Skip to main content
Log in

Improved river water-stage forecasts by ensemble learning

  • Original Article
  • Published:
Engineering with Computers Aims and scope Submit manuscript

Abstract

Forecasting water stages is of significance to river and reservoir management. However, conventional models sometimes fail to perform accurately, as water levels are characterized by high nonstationarity. To provide an improved estimation of water stages, this study develops a new prediction framework by coupling stand-alone machine learning models with ensemble algorithms. As base learners, the optimal regression tree (RT) and extreme learning machine (ELM) are integrated into four ensemble strategies, i.e., bagging (BA), boosting (BO), random forest (RF) and random subspace (RS), leading to eight ensemble models. They are then assessed using daily water-stage records at two hydrological stations on the Yangtze River. Their performance is evaluated by statistical criteria: coefficient of determination (CD), Nash–Sutcliffe efficiency (NSE), root mean square error (RMSE) and mean absolute error (MAE). The RT and the ELM generate satisfactory predictions with deficiency in capturing extreme values. The ensemble models generally enhance the prediction efficiency, with their mean CD and NSE augment by up to 6.9% and 7.0%, and mean RMSE and MAE reduction by up to 47.9% and 47.0%. The BO-based models, namely BO-RT and BO-ELM, result in the highest accuracy, with a mean absolute relative error (ARE) of 1.0% and 1.4%. Ensemble learning gains even in multi-step-ahead forecasts, which satisfactorily extends the lead time up to 14 days. This study illustrates the capability of ensemble learning for improved water-level forecasts, which provides reference for modeling related issues such as sediment load and rainfall-runoff.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig.13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. King CW, Webber ME (2008) Water intensity of transportation. Environ Sci Technol 42:7866–7872

    Google Scholar 

  2. Coops H, Beklioglu M, Crisman TL (2003) The role of water-level fluctuations in shallow lake ecosystems–workshop conclusions. Hydrobiologia 506:23–27

    Google Scholar 

  3. Schuurmans J, Hof A, Dijkstra S, Bosgra O, Brouwer R (1999) Simple water level controller for irrigation and drainage canals. J Irrig Drain Eng 125:189–195

    Google Scholar 

  4. Cohen Y, Radomski P (1993) Water level regulations and fisheries in Rainy Lake and the Namakan Reservoir. Can J Fish Aquat Sci 50:1934–1945

    Google Scholar 

  5. Alsdorf DE, Melack JM, Dunne T, Mertes LA, Hess LL, Smith LC (2000) Interferometric radar measurements of water level changes on the Amazon flood plain. Nature 404:174–177

    Google Scholar 

  6. Li S, He D (2008) Water level response to hydropower development in the upper Mekong River. AMBIO 37:170–7

    Google Scholar 

  7. Zhou T, Jiang Z, Liu X, Tan K (2020) Research on the long-term and short-term forecasts of navigable river’s water-level fluctuation based on the adaptive multilayer perceptron. J Hydrol 591:125285

    Google Scholar 

  8. Aksoy H, Unal N, Eris E, Yuce M (2013) Stochastic modeling of Lake Van water level time series with jumps and multiple trends. Hydrol Earth Syst Sci 17:2297–2303

    Google Scholar 

  9. Kebede S, Travi Y, Alemayehu T, Marc V (2006) Water balance of Lake Tana and its sensitivity to fluctuations in rainfall, Blue Nile basin, Ethiopia. J Hydrol 316:233–247

    Google Scholar 

  10. Fischer P, Öhl U (2005) Effects of water-level fluctuations on the littoral benthic fish community in lakes: a mesocosm experiment. Behav Ecol 16:741–746

    Google Scholar 

  11. Phan T-T-H, Nguyen XH (2020) Combining statistical machine learning models with ARIMA for water level forecasting: the case of the Red river. Adv Water Resour 142:103656

    Google Scholar 

  12. Sanikhani H, Kisi O, Maroufpoor E, Yaseen ZM (2019) Temperature-based modeling of reference evapotranspiration using several artificial intelligence models: application of different modeling scenarios. Theor Appl Climatol 135:449–462

    Google Scholar 

  13. Zhu S, Heddam S, Nyarko EK, Hadzima-Nyarko M, Piccolroaz S, Wu S (2019) Modeling daily water temperature for rivers: comparison between adaptive neuro-fuzzy inference systems and artificial neural networks models. Environ Sci Pollut Res 26:402–420

    Google Scholar 

  14. Yaseen ZM, Sulaiman SO, Deo RC, Chau K-W (2019) An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. J Hydrol 569:387–408

    Google Scholar 

  15. Hadi SJ, Tombul M (2018) Monthly streamflow forecasting using continuous wavelet and multi-gene genetic programming combination. J Hydrol 561:674–687

    Google Scholar 

  16. Altunkaynak A (2007) Forecasting surface water level fluctuations of Lake Van by artificial neural networks. Water Resour Manag 21:399–408

    Google Scholar 

  17. Buyukyildiz M, Tezel G, Yilmaz V (2014) Estimation of the change in lake water level by artificial intelligence methods. Water Resour Manag 28:4747–4763

    Google Scholar 

  18. Hrnjica B, Bonacci O (2019) Lake level prediction using feed forward and recurrent neural networks. Water Resour Manag 33:2471–2484

    Google Scholar 

  19. Le X-H, Ho HV, Lee G, Jung S (2019) Application of long short-term memory (LSTM) neural network for flood forecasting. Water 11:1387

    Google Scholar 

  20. Güldal V, Tongal H (2010) Comparison of recurrent neural network, adaptive neuro-fuzzy inference system and stochastic models in Eğirdir Lake level forecasting. Water Resour Manag 24:105–128

    Google Scholar 

  21. Kisi O, Shiri J, Nikoofar B (2012) Forecasting daily lake levels using artificial intelligence approaches. Comput Geosci 41:169–180

    Google Scholar 

  22. Khan MS, Coulibaly P (2006) Application of support vector machine in lake water level prediction. J Hydrol Eng 11:199–205

    Google Scholar 

  23. Pasupa K, Jungjareantrat S (2016) Water levels forecast in Thailand: A case study of Chao Phraya River. 2016 14th International Conference on Control, Automation, Robotics and Vision (ICARCV): IEEE. 1–6

  24. Yu Z, Lei G, Jiang Z, Liu F (2017) ARIMA modelling and forecasting of water level in the middle reach of the Yangtze River. 2017 4th International Conference on Transportation Information and Safety (ICTIS): IEEE. 172–7

  25. Galavi H, Mirzaei M, Shul LT, Valizadeh N (2013) Klang River–level forecasting using ARIMA and ANFIS models. J Am Water Works Ass 105:E496–E506

    Google Scholar 

  26. Seo Y, Choi E, Yeo W (2017) Reservoir water level forecasting using machine learning models. J Korean Soc Agricult Eng 59:97–110

    Google Scholar 

  27. Yang J-H, Cheng C-H, Chan C-P (2017) A time-series water level forecasting model based on imputation and variable selection method. Comput Intell Neurosci. https://doi.org/10.1155/2017/8734214

    Article  Google Scholar 

  28. Ghorbani MA, Khatibi R, Aytek A, Makarynskyy O, Shiri J (2010) Sea water level forecasting using genetic programming and comparing the performance with artificial neural networks. Comput Geosci 36:620–627

    Google Scholar 

  29. Karimi S, Shiri J, Kisi O, Makarynskyy O (2012) Forecasting water level fluctuations of Urmieh Lake using gene expression programming and adaptive neuro-fuzzy inference system. Int J Ocean Clim Syst 3:109–125

    Google Scholar 

  30. Zhang GP (2003) Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50:159–175

    MATH  Google Scholar 

  31. Seo Y, Kim S, Kisi O, Singh VP (2015) Daily water level forecasting using wavelet decomposition and artificial intelligence techniques. J Hydrol 520:224–243

    Google Scholar 

  32. Zhong C, Jiang Z, Chu X, Guo T, Wen Q (2019) Water level forecasting using a hybrid algorithm of artificial neural networks and local Kalman filtering. Proc Inst Mech Eng Part M J Eng Marit Environ 233:174–185

    Google Scholar 

  33. Xu G, Cheng Y, Liu F, Ping P, Sun J (2019) A water level prediction model based on ARIMA-RNN. 2019 IEEE Fifth International Conference on Big Data Computing Service and Applications (BigDataService): IEEE. 221–6

  34. Ghorbani MA, Deo RC, Karimi V, Yaseen ZM, Terzi O (2018) Implementation of a hybrid MLP-FFA model for water level prediction of Lake Egirdir, Turkey. Stoch Environ Res Risk Assess 32:1683–1697

    Google Scholar 

  35. Kisi O, Shiri J, Karimi S, Shamshirband S, Motamedi S, Petković D et al (2015) A survey of water level fluctuation predicting in Urmia Lake using support vector machine with firefly algorithm. Appl Math Comput 270:731–743

    Google Scholar 

  36. Zounemat-Kermani M, Batelaan O, Fadaee M, Hinkelmann R (2021) Ensemble machine learning paradigms in hydrology: a review. J Hydrol 598:126266

    Google Scholar 

  37. Kim D, Yu H, Lee H, Beighley E, Durand M, Alsdorf DE et al (2019) Ensemble learning regression for estimating river discharges using satellite altimetry data: Central Congo River as a Test-bed. Remote Sens Environ 221:741–755

    Google Scholar 

  38. Naghibi SA, Dolatkordestani M, Rezaei A, Amouzegari P, Heravi MT, Kalantar B et al (2019) Application of rotation forest with decision trees as base classifier and a novel ensemble model in spatial modeling of groundwater potential. Environ Monit Assess 191:1–20

    Google Scholar 

  39. Liu S, Xu J, Zhao J, Xie X, Zhang W (2014) Efficiency enhancement of a process-based rainfall–runoff model using a new modified AdaBoost. RT Tech Appl Soft Comput 23:521–529

    Google Scholar 

  40. Elkiran G, Nourani V, Abba S (2019) Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J Hydrol 577:123962

    Google Scholar 

  41. Salam R, Islam ARMT (2020) Potential of RT, Bagging and RS ensemble learning algorithms for reference evapotranspiration prediction using climatic data-limited humid region in Bangladesh. J Hydrol 590:125241

    Google Scholar 

  42. Zhao Y, Li Y, Zhang L, Wang Q (2016) Groundwater level prediction of landslide based on classification and regression tree. Geod Geodyn 7:348–355

    Google Scholar 

  43. Shiri J, Shamshirband S, Kisi O, Karimi S, Bateni SM, Nezhad SHH et al (2016) Prediction of water-level in the Urmia Lake using the extreme learning machine approach. Water Resour Manag 30:5217–5229

    Google Scholar 

  44. Deo RC, Şahin M (2016) An extreme learning machine model for the simulation of monthly mean streamflow water level in eastern Queensland. Environ Monit Assess 188:90

    Google Scholar 

  45. Brieman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. CRC Press, New York

    Google Scholar 

  46. Gomes CMA, Jelihovschi E (2020) Presenting the regression tree method and its application in a large-scale educational dataset. Int J Res Method Edu 43:201–221

    Google Scholar 

  47. Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501

    Google Scholar 

  48. Atiquzzaman M, Kandasamy J (2016) Prediction of hydrological time-series using extreme learning machine. J Hydroinf 18:345–353

    Google Scholar 

  49. Rezaie-Balf M, Kisi O (2018) New formulation for forecasting streamflow: evolutionary polynomial regression vs. extreme learning machine. Hydrol Res 49:939–953

    Google Scholar 

  50. Breiman L (1996) Bagging predictors. Mach Learn 24:123–140

    MATH  Google Scholar 

  51. Rokach L (2010) Ensemble-based classifiers. Artif Intell Rev 33:1–39

    Google Scholar 

  52. Alfaro E, Gamez M, Garcia N (2013) Adabag: an R package for classification with boosting and bagging. J Stat Softw 54:1–35

    Google Scholar 

  53. Breiman L (2001) Random forests. Mach Learn 45:5–32

    MATH  Google Scholar 

  54. Dong X, Yu Z, Cao W, Shi Y, Ma Q (2020) A survey on ensemble learning. Front Comput Sci 14:241–258

    Google Scholar 

  55. Khosravi K, Cooper JR, Daggupati P, Pham BT, Bui DT (2020) Bedload transport rate prediction: application of novel hybrid data mining techniques. J Hydrol 585:124774

    Google Scholar 

  56. Mosavi A, Golshan M, Janizadeh S, Choubin B, Melesse AM, Dineva AA (2020) Ensemble models of GLM, FDA, MARS, and RF for flood and erosion susceptibility mapping: a priority assessment of sub-basins. Geocarto Int 37:2541–60

    Google Scholar 

  57. Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20:832–844

    Google Scholar 

  58. Nhu V-H, Khosravi K, Cooper JR, Karimi M, Kisi O, Pham BT et al (2020) Monthly suspended sediment load prediction using artificial intelligence: testing of a new random subspace method. Hydrol Sci J 65:2116–2127

    Google Scholar 

  59. Chen W, Zhao X, Tsangaratos P, Shahabi H, Ilia I, Xue W et al (2020) Evaluating the usage of tree-based ensemble methods in groundwater spring potential mapping. J Hydrol 583:124602

    Google Scholar 

  60. Tiwari MK, Chatterjee C (2011) A new wavelet–bootstrap–ANN hybrid model for daily discharge forecasting. J Hydroinf 13:500–519

    Google Scholar 

  61. Moriasi DN, Arnold JG, Van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50:885–900

    Google Scholar 

  62. Ayele GT, Teshale EZ, Yu B, Rutherfurd ID, Jeong J (2017) Streamflow and sediment yield prediction for watershed prioritization in the Upper Blue Nile River Basin. Ethiopia Water 9:782

    Google Scholar 

  63. Xie Q, Yang J, Lundström TS (2021) Sediment and morphological changes along Yangtze River’s 500 km between Datong and Xuliujing before and after Three Gorges Dam commissioning. Sci Rep 11:1–17

    Google Scholar 

  64. Li S, Yang J, Ansell A (2021) Discharge prediction for rectangular sharp-crested weirs by machine learning techniques. Flow Meas Instrum 79:101931

    Google Scholar 

  65. Roushangar K, Ghasempour R (2019) Evaluation of the impact of channel geometry and rough elements arrangement in hydraulic jump energy dissipation via SVM. J Hydroinf 21:92–103

    Google Scholar 

  66. Mehr AD (2018) An improved gene expression programming model for streamflow forecasting in intermittent streams. J Hydrol 563:669–678

    Google Scholar 

  67. Rahmani-Rezaeieh A, Mohammadi M, Mehr AD (2020) Ensemble gene expression programming: a new approach for evolution of parsimonious streamflow forecasting model. Theor Appl Climatol 139:549–564

    Google Scholar 

  68. Sattar AM, Ertuğrul ÖF, Gharabaghi B, McBean EA, Cao J (2019) Extreme learning machine model for water network management. Neural Comput Appl 31:157–169

    Google Scholar 

  69. Matlab. Matlab R2021a (2021) Natick. The Mathworks, Inc, Massachusetts

  70. Valipour M, Banihabib ME, Behbahani SMR (2013) Comparison of the ARMA, ARIMA, and the autoregressive artificial neural network models in forecasting the monthly inflow of Dez dam reservoir. J Hydrol 476:433–441

    Google Scholar 

  71. Heo KY, Ha KJ, Yun KS, Lee SS, Kim HJ, Wang B (2014) Methods for uncertainty assessment of climate models and model predictions over East Asia. Int J Climatol 34:377–390

    Google Scholar 

  72. Cheng H, Tan P-N, Gao J, Scripps J (2006) Multistep-ahead time series prediction. Pacific-Asia Conference on Knowledge Discovery and Data Mining: Springer. 765-74

  73. Barzegar R, Moghaddam AA, Adamowski J, Ozga-Zielinski B (2018) Multi-step water quality forecasting using a boosting ensemble multi-wavelet extreme learning machine model. Stoch Environ Res Risk Assess 32:799–813

    Google Scholar 

  74. Tyralis H, Papacharalampous G, Langousis A (2021) Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms. Neural Comput Appl 33:3053–3068

    Google Scholar 

  75. Zounemat-Kermani M, Stephan D, Barjenbruch M, Hinkelmann R (2020) Ensemble data mining modeling in corrosion of concrete sewer: a comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models. Adv Eng Inf 43:101030

    Google Scholar 

  76. Kim S, Alizamir M, Zounemat-Kermani M, Kisi O, Singh VP (2020) Assessing the biochemical oxygen demand using neural networks and ensemble tree approaches in South Korea. J Environ Manag 270:110834

    Google Scholar 

  77. Pandey M, Jamei M, Karbasi M, Ahmadianfar I, Chu X (2021) Prediction of maximum scour depth near spur dikes in uniform bed sediment using stacked generalization ensemble tree-based frameworks. J Irrig Drain Eng 147:04021050

    Google Scholar 

  78. Pan M, Zhou H, Cao J, Liu Y, Hao J, Li S et al (2020) Water level prediction model based on GRU and CNN. IEEE Access 8:60090–60100

    Google Scholar 

  79. Liu D, Jiang W, Mu L, Wang S (2020) Streamflow prediction using deep learning neural network: case study of Yangtze River. IEEE Access 8:90069–90086

    Google Scholar 

  80. Xu W, Jiang Y, Zhang X, Li Y, Zhang R, Fu G (2020) Using long short-term memory networks for river flow prediction. Hydrol Res 51:1358–1376

    Google Scholar 

Download references

Acknowledgements

The study is funded by the Swedish Hydropower Center (SVC). It is part of research projects “Two-phase flow modeling: evaluations and simulations for safer spillway discharge” and “Quality and trust of numerical modeling of water-air flows for safe spillway discharge”, with James Yang and Anders Ansell as project leaders. SVC has been established by the Swedish Energy Agency, Energiforsk and Svenska Kraftnät, together with Royal Institute of Technology (KTH), Luleå University of Technology (LTU), Uppsala University (UU) and Chalmers University of Technology (CTH). Participating companies and industry associations include AFRY, Andritz Hydro, Boliden, Fortum Generation, Holmen Energi, Jämtkraft, Karlstads Energi, LKAB, Mälarenergi, Norconsult, Rainpower, Skellefteå Kraft, Sollefteåforsens, Statkraft Sverige, Sweco Energuide, Sweco Infrastructure, Tekniska verken i Linköping, Uniper, Vattenfall R&D, Vattenfall Vattenkraft, Voith Hydro, WSP Sverige and Zinkgruvan. The assistance from Holger Ecke of Vattenfall R&D and Qiancheng Xie of LTU is acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shicheng Li.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, S., Yang, J. Improved river water-stage forecasts by ensemble learning. Engineering with Computers 39, 3293–3311 (2023). https://doi.org/10.1007/s00366-022-01751-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00366-022-01751-1

Keywords

Navigation