Skip to main content
Log in

Monthly River Discharge Forecasting Using Hybrid Models Based on Extreme Gradient Boosting Coupled with Wavelet Theory and Lévy–Jaya Optimization Algorithm

  • Published:
Water Resources Management Aims and scope Submit manuscript

Abstract

River discharge represents critical hydrological data that can be used to monitor the hydrological status of a river basin. The objective of this study was to forecast the monthly river discharge time-series of two gauging hydrometric sites (USGS 06054500 and USGS 06090800) located on the Missouri River, USA. The forecast was performed using two machine learning models based on extreme gradient boosting (XGB) and K-nearest neighbors (KNN). XGB outperformed the KNN framework in forecasting the river flow. Subsequently, wavelet (W) analysis was incorporated to develop the hybrid W-XGB and W-KNN approaches. Finally, two novel hybrid models were established through the hybridization of XGB and the Lévy–Jaya optimization algorithm (LJA) and simultaneous integration of the wavelet analysis and LJA with the XGB, i.e., XGB-LJA and W-XGB-LJA, respectively. The performances of the models were evaluated using the root mean square error (RMSE), mean absolute error (MAE), mean bias error (MBE), determination coefficient (R ), and Nash–Sutcliffe efficiency (NSE). In the test phase, the best discharge forecasts at USGS 06054500 and USGS 06090800 were obtained using the hybrid WXGB2-LJA (RMSE = 41.303 m /s, MAE = 28.752 m /s, MBE = 3.377 m /s, R = 0.819, NSE = 0.800) and W-XGB4-LJA (RMSE = 39.310 m /s, MAE = 26.804 m /s, MBE = 1.489 m3/s, R = 0.897, NSE = 0.885), respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data Availability

The public data used in this study is available at https://waterdata.usgs.gov/nwis.

References

  • Al-Juboori AM (2021) A hybrid model to predict monthly streamflow using neighboring rivers annual flows. Water Resour Manag 35(2):729–743

    Google Scholar 

  • Apaydin H, Sattari MT, Falsafian K, Prasad R (2021) Artificial intelligence modelling integrated with singular spectral analysis and seasonal-trend decomposition using Loess approaches for streamflow predictions. J Hydrol 600:126506

    Google Scholar 

  • Bakhshi Ostadkalayeh F, Moradi S, Asadi A, Moghaddam Nia A, Taheri S (2023) Performance improvement of LSTM-based deep learning model for streamflow forecasting using Kalman filtering. Water Resour Manage. https://doi.org/10.1007/s11269-023-03492-2

    Article  Google Scholar 

  • Ch S, Anand N, Panigrahi BK, Mathur S (2013) Streamflow forecasting by SVM with quantum behaved particle swarm optimization. Neurocomput 101:18–23

    Google Scholar 

  • Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794)

  • Cheng D, Zhang S, Deng Z, Zhu Y, Zong M (2014) kNN algorithm with data-driven k value. In International Conference on Advanced Data Mining and Applications (pp. 499–512). Springer, Cham

  • Cheng X, Feng ZK, Niu WJ (2020) Forecasting monthly runoff time series by single-layer feedforward artificial neural network and grey wolf optimizer. IEEE Access 8:157346–157355

    Google Scholar 

  • Chu H, Wei J, Qiu J (2018) Monthly streamflow forecasting using EEMD-Lasso-DBN method based on multi-scale predictors selection. Water 10(10):1486

    Google Scholar 

  • Dariane AB, Azimi S (2018) Streamflow forecasting by combining neural networks and fuzzy models using advanced methods of input variable selection. J Hydroinforma 20(2):520–532

    Google Scholar 

  • Fang W, Huang S, Ren K, Huang Q, Huang G, Cheng G, Li K (2019) Examining the applicability of different sampling techniques in the development of decomposition-based streamflow forecasting models. J Hydrol 568:534–550

    Google Scholar 

  • Ferreira RG, da Silva DD, Elesbon AAA, Fernandes-Filho EI, Veloso GV, de Souza FM, Ferreira LB (2021) Machine learning models for streamflow regionalization in a tropical watershed. J Environ Manag 280:111713

    Google Scholar 

  • Freire PKDMM, Santos CAG, da Silva GBL (2019) Analysis of the use of discrete wavelet transforms coupled with ANN for short-term streamflow forecasting. Appl Soft Comput 80:494–505

    Google Scholar 

  • Girihagama L, Naveed Khaliq M, Lamontagne P, Perdikaris J, Roy R, Sushama L, Elshorbagy A (2022) Streamflow modelling and forecasting for Canadian watersheds using LSTM networks with attention mechanism. Neural Comput Appl 34:1–21

    Google Scholar 

  • Granata F, Di Nunno F, de Marinis G (2022) Stacked machine learning algorithms and bidirectional long short-term memory networks for multi-step ahead streamflow forecasting: A comparative study. J Hydrol 613:128431

    Google Scholar 

  • Guo J, Zhou J, Qin H, Zou Q, Li Q (2011) Monthly streamflow forecasting based on improved support vector machine model. Expert Syst Appl 38(10):13073–13081

    Google Scholar 

  • Guo Y, Xu YP, Xie J, Chen H, Si Y, Liu J (2021) A weights combined model for middle and long-term streamflow forecasts and its value to hydropower maximization. J Hydrol 602:126794

    Google Scholar 

  • Ha S, Liu D, Mu L (2021) Prediction of Yangtze River streamflow based on deep learning neural network with El Niño-Southern Oscillation. Sci Rep 11(1):1–23

    Google Scholar 

  • Haznedar B, Kilinc HC (2022) A hybrid ANFIS-GA approach for estimation of hydrological time series. Water Resour Manag 36(12):4819–4842

    Google Scholar 

  • Huang S, Chang J, Huang Q, Chen Y (2014) Monthly streamflow prediction using modified EMD-based support vector machine. J Hydrol 511:764–775

    Google Scholar 

  • Hussain D, Hussain T, Khan AA, Naqvi SAA, Jamil A (2020) A deep learning approach for hydrological time-series prediction: A case study of Gilgit river basin. Earth Sci Informa 13(3):915–927

    Google Scholar 

  • Jiang Y, Bao X, Hao S, Zhao H, Li X, Wu X (2020) Monthly streamflow forecasting using ELM-IPSO based on phase space reconstruction. Water Resour Manag 34(11):3515–3531

    Google Scholar 

  • Kambalimath SS, Deka PC (2021) Performance enhancement of SVM model using discrete wavelet transform for daily streamflow forecasting. Environ Earth Sci 80(3):1–16

    Google Scholar 

  • Khosravi K, Golkarian A, Tiefenbacher JP (2022) Using optimized deep learning to predict daily streamflow: A comparison to common machine learning algorithms. Water Resour Manag 36(2):699–716

    Google Scholar 

  • Kim T, Yang T, Gao S, Zhang L, Ding Z, Wen X, ... Hong Y (2021) Can artificial intelligence and data-driven machine learning models match or even replace process-driven hydrologic models for streamflow simulation?: A case study of four watersheds with different hydro-climatic regions across the CONUS. J Hydrol 598:126423

    Google Scholar 

  • Le XH, Nguyen DH, Jung S, Yeon M, Lee G (2021) Comparison of deep learning techniques for river streamflow forecasting. IEEE Access 9:71805–71820

    Google Scholar 

  • Lee G, Gommers R, Waselewski F, Wohlfahrt K, O’Leary A (2019) PyWavelets: A Python package for wavelet analysis. J Open Sour Softw 4(36):1237

    Google Scholar 

  • Li BJ, Sun GL, Liu Y, Wang WC, Huang XD (2022) Monthly runoff forecasting using variational mode decomposition coupled with gray wolf optimizer-based long short-term memory neural networks. Water Resour Manag 36(6):2095–2115

    Google Scholar 

  • Li S, Yang J (2022) Modelling of suspended sediment load by Bayesian optimized machine learning methods with seasonal adjustment. Eng Appl Comput Fluid Mech 16(1):1883–1901

    Google Scholar 

  • Li W, Peng X, Cheng K, Wang H, Xu Q, Wang B, Che J (2020a) A short-term regional wind power prediction method based on XGBoost and multi-stage features selection. In 2020a IEEE 3rd Student Conference on Electrical Machines and Systems (SCEMS) (pp. 614–618). IEEE

  • Li Y, Liang Z, Hu Y, Li B, Xu B, Wang D (2020b) A multi-model integration method for monthly streamflow prediction: modified stacking ensemble strategy. J Hydroinform 22(2):310–326

    Google Scholar 

  • Lian Y, Luo J, Xue W, Zuo G, Zhang S (2022) Cause-driven streamflow forecasting framework based on linear correlation reconstruction and long short-term memory. Water Resour Manag 36(5):1661–1678

    Google Scholar 

  • Liu X, Lin Z (2021) Impact of COVID-19 pandemic on electricity demand in the UK based on multivariate time series forecasting with bidirectional long short term memory. Energy 227:120455

    Google Scholar 

  • Mirzaei M, Yu H, Dehghani A, Galavi H, Shokri V, Mohsenzadeh Karimi S, Sookhak M (2021) A novel stacked long short-term memory approach of deep learning for streamflow simulation. Sustainability 13(23):13384

    Google Scholar 

  • Nguyen DH, Le XH, Anh DT, Kim SH, Bae DH (2022) Hourly streamflow forecasting using a Bayesian additive regression tree model hybridized with a genetic algorithm. J Hydrol 606:127445

    Google Scholar 

  • Niu WJ, Feng ZK (2021) Evaluating the performances of several artificial intelligence methods in forecasting daily streamflow time series for sustainable water resources management. Sust Cities Soci 64:102562

    Google Scholar 

  • Niu WJ, Feng ZK, Cheng CT, Zhou JZ (2018) Forecasting daily runoff by extreme learning machine based on quantum-behaved particle swarm optimization. J Hydrol Eng 23(3):04018002

    Google Scholar 

  • Pache R, Rung T (2022) Data-driven surrogate modeling of aerodynamic forces on the superstructure of container vessels. Eng Appl Comput Fluid Mech 16(1):746–763

    Google Scholar 

  • Panahi F, Ehteram M, Ahmed AN, Huang YF, Mosavi A, El-Shafie A (2021) Streamflow prediction with large climate indices using several hybrid multilayer perceptrons and copula Bayesian model averaging. Ecol Indic 133:108285

    Google Scholar 

  • Parisouj P, Mohebzadeh H, Lee T (2020) Employing machine learning algorithms for streamflow prediction: a case study of four river basins with different climatic zones in the United States. Water Resour Manag 34(13):4113–4131

    Google Scholar 

  • Rahmani-Rezaeieh A, Mohammadi M, Danandeh Mehr A (2020) Ensemble gene expression programming: a new approach for evolution of parsimonious streamflow forecasting model. Theor Appl Climatol 139(1):549–564

    Google Scholar 

  • Rao R (2016) Jaya: A simple and new optimization algorithm for solving constrained and unconstrained optimization problems. Int J Indust Eng Comput 7(1):19–34

    Google Scholar 

  • Rehamnia I, Benlaoukli B, Heddam S (2020) Modeling of seepage flow through concrete face rockfill and embankment dams using three heuristic artificial intelligence approaches: a comparative study. Environ Process 7(1):367–381

    Google Scholar 

  • Rezaei F, Ghorbani R, Mahjouri N (2022) Improving daily and monthly river discharge forecasts using geostatistical ensemble modeling. Water Resour Manag 36(13):5063–5089

    Google Scholar 

  • Roy DK (2021) Long short-term memory networks to predict one-step ahead reference evapotranspiration in a subtropical climatic zone. Environ Process 8:911–941

    Google Scholar 

  • Saraiva SV, de Oliveira CF, Santos CAG, Barreto LC, Freire PKDMM (2021) Daily streamflow forecasting in Sobradinho Reservoir using machine learning models coupled with wavelet transform and bootstrapping. Appl Soft Comput 102:107081

    Google Scholar 

  • Sharma PJ, Patel PL, Jothiprakash V (2021) Model tree technique for streamflow forecasting: a case study in sub-catchment of Tapi River Basin, India. In Advances in Streamflow Forecasting (pp. 215–237). Elsevier

  • Shu X, Ding W, Peng Y, Wang Z, Wu J, Li M (2021) Monthly streamflow forecasting using convolutional neural network. Water Resour Manag 35(15):5089–5104

    Google Scholar 

  • Shu X, Peng Y, Ding W, Wang Z, Wu J (2022) Multi-step-ahead monthly streamflow forecasting using convolutional neural networks. Water Resour Manag 36(11):3949–3964

    Google Scholar 

  • Tayyab M, Zhou J, Dong X, Ahmad I, Sun N (2019) Rainfall-runoff modeling at Jinsha River basin by integrated neural network with discrete wavelet transform. Meteorol Atmos Phys 131(1):115–125

    Google Scholar 

  • Terzi Ö, Ergin G (2014) Forecasting of monthly river flow with autoregressive modeling and data-driven techniques. Neural Comput Appl 25(1):179–188

    Google Scholar 

  • Tofiq YM, Latif SD, Ahmed AN, Kumar P, El-Shafie A (2022) Optimized model inputs selections for enhancing river streamflow forecasting accuracy using different artificial intelligence techniques. Water Resour Manag 1–18

  • Tongal H, Booij MJ (2022) Simulated annealing coupled with a Naive Bayes model and base flow separation for streamflow simulation in a snow dominated basin. Stoch Environ Res Risk Assess 1–24

  • Viswanathan GM, Afanasyev V, Buldyrev SV, Murphy EJ, Prince PA, Stanley HE (1996) Lévy flight search patterns of wandering albatrosses. Nature 381(6581):413–415

    Google Scholar 

  • Wang J, Wang X, Hui Lei X, Wang H, Hua Zhang X, Jun You J, ... Lian Liu X (2020) Teleconnection analysis of monthly streamflow using ensemble empirical mode decomposition. J Hydrol 582:124411

    Google Scholar 

  • Wang L, Guo Y, Fan M (2022) Improving annual streamflow prediction by extracting information from high-frequency components of streamflow. Water Resour Manag 36(12):4535–4555

    Google Scholar 

  • Xu W, Chen J, Zhang XJ (2022) Scale effects of the monthly streamflow prediction using a state-of-the-art deep learning model. Water Resour Manag 36(10):3609–3625

    Google Scholar 

  • Yafouz A, Ahmed AN, Zaini NA, Sherif M, Sefelnasr A, El-Shafie A (2021) Hybrid deep learning model for ozone concentration prediction: comprehensive evaluation and comparison with various machine and deep learning algorithms. Eng Appl Comput Fluid Mech 15(1):902–933

    Google Scholar 

  • Zhao X, Lv H, Lv S, Sang Y, Wei Y, Zhu X (2021) Enhancing robustness of monthly streamflow forecasting model using gated recurrent unit based on improved grey wolf optimizer. J Hydrol 601:126607

    Google Scholar 

Download references

Funding

This research was funded by Key the National Natural Science Foundation of China under Grant No.61862051; the Science and Technology Foundation of Guizhou Province under Grant No.ZK[2022]549; the Natural Science Foundation of Education of Guizhou province under Grant No.s([2019]203, KY[2019]067); the program of Qiannan Normal University for Nationalities under Grant Nos. (qnsy2018003, qnsy2019rc09).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Shahab S. Band or Changhyun Jun.

Ethics declarations

Ethics Approval

Not Applicable.

Consent to Participate

Not Applicable.

Consent to Publish

Not Applicable.

Competing Interests

There is not any competing interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, J., Wang, D., Band, S.S. et al. Monthly River Discharge Forecasting Using Hybrid Models Based on Extreme Gradient Boosting Coupled with Wavelet Theory and Lévy–Jaya Optimization Algorithm. Water Resour Manage 37, 3953–3972 (2023). https://doi.org/10.1007/s11269-023-03534-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11269-023-03534-9

Keywords

Navigation