Abstract
In this study, four individual models namely Hammerstein-Weiner (HW), Extreme Learning Machine (ELM), Long Short-Term Memory (LSTM) and Least Square Support Vector Machine (LSSVM) were utilized for modeling the SSL of Katar catchment in Ethiopia. Then, two strategies were applied to improve the overall predictive accuracy. The first strategy involves the development of four ensemble techniques such as simple average ensemble (SE), weighted average ensemble (WE), neuro-fuzzy ensemble (NFE) and HW ensemble (HWE) using the SSL output of individual models. In the second strategy, a hybrid Extreme Gradient Boosting (XGB) model was developed to improve the performance of the base models. The accuracy of the models was evaluated using the percent bias (Pbias), Nash–Sutcliffe coefficient efficiency (NSE), mean absolute error (MAE), root mean square error (RMSE) and determination coefficient (R2). The result showed that the HW model provided the best modeling result among the individual models with NSE and Pbias values of 0.922 and -8.8%, respectively, for the testing period. Among the ensemble techniques, NFE provided the best performance by increasing the NSE values of the individual models by 4.88% to 62.52%, during the testing period. Examination of the hybrid XGB models showed that all hybrid models performed reliably, with the HW-XGB model achieving the best predictive performance (NSE = 0.989). Overall, the results of this study showed the promising potential of ensemble and hybrid XGB models for SSL modeling in an agricultural catchment.
Similar content being viewed by others
Availability of Data
Available from the corresponding author upon reasonable request.
References
Abba SI, Linh NTT, Abdullahi J, Ali SIA, Pham QB, Abdulkadir RA, Costache R, Nam VT, Anh DT (2020) Hybrid machine learning ensemble techniques for modeling dissolved oxygen concentration. IEEE Access 8:157218–157237. https://doi.org/10.1109/ACCESS.2020.3017743
Adnan RM, Liang Z, Trajkovic S, Zounemat-Kermani M, Li B, Kisi O (2019) Daily streamflow prediction using optimally pruned extreme learning machine. J Hydrol 577(July):123981. https://doi.org/10.1016/j.jhydrol.2019.123981
Adulaimi AAA, Pradhan B, Chakraborty S, Alamri A (2021) Based on Machine Learning. Statistical Regression and GIS Energies 14(5095):1–19
Afan HA, El-Shafie A, Yaseen ZM, Hameed MM, Mohtar WHMW, Hussain A (2014) ANN Based Sediment Prediction Model Utilizing Different Input Scenarios. Water Resour Manage 29(4):1231–1245. https://doi.org/10.1007/s11269-014-0870-1
Aga AO, Melesse AM, Chane B (2018) Estimating the sediment flux and budget for a data limited Rift Valley Lake in Ethiopia. Hydrology 6(1):1. https://doi.org/10.3390/hydrology6010001
Al-Mukhtar M (2021) Modeling the monthly pan evaporation rates using artificial intelligence methods: a case study in Iraq. Environ Earth Sci 80(1):1–14. https://doi.org/10.1007/s12665-020-09337-0
Aliye MA, Aga AO, Tadesse T, Yohannes P (2020) Evaluating the Performance of HEC-HMS and SWAT Hydrological Models in Simulating the Rainfall-Runoff Process for Data Scarce Region of Ethiopian Rift Valley Lake Basin. Open Journal of Modern Hydrology 10(04):105–122. https://doi.org/10.4236/ojmh.2020.104007
Asnake Metekia W, Garba Usman A, Hatice Ulusoy B, Isah Abba S, Chirkena Bali K (2022) Artificial intelligence-based approaches for modeling the effects of spirulina growth mediums on total phenolic compounds. Saudi J Biol Sci 29(2):1111–1117. https://doi.org/10.1016/j.sjbs.2021.09.055
Baba NM, Makhtar M, Fadzli SA, Awang MK (2015) Current issues in ensemble methods and its applications. J Theor Appl Inf Technol 81(2):266–276
Bhattacharya B, Price RK, Solomatine DP (2005) Data-Driven Modelling in the Context of Sediment Transport 30:297–302. https://doi.org/10.1016/j.pce.2004.12.001
Buyukyildiz M, Kumcu SY (2017) An Estimation of the Suspended Sediment Load Using Adaptive Network Based Fuzzy Inference System, Support Vector Machine and Artificial Neural Network Models. Water Resour Manage 31(4):1343–1359. https://doi.org/10.1007/s11269-017-1581-1
Cavadias G, Morin G (1986) The Combination of Simulated Discharges of Hydrological Models. Nord Hydrol 17:21–32
Dawson CW, Abrahart RJ, See LM (2007) HydroTest: A web-based toolbox of evaluation metrics for the standardised assessment of hydrological forecasts. Environ Model Softw 22(7):1034–1052. https://doi.org/10.1016/j.envsoft.2006.06.008
Doroudi S, Sharafati A, Mohajeri SH (2021) Estimation of Daily Suspended Sediment Load Using a Novel Hybrid Support Vector Regression Model Incorporated with Observer-Teacher-Learner-Based Optimization Method. Complexity. https://doi.org/10.1155/2021/5540284
Elkiran G, Nourani V, Abba S (2019) Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J Hydrol 577(June):123962. https://doi.org/10.1016/j.jhydrol.2019.123962
Farzin S, Anaraki MV (2021) Modeling and predicting suspended sediment load under climate change conditions: A new hybridization strategy. J Water Clim Chang 12(6):2422–2443. https://doi.org/10.2166/wcc.2021.317
Fenicia F, Solomatine DP, Savenije HHG, Matgen P (2007) Soft combination of local models in a multi-objective framework. Hydrol Earth Syst Sci 11(6):1797–1809. https://doi.org/10.5194/hess-11-1797-2007
Gao P, Du W, Lei Q, Li J, Zhang S, Li N (2023) NDVI Forecasting Model Based on the Combination of Time Series Decomposition and CNN – LSTM. Water Resour Manage 37(4):1481–1497. https://doi.org/10.1007/s11269-022-03419-3
Gaya MS, Zango MU, Yusuf LA, Mustapha M, Muhammad B, Sani A, Tijjani A, Wahab NA, Khairi MTM (2017) Estimation of turbidity in water treatment plant using hammerstein-wiener and neural network technique. Indones J Electr Eng Comput Sci 5(3):666–672. https://doi.org/10.11591/ijeecs.v5.i3.pp666-672
Gelete G, Nourani V, Gokcekus H, Gichamo T (2023a) Ensemble physically based semi-distributed models for the rainfall-runoff process modeling in the data-scarce Katar catchment. Ethiopia Journal of Hydroinformatics 25(2):567. https://doi.org/10.2166/hydro.2023.197
Gelete G, Nourani V, Gokcekus H, Gichamo T (2023b) Physical and artificial intelligence-based hybrid models for rainfall-runoff-sediment process modelling. Hydrol Sci J 0(0). https://doi.org/10.1080/02626667.2023.2241850
Ghali UM, Usman AG, Chellube ZM, Degm MAA, Hoti K, Umar H, Abba SI (2020) Advanced chromatographic technique for performance simulation of anti-Alzheimer agent: an ensemble machine learning approach. SN Appl Sci 2(11):1–12. https://doi.org/10.1007/s42452-020-03690-2
Ghanbari-Adivi E, Ehteram M, Farrokhi A, Sheikh Khozani Z (2022) Combining Radial Basis Function Neural Network Models and Inclusive Multiple Models for Predicting Suspended Sediment Loads. Water Resour Manage 36(11):4313–4342. https://doi.org/10.1007/s11269-022-03256-4
Ghasempour R, Roushangar K, Sihag P (2021) Suspended sediment load prediction in consecutive stations of river based on ensemble pre-post-processing kernel based approaches. Water Supply 21(7):3370–3386. https://doi.org/10.2166/ws.2021.094
Ghorbani MA, Deo RC, Yaseen ZM, Kashani HM, Mohammadi B (2018) Pan evaporation prediction using a hybrid multilayer perceptron-firefly algorithm (MLP-FFA) model: case study in North Iran. Theoret Appl Climatol 133(3–4):1119–1131. https://doi.org/10.1007/s00704-017-2244-0
Gu J, Liu S, Zhou Z, Chalov SR, Zhuang Q (2022) A stacking ensemble learning model for monthly rainfall prediction in the Taihu Basin, China. Water 14(192):1–20. https://doi.org/10.3390/w140304923
Hadi SJ, Tombul M (2018) Forecasting Daily Streamflow for Basins with Different Physical Characteristics through Data-Driven Methods. Water Resour Manage 32(10):3405–3422. https://doi.org/10.1007/s11269-018-1998-1
Harmel RD, Smith PK, Migliaccio KW, Chaubey I, Douglas-Mankin KR, Benham B, Shukla S, Muñoz-Carpena R, Robson BJ (2014) Evaluating, interpreting, and communicating performance of hydrologic/water quality models considering intended use: a review and recommendations. Environ Model Softw 57:40–51. https://doi.org/10.1016/j.envsoft.2014.02.013
Hazarika BB, Gupta D, Berlin M (2020) Modeling suspended sediment load in a river using extreme learning machine and twin support vector regression with wavelet conjunction. Environ Earth Sci 79(10):1–15. https://doi.org/10.1007/s12665-020-08949-w
Himanshu SK, Pandey A, Yadav B (2017) Ensemble wavelet-support vector machine approach for prediction of suspended sediment load using hydrometeorological data. J Hydrol Eng 22(7):05017006
Huang G, Wang DH, Lan Y (2011) Extreme learning machines: a survey. Int J Mach Learn Cyber 2:107–122. https://doi.org/10.1007/s13042-011-0019-y
Huang G, Zhu Q, Siew C (2006) Extreme learning machine: Theory and applications. Neurocomputing 70:489–501. https://doi.org/10.1016/j.neucom.2005.12.126
Kakaei LE, Moghaddam Nia A, Ahmadi A (2013) Daily suspended sediment load prediction using artificial neural networks and support vector machines. J Hydrol 478:50–62. https://doi.org/10.1016/j.jhydrol.2012.11.048
Kaveh K, Kaveh H, Bui MD, Rutschmann P (2021) Long short-term memory for predicting daily suspended sediment concentration. In Engineering with Computers (Vol. 37, Issue 3, pp. 2013–2027). https://doi.org/10.1007/s00366-019-00921-y
Khosravi K, Golkarian A, Melesse AM, Deo RC (2022) Suspended sediment load modeling using advanced hybrid rotation forest based elastic network approach. J Hydrol 610(April):127963. https://doi.org/10.1016/j.jhydrol.2022.127963
Kiran RN, Ravi V (2008) Software reliability prediction by soft computing techniques. J Syst Softw 81(4):576–583. https://doi.org/10.1016/j.jss.2007.05.005
Kisi O, Dailr AH, Cimen M, Shiri J (2012) Suspended sediment modeling using genetic programming and soft computing techniques. J Hydrol 450–451:48–58. https://doi.org/10.1016/j.jhydrol.2012.05.031
Koch J, Schneider R (2022) Long short-term memory networks enhance rainfall-runoff modelling at the national scale of Denmark. GEUS Bull 49:1–7. https://doi.org/10.34194/geusb.v49.8292
Kratzert F, Klotz D, Brenner C, Schulz K, Herrnegger M (2018) Rainfall-runoff modelling using Long Short-Term Memory (LSTM) networks. Hydrol Earth Syst Sci 22(11):6005–6022. https://doi.org/10.5194/hess-22-6005-2018
Kumar A, Kumar P, Singh VK (2019) Evaluating Different Machine Learning Models for Runoff and Suspended Sediment Simulation. Water Resour Manage 33(3):1217–1231. https://doi.org/10.1007/s11269-018-2178-z
Lafdani EK, Nia AM, Ahmadi A (2013) Daily suspended sediment load prediction using artificial neural networks and support vector machines. J Hydrol 478:50–62. https://doi.org/10.1016/j.jhydrol.2012.11.048
Liu P, Wang J, Sangaiah AK, Xie Y, Yin X (2019) Analysis and Prediction of Water Quality Using LSTM Deep Neural Networks in IoT Environment. Sustainability 11(2058):1–14. https://doi.org/10.3390/su11072058
Malik A, Jamei M, Ali M, Prasad R, Karbasi M, Yaseen ZM (2022) Multi-step daily forecasting of reference evapotranspiration for different climates of India: A modern multivariate complementary technique reinforced with ridge regression feature selection. Agric Water Manag 272(March):107812. https://doi.org/10.1016/j.agwat.2022.107812
Moeeni H, Bonakdari H (2018) Impact of Normalization and Input on ARMAX-ANN Model Performance in Suspended Sediment Load Prediction. Water Resour Manage 32(3):845–863. https://doi.org/10.1007/s11269-017-1842-z
Moriasi DN, Gitau MW, Pai N, Daggupati P (2015) Hydrologic and water quality models: Performance measures and evaluation criteria. Trans ASABE 58(6):1763–1785. https://doi.org/10.13031/trans.58.10715
Natras R, Soja B, Schmidt M (2022) Ensemble Machine Learning of Random Forest, AdaBoost and XGBoost for Vertical Total Electron Content Forecasting. Remote Sens 14(3547):1–34
Ni L, Wang D, Wu J, Wang Y, Tao Y, Zhang J, Liu J (2020) Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model. J Hydrol 586(March):124901. https://doi.org/10.1016/j.jhydrol.2020.124901
Niu W, Feng Z, Feng B, Min Y, Cheng C (2019a) Comparison of Multiple Linear Regression, Artificial Neural Network, Extreme Learning Machine, and Support Vector Machine in Deriving Operation Rule of Hydropower Reservoir. Water 11(1):88. https://doi.org/10.3390/w11010088
Niu W, Feng Z, Zeng M, Feng B, Min Y (2019b) Forecasting reservoir monthly runoff via ensemble empirical mode decomposition and extreme learning machine optimized by an improved gravitational search algorithm. Appl Soft Comput 82:105589. https://doi.org/10.1016/j.asoc.2019.105589
Nourani V, Elkiran G, Abba SI (2018) Wastewater treatment plant performance analysis using artificial intelligence – an ensemble approach. Water Sci Technol 78(10):2064–2076. https://doi.org/10.2166/wst.2018.477
Nourani V, Gokcekus H, Gelete G (2021a) Estimation of Suspended Sediment Load Using Artificial Intelligence-Based Ensemble Model. Complexity (Article ID 6633760):19. https://doi.org/10.1155/2021/6633760
Nourani V, Gökçekuş H, Umar IK (2020) Artificial intelligence based ensemble model for prediction of vehicular traffic noise. Environ Res 180(October 2019):108852. https://doi.org/10.1016/j.envres.2019.108852
Nourani V, Molajou A, Tajbakhsh AD, Najafi H (2019) A Wavelet Based Data Mining Technique for Suspended Sediment Load Modeling. Water Resour Manage 33(5):1769–1784. https://doi.org/10.1007/s11269-019-02216-9
Nourani V, Gökçekuş H, Gichamo T (2021b) Ensemble data-driven rainfall-runoff modeling using multi-source satellite and gauge rainfall data input fusion. Earth Sci Inf 14(4):1787–1808. https://doi.org/10.1007/s12145-021-00615-4
Olyaie E, Banejad H, Chau KW, Melesse AM (2015) A comparison of various artificial intelligence approaches performance for estimating suspended sediment load of river systems: a case study in United States. Environ Monit Assess 187(4). https://doi.org/10.1007/s10661-015-4381-1
Özger M, Kabataş MB (2015) Sediment load prediction by combined fuzzy logic-wavelet method. J Hydroinf 17(6):930–942. https://doi.org/10.2166/hydro.2015.148
Partal T, Cigizoglu HK (2008) Estimation and forecasting of daily suspended sediment data using wavelet – neural networks. J Hydrol 358:317–331. https://doi.org/10.1016/j.jhydrol.2008.06.013
Pham QB, Abba SI, Usman AG, Linh NTT, Gupta V, Malik A, Costache R, Vo ND, Tri DQ (2019) Potential of Hybrid Data-Intelligence Algorithms for Multi-Station Modelling of Rainfall. Water Resour Manage 33(15):5067–5087. https://doi.org/10.1007/s11269-019-02408-3
Rajaee T, Nourani V, Zounemat-Kermani M, Kisi O (2011) River Suspended Sediment Load Prediction: Application of ANN and Wavelet Conjunction Model. J Hydrol Eng 17(5):604–614. https://doi.org/10.1061/(ASCE)HE
Sahour H, Gholami V, Torkaman J, Vazifedan M, Saeedi S (2021) Random forest and extreme gradient boosting algorithms for streamflow modeling using vessel features and tree-rings. Environ Earth Sci 80(22):1–14. https://doi.org/10.1007/s12665-021-10054-5
Salih SQ, Sharafati A, Khosravi K, Faris H, Kisi O, Tao H, Ali M, Yaseen ZM (2020) River suspended sediment load prediction based on river discharge information: application of newly developed data mining models. Hydrol Sci J 65(4):624–637. https://doi.org/10.1080/02626667.2019.1703186
Shamaei E, Kaedi M (2016) Suspended sediment concentration estimation by stacking the genetic programming and neuro-fuzzy predictions. Appl Soft Comput J 45:187–196. https://doi.org/10.1016/j.asoc.2016.03.009
Sharafati A, Haji Seyed Asadollah SB, Motta D, Yaseen ZM (2020) Application of newly developed ensemble machine learning models for daily suspended sediment load prediction and related uncertainty analysis. Hydrol Sci J 65(12):1–21. https://doi.org/10.1080/02626667.2020.1786571
Sharghi E, Nourani V, Behfar N (2018) Earthfill dam seepage analysis using ensemble artificial intelligence based modeling. J Hydroinf 20(5):1071–1084. https://doi.org/10.2166/hydro.2018.151
Shiau JT, Chen TJ (2015) Quantile Regression-Based Probabilistic Estimation Scheme for Daily and Annual Suspended Sediment Loads. Water Resour Manage 29(8):2805–2818. https://doi.org/10.1007/s11269-015-0971-5
Sivakumar B (2006) Suspended sediment load estimation and the problem of inadequate data sampling: A fractal view. Earth Surf Proc Land 31(4):414–427. https://doi.org/10.1002/esp.1273
Suykens JA, Vandewalle J (1999) Least Squares Support Vector Machine Classifier. Neural Process Lett 3(9):293–300. https://doi.org/10.1023/A:1018628609742Downloadcitation
Tao H, Keshtegar B, Yaseen ZM (2019) The Feasibility of Integrative Radial Basis M5Tree Predictive Model for River Suspended Sediment Load Simulation. Water Resour Manage 33(13):4471–4490. https://doi.org/10.1007/s11269-019-02378-6
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res 106(D7):7183–7192. https://doi.org/10.1029/2000JD900719
Tibangayuka N, Mulungu DMM, Izdori F (2022) Evaluating the performance of HBV, HEC-HMS and ANN models in simulating streamflow for a data scarce high-humid tropical catchment in Tanzania. Hydrol Sci J 67(14):1–14. https://doi.org/10.1080/02626667.2022.2137417
Umar IK, Gökçekuş H, Nourani V (2022) An intelligent soft computing technique for prediction of vehicular traffic noise. Arab J Geosci 15(19). https://doi.org/10.1007/s12517-022-10858-0
Umar IK, Nourani V, Gökçekuş H (2021) A novel multi-model data-driven ensemble approach for the prediction of particulate matter concentration. Environ Sci Pollut Res 28(36):49663–49677. https://doi.org/10.1007/s11356-021-14133-9
Usman AG, Işik S, Abba SI (2021) Hybrid data-intelligence algorithms for the simulation of thymoquinone in HPLC method development. J Iran Chem Soc 18(7):1537–1549. https://doi.org/10.1007/s13738-020-02124-5
Yaseen ZM, Deo RC, Hilal A, Abd AM, Bueno LC, Salcedo-Sanz S, Nehdi ML (2018) Predicting compressive strength of lightweight foamed concrete using extreme learning machine model. Adv Eng Soft 115(April 2017):112–125. https://doi.org/10.1016/j.advengsoft.2017.09.004
Yin H, Zhang X, Wang F, Zhang Y, Xia R, Jin J (2021) Rainfall-runoff modeling using LSTM-based multi-state-vector sequence-to-sequence model. J Hydrol 598(October 2020):126378. https://doi.org/10.1016/j.jhydrol.2021.126378
Yu X, Wang Y, Wu L, Chen G, Wang L, Qin H (2020). Comparison of support vector regression and extreme gradient boosting for decomposition-based data-driven 10-day streamflow forecasting. J Hydrol 582(April 2019):124293. https://doi.org/10.1016/j.jhydrol.2019.124293
Zhang D, Skullestad E, Lindholm G, Ratnaweera H (2018) Hydraulic modeling and deep learning based flow forecasting for optimizing inter catchment wastewater transfer. J Hydrol 567:792–802. https://doi.org/10.1016/j.jhydrol.2017.11.029
Zhao J, Jiao L, Xia S, Basto Fernandes V, Yevseyeva I, Zhou Y, Emmerich M (2018) Multiobjective sparse ensemble learning by means of evolutionary algorithms. Decis Support Syst 111(November 2017):86–100. https://doi.org/10.1016/j.dss.2018.05.003
Zhou F, Zhang Q, Sornette D, Jiang L (2019) Cascading logistic regression onto gradient boosted decision trees for forecasting and trading stock indices. Appl Soft Comput 84:105747. https://doi.org/10.1016/j.asoc.2019.105747
Zounemat-kermani M, Batelaan O, Fadaee M, Hinkelmann R (2021) Ensemble machine learning paradigms in hydrology: A review. J Hydrol 598(December 2020):126266. https://doi.org/10.1016/j.jhydrol.2021.126266
Funding
This research has not received any funding.
Author information
Authors and Affiliations
Contributions
Data processing, modeling and writing-up of the paper were conducted by Gebre Gelete,
Corresponding author
Ethics declarations
Ethical Approval
Not applicable.
Consent to Participate
Not applicable.
Consent to Publish
The author has agreed with the content and has given explicit consent to publish.
Competing Interests
The author declares no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gelete, G. Hybrid Extreme Gradient Boosting and Nonlinear Ensemble Models for Suspended Sediment Load Prediction in an Agricultural Catchment. Water Resour Manage 37, 5759–5787 (2023). https://doi.org/10.1007/s11269-023-03629-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11269-023-03629-3