Abstract
Reliable runoff forecasting plays an important role in water resource management. In this study, we propose a homogeneous selective ensemble forecasting framework based on modified differential evolution algorithm (MDE) to elucidate the complex nonlinear characteristics of hydrological time series. First, the same type of component learners was selected to form the average ensemble model, which was then trained using the training set to obtain preliminary prediction results. Subsequently, the MDE method was applied to improve the performance of the differential evolution algorithm with respect to low solution accuracy and premature convergence. MDE assigns weights according to the performance of each component learner in the ensemble model to obtain the selective ensemble model structure on the validation set. Finally, the selective ensemble framework was verified on the test set. Experiments were conducted on the runoff data of four important hydrological stations in the Yangtze River Basin. The results showed that the forecast framework can obtain better prediction accuracy and generalization performance than the average ensemble models composed of four classical learners, and can improve prediction accuracy for hydrological forecasting.
Similar content being viewed by others
Data Availability
The authors do not have permission to share data.
References
Aceña V, De Diego IM, Fernández RR, Moguerza JM (2022) Minimally overfitted learners: A general framework for ensemble learning. Knowl-Based Syst 254:109669. https://doi.org/10.1016/j.knosys.2022.109669
Al-Daweri MS, Abdullah S, Ariffin KAZ (2021) A homogeneous ensemble based dynamic artificial neural network for solving the intrusion detection problem. Int J Crit Infrastruct Prot 34:100449. https://doi.org/10.1016/j.ijcip.2021.100449
Alobaidi MH, Ouarda TBMJ, Marpu PR, Chebana F (2021) Diversity-driven ANN-based ensemble framework for seasonal low-flow analysis at ungauged sites. Adv Water Resour 147:103814. https://doi.org/10.1016/j.advwatres.2020.103814
Bashir D, Montañez GD, Sehra S, Segura PS, Lauw J, Gallagher M, Moustafa N, Lakshika E (2020) An information-theoretic perspective on overfitting and underfittinglecture notes in computer science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Science and Business Media Deutschland GmbH, p 347–358. https://doi.org/10.1007/978-3-030-64984-5_27
Breiman L (2001) Random Forests. Mach Learn 45(1):5–32. https://doi.org/10.1023/A:1010933404324
Chauhan P, Akıner ME, Sain K, Kumar A (2022) Forecasting of suspended sediment concentration in the Pindari-Kafni glacier valley in Central Himalayan region considering the impact of precipitation: using soft computing approach. Arab J Geosci 15(8):683. https://doi.org/10.1007/s12517-022-09773-1
Das S, Suganthan PN (2011) Differential evolution: A survey of the state-of-the-art. IEEE Trans Evol Comput 15(1):4–31. https://doi.org/10.1109/TEVC.2010.2059031
Dion P, Martel J, Arsenault R (2021) Hydrological ensemble forecasting using a multi-model framework. J Hydrol 600:126537. https://doi.org/10.1016/j.jhydrol.2021.126537
Feng ZK, Niu WJ, Wan XY, Xu B, Zhu FL, Chen J (2022) Hydrological time series forecasting via signal decomposition and twin support vector machine using cooperation search algorithm for parameter identification. J Hydrol 612. https://doi.org/10.1016/j.jhydrol.2022.128213
Gu Q, Chang Y, Xiong N, Chen L (2021) Forecasting Nickel futures price based on the empirical wavelet transform and gradient boosting decision trees. Appl Soft Comput 109:107472. https://doi.org/10.1016/j.asoc.2021.107472
Guo T, Song S, Yan Y (2022) A time-varying autoregressive model for groundwater depth prediction. J Hydrol 613:128394. https://doi.org/10.1016/j.jhydrol.2022.128394
Ibrahim KSMH, Huang YF, Ahmed AN, Koo CH, El-Shafie A (2022) A review of the hybrid artificial intelligence and optimization modelling of hydrological streamflow forecasting. Alex Eng J 61(1):279–303. https://doi.org/10.1016/j.aej.2021.04.100
Khalid OW, Isa NAM, Mat Sakim HA (2023) Emperor penguin optimizer: A comprehensive review based on state-of-the-art meta-heuristic algorithms. Alex Eng J 63:487–526. https://doi.org/10.1016/j.aej.2022.08.013
Li B, Zhang J, Yu Z, Liang Z, Chen L, Acharya K (2017) Climate change driven water budget dynamics of a Tibetan inland lake. Glob Planet Change 150:70–80. https://doi.org/10.1016/j.gloplacha.2017.02.003
Li Y, Han Y, Liu B, Li H, Du X, Wang Q, Wang X, Zhu X (2023) Construction and application of a refined model for the optimal allocation of water resources — Taking Guantao County, China as an example. Ecol Indic 146:109929. https://doi.org/10.1016/j.ecolind.2023.109929
Li YJ, Liang ZM, Hu YM, Li BQ, Xu B, Wang D (2020) A multi-model integration method for monthly streamflow prediction: modified stacking ensemble strategy. J Hydroinformatics 22(2):310–326. https://doi.org/10.2166/hydro.2019.066
Liu DD, Guo SL, Shao QX, Liu P, Xiong LH, Wang L, Hong XJ, Xu Y, Wang ZL (2018) Assessing the effects of adaptation measures on optimal water resources allocation under varied water availability conditions. J Hydrol 556:759–774. https://doi.org/10.1016/j.jhydrol.2017.12.002
Liu YQ, Ye L, Qin H, Hong XF, Ye JJ, Yin XL (2018) Monthly streamflow forecasting based on hidden Markov model and Gaussian Mixture Regression. J Hydrol 561:146–159. https://doi.org/10.1016/j.jhydrol.2018.03.057
Mengshu S, Yuansheng H, Xiaofeng X, Dunnan L (2021) China’s coal consumption forecasting using adaptive differential evolution algorithm and support vector machine. Resour Policy 74:102287. https://doi.org/10.1016/j.resourpol.2021.102287
Mohammadi K, Eslami HR, Kahawita R (2006) Parameter estimation of an ARMA model for river flow forecasting using goal programming. J HYDROL 331(1–2):293–299. https://doi.org/10.1016/j.jhydrol.2006.05.017
Murtagh F (1991) Multilayer perceptrons for classification and regression. Neurocomputing 2(5–6):183–197. https://doi.org/10.1016/0925-2312(91)90023-5
Oki T, Kanae S (2006) Global hydrological cycles and world water resources. Science 313(5790):1068–1072. https://doi.org/10.1126/science.1128845
Pérez-Alarcón A, Garcia-Cortes D, Fernández-Alvarez JC, Martínez-González Y (2022) Improving monthly rainfall forecast in a watershed by combining neural networks and autoregressive models. Environ Process 9(3):53. https://doi.org/10.1007/s40710-022-00602-x
Schueller A, Saldaña C (2022) Generalizability analysis of tool condition monitoring ensemble machine learning models. J MANUF PROCESS 84:1064–1075. https://doi.org/10.1016/j.jmapro.2022.10.064
Singh D, Vardhan M, Sahu R, Chatterjee D, Chauhan P, Liu S (2023) Machine-learning- and deep-learning-based streamflow prediction in a hilly catchment for future scenarios using CMIP6 GCM data. Hydrol Earth Syst Sci 27(5):1047–1075. https://doi.org/10.5194/hess-27-1047-2023
Smola AJ, Schölkopf B (2004) A tutorial on support vector regression. Stat Comput 14(3):199–222. https://doi.org/10.1023/B:STCO.0000035301.49549.88
Song Y, Zhao G, Zhang B, Chen H, Deng W, Deng W (2023) An enhanced distributed differential evolution algorithm for portfolio optimization problems. Eng Appl Artif Intell 121:106004. https://doi.org/10.1016/j.engappai.2023.106004
Taormina R, Chau KW (2015) Data-driven input variable selection for rainfall-runoff modeling using binary-coded particle swarm optimization and Extreme Learning Machines. J Hydrol 529:1617–1632. https://doi.org/10.1016/j.jhydrol.2015.08.022
Wang J, Zhou H, Hong T, Li X, Wang S (2020) A multi-granularity heterogeneous combination approach to crude oil price forecasting. Energy Econ 91:104790. https://doi.org/10.1016/j.eneco.2020.104790
Wang S, Peng H, Hu Q, Jiang M (2022) Analysis of runoff generation driving factors based on hydrological model and interpretable machine learning method. J Hydrol Reg Stud 42:101139. https://doi.org/10.1016/j.ejrh.2022.101139
Wang W, Du Y, Chau K, Xu D, Liu C, Ma Q (2021) An ensemble hybrid forecasting model for annual runoff based on sample entropy, secondary decomposition, and long short-term memory neural network. Water Resour Manag 35(14):4695–4726. https://doi.org/10.1007/s11269-021-02920-5
Weiß CH, Aleksandrov B, Faymonville M, Jentsch C (2023) Partial autocorrelation diagnostics for count time series. Entropy-Switz 25(1):105
Yang M, Wang H, Jiang Y, Lu X, Xu Z, Sun G (2020) GECA proposed ensemble–KNN method for improved monthly runoff forecasting. Water Resour Manag 34(2):849–863. https://doi.org/10.1007/s11269-019-02479-2
Yang X, Niu X, Li L, Wang L, Liu C, Liu J, Yuan Q, Pei X (2023) Understanding the molecular mechanism of drought resistance in Shanlan upland rice by transcriptome and phenotype analyses. Int J Biol Macromol 231:123387. https://doi.org/10.1016/j.ijbiomac.2023.123387
Yu X, Wang Y, Wu L, Chen G, Wang L, Qin H (2020) Comparison of support vector regression and extreme gradient boosting for decomposition-based data-driven 10-day streamflow forecasting. J Hydrol 582:124293. https://doi.org/10.1016/j.jhydrol.2019.124293
Yu X, Xu Y, Gu H, Guo Y (2023) Multi-objective robust optimization of reservoir operation for real-time flood control under forecasting uncertainty. J Hydrol 620:129421. https://doi.org/10.1016/j.jhydrol.2023.129421
Zhong X, Cheng P, You Z (2023) An improved differential evolution algorithm based on basis vector type and its application in fringe projection 3D imaging. Knowl-Based Syst 268:110470. https://doi.org/10.1016/j.knosys.2023.110470
Zhu X, Guo H, Huang JJ, Tian S, Zhang Z (2023) A hybrid decomposition and Machine learning model for forecasting Chlorophyll-a and total nitrogen concentration in coastal waters. J Hydrol 619:129207. https://doi.org/10.1016/j.jhydrol.2023.129207
Zounemat-Kermani M, Batelaan O, Fadaee M, Hinkelmann R (2021) Ensemble machine learning paradigms in hydrology: A review. J Hydrol 598:126266. https://doi.org/10.1016/j.jhydrol.2021.126266
Acknowledgements
This paper is supported by the National Key Research and Development Program of China (2021YFC3200303), and special thanks are given to the anonymous reviewers and editors for their constructive comments.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, S., Qin, H., Liu, G. et al. Runoff Forecasting of Machine Learning Model Based on Selective Ensemble. Water Resour Manage 37, 4459–4473 (2023). https://doi.org/10.1007/s11269-023-03566-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11269-023-03566-1