Abstract
The optimal planning and management of modern water resources depends highly on reliable and accurate runoff forecasting. Data preprocessing technology can provide new possibilities for improving the accuracy of runoff forecasting when basic physical relationships cannot be captured using a single prediction model. Yet, few studies have evaluated the performances of various data preprocessing technologies in predicting monthly runoff time series so far. In order to fill this research gap, this paper investigates the potential of five data preprocessing techniques based on the gated recurrent unit network (GRU) model for monthly runoff prediction, namely variational mode decomposition (VMD), wavelet packet decomposition (WPD), complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), extreme-point symmetric mode decomposition (ESMD), and singular spectrum analysis (SSA). In this study, the original monthly runoff data is first decomposed into a set of subcomponents using five data preprocessing methods; second, each component is predicted by developing an appropriate GRU model; and finally, the forecasting results of different two-stage hybrid models are obtained by aggregating the forecast results of the corresponding subcomponents. Four performance metrics are employed as the evaluation benchmarks. The experimental results from two Hydropower Stations in China show that five data preprocessing techniques can attain satisfying prediction results, while VMD and WPD methods can yield better performance than CEEMDAN, ESMD, and SSA in both training and testing periods in terms of four indexes. Indeed, it is significantly important to carefully determine an appropriate data preprocessing method according to the actual characteristics of the study area.
Similar content being viewed by others
Data Availability
All authors made sure that all data and materials support our published claims and comply with field standards.
References
Ali M, Prasad R, Xiang Y, Yaseen ZM (2020) Complete ensemble empirical mode decomposition hybridized with random forest and kernel ridge regression model for monthly rainfall forecasts. J Hydrol 584. https://doi.org/10.1016/j.jhydrol.2020.124647
Bhandari S, Thakur B, Kalra A, Miller WP, Lakshmi V, Pathak P (2019) Streamflow forecasting using singular value decomposition and support vector machine for the upper Rio Grande River basin. J Am Water Resour Assoc 55:680–699. https://doi.org/10.1111/1752-1688.12733
Chen S, Dong SN, Cao ZG, Guo JT (2020a) A compound approach for monthly runoff forecasting based on multiscale analysis and deep network with sequential structure. Water 12. https://doi.org/10.3390/w12082274
Chen X et al (2020b) The importance of short lag-time in the runoff forecasting model based on long short-term memory. J Hydrol 589. https://doi.org/10.1016/j.jhydrol.2020.125359
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv:1406.1078. https://doi.org/10.48550/arXiv.1406.1078
Dragomiretskiy K, Zosso D (2014) Variational mode decomposition. IEEE Trans Signal Process 62:531–544. https://doi.org/10.1109/TSP.2013.2288675
Duan J et al (2021) Short-term wind power forecasting using the hybrid model of improved variational mode decomposition and Correntropy long short -term memory neural network. Energy 214:118980. https://doi.org/10.1016/j.energy.2020.118980
Elsner JB (2002) Analysis of time series structure: SSA and related techniques. J Am Stat Assoc 97:1207–1208. https://doi.org/10.1198/jasa.2002.s239
Feng Z-k, Niu W-j, Tang Z-y, Jiang Z-q, Xu Y, Liu Y, Zhang H-r (2020) Monthly runoff time series prediction by variational mode decomposition and support vector machine based on quantum-behaved particle swarm optimization. J Hydrol 583. https://doi.org/10.1016/j.jhydrol.2020.124627
Fragkiadaki K, Levine S, Felsen P, Malik J (2015) Recurrent Network Models for Human Dynamics. In: 2015 IEEE International Conference on Computer Vision (ICCV), 7–13 Dec. 2015. pp 4346–4354. https://doi.org/10.1109/ICCV.2015.494
Greff K, Srivastava RK, Koutnik J, Steunebrink BR, Schmidhuber J (2017) LSTM: a search space odyssey. IEEE Trans Neural Netw Learn Syst 28:2222–2232. https://doi.org/10.1109/tnnls.2016.2582924
Hanifi S, Zare-Behtash H, Cammarano A, Lotfian S (2023) Offshore wind power forecasting based on WPD and optimised deep learning methods. Renew Energy 218:119241. https://doi.org/10.1016/j.renene.2023.119241
He F, Zhou J, Feng Z-k, Liu G, Yang Y (2019) A hybrid short-term load forecasting model based on variational mode decomposition and long short-term memory networks considering relevant factors with Bayesian optimization algorithm. Appl Energy 237:103–116. https://doi.org/10.1016/j.apenergy.2019.01.055
Hu F, Yang Q, Yang J, Luo Z, Shao J, Wang G (2024) Incorporating multiple grid-based data in CNN-LSTM hybrid model for daily runoff prediction in the source region of the Yellow River Basin. J Hydrol Reg Stud 51:101652. https://doi.org/10.1016/j.ejrh.2023.101652
Huang NE et al (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc A: Math Phys Eng Sci 454:903–995. https://doi.org/10.1098/rspa.1998.0193
Islam MK, Rahman MM, Ali MS, Mahim SM, Miah MS (2024) Enhancing lung abnormalities diagnosis using hybrid DCNN-ViT-GRU model with explainable AI: a deep learning approach. Image Vis Comput 142:104918. https://doi.org/10.1016/j.imavis.2024.104918
Kisi O, Cimen M (2011) A wavelet-support vector machine conjunction model for monthly streamflow forecasting. J Hydrol 399:132–140. https://doi.org/10.1016/j.jhydrol.2010.12.041
Kratzert F, Klotz D, Brenner C, Schulz K, Herrnegger M (2018) Rainfall-runoff modelling using long short-term memory (LSTM) networks. Hydrol Earth Syst Sci 22:6005–6022. https://doi.org/10.5194/hess-22-6005-2018
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
Liu H, Mi X, Li Y, Duan Z, Xu Y (2019) Smart wind speed deep learning based multi-step forecasting model using singular spectrum analysis, convolutional gated recurrent unit network and support vector regression. Renew Energy 143:842–854. https://doi.org/10.1016/j.renene.2019.05.039
Liu H, Mi XW, Li YF (2018) Smart deep learning based wind speed prediction model using wavelet packet decomposition, convolutional neural network and convolutional long short term memory network. Energy Convers Manag 166:120–131. https://doi.org/10.1016/j.enconman.2018.04.021
Lv C, Wang X, Ling M, Xu W, Yan D (2023) Effects of precipitation concentration and human activities on City runoff changes. Water Resour Manag 37:5023–5036. https://doi.org/10.1007/s11269-023-03593-y
Meshram SG, Singh VP, Kisi O, Karimi V, Meshram C (2020) Application of artificial neural networks, support vector machine and multiple model-ANN to sediment yield prediction. Water Resour Manag 34:4561–4575. https://doi.org/10.1007/s11269-020-02672-8
Moreno SR, Mariani VC, Coelho LS (2021) Hybrid multi-stage decomposition with parametric model applied to wind speed forecasting in Brazilian northeast. Renew Energy 164:1508–1526. https://doi.org/10.1016/j.renene.2020.10.126
Niu W-j, Feng Z-k, Chen Y-b, Zhang H-r, Cheng C-t (2020) Annual streamflow time series prediction using extreme learning machine based on gravitational search algorithm and Variational mode decomposition. J Hydrol Eng 25. https://doi.org/10.1061/(asce)he.1943-5584.0001902
Ömer Faruk D (2010) A hybrid neural network and ARIMA model for water quality time series prediction. Eng Appl Artif Intell 23:586–594. https://doi.org/10.1016/j.engappai.2009.09.015
Pan MY, Zhou HN, Cao JY, Liu YS, Hao JL, Li SX, Chen CH (2020) Water level prediction model based on GRU and CNN. Ieee Access 8:60090–60100. https://doi.org/10.1109/access.2020.2982433
Pérez-Alarcón A, Garcia-Cortes D, Fernández-Alvarez JC, Martínez-González Y (2022) Improving monthly rainfall forecast in a watershed by combining neural networks and autoregressive models. Environ Process 9:53. https://doi.org/10.1007/s40710-022-00602-x
Ravanelli M, Brakel P, Omologo M, Bengio Y (2018) Light gated recurrent units for speech recognition. IEEE Trans Emerg Top Comput Intell 2:92–102. https://doi.org/10.1109/TETCI.2017.2762739
Reddy BSN, Pramada SK, Roshni T (2021) Monthly surface runoff prediction using artificial intelligence: a study from a tropical climate river basin. J Earth Syst Sci 130. https://doi.org/10.1007/s12040-020-01508-8
Ren Y, Suganthan PN, Srikanth N (2015) A comparative study of empirical mode decomposition-based short-term wind speed forecasting methods. IEEE Trans Sustain Energy 6:236–244. https://doi.org/10.1109/TSTE.2014.2365580
Santamarta JC, Hernández-Alemán A, Cruz-Pérez N, Gamallo-Paz I, Rodríguez-Martín J, de Martín-Pinillos CI, García-Gil A (2024) Optimizing water sustainability: revealing cost recovery strategies in the Canary Islands (Spain). Stoch Front Model Environ Process 11:13. https://doi.org/10.1007/s40710-024-00692-9
Tang LH, Bai YL, Yang J, Lu YN (2020) A hybrid prediction method based on empirical mode decomposition and multiple model fusion for chaotic time series. Chaos, Solitons Fractals 141. https://doi.org/10.1016/j.chaos.2020.110366
Vatanchi SM, Etemadfard H, Maghrebi MF, Shad R (2023) A comparative study on forecasting of long-term daily streamflow using ANN, ANFIS, BiLSTM and CNN-GRU-LSTM. Water Resour Manag 37:4769–4785. https://doi.org/10.1007/s11269-023-03579-w
Wang H, Wang W, Du Y, Xu D (2021) Examining the applicability of wavelet packet decomposition on different forecasting models in annual rainfall prediction. Water 13:1997. https://doi.org/10.3390/w13151997
Wang W-C, Chau K-W, Cheng C-T, Qiu L (2009) A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J Hydrol 374:294–306. https://doi.org/10.1016/j.jhydrol.2009.06.019
Wang W-c, Chau K-w, Qiu L, Chen Y-b (2015a) Improving forecasting accuracy of medium and long-term runoff using artificial neural network based on EEMD decomposition. Environ Res 139:46–54. https://doi.org/10.1016/j.envres.2015.02.002
Wang W-c, Chau K-w, Xu D-m, Chen X-Y (2015b) Improving forecasting accuracy of annual runoff time series using ARIMA based on EEMD decomposition. Water Resour Manag 29:2655–2675. https://doi.org/10.1007/s11269-015-0962-6
Wang W-c, Xu D-m, Chau K-w, Chen S (2013) Improved annual rainfall-runoff forecasting using PSO-SVM model based on EEMD. J Hydroinf 15:1377–1390. https://doi.org/10.2166/hydro.2013.134
Wang Y-y, Wang W-c, Xu D-m, Zhao Y-w, Zang H-f (2024) A compound approach for ten-day runoff prediction by coupling wavelet denoising, attention mechanism, and LSTM based on GPU parallel acceleration technology. Earth Sci Inf. https://doi.org/10.1007/s12145-023-01212-3
Xiang Z, Yan J, Demir I (2020) A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water Resour Res 56. https://doi.org/10.1029/2019wr025326
Xu D-m, Liao A-d, Wang W, Tian W-c, Zang H-f (2024) Improved monthly runoff time series prediction using the CABES-LSTM mixture model based on CEEMDAN-VMD decomposition. J Hydroinf 26:255–283. https://doi.org/10.2166/hydro.2023.216
Yang S-Y, Jhong Y-D, Jhong B-C, Lin Y-Y (2024a) Enhancing flooding depth forecasting accuracy in an urban area using a novel trend forecasting method. Water Resour Manag. https://doi.org/10.1007/s11269-023-03725-4
Yang X, Chen Z, Qin M (2024b) Monthly runoff prediction via mode decomposition-recombination technique. Water Resour Manag 38:269–286. https://doi.org/10.1007/s11269-023-03668-w
Yao Z, Wang Z, Wang D, Wu J, Chen L (2023) An ensemble CNN-LSTM and GRU adaptive weighting model based improved sparrow search algorithm for predicting runoff using historical meteorological and runoff data as input. J Hydrol 625:129977. https://doi.org/10.1016/j.jhydrol.2023.129977
Yaseen ZM, Jaafar O, Deo RC, Kisi O, Adamowski J, Quilty J, El-Shafie A (2016) Stream-flow forecasting using extreme learning machines: a case study in a semi-arid region in Iraq. J Hydrol 542:603–614. https://doi.org/10.1016/j.jhydrol.2016.09.035
Yaseen ZM et al (2019) Implementation of univariate paradigm for streamflow simulation using hybrid data-driven model: case study in tropical region. Ieee Access 7:74471–74481. https://doi.org/10.1109/access.2019.2920916
Yu JX, Zhang X, Xu LL, Dong J, Zhangzhong LL (2021) A hybrid CNN-GRU model for predicting soil moisture in maize root zone. Agric Water Manag 245. https://doi.org/10.1016/j.agwat.2020.106649
Yue ZX, Ai P, Xiong CS, Hong M, Song YH (2020) Mid- to long-term runoff prediction by combining the deep belief network and partial least-squares regression. J Hydroinf 22:1283–1305. https://doi.org/10.2166/hydro.2020.022
Zhang Q, Wang B-D, He B, Peng Y, Ren M-L (2011) Singular Spectrum analysis and ARIMA hybrid model for annual runoff forecasting. Water Resour Manag 25:2683–2703. https://doi.org/10.1007/s11269-011-9833-y
Zhang X, Liu F, Yin Q, Qi Y, Sun S (2023) A runoff prediction method based on hyperparameter optimisation of a kernel extreme learning machine with multi-step decomposition. Sci Rep 13:19341. https://doi.org/10.1038/s41598-023-46682-z
Zhang X, Wang J, Zhang K (2017) Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by cuckoo search algorithm. Electr Power Syst Res 146:270–285. https://doi.org/10.1016/j.epsr.2017.01.035
Zuo GG, Luo JG, Wang N, Lian YN, He XX (2020) Two-stage variational mode decomposition and support vector regression for streamflow forecasting. Hydrol Earth Syst Sci 24:5491–5518. https://doi.org/10.5194/hess-24-5491-2020
Zuo H, Yan G, Lu R, Li R, Xiao S, Pang Y (2024) A multi-task learning based runoff forecasting model for multi-scale chaotic hydrological time series. Water Resour Manag 38:481–503. https://doi.org/10.1007/s11269-023-03681-z
Funding
Project of key science and technology of the Henan province (No: 202102310259; No: 202102310588), Henan province university scientific and technological innovation team (No: 18IRTSTHN009).
Author information
Authors and Affiliations
Contributions
Wen-chuan Wang: Conceptualization, Methodology, Writing-original draft. Yu-jin Du: Methodology, data curation, Writing - original draft preparation. Kwok-wing Chau: Writing and editing-original draft. Chun-Tian Cheng: Formal analysis and data collection. Dong-mei Xu: Formal analysis. Wen-Tao Zhuang: Investigation.
Corresponding author
Ethics declarations
Ethics Approval
All authors kept the ‘Ethical Responsibilities of Authors’.
Consent to Participate
All authors gave explicit consent to participate in this work.
Consent to Publish
All authors gave explicit consent to publish this manuscript.
Competing of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, Wc., Du, Yj., Chau, Kw. et al. Evaluating the Performance of Several Data Preprocessing Methods Based on GRU in Forecasting Monthly Runoff Time Series. Water Resour Manage (2024). https://doi.org/10.1007/s11269-024-03806-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11269-024-03806-y