Abstract
Accurate rainfall-runoff modelling is particularly challenging due to complex nonlinear relationships between various factors such as rainfall characteristics, soil properties, land use, and temporal lags. Recently, with improvements to computation systems and resources, data-driven models have shown good performances for runoff forecasting. However, the relative performance of common data-driven models using small temporal resolutions is still unclear. This study presents an application of data-driven models using artificial neural network, support vector regression and long-short term memory approaches and distributed forcing data for runoff predictions between 2010 and 2019 in the Russian River basin, California, USA. These models were used to predict hourly runoff with 1–6 h of lead time using precipitation, soil moisture, baseflow and land surface temperature datasets provided from the North American Land Data Assimilation System. The predicted results were evaluated in terms of seasonal and event-based performance using various statistical metrics. The results showed that the long-short term memory and support vector regression models outperforms artificial neural network model for hourly runoff forecasting, and the predictive performance of the models was greater during the wet seasons compared to the dry seasons. In addition, a comparison of the data-driven model results with the National Water Model, a fully distributed physical-based hydrologic model, showed that the long-short term memory and support vector regression models provide comparable performance. The results demonstrate that data-driven models for hourly runoff forecasting are sufficiently predictive and useful in areas where observation systems are not available.
Similar content being viewed by others
Availability of data and material
The NLDAS dataset used in this study can be downloaded from Land Data Assimilation System wetsite (https://ldas.gsfc.nasa.gov/nldas/nldas-get-data). Data used in this study is available at https://www.hydroshare.org/resource/7d13f677c28440a4800c531d93471000/
Data availability
Data used in this study is available at https://www.hydroshare.org/resource/7d13f677c28440a4800c531d93471000/
References
Abebe AJ, Price RK (2003) Managing uncertainty in hydrological models using complementary models. Hydrol Sci J 48(5):679–692. https://doi.org/10.1623/hysj.48.5.679.51450
Abebe AJ, Price RK (2004) Information theory and neural networks for managing uncertainty in flood routing. J Comput Civ Eng 18(4):373–380. https://doi.org/10.1061/(ASCE)0887-3801(2004)18:4(373)
Ayzel G (2019) Does deep learning advance hourly runoff predictions. In: Proceedings of the V international conference information technologies and high-performance computing (ITHPC-2019), Khabarovsk, Russia, pp. 16–19
Ba H, Guo S, Wang Y, Hong X, Zhong Y, Liu Z (2018) Improving ANN model performance in runoff forecasting by adding soil moisture input and using data preprocessing techniques. Hydrol Res 49(3):744–760. https://doi.org/10.2166/nh.2017.048
Badrzadeh H, Sarukkalige R, Jayawardena AW (2015) Hourly runoff forecasting for flood risk management: application of various computational intelligence models. J Hydrol 529:1633–1643. https://doi.org/10.1016/j.jhydrol.2015.07.057
Bui DT, Hoang ND, Martínez-Álvarez F, Ngo PTT, Hoa PV, Pham TD, Samui P, Costache R (2020) A novel deep learning neural network approach for predicting flash flood susceptibility: a case study at a high frequency tropical storm area. Sci Total Environ 701:134413. https://doi.org/10.1016/j.scitotenv.2019.134413
Breiman L (2001) Random forests. Machine Learn 45(1):5–32. https://doi.org/10.1023/A:101093340
Cai X, Yang ZL, Xia Y, Huang M, Wei H, Leung LR, Ek MB (2014) Assessment of simulated water balance from Noah, Noah-MP, CLM, and VIC over CONUS using the NLDAS test bed. J Geophys Res Atmos 119(24):13–751. https://doi.org/10.1002/2014JD022113
Castelletti A, Galelli S, Restelli M, Soncini-Sessa R (2010) Tree-based reinforcement learning for optimal water reservoir operation. Water Resour Res. https://doi.org/10.1029/2009WR008898
Chang FJ, Tsai MJ (2016) A nonlinear spatio-temporal lumping of radar rainfall for modeling multi-step-ahead inflow forecasts by data-driven techniques. J Hydrol 535:256–269. https://doi.org/10.1016/j.jhydrol.2016.01.056
Choi C, Kim J, Han H, Han D, Kim HS (2020) Development of water level prediction models using machine learning in wetlands: a case study of upo wetland in South Korea. Water 12(1):93. https://doi.org/10.3390/w12010093
Doycheva K, Horn G, Koch C, Schumann A, König M (2017) Assessment and weighting of meteorological ensemble forecast members based on supervised machine learning with application to runoff simulations and flood warning. Adv Eng Inf 33:427–439. https://doi.org/10.1016/j.aei.2016.11.001
Drucker H, Burges CJ, Kaufman L, Smola AJ, Vapnik V (1997) Support vector regression machines. Advances in neural information processing systems. MIT Press, Cambridge, MA, USA
Espinoza-Dávalos GE, Arctur DK, Teng W, Maidment DR, García-Martí I, Comair G (2016) Studying soil moisture at a national level through statistical analysis of NASA NLDAS data. J Hydroinf 18(2):277–287. https://doi.org/10.2166/hydro.2015.231
Fan H, Jiang M, Xu L, Zhu H, Cheng J, Jiang J (2020) Comparison of long short term memory networks and the hydrological model in runoff simulation. Water 12(1):175. https://doi.org/10.3390/w12010175
Feng Y, Cui N, Hao W, Gao L, Gong D (2019) Estimation of soil temperature from meteorological data using different machine learning models. Geoderma 338:67–77. https://doi.org/10.1016/j.geoderma.2018.11.044
Feng D, Fang K, Shen C (2020) Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales. Water Resour Res 56(9):e2019WR026793. https://doi.org/10.1029/2019WR026793
Gholami VCKW, Chau KW, Fadaee F, Torkaman J, Ghaffari A (2015) Modeling of groundwater level fluctuations using dendrochronology in alluvial aquifers. J Hydrol 529:1060–1069. https://doi.org/10.1016/j.jhydrol.2015.09.028
Han H, Kim J, Chandrasekar V, Choi J, Lim S (2019) Modeling streamflow enhanced by precipitation from atmospheric river using the NOAA national water model: a case study of the Russian river basin for February 2004. Atmos 10(8):466. https://doi.org/10.3390/atmos10080466
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Hu C, Wu Q, Li H, Jian S, Li N, Lou Z (2018) Deep learning with a long short-term memory networks approach for rainfall-runoff simulation. Water 10(11):1543. https://doi.org/10.3390/w10111543
Islam S, Kothari R (2000) Artificial neural networks in remote sensing of hydrologic processes. J Hydrol Eng 5(2):138–144. https://doi.org/10.1061/(ASCE)1084-0699(2000)5:2(138)
Jajarmizadeh M, Harun S, Salarpour M (2014) An assessment of a proposed hybrid neural network for daily flow prediction in arid climate. Modell Simul Eng. https://doi.org/10.1155/2014/635018
Johnson LE, Hsu C, Zamora R, Cifelli R (2016) Assessment and applications of distributed hydrologic model-Russian-Napa River Basins, CA. NOAA Technical Memorandum PSD-316, NOAA Printing Office, Silver Spring, MD. https://doi.org/10.7289/V5M32SS9
Jothiprakash V, Magar RB (2012) Multi-time-step ahead daily and hourly intermittent reservoir inflow prediction by artificial intelligent techniques using lumped and distributed data. J Hydrol 450:293–307. https://doi.org/10.1016/j.jhydrol.2012.04.045
Kim J, Han H, Johnson LE, Lim S, Cifelli R (2019) Hybrid machine learning framework for hydrological assessment. J Hydrol 577:123913. https://doi.org/10.1016/j.jhydrol.2019.123913
Liang J, Li W, Bradford SA, Šimůnek J (2019) Physics-informed data-driven models to predict surface runoff water quantity and quality in agricultural fields. Water 11(2):200. https://doi.org/10.3390/w11020200
Lin GF, Chou YC, Wu MC (2013a) Typhoon flood forecasting using integrated two-stage support vector machine approach. J Hydrol 486:334–342. https://doi.org/10.1016/j.jhydrol.2013.02.012
Lin GF, Jhong BC, Chang CC (2013b) Development of an effective data-driven model for hourly typhoon rainfall forecasting. J Hydrol 495:52–63. https://doi.org/10.1016/j.jhydrol.2013.04.050
Maier HR, Dandy GC (2000) Neural networks for the prediction and forecasting of water resources variables: a review of modelling issues and applications. Environ Model Softw 15(1):101–124. https://doi.org/10.1016/S1364-8152(99)00007-9
McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biol 5(4):115–133. https://doi.org/10.1007/BF02478259
McClelland JL, Rumelhart DE, PDP Research Group (1986) Parallel distributed processing. Explorations in the Microstructure of Cognition, 2:216-271
Moriasi DN, Arnold JG, van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50(3):885–900. https://doi.org/10.13031/2013.23153
Mosavi A, Ozturk P, Chau KW (2018) Flood prediction using machine learning models: literature review. Water 10(11):1536. https://doi.org/10.3390/w10111536
Namin AH, Leboeuf K, Wu H, Ahmadi M (2009) Artificial neural networks activation function HDL coder. In: 2009 IEEE international conference on electro/information technology, pp. 389–392
Senthil Kumar AR, Sudheer KP, Jain SK, Agarwal PK (2005) Rainfall-runoff modelling using artificial neural networks: comparison of network types. Hydrol Process Int J 19(6):1277–1291. https://doi.org/10.1002/hyp.5581
Seo Y, Kim S, Singh VP (2018) Machine learning models coupled with variational mode decomposition: a new approach for modeling daily rainfall-runoff. Atmos 9(7):251. https://doi.org/10.3390/atmos9070251
Shen C (2018) A transdisciplinary review of deep learning research and its relevance for water resources scientists. Water Resour Res 54(11):8558–8593. https://doi.org/10.1029/2018WR022643
Solomatine DP, Dulal KN (2003) Model trees as an alternative to neural networks in rainfall—runoff modelling. Hydrol Sci J 48(3):399–411. https://doi.org/10.1623/hysj.48.3.399.45291
Solomatine DP, Ostfeld A (2008) Data-driven modelling: some past experiences and new approaches. J Hydroinf 10(1):3–22. https://doi.org/10.2166/hydro.2008.015
Souffront Alcantara MA, Kesler C, Stealey MJ, Nelson EJ, Ames DP, Jones NL (2018) Cyberinfrastructure and web apps for managing and disseminating the national water model. JAWRA J Am Water Resour Assoc 54(4):859–871. https://doi.org/10.1111/1752-1688.12608
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106(D7):7183–7192. https://doi.org/10.1029/2000JD900719
Tsai MJ, Abrahart RJ, Mount NJ, Chang FJ (2014) Including spatial distribution in a data-driven rainfall-runoff model to improve reservoir inflow forecasting in Taiwan. Hydrol Process 28(3):1055–1070. https://doi.org/10.1002/hyp.9559
Vapnik V (1995) The nature of statistical learning theory. Springer, New York, NY, USA
Wang W (2006) Stochasticity, nonlinearity and forecasting of streamflow processes. Ios Press, Amsterdam
Xia Y, Hobbins MT, Mu Q, Ek MB (2015) Evaluation of NLDAS-2 evapotranspiration against tower flux site observations. Hydrol Process 29(7):1757–1771. https://doi.org/10.1002/hyp.10299
Xiang Z, Yan J, Demir I (2020) A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water Resour Res. https://doi.org/10.1029/2019WR025326
Young CC, Liu WC, Wu MC (2017) A physically based and machine learning hybrid approach for accurate rainfall-runoff modeling during extreme typhoon events. Appl Soft Comput 53:205–216. https://doi.org/10.1016/j.asoc.2016.12.052
Zhuo L, Han D, Dai Q, Islam T, Srivastava PK (2015) Appraisal of NLDAS-2 multi-model simulated soil moistures for hydrological modelling. Water Resour Manag 29(10):3503–3517. https://doi.org/10.1007/s11269-015-1011-1
Zia H, Harris N, Merrett G, Rivers M (2015) Predicting discharge using a low complexity machine learning model. Comput Electron Agric 118:350–360. https://doi.org/10.1016/j.compag.2015.09.012
Acknowledgements
The NLDAS dataset used in this study can be downloaded from Land Data Assimilation System wetsite (https://ldas.gsfc.nasa.gov/nldas/nldas-get-data)
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Conceptualization: [HH], [RM]; Methodology: [HH]; Data collection: [HH], Formal analysis and investigation: [HH], [RM]; Writing—original draft preparation: [HH], [RM]; Writing—review and editing: [HH], [RM]; Supervision: [RM]; All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Rights and permissions
About this article
Cite this article
Han, H., Morrison, R.R. Data-driven approaches for runoff prediction using distributed data. Stoch Environ Res Risk Assess 36, 2153–2171 (2022). https://doi.org/10.1007/s00477-021-01993-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-021-01993-3