Skip to main content
Log in

Comparison of stochastic and machine learning methods for multi-step ahead forecasting of hydrological processes

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Research within the field of hydrology often focuses on the statistical problem of comparing stochastic to machine learning (ML) forecasting methods. The performed comparisons are based on case studies, while a study providing large-scale results on the subject is missing. Herein, we compare 11 stochastic and 9 ML methods regarding their multi-step ahead forecasting properties by conducting 12 extensive computational experiments based on simulations. Each of these experiments uses 2000 time series generated by linear stationary stochastic processes. We conduct each simulation experiment twice; the first time using time series of 100 values and the second time using time series of 300 values. Additionally, we conduct a real-world experiment using 405 mean annual river discharge time series of 100 values. We quantify the forecasting performance of the methods using 18 metrics. The results indicate that stochastic and ML methods may produce equally useful forecasts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

Download references

Acknowledgements

We thank the Associate Editor and two reviewers for their useful suggestions. Part of the Discussion section, in particular the comments on the no free lunch theorem and the use of exogenous variables, has been inspired by the “Energy Forecasting” blog (http://blog.drhongtao.com/).

Author information

Authors and Affiliations

Authors

Contributions

HT conceived the idea of comparing stochastic and machine learning methods in hydrological univariate time series forecasting using large datasets. GP designed the experiments, performed the computations and wrote the manuscript under the supervision of HT and DK during her MSc thesis. All authors have discussed the results and edited the manuscript.

Corresponding author

Correspondence to Georgia Papacharalampous.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Statistical software and supplementary material

Appendix: Statistical software and supplementary material

The analyses and visualizations have been performed in R Programming Language (R Core Team 2018). We have used the following contributed R packages: cgwtools (Witthoft 2015), devtools (Wickham and Chang 2018), EnvStats (Millard 2013, 2018), forecast (Hyndman and Khandakar 2008; Hyndman et al. 2018), fracdiff (Fraley et al. 2012), gdata (Warnes et al. 2017), ggplot2 (Wickham 2016a; Wickham et al. 2018), HKprocess (Tyralis 2016), kernlab (Karatzoglou et al. 2004, 2018), knitr (Xie 2014, 2015, 2018), nnet (Venables and Ripley 2002; Ripley 2016), plyr (Wickham 2011, 2016b), randomForest (Liaw and Wiener 2002; Liaw 2018), readr (Wickham et al. 2017), rmarkdown (Allaire et al. 2018), rminer (Cortez 2010, 2016) and tidyr (Wickham and Henry 2018).

The supplementary material is available in Papacharalampous and Tyralis (2018). We provide the fully reproducible reports together with their codes. We also provide the reports entitled “Definitions of the stochastic processes’’, “Definitions of the forecast quality metrics’’ and “Selected figures for the qualitative comparison of the forecasting methods’’, which we suggest to be read alongside with Sects. 2.1, 2.4 and 3.1 respectively.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Papacharalampous, G., Tyralis, H. & Koutsoyiannis, D. Comparison of stochastic and machine learning methods for multi-step ahead forecasting of hydrological processes. Stoch Environ Res Risk Assess 33, 481–514 (2019). https://doi.org/10.1007/s00477-018-1638-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-018-1638-6

Keywords

Navigation