Skip to main content
Log in

Do non-linearity and non-Gaussianity truly matter in streamflow forecasting? A comparative study between PAR(p) and vine copula for Brazilian streamflow time series

  • Research
  • Published:
Environmental Monitoring and Assessment Aims and scope Submit manuscript

Abstract

This study evaluates the joint impact of non-linearity and non-Gaussianity on predictive performance in 23 Brazilian monthly streamflow time series from 1931 to 2022. We consider point and interval forecasting, employing a PAR(p) model and comparing it with the periodic vine copula model. Results indicate that the Gaussian hypothesis assumed by PAR(p) is unsuitable; gamma and log-normal distributions prove more appropriate and crucial for constructing accurate confidence intervals. This is primarily due to the assumption of the Gaussian distribution, which can lead to the generation of confidence intervals with negative values. Analyzing the estimated copula models, we observed a prevalence of the bivariate Normal copula, indicating that linear dynamic dependence is frequent, and the Rotated Gumbel 180°, which exhibits lower tail dependence. Overall, the temporal dynamics are predominantly shaped by combining these two types of effects. In point forecasting, both models show similar behavior in the estimation set, with slight advantages for the copula model. The copula model performs better during the out-of-sample analysis, particularly for certain power plants. In interval forecasting, the copula model exhibits pronounced superiority, offering a better estimation of quantiles. Consistently demonstrating proficiency in constructing reliable and accurate intervals, the copula model reveals a notable advantage over the PAR(p) model in interval forecasting.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data availability

The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request. The time series were provided by the National System Operator (ONS) and are available at https://sintegre.ons.org.br.

Notes

  1. A vine copula is a d-dimensional copula built from the combination of bivariate copulas. Because any bivariate copulas can be employed, vine copulas are renowned for their ability to model any dependency structure that can arise from a d-dimensional vector. For more details, see Aas et al. (2009).

References

  • Aas, K., Czado, C., Frigessi, A., & Bakken, H. (2009). Pair-copula constructions of multiple dependences. Insurance: Mathematics and Economics, 44(2), 182–198.

    Google Scholar 

  • Ávila, L., Mine, M. R. M., & Kaviski, E. (2020). Probabilistic long-term reservoir operation employing copulas and implicit stochastic optimization. Stochastic environmental research and risk assessment, 34, 931–947. https://doi.org/10.1007/s00477-020-01826-9

    Article  Google Scholar 

  • Chen, L., Singh, V. P., Guo, S., Zhou, J., & Zhang, J. (2015). Copula-based method for multisite monthly and daily streamflow simulation. Journal of Hydrology, 528, 369–384.

    Article  Google Scholar 

  • Cheng, M., Fang, F., Kinouchi, T., Navon, I. M., & Pain, C. C. (2020). Long lead-time daily and monthly streamflow forecasting using machine learning methods. Journal of Hydrology, 590, 125376. https://doi.org/10.1016/j.jhydrol.2020.125376

    Article  Google Scholar 

  • De Paiva, L. F. G., Montenegro, S. M., & Cataldi, M. (2020). Prediction of monthly flows for Três Marias reservoir (São Francisco river basin) using the CFS climate forecast model. RBRH, 25, e16. https://doi.org/10.1590/2318-0331.252020190067

    Article  Google Scholar 

  • Detzel, D. H. M., Bessa, M. R., Ávila, L., Cantão, M. P., & Geus, K. D. (2023). Generation of synthetic flow scenarios by means of multivariate sampling of contemporaneous ARMA model outputs. RBRH, 28, e46. https://doi.org/10.1590/2318-0331.282320230117

    Article  Google Scholar 

  • Dolatabadi, N., & Zahraie, B. (2023). A stochastic deep-learning-based approach for improved streamflow simulation. Stochastic Environmental Research and Risk Assessment, 38(1), 107–26. https://doi.org/10.1007/s00477-023-02567-1

    Article  Google Scholar 

  • Energy Research Company (EPE) (2023) 2023 Statistical yearbook of electricity—2022 baseline year. https://dashboard.epe.gov.br/apps/anuario-livro-eng/. Accessed 03 January 2024

  • Hao Z, Singh VP (2012). Entropy-copula method for single-site monthly streamflow simulation. Water Resources Research 48 6

  • Hipel, K. W., & McLeod, A. I. (1994). Time series modelling of water resources and environmental systems. Elsevier.

    Google Scholar 

  • Joe, H. (2014). Dependence modeling with copulas. CRC Press.

    Book  Google Scholar 

  • Kumar, V., Kedam, N., Sharma, K. V., Mehta, D. J., & Caloiero, T. (2023). Advanced machine learning techniques to improve hydrological prediction: A comparative analysis of streamflow prediction models. Water, 15, 2572. https://doi.org/10.3390/w15142572

    Article  Google Scholar 

  • Kupiec P (1995) Techniques for verifying the accuracy of risk measurement models. Journal of Derivatives 73–84

  • Lappicy, T., & Lima, C. H. (2023). Enhancing monthly streamflow forecasting for Brazilian hydropower plants through climate index integration with stochastic methods. RBRH, 28, e48. https://doi.org/10.1590/2318-0331.282320230118

    Article  Google Scholar 

  • Copula-based stochastic simulation of hydrological data applied to Nile river flows. (2011). Hydrology Research, 42(4), 318–330.

  • Mai, J. F., & Scherer, M. (2012). Simulating copulas: Stochastic models, sampling algorithms, and applications. Imperial College Press.

    Book  Google Scholar 

  • Marsaglia, G., & Marsaglia, J. (2004). Evaluating the Anderson-Darling distribution. Journal of Statistical Software, 9(2), 1–5. https://doi.org/10.18637/jss.v009.i02

    Article  Google Scholar 

  • Ng, K. W., Huang, Y. F., Koo, C. H., Chong, K. L., El-Shafie, A., & Ahmed, A. N. (2023). A review of hybrid deep learning applications for streamflow forecasting. Journal of Hydrology, 625, 130141. https://doi.org/10.1016/j.jhydrol.2023.130141

    Article  Google Scholar 

  • Oliveira, V. G. D., & Lima, C. H. R. (2016). Multiscale streamflow forecasts for the Brazilian hydropower system using Bayesian model averaging (BMA). RBRH, 21, 618–635. https://doi.org/10.1590/2318-0331.011616032

    Article  Google Scholar 

  • Oliveira (ed) (2020) O planejamento da operação energética no sistema interligado nacional. Conceitos, modelagem matemática, previsão de geração e carga. (2020). Artlibere Editora Ltda, São Paulo

  • Pereira, G., & Veiga, A. (2018). Par(p)-vine copula based model for stochastic streamflow scenario generation. Stochastic Environmental Research and Risk Assessment, 32(3), 833–842.

    Article  Google Scholar 

  • Pereira, G. A., & Veiga, A. (2019). Periodic copula autoregressive model designed to multivariate streamflow time series modelling. Water Resources Management, 33, 3417–3431.

    Article  Google Scholar 

  • Reddy, N. M., Saravanan, S., & Abijith, D. (2023). Streamflow simulation using conceptual and neural network models in the Hemavathi sub-watershed India. Geosystems and Geoenvironment, 2(2), 100153.

    Article  Google Scholar 

  • Sharma, R. K., Kumar, S., Padmalal, D., & Roy, A. (2023). Streamflow prediction using machine learning models in selected rivers of Southern India. International Journal of River Basin Management. https://doi.org/10.1080/15715124.2023.2196635

    Article  Google Scholar 

  • Sklar M (1959) Fonctions de repartition a n dimensions et leurs marges. Publications de l’Institut de Statistique de L’Universit? de Paris 8 229–231

  • Song, S., & Singh, V. P. (2010). Frequency analysis of droughts using the Plackett copula and parameter estimation by genetic algorithm. Stochastic Environmental Research and Risk Assessment, 24(5), 783–805.

    Article  Google Scholar 

  • Souza Filho, F. D. A. D., Rocha, R. V., Estácio, Á. B., Rolim, L. Z. R., Pontes Filho, J. D. D. A., Porto, V. C., & Guimarães, S. O. (2023). Enhancing streamflow forecasting for the Brazilian electricity sector: A strategy based on a hyper-multimodel. RBRH, 28, e45. https://doi.org/10.1590/2318-0331.282320230120

    Article  Google Scholar 

  • Wang, S., Pa, Z., Zhu, F., et al. (2023). Streamflow forecasting method with a hybrid physical process-mathematical statistic. Stochastic Environmental Research and Risk Assessment, 37, 4805–4826. https://doi.org/10.1007/s00477-023-02542-w

    Article  Google Scholar 

  • Zhu, F., Wang, Y., Liu, B., et al. (2023). Quantitative evaluation of the impact of hydrological forecasting uncertainty on reservoir real-time optimal operation. Stochastic Environmental Research and Risk Assessment. https://doi.org/10.1007/s00477-023-02588-w

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: [Guilherme Armando de Almeida Pereira, Álvaro de Lima Vega Filho]; Methodology: [Guilherme Armando de Almeida Pereira]; Formal analysis and investigation: [Guilherme Armando de Almeida Pereira], Writing: original draft preparation: [Guilherme Armando de Almeida Pereira, Álvaro de Lima Vega Filho]; Writing: review and editing: [Guilherme Armando de Almeida Pereira, Álvaro de Lima Vega Filho]; All authors read and approved the final manuscript.

Corresponding author

Correspondence to Guilherme Armando de Almeida Pereira.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Figure 4

Table 9

Table 10

Fig. 4
figure 4

These are contour plots of the bivariate copula density functions employed in this research. Copulas can represent distinct types of dependence, such as asymmetry and tail dependence

Table 9 List of employed hydropower plants and their corresponding indexes based to the information from the Brazilian National Electric System Operator (ONS)
Table 10 Dimensions of selected vine copulas for each month and hydropower plant (HPP)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

de Almeida Pereira, G.A., de Lima Veiga Filho, Á. Do non-linearity and non-Gaussianity truly matter in streamflow forecasting? A comparative study between PAR(p) and vine copula for Brazilian streamflow time series. Environ Monit Assess 196, 486 (2024). https://doi.org/10.1007/s10661-024-12645-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10661-024-12645-8

Keywords

Navigation