Abstract
Extreme rainfall data are usually scarce due to the low frequency of these events. However, prior knowledge of the precipitation depth and return period of a design event is crucial to water resource management and engineering. This study presents a model-based selection approach associated with regional frequency analysis to examine the lack of maximum daily rainfall data in Brazil. A generalized extreme values (GEV) distribution was hierarchically fitted using a Bayesian approach and data that were collected from rainfall gauge stations. The GEV model parameters were submitted to a model-based cluster analysis, resulting in regions of homogeneous rainfall regimes. Time-series data of the individual rainfall gauges belonging to each identified region were joined into a new dataset, which was divided into calibration and validation sets to estimate new GEV parameters and to evaluate model performance, respectively. The results identified two distinct rainfall regimes in the region: more and less intense rainfall extremes in the southeast and northwest regions, respectively. According to the goodness of fit measures that were used to evaluate the models, the aggregation level of the parameters in clustering influenced their performance.
Similar content being viewed by others
References
Banerjee S, Rosenfeld A (1993) Model-based cluster analysis. Pattern Recogn 26:963–974. https://doi.org/10.1016/0031-3203(93)90061-Z
Beijo LA, Vivanco MJF, Muniz JA (2009) Bayesian analysis for estimating the return period of maximum precipitation at Jaboticabal São Paulo state Brazil. Cienc Agrotecnol 33:261–270. https://doi.org/10.1590/S1413-70542009000100036
Bigiarini MZ (2014) Goodness-of-fit functions for comparison of simulated and observed hydrological time series. “HydroGOF Package” R Statistical Software Documentation. http://cran.r-project.org/web/packages/hydroGOF/hydroGOF.pdf. Accessed 29 Apr 2016
Burn DH (1990) Evaluation of regional flood frequency analysis with a region of influence approach. Water Resour Res 26(10):2257–2265. https://doi.org/10.1029/WR026i010p02257
Coles SG, Tawn JA (1996) A Bayesian analysis of extreme rainfall data. Appl Stat J R Stat C 45(4):463–478. http://www.jstor.org/stable/2986068. Accessed 29 Apr 2016
Crisci A, Gozzini B, Meneguzzo F, Pagliara S, Maracchi G (2002) Extreme rainfall in a changing climate: regional analysis and hydrological implications in Tuscany. Hydrol Process 16:1261–1274. https://doi.org/10.1002/hyp.1061
Criss RE, Winston WE (2008) Do Nash values have value? Discussion and alternate proposals. Hydrol Process 22:2723–2725. https://doi.org/10.1002/hyp.7072
El Adlouni S, Ouarda TBMJ, Zhang X, Roy R, Bobée B (2007) Generalized maximum likelihood estimators for the nonstationary generalized extreme value model. Water Resour Res 43:W03410. https://doi.org/10.1029/2005WR004545
Fraley C, Raftery AE (1998) How many clusters? Which clustering method? Answers via model-based cluster analysis. Comput J 41:578–588. https://doi.org/10.1093/comjnl/41.8.578
Gaioni E, Dey D, Ruggeri F (2010) Bayesian modeling of flash floods using generalized extreme value distribution with prior elicitation. Chil J Stat 1:75–90. http://chjs.soche.cl/papers/volumen1_2010/ChJS-01-01-05.pdf. Accessed 29 Apr 2016
Gupta HV, Kling H, Yilmaz KK, Martinez GF (2009) Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling. J Hydrol 377:80–91. https://doi.org/10.1016/j.jhydrol.009.08.003
Herrington T, Mahon A, Hires R, Miller J (2008) A comparison of methods used to calculate extreme water levels. In: Proceedings of solutions to coastal disasters congress 2008, Turtle Bay, Oahu, Hawaii, USA, April 13–16. ASCE, pp 198–209. http://doi.org/10.1061/40968(312)18. Accessed 29 Apr 2015
Hosking JRM, Wallis JR (1997) Regional frequency analysis: an approach based on L-moments. Cambridge University Press, Cambridge
Kling H, Fuchs M, Paulin M (2012) Runoff conditions in the upper Danube basin under an ensemble of climate change scenarios. J Hydrol 424–425:264–277. https://doi.org/10.1016/j.hydrol.2012.01.011
Koutsoyiannis D (2004) Statistics of extremes and estimation of extreme rainfall: I. Theoretical investigation. Hydrolog Sci J 49:575–590. https://doi.org/10.1623/hysj.49.4.591.54424
Krause P, Boyle DP, Bäse F (2005) Comparison of different efficiency criteria for hydrological model assessment. Adv Geosci 5:89–97. https://doi.org/10.5194/adgeo-5-89-2005
Kyselý J, Gaál L, Picek J (2011) Comparison of regional and at-site approaches to modelling probabilities of heavy precipitation. Int J Climatol 31:1457–1472. https://doi.org/10.1002/joc.2182
Liang Z, Chang W, Li B (2012) Bayesian flood frequency analysis in the light of the model parameter uncertainties. Stoch Environ Res Risk Assess 26:721–730. https://doi.org/10.1007/s00477-011-0552-y
Lima KC, Satyamurty P, Fernandez JPR (2010) Large-scale atmospheric conditions associated with heavy rainfall episodes in Southeast Brazil. Theor Appl Climatol 101:121–135. https://doi.org/10.1007/s00704-009-0207-9
Michele CD, Salvadori G (2005) Some hydrological applications of small sample estimators of generalized pareto and extreme value distributions. J Hydrol 301:37–53. https://doi.org/10.1016/j.jhydrol.2004.06.015
Milligan GW, Cooper MC (1985) An examination of procedures for determining the number of clusters in a data set. Psychometrika 50(2):159–179. https://doi.org/10.1007/BF02294245
Müllner D (2013) Fastcluster: fast hierarchical, agglomerative clustering routines for R and Python. J Stat Softw 53(9):1–18. http://www.jstatsoft.org/v53/i09/paper. Accessed 29 Apr 2015
Najafi MR, Moradkhani H (2013a) A hierarchical Bayesian approach for the analysis of climate change impact on runoff extremes. Hydrol Process. https://doi.org/10.1002/hyp.10113
Najafi MR, Moradkhani H (2013b) Analysis of runoff extremes using spatial hierarchical Bayesian modeling. Water Resour Res 49:1–15. https://doi.org/10.1002/wrer.20381
Nguyen V-T-V, Tao D, Bourque A (2002) On selection of probability distributions for representing annual extreme rainfall series. Global solutions for urban drainage. In: Proceedings of the ninth international conference on urban drainage [S.l.], pp 1–10. http://doi.org/10.1061/40644(2002)250. http://ascelibrary.org/doi/pdf/10.1061/40644(2002)250. Accessed 29 Apr 2015
Norbiato D, Borga M, Sangati M, Zanon F (2007) Regional frequency analysis of extreme precipitation in the easter Italian Alps and the August 29, 2003 flash flood. J Hydrol 345:149–166. https://doi.org/10.1016/j.jhydrol.2007.07.009
Ouarda TBMJ, El-Adlouni S (2011) Bayesian nonstationarity frequency analysis of hydrological variables. J Am Water Resour Assoc 47(3):496–505. https://doi.org/10.1111/j.1752-1688.2011.00544.x
Park J-S, Kang H-S, Lee YS, Kim M-K (2011) Changes in the extreme daily rainfall in South Korea. Int J Climatol 31:2290–2299. https://doi.org/10.1002/joc.2236
R Development Core Team (2014) A language and environment for statistical computing. R Development Core Team, Vienna
Reis DS Jr, Stedinger JR (2005) Bayesian MCMC flood frequency analysis with historical information. J Hydrol 313:97–116. https://doi.org/10.1016/j.jhydrol.2005.02.028
Renard B, Kochanek K, Lang M, Garavagia F, Paquet E, Neppel L, Najib K, Carreau J, Arnaud P, Aubert Y, Borchi F, Soubeyroux J-M, Jourdain S, Vaysseire J-M, Sauquet E, Cipriani T, Auffray A (2013) Data-based comparison of frequency analysis methods: a general framework. Water Resour Res 49(2):825–843. https://doi.org/10.1002/wrcr.20087
Shahzadi A, Akhter AS, Saf B (2013) Regional frequency analysis of annual maximum rainfall in monsoon region of Pakistan using L-moments. Pak J Stat Oper Res 9(1):111–136. http://www.pjsor.com/index.php/pjsor/article/view/461/297. Accessed 29 Apr 2015
Silva ED, Reboita MS (2013) Precipitation study for Minas Gerais State, Brazil. Braz J Climatol 13:120–136. http://ojs.c3sl.ufpr.br/ojs/index.php/revistaabclima/article/view/33345. Accessed 5 Apr 2015
Thomas A, O’Hara B, Ligges U, Sturtz S (2006) Making BUGS open. R News 6:12–17. http://www.rni.helsinki.fi/~boh/publications/Rnews2006-1.pdf. Accessed 29 Apr 2015
Viglione A, Laio F, Claps P (2007) A comparison of homogeneity tests for regional frequency analysis. Water Resour Res 43:1–10. https://doi.org/10.1029/2006WR005095
Viglione A, Merz R, Salinas J, Blöschl G (2013) Flood frequency hydrology: 3. A Bayesian analysis. Water Resour Res 49:675–692. https://doi.org/10.1029/2011WR0107
Wallis JR, Schaefer MG, Barker BL, Taylor GH (2007) Regional precipitation-frequency analysis and spatial mapping for 24-hour and 2-hour durations for Washington State. Hydrol Earth Syst Sci 11(1):415–442. https://doi.org/10.5194/hess-11-415-2007
Acknowledgements
We thank the Foundation for Research Support of the State of Minas Gerais (FAPEMIG) for financial support of our research. The authors also thank the reviewers’ comments and suggestions that tremendously contributed to improve the paper quality. The time-series of daily rainfall data used in this work are freely available online for the public at the Hydrological Information System—HidroWeb (http://hidroweb.ana.gov.br/) of the National Agency of Water (ANA).
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A
Appendix B
Appendix C
Algorithm C2. The algorithm of clustering from Müllner (2013)
-
1.
Let S be the current set of nodes with implicit or explicit dissimilarity information. Determine a pair of mutually closest points (a; b).
-
2.
Join a and b into a new node n. Delete a and b from the set of nodes and add n to the set.
-
3.
Output the node labels a and b and their dissimilarity d(a; b).
-
4.
Update the dissimilarity information by specifying the distance from n to all other nodes. This update can be performed explicitly by specifying the distances or by denying a cluster representative in the stored data approach.
-
5.
Repeat steps 1–4 until there is a single node left that contains all of the original nodes.
Distance formula for the complete agglomerative clustering procedure:
where I and J are clusters that are joined into a new cluster, and K is any other cluster.
Rights and permissions
About this article
Cite this article
Assis, L.C., Calijuri, M.L., Silva, D.D. et al. A model-based site selection approach associated with regional frequency analysis for modeling extreme rainfall depths in Minas Gerais state, Southeast Brazil. Stoch Environ Res Risk Assess 32, 469–484 (2018). https://doi.org/10.1007/s00477-017-1481-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-017-1481-1