Abstract
When making inferences about extreme quantiles, using simple parametric models for the entire distribution can be problematic in that a model that accurately describes the bulk of the distribution may lead to substantially biased estimates of extreme quantiles if the model is misspecified. One way to address this problem is to use flexible parametric families of distributions. For the setting where extremes in both the upper and lower tails are of interest, this paper describes various approaches to quantifying notions of flexibility and then proposes new parametric classes of distributions that satisfy these notions and are computable without requiring numerical integration. A semiparametric extension of these distributions is proposed when the parametric classes are not sufficiently flexible. Some of the new models are applied to daily temperature in July from an ensemble of 50 climate model runs that can be treated as independent realizations of the climate system over the period studied. The large ensemble makes it possible to compare estimates of extreme quantiles based on a single model run to estimates based on the full ensemble. For these data, at the four largest US cities, Chicago, Houston, Los Angeles and New York City, the parametric models generally dominate estimates based on fitting generalized Pareto distributions to some fraction of the most extreme observations, sometimes by a substantial margin. Thus, in at least this setting, parametric models not only provide a way to estimate the whole distribution, they also result in better estimates of extreme quantiles than traditional extreme value approaches.
Similar content being viewed by others
References
Azzalini, A., Regoli, G.: Some properties of skew-symmetric distributions. Ann. Inst. Stat. Math. 64, 857–859 (2012). https://doi.org/10.1007/s10463-011-0338-5
Carreau, J., Bengio, Y.: A hybrid Pareto model for asymmetric fat-tailed data: The univariate case. Extremes 12(1), 53–76 (2009). https://doi.org/10.1007/s10687-008-0068-0
Davison, A.C., Smith, R.L.: Models for exceedances over high thresholds. J. R. Stat. Soc.: Series B (Methodol.) 52(3), 393–425 (1990). https://doi.org/10.1111/j.2517-6161.1990.tb01796.x. https://rss.onlinelibrary.wiley.com/doi/abs/10.1111/j.2517-6161.1990.tb01796.x
de Haan, L., Ferreira, A.: Extreme Value Theory: An Introduction. Springer, New York (2007)
Ferreira, J.T.A.S., Steel, M.F.J.: A constructive representation of univariate skewed distributions. J. Am. Stat. Assoc. 101(474), 823–829 (2006). https://doi.org/10.1198/016214505000001212
Field, C., Barros, V., Stocker, T., Qin, D., Dokken, D., Ebi, K., Mastrandrea, M., Mach, K., Plattner, G.K., Allen, S., Tignor, M., Midgley, P (eds.): Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation. A Special Report of Working Groups I and II of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge (2012)
Frigessi, A, Haug, O, Rue, H: A dynamic mixture model for unsupervised tail estimation without threshold selection. Extremes 5(3), 219–235 (2002). https://doi.org/10.1023/A:1024072610684
Genton, M.G.: Skew-Elliptical Distributions and Their Applications: A Journey Beyond Normality. Chapman and Hall, Abingdon (2004). http://cds.cern.ch/record/997651
Haugen, M.A., Stein, M.L., Sriver, R.L., Moyer, E.J.: Assessing changes in temperature distributions using ensemble model simulations. J. Climate 31(20), 8573–8588 (2018). https://doi.org/10.1175/JCLI-D-17-0782.1
Huang, WK, Nychka, DW, Zhang, H: Estimating precipitation extremes using the log-histospline. Environmetrics 0(0), e2543 (2018). https://doi.org/10.1002/env.2543. https://onlinelibrary.wiley.com/doi/abs/10.1002/env.2543
Huser, R., Davison, A., Genton, M.: Likelihood estimators for multivariate extremes. Extremes 19, 79–103 (2016). https://doi.org/10.1007/s10687-015-0230-4
Ledford, A.W., Tawn, J.A.: Statistics for near independence in multivariate extreme values. Biometrika 83(1), 169–187 (1996). https://doi.org/10.1093/biomet/83.1.169
McDonald, JB, Xu, YJ: A generalization of the beta distribution with applications. J. Econometr. 66 (1), 133–152 (1995). https://doi.org/10.1016/0304-4076(94)01612-4. http://www.sciencedirect.com/science/article/pii/0304407694016124
Naveau, P, Huser, R, Ribereau, P, Hannart, A: Modeling jointly low, moderate, and heavy rainfall intensities without a threshold selection. Water Resour. Res. 52(4), 2753–2769 (2016). https://doi.org/10.1002/2015WR018552. https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1002/2015WR018552
Opitz, T., Huser, R., Bakka, H., Rue, H.: Inla goes extreme: Bayesian tail regression for the estimation of high spatio-temporal quantiles. Extremes 21(3), 441–462 (2018). https://doi.org/10.1007/s10687-018-0324-x
Papastathopoulos, I, Tawn, JA: Extended generalised Pareto models for tail estimation. J. Statist. Plan. Infer. 143(1), 131–143 (2013). https://doi.org/10.1016/j.jspi.2012.07.001. http://www.sciencedirect.com/science/article/pii/S0378375812002388
Salvadori, G., De Michele, C.: Multivariate multiparameter extreme value models and return periods: A copula approach. Water Resour. Res. 46, 10 (2010). https://doi.org/10.1029/2009WR009040
Sardeshmukh, P.D., Compo, G.P., Penland, C.: Need for caution in interpreting extreme weather statistics. J. Climate 28(23), 9166–9187 (2015). https://doi.org/10.1175/JCLI-D-15-0020.1
Schoelzel, C, Friederichs, P: Multivariate non-normally distributed random variables in climate research – introduction to the copula approach. Nonlinear Process. Geophys. 15(5), 761–772 (2008). https://hal-cea.archives-ouvertes.fr/cea-004404313
Smith, R.L., Tawn, J.A., Coles, S.G.: Markov chain models for threshold exceedances. Biometrika 84(2), 249–268 (1997). https://doi.org/10.1093/biomet/84.2.249
Sriver, R.L., Forest, C.E., Keller, K.: Effects of initial conditions uncertainty on regional climate variability: An analysis using a low-resolution CESM ensemble. Geophys. Res. Lett. 42(13), 5468–5476 (2015). https://doi.org/10.1002/2015GL064546
Tencaliec, P, Favre, AC, Naveau, P, Prieur, C, Nicolet, G: Flexible semiparametric generalized Pareto modeling of the entire range of rainfall amount. Environmetrics 31(2), e2582 (2020). https://doi.org/10.1002/env.2582. https://onlinelibrary.wiley.com/doi/abs/10.1002/env.2582, e2582 env.2582
Acknowledgements
This material was based upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research (ASCR) under Contract DE-AC02-06CH11347.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Stein, M.L. Parametric models for distributions when interest is in extremes with an application to daily temperature. Extremes 24, 293–323 (2021). https://doi.org/10.1007/s10687-020-00378-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10687-020-00378-z