Abstract
We consider bias reduced estimators of the extreme value index (EVI) in case of Pareto-type distributions and under all max-domains of attraction. To this purpose we revisit the regression approach started in Feuerverger and Hall (Ann. Stat. 27, 760–781, 1999) and Beirlant et al. (Extremes 2, 177–200, 1999) in the case of a positive EVI, and in Beirlant et al. (2005) for real-valued EVI. We generalize these approaches using ridge regression exploiting the mathematical fact that the bias tends to 0 when the number of top data points used in the estimation is decreased. The penalty parameter is selected by minimizing the asymptotic mean squared error of the proposed estimator. The accuracy and utility of the ridge regression estimators are studied using simulations and are illustrated with case studies on reinsurance claim size data as well as daily wind speed data.
Similar content being viewed by others
References
Beirlant, J., Dierckx, G., Goegebeur, Y., Matthys, G.: Tail index estimation and an exponential regression model. Extremes 2, 177–200 (1999)
Beirlant, J., Dierckx, G., Guillou, A., Starica, C.: On exponential representations of log-spacings of extreme order statistics. Extremes 5, 157–180 (2002)
Beirlant, J., Goegebeur, Y., Segers, J., Teugels, J.: Statistics of Extremes: Theory and Applications. Wiley, Chichester (2004)
Beirlant, J., Dierckx, G., Guillou, A.: Estimation of the extreme-value index and generalized Quantile plots. Bernoulli 11(6), 949–970 (2005)
Beirlant, J., Gaonyalelwe, M., Verster, A.: Using shrinkage estimators to reduce bias and mse in estimation of heavy tails. To appear in RevStat (2017)
Caeiro, F., Gomes, I., Pestana, D.: Direct reduction of bias of the classical hill estimator. RevStat 3(2), 113–136 (2005)
Dekkers, A.L.M., Einmahl, J.H.J., de Haan, L.: A moment estimator for the index of an extreme-value distribution. Ann. Stat. 17, 1833–1855 (1989)
Feuerverger, A., Hall, P.: Estimating a tail exponent by modelling departure from a pareto distribution. Ann. Stat. 27, 760–781 (1999)
Fraga Alves, M.I., de Haan, L., Lin, T.: Estimation of the parameter controlling the speed of convergence in extreme value theory. Math. Methods Stat. 12, 155–176 (2003)
Gomes, M.I., Martins, M.J.: Asymptotically unbiased estimators of the tail index based on external estimation of the second order parameter. Extremes 5, 387–414 (2002)
Guillou, A., Hall, P.: A diagnostic for selecting the threshold in extreme value analysis. J. Royal Stat. Soc. B 63, 293–305 (2001)
Hall, P.: On some simple estimates of an exponent of regular variation. J. Royal Stat. Soc. B 44, 37–42 (1982)
Hastie, T., Tibshirani, R., Friedman, J.H.: The Elements of Statistical Learning. Springer-Verlag, New York (2008)
Hill, B.M.: A simple general approach to inference about the tail of a distribution. Ann. Stat. 3(5), 1163–1174 (1975)
Hosking, J.R.M., Wallis, J.R.: Parameter and quantile estimation for the generalized pareto distribution. Technometrics 29, 339–349 (1987)
Hoerl, A.E., Kennard, R.W.: Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 55–67 (1970)
Matthys, G., Beirlant, J.: Adaptive threshold selection in tail index estimation. In: Embrechets, P (ed.) Extremes and Integrated Risk Management, pp 37–49. Risk Books, London (2000)
Smith, R.L.: Estimating tails of probability distributions. Ann. Stat. 15, 1174–1207 (1987)
Acknowledgments
The authors are grateful to the referees for their constructive comments and suggestions, which lead to improvements in the paper.
This work is based on research supported in part by the National Research Foundation of South Africa (Grant Number 108874). The authors acknowledge that the opinions, findings and conclusions or recommendations expressed in any publication by NRF-supported research is that of the authors, and that the NRF accepts no liability whatsoever in this regard.
Author information
Authors and Affiliations
Corresponding author
Appendix: Asymptotics
Appendix: Asymptotics
Proof of Theorem 1
Note that
Using the consistency of \(\hat {\beta }_{k}\) and \(\hat {\rho }\) it follows that as M ≠ 0 we have \(\hat {\tau }_{k}^{+} \to _{p} \gamma ^{2}/M^{2}\) as \(\sqrt {k}b_{n,k} \to M\). So
So the limit distribution is obtained from the limit distribution of
In case M = 0 we have similarly that \((S_{cc}+ \hat {\tau }_{k}^{+})^{-1} \to _{p} 0\) so that we obtain the limit distribution of
□
Proof of Theorem 2
It follows from Beirlant et al. (2005) that the generalised log spacings {Yj} have the following asymptotic expansions
where
Furthermore, write {λj(τ)} as
where \(\alpha (\tau )= 1+ \frac {\bar {c}^{2}}{S_{cc}+\tau }\) and \(\beta (\tau ) = -\frac {\bar {c}}{S_{cc}+\tau }\). Also note that \(\bar {c} \to \frac {1}{1-{\tilde {\rho }}}\) and \(S_{cc} +\bar {c}^{2}\to \frac {1}{1-2\tilde \rho }\) as k →∞ so \(\beta (\tau ) = \left (1-\alpha (\tau )\right )(1-{\tilde {\rho }})\left (1+o_{p}(1)\right )\). The proof of Theorem 2 follows then similarly as in the proof of Theorem 1, leading to the asymptotic distribution of
The expected value of the limit distribution of Sk is given by
The asymptotic variance of Sk follows from a more tedious calculation, both in case γ > 0 and γ < 0. □
The asymptotic variance when γ > 0. The covariance of the error terms {𝜖j} is given by
It follows that the asymptotic variance of Sk is given by
where \(\xi _{1}=\frac {-{\tilde {\rho }} \bar {c}}{S_{cc}}\) and \(\xi _{2}=\bar {c}\).
Indeed, as k →∞
and
The asymptotic variance when γ < 0. Here
It follows that the asymptotic variance of Sk is given by
where \(\xi _{1}(\gamma )=\frac {-2{\tilde {\rho }}(1-\gamma )}{S_{cc}(1-\gamma -{\tilde {\rho }})(1-2\gamma )}\) and \(\xi _{2}(\gamma )=\frac {2(1-\gamma )^{2}}{(1-\gamma -{\tilde {\rho }})(1-2\gamma )}\). Indeed, as k →∞
and
Rights and permissions
About this article
Cite this article
Buitendag, S., Beirlant, J. & de Wet, T. Ridge regression estimators for the extreme value index. Extremes 22, 271–292 (2019). https://doi.org/10.1007/s10687-018-0338-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10687-018-0338-4