Bias Reduction in Kernel Tail Index Estimation for Randomly Truncated Pareto-Type Data

Mancer, Saida; Necir, Abdelhakim; Benchaira, Souad

doi:10.1007/s13171-022-00303-5

Bias Reduction in Kernel Tail Index Estimation for Randomly Truncated Pareto-Type Data

Published: 21 December 2022

Volume 85, pages 1510–1547, (2023)
Cite this article

Sankhya A Aims and scope Submit manuscript

Saida Mancer¹,
Abdelhakim Necir¹ &
Souad Benchaira¹

98 Accesses
Explore all metrics

Abstract

A bias reduction to a kernel estimator of the tail index of randomly right-truncated Pareto-type distributions is made. The asymptotic normality of the derived estimator is established by assuming the second-order condition of regular variation. A simulation study is carried out to evaluate the finite sample behavior of the proposed estimator and compare it to those with non-reduced bias. An application to a real dataset of lifetimes of automobile brake pads is done.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Kernel based estimation of the distribution function for length biased data

Article 13 January 2022

Estimating the Second-Order Parameter of Regular Variation and Bias Reduction in Tail Index Estimation Under Random Truncation

Article 17 October 2018

Regression Estimator for the Tail Index

Article Open access 24 June 2020

References

Beirlant, J., Bardoutsos, A., de Wet, T. and Gijbels, I. (2016). Bias reduced tail estimation for censored Pareto type distributions. Statist. Probab. Lett.109, 78–88.
Article MathSciNet MATH Google Scholar
Beirlant, J., Maribe, G. and Verster, A. (2018). Penalized bias reduction in extreme value estimation for censored Pareto-type data, and long-tailed insurance applications. Insurance Math. Econom. 78, 114–122.
Article MathSciNet MATH Google Scholar
Benchaira, S., Meraghni, D. and Necir, A. (2016a). Tail product-limit process for truncated data with application to extreme value index estimation. Extremes19, 219–251.
Article MathSciNet MATH Google Scholar
Benchaira, S., Meraghni, D. and Necir, A. (2016b). Kernel estimation of the tail index of a right-truncated Pareto-type distribution. Statist. Probab. Lett.119, 186–193.
Article MathSciNet MATH Google Scholar
Caeiro, F. and Gomes, M.I. (2015). Threshold selection in extreme value analysis. Chapter In Extreme Value Modeling and Risk Analysis: Methods and Applications (Dipak, D. and Jun, Y.). (pp. 69-87), Chapman-Hall/CRC, ISBN 9781498701297.
Ciuperca, G. and Mercadier, C. (2010). Semi-parametric estimation for heavy tailed distributions. Extremes 13, 55–87.
Article MathSciNet MATH Google Scholar
Csörgő, S., Deheuvels, P. and Mason, D. (1985). Kernel estimates of the tail index of a distribution. Ann. Statist. 13, 1050–1077.
Article MathSciNet MATH Google Scholar
Gardes, L. and Stupfler, G. (2015). Estimating extreme quantiles under random truncation. TEST 24, 207–227.
Article MathSciNet MATH Google Scholar
Groeneboom, P., Lopuhaä, H.P. and de Wolf, P.P. (2003). Kernel-type estimators for the extreme value index. Ann. Statist. 31, 1956–1995.
Article MathSciNet MATH Google Scholar
de Haan, L. and Stadtmüller, U. (1996). Generalized regular variation of second order. J. Australian Math. Soc. (Series A) 61, 381–395.
Article MathSciNet MATH Google Scholar
de Haan, L. and Ferreira, A. (2006). Extreme Value Theory: An Introduction. Springer, Berlin.
Book MATH Google Scholar
He, S. and Yang, G.L. (1998). Estimation of the truncation probability in the random truncation model. Ann. Statist. 26, 1011–1027.
Article MathSciNet MATH Google Scholar
Hill, B.M. (1975). A simple general approach to inference about the tail of a distribution. Ann. Statist. 3, 1163–1174.
Article MathSciNet MATH Google Scholar
Haouas, N., Necir, A. and Brahimi, B. (2019). Estimating the second-order parameter of regular variation and bias reduction in tail index estimation under random truncation. J. Stat. Theory Pract. 13, Paper 7, 33.
MATH Google Scholar
Lawless, J.F. (2002). Statistical Models and Methods for Lifetime Data, 2nd edn. Wiley Series in Probability and Statistics, New York.
Book MATH Google Scholar
Neves, C. and Fraga Alves, M.I. (2004). Reiss and Thomas’ automatic selection of the number of extremes. Comput. Statist. Data Anal. 47, 689–704.
Article MathSciNet MATH Google Scholar
Reiss, R.D. and Thomas, M. (2007). Statistical analysis of extreme values with applications to insurance, finance hydrology and other fields, 3rd edn. Birkhäuser, Berlin.
MATH Google Scholar
Strzalkowska-Kominiak, E. and Stute, W. (2009). Martingale representations of the Lynden-Bell estimator with applications. Statist. Probab. Lett. 79, 814–820.
Article MathSciNet MATH Google Scholar
Weissman, I. (1978). Estimation of parameters and large quantiles based on the k largest observations. J. Am. Statist. Assoc. 73, 812–815.
MathSciNet MATH Google Scholar
Worms, J. and Worms, R. (2016). A Lynden-Bell integral estimator for extremes of randomly truncated data. Statist. Probab. Lett. 109, 106–117.
Article MathSciNet MATH Google Scholar
Woodroofe, M. (1985). Estimating a distribution function with truncated data. Ann. Statist. 13, 163–177.
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We gratefully acknowledge the insights, suggestions, and queries by anonymous referees. Discussions with Djamel Meraghni are also greatly appreciated.

Author information

Authors and Affiliations

Department of Mathematics, Mohamed Khider University of Biskra, Biskra, Algeria
Saida Mancer, Abdelhakim Necir & Souad Benchaira

Authors

Saida Mancer
View author publications
You can also search for this author in PubMed Google Scholar
Abdelhakim Necir
View author publications
You can also search for this author in PubMed Google Scholar
Souad Benchaira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saida Mancer.

Ethics declarations

• The authors have no relevant financial or non-financial interests to disclose.

• The authors have no competing interests to declare that are relevant to the content of this article.

• All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

• The authors have no financial or proprietary interests in any material discussed in this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix: A

A.1: Instrumental Result

Proposition A.1.

Assume that $\overline {\mathbf {F}}\in \mathcal {R}\mathcal {V}_{2}\left (-1/\gamma _{1};\tau _{1},A_{\mathbf {F}}\right ) $ and K satisfies assumptions $\left [ A1\right ] -\left [ A4\right ] ,$ then: $\left (i\right ) $ $E_{t}\left (\beta \right ) =\eta _{2}+\eta _{3}A_{\mathbf {F}}\left (t\right ) \left (1+o\left (1\right ) \right ) ,$ for β > 0, and for a given twice-differentiable function f : $\left (ii\right ) $ $f\left (L_{t,K}\right ) =f\left (\gamma _{1}\right ) +f^{\prime }\left (\gamma _{1}\right ) \eta _{1}A_{\mathbf {F}}\left (t\right ) \left (1+o\left (1\right ) \right ) ,$ where η_i = η_i,K, i = 1, 2, 3 are stated in (1.7) and Section 1.1 respectively. Moreover

$$ \left( iii\right) \text{ }A_{\mathbf{F}}\left( t\right) =\frac {E_{t}\left( \beta\right) -f_{\ast}\left( L_{t}\right) }{\eta_{3}-f_{\ast }^{\prime}\left( \gamma_{1}\right) \eta_{1}}\left( 1+o\left( 1\right) \right) ,\text{ as }t\rightarrow\infty, $$

where f_∗ is as in (1.12), $E_{t}\left (\beta \right ) :=E_{t,K}\left (\beta \right ) $ and L_t := L_t,K.

Proof 1.

Recall that (1.10), and let us decompose $E_{t}\left (\beta \right ) $ into the sum of

$$ E_{t}^{\left( 1\right) }\left( \beta\right) :=1-\beta{\int}_{1}^{\infty }x^{-\beta-1}K^{\ast}\left( x^{-1/\gamma_{1}}\right) dx $$

and

$$ E_{t}^{\left( 2\right) }\left( \beta\right) :=-\beta{\int}_{1}^{\infty }x^{-\beta-1}\left( K^{\ast}\left( \frac{\overline{\mathbf{F}}\left( tx\right) }{\overline{\mathbf{F}}\left( t\right) }\right) -K^{\ast}\left( x^{-1/\gamma_{1}}\right) \right) dx. $$

Using the change of variables $s=x^{-1/\gamma _{1}}$ we readily show that $E_{t}^{\left (1\right ) }\left (\beta \right ) =\eta _{2},$ for β > 0. Applying Taylor’s expansion to K^∗, we may rewrite $E_{t}^{\left (2\right ) }\left (\beta \right ) $ into the sum of

$$ E_{t}^{\left( 2,1\right) }\left( \beta\right) :=-\beta{\int}_{1}^{\infty }x^{-\beta-1}\left( \frac{\overline{\mathbf{F}}\left( tx\right) } {\overline{\mathbf{F}}\left( t\right) }-x^{-1/\gamma_{1}}\right) K\left( x^{-1/\gamma_{1}}\right) dx $$

and

$$ E_{t}^{\left( 2,2\right) }\left( \beta\right) :=-\frac{\beta}{2}{\int}_{1}^{\infty}x^{-\beta-1}\left( \frac{\overline{\mathbf{F}}\left( tx\right) }{\overline{\mathbf{F}}\left( t\right) }-x^{-1/\gamma_{1}}\right)^{2}K^{\prime}\left( \xi_{t}\left( x\right) \right) dx, $$

where $\xi _{t}\left (x\right ) $ is between $\overline {\mathbf {F}}\left (tx\right ) /\overline {\mathbf {F}}\left (t\right ) $ and $x^{-1/\gamma _{1}}.$ Since HCode $\overline {\mathbf {F}}\in \mathcal {R}\mathcal {V}_{2}(-1/\gamma _{1};\tau _{1},$ A_F), then making use of Potters inequalities corresponding to the second-order condition of df F, we write

$$ \left\vert \frac{\frac{\overline{\mathbf{F}}\left( tx\right) } {\overline{\mathbf{F}}\left( t\right) }-x^{-1/\gamma_{1}}}{A_{\mathbf{F} }\left( t\right) }-x^{-1/\gamma_{1}}\frac{x^{\tau_{1}/\gamma_{1}}-1} {\tau_{1}\gamma_{1}}\right\vert \leq\epsilon x^{-1/\gamma_{1}+\tau_{1} /\gamma_{1}+\epsilon}, $$

(A.1)

for any 𝜖 > 0, for all x > 1 and for all large t, see for instance Theorem 2.3.9 in de Haan and Ferreira (2006). Using this inequality, we end up with

$$ E_{t}^{\left( 2,1\right) }\left( \beta\right) =-\left( 1+o\left( 1\right) \right) A_{\mathbf{F}}\left( t\right) \beta{\int}_{1}^{\infty }x^{-\beta-1/\gamma_{1}-1}\frac{x^{\tau_{1}/\gamma_{1}}-1}{\tau_{1}\gamma_{1} }K\left( x^{-1/\gamma_{1}}\right) dx. $$

Once again, using change of variables $s=x^{-1/\gamma _{1}},$ we show easily that the previous integral equals to η₃. In view of assumption $\left [ A4\right ] ,$ the function $K^{\prime }$ is bounded, it follows that

$$ E_{t}^{\left( 2,2\right) }\left( \beta\right) =O\left( 1\right) {\int}_{1}^{\infty}x^{-\beta-1}\left\vert \frac{\overline{\mathbf{F}}\left( tx\right) }{\overline{\mathbf{F}}\left( t\right) }-x^{-1/\gamma_{1} }\right\vert^{2}dx. $$

Let 𝜖 > 0 so small such that − 1/γ₁ + 𝜖 < 0. The Potters inequalities corresponding to the first order condition of regularly varying functions says:

$$ \left\vert \frac{\overline{\mathbf{F}}\left( tx\right) }{\overline {\mathbf{F}}\left( t\right) }-x^{-1/\gamma_{1}}\right\vert \leq\epsilon x^{-1/\gamma_{1}+\epsilon}<\epsilon, $$

for all x ≥ 1 and for all large t, see for instance Proposition B.1.9, assertion 5 in de Haan and Ferreira (2006). It follows that

$$ E_{t}^{\left( 2,2\right) }\left( \beta\right) =o\left( 1\right) {\int}_{1}^{\infty}x^{-\beta-1}\left\vert \frac{\overline{\mathbf{F}}\left( tx\right) }{\overline{\mathbf{F}}\left( t\right) }-x^{-1/\gamma_{1} }\right\vert dx. $$

Let us write

$$ E_{t}^{\left( 2,2\right) }\left( \beta\right) =o\left( A_{\mathbf{F} }\left( t\right) \right) {\int}_{1}^{\infty}x^{-\beta-1}\left\vert \frac{\frac{\overline{\mathbf{F}}\left( tx\right) }{\overline{\mathbf{F} }\left( t\right) }-x^{-1/\gamma_{1}}}{A_{\mathbf{F}}\left( t\right) }\right\vert dx. $$

Subtracting $x^{-1/\gamma _{1}}\frac {x^{\tau _{1}/\gamma _{1}}-1}{\tau _{1} \gamma _{1}},$ inside the sign of the previous absolute value, and adding the same quantity, then applying inequality (A.1), we may readily show that $E_{t}^{\left (2,2\right ) }\left (\beta \right ) =o\left (A_{\mathbf {F}}\left (t\right ) \right ) ,$ as $t\rightarrow \infty ,$ that we omit further details. Thereby $E_{t}^{\left (2,1\right ) }\left (\beta \right ) =\left (1+o\left (1\right ) \right ) \eta _{3}$ leading to the result of assertion $\left (ii\right ) .$ To show the second assertion $\left (ii\right ) ,$ is suffices to use Taylor’s expansion to function f and similar arguments as used to first assertion $\left (i\right ) ,$ that we omit details. Let us now focus on the third one $\left (iii\right ) .$ Multiplying, the equation of assertion $\left (i\right ) $ by $f\left (\gamma _{1}\right ) ,$ we write

$$ f\left( \gamma_{1}\right) E_{t}\left( \beta\right) =f\left( \gamma_{1}\right) \eta_{2}+f\left( \gamma_{1}\right) \eta_{3}A_{\mathbf{F} }\left( t\right) \left( 1+o\left( 1\right) \right) , $$

(A.2)

and from assertion $\left (ii\right ) $ we have $f\left (\gamma _{1}\right ) =f\left (L_{t}\right ) -\eta _{1}f^{\prime }\left (\gamma _{1}\right ) A_{\mathbf {F}}\left (t\right ) \left (1+o\left (1\right ) \right ) .$ Substituting the previous expression into the first term of the right-side of equation (A.2), yields

$$ f\left( \gamma_{1}\right) E_{t}\left( \beta\right) =\left( f\left( L_{t}\right) -\eta_{1}f^{\prime}\left( \gamma_{1}\right) A_{\mathbf{F} }\left( t\right) \right) \eta_{2}\left( 1+o\left( 1\right) \right) +f\left( \gamma_{1}\right) \eta_{3}A_{\mathbf{F}}\left( t\right) \left( 1+o\left( 1\right) \right) , $$

which gives

$$ A_{\mathbf{F}}\left( t\right) =\frac{E_{t}\left( \beta\right) -\frac{f\left( L_{t}\right) }{f\left( \gamma_{1}\right) }\eta_{2}} {\eta_{3}-\frac{f^{\prime}\left( \gamma_{1}\right) }{f\left( \gamma_{1}\right) }\eta_{2}\eta_{1}}\left( 1+o\left( 1\right) \right) . $$

In particular, for f = f_∗, we have $f_{\ast }\left (\gamma _{1}\right ) =\eta _{2},$ it follows that

$$ A_{\mathbf{F}}\left( t\right) =\frac{E_{t}\left( \beta\right) -f_{\ast }\left( L_{t}\right) }{\eta_{3}-f_{\ast}^{\prime}\left( \gamma_{1}\right) \eta_{1}}\left( 1+o\left( 1\right) \right) , $$

which gives the third assertion $\left (iii\right ) .$ The proof of Proposition A.1 is now completed. □

A.2: High Quantile Estimation

An extreme quantile of df F is a value q_ν defined in terms of the generalized inverse by $q_{\nu }:=U_{\mathbf {F}}\left (1/\nu \right ) $ for v ↓ 0. In other words, it is an X-value which is sufficiently large so that the probability of exceeding it is very small. Also known as value-at-risk (VaR), this quantity is largely used, as a risk measure, in several fields such as in finance, insurance, hydrology and reliability. For asymptotic needs, we suppose that v is a function of the observed sample size n, denoted by v = v_n, and assumed to be much smaller than 1/n. The estimation of high quantiles of heavy-tailed distributions, in the case of complete data, has been extensively studied in the literature (de Haan and Ferreira, 2006, see, for instance,). The well-known Weissman estimator (Weissman, 1978) of high quantile q_ν adapted to our new tail index estimator $\widehat {\gamma }_{1,K}^{\ast }$ is given by

$$ \widehat{q}_{\nu}:=X_{n-k:n}\left( \frac{\nu}{\overline{\mathbf{F}}_{n}\left( X_{n-k:n}\right) }\right)^{-\widehat{\gamma}_{1,K}^{\ast}}, $$

where F_n is Woodroofe’s nonparametric estimator of df F, given in Section 1.

Appendix: B

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mancer, S., Necir, A. & Benchaira, S. Bias Reduction in Kernel Tail Index Estimation for Randomly Truncated Pareto-Type Data. Sankhya A 85, 1510–1547 (2023). https://doi.org/10.1007/s13171-022-00303-5

Download citation

Received: 07 February 2022
Accepted: 22 November 2022
Published: 21 December 2022
Issue Date: August 2023
DOI: https://doi.org/10.1007/s13171-022-00303-5

Keywords and phrases

AMS (2000) subject classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bias Reduction in Kernel Tail Index Estimation for Randomly Truncated Pareto-Type Data

Abstract

Access this article

Similar content being viewed by others

Kernel based estimation of the distribution function for length biased data