Skip to main content
Log in

Hazard rate estimation for left truncated and right censored data

  • Published:
Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Abstract

Left truncation and right censoring (LTRC) presents a unique challenge for nonparametric estimation of the hazard rate of a continuous lifetime because consistent estimation over the support of the lifetime is impossible. To understand the problem and make practical recommendations, the paper explores how the LTRC affects a minimal (called sharp) constant of a minimax MISE convergence over a fixed interval. The corresponding theory of sharp minimax estimation of the hazard rate is presented, and it shows how right censoring, left truncation and interval of estimation affect the MISE. Obtained results are also new for classical cases of censoring or truncation and some even for the case of direct observations of the lifetime of interest. The theory allows us to propose a relatively simple data-driven estimator for small samples as well as the methodology of choosing an interval of estimation. The estimation methodology is tested numerically and on real data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Andersen, P. K., Borgan, O., Gill, R. D., Keiding, N. (1993). Statistical models based on counting processes. New York: Springer.

  • Antoniadis, A., Gregoire, G., Nason, G. (1999). Density and hazard rate estimation for right-censored data by using wavelets methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 61, 63–84.

  • Bagkavos, D., Patil, P. (2012). Variable bandwidths for nonparametric hazard rate estimation. Communications in Statistics - Theory and Methods, 38, 1055–1078.

  • Bindera, N., Schumache, M. (2014). Missing information caused by death leads to bias in relative risk estimates. Journal of Clinical Epidemiology, 67, 1111–1120.

  • Bremhorsta, V., Lamberta, F. (2016). Flexible estimation in cure survival models using Bayesian p-splines. Computational Statistics & Data Analysis, 93, 270–284.

  • Brody, T. (2012). Clinical trials: Study design, endpoints and biomarkers, drug safety, and FDA and ICH guidelines. New York: Elsevier.

  • Brunel, E., Comte, F. (2008). Adaptive estimation of hazard rate with censored data. Communications in Statistics - Theory and Methods, 37, 1284–1305.

  • Cao, R., Janssen, P., Veraverbeke, N. (2005). Relative hazard rate estimation for right censored and left truncated data. Test, 14, 257–280.

  • Cox, D. R., Oakes, D. (1984). Analysis of survival data. London: Chapman&Hall.

  • Daepp, I., Hamilton, M., West, G., Bettencourt, L. (2015). The mortality of companies. Journal of the Royal Society Inference, 12, 1–8.

  • Döhler, S., Rüschendorf, L. (2002). Adaptive estimation of hazard functions. Probability and Mathematical Statistics, 22, 355–379.

  • Efromovich, S. (1999). Nonparametric curve estimation: Methods, theory and applications. New York: Springer.

  • Efromovich, S. (2001). Density estimation under random censorship and order restrictions: From asymptotic to small samples. Journal of the American Statistical Association, 96, 667–685.

    Article  MathSciNet  MATH  Google Scholar 

  • Efromovich, S. (2016). Minimax theory of nonparametric hazard rate estimation: Efficiency and adaptation. Annals of the Institute of Mathematical Statistics, 68, 25–75.

    Article  MathSciNet  MATH  Google Scholar 

  • Efromovich, S., Pinsker, M. (1982). Estimation of a square-integrable probability density of a random variable. Problems of Information Transmission, 18, 19–38.

  • Fleming, T., Harrington, D. (2011). Counting processes and survival analysis. New York: Wiley.

  • Gill, R. (2006). Lectures on survival analysis. New York: Springer.

    MATH  Google Scholar 

  • Hagar, Y., Dukic, V. (2015). Comparison of hazard rate estimation in R. arXiv: 1509.03253v1

  • Huang, C., Qin, J. (2013). Semiparametric estimation for the additive hazard model with left-truncated and right-censored data. Biometrika, 100, 877–888.

  • Jankowski, H., Wellner, J. (2009). Nonparametric estimation of a convex bathtub-shaped hazard function. Bernoulli, 15, 1010–1035.

  • Klein, J. P., Moeschberger, M. L. (2003). Survival analysis: Techniques for censored and truncated data. New York: Springer.

  • Klugman, S., Panjer, H., Willmot, G. (2012). Loss models: From data to decisions. New York: Wiley.

  • Lee, E., Wang, J. (2013). Statistical methods for survival data analysis. New York: Wiley.

  • Lu, X., Min, L. (2014). Hazard rate function in dynamic environment. Reliability Engineering and System Safety, 130, 50–60.

  • Müller, H., Wang, J. (2007). Density and hazard rate estimation. In F. Ruggeri, R. Kenett, F. Faltin (Eds.), Encyclopedia of statistics in quality and reliability (pp. 517–522). Chichester: Wiley.

  • Patil, P., Bagkavos, D. (2012). Histogram for hazard rate estimation. Sankhya B, 74, 286–301.

  • Petrov, V. (1975). Sums of independent random variables. New York: Springer.

    Book  MATH  Google Scholar 

  • Pierce, J., Natarajan, L., Caan, B., Parker, B., Greenberg, R., Flatt, S., et al. (2007). Influence of a diet very high in vegetables, fruit, and fiber and low in fat on prognosis following treatment for breast cancer: The women’s healthy eating and living (WHEL) randomized trial. Journal of the American Medical Association, 298, 289–298.

    Article  Google Scholar 

  • Qian, J., Betensky, R. (2014). Assumptions regarding right censoring in the presence of left truncation. Statistics and Probability Letters, 87, 12–17.

  • Rock, C., Flatt, S., Nataryan, L., Thomson, C., Bardwell, W., Newman, V., et al. (2005). Plasma carotenoids and recurrence-free survival in women with a history of breast cancer. Journal of Clinical Oncology, American Society of Clinical Oncology, 23, 6631–6638.

    Article  Google Scholar 

  • Shi, J., Chen, X., Zhou, Y. (2015). The strong representation for the nonparametric estimation of length-biased and right-censored data. Statistics and Probability Letters, 104, 49–57.

  • Silverman, B. (1986). Density estimation for statistics and data analysis. London: Chapman&Hall.

    Book  MATH  Google Scholar 

  • Su, Y., Wang, J. (2012). Modeling left-truncated and right censored survival data with longitudinal covariates. Annals of Statistics, 40, 1465–1488.

  • Talamakrouni, M., Van Keilegom, I., El Ghouch, A. (2016). Parametrically guided nonparametric density and hazard estimation with censored data. Computational Statistics and Data Analysis, 93, 308–323.

  • Uzunogullari, U., Wang, J. (1992). A comparison of hazard rate estimators for left truncated and right censored data. Biometrika, 79, 297–310.

  • Wang, J.-L. (2005). Smoothing hazard rate. In P. Armitage, T. Colton (Eds.), Encyclopedia of biostatistics (2nd ed., Vol. 7, pp. 4486–4497). Chichester: Willey.

  • Wang, Y., Liang, B., Tong, X., Marder, K., Bressman, S., Orr-Urtreger, A., et al. (2015). Efficient estimation of nonparametric genetic risk function with censored data. Biometrika, 102, 515–532.

    Article  MathSciNet  MATH  Google Scholar 

  • Wu, S., Wells, M. (2003). Nonparametric estimation of hazard functions by wavelet methods. Journal of Nonparametric Statistics, 15, 187–203.

  • Zhang, F., Zhou, Y. (2013). Analyzing left-truncated and right-censored data under cox model with long-term survivors. Acta Mathematicae Applicatae Sinica, 29, 241–252.

Download references

Acknowledgements

The research is supported by NSF Grant DMS-1513461. Suggestions of the reviewers and the AE are appreciated.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sam Efromovich.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 1135 KB)

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Efromovich, S., Chu, J. Hazard rate estimation for left truncated and right censored data. Ann Inst Stat Math 70, 889–917 (2018). https://doi.org/10.1007/s10463-017-0617-x

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10463-017-0617-x

Keywords

Navigation