Abstract
This paper presents a novel class of semiparametric estimating functions for the additive model with right-censored data that are obtained from general biased-sampling. The new estimator can be obtained using a weighted estimating equation for the covariate coeffcients, by embedding the biased-sampling data into left-truncated and right-censored data. The asymptotic properties (consistency and asymptotic normality) of the proposed estimator are derived via the modern empirical processes theory. Based on the cumulative residual processes, we also propose graphical and numerical methods to assess the adequacy of the additive risk model. The good finite-sample performance of the proposed estimator is demonstrated by simulation studies and two applications of real datasets.
Similar content being viewed by others
References
Aalen O O. A linear regression model for the analysis of life times. Stat Med, 1989, 8: 907–925
Andersen P K, Borgan O, Gill R D, et al. Statistical Models Based on Counting Processes. New York: Springer Verlag, 1993
Asgharian M, M’Lan C, Wolfson D. Length-biased sampling with right censoring. J Amer Statist Assoc, 2002, 97: 201–209
Bilias Y, Gu M, Ying Z, et al. Towards a general asymptotic theory for Cox model with staggered entry. Ann Statist, 1997, 25: 662–682
Breslow N, Day N. Statistical methods in cancer research, volume II: The design and analysis of cohort studies. IARC Sci Publ, 1986, 82: 1–406
Chen H Y, Little R J. Proportional hazards regression with missing covariates. J Amer Statist Assoc, 1999, 94: 896–908
Chen Y Q. Semiparametric regression in size-biased sampling. Biometrics, 2010, 66: 149–158
Cox D R, Oakes D. Analysis of Survival Data. London: Chapman and Hall, 1984
Crowley J, Hu M. Covariance analysis of heart transplant survival data. J Amer Statist Assoc, 1977, 72: 27–36
Ghosh D. Proportional hazards regression for cancer studies. Biometrics, 2008, 64: 141–148
Gross S. Weighted estimation in linear regression for truncated survival data. Scand J Statist, 1996, 23: 179–193
Hao M, Song X, Sun, L. Reweighting estimators for the additive hazards model with missing covariates. Canad J Statist, 2014, 42: 285–307
Huang C Y, Qin J. Nonparametric estimation for length-biased and right-censored data. Biometrika, 2011, 98: 177–186
Huang C Y, Qin J. Semiparametric estimation for the additive hazards model with left-truncated and right-censored data. Biometrika, 2013, 100: 877–888
Huang C Y, Qin J, Follmann D A. A maximum pseudo-profile likelihood estimator for the Cox model under length-biased sampling. Biometrika, 2012, 99: 199–210
Kim J, Song M S, Lee S. Goodness-of-fit tests for the additive risk model with (p > 2)-dimensional time-invariant covariates. Lifetime Data Anal, 1998, 4: 405–416
Kim J P, Lu W, Sit T, et al. A unified approach to semiparametric transformation models under general biased sampling schemes. J Amer Statist Assoc, 2013, 108: 217–227
Kim J P, Sit T, Ying Z. Accelerated failure time model under general biased sampling scheme. Biostatistics, 2016, 17: 576–588
Kong L, Cai J. Case-cohort analysis with accelerated failure time model. Biometrics, 2009, 65: 135–142
Kong L, Cai J, Sen P K. Weighted estimating equations for semiparametric transformation models with censored data from a case-cohort design. Biometrika, 2004, 91: 305–319
Lai T, Ying Z. Estimating a distribution function with truncated and censored data. Ann Statist, 1991, 19: 417–442
Lai T, Ying Z. Rank regression methods for left-truncated and right-censored data. Ann Statist, 1991, 19: 531–556
Li Y, Ma H, Wang D, et al. Analyzing the general biased data by additive risk model. Sci China Math, 2017, 60: 685–700
Lin D Y, Wei L J, Yang I, et al. Semiparametric regression for the mean and rate functions of recurrent events. J R Stat Soc Ser B Stat Methodol, 2000, 62: 711–730
Lin D Y, Wei L J, Ying Z. Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika, 1993, 80: 557–572
Lin D Y, Ying Z. Semiparametric analysis of the additive risk model. Biometrika, 1994, 81: 61–71
Lin W. Missing covariates and high-dimensional variable selection in additive hazards regression. PhD Thesis. Los Angeles: University of Southern California, 2011
Lu W, Tsiatis A A. Semiparametric transformation models for the case-cohort study. Biometrika, 2006, 93: 207–214
Lu X, Qi Y. Empirical likelihood for the additive risk model. Probab Math Statist, 2004, 24: 419–431
Luo X, Tsai W Y. Nonparametric estimation for right-censored length-biased data: A pseudo-partial likelihood ap-proach. Biometrika, 2009, 96: 873–886
Luo X, Tsai W Y, Xu Q. Pseudo-partial likelihood estimators for the Cox regression model with missing covariates. Biometrika, 2009, 96: 617–633
Ma H, Zhang F, Zhou Y. Composite estimating equation approach for additive risk model with length-biased and right-censored data. Statist Probab Lett, 2015, 96: 45–53
Miller R, Halpern J. Regression with censored data. Biometrika, 1982, 69: 521–531
Muttlak H, McDonald L. Ranked set sampling with size-biased probability of selection. Biometrics, 1990, 46: 435–445
Paik M C, Tsai W-Y. On using the Cox proportional hazards model with missing covariates. Biometrika, 1997, 84: 579–593
Pollard D. Empirical Processes: Theory and Applications. Hayward: Institute of Mathematics Statistics, 1990
Qi L, Wang C, Prentice R L. Weighted estimators for proportional hazards regression with missing covariates. J Amer Statist Assoc, 2005, 100: 1250–1263
Qin J, Ning J, Liu H, et al. Maximum likelihood estimations and EM algorithms with length-biased data. J Amer Statist Assoc, 2011, 106: 1434–1449
Qin J, Shen Y. Statistical methods for analyzing right-censored length-biased data under Cox model. Biometrics, 2010, 66: 382–392
Robins J M, Rotnitzky A, Zhao L P. Estimation of regression coeffcients when some regressors are not always observed. J Amer Statist Assoc, 1994, 89: 846–866
Rubin D B. Inference and missing data. Biometrika, 1976, 63: 581–592
Shen Y, Ning J, Qin J. Analyzing length-biased data with semiparametric transformation and accelerated failure time models. J Amer Statist Assoc, 2009, 104: 1192–1202
Shen Y, Ning J, Qin J. Nonparametric and semiparametric regression estimation for length-biased survival data. Lifetime Data Anal, 2017, 23: 3–24
Tsai W Y. Pseudo-partial likelihood for proportional hazards models with biased-sampling data. Biometrika, 2009, 96: 601–615
Tsai W Y, Jewell N P, Wang M C. A note on the product-limit estimator under right censoring and left truncation. Biometrika, 1987, 74: 883–886
Tsui K-L, Jewell N P, Wu C. A nonparametric approach to the truncated regression problem. J Amer Statist Assoc, 1988, 83: 785–792
Turnbull B W. The empirical distribution function with arbitrarily grouped, censored and truncated data. J R Stat Soc Ser B Stat Methodol, 1976, 38: 290–295
Van der Vaart A W. Asymptotic Statistics. Cambridge: Cambridge University Press, 1998
Vardi Y. Nonparametric estimation in the presence of length bias. Ann Statist, 1982, 10: 616–620
Vardi Y. Multiplicative censoring, renewal processes, deconvolution and decreasing density: Nonparametric estimation. Biometrika, 1989, 76: 751–761
Wang C, Chen H Y. Augmented inverse probability weighted estimator for Cox missing covariate regression. Biometrics, 2001, 57: 414–419
Wang M C. Hazards regression analysis for length-biased data. Biometrika, 1996, 83: 343–354
Wang M C, Brookmeyer R, Jewell N P. Statistical models for prevalent cohort data. Biometrics, 1993, 49: 1–11
Wang M C, Jewell N P, Tsai W Y. Asymptotic properties of the product limit estimate under random truncation. Ann Statist, 1986, 14: 1597–1605
Xu Q, Paik M C, Luo X, et al. Reweighting estimators for Cox regression with missing covariates. J Amer Statist Assoc, 2009, 104: 1155–1167
Yuen K, Burke M. A test of fit for a semiparametric additive risk model. Biometrika, 1997, 84: 631–639
Zhang F, Chen X, Zhou Y. Proportional hazards model with varying coeffcients for length-biased data. Lifetime Data Anal, 2014, 20: 132–157
Zhao M, Lin C, Zhou Y. Analyzing right-censored length-biased data with additive hazards model. Acta Math Appl Sin Engl Ser, 2017, 33: 893–908
Acknowledgements
Zhang’s work was supported by National Natural Science Foundation of China (Grant Nos. 11771133 and 11401194) and the Natural Science Foundation of Hunan Province of China (Grant No. 2017JJ3021). Zhao’s work was supported by National Natural Science Foundation of China (Grant No. 11771366). Zhou’s work was supported by the State Key Program of National Natural Science Foundation of China (Grant No. 71331006) and the State Key Program in the Major Research Plan of National Natural Science Foundation of China (Grant No. 91546202).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, F., Zhao, X. & Zhou, Y. An embedded estimating equation for the additive risk model with biased-sampling data. Sci. China Math. 61, 1495–1518 (2018). https://doi.org/10.1007/s11425-017-9268-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11425-017-9268-0