Linear regression analysis of survival data with missing censoring indicators

Wang, Qihua; Dinse, Gregg E.

doi:10.1007/s10985-010-9175-8

Linear regression analysis of survival data with missing censoring indicators

Published: 18 June 2010

Volume 17, pages 256–279, (2011)
Cite this article

Lifetime Data Analysis Aims and scope Submit manuscript

Qihua Wang^1,2 &
Gregg E. Dinse³

390 Accesses
23 Citations
Explore all metrics

Abstract

Linear regression analysis has been studied extensively in a random censorship setting, but typically all of the censoring indicators are assumed to be observed. In this paper, we develop synthetic data methods for estimating regression parameters in a linear model when some censoring indicators are missing. We define estimators based on regression calibration, imputation, and inverse probability weighting techniques, and we prove all three estimators are asymptotically normal. The finite-sample performance of each estimator is evaluated via simulation. We illustrate our methods by assessing the effects of sex and age on the time to non-ambulatory progression for patients in a brain cancer clinical trial.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Albert A, Anderson JA (1984) On the existence of maximum likelihood estimates in logistic regression models. Biometrika 71: 1–10
Article MATH MathSciNet Google Scholar
Buckley J, James I (1979) Linear regression with censored data. Biometrika 66: 429–436
Article MATH Google Scholar
Cox DR (1972) Regression models and life tables (with discussion). J R Stat Soc B 34: 187–220
MATH Google Scholar
Dewanji A (1992) A note on a test for competing risks with missing failure type. Biometrika 79: 855–857
Article Google Scholar
Dikta G (1998) On semiparametric random censorship models. J Stat Plan Inference 66: 253–279
Article MATH MathSciNet Google Scholar
Dinse GE (1982) Nonparametric estimation for partially-complete time and type of failure data. Biometrics 38: 417–431
Article Google Scholar
Gao G, Tsiatis AA (2005) Semiparametric estimators for the regression coefficients in the linear transformation competing risks model with missing cause of failure. Biometrika 92: 875–891
Article MATH MathSciNet Google Scholar
Goetghebeur EJ, Ryan L (1990) A modified logrank test for competing risks with missing failure type. Biometrika 77: 207–211
Article MATH MathSciNet Google Scholar
Goetghebeur EJ, Ryan L (1995) Analysis of competing risks survival data when some failure types are missing. Biometrika 82: 821–833
Article MATH MathSciNet Google Scholar
Hahn J (1998) On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica 66: 315–331
Article MATH MathSciNet Google Scholar
Hirano K, Imbens GW, Ridder G (2003) Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71: 1161–1189
Article MATH MathSciNet Google Scholar
Horvitz DG, Thompson DJ (1952) A generalization of sampling without replacement from a finite universe. J Am Stat Assoc 47: 663–685
Article MATH MathSciNet Google Scholar
Jin Z, Lin DY, Wei LJ, Ying Z (2003) Rank-based inference for the accelerated failure time model. Biometrika 90: 341–353
Article MATH MathSciNet Google Scholar
Koul H, Susarla V, van Ryzin J (1981) Regression analysis with randomly right-censored data. Ann Stat 9: 1276–1288
Article MATH Google Scholar
Lai TL, Ying Z (1991) Rank regression methods for left-truncated and right-censored data. Ann Stat 19: 531–556
Article MATH MathSciNet Google Scholar
Leurgans S (1987) Linear models, random censoring and synthetic data. Biometrika 74: 301–309
Article MATH MathSciNet Google Scholar
Li G, Wang QH (2003) Empirical likelihood regression analysis for right censored data. Stat Sinica 13: 51–68
MATH Google Scholar
Lo S-H (1991) Estimating a survival function with incomplete cause-of-death data. J Multivar Anal 39: 217–235
Article MATH Google Scholar
Lu W, Liang Y (2008) Analysis of competing risks data with missing cause of failure under additive hazards model. Stat Sinica 18: 219–234
MathSciNet Google Scholar
Lu K, Tsiatis AA (2001) Multiple imputation methods for estimating regression coefficients in the competing risks model with missing cause of failure. Biometrics 57: 1191–1197
Article MathSciNet Google Scholar
Lunceford JK, Davidian M (2004) Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med 23: 2937–2960
Article Google Scholar
McKeague IW, Subramanian S (1998) Product-limit estimators and Cox regression with missing censoring information. Scand J Stat 25: 589–601
Article MATH MathSciNet Google Scholar
Miller RG (1976) Least squares regression with censored data. Biometrika 63: 449–464
Article MATH MathSciNet Google Scholar
Peddada SD, Patwardhan G (1992) Jackknife variance estimators in linear models. Biometrika 79: 654–657
Article MATH MathSciNet Google Scholar
Reid N (1994) A conversation with Sir David Cox. Stat Sci 9: 439–455
Article MATH Google Scholar
Ritov Y (1990) Estimation in a linear regression model with censored data. Ann Stat 18: 303–328
Article MathSciNet Google Scholar
Robins JM, Rotnitzky A, Zhao LP (1994) Estimation of regression coefficients when some regressors are not always observed. J Am Stat Assoc 89: 846–866
Article MATH MathSciNet Google Scholar
Santner TJ, Duffy DE (1986) A note on A. Albert and J. A. Anderson’s conditions for the existence of maximum likelihood estimates in logistic regression models. Biometrika 73: 755–758
Article MATH MathSciNet Google Scholar
Scharfstein DO, Rotnitzky A, Robins JM (1999) Adjusting for nonignorable drop-out using semiparametric nonresponse models (with discussion). J Am Stat Assoc 94: 1096–1146
Article MATH MathSciNet Google Scholar
Subramanian S (2004) Asymptotically efficient estimation of a survival function in the missing censoring indicator model. Nonparametr Stat 16: 797–817
Article MATH MathSciNet Google Scholar
Subramanian S (2006) Survival analysis for the missing censoring indicator model using kernel density estimation techniques. Stat Methodol 3: 125–136
Article MathSciNet Google Scholar
Tsiatis AA (1990) Estimating regression parameters using linear rank tests for censored data. Ann Stat 18: 354–372
Article MATH MathSciNet Google Scholar
Tsiatis AA, Davidian M, McNeney B (2002) Multiple imputation methods for testing treatment differences in survival distributions with missing cause of failure. Biometrika 89: 238–244
Article MATH MathSciNet Google Scholar
Wang QH, Ng K (2008) Asymptotically efficient product-limit estimators with censoring indicators missing at random. Stat Sinica 18: 749–768
MATH MathSciNet Google Scholar
Wang QH, Linton O, Härdle W (2004) Semiparametric regression analysis with missing response at random. J Am Stat Assoc 99: 334–345
Article MATH Google Scholar
Ying Z (1993) A large sample study of rank estimation for censored regression data. Ann Stat 21: 76–99
Article MATH Google Scholar
Zhou X, Sun L (2003) Additive hazards regression with missing censoring information. Stat Sinica 13: 1237–1257
MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Yunnan University, Kunming, 650091, China
Qihua Wang
Academy of Mathematics and Systems Science, Chinese Academy of Science, Beijing, 100190, China
Qihua Wang
Biostatistics Branch, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, 27709, USA
Gregg E. Dinse

Authors

Qihua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Gregg E. Dinse
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gregg E. Dinse.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Q., Dinse, G.E. Linear regression analysis of survival data with missing censoring indicators. Lifetime Data Anal 17, 256–279 (2011). https://doi.org/10.1007/s10985-010-9175-8

Download citation

Received: 05 November 2008
Accepted: 02 June 2010
Published: 18 June 2010
Issue Date: April 2011
DOI: https://doi.org/10.1007/s10985-010-9175-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Linear regression analysis of survival data with missing censoring indicators

Abstract

Access this article

Similar content being viewed by others

Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range

Deep learning for survival analysis: a review

Binary Logistic Regression

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Linear regression analysis of survival data with missing censoring indicators

Abstract

Access this article

Similar content being viewed by others

Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range

Deep learning for survival analysis: a review

Binary Logistic Regression

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation