Skip to main content
Log in

Estimation of the cumulative incidence function under multiple dependent and independent censoring mechanisms

  • Published:
Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

Competing risks occur in a time-to-event analysis in which a patient can experience one of several types of events. Traditional methods for handling competing risks data presuppose one censoring process, which is assumed to be independent. In a controlled clinical trial, censoring can occur for several reasons: some independent, others dependent. We propose an estimator of the cumulative incidence function in the presence of both independent and dependent censoring mechanisms. We rely on semi-parametric theory to derive an augmented inverse probability of censoring weighted (AIPCW) estimator. We demonstrate the efficiency gained when using the AIPCW estimator compared to a non-augmented estimator via simulations. We then apply our method to evaluate the safety and efficacy of three anti-HIV regimens in a randomized trial conducted by the AIDS Clinical Trial Group, ACTG A5095.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Andersen PK, Borgan Ø, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer series in statistics. Springer, New York

    MATH  Google Scholar 

  • Andersen JW, Fass R, van der Horst C (2007) Factors associated with early study discontinuation in AACTG studies, DACS 200. Contemp Clin Trials 28:583–592

    Article  Google Scholar 

  • Arici C, Ripamonti D, Maggiolo F, Rizzi M, Finazzi MG, Pezzotti P, Suter F (2002) Factors associated with the failure of HIV-positive persons to return for scheduled medical visits. HIV Clin Trials 3(1):52–57

    Article  Google Scholar 

  • Bender R, Augustin T, Blettner M (2005) Generating survival times to simulate cox proportional hazards models. Stat Med 24(11):1713–1723

    Article  MathSciNet  Google Scholar 

  • Bryant J, Dignam JJ (2004) Semiparametric models for cumulative incidence functions. Biometrics 60(1):182–190

    Article  MathSciNet  MATH  Google Scholar 

  • Dudley J, Jin S, Hoover D, Metz S, Thackeray R, Chmiel J (1995) The multicenter AIDS cohort study: retention after 9 1/2 years. Am J Epidemiol 142(3):323–330

    Article  Google Scholar 

  • Fine JP (2001) Regression modeling of competing crude failure probabilities. Biostatistics 2(1):85–97

    Article  MATH  Google Scholar 

  • Fine JP, Gray RJ (1999) A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc 94(446):496–497

    Article  MathSciNet  MATH  Google Scholar 

  • Fleishman JA, Yehia BR, Moore RD, Korthuis PT, Gebo KA, HIV Reseach Network (2012) Establishment, retention, and loss to follow-up in outpatient HIV care. J Acquir Immune Defic Syndr 60(3):249–259

  • Fleming T, Harrington D (1991) Counting processes and survival analysis. Wiley, New York

    MATH  Google Scholar 

  • Gray RJ (1988) A class of K-sample tests for comparing the cumulative incidence of a competing risk. Ann Stat 16(3):1141–1154

    Article  MathSciNet  MATH  Google Scholar 

  • Gulick R, Ribaudo H, Shikuma C, Lustgarten S, Squires K, Meyer W III, Acosta E, Schackman B, Pilcher C, Murphy R et al (2004) Triple-nucleoside regimens versus efavirenz-containing regimens for the initial treatment of HIV-1 infection. N Engl J Med 350(18):1850–1861

    Article  Google Scholar 

  • Gulick R, Ribaudo H, Shikuma C, Lalama C, Schackman B, Meyer W III, Acosta E, Schouten J, Squires K, Pilcher C et al (2006) Three-vs four-drug antiretroviral regimens for the initial treatment of HIV-1 infection. JAMA 296(7):769–781

    Article  Google Scholar 

  • Heitjan DF, Rubin DB (1991) Ignorability and coarse data. Ann Stat 19(4):2244–2253

    Article  MathSciNet  MATH  Google Scholar 

  • Ioannidis JPA, Bassett R, Hughes MD, Volberding PA, Sacks HS, Lau J (1997) Predictors and impact of patients lost to follow-up in a long-term randomized trial of immediate versus deferred antiretroviral treatment. J Acquir Immune Defic Syndr Hum Retrovirol 16(1):22–30

    Article  Google Scholar 

  • Jeong JH, Fine J (2006) Direct parametric inference for the cumulative incidence function. J R Stat Soc Ser C (Appl Stat) 55(2):187–200

    Article  MathSciNet  MATH  Google Scholar 

  • Kalbfleisch JD, Prentice RL (1980) The statistical analysis of failure time data. Wiley, New York

    MATH  Google Scholar 

  • Krishnan S, Wu K, Smurzynski M, Bosch RJ, Benson CA, Collier AC, Klebert MK, Feinberg J, Koletar SL, ALLRT, A5001 team (2010) Incidence rate of and factors associated with loss to follow-up in a longitudinal cohort of antiretroviral-treated HIV-infected persons: an AIDS Clinical Trials Group (ACTG) Longitudinal Linked Randomized Trials (ALLRT) analysis. HIV Clin Trials 12(4):190–200

  • Lanoy E, Mary-Krause M, Tattevin P, Dray-Spira R, Duvidier C, Fischer P, Obadia Y, Lert F, Costagliola D, the Clinical Epidemiology Group of the French Hospital Database on HIV infection (2006) Predicators identified for losses to follow-up among HIV-seropositive patients. J Clin Epidemiol 59:829–835

  • Lin DY (1997) Non-parametric inference for cumulative incidence functions in competing risks studies. Stat Med 16(8):901–910

    Article  Google Scholar 

  • Lok JJ, Hughes MD (2016) Evaluating predictors of competing risk outcomes when censoring depends on time-dependent covariates, with application to safety and efficacy of HIV treatment. Stat Med 35(13):2183–2194

    Article  MathSciNet  Google Scholar 

  • Pintilie M (2006) Competing risks: a practical perspective. Wiley, New York

    Book  MATH  Google Scholar 

  • Robins JM, Rotnitzky A, Zhao LP (1995) Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. J Am Stat Assoc 90(429):106–121

    Article  MathSciNet  MATH  Google Scholar 

  • Robins J, Rotnitzky A (1992) Recovery of information and adjustment for dependent censoring using surrogate markers. In: Jewell NP, Dietz K, Farewell VT (eds) AIDS Epidemiology (Methodological issues), Birkhäuser Boston, pp. 297–331

  • Rotnitzky A, Farall A, Bergesio A, Scharfstein D (2007) Analysis of failure time data under competing censoring mechanisms. J R Stat Soc Ser B (Stat Methodol) 69(3):307–327

    Article  MathSciNet  Google Scholar 

  • Rotnitzky A, Bergesio A, Farall A (2009) Analysis of quality-of-life adjusted failure time data in the presence of competing, possibly informative, censoring mechanisms. Lifetime Data Anal 15(1):1–23

    Article  MathSciNet  MATH  Google Scholar 

  • Rotnitzky A, Robins JM (2005) Inverse probability weighted estimation in survival analysis. In: Armitage P, Coulton T (eds) Encyclopedia of biostatistics, 2nd edn. Wiley, New York

    Google Scholar 

  • Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592

    Article  MathSciNet  MATH  Google Scholar 

  • Tsiatis AA (2006) Semiparametric theory and missing data. Springer, Berlin

    MATH  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Andrea Rotnitzky for her review and valuable comments on this paper. We are grateful to the ACTG for providing data used in the motivating application. This work was partially supported by Grants AI024643, AI068634, and AI007358 from the National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Judith J. Lok.

Ethics declarations

Ethical approval

Analysis of data from ACTG A5095 was approved by the Institutional Review Board of the Harvard School of Public Health. Informed consent was obtained from all individual participants included in the study.

Appendix

Appendix

Equation (3) is an unbiased estimating equation for \(F_{j}(t)\) since

$$\begin{aligned}&{\mathbf {E}\left( \frac{\tilde{\varDelta }}{\pi (\tilde{T}|\bar{V}_{\tilde{T}};\varLambda _{0})}\left\{ \mathbf {1}(T\le t, J=j)-{F}_{j}(t)\right\} \right) }\\&\quad = \mathbf {E} \left( \mathbf {E}\left\{ \frac{\tilde{\varDelta }}{\pi (\tilde{T}|\bar{V}_{\tilde{T}};\varLambda _{0})}\left\{ \mathbf {1}(T\le t, J=j)-{F}_{j}(t)\right\} | \bar{V}_{T},T \right\} \right) \\&\quad = \mathbf {E}\left( \frac{\mathbf {E}[\tilde{\varDelta } | \bar{V}_{T},T ]}{\pi (\tilde{T}|\bar{V}_{\tilde{T}};\varLambda _{0})} \left\{ \mathbf {1}(T\le t, J=j)-{F}_{j}(t)\right\} \right) \\&\quad = \mathbf {E}\left( 1 \cdot \left\{ \mathbf {1}(T\le t, J=j)-{F}_{j}(t)\right\} \right) \\&\quad = 0. \end{aligned}$$

The third equality follows from the fact that \(\mathbf {E}\left( \tilde{\varDelta } | \bar{V}_{T},T \right) = Pr[\tilde{\varDelta } = 1 | \bar{V}_{T}, T]\) which under our coarsening at random assumption equals \( \pi (\tilde{T}|\bar{V}_{\tilde{T}};\varLambda _{0})\). From this it is clear that Eq. (3) is an unbiased estimating equation for \(F_{j}(t)\).

One can use the results in Robins and Rotnitzky (1992) to prove that with one type of censoring the solution to

$$\begin{aligned} \sum _{i=1}^{n}\bigg \{\frac{\tilde{\varDelta }_{i}}{\pi _{i}(\tilde{T}_{i}|\bar{V}_{\tilde{T}_{i}};\varLambda _{0})}\left\{ \mathbf {1}(T_{i}\le t, J_{i}=j)-{F}_{j}(t)\right\} - A_{i}\{F_{j}(t),\gamma ,b(\cdot )\} \bigg \}=0 \end{aligned}$$
(16)

with

$$\begin{aligned} A_{i}\{F_{j}(t),\gamma ,b(\cdot )\} \equiv \int \frac{b(u,\bar{V}_{u})}{\pi (u\text {-}|\bar{V}_{u};\varLambda _{0}(u))} dM_{C}(u) \end{aligned}$$
(17)

and \(b(u,\bar{V}_{u})\) the same as in Eq. (9), is a doubly robust, locally efficient estimator for \(F_{j}(t)\). Here, \(dM_{C}(u) = dN_{C}(u) - d\varLambda (u)\). In our situation, with multiple types of censoring, it is easy to show that \(dM_{C}(u) = \sum _{r=1}^{r*}dM_{C,r}(u)\), which leads to our augmentation term in (8). Note that with one type of failure and one type of censoring, our method reduces to that of (Rotnitzky and Robins 2005).

Now, it can be shown that for each r, since

$$\begin{aligned} \frac{b(u,\bar{V}_{u})}{\pi (u\text {-}|\bar{V}_{u};\varLambda _{0}(u))} \end{aligned}$$

is a bounded and predictable process, defined on the same filtration as \(M_{C,r}(u)\),

$$\begin{aligned} \int \frac{b(u,\bar{V}_{u})}{\pi (u\text {-}|\bar{V}_{u};\varLambda _{0}(u))} dM_{C,r}(u) \end{aligned}$$

is a mean zero martingale Fleming and Harrington (1991, Thm 1.5.1). Note that the left-continuous versions of \(b(u,\bar{V}_{u})\) and \(\pi (u|\bar{V}_{u};\varLambda _{0}(u))\) are needed here. As a result

$$\begin{aligned} \sum _{r=1}^{r^{*}}\int \frac{b(u,\bar{V}_{u})}{\pi (u\text {-}|\bar{V}_{u};\varLambda _{0}(u))} dM_{C,r}(u) \end{aligned}$$

is also a mean zero martingale. Thus, \(A_{i}\{F_{j}(t),\gamma ,b(\cdot )\}\) has mean zero. Since \(A_{i}\{F_{j}(t),\gamma ,b(\cdot )\}\) has mean zero, it also follows trivially that Eq. (7) is an unbiased estimating equation for \(F_{j}(t)\).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lok, J.J., Yang, S., Sharkey, B. et al. Estimation of the cumulative incidence function under multiple dependent and independent censoring mechanisms. Lifetime Data Anal 24, 201–223 (2018). https://doi.org/10.1007/s10985-017-9393-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10985-017-9393-4

Keywords

Navigation