Skip to main content
Log in

Using the EM-algorithm for survival data with incomplete categorical covariates

  • Published:
Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

Incomplete covariate data is a common occurrence in many studies in which the outcome is survival time. With generalized linear models, when the missing covariates are categorical, a useful technique for obtaining parameter estimates is the EM by the method of weights proposed in Ibrahim (1990). In this article, we extend the EM by the method of weights to survival outcomes whose distributions may not fall in the class of generalized linear models. This method requires the estimation of the parameters of the distribution of the covariates. We present a clinical trials example with five covariates, four of which have some missing values.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. A. Agresti, Categorical Data Analysis, Wiley: New York, 1990.

    Google Scholar 

  2. G. Falkson, A. Cnaan and I. W. Simson, “A randomized Phase II study of acivicin and 4'Deoxydoxorubicin in patients with hepatocellular carcinoma in an Eastern Cooperative Oncology Group Study,” American Journal of Clinical Oncology vol. 13, pp. 510–515, 1990.

    Google Scholar 

  3. G. Falkson, S. Lipsitz, E. Borden, I. W. Simson, and D. Haller, “A ECOG randomized Phase II study of beta Interferon and Menogoril,” American Journal of Clinical Oncology vol. 18, pp. 287–292, 1994.

    Google Scholar 

  4. J. Ibrahim, “Incomplete data in generalized linear models,” Journal of the American Statistical Association, vol. 85, pp. 765–769, 1990.

    Google Scholar 

  5. J. Ibrahim and S. Weisberg, “Incomplete data in generalized linear models with continuous covariates,” Australian Journal of Statistics vol. 34, pp. 461–470, 1992.

    Google Scholar 

  6. J. F. Lawless, Statistical models and methods for lifetime data. Wiley: New York. 1982.

    Google Scholar 

  7. R. Little and M. Schluchter, “Maximum likelihood estimation for mixed continuous and categorical data with missing values,” Biometrika vol. 72, pp. 497–512, 1985.

    Google Scholar 

  8. T. Louis, “Finding the observed information matrix when using the EM algorithm,” JRSS-B vol. 44, pp. 226–233, 1982.

    Google Scholar 

  9. I. Meilijson, “A fast improvement to the EM algorithm on its own terms,” JRSS-B vol. 51, pp. 127–138, 1989.

    Google Scholar 

  10. M. Pugh, D. Harrington, S. Lipsitz and J. M. Robins, “A Simulation Study of an Estimator of Relative Risk in the Cox Model with Missing Covariate Data.” Division of Biostatistics, Dana-Farber Cancer Institute, Boston, MA, Technical Report No. 777Z, 1993.

    Google Scholar 

  11. J. M. Robins, A. Rotnitsky and L. P. Zhao, “Estimation of regression coefficients when some regressors are not always observed,” JASA vol. 89, pp. 846–866, 1994.

    Google Scholar 

  12. M. Schluchter and K. Jackson, “Log-linear analysis of censored survival data with partially observed covariates,” Journal of the American Statistical Association vol. 84, pp. 42–52, 1989.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lipsitz, S.R., Ibrahim, J.G. Using the EM-algorithm for survival data with incomplete categorical covariates. Lifetime Data Anal 2, 5–14 (1996). https://doi.org/10.1007/BF00128467

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00128467

Keywords

Navigation