Skip to main content
Log in

Missing genetic information in case-control family data with general semi-parametric shared frailty model

  • Published:
Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

Case-control family data are now widely used to examine the role of gene-environment interactions in the etiology of complex diseases. In these types of studies, exposure levels are obtained retrospectively and, frequently, information on most risk factors of interest is available on the probands but not on their relatives. In this work we consider correlated failure time data arising from population-based case-control family studies with missing genotypes of relatives. We present a new method for estimating the age-dependent marginalized hazard function. The proposed technique has two major advantages: (1) it is based on the pseudo full likelihood function rather than a pseudo composite likelihood function, which usually suffers from substantial efficiency loss; (2) the cumulative baseline hazard function is estimated using a two-stage estimator instead of an iterative process. We assess the performance of the proposed methodology with simulation studies, and illustrate its utility on a real data example.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Becher H, Schmidt S, Chang-Claude J (2003) Reproductive factors and familial predisposition for breast cancer by age 50 years. A case-control-family study for assessing main effects and possible gene-environment interaction. Int J Epidemiol 32: 38–48

    Article  Google Scholar 

  • Chatterjee N, Kalaylioglu Z, Shih JH, Gail MH (2006) Case-control and case-only designs with genotype and family history data: estimating relative risk, residual familial aggregation, and cumulative risk. Biometrics 62: 36–48

    Article  MATH  MathSciNet  Google Scholar 

  • Chen L, Hsu L, Malone K (2009) A frailty-model based approach to estimating the age dependent penetrance function of candidate genes using population based case-control study designs: an application to data on BRCA1 gene. Biometrics 65: 1105–1114

    Article  MATH  Google Scholar 

  • Clayton DG (1978) A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 65: 141–151

    Article  MATH  MathSciNet  Google Scholar 

  • Cox DR (1972) Regression models and life tables (with discussion). J R Stat Soc B 34: 187–220

    Google Scholar 

  • Duchateau L, Janssen P (2008) The frailty model. Springer, New York

    MATH  Google Scholar 

  • Fine JP, Glidden DV, Lee KE (2003) A simple estimator for a shared frailty regression model. J R Stat Soc 65: 317–329

    Article  MATH  MathSciNet  Google Scholar 

  • Genest C, MacKay J (1986) The joy of copulas: bivariate distributions with given marginals. Am Stat 40: 280–283

    Article  MathSciNet  Google Scholar 

  • Gill RD (1985) Discussion of the paper by D. Clayton and J. Cuzick. J R Stat Soc A 148: 108–109

    Google Scholar 

  • Gill RD (1989) Non- and semi-parametric maximum likelihood estimators and the von Mises method (Part 1). Scand J Stat 16: 97–128

    MATH  MathSciNet  Google Scholar 

  • Glidden DV (1999) Checking the adequacy of the gamma frailty model for multivariate failure times. Biometrika 86: 381–393

    Article  MATH  MathSciNet  Google Scholar 

  • Gorfine M, Zucker DM, Hsu L (2009) Case-control survival analysis with a general semiparametric shared frailty model—a pseudo full likelihood approach. Ann Stat 37: 1489–1517

    Article  MATH  MathSciNet  Google Scholar 

  • Henderson R, Oman P (1999) Effect of frailty on marginal regression estimates in survival analysis. J R Stat Soc B 61: 367–379

    Article  MATH  MathSciNet  Google Scholar 

  • Hopper JL (2003) Comentary: case-control-family design: a paradigm for future epidemiology reserach. Int J Epidemiol 32: 48–50

    Article  Google Scholar 

  • Hougaard P (1986) Survival models for heterogeneous populations derived from stable distributions. Biometrika 73: 387–396

    Article  MATH  MathSciNet  Google Scholar 

  • Hougaard P (2000) Analysis of multivariate survival data. Springer, New York

    MATH  Google Scholar 

  • Hsu L, Chen L, Gorfine M, Malone K (2004) Semiparametric estimation of marginal hazard function from casef́bcontrol family studies. Biometrics 60: 936–944

    Article  MathSciNet  Google Scholar 

  • Hsu L, Gorfine M (2006) Multivariate survival analysis for case-control family data. Biostatistics 7: 387–398

    Article  MATH  Google Scholar 

  • Hsu L, Gorfine M, Malone K (2007) On robustness of marginal regression coefficient estimates and hazard functions in multivariate survival analysis of family data when the frailty distribution is misspecified. Stat Med 26: 4657–4678

    Article  MathSciNet  Google Scholar 

  • Klein JP (1992) Semiparametric estimation of random effects using the Cox model based on the EM algorithm. Biometrics 48: 795–806

    Article  Google Scholar 

  • Kosorok MR, Lee BL, Fine JP (2004) Robust inference for univariate proportional hazards frailty regression models. Ann Stat 32: 1448–1491

    Article  MATH  MathSciNet  Google Scholar 

  • Malone KE, Daling JR, Neal C, Suter NM, O’Brien C, Cushing-Haugen K, Jonasdottir TJ, Thompson JD, Ostrander EA (2000) Frequency of BRCA1/BRCA2 mutations in a population-based sample of young breast carcinoma cases. Cancer 88: 1393–1402

    Article  Google Scholar 

  • Malone KE, Daling JR, Doody DR, Hsu L, Bernstein L, Coates RJ, Marchbanks PA, Simon MS, McDonald JA, Norman SA, Strom BL, Burkman RT, Ursin G, Deapen D, Weiss LK, Folger S, Madeoy JJ, Friedrichsen DM, Suter NM, Humphrey MC, Spirtas R, Ostrander EA (2006) Prevalence and predictors of BRCA1 and BRCA2 mutations in a population-based study of breast cancer in white and black American women ages 35 to 64 years. Cancer Res 66: 8297–8308

    Article  Google Scholar 

  • Marchbanks PA et al (2002) The NICHD women’s contracetive and reproductive experiences study: methods and results. Ann Epidemiol 26: 213–221

    Article  Google Scholar 

  • Marshall AW, Olkin I (1988) Families of multivariate distributions. J Am Stat Assoc 83: 834–841

    Article  MATH  MathSciNet  Google Scholar 

  • McGilchrist CA (1993) REML estimation for survival models with frailty. Biometrics 49: 221–225

    Article  Google Scholar 

  • Nielsen GG, Gill RD, Andersen PK, Sørensen TIA (1992) A counting process approach to maximum likelihood estimation in frailty models. Scand J Stat 19: 25–43

    MATH  Google Scholar 

  • Oakes D (1989) Bivariate survival models induced by frailties. J Am Stat Assoc 84: 487–493

    Article  MATH  MathSciNet  Google Scholar 

  • Ripatti S, Palmgren J (2000) Estimation of multivariate frailty models using penalized partial likelihood. Biometrics 56: 1016–1022

    Article  MATH  MathSciNet  Google Scholar 

  • Shih JH, Chatterjee N (2002) Analysis of survival data from case-control family studies. Biometrics 58: 502–509

    Article  MathSciNet  Google Scholar 

  • Shih JH, Louis TA (1995) Inferences on the association parameter in copula models for bivariate survival data. Biometrics 51: 1384–1399

    Article  MATH  MathSciNet  Google Scholar 

  • Vaida F, Xu RH (2000) Proportional hazards model with random effects. Stat Med 19: 3309–3324

    Article  Google Scholar 

  • Zeger SL, Liang K-Y, Albert PS (1988) Models for longitudinal data: a generalized estimating equation approach. Biometrics 44: 1049–1060

    Article  MATH  MathSciNet  Google Scholar 

  • Zhao LP, Hsu L, Holte S, Chen Y, Quiaoit F, Prentice RL (1998) Combined association and aggregation analysis of data from case-control family studies. Biometrika 85: 299–315

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Malka Gorfine.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Graber-Naidich, A., Gorfine, M., Malone, K.E. et al. Missing genetic information in case-control family data with general semi-parametric shared frailty model. Lifetime Data Anal 17, 175–194 (2011). https://doi.org/10.1007/s10985-010-9178-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10985-010-9178-5

Keywords

Navigation