Missing genetic information in case-control family data with general semi-parametric shared frailty model

Graber-Naidich, Anna; Gorfine, Malka; Malone, Kathleen E.; Hsu, Li

doi:10.1007/s10985-010-9178-5

Missing genetic information in case-control family data with general semi-parametric shared frailty model

Published: 12 December 2010

Volume 17, pages 175–194, (2011)
Cite this article

Lifetime Data Analysis Aims and scope Submit manuscript

Anna Graber-Naidich¹,
Malka Gorfine¹,
Kathleen E. Malone² &
…
Li Hsu²

125 Accesses
4 Citations
Explore all metrics

Abstract

Case-control family data are now widely used to examine the role of gene-environment interactions in the etiology of complex diseases. In these types of studies, exposure levels are obtained retrospectively and, frequently, information on most risk factors of interest is available on the probands but not on their relatives. In this work we consider correlated failure time data arising from population-based case-control family studies with missing genotypes of relatives. We present a new method for estimating the age-dependent marginalized hazard function. The proposed technique has two major advantages: (1) it is based on the pseudo full likelihood function rather than a pseudo composite likelihood function, which usually suffers from substantial efficiency loss; (2) the cumulative baseline hazard function is estimated using a two-stage estimator instead of an iterative process. We assess the performance of the proposed methodology with simulation studies, and illustrate its utility on a real data example.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust Identification of Gene-Environment Interactions Under High-Dimensional Accelerated Failure Time Models

Generalized mean residual life models for case-cohort and nested case-control studies

Article 11 June 2020

Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks

Article Open access 16 September 2022

References

Becher H, Schmidt S, Chang-Claude J (2003) Reproductive factors and familial predisposition for breast cancer by age 50 years. A case-control-family study for assessing main effects and possible gene-environment interaction. Int J Epidemiol 32: 38–48
Article Google Scholar
Chatterjee N, Kalaylioglu Z, Shih JH, Gail MH (2006) Case-control and case-only designs with genotype and family history data: estimating relative risk, residual familial aggregation, and cumulative risk. Biometrics 62: 36–48
Article MATH MathSciNet Google Scholar
Chen L, Hsu L, Malone K (2009) A frailty-model based approach to estimating the age dependent penetrance function of candidate genes using population based case-control study designs: an application to data on BRCA1 gene. Biometrics 65: 1105–1114
Article MATH Google Scholar
Clayton DG (1978) A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 65: 141–151
Article MATH MathSciNet Google Scholar
Cox DR (1972) Regression models and life tables (with discussion). J R Stat Soc B 34: 187–220
Google Scholar
Duchateau L, Janssen P (2008) The frailty model. Springer, New York
MATH Google Scholar
Fine JP, Glidden DV, Lee KE (2003) A simple estimator for a shared frailty regression model. J R Stat Soc 65: 317–329
Article MATH MathSciNet Google Scholar
Genest C, MacKay J (1986) The joy of copulas: bivariate distributions with given marginals. Am Stat 40: 280–283
Article MathSciNet Google Scholar
Gill RD (1985) Discussion of the paper by D. Clayton and J. Cuzick. J R Stat Soc A 148: 108–109
Google Scholar
Gill RD (1989) Non- and semi-parametric maximum likelihood estimators and the von Mises method (Part 1). Scand J Stat 16: 97–128
MATH MathSciNet Google Scholar
Glidden DV (1999) Checking the adequacy of the gamma frailty model for multivariate failure times. Biometrika 86: 381–393
Article MATH MathSciNet Google Scholar
Gorfine M, Zucker DM, Hsu L (2009) Case-control survival analysis with a general semiparametric shared frailty model—a pseudo full likelihood approach. Ann Stat 37: 1489–1517
Article MATH MathSciNet Google Scholar
Henderson R, Oman P (1999) Effect of frailty on marginal regression estimates in survival analysis. J R Stat Soc B 61: 367–379
Article MATH MathSciNet Google Scholar
Hopper JL (2003) Comentary: case-control-family design: a paradigm for future epidemiology reserach. Int J Epidemiol 32: 48–50
Article Google Scholar
Hougaard P (1986) Survival models for heterogeneous populations derived from stable distributions. Biometrika 73: 387–396
Article MATH MathSciNet Google Scholar
Hougaard P (2000) Analysis of multivariate survival data. Springer, New York
MATH Google Scholar
Hsu L, Chen L, Gorfine M, Malone K (2004) Semiparametric estimation of marginal hazard function from casef́bcontrol family studies. Biometrics 60: 936–944
Article MathSciNet Google Scholar
Hsu L, Gorfine M (2006) Multivariate survival analysis for case-control family data. Biostatistics 7: 387–398
Article MATH Google Scholar
Hsu L, Gorfine M, Malone K (2007) On robustness of marginal regression coefficient estimates and hazard functions in multivariate survival analysis of family data when the frailty distribution is misspecified. Stat Med 26: 4657–4678
Article MathSciNet Google Scholar
Klein JP (1992) Semiparametric estimation of random effects using the Cox model based on the EM algorithm. Biometrics 48: 795–806
Article Google Scholar
Kosorok MR, Lee BL, Fine JP (2004) Robust inference for univariate proportional hazards frailty regression models. Ann Stat 32: 1448–1491
Article MATH MathSciNet Google Scholar
Malone KE, Daling JR, Neal C, Suter NM, O’Brien C, Cushing-Haugen K, Jonasdottir TJ, Thompson JD, Ostrander EA (2000) Frequency of BRCA1/BRCA2 mutations in a population-based sample of young breast carcinoma cases. Cancer 88: 1393–1402
Article Google Scholar
Malone KE, Daling JR, Doody DR, Hsu L, Bernstein L, Coates RJ, Marchbanks PA, Simon MS, McDonald JA, Norman SA, Strom BL, Burkman RT, Ursin G, Deapen D, Weiss LK, Folger S, Madeoy JJ, Friedrichsen DM, Suter NM, Humphrey MC, Spirtas R, Ostrander EA (2006) Prevalence and predictors of BRCA1 and BRCA2 mutations in a population-based study of breast cancer in white and black American women ages 35 to 64 years. Cancer Res 66: 8297–8308
Article Google Scholar
Marchbanks PA et al (2002) The NICHD women’s contracetive and reproductive experiences study: methods and results. Ann Epidemiol 26: 213–221
Article Google Scholar
Marshall AW, Olkin I (1988) Families of multivariate distributions. J Am Stat Assoc 83: 834–841
Article MATH MathSciNet Google Scholar
McGilchrist CA (1993) REML estimation for survival models with frailty. Biometrics 49: 221–225
Article Google Scholar
Nielsen GG, Gill RD, Andersen PK, Sørensen TIA (1992) A counting process approach to maximum likelihood estimation in frailty models. Scand J Stat 19: 25–43
MATH Google Scholar
Oakes D (1989) Bivariate survival models induced by frailties. J Am Stat Assoc 84: 487–493
Article MATH MathSciNet Google Scholar
Ripatti S, Palmgren J (2000) Estimation of multivariate frailty models using penalized partial likelihood. Biometrics 56: 1016–1022
Article MATH MathSciNet Google Scholar
Shih JH, Chatterjee N (2002) Analysis of survival data from case-control family studies. Biometrics 58: 502–509
Article MathSciNet Google Scholar
Shih JH, Louis TA (1995) Inferences on the association parameter in copula models for bivariate survival data. Biometrics 51: 1384–1399
Article MATH MathSciNet Google Scholar
Vaida F, Xu RH (2000) Proportional hazards model with random effects. Stat Med 19: 3309–3324
Article Google Scholar
Zeger SL, Liang K-Y, Albert PS (1988) Models for longitudinal data: a generalized estimating equation approach. Biometrics 44: 1049–1060
Article MATH MathSciNet Google Scholar
Zhao LP, Hsu L, Holte S, Chen Y, Quiaoit F, Prentice RL (1998) Combined association and aggregation analysis of data from case-control family studies. Biometrika 85: 299–315
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Industrial Engineering and Management, Technion City, Haifa, 32000, Israel
Anna Graber-Naidich & Malka Gorfine
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109-1024, USA
Kathleen E. Malone & Li Hsu

Authors

Anna Graber-Naidich
View author publications
You can also search for this author in PubMed Google Scholar
Malka Gorfine
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen E. Malone
View author publications
You can also search for this author in PubMed Google Scholar
Li Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Malka Gorfine.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Graber-Naidich, A., Gorfine, M., Malone, K.E. et al. Missing genetic information in case-control family data with general semi-parametric shared frailty model. Lifetime Data Anal 17, 175–194 (2011). https://doi.org/10.1007/s10985-010-9178-5

Download citation

Received: 25 June 2009
Accepted: 15 June 2010
Published: 12 December 2010
Issue Date: April 2011
DOI: https://doi.org/10.1007/s10985-010-9178-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Missing genetic information in case-control family data with general semi-parametric shared frailty model

Abstract

Access this article

Similar content being viewed by others

Robust Identification of Gene-Environment Interactions Under High-Dimensional Accelerated Failure Time Models

Generalized mean residual life models for case-cohort and nested case-control studies

Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Missing genetic information in case-control family data with general semi-parametric shared frailty model

Abstract

Access this article

Similar content being viewed by others

Robust Identification of Gene-Environment Interactions Under High-Dimensional Accelerated Failure Time Models

Generalized mean residual life models for case-cohort and nested case-control studies

Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation