Skip to main content

Advertisement

Log in

Prevalent cohort studies and unobserved heterogeneity

  • Published:
Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

Consider lifetimes originating at a series of calendar times \( t_{1} ,t_{2} , \ldots \). At a certain time \( t_{0} \) a cross-sectional sample is taken, generating a sample of current durations (backward recurrence times) of survivors until \( t_{0} \) and a prevalent cohort study consisting of survival times left-truncated at the current durations. A Lexis diagram is helpful in visualizing this situation. Survival analysis based on current durations and prevalent cohort studies is now well-established as long as all covariates are observed. The general problems with unobserved covariates have been well understood for ordinary prospective follow-up studies, with the good help of hazard rate models incorporating frailties: as for ordinary regression models, the added noise generates attenuation in the regression parameter estimates. For prevalent cohort studies this attenuation remains, but in addition one needs to take account of the differential selection of the survivors from initiation \( t_{i} \) to cross-sectional sampling at \( t_{0} \). This paper intends to survey the recent development of these matters and the consequences for routine use of hazard rate models or accelerated failure time models in the many cases where unobserved heterogeneity may be an issue. The study was inspired by concrete problems in the study of time-to-pregnancy, and we present various simulation results inspired by this particular application.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Redrawn from Keiding (2006)

Fig. 2

Redrawn from Keiding (2006)

Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. Hald (1949).

References

  • Aalen OO (1994) Effects of frailty in survival analysis. Stat Methods Med Res 3:227–243

    Article  Google Scholar 

  • Aalen OO, Borgan Ø, Gjessing HK (2008) Survival and event history analysis: a process point of view. Springer, New York

    Book  Google Scholar 

  • Aalen OO, Cook RJ, Røysland K (2015a) Does Cox analysis of a randomized survival study yield a causal treatment effect? Lifetime Data Anal 21:579–593

    Article  MathSciNet  Google Scholar 

  • Aalen OO, Valberg M, Grotmol T, Tretli S (2015b) Understanding variation in disease risk: the elusive concept of frailty (with discussion). Int J Epidemiol 44:1408–1421

    Article  Google Scholar 

  • Ali MM, Marshall T, Babiker AG (2001) Analysis of incomplete durations with application to contraceptive use. J R Stat Soc A 164:549–563

    Article  MathSciNet  Google Scholar 

  • Andersen PK, Borgan Ø, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer, New York, p 767

    Book  Google Scholar 

  • Anderson JE, Louis TA (1995) Survival analysis using a scale change random effects model. J Am Stat Assoc 90:669–679

    Article  MathSciNet  Google Scholar 

  • Anonymous [W. Dale] (1777). A Supplement to Calculations of the Value of Annuities, Published for the Use of Societies Instituted for Benefit of Age Containing Various Illustration of the Doctrine of Annuities, and Compleat Tables of the Value of 1£. Immediate Annuity. (Being the Only Ones Extant by Half-Yearly Interest and Payments). Together with Investigations of the State of the Laudable Society of Annuitants; Showing What Annuity Each Member Hath Purchased, and Real Mortality Therein, from its Institution Compared with Dr. Halley’s Table. Also Several publications, Letters, and Anecdotes Relative to that Society. And Explanatory of Proceedings to the Present year. London: Ridley

  • Cox DR (1972) Regression models and life tables (with discussion). J R Stat Soc B 34:187–220

    MATH  Google Scholar 

  • Duron S, Slama R, Ducot B, Bohet A, Sørensen DN, Keiding N, Moreau C, Bouyer J (2013) Cumulative incidence rate of medical consultation for fertility problems—analysis of a prevalent cohort using competing risks. Hum Reprod 28:2872–2879

    Article  Google Scholar 

  • Eijkemans MJC, Leridon H, Keiding N, Slama R (2019) A systematic comparison of designs to study human fecundity. Epidemiology 30:120–129

    Article  Google Scholar 

  • Farewell VT, Prentice RL (1977) A study of distributional shape in life testing. Technometrics 19:69–75

    MATH  Google Scholar 

  • Hald A (1949) Maximum likelihood estimation of the parameters of a normal distribution which is truncated at a known point. Skandinavisk Aktuarietidskrift 32:119–134

    MathSciNet  MATH  Google Scholar 

  • Henderson R, Oman P (1999) Effect of frailty on marginal regression estimates in survival analysis. J R Stat Soc B 61:367–379

    Article  MathSciNet  Google Scholar 

  • Hernan MA (2010) The hazards of hazard ratios. Epidemiology 21:13–15

    Article  Google Scholar 

  • Hougaard P (2000) Analysis of multivariate survival data. Springer, New York

    Book  Google Scholar 

  • Hougaard P, Myglegaard P, Borch-Johnsen K (1994) Heterogeneity models of disease susceptibility, with application to diabetic nephropathy. Biometrics 50:1178–1188

    Article  Google Scholar 

  • Kaplan EL, Meier P (1958) Non-parametric estimation from incomplete observations. J Am Stat Assoc 53:457–481

    Article  Google Scholar 

  • Keiding N (1987) The method of expected number of deaths 1786–1886–1986. Int Stat Rev 55:1–20

    Article  MathSciNet  Google Scholar 

  • Keiding N (1992) Independent delayed entry (with discussion). In: Klein JP, Goel PK (eds) Survival analysis: state of the art. Kluwer, Dordrecht, pp 309–326

    Chapter  Google Scholar 

  • Keiding N (2006) Event history analysis and the cross-section. Stat Med 25:2343–2364

    Article  MathSciNet  Google Scholar 

  • Keiding N, Sørensen DN, Ducot B, Bouyer J, Slama R. The current duration approach to analysing time-to-pregnancy: direct validation using an imbedded prevalent cohort study (in preparation)

  • Keiding N, Andersen PK, Klein JP (1997) The role of frailty models and accelerated failure time models in describing heterogeneity due to omitted covariates. Stat Med 16:215–224

    Article  Google Scholar 

  • Keiding N, Kvist K, Hartvig H, Tvede M, Juul S (2002) Estimating time to pregnancy from current durations in a cross-sectional sample. Biostatistics 3:565–578

    Article  Google Scholar 

  • Keiding N, Fine JP, Hansen OH, Slama R (2011) Accelerated failure time regression for backward recurrence times and current durations. Stat Prob Lett 81:724–729

    Article  MathSciNet  Google Scholar 

  • Keiding N, Hansen OHH, Sørensen DN, Slama R (2012) The current duration approach to estimating time to pregnancy (with discussion). Scand J Stat 39:185–213

    Article  MathSciNet  Google Scholar 

  • Leridon H (2004) Can assisted reproduction technology compensate for the natural decline in fertility with age? A model assessment. Hum Reprod 19:1548–1553

    Article  Google Scholar 

  • Louis JF, Thoma ME, Sørensen DN, McLain AC, King RB, Sundaram R, Keiding N, Louis GMB (2013) The prevalence of couple infertility in the United States from a male perspective: evidence from a nationally representative sample. Andrology 1:741–748

    Article  Google Scholar 

  • Martinussen T, Vansteelandt S (2013) On collapsibility and confounding bias in Cox and Aalen regression models. Lifetime Data Anal 19:279–296

    Article  MathSciNet  Google Scholar 

  • Martinussen T, Vansteelandt S, Andersen PK (2018) Subtleties in the interpretation of hazard ratios. arXiv:1810.09192 [math.ST]

  • McLain AC, Sundaram R, Thoma M, Louis GMB (2018) Cautionary note on “Semiparametric modeling of grouped current duration data with preferential reporting”. arXiv:1801.00775 [stat.AP]

  • McLain AC, Sundaram R, Thoma M, Louis GMB (2014) Semiparametric modeling of grouped current duration data with preferential reporting. Stat Med 33:3961–3972

    Article  MathSciNet  Google Scholar 

  • McLaughlin KA, Green JG, Gruber MJ, Sampson NA, Zaslavsky AM, Kessler RC (2010) Childhood adversities and adult psychiatric disorders in the National Comorbidity Survey Replication II. Arch Gen Psychiatry 67:124–132

    Article  Google Scholar 

  • Munda M, Rotolo F, Legrand C (2012) Parfm: parametric frailty models in R. J Stat Softw 51:11

    Article  Google Scholar 

  • Polis CB, Cox CM, Tuncalp Ö, McLain AC, Thoma ME (2017) Estimating infertility prevalence in low-to-middle-income countries: an application of a current duration approach to Demographic and Health Survey data. Hum Reprod 32:1064–1074

    Article  Google Scholar 

  • Scheike T, Keiding N (2006) Design and analysis of time to pregnancy. Stat Methods Med Res 15:127–140

    Article  MathSciNet  Google Scholar 

  • Slama R, Ducot B, Carstensen L, Lorente C, de La Rochebrochard E, Leridon H, Keiding N, Bouyer J (2006) Feasibility of the current duration approach to study human fecundity. Epidemiology 17:440–449

    Article  Google Scholar 

  • Slama R, Hansen O, Ducot B, Bohet A, Sorensen D, Giorgis Allemand L, Eijkemans MJC, Rosetta L, Thalabard JC, Keiding N, Bouyer J (2012) Estimation of the frequency of involuntary infertility on a nation-wide basis. Hum Reprod 27:1489–1498

    Article  Google Scholar 

  • Therneau T (2012) survival: survival analysis, including penalised likelihood. R package version 2.36-14. http://CRAN.R-project.org/package=survival

  • Thoma ME, McLain AC, Louis JF, King RB, Trumble AC, Sundaram R, Louis GMB (2013) Prevalence of infertility in the United States as estimated by the current duration approach and a traditional constructed approach. Fertil Steril 99:1324–1331

    Article  Google Scholar 

  • van Es B, Klaassen CAJ, Oudshoorn K (2000) Survival analysis under cross sectional sampling: length bias and multiplicative censoring. J Stat Plan Inf 91:295–312

    Article  MathSciNet  Google Scholar 

  • Vaupel JW, Manton KG, Stallard E (1979) The impact of heterogeneity in individual frailty on the dynamics of mortality. Demography 16:439–454

    Article  Google Scholar 

  • Weinberg CS, Gladen BC (1986) The beta-geometric distribution applied to comparative fecundity studies. Biometrics 42:547–560

    Article  Google Scholar 

  • Yamaguchi K (2003) Accelerated failure-time mover-stayer regression models for the analysis of last episode data. Sociol Methodol 33:81–110

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Niels Keiding.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Keiding, N., Albertsen, K.L., Rytgaard, H.C. et al. Prevalent cohort studies and unobserved heterogeneity. Lifetime Data Anal 25, 712–738 (2019). https://doi.org/10.1007/s10985-019-09479-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10985-019-09479-9

Keywords

Navigation