Skip to main content
Log in

Regression models for binary longitudinal responses

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

Some conditional models to deal with binary longitudinal responses are proposed, extending random effects models to include serial dependence of Markovian form, and hence allowing for quite general association structures between repeated observations recorded on the same individual. The presence of both these components implies a form of dependence between them, and so a complicated expression for the resulting likelihood. To handle this problem, we introduce, as a first instance, what Follmann and Wu (1995) called, in a different setting, an approximate conditional model, which represents an optimal choice for the general framework of categorical longitudinal responses. Then we define two more formally correct models for the binary case, with no assumption about the distribution of the random effect. All of the discussed models are estimated by means of an EM algorithm for nonparametric maximum likelihood. The algorithm, an adaptation of that used by Aitkin (1996) for the analysis of overdispersed generalized linear models, is initially derived as a form of Gaussian quadrature, and then extended to a completely unknown mixing distribution. A large scale simulation work is described to explore the behaviour of the proposed approaches in a number of different situations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Akaike, H. (1973) Information theory and an extension of the maximum likelihood principle, in Second International Symposium on Information Theory, Petrov, B. N. and Csaki, F. (eds), Akademiai Kiado, Budapest. pp. 267–281.

    Google Scholar 

  • Aitkin, M. (1996) A general maximum likelihood analysis of overdispersion in generalized linear models. Statistics and Computing, 6, 251–262.

    Google Scholar 

  • Aitkin, M. (1997) A general maximum likelihood analysis of variance components in generalized linear models. Biometrics, 55, 218–234.

    Google Scholar 

  • Crouch, E. A. C. and Spiegelman, D. (1990) The evaluation of integrals of the form ∫ −∞ f(t) exp(−t 2)dt: application to logistic-normal models. Journal of American Statistical Association, 85, 464–469.

    Google Scholar 

  • Dempster, A. P., Laird, N. M. and Rubin D. A. (1977) Maximum likelihood estimation from incomplete data via the EM algorithm (with discussion). Journal of Royal Statistical Society B, 39, 1–38.

    Google Scholar 

  • Dietz, E. and Böhning, D. (1995) Statistical inference based on a general model of unobserved heterogeneity, in Statistical Modelling. Springer-Verlag, New York.

    Google Scholar 

  • Diggle, P. J. (1988) An approach to the analysis of repeated measurements. Biometrics, 44, 959–971.

    Google Scholar 

  • Diggle, P. J., Liang, K.-Y. and Zeger, S. L. (1994) Analysis of Longitudinal Data. Oxford Science Publications, Oxford.

    Google Scholar 

  • Follmann, D. and Wu, M. (1995) An approximate generalized linear model with random effects for informative missing data. Biometrics, 51, 151–168.

    Google Scholar 

  • Fotouhi, A. R. and Davies, R. B. (1997) Modelling repeated durations: the initial conditions problem, in Statistical Modelling (Proceedings of the 11th International Workshop on Statistical Modelling; Orvieto, Italy, 15–19 July 1996), pp. 159–166.

  • Kaufmann, H. (1987) Regression models for nonstationary categorical time series: asymptotic estimation theory. Annals of Statistics, 15, 79–98.

    Google Scholar 

  • Leroux, B. G. (1992) Consistent estimation of a mixing distribution. Annals of Statistics, 20, 1350–1360.

    Google Scholar 

  • Laird, N. M. (1978) Nonparametric maximum likelihood estimation of a mixing distribution. Journal of American Statistical Association, 73, 805–811.

    Google Scholar 

  • Lindsay, B. G. (1983a) The geometry of mixture likelihoods: a general theory. Annals of Statistics, 11, 86–94.

    Google Scholar 

  • Lindsay, B. G. (1983b) The geometry of mixture likelihoods, part II: the exponential family. Annals of Statistics, 11, 783–792.

    Google Scholar 

  • Manning, W. G., Duan, N. and Rogers, W. H. (1987) Monte Carlo evidence on the choice between sample selection and two-part models. Journal of Econometrics, 35, 59–82.

    Google Scholar 

  • McLachlan, G. J. (1987) On bootstrapping the likelihood ratio test statistic for the number of components in a normal mixture. Applied Statistics, 36, 318–324.

    Google Scholar 

  • Ware, J. H. (1985) Linear models for the analysis of longitudinal studies. The American Statistician, 39, 95–101.

    Google Scholar 

  • Wu, M. C. and Carroll, R. J. (1988) Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process. Biometrics, 44, 175–188.

    Google Scholar 

  • Wu, M. C. and Bailey, K. R. (1989) Estimation and comparison of changes in the presence of informative right censoring: conditional linear model. Biometrics, 45, 939–955.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

AITKIN, M., ALFÓ, M. Regression models for binary longitudinal responses. Statistics and Computing 8, 289–307 (1998). https://doi.org/10.1023/A:1008847820371

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008847820371

Navigation