Abstract
Multilevel covariance structure models have become increasingly popular in the psychometric literature in the past few years to account for population heterogeneity and complex study designs. We develop practical simulation based procedures for Bayesian inference of multilevel binary factor analysis models. We illustrate how Markov Chain Monte Carlo procedures such as Gibbs sampling and Metropolis-Hastings methods can be used to perform Bayesian inference, model checking and model comparison without the need for multidimensional numerical integration. We illustrate the proposed estimation methods using three simulation studies and an application involving student's achievement results in different areas of mathematics.
Similar content being viewed by others
References
Albert, J., & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data.Journal of the American Statistical Association, 88, 669–679.
Albert, J., & Chib, S. (1995). Bayesian residual analysis for binary response regression models.Biometrika, 82, 747–759.
Arminger, G., & Muthén B. (1998). A Bayesian approach to nonlinear latent variable models using the Gibbs sampler and the Metropolis-Hastings algorithm.Psychometrika, 63, 271–300.
Bartholomew, D.J. (1980). Factor analysis for categorical data.Journal of the Royal Statistical Society, Series B, 42, 293–321.
Bartholomew, D.J. (1981). Posterior analysis of the factor model.British Journal of Mathematical and Statistical Psychology, 34, 93–99.
Bartholomew, D.J. (1984). Scaling binary data using a factor model.Journal of the Royal Statistical Society, Series B, 46, 120–123.
Bartholomew, D.J. (1987).Latent variable models and factor analysis, New York, NY: Oxford University Press.
Best, N.G., Cowles, M.K., & Vines, S.K. (1995).CODA: Convergence diagnostics and output analysis software for Gibbs sampler output, Version 0.3. (Tech. Rep.). Cambridge, UK: Biostatistics Unit-MRC.
Bock, R.D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm.Psychometrika, 46, 443–445.
Bock, R.D., & Gibbons, R.D., (1996). High-dimensional multivariate probit analysis.Biometrics, 52, 1183–1194.
Brooks, S.P., & Gelman, A. (1998). General methods for monitoring convergence of iterative simulations.Journal of Computational and Graphical Statistics, 7, 434–455.
Brooks, S.P., & Roberts, G.O. (in press). Assessing convergence of Markov chain Monte Carlo algorithms.Journal of Computational and Graphical Statistics.
Chambers, R.G. (1982). Correlation coefficients from 2 × 2 tables and from biserial data.British Journal of Mathematical and Statistical Psychology, 35, 216–227.
Chen, Ming-Hui, & Dey, D.K. (1998). Bayesian modeling of correlated binary responses via scale mixture of multivariate normal link functions.Sankhya, Series A, 60, 322–343.
Chen, Ming-Hui, & Schmeiser, B.W. (1993). Performance of the Gibbs, Hit-and-Run, and Metropolis Samplers.Journal of Computational and Graphical Statistics, 2, 251–272.
Chib, S., & Greenberg, E. (1995). Understanding the Metropolis-Hastings Algorithm.American Statistician, 49, 327–35.
Chib, S., & Greenberg, E. (1998). Analysis of Multivariate Probit Models.Biometrika, 85(2), 347–361.
Christofferson, A. (1975). Factor analysis of dichotomized variables.Psychometrika, 40, 5–32.
Cowles, M.K., & Carlin, B.P. (1996). Markov chain Monte Carlo convergence diagnostics: a comparative review.Journal of the American Statistical Association, 91, 883–904.
Crosswhite, F.J., Dossey, J.A., Swafford, J.O., McKnight, C.C., & Cooney, T.J. (1985).Second International Mathematics Study: Summary report for the United States. Champaign, IL: Stipes.
Chen, M.H., & Dey, Dipak K. (1998). Bayesian analysis of correlated binary data models.Sankhya, Series A, 60, 322–343.
Gelfand, A.E. (1996). Model determination using sampling-based methods. In W.R. Gilks, S. Richardson, & D. J. Spiegelhalter (Eds.),Markov chain Monte Carlo in practice (pp. 145–161). London: Chapman & Hall.
Gelfand, A.E., & Smith, A.F.M. (1990). Sampling-based approaches to calculating marginal densities.Journal of the American Statistical Association, 85, 972–985.
Gelman, A., Carlin, J.B., Stern, H. S., & Rubin, D. R. (1996). Posterior predictive assessment of model fitness (with discussion).Statistica Sinica, 6, 733–807.
Gelman, A., & Rubin, D.R. (1992). Inference from iterative simulation using multiple sequences (with discussion).Statistical Science, 7, 457–511.
Geman S., & Geman, D. (1984). Stochastic Relaxation, Gibbs Distributions and the Bayesian Restoration of Images.IEEE Transactions of Pattern Analysis and Machine Intelligence, 6, 721–741.
Geweke, J. (1992). Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments (with discussion). In J. M. Bernardo et al. (Eds.),Bayesian Statistics 4 (pp. 156–163). Oxford: Oxford University Press.
Geyer, C.J. (1992). Practical Markov chain Monte Carlo (with discussion).Statistical Science, 7, 473–511.
Goldstein, H., & McDonald, R.P. (1988). A general model for the analysis of multilevel data.Psychometrika, 53, 455–467.
Kass, R.E., & Raftery, A.E. (1995). Bayes factors.Journal of American Statistical Association, 90, 773–795.
Lee, S.-Y (1981). A Bayesian approach to confirmatory factor analysis.Psychometrika, 46, 153–160.
Longford, N.T., & Muthén, B. (1992). Factor analysis for clustered observations.Psychometrika, 57, 581–597.
Mardia, K.V. (1970).Families of bivariate distributions, London: Griffin.
Martin, J.K. & McDonald, R.P. (1975). Bayesian estimation in unrestricted factor analysis; a treatment for Heywood cases.Psychometrika, 40, 505–517.
McDonald, R.P., & Goldstein, H. (1989). Balanced versus unbalanced designs for linear structural relations in two-level data.British Journal of Mathematical and Statistical Psychology, 42, 214–232.
Muthén, B. (1978). Contributions to factor analysis of dichotomous variables.Psychometrika, 43, 551–560.
Muthén, B. (1979). A structural probit model with latent variables.Journal of the American Statistical Association, 74, 807–811.
Muthén, B. (1984). A general structural equation model with dichotomous, ordered categorical and continuous latent variable indicators.Psychometrika, 49, 115–132.
Muthén, B. (1987).LISCOMP: Analysis of linear structural equations with a comprehensive measurement model (User's Guide). Mooresville, IN: Scientific Software.
Muthén, B. (1989). Latent variable modeling in heterogeneous populations.Psychometrika, 54, 557–585.
Muthén, B. (1994). Multilevel covariance structure analysis.Sociological Methods & Research 22, 376–398.
Muthén, B., & Christofferson, A. (1981). Simultaneous factor analysis of dichotomous variables in several groups.Psychometrika, 46, 407–419.
Muthén, B., & Satorra, A. (1989). Multilevel aspects of varying parameters in structural models. In R. D. Bock (Ed.),Multilevel analysis of educational data (pp. 87–99). New York, NY: Academic Press.
Sahu, S.K. (1998). Bayesian estimation and model choice in item response models (Tech. Rep.). Cardiff, Wales, UK: Cardiff University, School of Mathematics.
Shi, J., & Lee, S.-Y. (1997). A Bayesian estimation of factor score in confirmatory factor model with polytomous, censored or truncated data.Psychometrika, 62, 29–50.
Tanner, M.A., & Wong, W.H. (1987). The calculation of posterior distributions by data augmentation (with discussion).Journal of American Statistical Association, 82, 528–550.
Tierney, L. (1994). Markov chains for exploring posterior distributions (with discussion),Annals of Statistics, 22, 1701–1762.
Author information
Authors and Affiliations
Corresponding author
Additional information
The authors thank Ian Westbury, University of Illinois at Urbana Champaign for kindly providing the SIMS data for the application.
Rights and permissions
About this article
Cite this article
Ansari, A., Jedidi, K. Bayesian factor analysis for multilevel binary observations. Psychometrika 65, 475–496 (2000). https://doi.org/10.1007/BF02296339
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02296339