Abstract
Item factor analysis (IFA) refers to the factor models and statistical inference procedures for analyzing multivariate categorical data. IFA techniques are commonly used in social and behavioral sciences for analyzing item-level response data. Such models summarize and interpret the dependence structure among a set of categorical variables by a small number of latent factors. In this chapter, we review the IFA modeling technique and commonly used IFA models. Then we discuss the estimation methods for IFA models and their computation, with a focus on the situation where the sample size, the number of items, and the number of factors are all large. Existing statistical software for IFA are surveyed. This chapter is concluded with suggestions for practical applications of IFA methods and discussions of future directions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Orthogonal rotational methods (e.g., varimax rotation; 32) are available in factor analysis that requires the estimated factors to be orthogonal to each other. As the orthogonal requirement of the latent factors is somewhat artificial, we do not discuss them in this chapter.
References
Andersen, E.B.: Conditional Inference and Models for Measuring. Mentalhygiejnisk Forlag, Copenhagen (1973)
Anderson, T.W., Rubin, H.: Statistical inference in factor analysis. In: Neyman J. (ed.) Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume V: Contributions to Econometrics, Industrial Research, and Psychometry, pp. 111–150. University of California Press, Berkerley, CA (1956)
Asparouhov, T., Muthén, B.: Comparison of computational methods for high dimensional item factor analysis. Unpublished manuscript retrieved from www.statmodel.com (2012)
Béguin, A.A., Glas, C.A.: MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika 66(4), 541–561 (2001)
Birnbaum, A.: Some latent trait models and their use in inferring an examinee’s ability. In Lord F.M., Novick, M.R. (eds.) Statistical Theories of Mental Test Scores, pp. 397–479. Addison-Wesley, Reading, MA (1968)
Bock, R.D., Aitkin, M.: Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika 46(4), 443–459 (1981)
Bolt, D.M., Lall, V.F.: Estimation of compensatory and noncompensatory multidimensional item response models using Markov chain Monte Carlo. Appl. Psychol. Meas. 27(6), 395–414 (2003)
Browne, M.W.: An overview of analytic rotation in exploratory factor analysis. Multivar. Behav. Res. 36(1), 111–150 (2001)
Cai, L.: High-dimensional exploratory item factor analysis by a Metropolis–Hastings Robbins–Monro algorithm. Psychometrika 75(1), 33–57 (2010)
Cai, L.: Metropolis-Hastings Robbins-Monro algorithm for confirmatory item factor analysis. J. Educ. Behav. Stat. 35(3), 307–335 (2010)
Cai, L., Wirth, R.: flexMIRT version 2: Flexible multilevel multidimensional item analysis and test scoring [Computer software]. Vector Psychometric Group, Chapel Hill, NC (2013)
Cai, L., Du Toit, S., Thissen, D.: IRTPRO: Flexible, Multidimensional, Multiple Categorical IRT Modeling [Computer software]. Scientific Software International, Chicago, IL (2011)
Celeux, G., Diebolt, J.: The SEM algorithm: A probabilistic teacher algorithm derived from the EM algorithm for the mixture problem. Comput. Stat. Q. 2, 73–82 (1985)
Chalmers, R.P.: mirt: A multidimensional item response theory package for the R environment. J. Stat. Softw. 48(6), 1–29 (2012)
Chatterjee, S.: Matrix estimation by universal singular value thresholding. Ann. Stat. 43(1), 177–214 (2015)
Chen, Y., Li, X., Zhang, S.: Joint maximum likelihood estimation for high-dimensional exploratory item factor analysis. Psychometrika 84(1), 124–146 (2019)
Chen, Y., Li, X., Zhang, S.: Structured latent factor analysis for large-scale data: Identifiability, estimability, and their implications. J. Am. Stat. Assoc 115(532), 1756–1770 (2019)
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., Robins, J.: Double/debiased machine learning for treatment and structural parameters. Econom. J. 21(1), C1–C68 (2018)
Chung, K.L.: On a stochastic approximation method. Ann. Math. Stat. 25(3), 463–483 (1954)
Dagum, L., Menon, R.: OpenMP: An industry standard API for shared-memory programming. Comput. Sci. Eng. IEEE 5(1), 46–55 (1998)
Davis, C., Kahan, W.M.: The rotation of eigenvectors by a perturbation. III. SIAM J. Numer. Anal. 7(1), 1–46 (1970)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 39(1), 1–22 (1977)
Edwards, M.C.: A Markov chain Monte Carlo approach to confirmatory item factor analysis. Psychometrika 75(3), 474–497 (2010)
Fabian, V.: On asymptotic normality in stochastic approximation. Ann. Math. Stat. 39(4), 1327–1332 (1968)
Ghosh, M.: Inconsistent maximum likelihood estimators for the Rasch model. Stat. Probab. Lett. 23(2), 165–170 (1995)
Gu, M.G., Kong, F.H.: A stochastic approximation algorithm with Markov chain Monte-Carlo method for incomplete data estimation problems. Proc. Natl. Acad. Sci. 95(13), 7270–7274 (1998)
Haberman, S.J.: Maximum likelihood estimates in exponential response models. Ann. Stat. 5(5), 815–841 (1977)
Holland, P.W.: On the sampling theory roundations of item response theory models. Psychometrika 55(4), 577–601 (1990)
Ip, E.H.: On single versus multiple imputation for a class of stochastic algorithms estimating max. Comput. Stat. 17, 517–524 (2002)
Jöreskog, K.G.: On the estimation of polychoric correlations and their asymptotic covariance matrix. Psychometrika 59(3), 381–389 (1994)
Jöreskog, K.G., Moustaki, I.: Factor analysis of ordinal variables: A comparison of three approaches. Multivar. Behav. Res. 36(3), 347–387 (2001)
Kaiser, H.F.: The varimax criterion for analytic rotation in factor analysis. Psychometrika 23(3), 187–200 (1958)
Kotov, R., Krueger, R.F., Watson, D., Achenbach, T.M., Althoff, R.R., Bagby, R.M., Brown, T.A., Carpenter, W.T., Caspi, A., Clark, L.A., et al.: The hierarchical taxonomy of psychopathology (HiTOP): A dimensional alternative to traditional nosologies. J. Abnorm. Psychol. 126(4), 454–477 (2017)
Lee, S.-Y., Poon, W.-Y., Bentler, P.M.: A three-stage estimation procedure for structural equation models with polytomous variables. Psychometrika 55(1), 45–51 (1990)
Mair, P., Hatzinger, R.: Extended Rasch modeling: The eRm package for the application of IRT models in R. J. Stat. Softw. 20(9), 1–20 (2007)
Martin, A.D., Quinn, K.M., Park, J.H.: MCMCpack: Markov chain Monte Carlo in R. J. Stat. Softw. 42(9), 1–21 (2011)
Meng, X.-L., Schilling, S.: Fitting full-information item factor models and an empirical investigation of bridge sampling. J. Am. Stat. Assoc. 91(435), 1254–1267 (1996)
Muraki, E.: A generalized partial credit model: Application of an EM algorithm. Appl. Psychol. Meas. 16(2), 159–176 (1992)
Muraki, E., Bock, D.: IRT Item Analysis and Test Scoring for Rating Scale Data [Computer software]. Scientific Software, Chicago, IL (1997)
Muraki, E., Carlson, J.E.: Full-information factor analysis for polytomous item responses. Appl. Psychol. Meas. 19(1), 73–90 (1995)
Muthén, B.: A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika 49(1), 115–132 (1984)
Muthén, L.K.: Mplus: The comprehensive modeling program for applied researchers: User’s guide. Muthén & Muthén, Los Angeles, CA (1998)
Neyman, J., Scott, E.L.: Consistent estimates based on partially consistent observations. Econometrica 16(1), 1–32 (1948)
Nielsen, S.F.: The stochastic EM algorithm: Estimation and asymptotic results. Bernoulli 6(3), 457–489 (2000)
Patz, R.J., Junker, B.W.: Applications and extensions of MCMC in IRT: Multiple item types, missing data, and rated responses. J. Educ. Behav. Stat. 24(4), 342–366 (1999)
Rasch, G.: Probabilistic models for some intelligence and attainment tests. Danish Institute for Educational Research, Copenhagen (1960)
Reckase, M.: Multidimensional Item Response Theory. Springer, New York, NY (2009)
Revelle, W., Condon, D.M., Wilt, J., French, J.A., Brown, A., Elleman, L.G.: Web and phone based data collection using planned missing designs. In: Fielding, N.G., Lee, R.M., Blank, G. (eds.) Handbook of Online Research Methods, pp. 578–595. Sage Publications, Thousand Oaks, CA (2016)
Rizopoulos, D.: ltm: An R package for latent variable modeling and item response theory analyses. J. Stat. Softw. 17(5), 1–25 (2006)
Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Stat. 22(3), 400–407 (1951)
Robitzsch, A., Kiefer, T., Wu, M.: TAM: Test analysis modules. R package version 3.3-10 (2019)
Rupp, A.A.: Item response modeling with BILOG-MG and MULTILOG for Windows. Int. J. Test. 3(4), 365–384 (2003)
Samejima, F.: Estimation of latent ability using a response pattern of graded scores. ETS Res. Bull. Ser. 1968(1), i–169 (1968)
Skitka, L.J., Sargis, E.G.: The internet as psychological laboratory. Annu. Rev. Psychol. 57(1), 529–555 (2006)
Spiegelhalter, D., Thomas, A., Best, N., Lunn, D.: WinBUGS user manual: Version 1.4. MRC Biostatistics Unit, Cambridge (2003)
Stock, J.H., Watson, M.W.: Forecasting using principal components from a large number of predictors. J. Am. Stat. Assoc. 97(460), 1167–1179 (2002)
Sun, J., Chen, Y., Liu, J., Ying, Z., Xin, T.: Latent variable selection for multidimensional item response theory models via L 1 regularization. Psychometrika 81(4), 921–939 (2016)
van der Vaart, A.W.: Asymptotic Statistics. Cambridge University Press, Cambridge (1998)
Varin, C., Reid, N., Firth, D.: An overview of composite likelihood methods. Stat. Sin. 21(1), 5–42 (2011)
Wedin, P.-Å.: Perturbation bounds in connection with singular value decomposition. BIT Numer. Math. 12(1), 99–111 (1972)
Wei, G.C.G., Tanner, M.A.: A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithms. J. Am. Stat. Assoc. 85(411), 699–704 (1990)
Yao, L., Schwarz, R.D.: A multidimensional partial credit model with associated item and test statistics: An application to mixed-format tests. Appl. Psychol. Meas. 30(6), 469–492 (2006)
Zhang, S., Chen, Y.: lvmcomp: Stochastic EM algorithms for latent variable models with a high-dimensional latent space. R package version 1.3.0 (2019)
Zhang, S., Chen, Y., Li, X.: mirtjml: Joint maximum likelihood estimation for high-dimensional item factor analysis. R package version 1.3.0 (2019)
Zhang, H., Chen, Y., Li, X.: A note on exploratory item factor analysis by singular value decomposition. Psychometrika 85, 358–372 (2020)
Zhang, S., Chen, Y., Liu, Y.: An improved stochastic EM algorithm for large-scale full-information item factor analysis. Br. J. Math. Stat. Psychol. 73(1), 44–71 (2020)
Zhao, Y., Joe, H.: Composite likelihood estimation in multivariate data analysis. Can. J. Stat. 33(3), 335–356 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Chen, Y., Zhang, S. (2021). Estimation Methods for Item Factor Analysis: An Overview. In: Zhao, Y., Chen, (.DG. (eds) Modern Statistical Methods for Health Research. Emerging Topics in Statistics and Biostatistics . Springer, Cham. https://doi.org/10.1007/978-3-030-72437-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-72437-5_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72436-8
Online ISBN: 978-3-030-72437-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)