Loglinear multidimensional IRT models for polytomously scored items

Kelderman, Henk; Rijkes, Carl P. M.

doi:10.1007/BF02295181

Loglinear multidimensional IRT models for polytomously scored items

Published: June 1994

Volume 59, pages 149–176, (1994)
Cite this article

Psychometrika Aims and scope Submit manuscript

Henk Kelderman¹ &
Carl P. M. Rijkes¹

76 Citations
Explore all metrics

Abstract

A loglinear IRT model is proposed that relates polytomously scored item responses to a multidimensional latent space. The analyst may specify a response function for each response, indicating which latent abilities are necessary to arrive at that response. Each item may have a different number of response categories, so that free response items are more easily analyzed. Conditional maximum likelihood estimates are derived and the models may be tested generally or against alternative loglinear IRT models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Article Open access 22 August 2014

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

References

Akaike, H. (1977). On entropy maximization principle. In P. R. Krisschnaiah (Ed.),Applications of statistics (pp. 27–41). Amsterdam: North Holland.
Google Scholar
Andersen, E. B. (1973). Conditional inference and multiple choice questionnaires.British Journal of Mathematical and Statistical Psychology, 26, 31–44.
Google Scholar
Andersen, E. B. (1983). A general latent structure model for contingency table data. In H. Wainer & S. Messick (Eds.),Principals of modern psychological measurement (pp. 117–138). Hillsdale, NJ: Lawrence Erlbaum.
Google Scholar
Andrich, D. (1978). A rating scale formulation for ordered response categories.Psychometrika, 43, 561–573.
Google Scholar
Andrich, D. (1982). An extension of the Rasch model for ratings providing both location and dispersion parameters.Psychometrika, 47, 105–113.
Google Scholar
Baglivo, J., Olivier, D., & Pagano, M. (1992). Methods for exact goodness-of-fit tests,Journal of the American Statistical Association, 87, 464–469.
Google Scholar
Bishop, Y. M. M., Fienberg, S. E., & Holland, P. W. (1975).Discrete multivariate analysis. Cambridge, MA: MIT Press.
Google Scholar
Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories.Psychometrika, 37, 29–51.
Google Scholar
Carpenter, P. A., Just, M. A., & Shell, P. (1990). What one intelligence test measures: A theoretical account of the processing in the Raven Progressive Matrices Test.Psychological Review, 97, 404–431.
Google Scholar
Cox, M. A. A., & Placket, R. L. (1980). Small samples in contingency tables.Biometrika, 67, 1–13.
Google Scholar
Cressie, N., & Holland, P. W. (1983). Characterizing the manifest probabilities of latent trait models.Psychometrika, 48, 129–142.
Google Scholar
de Leeuw, J., & Verhelst, N. D. (1986). Maximum likelihood estimation in generalized Rasch models.Journal of Educational Statistics, 11, 183–196.
Google Scholar
Duncan, O. D. (1984). Rasch measurement: Further examples and discussion. In C. F. Turner & E. Martin (Eds.),Surveying subjective phenomena, Vol. 2 (pp. 367–403). New York: Russell Sage Foundation.
Google Scholar
Duncan, O. D., & Stenbeck, M. (1987). Are Likert scales unidimensional?Social Science Research, 16, 245–259.
Google Scholar
Embretson, S. E. (1984). A general latent trait model for response processes.Psychometrika, 49, 175–186.
Google Scholar
Embretson, S. E. (1985). Multicomponent latent trait models for test design. In S. E. Embretson (Ed.),Test design: Developments in psychology and psychometrics (pp. 195–218). Orlando, FL: Academic Press.
Google Scholar
Embretson, S. E. (1991).Measuring and validating the cognitive modifiability construct. Poster Presentation at the Annual Meeting of the American Educational Research Association.
Fischer, G. H. (1972). A measurement model for the effect of mass-media.Acta Psychologica, 36, 207–220.
Google Scholar
Fischer, G. H. (1973). The linear logistic test model as an instrument in educational research.Acta Psychologica, 36, 359–374.
Google Scholar
Fischer, G. H. (1974).Einführung in die Theorie psychologischer Tests [Introduction to the theory of psychological test]. Bern: Huber (In German).
Google Scholar
Fischer, G. H. (1976). Some probabilistic models for measuring change. In D. N. M. de Gruyter & L. J. Th. van der Kamp (Eds.),Advances in psychological and educational measurement (pp. 97–110). New York: Wiley.
Google Scholar
Fischer, G. H., & Forman, A. K. (1982). Some applications of logistic latent trait models with linear constraints on the parameters.Applied Psychological Measurement, 6, 397–416.
Google Scholar
Forbes, A. R. (1964). An item analysis of the advanced matrices.British Journal of Educational Psychology, 34, 1–14.
Google Scholar
Frederiksen, J. R. (1982). A componential theory of reading skills and their interactions. In R. J. Sternberg (Ed.),Advances in the psychology of human intelligence Vol. 1 (pp. 125–180). Hillsdale, NJ: Lawrence Erlbaum.
Google Scholar
Glas, CA. W., & Verhelst, N. D. (1989). Extensions of the Partial Credit Model.Psychometrika, 54, 635–660.
Google Scholar
Haberman, S. J. (1977). Log-linear models and frequency tables with small cell counts,Annals of Statistics, 5, 1124–1147.
Google Scholar
Haberman, S. J. (1979).Analysis of qualitative data: New developments, Vol. 2. New York: Academic Press.
Google Scholar
Hunt, E. B. (1974). Quote the Raven? Nevermore! In L. W. Greg (Ed.),Knowledge and cognition (pp. 129–158). Hillsdale, NJ: Lawrence Erlbaum.
Google Scholar
Imrey, P. B., Koch, G. C., & Stokes, M. E. (1981). Categorical data analysis: Some reflections on the loglinear model and logistic regression. Part I: Historical and methodological overview.International Statistical Overview, 49, 265–283.
Google Scholar
Kelderman, H. (1984). Loglinear Rasch model tests.Psychometrika, 49, 223–245.
Google Scholar
Kelderman, H. (1989). Item bias detection using loglinear IRT.Psychometrika, 54, 681–697.
Google Scholar
Kelderman, H. (1992). Computing maximum likelihood estimates of loglinear IRT models from marginal sums.Psychometrika, 57, 437–450.
Google Scholar
Kelderman, H., & Steen, R. (1988).Logimo: Loglinear IRT modeling [Program Manual]. Enschede, The Netherlands: University of Twente.
Google Scholar
Koehler, K. J. (1977).Goodness-of-fit statistics for large sparse multinomials. Unpublished doctoral dissertation, University of Minnesota, School of Statistics.
Koehler, K. J. (1986). Goodness-of-fit tests for log-linear models in sparse contingency tables.Journal of the American Statistical Association, 81, 483–493.
Google Scholar
Lancaster, H. O. (1961). Significance tests in discrete distributions.Journal of the American Statistical Association, 56, 223–234.
Google Scholar
Lehmann, E. L. (1983).The theory of point estimation. New York: John Wiley.
Google Scholar
Marshalek, B., Lohman, D. F., & Snow, R. E. (1983). The complexity continuum in the radex and hierarchical models of intelligence.Intelligence, 7, 107–127.
Google Scholar
Masters, G. N. (1982). A Rasch model for partial credit scoring.Psychometrika, 47, 149–174.
Google Scholar
Muraki, E. (1990). Fitting a polytomous item response model to Likert type data.Applied Psychological Measurement, 14, 59–71.
Google Scholar
Newell, A. (1977). On the analysis of human problem solving protocols. In P. N. Johnson-Laird & P. C. Wason,Thinking (pp. 46–61). London: Cambridge University Press.
Google Scholar
Neyman, J., & Scott, E. L. (1948). Consistent estimates based on partially consistent observations.Econometrica, 16, 1–32.
Google Scholar
Rao, C. R. (1973).Linear statistical inference and its applications. New York: Wiley.
Google Scholar
Rasch, G. (1961). On general laws and the meaning of measurement in psychology.Proceedings of the fourth Berkeley symposium on mathematical statistics and probability (pp. 321–333). Berkeley, CA: University of California Press.
Google Scholar
Rasch, G. (1980).Probabilistic models for some intelligence and attainment tests. Chicago: The University of Chicago Press.
Google Scholar
Raven, J., Raven, J. C., & Court, J. H. (1991).Manual for Raven's progressive matrices and vocabulary scales (section 1): General overview. Oxford: Oxford Psychologists Press.
Google Scholar
Read, T. R. C., & Cressie, N. (1988).Goodness-of-fit statistics for discrete multivariate data. New York: Springer-Verlag.
Google Scholar
Rost, J. (1988). Measuring attitudes with a threshold model drawing on a traditional scaling concept.Applied Psychological Measurement, 12, 397–409.
Google Scholar
Samejima, F. (1972). A general model for free-response data.Psychometrika Monograph No. 18, 37 (4, Pt. 2).
Scheiblechner, H. (1972). Das lernen und lòsen komplexer Denkaufgaben [Learning and solving complex cognitive problems].Zeitschrift für experimentelle und Angewandte Psychologie, 19, 476–506. (In German)
Google Scholar
Spada, H. (1976).Modelle des Denkens und Lernens [Models of thinking and learning]. Bern: Huber. (In German)
Google Scholar
Stenner, A. J., Smith, III, M., & Burdick, D. (1983). Toward a theory of construct definition.Journal of Educational Measurement, 20, 303–316.
Google Scholar
Sternberg, R. J. (Ed.). (1982).Advances in the psychology of human intelligence, Vol. 1. Hillsdale, NJ: Lawrence Erlbaum.
Google Scholar
Thissen, D., & Steinberg, L. (1984). A response model for multiple choice items.Psychometrika, 49, 501–519.
Google Scholar
Theunissen, T. J. J. M. (1985). Binary programming and test design.Psychometrika, 50, 411–420.
Google Scholar
Tjur, T. (1982). A connection between Rasch's item analysis model and a multiplicative Poisson model.Scandinavian Journal of Statistics, 9, 23–30.
Google Scholar
van den Wollenberg, A. L. (1982). Two new test statistics for the Rasch model.Psychometrika, 47, 123–140.
Google Scholar
van der Linden, W. J., & Boekkooi-Timminga, E. (1989). A maximum model for test design with practical constraints.Psychometrika, 54, 237–248.
Google Scholar
Wilson, M. (1989).The partial order model. Paper presented at the Fifth International Objective Measurement Workshop, Berkeley, CA.
Wilson, M. (1990).An extension of the partial credit model to incorporate diagnostic information. Unpublished manuscript, University of California, Graduate School of Education, Berkeley, CA.
Google Scholar
Wright, B. D., & Masters, G. N. (1982).Rating scale analysis. Chicago: MESA Press.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Twente, The Netherlands
Henk Kelderman & Carl P. M. Rijkes

Authors

Henk Kelderman
View author publications
You can also search for this author in PubMed Google Scholar
Carl P. M. Rijkes
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Hank Kelderman is currently affiliated with Vrije Universiteit, Amsterdam.

We thank Linda Vodegel-Matzen of the Division of Developmental Psychology of the University of Amsterdam for making available the data used in the example in this article.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kelderman, H., Rijkes, C.P.M. Loglinear multidimensional IRT models for polytomously scored items. Psychometrika 59, 149–176 (1994). https://doi.org/10.1007/BF02295181

Download citation

Received: 19 October 1988
Revised: 18 November 1992
Issue Date: June 1994
DOI: https://doi.org/10.1007/BF02295181

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Loglinear multidimensional IRT models for polytomously scored items

Abstract

Access this article

Similar content being viewed by others

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key words

Navigation

Loglinear multidimensional IRT models for polytomously scored items

Abstract

Access this article

Similar content being viewed by others

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation