Abstract
The most probable distribution method is applied to derive the logistic model as the distribution accounting for the maximum number of possible outcomes in a dichotomous test while introducing latent traits and item characteristics as constraints to the system. The item response theory logistic models, with a particular focus on the one-parameter logistic model, or Rasch model, and their properties and assumptions, are discussed for both infinite and finite populations.
Similar content being viewed by others
References
Adams, E.W. (1965). Elements of a theory of inexact measurement. Philosophy of Science, 32(3), 205–228.
Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561–573.
Andrich, D. (1982). An extension of the Rasch model for ratings providing both location and dispersion parameters. Psychometrika, 47(1), 105–113.
Aczel, J. (1966). Lectures on functional equations and their applications. New York: Academic Press.
Aczel, J., & Dohmbres, J. (1989). Functional equations in several variables. Cambridge: Cambridge University Press.
Bradley, R.A., & Terry, M.E. (1952). Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika, 39(3/4), 324–345.
Bernardo, J.M., & Smith, A.F.M. (1994). Bayesian theory. Chichester: Wiley.
Barton, M.A., & Lord, F.M. (1981). An upper asymptote for the three-parameter logistic item-response model. Princeton: Educational testing service.
Clinton, W.L., & Massa, L.J. (1972). Derivation of a statistical mechanical distribution function by a method of inequalities. American Journal of Physics, 40, 608–610.
Davis-Stober, C.P. (2009). Analysis of multinomial models under inequalities constraints: applications to measurement theory. Journal of Mathematical Psychology, 53, 1–13.
Fischer, G.H. (1973). The linear logistic test model as an instrument in educational research. Acta Psychologica, 37, 359–374.
Fischer, G.H. (1995). Some neglected problems in IRT. Psychometrika, 60(4), 459–487.
Fischer, G.H., & Molenaar, I.W. (1995). Rasch models: foundations, recent developments, and applications. New York: Springer.
Fishburn, P.C. (1981). Uniqueness properties in finite-continuous additive measurement. Mathematical Social Sciences, 1(2), 145–153.
Gonzales, C. (2000). Two factor additive conjoint measurement with one solvable component. Journal of Mathematical Psychology, 44, 285–309.
Holland, P.W. (1990). On the sampling theory foundations of item response theory models. Psychometrika, 55(4), 577–601.
Huang, K. (1987). Statistical mechanics. New York: Wiley.
Irtel, H. (1987). On specific objectivity as a concept in measurement. In E.E. Roskam & R. Suck (Eds.), Progress in mathematical psychology-1. Amsterdam: Elsevier.
Irtel, H. (1993). The uniqueness of simple latent trait models. In G.H. Fischer & D. Laming (Eds.), Contributions to mathematical psychology, psychometrics, and methodology. New York: Springer.
Jaynes, E.T. (1957). Information theory and statistical mechanics. The Physical Review, 106(4), 620–630.
Jaynes, E.T. (1968). Prior probabilities. IEEE Transactions on Systems Science and Cybernetics, 4(3), 227–241.
Kagan, A.M., Linnik, V.Y., & Rao, C.R. (1973). Characterization problems in mathematical statistics. New York: Wiley.
Karabatsos, G. (2001). The Rasch model, additive conjoint measurement, and new models of probabilistic measurement theory. Journal of Applied Measurement, 2(4), 389–423.
Kyngdon, A. (2008). The Rasch model from the perspective of the representational theory of measurement. Theory & Psychology, 18, 89–109.
Kyngdon, A. (2011). Plausible measurement analogies to some psychometric models of test performance. British Journal of Mathematical & Statistical Psychology, 64, 478–497.
Krantz, D.H., Luce, R.D., Suppes, P., & Tversky, A. (1971). Foundations of measurement. Vol. 1: Additive and polynomial representations. San Diego: Academic Press.
Landsberg, P.T. (1954). On most probable distributions. Proceedings of the National Academy of Sciences, 40, 149–154.
Lord, F.M., & Novik, M.R. (1968). Statistical theories of mental test scores. London: Addison-Wesley.
Luce, R.D. (1959). Individual choice behavior: a theoretical analysis. New York: Wiley.
Luce, R.D., Krantz, D.H., Suppes, S., & Tversky, A. (1990). Foundations of measurement. Vol. 3: Representation, axiomatization and invariance. San Diego: Academic Press.
Luce, R.D., & Narens, L. (1994). Fifteen problems concerning the representational theories of measurement. In P. Humpreys (Ed.), Patrick suppes: scientific philosopher, (Vol. 2). Dordrecht: Kluwer Academic.
Luce, R.D., & Tukey, J.W. (1964). Simultaneous conjoint measurement: a new scale type of fundamental measurement. Journal of Mathematical Psychology, 1, 1–27.
Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149–174.
Michell, J. (1990). An introduction to the logic of psychological measurement. Hillsdale: Erlbaum.
Michell, J. (2009). The psychometricians’ fallacy: too clever by half? British Journal of Mathematical & Statistical Psychology, 62, 41–55.
Perline, R., Wright, B.D., & Wainer, H. (1979). The Rasch model as additive conjoint measurement. Applied Psychological Measurement, 3, 237–255.
Pfanzagl, J. (1971). Theory of measurement. Wurzburg and Vienna: Physica-Verlag.
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: Nielsen & Lydiche.
Rasch, G. (1972). On specific objectivity. An attempt at formalizing the request for generality and validity of scientific statements. In M. Blegvad (Ed.), The Danish yearbook of philosophy. Copenhagen: Munksgaard.
Scott, D. (1964). Measurement structures and linear inequalities. Journal of Mathematical Psychology, 1, 233–247.
Suppes, P., & Zinnes, J.L. (1963). Basic theory of measurement. In R.D. Luce, R.R. Bush, & E. Galanter (Eds.), Handbook Math. Psych.: Vol. 1. New York: Wiley.
Scheiblechner, H. (1972). Das Lernen und Lösen komplexer Denkaufgaben [The learning and solving of complex reasoning items]. Zeitschrift für Experimentelle und Angewandte Psychologie, 3, 456–506.
Scheiblechner, H. (1995). Isotonic psychometrics models. Psychometrika, 60, 281–304.
Scheiblechner, H. (1999). Additive conjoint isotonic probabilistic models. Psychometrika, 64, 295–316.
Tversky, A. (1967). A general theory of polynomial conjoint measurement. Journal of Mathematical Psychology, 4, 1–20.
Acknowledgements
We wish to thank the two anonymous reviewers of the journal for their insight into the work and their helpful comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A. Derivation of Condition (24)
Inequalities (22) and (23) in the case of Equation (21) (and omitting the constant term μ for simplicity of notation) become
so that expanding the logarithm and simplifying the common terms gives
Since, now, by definition of factorial, ln(n+1)!=ln(n+1)+lnn!, the previous inequalities become
which, by setting the finite and backward differences of h jk as
can finally be rewritten as
which when divided by g jk leads to condition (24).
Appendix B. Derivation of Equation (38) Applying Stirling Formula
The case of an infinite population can be described by the limits n jk →∞, g jk →∞ and g jk −n jk →∞. Under these limits, variations Δn jk in the discrete variables n jk are often considered to be negligible compared to the value of n jk , namely Δn jk ≪n jk , so that it can be directly considered the limit Δn jk →0.
In such a case the forward difference equation can be written as
while the backward difference equation can be given by switching the operator Δ with ∇. Since, now, due to linearity, variation passes under the sign of summation, the previous equation becomes
and the summations can be dropped, leading to
which can be rewritten as
The previous equation is usually approximated by means of the Stirling formula, lnN!≈NlnN−N, in order to get rid of the factorials. In passing, notice that the Stirling formula misses a term \(\frac{1}{2}\ln 2\pi N\) with respect to the approximation (62). This is not a problem since this additional term vanishes in the limit for N→∞. Hence,
which simplified gives
Since the forward difference implies n jk →n jk +Δn jk (while the backward difference implies n jk →n jk −Δn jk ), the equation becomes
which, dividing by Δn jk , and taking the limit Δn jk →0, gives
where, by definition of a derivative,
so that the forward finite variation Δ has been switched with the right derivative \(\partial^{+}_{n_{jk}}\), thus giving
A similar result can be obtained for the backward difference equation, so that the operator ∇ can be switched with the left derivative \(\partial^{-}_{n_{jk}}\), giving
Hence, both Equations (B.1) and (B.2) are satisfied when the function h(α j ,δ k ,n jk ) is derivable so that both right and left derivatives exist and coincide, \(\partial^{+}_{n_{jk}}h=\partial^{-}_{n_{jk}}h\) , thus giving Equation (38).
Appendix C. Other IRT Logistic Models
Other parameters in the logistic item response model can be introduced by modifying the constraint and the multiplicity in order to account for lucky guesses, careless errors, and single item discriminability.
3.1 C.1 Permutations
If in the (jk)th cluster there are l jk lucky guesses and c jk careless errors, it must be considered how n jk −l jk responses can distribute themselves inside g jk −c jk −l jk cells in the matrix, so the total number of possible outcomes (15) becomes
3.2 C.2 Constraints
Items possessing different discriminating power can be accounted for by means of constraint (44),
where the coefficient a k is the Birnbaum parameter (see Birnbaum in Lord & Novik 1968) and becomes just a dilation parameter for all the items when a k =a for every item as in (44).
3.3 C.3 Derivation
For handiness of calculations, one can follow the same continuous derivation given in Appendix B (omitting the constant parameter μ for simplicity of calculation) in the case of n jk →∞, g jk →∞ and g jk −n jk →∞. Due to linearity, the finite variation Δ passes under the sign of summation, which gives
and the summation can be dropped, leading to
which can be rewritten as
Since in the population limit the Stirling approximation can be applied to the logarithm of a factorial, the difference formula becomes
which simplified gives
Consider, now, n jk as a continuous variable, as has already been done in Appendix B, so that the finite variation Δ can be switched into the derivative \(\partial_{n_{jk}}\) thus giving
which, simplifying, is
so that
which can be rewritten as
or equivalently as
which gives
Since the proportion c jk /g jk gives the probability of a careless error \(P^{C}_{jk}\), and the proportion l jk /g jk gives the probability of a lucky guess \(P^{L}_{jk}\), dividing the previous equation by g jk returns the most probable distribution for the number of cells filled in a cluster,
which is exactly model (47), if one defines \(b_{k}=P^{L}_{jk}\) and \(c_{k}=1-P^{C}_{jk}\). Notice that the choice of omitting the index j is just a simplification in which lucky guesses and careless errors are considered to depend only on the item difficulty.
Rights and permissions
About this article
Cite this article
Noventa, S., Stefanutti, L. & Vidotto, G. An Analysis of Item Response Theory and Rasch Models Based on the Most Probable Distribution Method. Psychometrika 79, 377–402 (2014). https://doi.org/10.1007/s11336-013-9348-y
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11336-013-9348-y