Abstract
When item characteristic curves are nondecreasing functions of a latent variable, the conditional or local independence of item responses given the latent variable implies nonnegative conditional covariances between all monotone increasing functions of a set of item responses given any function of the remaining item responses. This general result provides a basis for testing the conditional independence assumption without first specifying a parametric form for the nondecreasing item characteristic curves. The proposed tests are simple, have known asymptotic null distributions, and possess certain optimal properties. In an example, the conditional independence hypothesis is rejected for all possible forms of monotone item characteristic curves.
Similar content being viewed by others
References
Andersen, E. B. (1980).Discrete Statistical Models with Social Science Applications, Amsterdam: North Holland.
Birch, M. W. (1964). The detection of partial association, I: the 2 × 2 case.Journal of the Royal Statistical Society, Series B, 26, 313–324.
Birch, M. W. (1965). The detection of partial association, II: the general case.Journal of the Royal Statistical Society, Series B, 27, 111–124.
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability (Part 5). In F. Lord and M. Novick,Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley.
Bishop, Y., Fienberg, S., and Holland, P. (1975).Discrete Multivariate Analysis. Cambridge, MA: MIT Press.
Bock, D. and Lieberman, M. (1970). Fitting a response model forn dichotomously scored items.Psychometrika, 35, 179–197.
Breslow, N. (1981). Odds ratio estimators when the data are sparse.Biometrika, 68, 73–84.
Clayton, D. G. (1974). Some odds ratio statistics for the analysis of ordered categorical data.Biometrika, 61, 525–531.
Cox, D. R. (1966). A simple example of a comparison involving quantal data.Biometrika, 53, 215–220.
Cressie, N. and Holland, P. W. (1983). Characterizing the manifest probabilities of latent trait models.Psychometrika, 48, 129–141.
Esary, J. D., Proschan, F., and Walkup, D. W. (1967). Association of random variables, with applications.Annals of Mathematical Statistics, 38, 1466–1474.
Ferguson, T. (1967).Mathematical Statistics: A Decision Theoretic Approach. New York: Academic Press.
Goldstein, H. (1980). Dimensionality, bias, independence and measurement scale problems in latent test score models.British Journal of Mathematical and Statistical Psychology, 33, 234–246.
Goodman, L. and Kruskal, W. (1979).Measures of Association for Cross Classifications. New York: Springer-Verlag.
Holland, P. W. (1981). When are item response models consistent with observed data?Psychometrika, 46, 79–92.
Lehmann, E. L. (1955). Ordered families of distributions.Annals of Mathematical Statistics, 26, 399–419.
Lehmann, E. L. (1966). Some concepts of dependence.Annals of Mathematical Statistics, 37, 1137–1153.
Lord, F. (1952).A Theory of Test Scores. Psychometric Monograph # 7. Psychometric Society.
Lord, F. (1980).Applications of Item Response Theory to Practical Testing Problems. Hillsdale, NJ: Lawrence Erlbaum.
Mantel, N. and Haenszel, W. (1959). Statistical aspects of the retrospective study of disease.Journal of the National Cancer Institute, 22, 719–748.
Miller, R. G. (1981).Simultaneous Statistical Inference. New York: Springer-Verlag.
Molenaar, I. W. (1983). Some improved diagnostics for failure of the Rasch model.Psychometrika, 48, 49–72.
Rao, C. R. (1973).Linear Statistical Inference and Its Applications. New York: Wiley, pp 389–391.
Rasch, G. (1960).Probabilistic Models for Some Intelligence and Attainment Tests. Copenhagen: Neilson and Lydiche.
Robertson, T. and Wright, F. T. (1981). Likelihood ratio tests for and against stochastic ordering between multinomial populations.Annals of Statistics, 9, 1248–1257.
Schweder, T. (1970). Composable Markov processes.Journal of Applied Probability, 7, 400–410.
Schweder, T. and Spjotvoll, E. (1982). Plots ofp-values to evaluate many tests simultaneously.Biometrika, 69, 493–502.
Tjur, T. (1982). A connection between Rasch's item analysis model and a multiplicative Poisson model.Scand. J. Statist., 9, 23–30.
Traub, R. E. (1983). A priori considerations in choosing an item response model. In R. K. Hambleton, editor,Applications of Item Response Theory. Vancouver: Educational Research Institute of British Columbia, 57–70.
Traub, R. E. and Wolfe, R. G. (1981). Latent trait theories and the assessment of educational achievement.Review of Research in Education, 9, 377–435.
Tukey, J. W. (1977).Exploratory Data Analysis. Reading Massachusetts; Addison-Wesley.
Van den Wollenberg, A. L. (1982). Two new test statistics for the Rasch model.Psychometrika, 47, 123–140.
Author information
Authors and Affiliations
Additional information
The author acknowledges Paul W. Holland for valuable conversations on the subject of this paper; Henry Braun and Fred Lord for comments at a presentation on this subject which led to improvements in the paper; Carl H. Haag for permission to use the data in §4; Bruce Kaplan for assistance with computing; and two referees for helpful suggestions. Requests for reprints should be sent to Paul R. Rosenbaum
Rights and permissions
About this article
Cite this article
Rosenbaum, P.R. Testing the conditional independence and monotonicity assumptions of item response theory. Psychometrika 49, 425–435 (1984). https://doi.org/10.1007/BF02306030
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02306030