A Multi-Dimensional Continuous Item Response Model for Probability Testing

Zhang, Yiping; Watanabe, Hiroshi

doi:10.2333/bhmk.39.183

A Multi-Dimensional Continuous Item Response Model for Probability Testing

Published: 15 July 2012

Volume 39, pages 183–197, (2012)
Cite this article

Behaviormetrika Aims and scope Submit manuscript

Yiping Zhang¹ &
Hiroshi Watanabe²

16 Accesses
Explore all metrics

Abstract

Probability testing (PT) is a way to respond to multiple-choice test items. In PT the examinee gives to each response option his/her subjective probability of its being correct as an expression of partial knowledge. By using PT more item information can be drawn from the subjects than the other scoring methods that can be used for multiple-choice items. In this research, a multi-dimensional continuous item response model for PT is proposed. Moreover, the matrix of information function, a method of estimating item parameter, a method of estimating the subject’s vector of latent traits are introduced.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ben-Simon, A., & Budescu, D. V., & Nevo, B. (1997). A comparative study of measures of partial knowledge in multiple-choice tests. Applied Psychological Measurement, 21, 65–88.
Article Google Scholar
de Finetti, B. (1965). Methods for discriminating levels of partial knowledge concerning a test item. British Journal of Mathematical and Statistical Psychology, 18, 87–123.
Article Google Scholar
Elderton, W. P., & Johnson, N. L. (1969). System of frequency curves. Cambridge: Combeidge University Press.
Book Google Scholar
Hambleton, R. K., Roberts, D. M., & Traub, R. E. (1970). A comparison of the reliability and validity of two methods for assessing partial knowledge on a multiple choice testing. Journal of Educational Measurement, 7, 75–82.
Article Google Scholar
Johnson, N. L., & Kotz, S., & Balakrishnan, N. (1995). Continuous univariate distributions, Vol. 2, 2th ed. New York: John Wiley and Sons.
Kansup, W., & Hakstian, A. R. (1975). A comparison of several methods of assessing partial knowledge in multiple-choice tests: I. Scoring procedures. Journal of Educational Measurement, 12, 212–230.
Article Google Scholar
Lawley, D. N. and A. E. Maxwell. (1971). Factor analysis as a statistical method. (2nd edition) New York: American Elsevier.
MATH Google Scholar
Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
MATH Google Scholar
Michael, J. C. (1968). The reliability of a multiple choice examination under various test-taking instructions. Journal of Educational Measurement, 5, 307–314.
Article Google Scholar
Press, S. J., & Shigemasu, K. (1989). Bayesian inference in factor analysis. In Gleser, L., & Perleman, M., & Press, S. J. (Eds.), Contributions to probability and statistics. New York: Springer-Verlag.
Google Scholar
Pugh, R. C., & Brunza, J. J. (1975). Effects of a confidence weighted scoring system on measures of test reliability and validity. Educational and Psychological Measurement, 35, 73–78.
Article Google Scholar
Rippey, R. M. (1970). A comparison of five different scoring function for confidence tests. Journal of Educational Measurement, 7, 165–170.
Article Google Scholar
Samejima, F. (1974). Normal ogive model on the continuous response level in the multidimensional latent space. Psychometrika, 39, 111–121.
Article MathSciNet Google Scholar
Shuford, E. H., Albert, A., & Massengill, H. E. (1966). Admissible probability measurement procedures. Psychometrika, 31, 125–145.
Article Google Scholar
Suhadolnik, D., & Weiss, D. J. (1983). Effect of examinee certainty on probabilistic test scores and a comparison of scoring methods for probabilistic responses (Research Report 83-3). University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory, Minneapolis.
Google Scholar
Zhang, Y. P. (2007). An item response model on Probability-Testing. University of Tokyo Press. (in Japanese)
Google Scholar
Zhang, Y. P., & Watanabe, H. (2007). Test scoring methods and their problems. IMPS2007.
Google Scholar

Download references

Author information

Authors and Affiliations

South China Normal University, China
Yiping Zhang
Benesse Educational Research and Development Center, Japan
Hiroshi Watanabe

Authors

Yiping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Watanabe
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yiping Zhang.

About this article

Cite this article

Zhang, Y., Watanabe, H. A Multi-Dimensional Continuous Item Response Model for Probability Testing. Behaviormetrika 39, 183–197 (2012). https://doi.org/10.2333/bhmk.39.183

Download citation

Received: 19 October 2011
Revised: 02 July 2012
Published: 15 July 2012
Issue Date: July 2012
DOI: https://doi.org/10.2333/bhmk.39.183

Key Words and Phrases

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Multi-Dimensional Continuous Item Response Model for Probability Testing

Abstract

Access this article

Similar content being viewed by others

A comparison of Monte Carlo methods for computing marginal likelihoods of item response theory models

Item Response Thresholds Models: A General Class of Models for Varying Types of Items

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Key Words and Phrases

Navigation

A Multi-Dimensional Continuous Item Response Model for Probability Testing

Abstract

Access this article

Similar content being viewed by others

A comparison of Monte Carlo methods for computing marginal likelihoods of item response theory models

Item Response Thresholds Models: A General Class of Models for Varying Types of Items

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Key Words and Phrases

Search

Navigation