A Paired Comparison IRT Model Using 3-Value Judgment: Estimation of Item Difficulty Parameters Prior to the Administration of the Test

Ozaki, Koken; Toyoda, Hideki

doi:10.2333/bhmk.33.131

A Paired Comparison IRT Model Using 3-Value Judgment: Estimation of Item Difficulty Parameters Prior to the Administration of the Test

Published: 15 September 2006

Volume 33, pages 131–147, (2006)
Cite this article

Behaviormetrika Aims and scope Submit manuscript

Koken Ozaki¹ &
Hideki Toyoda²

18 Accesses
3 Citations
Explore all metrics

Abstract

Currently, test operation using Item Response Theory (IRT) requires test items to undergo parameter estimation using examinee data. Furthermore, after equating, the items may be included in an item pool that can be used for several tests. However, this test operation method contains the probability of item content leakage. Thus, estimating item parameters while keeping the item contents secret would be useful. In this study, to make such a situation possible, a model in which item parameters are estimated using a paired comparison from the perspective of the difficulty of items by a rater familiar with the field is proposed. The estimation accuracy of this model was confirmed in a simulation study, and the feasibility of its use in practical settings is demonstrated using actual data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

Article Open access 17 June 2022

Assessing the performance of classical test theory item discrimination estimators in Monte Carlo simulations

Article 18 November 2017

Modified Method of Drawing Classical ICCs Comparable to IRT-Based ICCs

References

Andrich, D. (1978). Relationships between the Thurstone and Rasch approaches to item scaling. Applied Psychological Measurement, 2, 449–460.
Google Scholar
Andrich, D. (1988). The application of an unfolding model of the PIRT type to the measurement of attitude. Applied Psychological Measurement, 12, 33–51.
Article Google Scholar
Andrich, D. (1989). A probabilistic IRT model for unfolding preferene data. Applied Psychological Measurement, 13, 193–216.
Article Google Scholar
Andrich, D. (1995). Hyperbolic cosine latent trait models for unfolding direct responses and pairwise preferences. Applied Psychological Measurement, 19, 269–290.
Article Google Scholar
Bejar, I.I. (1983). Subject matter experts’ assessment of item statistics. Applied Psychological Measurement, 3, 303–310.
Article Google Scholar
Bock, R.D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443–459.
Article MathSciNet Google Scholar
Bock, R.D., & Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35, 179–197.
Article Google Scholar
Bockenholt, U. (2001). Thresholds and intransitivities in pairwise judgments: A multilevel analysis. Journal of Educational and Behavioral Statistics, 26, 269–282.
Article Google Scholar
Fischer, G.H. (1973). The linear logistic test model as an instrument in educational research. Acta Psychologica, 37, 359–374.
Article Google Scholar
Hambleton, R.K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston: Kluwer-Nijhoff.
Book Google Scholar
Jansen, P.G.W. (1984). Relationships between the Thurstone, Coombs, and Rasch approaches to item scaling. Applied Psychological Measurement, 8, 373–383.
Article Google Scholar
Lord, F.M. (1980). Applications of item response theory to practical testing problesm. Hillsdale NJ: Erlbaum.
Google Scholar
von Davier, M., & von Davier, A.A. (2004). A unified approach to IRT scale linking and scale transformations. ETS Research Report, 04-09, Princeton, NJ: Educational Testing Service.
Google Scholar
Thurstone, L.L. (1927). A law of comparative judgement. Pschological Review, 34, 273–286.
Article Google Scholar
Toyoda, H. (2001). Parameter estimation for Rasch type items that have never been used. The Japanese Journal of Psychology. 73, 26–33 (in Japanese).
Article Google Scholar
Toyoda, H. (2002). Koumoku hannou riron nyumon hen (Introduction to item response theory). Tokyo: Asakura Shoten (in Japanese).
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Letters, Arts and Sciences, Waseda University, Japan
Koken Ozaki
School of Letters, Arts and Sciences, Waseda University, Japan
Hideki Toyoda

Authors

Koken Ozaki
View author publications
You can also search for this author in PubMed Google Scholar
Hideki Toyoda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koken Ozaki.

About this article

Cite this article

Ozaki, K., Toyoda, H. A Paired Comparison IRT Model Using 3-Value Judgment: Estimation of Item Difficulty Parameters Prior to the Administration of the Test. Behaviormetrika 33, 131–147 (2006). https://doi.org/10.2333/bhmk.33.131

Download citation

Received: 31 May 2005
Revised: 27 March 2006
Published: 15 September 2006
Issue Date: July 2006
DOI: https://doi.org/10.2333/bhmk.33.131

Key Words and Phrases

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Paired Comparison IRT Model Using 3-Value Judgment: Estimation of Item Difficulty Parameters Prior to the Administration of the Test

Abstract

Access this article

Similar content being viewed by others

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

Assessing the performance of classical test theory item discrimination estimators in Monte Carlo simulations

Modified Method of Drawing Classical ICCs Comparable to IRT-Based ICCs

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Key Words and Phrases

Navigation

A Paired Comparison IRT Model Using 3-Value Judgment: Estimation of Item Difficulty Parameters Prior to the Administration of the Test

Abstract

Access this article

Similar content being viewed by others

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

Assessing the performance of classical test theory item discrimination estimators in Monte Carlo simulations

Modified Method of Drawing Classical ICCs Comparable to IRT-Based ICCs

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Key Words and Phrases

Search

Navigation