A Recursive Algorithm for IRT Weighted Observed Score Equating

Chien, Yuehmei; Shin, Ching David

doi:10.1007/978-1-4614-9348-8_24

Yuehmei Chien⁵ &
Ching David Shin⁵

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 66))

1718 Accesses

Abstract

There are various reasons for placing different weights on items of test forms such as increasing test reliability or validity and improving measurement precision. Different weighting schemes have been used to accommodate different purposes under different testing situations. However, when the items are weighted, the question is how to equate the test forms containing those weighted items. Under IRT, there are two commonly used equating methods—IRT true score equating and IRT observed score equating. Applying the weights on items to IRT true score equating is straightforward and the software WITSE (Chien and Shin, WITSE: A program for weighted IRT true score equating, Version 1.0. Iowa City, IA: Pearson, 2008) had been specifically developed for weighted scores using IRT true score equating. Yet, currently, there is no procedure or algorithm available for the IRT weighted observed score equating due to the great complexity augmented by imposing weights on items. The regular IRT observed score equating constructs the estimated observed score distributions for two test forms, which are typically obtained using recursive algorithm. However, when items have different weights, the recursive algorithm is no longer feasible. Therefore, an extended recursive algorithm based on the recursive algorithm is proposed in this paper to construct the estimated observed score distribution and is illustrated with a real data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Brennan, R. L., Wang, T., Kim, S., & Seol, J. (2009). Equating recipes (CASMA Monograph Number 1). Iowa City, IA: Center for Advanced Studies in Measurement and Assessment, the University of Iowa. Available from the web address: http://www.uiowa.edu/~casma
Chang, S. (2009). Choice of weighting scheme in forming the composite. Bulletin of Educational Psychology, 40(3), 489–510.
Google Scholar
Chien, Y., & Shin, D. C. (2008). WITSE: A program for weighted IRT true score equating, Version 1.0. Iowa City, IA: Pearson.
Google Scholar
Ercikan, K., Schwarz, R. D., Julian, M. W., Burket, G. R., Weber, M. M., & Link, V. (1998). Calibration and scoring of tests with multiple-choice and constructed-response item types. Journal of Educational Measurement, 35, 137–154.
Article Google Scholar
Gulliksen, H. (1950). Theory of mental tests. New York: Wiley.
Book Google Scholar
Ito, K., & Sykes, R. C. (2000, June). An evaluation of “intentional” weighting of extended-response or constructed-response items in tests with mixed item types. Paper presented at the annual national conference on large scale assessment, Snowbird, Utah.
Google Scholar
Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking. New York, NY: Springer.
Book MATH Google Scholar
Lord, F. M. & Wingersky, M. S. (1984). Comparison of IRT true-score and equipercentile observed-score equatings. Applied Psychological Measurement, 8, 453–461.
Google Scholar
Lukhele, R., & Sireci, G. (1995). Using IRT to combine multiple-choice and free-response sections of a test onto a common scale using a priori weights. Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco, CA.
Google Scholar
McDonald, R. P. (1968). A unified treatment of the weighting problem. Psychometrika, 33, 351–381.
Article MathSciNet MATH Google Scholar
Rudner, L. M. (2001). Informed test component weighting. Educational Measurement: Issues and Practice, 20(1), 16–19.
Google Scholar
Schaeffer, G. A., Henderson-Montero, D., & Julian, M. (2002). A comparison of three scoring methods for tests with selected-response and constructed-response items. Educational Assessment, 8(4), 317–340.
Article Google Scholar
Stucky, B. D. (2009). Item response theory for weighted summed scores (Unpublished manuscript).
Google Scholar
Sykes, R. C., & Hou, L. (2003). Weighting constructed-response items in IRT-based exams. Applied Measurement in Education, 16, 257–275.
Article Google Scholar
Sykes, R. C., Truskosky, D., & White, H. (2001, April). Determining the representation of constructed-response items in mixed-item format exams. Paper presented at the annual meeting of the National Council on Measurement in Education, Seattle, WA.
Google Scholar
Wainer, H., & Thissen, D. (1993). Combining multiple-choice and constructed response test scores: Toward a Marxist theory of test construction. Applied Measurement in Education, 6(2), 103–118.
Article Google Scholar
Wang, M. D., & Stanley, J. C. (1970). Differential weighting: A review of methods and empirical studies. Review of Educational Research, 40, 663–705.
Article Google Scholar
Wilson, M., & Wang, W. (1995). Complex composites: Issues that arise in combining different modes of assessment. Applied Psychological Measurement, 19, 51–71.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Pearson, 2510 N Dodge St., Iowa City, IA, 52245, USA
Yuehmei Chien & Ching David Shin

Authors

Yuehmei Chien
View author publications
You can also search for this author in PubMed Google Scholar
Ching David Shin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuehmei Chien .

Editor information

Editors and Affiliations

Department of Psychology, Arizona State University, Tempe, AZ, USA
Roger E. Millsap
Department of Methodology and Statistics, Tilburg University, Tilburg, The Netherlands
L. Andries van der Ark
Department of Educational Psychology, University of Wisconsin, Madison, WI, USA
Daniel M. Bolt
Department of Psychology, University of Kansas, Lawrence, KS, USA
Carol M. Woods

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chien, Y., Shin, C.D. (2013). A Recursive Algorithm for IRT Weighted Observed Score Equating. In: Millsap, R.E., van der Ark, L.A., Bolt, D.M., Woods, C.M. (eds) New Developments in Quantitative Psychology. Springer Proceedings in Mathematics & Statistics, vol 66. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-9348-8_24

Download citation

DOI: https://doi.org/10.1007/978-1-4614-9348-8_24
Published: 13 January 2014
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-9347-1
Online ISBN: 978-1-4614-9348-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics