Abstract
Optimal sampling designs for an IRT linking with improved efficiency are often sought in analyzing assessment data. In practice, the skill distribution of an assessment sample may be bimodal, and this warrants special consideration when trying to create these designs. In this study we explore optimal sampling designs for IRT linking of bimodal data. Our design paradigm is modeled and presents a formal setup for optimal IRT linking. In an optimal sampling design, the sample structure of bimodal data is treated as being drawn from a stratified population. The optimum search algorithm proposed is used to adjust the stratum weights and form a weighted compound sample that minimizes linking errors. The initial focus of the current study is the robust mean–mean transformation method, though the model of IRT linking under consideration is adaptable to generic methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Angoff WH (1984) Scales, norms, and equivalent scores. Educational Testing Service, Princeton
Berger MPF (1991) On the efficiency of IRT models when applied to different sampling designs. Appl Psychol Meas 15:293–306
Berger MPF (1997) Optimal designs for latent variable models: a review. In: Rost J, Langeheine R (eds) Application of latent trait and latent class models in the social sciences. Waxmann, Muenster, pp 71–79
Berger MPF, van der Linden WJ (1992) Optimality of sampling designs in item response theory models. In: Wilson M (ed) Objective measurement: theory into practice, vol 1. Ablex, Norwood, pp 274–288
Berger MPF, King CYJ, Wong WK (2000) Minimax D-optimal designs for item response theory models. Psychometrika 65:377–390
Beveridge GSG, Schechter RS (1970) Optimization: theory and practice. McGraw-Hill, New York
Buyske S (2005) Optimal design in educational testing. In: Berger MPF, Wong WK (eds) Applied optimal designs. Wiley, New York, pp 1–19
Cochran WG (1977) Sampling techniques, 3rd edn. Wiley, New York
Dorans NJ, Holland PW (2000) Population invariance and equitability of tests: basic theory and the linear case. J Educ Meas 37:281–306
Duong M, von Davier AA (2012) Observed-score equating with a heterogeneous target population. Int J Test 12:224–251
Haberman SJ (2009) Linking parameter estimates derived from an item response model through separate calibrations (research report 09–40). Educational Testing Service, Princeton
Haberman SJ, Lee Y, Qian J (2009) Jackknifing techniques for evaluation of equating accuracy (research report 09–39). Educational Testing Service, Princeton
Haebara T (1980) Equating logistic ability scales by a weighted least squares method. Jpn Psychol Res 22(3):144–149
Hua L-K, Wang Y, Heijmans JGC (1989) Optimum seeking methods (single variable). In: Lucas WF, Thompson M (eds) Mathematical modelling, vol 2. Springer, New York, pp 57–78
Jones DH, Jin Z (1994) Optimal sequential designs for on-line item estimation. Psychometrika 59:59–75
Kish L (1965) Survey sampling. Wiley, New York
Kolen MJ, Brennan RL (2004) Test equating, scaling, and linking: methods and practices. Springer, New York
Kuhn HW, Tucker AW (1951) Nonlinear programming. In: Neyman J (ed) Proceedings of the second Berkeley symposium on mathematical statistics and probability, University of California Press, Berkeley, pp 481–492
Lord FM (1980) Applications of item response theory to practical testing problems. Erlbaum, Hillsdale
Lord MF, Wingersky MS (1985) Sampling variances and covariances of parameter estimates in item response theory. In: Weiss DJ (ed) Proceedings of the 1982 IRT/CAT conference, Department of Psychology, CAT Laboratory, University of Minnesota, Minneapolis
Loyd BH, Hoover HD (1980) Vertical equating using the Rasch model. J Educ Meas 17:179–193
Mislevy RJ, Bock RD (1990) BILOG 3, 2nd edn. Scientific Software, Mooresville
Muraki E, Bock RD (2002) PARSCALE (Version 4.1) [Computer software]. Scientific Software, Lincolnwood
Nocedal J, Wright SJ (2006) Numerical optimization. Springer, New York
Qian J, Spencer B (1994). Optimally weighted means in stratified sampling. In: Proceedings of the section on survey research methods, American Statistical Association, pp 863–866
Qian J, von Davier AA, Jiang Y (2013) Achieving a stable scale for an assessment with multiple forms: weighting test samples in IRT linking. In: Millsap RE, van der Ark LA, Bolt DM, Woods CM (eds) Springer proceedings in mathematics & statistics, new developments in quantitative psychology. Springer, New York, pp 171–185
Silvey SD (1970) Statistical inference. Penguin Books, Baltimore
Stocking ML (1990) Specifying optimum examinees for item parameter estimation in item response theory. Psychometrika 55:461–475
Stocking ML, Lord FM (1983) Developing a common metric in item response theory. Appl Psychol Meas 7:201–210
van der Linden WJ, Luecht RM (1998) Observed-score equating as a test assembly problem. Psychometrika 63:401–418
von Davier M, von Davier AA (2011) A general model for IRT scale linking and scale transformation. In: von Davier AA (ed) Statistical models for test equating, scaling, and linking. Springer, New York, pp 225–242
von Davier AA, Wilson C (2008) Investigating the population sensitivity assumption of item response theory true-score equating across two subgroups of examinees and two test formats. Appl Psychol Meas 32:11–26
von Davier AA, Holland PW, Thayer DT (2004) The kernel method of test equating. Springer, New York
Wilde DJ (1964) Optimum seeking methods. Prentice-Hall, Englewood Cliffs
Wolter K (2007) Introduction to variance estimation, 2nd edn. Springer, New York
Zumbo BD (2007) Validity: foundational issues and statistical methodology. In: Rao CR, Sinharay S (eds) Handbook of statistics, vol 26, Psychometrics. Elsevier Science B.V, Amsterdam, pp 45–79
Acknowledgments
The authors thank Jim Carlson, Shelby Haberman, Yi-Hsuan Lee, Ying Lu, and Daniel Bolt for their suggestions and comments. The authors also thank Shuhong Li and Jill Carey for their assistance in assembling data and Kim Fryer for editorial help. Any opinions expressed in this paper are solely those of the authors and not necessarily those of ETS.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Qian, J., von Davier, A.A. (2015). Optimal Sampling Design for IRT Linking with Bimodal Data. In: Millsap, R., Bolt, D., van der Ark, L., Wang, WC. (eds) Quantitative Psychology Research. Springer Proceedings in Mathematics & Statistics, vol 89. Springer, Cham. https://doi.org/10.1007/978-3-319-07503-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-07503-7_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07502-0
Online ISBN: 978-3-319-07503-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)