Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores

Lin, Miao-Hsiang; Hsiung, Chao A.

doi:10.1007/BF02296128

Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores

Published: September 1994

Volume 59, pages 331–359, (1994)
Cite this article

Psychometrika Aims and scope Submit manuscript

Miao-Hsiang Lin¹ &
Chao A. Hsiung¹

107 Accesses
6 Citations
Explore all metrics

Abstract

We introduce two simple empirical approximate Bayes estimators (EABEs)—\(\widetilde{d}_N (x)\) and\(\widetilde\delta _N (x)\)—for estimating domain scores under binomial and hypergeometric distributions, respectively. Both EABEs (derived from corresponding marginal distributions of observed test scorex without relying on knowledge of prior domain score distributions) have been proven to hold Δ-asymptotic optimality in Robbins' sense of convergence in mean. We found that, where\(\widetilde{d}^* _N\) and\(\widetilde\delta ^* _N\) are the monotonized versions of\(\widetilde{d}_N\) and\(\widetilde\delta _N\) under Van Houwelingen's monotonization method, respectively, the convergence rate of the overall expected loss of Bayes risk in either\(\widetilde{d}^* _N\) or\(\widetilde\delta ^* _N\) depends on test length, sample size, and ratio of test length to size of domain items. In terms of conditional Bayes risk,\(\widetilde{d}^* _N\) and\(\widetilde\delta ^* _N\) outperform their maximum likelihood counterparts over the middle range of domain scales. In terms of mean-squared error, we also found that: (a) given a unimodal prior distribution of domain scores,\(\widetilde\delta ^* _N\) performs better than both\(\widetilde{d}^* _N\) and a linear EBE of the beta-binomial model when domain item size is small or when test items reflect a high degree of heterogeneity; (b)\(\widetilde{d}^* _N\) performs as well as\(\widetilde\delta ^* _N\) when prior distribution is bimodal and test items are homogeneous; and (c) the linear EBE is extremely robust when a large pool of homogeneous items plus a unimodal prior distribution exists.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Confidence Distribution for the Ability Parameter of the Rasch Model

Article 03 February 2021

Overestimation of Reliability by Guttman’s λ 4, λ 5, and λ 6 and the Greatest Lower Bound

Confidence distributions and hypothesis testing

Article Open access 29 March 2024

References

American Psychological Association, American Educational Research Association, & National Council on Measurement in Education. (1985).Standards for educational and psychological tests. Washington, DC: American Psychological Association.
Google Scholar
Berk, R. (1980). A consumer's guide to criterion-referenced test reliability.Journal of Educational Measurement, 17, 323–349.
Article Google Scholar
Chung, K. L. (1974).A course in probability theory. New York: Academic Press.
Google Scholar
Cressie, N. (1982). A useful empirical Bayes identity.The Annals of Statistics, 10, 625–629.
Google Scholar
Cressie, N., & Seheult, A. (1985). Empirical Bayes estimation in sampling inspection.Biometrika, 72, 451–458.
Google Scholar
Deely, J. J., & Lindley, D. V. (1981). Bayes Empirical Bayes.Journal of the American Statistical Association, 76, 833–841.
Google Scholar
Johnson, N., & Kotz, S. (1969).Discrete distribution in statistics: Distributions. New York: Wiley.
Google Scholar
Keats, J. A., & Lord, F. M. (1962). A theoretical distribution for mental test scores.Psychometrika, 27, 59–72.
Article Google Scholar
Lin, M. H., Hsiung, C. A., & Hsiao, C. F. (1994). A computing program for monotonizing two empirical Bayes estimators in binomial and hypergeometric data distributions.Psychometrika, 59, 423–424.
Google Scholar
Lord, F. M., & Novick, M. R. (1968).Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Google Scholar
Maritz, J. S., & Lwin, T. (1975). Construction of simple empirical Bayes estimators.Journal of the Royal Statistical Society, Series B, 39, 421–425.
Google Scholar
Maritz, J., & Lwin, J. (1989).Empirical Bayes methods. London: Chapman and Hall.
Google Scholar
Meredith, W., & Kearns, J. (1973). Empirical Bayes point estimates of latent trait scores without knowledge of the trait distribution.Psychometrika, 38, 533–554.
Article Google Scholar
Micceri, T. (1989). The unicorn, the normal curve, and other improbable creatures.Psychological Bulletin, 105(1), 156–166.
Article Google Scholar
Millman, J. (1974). Criterion referenced measurement. In W. J. Popham (Ed.).Evaluation in education: Current application. Berkeley, CA: McCutcheon.
Google Scholar
Mood, A., Graybill, F., & Boes, D. (1974).Introduction to the theory of statistics. New York: McGraw-Hill.
Google Scholar
Nichols, W. G., & Tsokos, C. P. (1972). Empirical Bayes point estimation in a family of probability distributions.International Statistical Review, 40, 147–151.
Google Scholar
Popham, W. J. (1984). Specifying the domain of content or behaviors. In R. A. Berk (Ed.).A guide to criterion-referenced test construction (pp. 29–48). Baltimore: Johns Hopkins University Press.
Google Scholar
Robbins, H. (1964). The empirical Bayes approach to statistical decision problems.Annals Mathematical Statistics, 35, 1–20.
Google Scholar
Rutherford, J. R., & Krutchkoff, R. G. (1969). Some empirical Bayes techniques in point estimation.Biometrika, 56, 133–137.
Google Scholar
van der Linden, W. J. (1979). Binomial test models and item difficulty.Applied Psychological Measurement, 3, 401–411.
Google Scholar
Van Houwelingen, J. C. (1977). Monotonizing empirical Bayes estimators for a class of discrete distributions with monotone likelihood ratio.Statistica Neerlandica, 31, 95–104.
Google Scholar
Wilcox, R. R. (1979). A lower bound to the probability of choosing the optimal passing scores for a mastery test when there is an external criterion.Psychometrika, 44, 245–249.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Statistical Science, Academia Sinica, 11529, Taipei, Taiwan, R.O.C.
Miao-Hsiang Lin & Chao A. Hsiung

Authors

Miao-Hsiang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Chao A. Hsiung
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

The authors are indebted to both anonymous reviewers, especially Reviewer 2, and the Editor for their invaluable comments and suggestions. Thanks are also due to Yuan-Chin Chang and Chin-Fu Hsiao for their help with our simulation and programming work.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, MH., Hsiung, C.A. Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores. Psychometrika 59, 331–359 (1994). https://doi.org/10.1007/BF02296128

Download citation

Received: 09 December 1991
Revised: 01 September 1993
Issue Date: September 1994
DOI: https://doi.org/10.1007/BF02296128

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores

Abstract

Access this article

Similar content being viewed by others

Confidence Distribution for the Ability Parameter of the Rasch Model

Overestimation of Reliability by Guttman’s λ 4, λ 5, and λ 6 and the Greatest Lower Bound

Confidence distributions and hypothesis testing

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key words

Navigation

Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores

Abstract

Access this article

Similar content being viewed by others

Confidence Distribution for the Ability Parameter of the Rasch Model

Overestimation of Reliability by Guttman’s λ 4, λ 5, and λ 6 and the Greatest Lower Bound

Confidence distributions and hypothesis testing

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation