Skip to main content
Log in

Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores

  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

We introduce two simple empirical approximate Bayes estimators (EABEs)—\(\widetilde{d}_N (x)\) and\(\widetilde\delta _N (x)\)—for estimating domain scores under binomial and hypergeometric distributions, respectively. Both EABEs (derived from corresponding marginal distributions of observed test scorex without relying on knowledge of prior domain score distributions) have been proven to hold Δ-asymptotic optimality in Robbins' sense of convergence in mean. We found that, where\(\widetilde{d}^* _N\) and\(\widetilde\delta ^* _N\) are the monotonized versions of\(\widetilde{d}_N\) and\(\widetilde\delta _N\) under Van Houwelingen's monotonization method, respectively, the convergence rate of the overall expected loss of Bayes risk in either\(\widetilde{d}^* _N\) or\(\widetilde\delta ^* _N\) depends on test length, sample size, and ratio of test length to size of domain items. In terms of conditional Bayes risk,\(\widetilde{d}^* _N\) and\(\widetilde\delta ^* _N\) outperform their maximum likelihood counterparts over the middle range of domain scales. In terms of mean-squared error, we also found that: (a) given a unimodal prior distribution of domain scores,\(\widetilde\delta ^* _N\) performs better than both\(\widetilde{d}^* _N\) and a linear EBE of the beta-binomial model when domain item size is small or when test items reflect a high degree of heterogeneity; (b)\(\widetilde{d}^* _N\) performs as well as\(\widetilde\delta ^* _N\) when prior distribution is bimodal and test items are homogeneous; and (c) the linear EBE is extremely robust when a large pool of homogeneous items plus a unimodal prior distribution exists.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • American Psychological Association, American Educational Research Association, & National Council on Measurement in Education. (1985).Standards for educational and psychological tests. Washington, DC: American Psychological Association.

    Google Scholar 

  • Berk, R. (1980). A consumer's guide to criterion-referenced test reliability.Journal of Educational Measurement, 17, 323–349.

    Article  Google Scholar 

  • Chung, K. L. (1974).A course in probability theory. New York: Academic Press.

    Google Scholar 

  • Cressie, N. (1982). A useful empirical Bayes identity.The Annals of Statistics, 10, 625–629.

    Google Scholar 

  • Cressie, N., & Seheult, A. (1985). Empirical Bayes estimation in sampling inspection.Biometrika, 72, 451–458.

    Google Scholar 

  • Deely, J. J., & Lindley, D. V. (1981). Bayes Empirical Bayes.Journal of the American Statistical Association, 76, 833–841.

    Google Scholar 

  • Johnson, N., & Kotz, S. (1969).Discrete distribution in statistics: Distributions. New York: Wiley.

    Google Scholar 

  • Keats, J. A., & Lord, F. M. (1962). A theoretical distribution for mental test scores.Psychometrika, 27, 59–72.

    Article  Google Scholar 

  • Lin, M. H., Hsiung, C. A., & Hsiao, C. F. (1994). A computing program for monotonizing two empirical Bayes estimators in binomial and hypergeometric data distributions.Psychometrika, 59, 423–424.

    Google Scholar 

  • Lord, F. M., & Novick, M. R. (1968).Statistical theories of mental test scores. Reading, MA: Addison-Wesley.

    Google Scholar 

  • Maritz, J. S., & Lwin, T. (1975). Construction of simple empirical Bayes estimators.Journal of the Royal Statistical Society, Series B, 39, 421–425.

    Google Scholar 

  • Maritz, J., & Lwin, J. (1989).Empirical Bayes methods. London: Chapman and Hall.

    Google Scholar 

  • Meredith, W., & Kearns, J. (1973). Empirical Bayes point estimates of latent trait scores without knowledge of the trait distribution.Psychometrika, 38, 533–554.

    Article  Google Scholar 

  • Micceri, T. (1989). The unicorn, the normal curve, and other improbable creatures.Psychological Bulletin, 105(1), 156–166.

    Article  Google Scholar 

  • Millman, J. (1974). Criterion referenced measurement. In W. J. Popham (Ed.).Evaluation in education: Current application. Berkeley, CA: McCutcheon.

    Google Scholar 

  • Mood, A., Graybill, F., & Boes, D. (1974).Introduction to the theory of statistics. New York: McGraw-Hill.

    Google Scholar 

  • Nichols, W. G., & Tsokos, C. P. (1972). Empirical Bayes point estimation in a family of probability distributions.International Statistical Review, 40, 147–151.

    Google Scholar 

  • Popham, W. J. (1984). Specifying the domain of content or behaviors. In R. A. Berk (Ed.).A guide to criterion-referenced test construction (pp. 29–48). Baltimore: Johns Hopkins University Press.

    Google Scholar 

  • Robbins, H. (1964). The empirical Bayes approach to statistical decision problems.Annals Mathematical Statistics, 35, 1–20.

    Google Scholar 

  • Rutherford, J. R., & Krutchkoff, R. G. (1969). Some empirical Bayes techniques in point estimation.Biometrika, 56, 133–137.

    Google Scholar 

  • van der Linden, W. J. (1979). Binomial test models and item difficulty.Applied Psychological Measurement, 3, 401–411.

    Google Scholar 

  • Van Houwelingen, J. C. (1977). Monotonizing empirical Bayes estimators for a class of discrete distributions with monotone likelihood ratio.Statistica Neerlandica, 31, 95–104.

    Google Scholar 

  • Wilcox, R. R. (1979). A lower bound to the probability of choosing the optimal passing scores for a mastery test when there is an external criterion.Psychometrika, 44, 245–249.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

The authors are indebted to both anonymous reviewers, especially Reviewer 2, and the Editor for their invaluable comments and suggestions. Thanks are also due to Yuan-Chin Chang and Chin-Fu Hsiao for their help with our simulation and programming work.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, MH., Hsiung, C.A. Empirical bayes estimates of domain scores under binomial and hypergeometric distributions for test scores. Psychometrika 59, 331–359 (1994). https://doi.org/10.1007/BF02296128

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02296128

Key words

Navigation