Efficiency of multiple-choice tests as a function of spread of item difficulties

Cronbach, Lee J.; Warrington, Willard G.

doi:10.1007/BF02288778

Efficiency of multiple-choice tests as a function of spread of item difficulties

Published: June 1952

Volume 17, pages 127–147, (1952)
Cite this article

Psychometrika Aims and scope Submit manuscript

Lee J. Cronbach¹ &
Willard G. Warrington¹

176 Accesses
27 Citations
Explore all metrics

Abstract

The validity of a univocal multiple-choice test is determined for varying distributions of item difficulty and varying degrees of item precision. Validity is a function ofσ ²_d +σ ²_v , whereσ _d measures item unreliability andσ _v measures the spread of item difficulties. When this variance is very small, validity is high for one optimum cutting score, but the test gives relatively little valid information for other cutting scores. As this variance increases, eta increases up to a certain point, and then begins to decrease. Screening validity at the optimum cutting score declines as this variance increases, but the test becomes much more flexible, maintaining the same validity for a wide range of cutting scores. For items of the type ordinarily used in psychological tests, the test with uniform item difficulty gives greater over-all validity, and superior validity for most cutting scores, compared to a test with a range of item difficulties. When a multiple-choice test is intended to reject the poorestF per cent of the men tested, items should on the average be located at or above the threshold for men whose true ability is at theFth percentile.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

Sander Greenland, Stephen J. Senn, … Douglas G. Altman

Small is beautiful: In defense of the small-N design

Article Open access 19 March 2018

Philip L. Smith & Daniel R. Little

Violating the normality assumption may be the lesser of two evils

Article Open access 07 May 2021

Ulrich Knief & Wolfgang Forstmeier

References

Brogden, H. E. Variation in test validity with variation in the distribution of item difficulties, number of items, and degree of their intercorrelation.Psychometrika, 1946,11, 197–214.
Google Scholar
Carroll, J. B. The effect of difficulty and chance success on correlations between items or between tests.Psychometrika, 1945,10, 1–19.
Google Scholar
Gulliksen, H. The relation of item difficulty and interitem correlation to test variance and reliability.Psychometrika, 1945,10, 79–91.
Google Scholar
Lord, F. M. A theory of test scores and their relation to the trait measured.Res. Bull. 51–13, Educational Testing Service, 1951. See also A theory of test scores. Psychometric Monograph No. 7, 1952.
Richardson, M. W. The relation between the difficulty and the differential validity of a test.Psychometrika, 1936,1, 33–49.
Google Scholar
Tucker, L. R. Maximum validity of a test with equivalent items.Psychometrika, 1946,11, 1–13.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Illinois, USA
Lee J. Cronbach & Willard G. Warrington

Authors

Lee J. Cronbach
View author publications
You can also search for this author in PubMed Google Scholar
Willard G. Warrington
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

This research was performed under contract Nop 536 with the Bureau of Naval Personnel, and received additional support from the Bureau of Research and Service, College of Education, University of Illinois.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cronbach, L.J., Warrington, W.G. Efficiency of multiple-choice tests as a function of spread of item difficulties. Psychometrika 17, 127–147 (1952). https://doi.org/10.1007/BF02288778

Download citation

Received: 06 August 1951
Revised: 08 October 1951
Issue Date: June 1952
DOI: https://doi.org/10.1007/BF02288778

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Efficiency of multiple-choice tests as a function of spread of item difficulties

Abstract

Access this article

Similar content being viewed by others

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Small is beautiful: In defense of the small-N design

Violating the normality assumption may be the lesser of two evils

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Efficiency of multiple-choice tests as a function of spread of item difficulties

Abstract

Access this article

Similar content being viewed by others

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Small is beautiful: In defense of the small-N design

Violating the normality assumption may be the lesser of two evils

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation