Reliability of Pass and Fail Decisions on Tests Employing Cut Scores

Puhan, Gautam; Gall, Leanne

doi:10.1007/s12646-012-0147-9

Reliability of Pass and Fail Decisions on Tests Employing Cut Scores

Assessment
Published: 04 February 2012

Volume 57, pages 273–282, (2012)
Cite this article

Psychological Studies Aims and scope Submit manuscript

Gautam Puhan¹ &
Leanne Gall¹

176 Accesses
1 Citation
Explore all metrics

Abstract

The study evaluated the reliability of pass and fail classifications for several teacher certification tests. Since these tests are used in the context of a cut score to classify examinees as pass and fail, evaluating the accuracy and consistency of these classifications is important. The classification accuracy and consistency statistics were estimated using the RELCLASS software. Results indicated the following. (1) The 29 teacher certification tests that were examined had a relatively high classification accuracy (0.827 to 0.999) and consistency (0.760 to 0.999). (2) Both classification accuracy and consistency increased as the difference between the mean and cut score increased. (3) Classification accuracy and consistency was higher for multiple-choice (MC) as compared to tests consisting of only constructed-response (CR) items or a combination of CR and MC items.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cut-scores revisited: feasibility of a new method for group standard setting

Article Open access 07 June 2018

From Standards to Rubrics: Comparing Full-Range to At-Level Applications of an Item-Level Scoring Rubric on an Oral Proficiency Assessment

Measurement precision at the cut score in medical multiple choice exams: Theory matters

Article Open access 28 May 2020

Notes

Researchers interested in conducting similar studies can use the procedural steps documented in Livingston and Lewis (1995) for computing the reliability of classification.

References

Anderson, D. O., & Schneider, C. (October 2002). Reliability of tests used for classification. Paper presented at the annual conference of the Northeastern Educational Research Association, Kerhonkson, NY.
Breyer, F. J., & Lewis, C. (1994). Pass-fail reliability for tests with cut scores: A simplified method (ETS Research Report No. 94–39). Princeton: Educational Testing Service.
Google Scholar
Lee, W., Hanson, B. A., & Brennan, R. L. (2000). Procedures for computing classification and accuracy indices with multiple categories (ACT Research Report No. 2000–10). Iowa city: American College Testing.
Google Scholar
Livingston, S. A., & Lewis, C. (1995). Estimating the consistency and accuracy of classifications based on test scores. Journal of Educational Measurement, 32(2), 179–197.
Article Google Scholar
Livingston, S. A., & Wingersky, M. S. (1979). Assessing the reliability of tests used to make pass/fail decisions. Journal of Educational Measurement, 16(4), 247–260.
Article Google Scholar
Mroczka, R. C. (2000). RELCLASS-COMP (SOSA8P Version 4.11). Princeton: Educational Testing Service.
Google Scholar
Subkoviak, M. J. (1984). Estimating the reliability of mastery-nonmastery classifications. In R. A. Berk (Ed.), A guide to criterion-referenced test construction. Baltimore: John Hopkins University Press.
Google Scholar

Download references

Author information

Authors and Affiliations

Educational Testing Service, Princeton, NJ, USA
Gautam Puhan & Leanne Gall

Authors

Gautam Puhan
View author publications
You can also search for this author in PubMed Google Scholar
Leanne Gall
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gautam Puhan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Puhan, G., Gall, L. Reliability of Pass and Fail Decisions on Tests Employing Cut Scores. Psychol Stud 57, 273–282 (2012). https://doi.org/10.1007/s12646-012-0147-9

Download citation

Received: 27 April 2011
Accepted: 18 January 2012
Published: 04 February 2012
Issue Date: September 2012
DOI: https://doi.org/10.1007/s12646-012-0147-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reliability of Pass and Fail Decisions on Tests Employing Cut Scores

Abstract

Access this article

Similar content being viewed by others

Cut-scores revisited: feasibility of a new method for group standard setting

From Standards to Rubrics: Comparing Full-Range to At-Level Applications of an Item-Level Scoring Rubric on an Oral Proficiency Assessment

Measurement precision at the cut score in medical multiple choice exams: Theory matters

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Reliability of Pass and Fail Decisions on Tests Employing Cut Scores

Abstract

Access this article

Similar content being viewed by others

Cut-scores revisited: feasibility of a new method for group standard setting

From Standards to Rubrics: Comparing Full-Range to At-Level Applications of an Item-Level Scoring Rubric on an Oral Proficiency Assessment

Measurement precision at the cut score in medical multiple choice exams: Theory matters

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation