Predicting Mastery Level on a Large-scale Standardized Patient Test: A Comparison of Case and Instrument Score-based Models Using Discriminant Function Analysis

De Champlain, André F.; Margolis, Melissa J.; Macmillan, Mary K.; Klass, Daniel J.

doi:10.1023/A:1011421706300

Predicting Mastery Level on a Large-scale Standardized Patient Test: A Comparison of Case and Instrument Score-based Models Using Discriminant Function Analysis

Published: May 2001

Volume 6, pages 151–158, (2001)
Cite this article

Advances in Health Sciences Education Aims and scope Submit manuscript

André F. De Champlain¹,
Melissa J. Margolis²,
Mary K. Macmillan² &
…
Daniel J. Klass²

83 Accesses
6 Citations
Explore all metrics

Abstract

Clinical skills assessments have traditionally been scored via experts' ratings of examinee performance. However, this approach to scoring may be impractical in a large-scale context due to logistical and cost considerations as well as the increased probability of rater error. The purpose of this investigation was therefore to identify, using discriminant analysis, weighted score-based models that maximize the accuracy with which mastery level can be estimated for examinees taking a nationally administered standardized patient test. Additionally, the accuracy with which the resulting classification functions can be applied to predict mastery level for a cross-validation sample of examinees was also examined. Results suggest that it might be feasible to implement an automated scoring procedure in a cost-effective manner while still retaining the important facets of the decision-making process of expert raters. Cost-benefit, test development and psychometric implications of these results are important and discussed in the full paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?

Article Open access 29 February 2020

The Development of a Calculator for Objectively Evaluating Supervisory Behaviors in Practice

Article 10 June 2024

The InterModel Vigorish as a Lens for Understanding (and Quantifying) the Value of Item Response Models for Dichotomously Coded Items

Article 03 June 2024

References

Baxter, G.P., Shavelson, R.J., Goldman, S.R. & Pine, J. (1992). Evaluation of procedure-based scoring for hands-on science assessment. Journal of Educational Measurement 29: 1-17.
Article Google Scholar
Bejar, I.I. (1991). A methodology for scoring open-ended architectural design problems. Journal of Applied Psychology 76: 522-532.
Article Google Scholar
Bennett, R.E. & Sebrechts, M.M. (1996). The accuracy of expert-system diagnoses of mathematical problem solutions. Applied Measurement in Education 9: 133-150.
Article Google Scholar
Clauser, B.E., Subhiyah, R.G., Nungester, R.J., Ripkey, D.R., Clyman, S.G. & McKinley, D. (1995). Scoring a performance-based assessment by modeling the judgments of experts. Journal of Educational Measurement 32: 397-415.
Article Google Scholar
Clauser, B.E., Margolis, M.J., Clyman, S.G. & Ross, L.P. (1997). Development of automated scoring algorithms for complex performance assessments: A comparison of two approaches. Journal of Educational Measurement 34: 141-161.
Article Google Scholar
Hardy, R.A. (1995). Examining the costs of performance assessment. Applied Measurement in Education 8: 121-134.
Article Google Scholar
Huberty, C.J. & Barton, R.M. (1989). An introduction to discriminant analysis. Measurement and Evaluation in Counseling and Development 22: 158-168.
Google Scholar
Margolis, M.J., De Champlain, A.F. & Klass, D.J. (1998). Comparing Alternative Procedures for Scoring a Performance-Based Assessment of Physicians' Clinical Skills. Paper presented at the meeting of the National Council on Measurement in Education, San Diego, CA, April.
Margolis, M.J., De Champlain, A.F. & Klass, D.J. (1998). Setting examination-level standards for a performance-based assessment of physicians' clinical skills. Academic Medicine 73(10 suppl): S114-S116.
Article Google Scholar
Norcini, J.J., Diserens, D., Day, S.C., Cebul, R.D., Schwartz, J.S., Beck, L.H., Webster, G.D., Schnabel, T.G. & Elstein, A. (1990). The scoring and reproducibility of an essay test of clinical judgment. Academic Medicine 65(9 suppl): S41-S42.
Google Scholar
Thompson, B. (1998). Five Methodology Errors in Educational Research: The Pantheon of Statistical Significance and other Faux Pas. Paper presented at the meeting of the American Educational Research Association, San Diego, CA, April.

Download references

Author information

Authors and Affiliations

National Board of Medical Examiners, 3750 Market Street, Philadelphia, PA, U.S.A, 19104 (
André F. De Champlain
National Board of Medical Examiners, 3750 Market Street, Philadelphia, PA, U.S.A, 19104
Melissa J. Margolis, Mary K. Macmillan & Daniel J. Klass

Authors

André F. De Champlain
View author publications
You can also search for this author in PubMed Google Scholar
Melissa J. Margolis
View author publications
You can also search for this author in PubMed Google Scholar
Mary K. Macmillan
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Klass
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

De Champlain, A.F., Margolis, M.J., Macmillan, M.K. et al. Predicting Mastery Level on a Large-scale Standardized Patient Test: A Comparison of Case and Instrument Score-based Models Using Discriminant Function Analysis. Adv Health Sci Educ Theory Pract 6, 151–158 (2001). https://doi.org/10.1023/A:1011421706300

Download citation

Issue Date: May 2001
DOI: https://doi.org/10.1023/A:1011421706300

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting Mastery Level on a Large-scale Standardized Patient Test: A Comparison of Case and Instrument Score-based Models Using Discriminant Function Analysis

Abstract

Access this article

Similar content being viewed by others

Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?

The Development of a Calculator for Objectively Evaluating Supervisory Behaviors in Practice

The InterModel Vigorish as a Lens for Understanding (and Quantifying) the Value of Item Response Models for Dichotomously Coded Items

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Predicting Mastery Level on a Large-scale Standardized Patient Test: A Comparison of Case and Instrument Score-based Models Using Discriminant Function Analysis

Abstract

Access this article

Similar content being viewed by others

Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?

The Development of a Calculator for Objectively Evaluating Supervisory Behaviors in Practice

The InterModel Vigorish as a Lens for Understanding (and Quantifying) the Value of Item Response Models for Dichotomously Coded Items

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation