Abstract
The extant literature has largely ignored a potentially significant source of variance in multiple mini-interview (MMI) scores by “hiding” the variance attributable to the sample of attributes used on an evaluation form. This potential source of hidden variance can be defined as rating items, which typically comprise an MMI evaluation form. Due to its multi-faceted, repeated measures format, reliability for the MMI has been primarily evaluated using generalizability (G) theory. A key assumption of G theory is that G studies model the most important sources of variance to which a researcher plans to generalize. Because G studies can only attribute variance to the facets that are modeled in a G study, failure to model potentially substantial sources of variation in MMI scores can result in biased estimates of variance components. This study demonstrates the implications of hiding the item facet in MMI studies when true item-level effects exist. An extensive Monte Carlo simulation study was conducted to examine whether a commonly used hidden item, person-by-station (p × s|i) G study design results in biased estimated variance components. Estimates from this hidden item model were compared with estimates from a more complete person-by-station-by-item (p × s × i) model. Results suggest that when true item-level effects exist, the hidden item model (p × s|i) will result in biased variance components which can bias reliability estimates; therefore, researchers should consider using the more complete person-by-station-by-item model (p × s × i) when evaluating generalizability of MMI scores.








Similar content being viewed by others
References
AAMC data book: Medical schools and teaching hospitals by the numbers (2015).
Brennan, R. L. (2001). Generalizability theory. New York: Springer.
Chafouleas, S. M., Christ, T. J., & Riley-Tillman, T. C. (2009). Generalizability of scaling gradient on direct behavior ratings. Educational and Psychological Measurement, 69, 157–173.
Cohen, L., Manion, L., & Morrison, K. (2000). Research methods in education (5th ed.). London: Routledge Falmer.
Crossley, J., Russell, J., Jolly, B., Rickets, C., Roberts, C., Schuwirth, L., et al. (2007). I’m pickin’ up good regressions’: the governance of generalizability analyses. Medical Education, 41, 926–934.
Dewberry, C., Davies-Muir, A., & Newell, S. (2013). Impact and causes of rater severity/leniency in appraisals without postevaluation communication between raters and rates. International Journal of Selection and Assessment, 21, 286–293.
Dodson, M., Crotty, B., Prideaux, D., Carne, R., Ward, A., & De Leeuw, E. (2009). The multiple mini-interview: How long is long enough? Medical Education, 43, 168–174.
Dowell, J., Lynch, B., Till, H., Kumwenda, B., & Husbands, A. (2012). The multiple mini-interview in the UK context: 3 years of experience at Dundee. Medical Teacher, 34, 297–304.
Edwards, J. C., Johnson, E. K., & Molidor, J. B. (1990). The interview in the admission process. Academic Medicine, 65, 167–177.
Eva, K. W., Reiter, H. I., Trinh, K., Wasi, P., Rosenfeld, J., & Norman, G. R. (2009). Predictive validity of the multiple mini-interview for selecting medical trainees. Medical Education, 43, 767–775.
Eva, K. W., Rosenfeld, J., Reiter, H. I., & Norman, G. R. (2004). An admissions OSCE: The multiple mini-interview. Medical Education, 38, 314–326.
Fisicaro, S. A., & Lance, C. E. (1990). Implications of three causal-models for the measurement of halo error. Applied Psychological Measurement, 14, 419–429.
Goho, J., & Blackman, A. (2006). The effectiveness of academic admission interviews: An exploratory meta-analysis. Medical Teacher, 28, 335–340.
Hofmeister, M., Lockyer, J., & Crutcher, R. (2009). The multiple mini-interview for selection of international medical graduates into family medicine residency education. Medical Education, 43, 573–579.
Hox, J. J. (2010). Multilevel analysis: techniques and applications (2nd ed.). New York: Routledge.
Huber, P. J., & Ronchetti, E. M. (2009). Robust Statistics (2nd ed.). Hoboken, NJ: Wiley.
Kuncel, N. R., & Sackett, P. R. (2014). Resolving the assessment center construct validity problem (as we know it). Journal of Applied Psychology, 99, 38–47.
Lemay, J., Lockyer, J. M., Collin, V. T., & Brownell, A. K. W. (2007). Assessment of non-cognitive traits through the admissions multiple mini-interview. Medical Education, 41, 573–579.
Manno, I. (1999). Introduction to the Monte-Carlo method. Budapest: Akadémiai Kiadó.
McCormick, E. J., & Ilgen, D. R. (1985). Industrial psychology (8th ed.). Englewood Cliffs, NJ: Prentice-Hall.
McKelvey, R. D., & Zavoina, W. (1975). A statistical model for the analysis of ordinal level dependent variables. The Journal of Mathematical Sociology, 4, 103–120.
Moore, J. L. (2010). Estimating standard errors of estimated variance components in generalizability theory using bootstrap procedures. Dissertation, University of Iowa.
Morris, J. G. (1999). The value and role of the interview in the student admissions process: A review. Medical Teacher, 21, 473–481.
Pau, A., Jeevaratnam, K., Chen, Y. S., Abdoul, A. F., Khoo, C., & Nadarajah, V. D. (2013). The multiple mini-interview (MMI) for student selection in health professions training—a systematic review. Medical Teacher, 35, 1027–1041.
R Core Team. (2014). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/
Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: applications and data analysis methods (2nd ed.). Thousand Oaks: Sage.
Salvatori, P. (2001). Reliability and validity of admissions tools used to select students for the health professions. Advances in Health Sciences Education, 6, 159–175.
Searle, S. R., Casella, G., & McCulloch, C. E. (1992). Variance components. New York: Wiley.
Sebok, S. S., Luu, K., & Klinger, D. A. (2014). Psychometric properties of the multiple mini-interview used for medical admissions: Findings from generalizability and Rasch analyses. Advances in Health Sciences Education, 19, 71–84.
Shavelson, R. J., & Webb, N. M. (1991). Generalizability theory: A primer. Newbury Park: Sage.
Tong, Y., & Brennan, R. L. (2007). Bootstrap estimates of standard error in generalizability theory. Educational and Psychological Measurement, 67, 804–817.
Uijtdehaage, S., Doyle, L. H., & Parker, N. (2011). Enhancing the reliability of the multiple mini-interview for selecting prospective health care leaders. Academic Medicine, 86, 1032–1039.
Zaidi, N. B., Swoboda, C., Wang, L. L., & Manuel, R. S. (2014). Variance in attributes assessed by the multiple mini-interview. Medical Teacher, 36, 794–798.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zaidi, N.L.B., Swoboda, C.M., Kelcey, B.M. et al. Hidden item variance in multiple mini-interview scores. Adv in Health Sci Educ 22, 337–363 (2017). https://doi.org/10.1007/s10459-016-9706-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10459-016-9706-5
