Introduction

Disparities in health and health care continue to plague our nation. Extensive evidence shows differences across racial/ethnic, gender, and geographic groupings.1 Access to health care, cultural and language differences, communication and trust barriers, and socioeconomic differences are clearly contributors to these disparities.1 However, there is evidence that stereotyping or decisional biases by clinicians in some situations may also play a role.112 For example, Schulman, et al. found that recommendations among practicing clinicians for standardized video vignette patients presenting with cardiac symptoms varied according to patient race and gender.13 Another study showed variations by patient race and socioeconomic status in clinician expectations of post-angiogram cardiac patients.14 Similarly, pain medication administered for major fractures in the emergency setting was shown to vary by patient ethnicity.15

Evidence of decisional bias among practicing clinicians raises questions regarding the origin of such biases as well as ways to eliminate them. Do they originate from similar biases that exist prior to entry into a medical career, are they a reflection of training processes, or are they a product of the practice environment? Answers to these questions could be of value to medical educators and policymakers.

Preliminary investigations seeking similar evidence of decisional bias among medical students have shown mixed findings. In one study, students shown videotapes of standardized black female or white male patients ascribed a lower value to the quality of life of the black female “patient” than to the white male “patient.”16 On the other hand, a more recent study failed to show that student assessments of patient vignettes varied by race or social class.17 Both of these studies, however, were based on students who were early in their medical school training and had no clinical experience, and the studies did not assess clinical decision-making. A third study used clinical-level students from a single institution to demonstrate evidence of situational gender bias in vignette patient diagnoses.18 We are unaware of any other published studies investigating evidence of bias regarding race, gender, or socioeconomic status in clinical decision-making among clinical-level medical students.

As a first step toward determining the origins of these biases, we undertook a study to test for the presence of such bias among medical students nearing the end of their medical school clinical training. Specifically, we tested for bias with respect to patient race, gender, and socioeconomic status (SES) in student recommendations for cardiac care vignette patients. We chose to study senior medical students in order to test whether such biases are present early in clinical training.

Methods

Design and Overview

We invited a national sample of senior medical students during the 2011–2012 academic year to participate in a brief online survey in which they were asked to choose between two clinically equivalent diagnostic or therapeutic options for a set of patient vignettes. We evaluated differences in student recommendations among the vignettes, when we varied the patient race, gender, and SES across the sample, with all other elements remaining fixed.

Survey Instrument

Three academic practicing cardiologists developed a set of eight vignettes of patients requiring cardiac diagnostic or treatment services. We focused these vignettes on cardiac care, given the well-supported evidence that blacks and women, despite clinically equivalent circumstances, are less likely to receive cardiac care procedures.1,1931 Two clinically equivalent options for recommendations, one involving a procedure and the other a non-procedural option, were developed for each vignette. The vignettes were created under the premise that in situations where clinical equivalency does not allow a clear choice, social psychology concepts would suggest that students would then be forced to base their decisions on secondary, non-clinical factors, such as patient demographics.

We included an additional 15 patient vignettes (“non-cardiac” vignettes) in the survey in order to divert attention from the research questions related to cardiac care patients. These non-cardiac vignettes presented the student with clinical scenarios involving controversy in preventive care services (e.g., age to begin mammography), medication choices (brand vs. generic equivalent), or management strategies for newly diagnosed chronic illness (behavior change vs. medication).

In the cardiac care vignettes, we varied the race and gender of the patient in each vignette across the sample such that individual students receiving the same vignette in their survey versions might see the patient as either male or female and/or black or white. Patient SES (as determined solely by the Hollingshead occupational scale32) was fixed for each individual vignette, but varied across the set of eight cardiac vignettes. Patient attributes were not varied for the non-cardiac vignettes.

We designed all vignettes to be quickly readable. Each participating student received a total of eight vignettes—three cardiac and five non-cardiac—randomly mixed and distributed across the sample to reduce the probability of neighboring students receiving the same vignettes. In order to further support students’ use of non-clinical factors in decision-making, the accompanying instructions clearly emphasized the clinical equivalence of the options presented and that students should not attempt to determine the “correct” option, as either option would be appropriate.

After students had completed the vignettes, they were asked to provide certain demographic information. They were also offered an opportunity to be entered into drawings to win one of two $50 gift certificates to be drawn at each medical school and one of 20 iPad 2s to be drawn nationally.

Survey invitations and administration were conducted online. We tracked screen time to record the length of time students took to answer each vignette. We preserved participant anonymity by directing survey data and contact email addresses for the drawings to separate, unlinked databases.

Survey Piloting

Cardiac care vignettes were piloted first with family medicine residents at two medical schools and then with earlier classes of graduating seniors at seven medical schools. Readers may view a final version of the survey at: http://www.medical-decision-making.com/.

Distribution Process

We contacted students through their medical schools. Depending on school preference, we sent invitations with embedded links to the survey either directly to the students (with up to five solicitations to non-responders), to a student LISTSERV, or to a contact person at the school, who then forwarded the invitation to the senior students. When a student clicked on the link to the survey, a unique version of the survey in mix of vignettes and cardiac patient race/gender was generated. In addition, we varied the order in which the options for each vignette were presented to the student. Automated re-invitation messages were forwarded every 7 days after the initial invitation.

Review and Approval Process

The study protocol was reviewed by the University of New Mexico Institutional Review Board and was determined to be exempt. Subsequently, we sought approval to survey school seniors from the administrators at 130 of the 131 U.S. allopathic medical school campuses graduating seniors in 2012. We also sought approval from the institutional review board at each school whose administrators agreed to participate.

Analysis

We eliminated from analysis any returned survey for which the mean recorded viewing time for the cardiac vignettes was less than 10 seconds—considered as the minimum valid length of time needed to read and respond to a vignette—in order to reduce contamination by respondents participating solely to be entered in the incentive drawings. We examined our remaining sample using standard descriptive statistics. We then compared proportions of seniors recommending cardiac procedural options by vignette patient gender, race, and SES using chi-square statistics. We tested for variation in procedural recommendations by student demographics, again with chi-square statistics. Because national database studies have suggested some regional disparities in delivery of cardiac services, we next examined the data for geographic regional differences in recommendations.33,34 To search for similar geographic patterns in our findings, we grouped students by the Association of American Medical Colleges (AAMC) approach to regional grouping of states: Northeast, South, Central, and West (Appendix).35We also used each school’s self-description (obtained from its website) as “public” or “private” to categorize the schools. Finally, we used weighted multilevel multivariate models to estimate the effects of patient gender, race, and SES on student recommendations, while controlling for student and school descriptors, as well as for clustering by medical school and for variation in medical school senior class size and response rate. These models were weighted by size of medical school class and individual school response rate. We looked for and found in one of the eight cardiac vignettes response patterns indicating a first position response bias (p < .01), so we included a variable for response order.

Results

Sample Characteristics

We successfully contacted administrators of 109 of the 130 school campuses over the 9-month enrollment period from August 2011 to March 2012. Twenty-five contacted schools declined to participate (Fig. 1), for a variety of reasons (e.g., administrator denial, local institutional review board process not open to outside investigators, etc.), resulting in a total of 84 participating schools (77 % of successfully contacted schools, 65 % of eligible schools). Data showed that public schools were more likely than private schools to participate (public 73 % vs. private 50 % participation, p < 0.01), and participating schools had a higher median percentage of students entering primary care in 2011 (41 % vs. 38 %, p < .01, Wilcoxon signed-rank test).36

Fig. 1
figure 1

Flow chart of participation in senior medical student survey

We received a total of 4,603 valid surveys from 11,438 seniors at the 84 schools (overall response rate 40.2 %). Demographic characteristics of these students compared with the students participating in the 2012 AAMC Graduate Questionnaire are presented in Table 1. Overall, the two samples were similar, though participants in our survey were slightly more likely to be white and non-Hispanic, and slightly more likely to be entering family medicine, internal medicine, or pediatrics.

Table 1 Comparison of Survey Participants to Association of American Medical Colleges (AAMC) Senior Medical Student Survey Participants, 2012*

Principal Analysis: Relationship of Vignette Race, Gender, and SES to Student Recommendations

Table 2 presents the results of bivariate analyses of student recommendations for cardiac vignette patients based on patient race and gender. Overall, students were slightly but statistically significantly more likely to recommend a procedural option for patients who were described as black (43.9 %) than patients described as white (42.0 %) (p = .03). With regard to gender, there was no statistically significant difference in recommendations between vignette patients presented as male (43.5 % procedural recommendation) vs. female (42.4 %) (p = .18). On the other hand, patient SES was a strong and significant predictor of student recommendations, as shown in Table 3, with the highest SES grouping (SES levels 7–9) most likely to receive procedural recommendations (p < .001).

Table 2 Comparison of Senior Medical Student Recommendations for Cardiac Vignette Patients Based on Vignette Patient Gender and Race, 2012
Table 3 Comparison of Senior Medical Student Recommendations for Cardiac Vignette Patients Based on Vignette Patient Socioeconomic Status (SES), 2012

Interactions of Race, Gender, and SES

We next examined recommendation proportions for the four combinations of patient race and gender (Table 4). Overall, there was a non-significant difference in white females being offered procedures less frequently than any other group. With inclusion of SES, when the patient was presented as being in the lowest SES group (SES 1–2), students were more likely to recommend procedures for black patients (Table 2), and least likely to do so for white female patients (Table 4).

Table 4 Comparison of Senior Medical Student Recommendations for Cardiac Vignette Patients Based on Combinations of Vignette Patient Race and Gender, 2012

Subgroup Analysis: Relationship of Student Demographics, Geographic Region, and School Type to Student Recommendations. Student Demographics

We found that students who described their family of origin as upper middle class or who described their race/ethnicity as other than African American, Hispanic, or Native American were more likely to recommend procedural options for patients described as black (Table 2). Recommendations based on patient vignette gender and on patient SES did not vary across student demographic subgroups, with one exception: students who described themselves as African American, Hispanic, or Native American showed no differences in recommendations based on the patient SES (Table 3).

Geographic Region

In examining geographic area subgroups, we found that students from schools in the Northeast and the South demonstrated greater preference for procedural options with black patients (Table 2), and those from schools in the Northeast were significantly less likely to recommend procedural options if the patient was presented as a female (Table 2). Students across geographic areas showed the same preference for procedural options among higher SES patients (Table 3), with the exception of students attending schools in the West, where that difference was not seen.

School Type

Type of school, private or public, had no influence on overall group patterns with regard to patient race or SES (Tables 2 and 3), but those from private schools were significantly less likely to recommend procedural options for female patients (Table 2).

Multilevel Analysis

Results of multilevel modeling (Table 5) showed findings similar to bivariate analyses. Black male patients were statistically significantly more likely to receive procedural recommendations than white female patients; patients in the highest SES were most likely to receive procedural recommendations. Students from the Northeast were least likely and those from the South most likely to recommend procedural options. The intraclass correlation coefficient was 0.005, indicating a low level of cluster bias.

Table 5 Results of Multilevel Multivariable Analysis of Student Recommendations for Cardiac Vignette Patients

Discussion

We found no evidence in the overall sample of racial bias in clinical decision-making leading to fewer recommendations for cardiac procedural services for black vignette patients. Indeed, students recommended procedural services slightly more frequently for black than white patients. We also found no evidence in our overall sample of clinical decision-making bias related to patient gender. We did find a clear variation in student recommendations by patient SES, with the highest SES patients more likely to receive procedural recommendations.

Our results present a more complex picture, however, when subjected to detailed secondary analyses. We found regional differences, with students graduating from schools in the Northeast more likely to recommend procedures if the patient was black or was male. Students from private schools were less likely to recommend procedural choices for female patients.

Our findings are open to a variety of interpretations, the most optimistic of which is that, overall, they reveal evolving clinical decision-making, perhaps reflecting increasing sensitivity to the problem of racial and gender-based disparities. The variations by patient SES, however, suggest that even with the overall findings related to race, more work needs to be done to fully eliminate bias in clinical decision-making.

An alternative interpretation is that overall rates related to racial and gender-based decision-making may obscure important differences within the population of students. Counterbalancing pockets (e.g., geographic regions) of bias might still exist within the larger medical student population. Evidence of differences in rates of procedural choice by region and type of school support this interpretation, especially with regard to white female patients. These latter results emphasize the work that lies ahead in determining the basis for these observations, and they also underscore the importance of considering the interaction between race, gender, and SES in disparities in decision-making.

A further interpretation of our findings is that, in general, they demonstrate success in reducing or eliminating explicit bias, while not addressing implicit bias. Explicit bias, operating at a conscious level, is under an individual’s control, and is therefore subject to training, reflection, social pressure, and correction.37 Implicit bias, on the other hand, operates at a subconscious level, is not under voluntary control, and surfaces only under certain conditions, such as fatigue, decisional time pressures, or situational stresses, without the individual’s awareness.38

Haider et al. studied matriculating medical students and found evidence among these students that implicit bias may exist in the absence of explicit bias.17 Our design attempted to elicit implicit as well as explicit bias through our emphasis on rapid response to the survey questions and use of toss-up scenarios. However, it is quite possible that this effort was ineffective in evoking implicit biases. This study, therefore, should not be considered a test for the presence or absence of implicit bias.

Limitations

Although our study had none of the quality flaws noted in a recent systematic review of research on racial bias in health care practitioners,11 it does have several potential limitations. It is possible that our efforts at blinding students to our interest in stereotyping in decision-making were not successful, and as a result, our findings are not valid representations of the students’ true decision-making tendencies. In this case, a social desirability bias may have influenced student recommendations, resulting in the slightly increased rate of procedural recommendations for blacks. However, this possibility would be difficult to reconcile with our findings demonstrating evidence of decision-making bias in interactions between race, gender, and SES. Another concern might be the use of vignettes to search for evidence of stereotyping in decision-making, although several studies have shown vignettes to be accurate in reflecting actual clinical practice.3943 The survey response rate among students at participating schools (40.2 %) is low enough that some might question the validity of the sample. Although our data show relatively few differences among survey respondents and the larger population of senior medical students as depicted by the AAMC survey, suggesting sample validity, it is possible that non-participants and students from non-participating schools may demonstrate different decision-making tendencies than those reflected in these data. Finally, with regard to variations in student recommendations by patient SES, it is possible that cost-of-care perspectives led students to select care options based on expected ability to pay.

Because of the large sample size, we were able to demonstrate statistically significant results despite relatively small absolute differences in recommendations among study groups. Given the high prevalence of cardiac disease, however, even small variations across the population can translate into large numbers of individuals affected. This importance of small differences is comparable to that seen in the post-myocardial infarction use of beta-blocker medications, which have been shown to produce a 1.8 % reduction in long-term mortality rates compared with patients not using beta-blockers.44

Conclusions

Our national survey of senior medical students is reason for cautious optimism that racial bias in clinical decision-making may be less common in the future. Much work still needs to be done, however, as reflected in our findings of variations by region and between public and private schools, of the strong influence of patient SES on students’ approach to toss-up clinical scenarios, and of the interplay between race, gender, and SES. We need to better understand these differences and their origins, whether subsequent medical training changes the picture painted by our findings, and the influence, if any, of implicit bias on clinical decision-making. Research to explore the driving influences on decision-making among medical students and the elements present in the training environment that promote or eliminate bias is an important next step. Most importantly, we need to use such understanding to promote effective solutions for preventing these tendencies in the future and for eliminating any lingering biases in current clinical decision-making.