Establishing Crosswalks Between Common Measures of Burnout in US Physicians

Brady, Keri J. S.; Ni, Pengsheng; Carlasare, Lindsey; Shanafelt, Tait D.; Sinsky, Christine A.; Linzer, Mark; Stillman, Martin; Trockel, Mickey T.

doi:10.1007/s11606-021-06661-4

Establishing Crosswalks Between Common Measures of Burnout in US Physicians

Original Research
Open access
Published: 31 March 2021

Volume 37, pages 777–784, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of General Internal Medicine Aims and scope Submit manuscript

Establishing Crosswalks Between Common Measures of Burnout in US Physicians

Download PDF

Keri J. S. Brady PhD, MPH ORCID: orcid.org/0000-0001-6417-0840¹,
Pengsheng Ni MD, MPH^1,2,
Lindsey Carlasare MBA³,
Tait D. Shanafelt MD⁴,
Christine A. Sinsky MD³,
Mark Linzer MD⁵,
Martin Stillman MD, JD⁵ &
…
Mickey T. Trockel MD, PhD^4,6

4753 Accesses
5 Altmetric
Explore all metrics

Abstract

Background

Physician burnout is often assessed by healthcare organizations. Yet, scores from different burnout measures cannot currently be directly compared, limiting the interpretation of results across organizations or studies.

Objective

To link common measures of burnout to a single metric in psychometric analyses such that group-level scores from different assessments can be compared.

Design

Cross-sectional survey.

Setting

US practices.

Participants

A total of 1355 physicians sampled from the American Medical Association Physician Masterfile.

Main Measures

We linked the Stanford Professional Fulfillment Index (PFI) and Mini-Z Single-Item Burnout (MZSIB) scale to the Maslach Burnout Inventory (MBI) in item response theory (IRT) fixed-calibration and equipercentile analyses and created crosswalks mapping PFI and MZSIB scores to corresponding MBI scores. We evaluated the accuracy of the results by comparing physicians’ actual MBI scores to those predicted by linking and described the closest cut-point equivalencies across scales linked to the same MBI subscale using the resulting crosswalks.

Key Results

IRT linking produced the most accurate results and was used to create crosswalks mapping (1) PFI Work Exhaustion (PFI-WE) and MZSIB scores to MBI Emotional Exhaustion (MBI-EE) scores and (2) PFI Interpersonal Disengagement (PFI-ID) scores to MBI Depersonalization (MBI-DP) scores. The commonly used MBI-EE raw score cut-point of ≥27 corresponded most closely with respective PFI-WE and MZSIB raw score cut-points of ≥7 and ≥3. The commonly used MBI-DP raw score cut-point of ≥10 corresponded most closely with a PFI-ID raw score cut-point of ≥9.

Conclusions

Our findings allow healthcare organizations using the PFI or MZSIB to compare group-level scores to historical, regional, or national MBI scores (and vice-versa).

Dear Mental Health Practitioners, Take Care of Yourselves: a Literature Review on Self-Care

Article 23 May 2019

Burnout in nursing: a theoretical review

Article Open access 05 June 2020

The Effectiveness of Mindfulness-Based Stress Reduction on the Psychological Functioning of Healthcare Professionals: a Systematic Review

Article 24 September 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

INTRODUCTION

In the US, burnout is more common in physicians than in workers in other fields,¹ and is characterized by work-related feelings of exhaustion and depersonalization or interpersonal disengagement.^{2, 3} Physician burnout is associated with poor physician health outcomes, reduced quality of care, and at least 4.6 billion dollars in excess health system costs annually.^4,5,6 In an effort to curb physician burnout,^{7, 8} health systems across the nation are integrating measures of burnout into routine organizational assessments to monitor system functioning and evaluate the effectiveness of practice changes designed to improve physician well-being.^9,10,11 This practice is recommended in the National Academy of Medicine’s consensus report on clinician burnout and regarded by healthcare leaders as a basic first step to addressing the problem.^{7, 10,11,12,13,14}

With the widespread adoption of physician burnout assessment within US healthcare systems has come the problem of comparing outcomes across different burnout measures. With several validated options available that vary in length and cost, a number of different measures are currently in use in the US,^{9, 10} including the Maslach Burnout Inventory-Human Services Survey for Medical Personnel (MBI),¹⁵ Stanford Professional Fulfillment Index (PFI),¹⁶ and the Mini-Z Single-Item Burnout (MZSIB) scale.¹⁷ When two different burnout measures are used across organizations or within an organization over time, the scores are not comparable unless they are placed onto the same metric, or “linked,” in psychometric analyses. To date, no studies to our knowledge have linked common measures of physician burnout onto a single metric, which would allow healthcare organizations to compare burnout scores/rates across different measures.

The primary aim of this study was to link the PFI and MZSIB to the MBI metric and create crosswalks that map scores from the PFI and MZSIB to corresponding scores on the MBI. Using the crosswalks, we aimed to describe the closest cut-point equivalences for scales linked to the same metric. Our secondary aim was to examine the psychometric properties of scales linked to the same metric, including each scale’s reliability and associations with relevant adverse outcomes.

METHODS

Linking refers to the statistical process of placing two or more measures with different content and/or construct severity levels onto the same scale.¹⁸ Through this process, a relationship is established between the linked measures, such that for each score on Burnout Measure A, an equivalent score (within standard error) on Burnout Measure B is established.

Design and Participants

This study used a single-group linking design, whereby items from each burnout instrument were administered in a confidential, cross-sectional survey to all respondents from February to March 2019. To obtain a representative convenience sample, we randomly sampled physicians of all ages, sexes, and specialties from the American Medical Association Physician Masterfile. Physicians were emailed the survey and offered a small financial incentive to participate. The survey was administered in waves until we reached a target sample size of ≥1200 respondents, which was estimated as the minimum sample size needed for item response theory linking analyses. Physicians (including postgraduate trainees) practicing in the US at the time of the survey were eligible for inclusion.

Measures

We measured physician burnout using the MBI 9-item Emotional Exhaustion (MBI-EE) and 5-item Depersonalization (MBI-DP) subscales (0 = never, 1 = a few times a year or less, 2 = once a month or less, 3 = a few times a month, 4 = once a week, 5 = a few times a week, 6 = every day); the PFI 4-item Work Exhaustion (PFI-WE) and 6-item Interpersonal Disengagement (PFI-ID) subscales (0 = not at all, 1 = very little, 2 = moderately, 3 = a lot, 4 = extremely); and the single-item MZSIB (1 = no burnout; 2 = under stress; 3 = have one or more burnout symptom; 4 = burnout won’t go away; 5 = completely burned out; see Supplemental Appendix 1 for the complete MZSIB response options).¹⁷ The sequence in which each instrument was administered was randomized to prevent ordering effects.

The MBI and PFI are outcome measures, whereas the MZSIB scale is a screening measure. Commonly used raw (total) score cut-points for each scale are ≥27, ≥10, and ≥3 on the MBI-EE, MBI-DP, and MZSIB scales, respectively.^{1, 19, 20} The raw (total) score cut-point for the PFI Burnout Composite (PFI-BC) Scale is ≥14.¹⁶ Cut-points for PFI-WE and PFI-ID subscales have not been published and are identified in the current study.¹⁶

We also assessed physicians’ demographics, depressive symptoms (4-item PROMIS depression measure),²¹ distress as measured by the original, 7-item Physician Well-Being Index (WBI),^22,23,24 and intent to leave one’s current practice or intent to leave medicine (for attending physicians and postgraduate trainees, respectively) in the next 2 years (1 item). ¹⁷ All measures were scored such that higher scores indicate more of each construct.

Linking Analyses

Our methods were informed by those used in the PROsetta Stone Project.^{25, 26} Scales were linked in item sets, consisting of two scales: a target measure and an anchor measure. In linking analyses, a target measure is linked to an anchor measure, which places the target measure onto the metric of the anchor measure. Because the MBI is historically the most common physician burnout assessment,²⁷ we selected the MBI-EE and MBI-DP scales as anchor measures. Target measures included the PFI-WE, PFI-ID, and MZSIB scales.

Prior to conducting linking analyses, we qualitatively and quantitatively examined the degree to which the scales that we aimed to link assess essentially the same construct, a key assumption of linking.^{18, 28} Scales assessing essentially the same construct were expected to (1) have very similar item content as determined by two independent subject domain expert raters (TS, ML); (2) be highly correlated (inter-scale Pearson’s r of ≥0.75); and (3) be essentially unidimensional as determined in confirmatory factor analyses (CFAs) (see Supplemental Appendix 2 for additional assumption assessment details).²⁵

For each item set, we conducted item response theory (IRT) fixed-calibration linking and equipercentile linking analyses using a fivefold cross validation process (Supplemental Appendix 3). In IRT linking, raw (total) scores on each target measure were linked to t-scores on each MBI anchor scale. A t-score is a standardized score ranging from 0 to 100, with a mean score and standard deviation equal to 50 and 10, respectively. T-scores on each MBI anchor scale were then mapped to corresponding MBI raw scores. In our IRT linking analyses, we derived the MBI-EE and MBI-DP anchor metrics from a prior IRT calibration of the MBI in a 2014 national sample of US physicians.^{29, 30} In equipercentile linking, the MBI metric was derived from the primary survey data collected in this study. We evaluated the accuracy of each linking method for each item set by calculating the correlation, mean difference, and standard deviation (SD) of the difference between physicians’ predicted and actual t-scores on the MBI anchor scale, using pooled predicted and actual t-scores produced from a fivefold cross validation process. The method that yielded the highest correlations, lowest mean differences, and lowest SD of difference across all item sets was used to create a crosswalk mapping raw scores on the target measure to corresponding t-scores and raw scores on the MBI anchor measure. Once each item set was linked, we (1) identified the closest cut-point equivalencies across scales linked to the same metric and (2) described the reliability of scales linked to the same metric (Supplemental Appendix 4).³¹ We used the Brady et al. ²⁹ IRT analysis to identify the t-scores corresponding with (1) each MBI-EE and MBI-DP raw score cut-point and (2) each raw score on the MBI predicted by equipercentile linking.

Finally, we computed correlations between each scale and measures of physician depressive symptoms, distress, and intent to leave to compare the magnitude of each scale’s associations with these outcomes. Analyses were conducted in R (v3.5.1) psych, lavaan, mirt, and equate packages.^{32,33,34,35,36} This study was approved by the University of Illinois at Chicago Institutional Review Board.

RESULTS

Sample

The overall sample included 1355 US physicians (Table 1). The most common demographic characteristics of respondents were White race, male sex, non-primary care specialty, and <44 years of age. Thirty-one percent of respondents were trainees. In subgroup invariance analyses, we found support for the invariance of our linking results across early versus late responders (where late responders were used as a proxy for non-responders; Supplemental Appendix 5, Table 5.5). Overall, mean raw scores on the MBI-EE, PFI-WE, MZSIB, MBI-DP, and PFI-ID scales were 21.82, 6.06, 2.45, 7.86, and 6.63, respectively (Table 2) (see Supplemental Appendix 6 for specialty-level descriptive scale statistics).

Table 1 Overall Sample Characteristics (n = 1355)

Full size table

Table 2 Overall Descriptive Scale Statistics by Domain and Measure (n = 1346)

Full size table

Assumption Assessment

In qualitative evaluations of each target and anchor scale’s item content overlap, both raters agreed that the following item sets assess essentially the same underlying construct: PFI-WE and MBI-EE (item set 1), PFI-ID and MBI-DP (item set 2), and MZSIB and MBI-EE (item set 3). Inter-scale correlations between the target and anchor scales in item sets 1–3 were 0.80, 0.76, and 0.76, respectively. Item sets 1–3 met all other linking assumptions in quantitative analyses (Supplemental Appendix 5).

Crosswalks and Closest Cut-Point Equivalents

Overall, IRT (versus equipercentile) linking produced the most accurate results (Supplemental Appendices 7 - 9) and was used to create crosswalks mapping raw scores on the PFI-WE, PFI-ID, and MZSIB (target) scales to corresponding t-scores and raw scores on their respective MBI-EE, MBI-DP, and MBI-EE anchor scales (Table 3).

The commonly used raw score cut-point of ≥27 (t-score = 50.70) ²⁹ on the MBI-EE scale corresponded most closely with raw score cut-points of ≥7 and ≥3 on the respective PFI-WE and MZSIB scales (Table 3). The commonly used raw score cut-point of ≥10 (t-score = 53.76) ²⁹ on the MBI-DP scale corresponded most closely with a raw score cut-point of ≥9 on the PFI-ID scale. The raw score cut-point of ≥3 on the MZSIB scale corresponded most closely with a raw score of ≥8 on the PFI-WE scale.

Table 3 Crosswalks Produced from IRT Linking Mapping Raw Scores from the PFI and MZSIB to Corresponding Predicted MBI T-scores and Raw Scores

Full size table

Reliability

Both the MBI-EE and PFI-WE scales demonstrated ≥0.70 reliability to assess a wide range of low and high emotional exhaustion levels on the MBI-EE t-score metric (Fig. 1a). The MZSIB scale showed less than 0.70 reliability to assess emotional exhaustion across the MBI-EE t-score metric. Both the MBI-DP and PFI-ID scales also demonstrated ≥0.70 reliability to assess a range of low and high depersonalization levels on the MBI-DP t-score metric (Fig. 1b). Compared to the PFI-WE scale, the MBI-EE scale possessed ≥0.70 reliability over a wider range of below average emotional exhaustion t-scores, whereas, compared to the MBI-DP scale, the PFI-ID scale possessed ≥0.70 reliability over a wider range of above average depersonalization t-scores.

Associations with Adverse Outcomes

All scales correlated with physician depressive symptoms, physician distress, and physicians’ intent to leave their practice or medicine within 2 years (Table 4). Among measures assessing the same underlying construct (i.e., the MBI-EE, PFI-WE, and MZSIB measures of emotional exhaustion and the MBI-DP and PFI-ID measures of depersonalization), there were no major differences in the magnitude of correlations between each burnout scale and depressive symptom, distress, and intent to leave outcomes (Table 4). The MBI-DP scale showed a modestly lesser correlation with intent to leave compared to the PFI-ID scale.

Table 4 Correlation Analysis of Each Scale’s Raw Scores with Adverse Outcomes

Full size table

DISCUSSION

Healthcare organizations across the US are monitoring physician burnout as an indicator of health system performance.⁹ Common applications of physician burnout measurement as a performance indicator are to make inferences regarding the quality of physicians’ medical practice environments, workforce sustainability, and healthcare quality.⁹ Yet, comparisons of performance over time, across organizations, or across studies are not possible when different burnout measures have been employed. In this study, we used IRT linking to place common burnout measures—the PFI and MZSIB—onto the metric of the MBI, and created crosswalks that map raw scores on the PFI-WE, PFI-ID, and MZSIB scales to corresponding MBI subscale scores. For scales linked to the same metric, we identified the closest cut-point equivalencies across all linked metrics and compared the reliability across linked outcome metrics.

By linking the PFI, MZSIB, and MBI to the same metric, the crosswalks we produced allow investigators using these measures to make several useful comparisons.²⁵ First, investigators can compare summary sample scores across the PFI, MZSIB, and MBI. That is, using the crosswalks produced in this study, group-level emotional exhaustion scores can be compared across the MBI-EE, PFI-WE, and MZSIB scales, and group-level depersonalization scores can be compared across the MBI-DP or PFI-ID scales.²⁵ Second, investigators can use the crosswalks to calculate emotional exhaustion/depersonalization rates across metrics by substituting respondents’ raw (total) scores on the PFI or MZSIB with the corresponding MBI t-score. The corresponding MBI t-scores can then be used to calculate the percent of physicians scoring at or above a selected MBI cut-point. The substituted MBI scores can be further analyzed in descriptive and inferential analyses.²⁵ The crosswalks can also be used to calculate emotional exhaustion/depersonalization rates across metrics using only aggregated data. In Supplemental Appendix 10, we demonstrate how to calculate emotional exhaustion/depersonalization rates on the MBI metric using frequency tables of physicians’ raw scores on the PFI. The crosswalks can facilitate comparisons of burnout scores/rates across organizations using different measures, within organizations using different measures over time, and to published regional/national benchmarks. The use of our crosswalks to convert burnout scores from different measures to a common metric may also improve comparative effectiveness and meta-analysis research by reducing error associated with the use of different scales across studies.^{25, 37}

Our reliability assessment provides important information regarding the psychometric performance of each measure, each of which has its own strengths and weaknesses that should be considered within the intended purpose of an organization’s assessment.⁹ For example, the MBI-EE scale provides >0.90 reliability to assess a wide range of emotional exhaustion levels, but at the cost of additional items. With less than half the items of the MBI-EE, the PFI-WE scale offers >0.80 reliability to assess a similar range of above-average emotional exhaustion levels as the MBI-EE scale, but has less precision at below average emotional exhaustion levels than the MBI-EE scale. Similarly, with only one item, the MZSIB offers the least response burden but has less precision to assess emotional exhaustion than the MBI-EE and PFI-WE scales (an expected result given the MZSIB was originally designed as a brief screening tool, not an outcome assessment). However, this level of precision may be sufficient, for example, if the intended purpose of assessment is for screening followed by additional assessment, or to predict the risk of occupational outcomes of depression symptoms, distress, or intent to leave one’s practice at a group-level. The PFI-ID scale offers the most reliable assessment of depersonalization across the widest range of depersonalization levels, with one additional item compared to MBI-DP scale. We should note that, to our knowledge, this is the first assessment of the MZSIB’s reliability (as internal consistency reliability is not applicable to single-item scales and test-retest reliability has not yet been investigated for this measure).

All scales showed significant correlations with important, adverse outcomes, including physician depression, distress, and intent to leave. The association between each measure and each adverse outcome underscores the importance of including measures of physician burnout in institutional assessments.

To our knowledge, this is the first study to crosswalk common measures of burnout among US physicians. Strengths of this study include the use of a single-group linking design (permitting the direct comparison of physicians’ actual MBI scores to those predicted by linking to determine the accuracy of our results) and the use and agreement of two different linking methods.

However, this study has several limitations. First, because the MBI-EE and MBI-DP metrics to which the PFI and MZSIB are linked were derived from a prior IRT analysis of 2014 MBI data from the Shanafelt et al. (2015) national physician burnout prevalence study,^{29, 30} the mean of each MBI anchor scale is fixed to the mean EE and DP scores of US physicians in 2014. Therefore, when interpreting a score on a target scale relative to its SDs above/below the mean score on its MBI anchor scale, it should be known that the comparison is relative to the underlying mean MBI score of US physicians in 2014. Despite this limitation, the crosswalks remain valid assuming that the MBI subscales function equivalently across the 2014 US physician sample and US general physician population. Second, although our findings provide support for the invariance of our crosswalks across early and late responder groups and, therefore, provide potential support for the representativeness of our sample, this support relies on the assumption that late responders are an adequate proxy for non-respondents. Nevertheless, several studies have demonstrated no significant differences in burnout estimates across respondent and non-respondent groups, despite the low response rates that are common in physician survey research.^{1, 38} Third, we chose to highlight the closest cut-point equivalencies across linked measures using commonly used cut-points on each metric. Because raw scores on each target metric are linked to continuous scores on each anchor metric, the closest cut-point equivalencies across metrics are an approximation. Although we identified the closest cut-point equivalency for scores ≥27 and ≥10 on the respective MBI-EE and MBI-DP scales, investigators can use crosswalks published in Brady et al. ²⁹ in conjunction with the crosswalks presented herein to identify cut-point equivalencies on the PFI and MZSIB at other MBI raw score cut-points.

It is important to note that the crosswalk tables rendered with this research allow reasonable approximate translation of aggregate, group-level scores from one measure of burnout to another. They are not intended to translate individual-level respondent scores from one measure of burnout to another, and attempting to do so would produce unreliable results. In addition, it is important to note that crosswalking scores from one measure of burnout to another is only appropriate across measures that assess the same construct. A measure of emotional exhaustion (such as the MZSIB) cannot be crosswalked to derive an equivalent score on a metric of depersonalization.

CONCLUSIONS

As US healthcare organizations are increasingly measuring physician burnout as an indicator of health system performance, there is a need to compare burnout outcomes across different assessments. Our findings allow healthcare organizations using the PFI or MZSIB to compare group-level scores to historical, regional, or national MBI scores (and vice-versa).

Abbreviations

IRT:: Item response theory
MBI:: Maslach Burnout Inventory-Human Services Survey for Medical Personnel
MBI-EE:: Maslach Burnout Inventory-Human Services Survey for Medical Personnel Emotional Exhaustion scale
MBI-DP:: Maslach Burnout Inventory-Human Services Survey for Medical Personnel Depersonalization scale
MZSIB:: Mini-Z Single-Item Burnout scale
PFI:: Stanford Professional Fulfillment Index
PFI-WE:: Stanford Professional Fulfillment Index Work Exhaustion scale
PFI-ID:: Stanford Professional Fulfillment Index Interpersonal Disengagement scale

References

Shanafelt TD, West CP, Sinsky C, et al. Changes in burnout and satisfaction with work-life integration in physicians and the general US working population between 2011 and 2017. Mayo Clinic Proceedings 2019;94(9):1681-1694.
Maslach C, Jackson SE. The measurement of experienced burnout. J Occup Behav 1981;2(2):99-113.
Article Google Scholar
Shanafelt TD, Boone S, Tan L, et al. Burnout and satisfaction with work-life balance among US physicians relative to the general US population. Arch Intern Med 2012;172(18):1377-1385.
Article Google Scholar
Dyrbye LN, T.D. Shanafelt, C.A. Sinsky, P.F. Cipriano, J. Bhatt, A. Ommaya, C.P. West, and D. Meyers. Burnout among health care professionals: A call to explore and address this underrecognized threat to safe, high-quality care. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2019; https://doi.org/10.31478/201707b
Tawfik DS, Scheid A, Profit J, et al. Evidence Relating Health Care Provider Burnout and Quality of Care: A Systematic Review and Meta-analysis. Ann Intern Med 2019;171(8):555-567.
Article Google Scholar
Han S, Shanafelt TD, Sinsky CA, et al. Estimating the Attributable Cost of Physician Burnout in the United States. Ann Intern Med 2019;170(11):784-790.
Article Google Scholar
Jha A, Ilif A, Chaoui A. A crisis in health care: a call to action on physician burnout. In: Massachusetts Medical Society. Available at: http://www.massmed.org/Publications/Research,-Studies,-and-Reports/A-Crisis-in-Health-Care--A-Call-to-Action-on--Physician-Burnout/#.X99sCJNKh_k. Accessed June 19, 2020.
Dzau VJ, Kirch DG, Nasca TJ. To Care Is Human — Collectively Confronting the Clinician-Burnout Crisis. N Engl J Med 2018;378(4):312-314.
Article Google Scholar
Brady KJS, Kazis LE, Sheldrick RC, Ni P, Trockel MT. Selecting Physician Well-Being Measures to Assess Health System Performance and Screen for Distress: Conceptual and Methodological Considerations. Curr Probl Pediatr Adolesc Health Care 2019;49(12):100662.
Article Google Scholar
Dyrbye LN, Meyers D, Ripp J, Dalal N, Bird SB, Sen S. A Pragmatic Approach for Organizations to Measure Health Care Professional Well-Being. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2018; https://doi.org/10.31478/201810b
Shanafelt TD, Noseworthy JH. Executive leadership and physician well-being: nine organizational strategies to promote engagement and reduce burnout. Mayo Clin Proc 2017;92(1):129-146.
Article Google Scholar
National Academy of Medicine. Measuring Burnout. Available at: https://nam.edu/clinicianwellbeing/solutions/measuring-burnout/. Accessed June 19, 2020.
National Academy of Medicine. Validated instruments to assess work-related dimensions of well-being. Available at: https://nam.edu/valid-reliable-survey-instruments-measure-burnout-well-work-related-dimensions/. Accessed June 19, 2020.
National Academies of Sciences, Engineering, and Medicine. Taking Action Against Clinician Burnout: A Systems Approach to Professional Well-Being. The National Academies Press; 2019.
Maslach C, Jackson SE, Leiter MP. Maslach Burnout Inventory Manual. 4th ed: Mind Garden, Inc.; 2017. https://www.mindgarden.com/maslach-burnout-inventory-mbi/686-mbi-manualprint.html
Trockel M, Bohman B, Lesure E, et al. A Brief Instrument to Assess Both Burnout and Professional Fulfillment in Physicians: Reliability and Validity, Including Correlation with Self-Reported Medical Errors, in a Sample of Resident and Practicing Physicians. Acad Psychiatry 2017;42(1):11-24.
Article Google Scholar
Konrad TR, Williams ES, Linzer M, et al. Measuring physician job satisfaction in a changing workplace and a challenging environment. Med Care 1999;37(11):1174-1182.
Article CAS Google Scholar
Kolen MJ, Brennan RL. Test equating, scaling, and linking: Methods and practices. Springer Science & Business Media; 2014.
Maslach C, Jackson S, Leiter M. Maslach Burnout Inventory Manual. 3rd ed. Consulting Psychologists Press; 1996.
Williams ES, Konrad TR, Linzer M, et al. Refining the measurement of physician job satisfaction: results from the Physician Worklife Survey. SGIM Career Satisfaction Study Group. Society of General Internal Medicine. Med Care 1999;37(11):1140-1154.
Article CAS Google Scholar
Pilkonis PA, Choi SW, Reise SP, Stover AM, Riley WT, Cella D. Item Banks for Measuring Emotional Distress From the Patient-Reported Outcomes Measurement Information System (PROMIS®): Depression, Anxiety, and Anger. Assessment. 2011;18(3):263-283.
Article Google Scholar
Dyrbye LN, Szydlo DW, Downing SM, Sloan JA, Shanafelt TD. Development and preliminary psychometric properties of a well-being index for medical students. BMC Med Educ 2010;10(1):8.
Article Google Scholar
Dyrbye LN, Schwartz A, Downing SM, Szydlo DW, Sloan JA, Shanafelt TD. Efficacy of a brief screening tool to identify medical students in distress. Acad Med 2011;86(7):907-914.
Article Google Scholar
Dyrbye LN, Satele D, Sloan J, Shanafelt TD. Utility of a brief screening tool to identify physicians in distress. J Gen Intern Med 2013;28(3):421-427.
Article Google Scholar
Choi SW, Schalet B, Cook KF, Cella D. Establishing a Common Metric for Depressive Symptoms: Linking the BDI-II, CES-D, and PHQ-9 to PROMIS Depression. Psychol Assess 2014;26(2):513-527.
Article Google Scholar
Choi SW, Prodrabsky T, McKinney N, Schalet BD, Cook KF, Cella D. PROSetta Stone Methodology. Available at: http://www.prosettastone.org/Methodology/Documents/PROSetta%20Methodology%20Report.pdf. Accessed June 19, 2020.
Rotenstein LS, Torre M, Ramos MA, et al. Prevalence of burnout among physicians: A systematic review. Jama. 2018;320(11):1131-1150.
Article Google Scholar
Dorans NJ, Holland PW. Population invariance and the equatability of tests: Basic theory and the linear case. J Educ Meas 2000;37(4):281-306.
Article Google Scholar
Brady KJS NP, Sheldrick RC, Trockel MT, Shanafelt T, Rowe SG, Schneider JI, Kazis LE. Describing the Emotional Exhaustion, Depersonalization, and Low Personal Accomplishment Symptoms Associated with Maslach Burnout Inventory Subscale Scores in US Physicians. J Patient Rep Outcomes 2020;4(1):1-14.
Article Google Scholar
Shanafelt TD, Hasan O, Dyrbye LN, et al. Changes in Burnout and Satisfaction With Work-Life Balance in Physicians and the General US Working Population Between 2011 and 2014. Mayo Clin Proc 2015;90(12):1600-1613.
Article Google Scholar
HealthMeasures. PROMIS Instrument Development and Scientific Standards Version 2.0. 2013. Available at: https://www.healthmeasures.net/images/PROMIS/PROMISStandards_Vers2.0_Final.pdf. Accessed June 19, 2020.
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. 2018.
Revelle W. psych: Procedures for Personality and Psychological Research. https://CRAN.R-project.org/package=psych. 2018.
Albano AD. equate: An R package for observed-score linking and equating. J Stat Softw 2016;74(8):1-36.
Article Google Scholar
Chalmers P. mirt: A Multidimensional Item Response Theory Package for the R Environment. J Stat Softw 2012;48(6):1-29.
Article Google Scholar
Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J Stat Softw 2012;48(2):1-36.
Article Google Scholar
Lai J-S, Cella D, Yanez B, Stone A. Linking fatigue measures on a common reporting metric. J Pain Symptom Manag 2014;48(4):639-648.
Article Google Scholar
Simonetti JA, Clinton WL, Taylor L, et al. The impact of survey nonresponse on estimates of healthcare employee burnout. Healthcare. 2020;8(3):100451.
Article Google Scholar

Download references

Funding

This study was funded by the American Medical Association.

Author information

Authors and Affiliations

Health Law, Policy & Management Department, Boston University School of Public Health, Boston, MA, USA
Keri J. S. Brady PhD, MPH & Pengsheng Ni MD, MPH
Biostatistics & Epidemiology Data Analytic Center, Boston University School of Public Health, Boston, MA, USA
Pengsheng Ni MD, MPH
American Medical Association, Chicago, IL, USA
Lindsey Carlasare MBA & Christine A. Sinsky MD
Stanford Medicine WellMD Center, Stanford University, Stanford, CA, USA
Tait D. Shanafelt MD & Mickey T. Trockel MD, PhD
Hennepin Healthcare Research Institute and Department of Medicine, Hennepin Healthcare, University of Minnesota, Minneapolis, MN, USA
Mark Linzer MD & Martin Stillman MD, JD
Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, USA
Mickey T. Trockel MD, PhD

Authors

Keri J. S. Brady PhD, MPH
View author publications
You can also search for this author in PubMed Google Scholar
Pengsheng Ni MD, MPH
View author publications
You can also search for this author in PubMed Google Scholar
Lindsey Carlasare MBA
View author publications
You can also search for this author in PubMed Google Scholar
Tait D. Shanafelt MD
View author publications
You can also search for this author in PubMed Google Scholar
Christine A. Sinsky MD
View author publications
You can also search for this author in PubMed Google Scholar
Mark Linzer MD
View author publications
You can also search for this author in PubMed Google Scholar
Martin Stillman MD, JD
View author publications
You can also search for this author in PubMed Google Scholar
Mickey T. Trockel MD, PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Keri J. S. Brady PhD, MPH.

Ethics declarations

Conflict of Interest

Dr. Shanafelt is co-inventor of the Well-being Index instruments and the Participatory Management Leadership Index. Mayo Clinic holds the copyright for these instruments and has licensed them for use outside of Mayo Clinic. Dr. Shanafelt receives a portion of any royalties paid to Mayo Clinic. Dr. Linzer is supported in part through grants to Hennepin Healthcare from the AMA, Institute for Healthcare Improvement, the American Board of Internal Medicine Foundation, and the American College of Physicians for research and training in burnout prevention. All other authors report no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

ESM 1

(DOCX 80.7 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brady, K.J.S., Ni, P., Carlasare, L. et al. Establishing Crosswalks Between Common Measures of Burnout in US Physicians. J GEN INTERN MED 37, 777–784 (2022). https://doi.org/10.1007/s11606-021-06661-4

Download citation

Received: 20 December 2020
Accepted: 11 February 2021
Published: 31 March 2021
Issue Date: March 2022
DOI: https://doi.org/10.1007/s11606-021-06661-4

Key words

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Establishing Crosswalks Between Common Measures of Burnout in US Physicians

Abstract

Background

Objective

Design

Setting

Participants

Main Measures

Key Results

Conclusions

Similar content being viewed by others

Dear Mental Health Practitioners, Take Care of Yourselves: a Literature Review on Self-Care

Burnout in nursing: a theoretical review

The Effectiveness of Mindfulness-Based Stress Reduction on the Psychological Functioning of Healthcare Professionals: a Systematic Review

INTRODUCTION

METHODS

Design and Participants

Measures

Linking Analyses

RESULTS

Sample

Assumption Assessment

Crosswalks and Closest Cut-Point Equivalents

Reliability

Associations with Adverse Outcomes

DISCUSSION

CONCLUSIONS

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher’s Note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation