INTRODUCTION

Increased attention has been paid to clinical quality of care and whether it differs by socioeconomic status or race/ethnicity.1,2,3,4,5,6 However, there has been less recent focus on gender health equity. A small body of research suggests unexplained differences in the quality of health care received by women and men.7,8 In general, care-seeking and adherence are higher among women than among men, with women scoring higher on preventive care measures, including many screening measures. Less clear is whether this advantage extends to other aspects of care or intermediate outcomes. Some have found lower quality of cardiovascular disease and HIV/AIDS for women9,10,11,12,13 and have argued that these exceptions suggest a lingering bias in how women are treated for these diseases.14,15

Gender gaps in care for seniors could have substantial health and cost consequences due to high prevalence of comorbidities (which may confound prevention and treatment), higher socioeconomic vulnerability than the general population, and higher mortality risk.

Prior work finds that racial/ethnic gaps in quality of care vary across MA plans.5,6,16 This study examines the extent to which gender differences in quality of care vary across Medicare Advantage (MA) plans and whether quality of care is higher among women or men on cardiovascular disease measures.1,2,3,4,7,9,11,12 Here, we examine the direction, size, and nature of gender gaps in MA plans. We focus on three questions.

First, how did performance scores differ by gender among MA beneficiaries? Consistent gaps across measures would indicate a need to improve overall care for one gender. Alternatively, gaps that differ by measure would suggest focus on the care of one gender in those areas. These patterns would warrant different quality improvement strategies to close observed gaps. For example, if women are generally more internally motivated to seek care, men might especially benefit from nudges that help those who are less activated.

Second, did the MA plans that provided the highest quality care for women also tend to do so for men? Here, we ask whether there is a plan-level gender gap that varies among plans. If the gap varies little, both women and men can use plan performance scores as an accurate quality indicator for their gender. If gender gaps vary substantially, gender-specific performance reporting might better inform MA beneficiaries about the plans that would offer them the best care. Such reporting might also illustrate differential improvement in men’s or women’s care in response to interventions.

Third, if gender gaps in care differ by plan, are gaps more favorable to one gender where quality of care is high? Determining whether and how care falls short for one or both genders could inform interventions to improve overall quality and to address gender gaps.

METHODS

Data

We used 2011 and 2012 Healthcare Effectiveness Data and Information Set (HEDIS) data. HEDIS data is collected by the Centers for Medicare and Medicaid Services (CMS) and consists of health care process measures and intermediate outcome measures based on individual-level administrative data, supplemented in some cases by information from medical records.17 These measures represent the state of the art in measuring health plan performance; they are used to evaluate services for which evidence indicates that measure improvement improves health outcomes.18,19 Each measure specifies inclusion criteria based on age, disease presence, and also specifies relevant exclusions (such as contraindications) that define the sample for each HEDIS measure.20

An MA contract, hereafter called a plan, is a set of offerings (or benefit packages) from a single sponsor, usually in a specific geographic area.

Sample

The analytic sample includes 23.8 million HEDIS records (unique combinations of measure, person, and year) for MA beneficiaries enrolled in any of 456 reporting plans operating in 2011.Footnote 1

Variables

Our dependent variables were 34 HEDIS measures available in both measurement years after excluding 2 measures with limited variation (having a pass rate above 95% or below 5%). We examined (see Table 1) 9 screening measures (3 primary screening measures and 6 secondary screening measures for beneficiaries with specific conditions), 17 treatment measures, 5 intermediate outcome measures, and 1 access measure. There are 6 “count” measures, for which an individual beneficiary may have more than one eligible event (the anticonvulsant and summary medicine monitoring measures, the 2 measures for pharmacotherapy management of Chronic Obstructive Pulmonary Disease exacerbation, and the 2 measures for follow-up after hospitalization for mental illness). All other HEDIS measures are coded 1 = yes and 0 = no. We reverse-coded 3 measures of drug-disease interaction prevalence so that a higher score corresponds to better care for all measures. Thus, a higher value reflects better quality of care for all measures examined.

Table 1 National HEDIS Performance Scores by Gender

Analytic Approach

For each HEDIS measure, we calculate national performance scores by gender.

To address our first research question, we estimate female-male differences both within plan and overall. We fit two-level binomial mixed-effect models21 using individual-level HEDIS scores as outcomes, fixed effects for gender, random plan intercepts, and random plan slopes for female-male; these models account for clustering of patients within contracts. This approach reduces the likelihood that any apparent convergence of quality of care by gender at high or low levels of quality of care reflects a mere ceiling or floor effect. Because the official scoring specifications for HEDIS measures do not involve case-mix adjustment, no other covariates were included.

We employ the same models to address the second and third research questions by calculating the informativeness of gender-specific plan scores. Conceptually, a measure is informative if the gender gap (here the female minus male difference within a plan) varies from plan to plan, and the best plan for men is not necessarily the best plan for women. More formally, informativeness is the proportion of variance in plan scores for one gender that cannot be predicted from the overall plan scores.16 It is 0 if gender gaps are constant across plans and 1 if men’s and women’s scores are uncorrelated at the plan level. See Table 2 notes regarding the calculation of informativeness.

Table 2 Odds Ratios and Correlations of Plan HEDIS Performance by Gender

To illustrate the correlations between plan performance and plan gender gap graphically for each HEDIS measure, we classified plan into quintiles based on their performance score on that measure. Within each quintile, we calculated and plotted women’s and men’s performance scores.

RESULTS

Table 1 shows the percentage of MA beneficiaries receiving the indicated care—the performance score—by gender for each measure. All gender differences were statistically significant (p < 0.05), except for antidepressant medication management, acute phase (p = 0.56). Performance scores were higher for women than men for 22/32 measures; differences ranged from 7.5 percentage points higher for women on 30-day follow-up after hospitalization for myocardial infarction to 0.1 percentage point higher for monitoring medications: diuretics. The measures favoring women included 3/3 primary screening measures, 4/6 secondary screening measures, 12/17 treatment measures, 2/5 intermediate outcome measures, and the 1 access measure. Of the 9 measures favoring men, 4 relate to cardiovascular disease and 3 to potentially harmful drug-disease interactions. Although 10 gender differences were small (< 1 percentage point), 6 exceeded 5 percentage point, including 1 measure favoring men.

The first column of Table 2 shows the odds of women receiving the indicated care compared to men, controlling for plan. When calculated within plans, 2 ORs were no longer statistically significant (1 favoring men) and the OR for 1 measure reversed from favoring women to favoring men.

As seen in the second column of Table 2, the informativeness of single-gender scores relative to overall scores was low. For 11/32 measures, informativeness was zero, indicating no evidence of variation in gender gap by plan (including 5/5 intermediate outcome measures). These consistent gender gaps occurred for 7/9 measures for which performance for men exceeds that for women.

Informativeness greater than zero implies that differences between women’s and men’s scores—hereafter the gender gap—vary by plan. Statistically significant plan-level correlations for 21 measures (7/9 screening measures, 13/17 treatment measures, and the single access measure, p < 0.001) indicate plan-level variation in the gender gap. Of these measures, 15 have informativeness above 0.30, with the highest (0.47) for pharmacotherapy management of chronic obstructive pulmonary disorder exacerbation: bronchodilator. For these 15 measures, the best plans for men and women may differ meaningfully.

Specifically, relative to women in low-performing plans, men in low-performing plans had even lower scores for these 15 measures than would otherwise be expected. As seen in the third column of Table 2, the correlation between gender differences (women vs. men) and overall plan performance was negative and statistically significant for 8 measures. On these 8 measures, women had higher performance scores than men. The correlation between gender differences and overall plan performance was positive and significant for 2 measures, 1 of which favored men and the other of which had no average gender gap within plans. Thus, for these 2 measures, the gap is more favorable to men in plans with higher scores.

The different ways gender gaps and overall scores are related can be illustrated with three examples from Table 2. Within plans, women were more likely to receive the adult BMI assessment (significant OR of 1.11) and the informativeness of 0 is consistent with a constant true gender gap across plans. Diabetic women were also more likely than diabetic men to receive an eye exam (significant OR of 1.28), but the statistically significant informativeness (0.29) indicates that the gender gap varies across plans. However, we did not find evidence that this gender gap was correlated with overall plan performance; it was not likely to be larger (or smaller) in low-performing plans. Finally, women were more likely to receive a colorectal cancer screening (OR = 1.07) and, as with eye exams, there was evidence that the gender gap varies across plans (informativeness = 0.26). However, for this measure, the gender gap was also significantly negatively correlated with overall plan performance (r = − .25), indicating that the gender gap tends to be less favorable to women in high-performing plans (and to men in low-performing plans).

Figure 1 illustrates how the gender gap varies across quintiles of plan performance for 4 measures with significant negative correlations between plan performance and the difference between performance scores of women and men. In the case of Rx therapy for rheumatoid arthritis and HbA1c testing for diabetics, the gender gap (favoring women) closes at high levels of overall plan performance where men’s care is equivalent to women’s. In the case of colorectal cancer screening, the gender gap closes and men’s care exceeds women’s care at high levels. Similarly, for adult access to preventive/ambulatory services, performance scores are high in general, but gaps favoring women are large and narrow as overall performance increases. Thus, for all 4 of these measures, while quality of care increases for both genders across quintiles of care quality, the relative gains are greater for men than women, which results in a smaller gender disparity in the highest-performing quintile.

Figure 1
figure 1

Women’s and men’s HEDIS performance scores by Plan performance quintile. Notes: p ≤ 0.001. Male = dashed line -------. Female = solid line –––––.

DISCUSSION

Our study demonstrates that among MA beneficiaries, women generally experienced better care than men (on 22/32 measures, almost all of which are screening or treatment measures). The only measures for which men’s care was more than 1 percentage point higher than women’s were intermediate outcome measures related to control of LDL-C and of high blood pressure, as well as treatment measures regarding potentially harmful drug-disease interactions. Although the plan-level correspondence between plan scores for women and men was generally high, the best-performing plan for women was not high-performing for men in some cases.

In the two areas with gaps favoring men, intermediate outcome measures related to control of cardiovascular risk factors and treatment measures regarding potentially harmful drug-disease interactions, point to aspects of health care which might place women at increased risk of poorer quality of care. Mosca and colleagues found that providers were more likely to assign women with intermediate cardiovascular risk as assessed by the Framingham Risk Score to a lower risk category than men with identical risk factors.22 They also found that providers were also less likely to prescribe statins to women and to increase the dose to achieve adequate LDL control, though this only partly explained the observed gender gaps in care. Similarly, in an analysis of preventive cardiovascular care among commercial managed care members in four metropolitan areas, Bird and colleagues found that LDL cholesterol control rates were 5 and 15 percentage points lower for women than men with diabetes mellitus (p < .0001) and coronary artery disease (p < .0001), respectively.23 They found that younger women were under-identified by a widely used algorithm to identify individuals for referral to disease management and wellness behavior support programs. In the case of potentially harmful drug-disease interactions, women’s poorer quality of care may be a function of their higher rates of comorbidities and associated risks of polypharmacy. Although women’s health may expose them to greater risk of drug-disease interactions, this would not be a justification for not receiving guideline concordant care.

We also found that the gender gap is often larger in plans that perform poorly overall. On some measures for which women score higher than men, the difference tends to be smaller for high-performing plans. On measures for which men score higher than women, the gender gap typically varies little across plans. The findings suggest that stratifying quality assessments by gender could identify plans where either women or men are receiving worse care than expected based on what is known about plan performance. Gender-based quality reporting may also motivate quality improvement efforts for the lagging group.14 This approach could be used in setting priorities and to monitor whether improvement efforts benefit both women and men. The information may also raise clinicians’ awareness of potential gender gaps in care of seniors. Provider groups and plans may need to coordinate to improve services. Moreover, our findings confirm the additional challenges of achieving equity for women on control measures compared to process measures, also a challenge for racial/ethnic disparities.5,24

Even small unexplained disparities in performance by gender signal faults in the delivery system that should be addressed. By analogy, very few planes crash and cause passenger deaths, but when they do, they present an opportunity to examine factors that contribute and create remedies that reduce future crashes and prevent other problems. Although gender differences in quality-of-care measures were often less than five percentage points, they reflect the care of millions of seniors. Therefore, a substantial number of people might benefit if gender gaps were closed through improvement for lagging groups. For example, 70.2% of women and 64.7% of men received glaucoma screening. In 2011, men accounted for 44% of approximately 56 million beneficiaries. Matching the female rate would add ~ 1.4 million glaucoma screenings. Likewise, given the importance of cardiovascular care for reducing mortality, many deaths might be prevented by increasing women’s quality of cardiovascular care to the average level currently received by men.

Gender differences, which are nearly constant across plans for some measures, suggest an opportunity to make gender gaps visible to plans, providers, and older adults and their families so that they will take actions to improve care and reduce gender disparities in quality. HEDIS data reporting has been used in the MA program since 2002 to measure disparities in care.25 MA plans have several tools at their disposal to improve quality including education, reminders, and prompts for both beneficiaries and providers. They can also use payment incentives and contracting requirements to motivate improvement. Addressing gender gaps among MA beneficiaries might also drive efforts to address gaps in employer-sponsored care which in turn could lead to improved health trajectories of older women and men as well as Medicare cost savings.

This study has several limitations. We can only speculate about the underlying causes of these gender differences. They may reflect differential treatment by the same providers, differences in the quality of providers seen within plans, differences in patients’ health behaviors (including preferences and adherence to provider recommendations), or, for control measures, gender differences in tolerance of medications and response to treatment.26 However, for some plans and measures, particularly for high-performing plans and screening or treatment measures, no gender gap exists. Although we lacked information on patient health and comorbidities, these quality measures refer specifically to care for which there is clinical consensus that it is indicated for the entire population for which it is assessed; the denominator specification for each measure includes only beneficiaries who meet the eligibility criteria for the service specified by the measure numerator.

These limitations notwithstanding, these results have important clinical implications. HEDIS measures offer evidence-based standards of care for which there is general agreement. The observed gender disparities could result in adverse outcomes for men across the wide range of measures for which they experienced worse care and adverse outcomes for women in the areas of cardiovascular care and potentially harmful drug-disease interactions. Further reduction in women’s morbidity and mortality from cardiovascular disease depends on better addressing the disease and its risk factors in ambulatory practice and in reducing drug-disease interactions. Further research is needed to assess the underlying causes of gender-specific gaps in system performance and associated opportunities to improve care and outcomes.