figure a

Introduction

Educational inequality in type 2 diabetes is evident in many high-income countries, in which less education is associated with markedly higher incidence of type 2 diabetes [1, 2]. The aetiology of type 2 diabetes is multifactorial and complex, but being overweight and obese are key modifiable risk factors [3, 4]. Since being overweight and obese are often more prevalent among people with a lower education level in high-income countries [5], it is plausible that an interrelationship between education level and being overweight/obese could explain the educational inequality in type 2 diabetes incidence.

This interrelationship can be studied within a theoretical framework of the social basis of disparities in health [6, 7], previously used by the WHO in their work on social determinants of health [8]. The framework points to two key mechanisms (Fig. 1): (1) differential exposure, in which the distribution of being overweight and obese is different across educational levels. More cases of type 2 diabetes develop in groups with lower education levels because they are more exposed to being overweight and obese; (2) differential susceptibility, in which the effect of being overweight and obese on the risk of type 2 diabetes is different across educational levels. More cases of type 2 diabetes develop in groups with lower education levels because being overweight and obese is more harmful in these groups. The increased susceptibility is caused by simultaneous exposure to other complementary risk factors for type 2 diabetes, such as lack of physical activity, poor diet, smoking and chronic stress, which all tend to accumulate in individuals with less education [9, 10]. The concept of differential susceptibility is closely linked with the sufficient-component-cause model [11], in which the effect of one cause is dependent on the presence of other interacting causes of the same disease. These mechanisms are not mutually exclusive [6, 7, 12] and may work simultaneously to create and uphold educational inequality in type 2 diabetes incidence.

Fig. 1
figure 1

The proposed mechanisms of differential exposure (red arrow) and differential susceptibility (blue arrow) to being overweight and obese across educational levels, leading to educational inequality in type 2 diabetes incidence. All other pathways from education level to type 2 diabetes, not working through being overweight and obese, are depicted by (purple arrow)

Previous studies have found that differential exposure to being overweight/obese could partly explain the association between education level and type 2 diabetes incidence [13,14,15,16,17,18,19,20,21]. Most often, however, these studies assess differential exposure using methods inspired by Baron and Kenny [22], typically comparing estimates from regression models with and without adjustment for BMI [13,14,15,16,17,18,19,20]. This is often referred to as the difference method [23]. This method adequately estimates the strength of a mediator (e.g. being overweight/obese) for an association (e.g. between education level and type 2 diabetes) when used in linear regression models without interactions between the exposure and the mediator [24, 25]. However, the method is typically not viable for nonlinear models or in settings where the effect of a mediator is different across exposure categories (i.e. differential susceptibility) [25]. This limitation generally also applies to other so-called traditional approaches to mediation analysis, such as the product-of-coefficient method [23].

Counterfactual-based approaches address some of these limitations [23]. Nonetheless, we have only identified one study that used such an approach to address the relationship between education level, being overweight/obese and diabetes incidence. The study found that a sizeable part of the educational inequality in diabetes was mediated through BMI [21], but it did not assess a potential interaction between education level and BMI. Therefore, to our knowledge, no previous study has assessed the impact of differential susceptibility to being overweight/obese on the educational inequality in type 2 diabetes incidence.

Knowledge of the impact of differential susceptibility to being overweight and obese can be used to guide policies and interventions aimed at reducing rates of type 2 diabetes [7]. Specific educational groups could be targeted if they are found to be more susceptible to the adverse effects of being overweight and obese. Furthermore, interventions that would reduce the number of people who are overweight and obese to the same extent in all educational groups would also provide a greater reduction in the rates of type 2 diabetes among the groups that are more susceptible.

In this study, we aimed to assess the educational inequality in type 2 diabetes incidence, and to quantify the contributions of differential exposure and susceptibility to being overweight and obese to this association.

Methods

In this longitudinal cohort study, we applied a counterfactual approach to mediation analysis based on a marginal structural model [26] and a three-way decomposition of effects [27]. This approach enabled simultaneous estimation of mediation (differential exposure) and interaction (differential susceptibility). We compared these results with estimates from a conventional approach to mediation analysis, specifically a difference method comparison of regression model estimates [23]. This method can only assess mediation, so we, therefore, used a separate method to assess interaction, namely an additive joint effects model [28]. We refer to these methods as conventional approaches throughout the paper.

Study population

The data used was previously collected for the Diet, Cancer and Health (DCH) study, which was a population-based prospective cohort of 57,053 Danish men and women who entered the study between 1993 and 1997 [29]. All participants were between 50 and 64 years old at baseline and were born in Denmark [29]. Access to DCH study data was obtained through the Social Inequality in Cancer (SIC) cohort, which is a consortium of several Danish population-based cohorts linked to social and health registers [30]. Following the exclusion of 1247 individuals because of a cancer diagnosis before baseline or because of previous participation in another cohort in the consortium, data on 55,806 participants in the DCH study were available through the SIC cohort. We excluded 2647 participants who had missing information on key variables, had a BMI below 18.5 or had diabetes at baseline. The study population available for the analyses comprised 53,159 individuals (see the flowchart in electronic supplementary material [ESM] Fig. 1).

The DCH study was approved by the Regional Ethical Committees in Copenhagen and Aarhus ([KF] 01–345/93) and the Danish Data Protection Agency (1991–1110-635). The participants gave informed consent to participation in the study and informed consent to search for personal data in medical registers.

Education

Education was categorised into three levels according to highest educational attainment: low (primary or lower secondary education), medium (upper secondary, vocational or technical education) and high (tertiary education of short- and medium-cycle non-university programmes and long-cycle university programmes). Information was drawn from the Population Education Register 1 year prior to baseline in the DCH study.

Overweight and obesity

Participants were categorised as being overweight and obese using BMI (weight in kg/[height in m]2) calculated from baseline anthropometric measurements conducted by a laboratory technician. Participants were divided into three weight categories: normal weight (18.5–24.9 kg/m2), overweight (25–29.9 kg/m2) and obese (≥30 kg/m2). People with a BMI below 18.5 kg/m2 were excluded because it is unclear whether diabetes in these individuals is pathophysiologically similar to type 2 diabetes [31].

Covariates

Potential confounders were identified based on prior knowledge and the method of directed acyclic graphs [32]. ESM Fig. 2 shows a simplified depiction of the assumed causal relationships between education level, being overweight/obese and type 2 diabetes. We identified age, sex, ethnicity, family socioeconomic position, family history of type 2 diabetes and fetal/neonatal development as potential baseline confounders. Data were not available on familial and early-life factors. Accordingly, we adjusted our analyses for sex, ethnicity and age (underlying timescale in the additive hazards model). Ethnicity was assessed by country of origin and dichotomised into Danish if participants descended from Danish parents and non-Danish if participants descended from foreign-born parents not holding Danish citizenship. We acknowledge that country of origin is an imperfect proxy for ethnicity. Information was drawn from the Danish Civil Registration System.

We also identified several potential confounders of the association between being overweight/obese and type 2 diabetes, namely cohabitation status, chronic stress exposure, physical activity levels, dietary factors, alcohol intake and smoking. However, since these factors were also associated with education level, they were simultaneously mediators and confounders (so-called intermediate confounders). Controlling for these factors was necessary to reduce confounding in analysis of the relationship between being overweight/obese and type 2 diabetes. Doing so, however, would block some of the direct effect of education level on type 2 diabetes. Therefore, both adjusting and not adjusting for these may have led to biased estimates, possibly even in the same direction [24]. Hence, these factors were omitted in the main analysis but were included in a sensitivity analysis (with the exception of chronic stress exposure, for which no data was available).

Type 2 diabetes

Incident type 2 diabetes was ascertained as the date of inclusion in the National Diabetes Register (NDR), which requires meeting one of several diabetes-related criteria in central Danish health registries [33]. Briefly, the criteria were registration of: (1) diabetes diagnosis defined by ICD-8 (www.wolfbane.com/icd/icd8h.htm) and ICD-10 (http://apps.who.int/classifications/icd10/browse/2016/en) codes (ICD-9 was never used in Denmark); (2) use of chiropody (foot therapy) as a diabetic patient; (3) second purchase of oral glucose-lowering drug (OGLD) or insulin; and (4) five instances of blood glucose measurement within 1 year or two instances of blood glucose measurements in 5 consecutive years. Previous studies have indicated that up to 20% of cases in the NDR might be false positives owing to the inclusion criteria of five blood glucose measurements during 1 year [34]. We conducted a sensitivity analysis of this issue by excluding people registered by the blood glucose measurement criteria. Also, the registration dates are systematically delayed for more than 30% of the registrants in NDR before 1997 [34]. However, incidence data are complete from 1995 [33]. We conducted two sensitivity analyses of this issue by excluding people registered before 1995 and 1997, respectively.

The participants were followed from the date of study entry into the DCH study until the date of incident diabetes, death, emigration or end of follow-up (31 December 2012), whichever came first. Follow-up ended because the NDR was discontinued at the end of 2012. Information about migration and death, used for censoring purposes, was drawn from the Danish Civil Registration System. Due to emigration, 1.2% were lost to follow-up. The mean follow-up time was 14.7 years (SD: 4.1).

Statistical analyses

In both the counterfactual and the conventional approaches, we applied Aalen’s semiparametric additive hazards regression model [35]. We estimated the absolute rate differences in incident cases of type 2 diabetes per 100,000 person-years between groups with different educational levels (reference: high education) and in different BMI categories (reference: normal-weight BMI category).

In the conventional approach, we applied the difference method to estimate the proportion of the rate differences of type 2 diabetes between educational levels that could be attributed to being overweight/obese. Here, mediation through being overweight/obese is quantified as the difference between rate differences of type 2 diabetes estimated with and without adjustment for BMI. Also, we applied an additive joint effects model [28] to estimate the proportion of the educational rate differences of type 2 diabetes that could be attributed to an interaction between education level and BMI. Here, interaction is quantified as deviation from additivity of effects [36], i.e. the extent to which the rate difference in a doubly exposed group (to e.g. low education level and being obese) deviates from the sum of rate differences in the separately exposed groups ([e.g. low education level, being normal weight] + [high education level, being obese]).

In the counterfactual approach, we defined the effects of education level acting through being overweight/obese on type 2 diabetes using a nested counterfactual framework [23]. Briefly, we estimated the rate differences of counterfactually changing a given educational level from the reference level (high education) to an exposure level (medium or low education), which corresponds to the total effect of education level on type 2 diabetes. We applied the marginal structural approach by Lange et al. [26] and the three-way decomposition of effects by VanderWeele [27] to decompose the total effect of education level into three effects: (1) the pure natural direct effect (PNDE) of education level on type 2 diabetes; (2) the pure natural indirect effect (PNIE) of education level on type 2 diabetes mediated through BMI; and (3) the mediated interactive effect (MIE) of an interaction between education level and BMI on type 2 diabetes. The PNIE corresponds to differential exposure and the MIE corresponds to differential susceptibility [12]. The PNDE corresponds to all other (unaccounted) pathways than through being overweight/obese. The analytical framework in the counterfactual approach is further outlined in the ESM Methods. In an auxiliary analysis, we also stratified this analysis by sex.

Age was the underlying timescale in all additive hazards models. The time-constant hazard difference assumption was tested for education level and BMI and was not violated. We used robust standard errors to compute 95% CIs for the decomposed effects. To obtain 95% CI for the total effect and the proportions of the decomposed effects, we repeated the analysis on 10,000 bootstrapped samples. The statistical technique and its implementation in R are described elsewhere [37, 38]. All analyses were conducted using SAS 9.4 [39] and R 3.5.0. [40].

Results

Descriptive characteristics stratified by educational level for the 53,159 participants are shown in Table 1. Education was associated with being overweight and obese at baseline as more people in the high level of education group were normal weight and more people in the low level of education group were obese. There were more women in the low-level education group compared with the medium- and high-level education groups. A total of 7564 participants developed type 2 diabetes during follow-up (Table 2) and the proportion who developed type 2 diabetes was higher in groups with lower education level and markedly higher in groups with higher BMI.

Table 1 Descriptive characteristics for the 53,159 participants by level of education
Table 2 Numbers and proportions of cases of type 2 diabetes and rate differences in incident cases of type 2 diabetes per 100,000 person-years (95% CI) by education level and BMI

The results of the assessment of differential exposure using the conventional approach (Table 2) show that low education level was associated with 455 (CI 398, 511) additional cases of type 2 diabetes and medium education level was associated with 316 (CI 269, 360) additional cases per 100,000 person-years, as compared with high education level. According to the difference method, 34% of the additional cases of type 2 diabetes in the medium-level education group and 45% in the low-level education group could be attributed to differential exposure to being overweight and obese across educational groups (without accounting for differential susceptibility).

The result of the assessment of differential susceptibility using the conventional approach (Table 3) show an overall additive interaction between education level and BMI (p = 0.003). Compared with the associations in the high-level education group, being overweight was associated with 14% of additional cases of type 2 diabetes and obesity was associated with 19% of additional cases in the low-level education group. In the group with a medium level of education, being overweight was associated with 13% of additional cases of type 2 diabetes and obesity was associated with 19% of additional cases. Thus, without accounting for differential exposure, the joint effects method suggests that between 13% and 19% of the educational inequality in type 2 diabetes incidence can be attributed to differential susceptibility to being overweight/obese across educational groups.

Table 3 Rate differences in incident cases of type 2 diabetes per 100,000 person-years (95% CI) in groups jointly categorised by education and BMI

Table 4 shows the results from the counterfactual approach, which simultaneously quantifies differential exposure and susceptibility. Low-level education was associated with 454 (CI 398, 510) additional cases of type 2 diabetes and medium-level education was associated with 316 (CI 268, 363) additional cases per 100,000 person-years, as compared with a high level of education. These rate differences were essentially similar to the rate differences shown in Table 2, with only modelling variation. Decomposing the rate differences showed that differential exposure to being overweight and obese accounted for 37% (CI 31%, 45%) of the additional cases of type 2 diabetes in the low-level education group and 29% (CI 24%, 36%) in the medium-level education group. Differential susceptibility to being overweight/obese accounted for 9% (CI 4%, 14%) of the additional cases of type 2 diabetes in the low-level education group and 6% (CI 3%, 10%) of the addition cases in the medium-level education group.

Table 4 Natural effects model estimating the total and decomposed rate differences in incident cases of type 2 diabetes per 100,000 person-years due to counterfactual changes in educational level (95% CI)

In an auxiliary analysis, we stratified the counterfactual analysis by sex (ESM Table 1). Here, the overall educational inequality in type 2 diabetes incidence was larger for men than for women, but the proportion attributable to differential exposure to being overweight/obese was larger for women. The proportions attributable to differential susceptibility were similar. We also conducted a number of sensitivity analyses. First, we adjusted the counterfactual analyses for the intermediate confounders that were omitted from the main analysis, namely physical activity level, dietary factors, alcohol intake, smoking and cohabitation status. Descriptive characteristics are shown in ESM Table 2. Adjusting for these factors showed somewhat similar results to those without adjustment (ESM Table 3). However, as expected, the effects of differential exposure and susceptibility to being overweight/obese were stronger and the effect of other pathways was weaker compared with the analysis without adjustment. Further, excluding people registered in the NDR by the blood glucose measurement criteria did not substantially influence the results (data not shown). Finally, excluding people registered in the NDR with type 2 diabetes before 1995 or 1997 did not influence the results (data not shown).

Discussion

In this longitudinal cohort study of 53,159 men and women, we found educational inequality in type 2 diabetes incidence. In accordance with the applied theoretical framework, we hypothesised that at least some of this inequality could be attributed to the mechanisms of differential exposure and susceptibility to being overweight and obese across educational levels. We found that roughly one-third of the inequality could be attributed to differential exposure to being overweight/obese and that a smaller, yet considerable, proportion could be attributed to differential susceptibility to being overweight/obese across educational levels. These findings are consistent with the employed theoretical framework as both mechanisms play important roles for the educational inequality in type 2 diabetes. Comparing counterfactual and conventional approaches, we found that the mechanisms of differential exposure and differential susceptibility to being overweight and obese were not mutually exclusive and that they should be considered simultaneously to fully understand the impact of being overweight/obese on the educational inequality in type 2 diabetes incidence.

The magnitude of the educational inequality in type 2 diabetes incidence in this study is in accordance with other studies from high-income countries [1, 2]. Previous studies have found that between 8% [17] and 64% [18] of the association between education level and type 2 diabetes could be attributed to differential exposure to being overweight/obese [13,14,15,16,17,18,19,20,21]. Our findings corroborate this evidence and underscore that differential exposure to being overweight/obese is an important mechanism in the association between education level and type 2 diabetes incidence.

The impact of the potential interaction between education level and being overweight/obese on type 2 diabetes incidence have previously been assessed on a multiplicative scale [15, 16, 41, 42]. From a public health perspective, it has been argued that interactions should be assessed as deviation from additivity (i.e. interaction on an additive scale), since this approach directly identifies the subgroups in which most cases could potentially be prevented [28, 36]. To our knowledge, the finding of an additive interaction between education level and being overweight/obese on the risk of type 2 diabetes (i.e. differential susceptibility) is novel, and it illustrates the complex interplay between social factors and being overweight/obese that underlies the educational inequalities in type 2 diabetes.

We also compared conventional and counterfactual approaches to mediation and interaction analysis and found that the estimates of both differential exposure and susceptibility were weaker when evaluated simultaneously in the counterfactual approach than when evaluated separately using the conventional approaches. The finding that a mediator could be attributable to a larger part of an association when conventional methods are applied, compared with when counterfactual methods are applied, has previously been shown in other settings within social epidemiology [43]. It is, therefore, important to highlight that the choice of methodological approach may affect the evaluation of the importance of differential exposure and differential susceptibility to a particular risk factor. It is essential for the understanding of social inequalities in health to acknowledge that these two mechanisms are not mutually exclusive and that analyses of these mechanisms should consequently accommodate them simultaneously.

Strengths

We used longitudinal data with a clear temporal ordering and separation between education level, being overweight/obese and type 2 diabetes incidence. The large observational cohort, with a long follow-up period and a large number of cases, gave us sufficient power to investigate mediation and interaction simultaneously. With this setup, we employed counterfactual mediation methods to disentangle the contribution of differential exposure and differential susceptibility to being overweight/obese in the association between education level and type 2 diabetes incidence, which is a novel approach in this setting. In addition, we assessed differential exposure and differential susceptibility on an absolute scale and, thus, directly estimated the number of type 2 diabetes cases that could potentially be prevented.

Limitations

Identification of natural direct and indirect effects requires that four non-confounding assumptions are fulfilled, and at least two of these are likely violated in our study. First, the effect estimates of educational level on type 2 diabetes may be inflated due to unmeasured confounding from early-life factors, such as family socioeconomic position [44], family history of type 2 diabetes [45] and fetal/neonatal factors [46].

Second, intermediate confounders of the relationship between being overweight/obese and type 2 diabetes may bias the estimated contributions of differential exposure and susceptibility (but not the overall association between education level and type 2 diabetes). Factors including cohabitation status, chronic stress exposure, physical activity level, dietary factors, alcohol intake and smoking are associated with both being overweight/obese and type 2 diabetes incidence. In our data, those with a lower education level were less physically active, smoked more, drank less and had a lower intake of fruit and vegetables than those with a high education level, while cohabitation was most common among those with a medium level of education (ESM Table 2). A measure of chronic stress exposure was not available. Adjusting for these variables showed somewhat similar results to those without adjustment (ESM Table 3). However, as expected, the effect of education level through other pathways (i.e. the direct effect) was weaker, and the effects of differential exposure and susceptibility to being overweight/obese were stronger. While we may not conclude that the true estimate lies between the unadjusted and adjusted estimates, it is encouraging that we still find evidence of differential exposure and susceptibility when adjusting for these variables.

Selection bias may have attenuated both the overall association between education level and type 2 diabetes and the proportion of the effect attributable to differential exposure and susceptibility. Participants in the DCH study had higher education [29] and a lower mortality rate [47] than non-participants. Our study population is, thus, disproportionately comprised of higher-educated and healthier individuals than the source population, which may affect the translatability of our findings to other settings.

Bias in the measurement of being overweight/obese is also a potential limitation. The BMI of the participants was likely to fluctuate throughout the follow-up period, yet we only had baseline measures available. One study has shown that models using either single or longitudinal measures of BMI did not differ substantially in explaining occupational inequality in type 2 diabetes incidence [48]. This suggests that measurement bias caused by having only one measurement of BMI might not be severe in our study.

The measurement of type 2 diabetes incidence might also be subject to bias. First, it has been suggested that 26% of individuals who would clinically qualify for a type 2 diabetes diagnosis in Denmark are not diagnosed [49]. The proportion is possibly higher in groups with a lower level of education [49], which could lead to differential misclassification, resulting in an underestimation of the association between education level and type 2 diabetes. Next, several validity issues have been reported with the NDR, including that up to 20% of cases included might be false-positive cases and that registration dates before 1997 are systematically delayed [34]. We assessed these issues in separate analyses and did not find a substantial impact on the results (data not included). Additionally, the NDR identifies patients with type 2 diabetes by the purchase of OGLDs, among other criteria. These drugs can be prescribed to patients with impaired glucose tolerance and/or impaired fasting glucose who do not meet the diagnostic criteria for type 2 diabetes, resulting in a false-positive case. However, the OGLD criteria has been shown to have a high positive predictive value of 95% when validated against medical records [33]. A new diabetes register based on Danish national registries has been developed [50] but was not available for this study.

Conclusions and implications

The finding of differential susceptibility to being overweight and obese (i.e. that being overweight and obese cause more cases of type 2 diabetes in groups with lower levels of education) might inform practitioners and policymakers. Practitioners may include information on education level or risk factors associated with education level to more accurately assess type 2 diabetes risk in patients. Policymakers may take into account that obesity-reducing interventions could have a higher impact on reducing type 2 diabetes rates if groups with lower education levels are specifically targeted. Such knowledge is especially useful in settings with sparse resources. Additionally, a population-wide reduction in the number of people who are overweight/obese may benefit groups with lower education levels more than groups with higher education levels in terms of lowering type 2 diabetes rates.

Future studies will need to replicate the finding of differential susceptibility to being overweight and obese. It would also be important to disentangle the mechanism of differential susceptibility to investigate whether it is the lower education level in itself or the associated contributing risk factors that are responsible for the higher rate of cases among those with low education levels. In addition, studies that include data on early-life determinants of education level, being overweight or obese, and type 2 diabetes might shed further light on the causal mechanisms behind these associations. This study has also shown the potential of the recent developments in counterfactual mediation analysis and we suggest that other researchers implement these methods in their inquiry into the social inequalities in type 2 diabetes.