Introduction

The cluster of cardiovascular risk factors known as the metabolic syndrome appears to be driven by underlying pathophysiological processes associated with insulin resistance, including adipocyte dysfunction, systemic inflammation, oxidative stress and disruption of cellular processes in target tissues [1,2,3]. Individuals classified as having the metabolic syndrome using traditional binary criteria such as those of the Adult Treatment Panel III (ATP-III) [4] have a high risk of disease progression, with an odds ratio of 5.16 for future type 2 diabetes [5]. Nevertheless, in most studies, a diagnosis of the metabolic syndrome does not confer additional predictive abilities beyond that of the individual metabolic syndrome components of elevated waist circumference (WC), high BP, high fasting triacylglycerol, low HDL-cholesterol and high fasting glucose [5]. Although abnormalities in these individual factors are not thought of as being causative of diabetes, researchers have questioned the value of the concept of the metabolic syndrome as an indication of the underlying pathophysiological processes (i.e. adipocyte dysfunction, systemic inflammation and oxidative stress) that themselves may contribute to the development of type 2 diabetes [6].

We developed a continuous metabolic syndrome severity z score that is specific to sex and race/ethnicity [7,8,9]. This score correlates, among children and adults, with risk of future CVD [10] and diabetes [11]. However, these results were observed in a relatively small cohort of children originally enrolled over 40 years ago that did not allow for analyses by race or sex, nor did it allow for in-depth examination of the risk associated with the severity of the metabolic syndrome above and beyond the individual metabolic syndrome components. In a more recent combined cohort of adults, we demonstrated that this score remains a significant predictor of CHD, even in models that include the individual metabolic syndrome components [12], suggesting that this score may capture additional risk conferred by biological processes underlying the metabolic syndrome [1].

The goal of this study was to assess the following: (1) the extent to which this continuous metabolic syndrome severity score was associated with future diabetes in a large, combined bi-racial cohort; (2) whether this score conferred additional risk prediction after accounting for each of the individual components of the metabolic syndrome; (3) whether the severity of the metabolic syndrome conferred differential risk by sex and race; (4) how the metabolic syndrome severity-related risk compared with the risk associated with ATP-III MetS; (5) whether a change in score over time conferred additional risk; and (6) how this risk compared with risk of CHD. We hypothesised that as a linear estimate of metabolic syndrome-related pathophysiology, this score would offer increased risk prediction for diabetes, both at baseline and over time, with potential relevance as a clinical tool.

Methods

Cohorts and exclusion criteria

This study and/or its analysis was approved by the Institutional Review Boards of the University of Florida, the University of Virginia and the study sites for the Atherosclerosis Risk in Communities Study (ARIC) and the Jackson Heart Study (JHS), and was carried out in accordance with the Declaration of Helsinki.

ARIC is a large community-based epidemiological cohort study that began in 1987–1989 across four field centres in the USA. Further details regarding the study design and objectives are published elsewhere [13]. ARIC data are available through ancillary study collaboration with ARIC [14]. A total of 15,792 primarily white and black participants aged 45–64 years old provided informed consent and were enrolled. We excluded individuals who did not consent (n = 395) and those who identified as a race/ethnicity other than white or black (n = 46). We used metabolic syndrome and risk factors from visits 1 and 2, and data on diabetes from visits 1–4.

The JHS began as an extension study of black participants in the Jackson, MS, site of ARIC and used similar methodologies. JHS data are available through ancillary study collaboration with JHS [15]. Starting in 2000–2004, 5301 participants aged 21–95 years provided informed consent and were enrolled; this included 1626 participants who had been followed as part of ARIC and for whom data from ARIC rather than the JHS were used for the present analysis [16]. For JHS participants who did not participate in ARIC, we used metabolic syndrome data from visits 1 and 2, as well as diabetes data from visits 1–3 (i.e. the final visit).

After combining the two cohorts as described above (n = 19,026), we further excluded participants with baseline diabetes (n = 2485), CHD (n = 973) or stroke (n = 393), as well as participants who were missing baseline data on metabolic syndrome components (n = 792) or who had non-fasting laboratory results (n = 507), and/or those without follow-up data regarding outcomes (n = 2992). This left 13,094 individuals for our primary analysis. For this group of participants, we performed a subsequent analysis on those who had a follow-up visit at which the metabolic syndrome score could be calculated and who had yet to develop diabetes in order to estimate the association between change in metabolic syndrome severity and future diabetes.

Assessment of covariates and outcomes

Exposure: the metabolic syndrome

Components of the metabolic syndrome were measured using similar approaches for both cohorts as previously described [16, 17]. The metabolic syndrome was defined using the criteria established by the ATP-III, i.e. the presence of three or more of the following criteria: elevated WC (≥102 cm for men, ≥88 cm for women), elevated fasting triacylglycerol (≥1.69 mmol/l [150 mg/dl]), reduced HDL-cholesterol (<1.04 mmol/l [40 mg/dl] for men, <1.29 mmol/l [50 mg/dl] for women), elevated BP (≥130 mmHg systolic or ≥85 mmHg diastolic, or drug treatment for hypertension) and elevated fasting blood glucose (≥5.55 mmol/l [100 mg/dl]) [4].

Continuous metabolic syndrome severity z scores at baseline were calculated for participants using sex- and race-based formulas. As described elsewhere [7, 8], these scores were derived using a confirmatory factor analysis approach for the five traditional metabolic syndrome components (WC, triacylglycerol, HDL-cholesterol, systolic BP, fasting glucose) to determine the weighted contribution of each component to a latent metabolic syndrome ‘factor’ on a sex- and race/ethnicity-specific basis. Confirmatory factor analysis was performed among adults aged 20–64 years from the National Health and Nutrition Examination Survey with categorisation into six subgroups based on sex and race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic). For each of these six population subgroups, loading coefficients for the five metabolic syndrome components were transformed into a single metabolic syndrome factor and used to generate equations to calculate a standardised metabolic syndrome severity score for each subgroup (http://mets.health-outcomes-policy.ufl.edu/calculator/, accessed 20 March 2017). The resulting metabolic syndrome severity scores are z scores (normally distributed and ranging from theoretical negative to positive infinity with mean = 0 and SD = 1) of relative metabolic syndrome severity on a sex- and race/ethnicity-specific basis. These scores correlate strongly with other markers of risk of the metabolic syndrome [18], including high-sensitivity C-reactive protein (hsCRP), uric acid and the homeostasis model of insulin resistance [8], with adiponectin [19] and with long-term risk of CVD [10, 12] and diabetes [11].

Outcome: type 2 diabetes

In ARIC, participants were defined as having type 2 diabetes if they reported that a physician had told them they had diabetes, if they had a fasting serum glucose ≥6.99 mmol/l (126 mg/dl) or a non-fasting serum glucose ≥11.10 mmol/l (200 mg/dl), or if they reported that they were taking insulin or oral hypoglycaemic medications [20, 21]. In the JHS, participants were defined as having diabetes if they had a fasting serum glucose ≥6.99 mmol/l (126 mg/dl) or an HbA1c ≥ 6.5% (48 mmol/mol) or if they had taken a diabetic medication within the 2 weeks before the clinic visit. These definitions of diabetes were used at visits 1–4 for ARIC participants and at visits 1–3 for JHS participants. As the area of primary interest was the incidence of diabetes, time to diabetes was, for those individuals without diabetes at visit 1, defined as the number of years between visit 1 and the first visit where diabetes was reported, regardless of diabetes status at subsequent visits.

Statistics

All statistical analyses were performed using SAS Version 9.4 (SAS Institute, Cary, NC, USA), and statistical significance was set to α = 0.05. To account for the interval censored time-to-event data (i.e. we do not know precisely when the diabetes developed), accelerated failure time models with an assumed Weibull distribution (using SAS PROC LIFEREG) were used, adjusting for baseline age and site (four sites from ARIC plus the JHS ‘site’). HRs of interest were estimated as a function of the variables and the Weibull shape parameter estimate.

ATP-III MetS was modelled with and without its individual (binary) components to establish whether the metabolic syndrome was a risk factor above and beyond its components, similar to others [17]. Interactions were also examined to determine whether risk varied by sex and race. In a similar fashion, metabolic syndrome severity was modelled, using quartiles of metabolic syndrome severity (defined naturally by the associated percentiles since it is a z score) and including quartiles for each of the metabolic syndrome components (defined by the analytical sample). This quartile approach allowed for an assessment of non-linear associations and threshold effects as well a reduction in the possibility of collinearity. Nonetheless, variance inflation factors were computed to assess the degree of collinearity when including the metabolic syndrome (or metabolic syndrome severity) and its components in the same model, with variance inflation factors greater than ten representing severe collinearity. Model fit was assessed and compared using Akaike’s information criterion (AIC), with smaller values indicating a better fit. An unadjusted Kaplan–Meier estimate of survival by quartile of metabolic syndrome severity was performed, accounting for interval censored data (PROC ICLIFETEST). With a linear relationship observed between metabolic syndrome severity and risk of incident diabetes, a model with only the continuous score was fitted, and a time-dependent receiver operating characteristic (ROC) curve (and AUC) was generated [22] using a SAS macro. For those participants who had two metabolic syndrome scores calculated at visits 1 and 2, time to incident diabetes was modelled with both the visit 1 score and the change in score (categorised). Given that this was an evaluation of the risk of future diabetes using a metabolic syndrome score that included fasting glucose as a component, we also performed a sensitivity analysis using a metabolic syndrome severity score without glucose as one of the contributing components (i.e. incorporating only the remaining four metabolic syndrome components) for the association of baseline levels of this score with later diabetes.

Results

Participant characteristics and disease incidence

We evaluated data from 13,094 participants meeting the inclusion criteria: 10,957 from ARIC and 2137 from the JHS. Participant characteristics at baseline by sex and race are shown in Table 1. At baseline, the prevalence of ATP-III MetS criteria was 31.2% and metabolic syndrome severity z scores at baseline were on average 0 and were highest among white men.

Table 1 Descriptive characteristics of included participants

Incident diabetes during a median of 7.8 years of follow-up was 12.2% for the entire cohort and differed significantly by sex and race, being highest among black women (17.1%) and black men (16.8%), intermediate among white men (11.7%) and lowest among white women (8.0%). Electronic supplementary material (ESM) Table 1 displays quartiles of metabolic syndrome severity score and diabetes incidence by ATP-III MetS status. Incident diabetes was higher among those with ATP-III MetS and higher by metabolic syndrome severity quartile, regardless of ATP-III MetS status.

Survival models for the metabolic syndrome and incident diabetes

Survival models for time to incident diabetes are shown in Table 2. ATP-III MetS was associated with future diabetes (HR 4.36, p < 0.01), as were abnormalities in the individual components when assessed together in a model. When ATP-III MetS was assessed in a model with the individual metabolic syndrome components, the association between the ATP-III MetS and future diabetes was not significant.

Table 2 Survival models: time to incident diabetes

Metabolic syndrome severity z score (assessed as quartiles for the cohort) was also associated with future diabetes, with an HR for the highest vs lowest metabolic syndrome severity score quartile of 17.4 (p < 0.01) (Table 2). Each of the individual metabolic syndrome components was also associated with future diabetes when assessed in a model together. In a model that included metabolic syndrome severity z score and the individual components, the metabolic syndrome z score remained significantly associated with future diabetes, with an HR for the highest vs lowest quartile of 3.69 (p < 0.01). In our sensitivity analysis assessing a score that lacked glucose concentration as one of the metabolic syndrome components, we found an overall similar degree of association of that score with future diabetes as noted for the metabolic syndrome severity score, both with and without the individual metabolic syndrome components (including glucose) in the hazard model (ESM Table 2).

Table 3 shows HRs by sex and race for the metabolic syndrome predicting incident diabetes in models with and without adjustment for individual metabolic syndrome components. Models for ATP-III MetS unadjusted for individual components exhibited both race × metabolic syndrome and sex × metabolic syndrome interactions, with white women having the highest association (HR 6.28) and black men the lowest association (HR 2.88). Following adjustment for the individual components, no interaction remained, with a non-significant protective association of ATP-III MetS among all participants (HR = 0.95, p = 0.594).

Table 3 Survival models: time to incident diabetes

Assessing correlations between the metabolic syndrome severity score and future diabetes in models unadjusted for the individual metabolic syndrome components, there was a sex × race × metabolic syndrome interaction, with black women showing the strongest association (HR for highest vs lowest quartile 33.6) and white men having the weakest association (HR 6.48) (Table 3). In a model that included the individual components, there was only a race × metabolic syndrome interaction, with black men and women having a higher degree of association (HR for highest vs lowest quartile 5.30) than white men and women (HR 2.24).

Kaplan–Meier plot of time to disease by metabolic syndrome severity quartile

Figure 1 displays Kaplan–Meier plots of time to incident diabetes by quartile of metabolic syndrome severity over a 10 year period. There was an increase in probability of 10 year diabetes-free survival ranging from 96% for the first quartile to 60% for the fourth quartile.

Fig. 1
figure 1

Kaplan–Meier curve of diabetes-free survival by metabolic syndrome severity quartile. Baseline metabolic syndrome percentile: brown line, <25th; green line, 25th–50th; red line, 50th–75th; blue line, >75th

ROC curves, sensitivity and specificity

ESM Fig. 1 shows a ROC curve for 10 year risk of diabetes by baseline metabolic syndrome z score. The AUC for metabolic syndrome severity as a predictor of diabetes was 0.77. This compared with a c statistic from logistic regression of 0.68 for ATP-III MetS. The cut-off of metabolic syndrome z score that maximised sensitivity and specificity for incident diabetes was 0.25, which had a sensitivity of 0.68 and a specificity of 0.72 (ESM Table 3). This compared with ATP-III MetS, which had a sensitivity of 0.64 and specificity of 0.73 for predicting diabetes.

Change in metabolic syndrome severity over time and incident diabetes

Metabolic syndrome z scores were available for 11,292 individuals at both visits 1 and 2, with a mean (±SD) change in z score of 0.2 (±0.4) units over 3.2 (±0.7) years. Of these, 700 (6.2%) individuals went on to develop diabetes over a mean 5.1 (±1.4) years after visit 2. Table 4 shows the results of a model that included metabolic syndrome severity at visit 1 and change in z score between visits 1 and 2. Compared with individuals with a change in score < 0, participants with a change >0.5 had a HR of 2.66 for incident diabetes (p < 0.001), independent of their visit 1 score.

Table 4 Survival model of time to incident diabetes after two clinic visits (n = 11,292)

Discussion

Although the abnormalities in CVD risk factors that comprise the metabolic syndrome have long been known to be associated with insulin resistance and metabolic dysfunction, the majority of prior studies have not found that the presence of the metabolic syndrome provided additional predictive power for future disease beyond the metabolic syndrome components themselves [5]. In the current study of a large, combined bi-racial cohort, we present novel findings: (1) that a continuous metabolic syndrome severity z score remained strongly associated with future diabetes, even in models that included the individual components of the metabolic syndrome; (2) that this association varied by race, with a higher risk among black participants; and (3) that the degree of change in metabolic syndrome score over a 4 year period conferred the additional risk. This study expands on our previous similar findings with respect to CHD [12] and our findings regarding risk of diabetes in a small, regional cohort [11]. These findings are consistent with the hypothesis that a higher degree of metabolic syndrome severity may contribute to future risk of diabetes, potentially as an estimate of the underlying metabolic dysfunction driving the abnormalities in the individual metabolic syndrome components [2, 3]. Furthermore, the heightened risk associated with the change in metabolic syndrome score over time suggests the potential to follow such a score over time (potentially provided automatically using an electronic medical record calculator) for ominous worsening of risk, as well potentially to motivate patients [3] and gauge the response to therapies [23].

It should be noted that adjustment for the individual metabolic syndrome components has a more logical basis in considering risk for CHD than for type 2 diabetes. This is because the suggestion has been that if risk of CHD could be reduced by merely addressing the abnormalities in the individual components (i.e. lowering triacylglycerol levels, decreasing BP and raising HDL-cholesterol), there might not be as much utility in knowing about the underlying processes that drive these abnormalities [6]. The case of type 2 diabetes is, however, different in that these individual components—while potentially linked by adipocyte dysfunction, systemic inflammation and oxidative stress—are not likely themselves to be individual drivers of diabetes. Nevertheless, the fact that the severity of the metabolic syndrome persists in its association with future diabetes in models that include each of these abnormalities suggests that the strength of their association together is itself a potentially important estimate of risk.

Prior to adjustment for the individual metabolic syndrome components, the metabolic syndrome severity-related risk for diabetes was threefold higher among women than men, and 30–80% higher among black than white participants. This occurred despite the differential weights of the metabolic syndrome components determined by sex and race to account for how the metabolic syndrome was manifested among each group. We had not previously noted interactions of sex or race with metabolic syndrome severity in the prediction of future CHD [12], suggesting that the sex- and race-specificity of the score may have estimated metabolic syndrome-related CHD risk similarly for all groups. The higher association of metabolic syndrome severity and diabetes risk among women (prior to adjustment for metabolic syndrome components) may reflect an ability to better separate individuals with ‘healthy’ obesity [24] compared with those with higher metabolic risk. Before adjustment for the individual metabolic syndrome components, women in the fourth quartile of metabolic syndrome severity (compared with the first quartile) had HRs of 25.5 and 33.6 for white and black women, respectively, emphasising the potential utility of such a score for risk identification that is more powerful than that of ATP-III MetS, which conferred HRs of 6.3 and 3.6. This sex × race × metabolic syndrome severity interaction was not present when the individual components of the metabolic syndrome were included in the model, providing further support that the elevated metabolic syndrome severity-related risk for women may be related to abnormalities in specific metabolic syndrome components. Nevertheless, metabolic syndrome severity continued to add predictive ability for future diabetes among white and black women in models adjusted for the components, supporting additional risk related to the processes driving the metabolic syndrome.

The differential metabolic syndrome-related risk between races had not previously been observed, possibly because earlier studies had fewer black participants [5, 25]. Indeed, it has been noted that the metabolic syndrome did not provide adequate risk prediction among black participants [26, 27], who have a relatively low prevalence of the metabolic syndrome as assessed using traditional binary criteria despite having high rates of diabetes [28] and CHD [29], as though the metabolic syndrome were failing to detect some degree of risk, particularly among black men [30,31,32,33]. It is partly for this reason that we formulated scores in an attempt to reflect how the metabolic syndrome is manifested on a sex- and race/ethnicity-specific basis [8, 34]. Although population-based metabolic syndrome severity was similar at baseline by sex and race, the higher metabolic syndrome-related risk of diabetes among black participants suggests an important interplay between metabolic syndrome severity and other factors contributing to diabetes progression.

Overall metabolic syndrome severity in these combined cohorts was less associated with risk of CHD [12] than of diabetes, as has previously been seen with respect to ATP-III MetS [5, 35]. In models of future disease prediction without the individual metabolic syndrome components, the fourth quartile of metabolic syndrome severity (compared with the first) had an HR of 4.0 for CHD and of 17.4 for future diabetes. This appears to emphasise that a greater degree of insulin resistance (potentially contributing to earlier beta cell exhaustion) and other processes associated with the metabolic syndrome are more directly related to diabetes risk than to risk of atherosclerosis per se. Nevertheless, for both disease outcomes, metabolic syndrome severity remained a significant risk factor, even when the individual metabolic syndrome components were included—and in a sensitivity analysis, metabolic syndrome severity was associated with risk of future diabetes, even when using a metabolic syndrome score without glucose concentration as a component.

Although a continuous score such as this provides many benefits, including improvements in power for statistical analysis and the ability to track changes over time, clinical application would probably benefit from cut-off values to identify individuals at highest risk. We found that a metabolic syndrome severity score cut-off of 0.25 provided reasonable sensitivity (0.68) and specificity (0.72) for future diabetes, although clearly such a cut-off could be adjusted to emphasise the detection of a larger number of individuals at risk or to try to maximise specificity. Overall, this was similar to the sensitivity and specificity for ATP-III MetS of 0.64 and 0.73, but with the added benefit of being able to track for changes in risk over time. We had not noted an obvious cut-off in the association between metabolic syndrome severity and CHD risk [12], and ultimately different cut-offs may be needed for different disease states.

This study had many limitations. We relied on identification of a diagnosis of diabetes at study visits, whereas the actual development of disease probably occurred at intermediate time points. In addition, the definition of type 2 diabetes differed between the two cohorts, with the JHS including elevated HbA1c as an indicator of diabetes; this would potentially identify more individuals who had previously unidentified diabetes but may have had normal fasting blood glucose levels. Because JHS participants made up 48% of the black participants analysed, there is the possibility that a higher identification of diabetes in this cohort could have affected the differences in ability of the metabolic syndrome severity score to predict diabetes by race/ethnicity. Participants were followed through the 1990s (ARIC) and 2000s (JHS), and interval changes in health maintenance may limit the generalisability to current populations. Nevertheless, participants from ARIC and other cohorts from the 1990s were also used in deriving the recent American College of Cardiology/American Heart Association risk prediction algorithm cohorts for risk of CHD, supporting the idea that population-based risk has not changed significantly [36]. This study had several strengths in the longitudinal follow-up of two large cohorts, with a significant representation of white and black, and male and female participants.

In conclusion, we found that a sex- and race/ethnicity-specific continuous metabolic syndrome z score was associated with risk of future diabetes in models that included each of the individual metabolic syndrome components. A score such as this may thus provide additional risk prediction related to the underlying processes that drive the clustering of abnormalities among metabolic syndrome components. Temporal increases in the score escalated risk, supporting the potential to track score levels using electronic medical records to trigger additional therapies and follow treatment response over time.