figure b

Introduction

The prevalence of type 2 diabetes is increasing with the rising burden of obesity and sedentary lifestyle [1]. Among patients with type 2 diabetes, CVD is the leading cause of morbidity and mortality, highlighting the need for implementation of effective cardiovascular preventive strategies that may match the intensity of therapy to the baseline risk [2]. A critical next step for CVD prevention is to identify individuals with type 2 diabetes who have heightened but modifiable risk for adverse cardiovascular events and may derive the greatest benefit from initiating or intensifying disease-modifying therapies, such as sodium-glucose cotransporter 2 (SGLT-2) inhibitors and glucagon-like peptide 1 (GLP-1) receptor agonists [3, 4]. Accordingly, recent studies have developed clinical risk scores to identify those who are at the highest risk for adverse cardiovascular events among cohorts of individuals with type 2 diabetes [5,6,7]. However, these traditional risk assessment models do not account for multidimensional data and do not differentiate between modifiable vs non-modifiable risk.

Prior studies have utilised machine learning to identify clusters of patients with traditional risk factors, like primary hypertension [8], and subgroups of patients with prevalent CVD, including CHD [9] and heart failure [10,11,12]. While clustering may identify clinically unique subtypes in a given population, unsupervised machine learning may identify subtypes that are unrelated to patient outcomes. Conversely, supervised machine learning may yield subgroups that are not biologically meaningful. Few studies have characterised the benefits and shortcomings of different clustering strategies to identify type 2 diabetes patient phenotypes. The aim of the present study was to compare semi-supervised clustering methodologies to identify phenotypic subgroups within a cohort of individuals with type 2 diabetes at high CVD risk enrolled in the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial [13]. For this, we evaluated a novel semi-supervised methodology in addition to several clustering techniques that have been previously studied [14]. Subsequently, the phenogroups derived from the optimal clustering strategy were externally validated in two separate cohorts of patients with type 2 diabetes from the Look Action for Health in Diabetes (Look AHEAD) and Bypass Angioplasty Revascularisation Investigation 2 Diabetes (BARI 2D) trials [15, 16].

Methods

Study populations

The derivation cohort comprised participants from the ACCORD trial and its long-term observational follow-up study (ACCORD Follow-on [ACCORDION]) [13, 17, 18]. A detailed description of the ACCORD trial is provided in the electronic supplementary material (ESM) Methods. Participants with a history of CVD (i.e. previous stroke or CHD) or >10% missing data were excluded. External validation of the phenomapping approach was performed in participants from the Look AHEAD trial. The prognostic usefulness in identifying patients who will benefit from type 2 diabetes therapies was also assessed in the ACCORD and Look AHEAD cohorts. Furthermore, to assess the generalisability of the phenomapping approach in a higher-risk cohort, we additionally evaluated the prognostic significance of the phenogroups and their interaction with revascularisation among participants of the BARI 2D trial who had type 2 diabetes and coronary artery disease. The study design and results from the Look AHEAD and BARI 2D trials have been published previously and descriptions are provided in the ESM Methods [15, 16, 19]. All participants in ACCORD, Look AHEAD and BARI 2D had type 2 diabetes at baseline.

Outcomes of interest

The primary outcome of interest for the present study was a major CHD event defined as a composite of fatal myocardial infarction (MI), non-fatal MI or unstable angina. Each component of the primary outcome was evaluated as a secondary outcome and adjudicated by a central committee masked to randomised assignments in ACCORD. In the external validation cohorts, the primary outcome included a composite of fatal or non-fatal MI in BARI 2D with the addition of hospitalisation for angina in Look AHEAD. Descriptions of the outcomes evaluated in each of the three trial cohorts are included in the ESM Methods [13, 15, 16]. The secondary outcomes of interest for the present study were all-cause mortality, cumulative severe/symptomatic hypoglycaemic (SH) events and treatment non-adherence. A detailed description of the secondary outcomes is provided in the ESM Methods.

Phenotypic variable selection

The detailed variable selection and clustering methodologies are described in the ESM Methods and outlined in ESM Fig. 1. A total of 97 candidate variables were considered for inclusion. After excluding covariates with >10% missingness, high correlation (|r| > 0.70), or composites of already present covariates a total of 44 continuous and categorical variables were included in the final analysis. Prior to clustering, missing data (not including the outcome variable) were singularly imputed [20].

The Wald index was used to identify relevant covariates [21]. The Wald index is a hypothesis test of statistical significance and is analogous to the Cox score (a measure of the correlation between the covariate and outcome of interest). The Wald index ranges from zero to infinity and a higher value indicates a stronger association between the covariate and outcome. The Wald index for each covariate was calculated and ranked. Due to the limited performance increase in clustering more than 20 covariates, only variables with the highest 20 Wald indices were included [11].

Clustering methodologies

After identifying the 20 covariates to be included in the analyses, continuous variables were scaled and standardised (mean = 0; standard deviation = 1). Four clustering methodologies were evaluated: Gaussian mixture model (GMM)-based clustering of continuous variables, latent class analysis (LCA) of categorical and binned continuous variables, finite mixture model (FMM)-based clustering of both continuous and categorical (mixed) data, and dichotomisation of the principal component (principal component analysis [PCA]) of mixed data into two clusters. An explanation of each method is detailed in the ESM Methods. For each phenomapping method, the Bayesian information criterion (BIC) and Dunn index were calculated to determine the optimal number of clusters (evaluated two to seven, except for PCA) [22, 23].

Optimal clustering method

The optimal clustering method was defined according to the best-performing BIC (lower values considered better), Dunn index (higher values considered better) and prognostic utility. The prognostic value of the phenogroup membership was determined by evaluating the improvement in performance in the 2013 pooled cohort equation (PCE) with vs without the phenogroup variable [24]. Improvement in mean positive predictive value (PPV) was used to assess the comparative utility of PCE-only and PCE + phenogroup [25]. The different clustering methodologies were also independently compared in the Look AHEAD trial validation cohort.

Once identified, we further assessed the prognostic utility of the optimal clustering method by evaluating the improvement in mean PPV with the Swedish National Diabetes Register (SNDR) score with and without phenogroup membership [26]. Finally, decision curves, a measure of the number of true positive cases identified without an increase in the false-positive rate, were constructed to compare phenogroup ± PCE and phenogroup ± SNDR scores [27].

Statistical analysis

In the derivation cohort, after identification of the optimal clustering method and respective clusters, phenogroup characteristics were compared across the subgroups. Categorical variables were summarised as percentages and differences across phenogroups were evaluated using the χ2 test. Continuous variables were summarised as means (standard deviations) and differences across phenogroups were evaluated using one-way ANOVA. Time-to-event analyses and logrank tests were performed to evaluate and compare the unadjusted risk of the primary outcome across phenogroups. Multiplicative interaction terms were included in multivariable-adjusted Cox models to evaluate whether phenogroup modifies the association between the treatment group and risk of the primary outcome with adjustment for the following covariates: age, sex, race, education level, treatment arm, BMI, systolic BP, alcohol use, smoking, total cholesterol, serum creatinine, high-density lipoprotein cholesterol and diabetes duration. The cumulative incidence of all-cause mortality across phenogroups was also assessed. For SH event and glycaemic medication non-adherence outcomes, one-way ANOVA and negative binomial regression were used to evaluate for differences in the cumulative number of events across phenogroups with the same model adjustments described above. To further assess the robustness of the primary analysis results, multiple sensitivity analyses were performed as outlined in ESM Methods.

In the external validation cohort of the Look AHEAD trial, we predicted which phenogroup each individual most closely resembled using multinomial logistic regression (MLR). Agreement between the MLR-predicted phenogroups and FMM phenomapping in the external validation cohort alone was determined by the Cohen’s κ statistic [28]. In the higher-risk cohort of type 2 diabetes patients from the BARI 2D trial, a similar MLR approach was also used to stratify the participants across the three phenogroups. The differences between predicted phenogroups with respect to baseline characteristics, risk of primary and secondary outcomes, and treatment interactions were analysed in both the validation cohorts similarly to the primary analysis. Analyses were performed with R version 3.6.3 (R Foundation for Statistical Computing, Vienna, Austria) with p < 0.05 used as the criterion for statistical significance.

Ethics approval

All participants provided written and informed consent and the study was approved by the ethics committee at each participating trial site. The present study was considered exempt by the Institutional Review Board at the University of Texas Southwestern Medical Center, Dallas, Texas.

Results

In the ACCORD trial, there were 6537 (63.8%) participants without CVD at baseline of which 6466 (63.1%) had <10% missing data and were included in the analyses. A detailed description of the covariates and inclusion/exclusion criteria is provided in ESM Table 1. A total of 44 variables were included in the semi-supervised clustering procedure to identify the top 20 most significant variables to be used in the clustering analysis (Fig. 1). The validation analysis included 4211 of 4906 (85.8%) participants with available follow-up data and no prior CVD who were enrolled in the Look AHEAD trial and 1495 of 2368 (63.1%) participants enrolled in the BARI 2D trial. Baseline characteristics of the derivation and validation cohorts are provided in ESM Table 2.

Fig. 1
figure 1

Top 20 variables based on univariate Wald index scores in the derivation cohort (ACCORD). HDL-c, HDL-cholesterol; MNSI, Michigan Neuropathy Screening Instrument

Determination of the optimal phenomapping strategy and development of a phenogroup classifier

When comparing the four phenomapping methods, clustering with FMMs and three phenogroups had the best-performing internal validation metrics including lowest BIC and highest Dunn index (ESM Table 3, ESM Fig. 2). When phenogroup membership was added to the PCE risk score, the FMM method with three clusters had the greatest improvement in model discrimination (ESM Table 3). Similar to the derivation cohort, the FMM method was the top-performing phenomapping method in the external validation cohort (Look AHEAD) (ESM Table 4).

Using the FMM phenogroups derived in the derivation cohort, we developed an MLR classifier to predict phenogroup membership for individuals (ESM Table 5). Using the MLR classifier, each participant in the validation cohort was successfully matched into one of the three previously defined clusters. We observed high agreement between the MLR classifier and phenomapping in the Look AHEAD dataset (Cohen’s κ = 0.91 [95% CI 0.88, 0.93]) and a minimum of ten non-missing covariates was needed to achieve an 80% accurate prediction in the MLR classifier (ESM Fig. 3).

Prognostic utility of the FMM phenogroups in improving atherosclerotic cardiovascular disease risk prediction

Over 9.1 years of follow-up, 789 (12.2%) and 963 (14.9%) participants had a primary outcome and all-cause mortality event, respectively. Phenogroup membership from the FMM clustering method was significantly associated with risk of the primary composite outcome and all-cause mortality on follow-up, with a graded decrease in risk from phenogroup 1 (highest risk) to phenogroup 3 (lowest risk) (ESM Fig. 4). The mean PPV of the SNDR risk score was 0.16 (95% CI 0.14, 0.19). Addition of the FMM phenogroup significantly increased the mean PPV (PPV = 0.21 [95% CI 0.20, 0.23]). Similarly, in decision curve analysis, phenogroup membership also improved the prognostic utility of both risk scores (ESM Fig. 5). In the Look AHEAD external validation cohort, addition of the FMM phenogroup to the SNDR risk score significantly increased the mean PPV (increase from 0.14 [0.12, 1.17] to 0.16 [0.14, 0.0.19]). A similar improvement was observed in decision curve analysis (ESM Fig. 5).

Characterisation of FMM-based phenogroups

Baseline characteristics of study participants across the three phenogroups as determined by the FMM method are shown in Tables 1 and 2. Compared with the other groups, phenogroup 1 participants (n = 663, 10.3%) were more likely to be men, to be of self-reported black race and to have higher burden of traditional cardiovascular risk factors, including higher BP, smoking prevalence, HbA1c, fasting blood glucose, low-density lipoprotein cholesterol and triacylglycerol levels. Phenogroup 2 (n = 2388, 36.9%) had participants with intermediate burden of CVD risk factors. In contrast, phenogroup 3 (n = 3415, 52.8%) participants had the lowest burden of CVD risk factors, with lower BP, low-density lipoprotein cholesterol levels, HbA1c and smoking prevalence. The prevalence of type 2 diabetes-related complications such as foot ulceration, lower extremity amputations, proteinuria and eye surgery history was highest in phenogroup 1 and lowest in phenogroup 2 members. The pattern of baseline characteristics was similar when phenomapping separately in men and women (ESM Table 6). The pattern of baseline characteristics across the FMM-based phenogroups in the Look AHEAD and BARI 2D validation cohorts was mostly similar to that observed in the derivation cohort (ESM Tables 7, 8).

Table 1 Baseline demographic and clinical characteristics of the study participants stratified by FMM-based phenogroup in the derivation cohort (ACCORD)
Table 2 Baseline ECG, laboratory and diabetes complication characteristics of the study participants stratified by FMM-based phenogroup in the derivation cohort (ACCORD)

Effect modification by FMM-based phenogroups for cardiovascular benefits associated with different therapies in type 2 diabetes

The proportion of participants randomised to intensive glycaemic control was similar across the three phenogroups in the derivation cohort (Table 1). We observed a significant interaction between randomisation to intensive glycaemic control (vs standard) and phenogroup for the risk of the primary composite outcome (p-interaction = 0.042) (Fig. 2). Intensive glycaemic control was associated with lower risk of the primary composite outcome in phenogroup 3 (adjusted HR [aHR] 0.65; 95% CI 0.51, 0.83; p value <0.001) and there was no significant association with risk in phenogroup 1 (aHR 1.25; 95% CI 0.91, 1.77; p value = 0.19). A similar pattern of association was noted with randomisation to combination lipid therapy with a significant treatment interaction by phenogroup membership (p-interaction < 0.001). Specifically, combination lipid therapy was associated with significantly lower risk in phenogroup 3 (aHR 0.71; 95% CI 0.52, 0.98; p value = 0.04) and higher risk in phenogroup 1 (aHR 1.49; 95% CI 0.98, 2.24; p value = 0.06). There was no significant treatment effect of intensive BP control on risk of the primary composite outcome across the phenogroups (Fig. 2). Event rates and number of events prevented across treatment groups and phenogroups are shown in ESM Table 9. Among participants in phenogroup 1, the risk of all-cause mortality was numerically higher in the intensive vs standard glycaemic control groups (28.5% vs 25.8%, respectively; p value = 0.34). In contrast, the risk of all-cause mortality was lower in the intensive vs standard glycaemic control group in phenogroup 3 (8.9% vs 10.9%, respectively; p value = 0.05) (ESM Fig. 6).

Fig. 2
figure 2

Multivariable-adjusted associations of randomised treatment groups with risk of primary outcome events stratified by FMM-based phenogroups in the derivation (ACCORD) and validation (Look AHEAD and BARI 2D) cohorts. Cox models were adjusted for age, sex, race, education level, BMI, systolic BP, smoking status, alcohol use, serum creatinine, total cholesterol, serum HDL-cholesterol and diabetes duration. The primary outcome was a composite of fatal MI, non-fatal MI or unstable angina in the ACCORD trial cohort. In the Look AHEAD trial cohort, the primary outcome included fatal or non-fatal MI plus hospitalisation for angina. In BARI 2D, the primary outcome was a composite of fatal MI or non-fatal MI

In the Look AHEAD external validation cohort, we observed a significant interaction between intensive lifestyle intervention (ILI) therapy (vs standard) and phenogroup for the risk of the primary composite outcome (p-interaction = 0.002) (Fig. 2). Specifically, ILI was associated with a lower risk of the primary composite outcome in phenogroup 3 (aHR 0.77; 95% CI 0.61, 0.98; p value = 0.03) and higher risk in phenogroup 1 (aHR 1.58; 95% CI 1.10, 2.47; p value = 0.03).

In the BARI 2D external validation cohort, we similarly observed a significant interaction between early coronary revascularisation (vs medical therapy) and phenogroup for the risk of the primary composite outcome (p-interaction = 0.003) (Fig. 2). Early revascularisation was associated with lower risk of the primary composite outcome in phenogroup 3 (aHR 0.64; 95% CI 0.47, 0.91; p value = 0.008) and higher risk in phenogroup 1 (aHR 1.90; 95% CI 1.07, 3.35; p value = 0.03).

Association of FMM-based phenogroups with SH events

In the derivation cohort (ACCORD trial), there were a total of 1478 SH events over a median follow-up of 8.8 (IQR = 5.7–10.1) years. The mean number of cumulative SH events per participant was highest in phenogroup 1 (0.32 events) followed by phenogroup 2 (0.26 events) and phenogroup 3 (0.19 events) (p value = 0.0004) (Fig. 3a). In adjusted analysis, the incidence rate ratio of an SH event was significantly higher in phenogroups 1 and 2 as compared with phenogroup 3 (phenogroup 1: 0.41; 95% CI 0.13, 0.69; p value = 0.005; and phenogroup 2: 0.22; 95% CI 0.04, 0.41; p value = 0.03). There was no significant interaction between phenogroup membership and intensive glycaemic control for the risk of SH events (p-interaction = 0.68). Additional adjustment for baseline glucose-lowering medications did not attenuate the associations between phenogroups and SH events.

Fig. 3
figure 3

The mean cumulative number of (a) hypoglycaemic events and (b) visits with <80% medication adherence across FMM-based phenogroups in the derivation cohort (ACCORD). The error bars represent the upper limit of the 95% CI

In the Look AHEAD validation cohort, there were a total of 637 SH events over a median follow-up of 9.5 (IQR = 8.8–10.3) years. Similar to the derivation cohort, the mean cumulative number of SH events per participant was highest in phenogroup 1 (0.21 events) followed by phenogroup 2 (0.17 events) and phenogroup 3 (0.15 events) (p value = 0.03). In adjusted analysis, the incidence ratio of an SH event was significantly higher in phenogroup 1 (0.33; 95% CI 0.11, 0.54; p value = 0.002) as compared with phenogroup 3. There was no significant difference in SH events between phenogroups 2 and 3 and no significant interaction between phenogroup membership and randomisation to ILI for the risk of SH events (p-interaction = 0.40).

Association of FMM-based phenogroups with treatment non-adherence

In the derivation cohort, phenogroup 1 participants also had the highest rates of medication non-adherence, with a mean of 2.09 visits during follow-up with <80% medication adherence compared with 1.09 and 1.08 visits for phenogroups 2 and 3, respectively (p value = 0.008) (Fig. 3b). In adjusted analysis, the incidence rate ratio of medication non-adherence was significantly higher in phenogroup 1 compared with phenogroup 3 (0.35; 95% CI 0.10, 0.60) but not compared with phenogroup 2 (0.05; 95% CI –0.12, 0.29; p value = 0.65). Among phenogroup 1 participants randomised to either intensive glycaemic control or combination lipid therapy, medication non-adherence was significantly higher in individuals with a primary composite outcome event on follow-up compared with those without (ESM Table 10).

In the Look AHEAD validation cohort, similar to the derivation cohort, phenogroup 1 had the highest rates of treatment non-adherence with a mean of 26.5% participants with <80% of attended intervention clinic visits compared with 17.9% and 16.4% in phenogroups 2 and 3, respectively (p value < 0.001). Similar to the derivation cohort, among individuals randomised to ILI in phenogroup 1, treatment non-adherence was significantly higher in individuals with a primary composite outcome event compared with those without (ESM Table 10).

Discussion

In this study, several important findings were observed. First, in a cohort of patients with type 2 diabetes enriched for CVD risk, of the four phenomapping approaches tested, the FMM approach was noted to be the most optimal clustering strategy to identify unique phenogroups with distinct cardiometabolic and risk factor profiles. Second, the FMM-based phenogroups had distinct clinical profiles and downstream risk of major CHD events across both the derivation and external validation cohorts. Third, there was a significant interaction between the FMM-based phenogroup and treatment effect of different therapeutic approaches such as intensive glycaemic control, combination lipid therapy, ILI and early coronary revascularisation for the risk of a major CHD event. Finally, we have developed an online, publicly accessible tool for researchers to determine a patient’s phenogroup membership, available at www.cvriskscores.com.

The present study directly compares the performance of multiple clustering strategies for identifying subgroups in a large cohort of patients with type 2 diabetes enriched for CVD risk. A prior study by Ahlqvist et al. identified clusters of individuals with type 2 diabetes that had distinct characteristics and risk of diabetes complications [29]. The clustering technique applied by Ahlqvist et al., and later re-examined by Dennis et al. [30], evaluated phenogroups by focusing on specific pathophysiological abnormalities such as insulin deficiency, ageing and insulin resistance. Similarly, Li et al. used a novel method of topological analysis integrating data from the electronic medical record to identify three subgroups within type 2 diabetes using a proprietary, third-party software platform to perform the analysis [31]. Our study adds to the existing literature by developing and validating a phenomapping approach by leveraging machine-learning-based clustering methods that take into consideration mixed-type data and multiple interactions between covariates. Finally, we were able to compare different clustering methods, including the GMM method that utilised continuous data, LCA that analysed categorical data, and the FMM method and PCA that evaluated mixed data, to identify the most optimal phenomapping approach [32,33,34]. Among these clustering techniques, novel implementation of FMM phenomapping demonstrated the best performance according to BIC, Dunn index and model discrimination in an intermediate CVD risk cohort (ACCORD). Furthermore, FMM phenomapping performed well among participants with type 2 diabetes across different strata of baseline risk ranging from the lower-risk Look AHEAD trial cohort to the higher-risk BARI 2D trial cohort. These findings highlight the generalisability of our phenomapping approach across the spectrum of baseline risk.

Each FMM-based phenogroup had a unique profile of cardiometabolic factors, history of complications and risk of CVD, hypoglycaemic events and medication non-adherence. Phenogroup 1 had the greatest burden of comorbidities, cardiometabolic risk factors and insulin use and the highest risk of adverse events on follow-up. Phenogroup 2 participants had the lowest burden of microvascular complications and had an intermediate risk for CVD and hypoglycaemic events compared with the other subgroups. In contrast, phenogroup 3 participants had a more favourable risk factor profile with the lowest risk of adverse CVD and hypoglycaemic events. The proportion of participants belonging to a specific phenogroup varied across cohorts and was likely related to the baseline CVD risk as well as to the heterogeneity in the population. We further observed that markers of diabetes complications, such as the neuropathy score, filament sensation and urine albumin/creatinine ratio, were the most important covariates for determining phenogroup membership. Our findings add to the existing literature that suggests peripheral neuropathy is associated with an increased risk of adverse cardiovascular events. Similar to our study, Brownrigg et al [35] observed that addition of peripheral neuropathy to existing risk scores significantly improved model discrimination and reclassification. It is possible that the observed benefit in PCE and SNDR model performance with addition of phenogroup membership, as seen in our study, may be due to the addition of new risk factors. Furthermore, our findings support prior studies that identified albuminuria as a key risk factor for CVD among adults with diabetes [36].

We observed substantial heterogeneity in treatment effects of intensive glycaemic control and combination lipid therapy across different phenogroups. Specifically, patients in the lowest-risk phenogroup benefited from intensive glycaemic control and combination lipid therapy in the ACCORD trial dataset. In contrast, participants in the highest-risk group showed potential harm with these interventions. A similar treatment interaction was also observed in the validation cohorts, with greatest benefit of ILI and early coronary revascularisation noted in the low-risk phenogroup 3 of the Look AHEAD and BARI 2D cohorts, respectively. This pattern of lack of benefit from these interventions in the highest-risk group is consistent with prior observations and may be related to presence of a more advanced disease state in which intensive treatment may not modify risk [11, 37, 38]. It is plausible that the trajectory of CVD development is more modifiable in patients with earlier stages of type 2 diabetes with lower burden of associated complications and hypoglycaemic events. This is consistent with prior observations from the United Kingdom Prospective Diabetes Study [39]. Similarly, we also observed that participants in the phenogroup with highest (phenogroup 1) vs lowest (phenogroup 3) risk had higher burden of hypoglycaemic events and the lowest adherence to prescribed medications, which may contribute to the lack of benefit from the tested therapies.

Our study findings have important clinical implications. The phenomapping strategy developed and validated in the present study may help identify distinct subgroups of participants with ‘high-risk’ vs ‘low-risk’ profiles with distinct response to preventive therapies. Such an approach can have important implications for management of patients with type 2 diabetes with regards to resource allocation, downstream follow-up and implementation of targeted preventive strategies. The patients with low-risk profiles may benefit from the therapies tested in the present study including intensive glycaemic control, combination lipid therapy, ILI and early revascularisation. In contrast, the high-risk patients identified by our phenomapping strategy need to be evaluated with newer type 2 diabetes therapies such as SGLT-2 inhibitors, which have been shown to be effective even at the higher end of the baseline CVD risk spectrum [40, 41].

The study has several limitations. First, the phenogroups identified in the present study were derived in an intermediate-risk cohort (ACCORD) that may not be generalisable to most individuals with type 2 diabetes or a more contemporary population on newer therapies such as SGLT-2 inhibitors and GLP-1 receptor agonists. However, we externally validated the phenogroups in low- (Look AHEAD) and high-CVD-risk (BARI 2D) populations and demonstrated consistent patterns of risk and treatment effects. Second, the present study was a post hoc analysis of the ACCORD trial that began enrolment over a decade ago, and prognostic biomarkers, such as natriuretic peptides and high-sensitivity cardiac troponin, and echocardiographic measures of cardiac structure and function were not available for the present analysis [42]. Inclusion of additional phenotypic measures may lead to clustering patterns that differ or improve upon those observed here. Finally, considering the lack of benefit of each of the interventions in the ACCORD, Look AHEAD and BARI 2D trials, the significant treatment interactions observed in the present study should be considered hypothesis generating. Future studies are needed to further validate these observations and test potential targeted interventions in prospective, randomised controlled studies.

In conclusion, among patients with type 2 diabetes, FMM clustering of mixed data was the optimal phenomapping method to identify unique subgroups of patients with distinct cardiovascular risk trajectories and response to established therapeutic approaches. Such semi-supervised clustering techniques may inform development of personalised treatment approaches for patients with type 2 diabetes guided by their baseline risk and unique clinical characteristics.