FormalPara Key Summary Points

Current understanding states that CVOTs are crucial for assessing cardiovascular efficacy of diabetes treatments

The research challenge is that many CVOTs end early because of less precise background cardiovascular risk estimation

Using placebo groups from past RCTs, we improved the accuracy of background cardiovascular risk estimation

Modified background risk for the SOUL trial predicts early termination at 3.78 years and a significant reduction in MACE compared to placebo through predictive modeling

This strategy offers a more accurate and cost-effective blueprint for conducting future cardiovascular outcome trials

Introduction

Type 2 diabetes (T2D) is one of the most common noncommunicable diseases and is strongly associated with cardiovascular disease and death [1]. It is also one of the most common causes of end-stage renal disease [2]. Treatment with RAAS blockade, statins, anti-platelets, and SGLT2 inhibitors is recommended by all major associations to reduce major adverse cardiovascular and renal outcomes, and these drugs are considered the standard of care [3]. Although renal and cardiac outcomes have improved over time with these medications, a therapeutic gap persists; renal and cardiovascular outcomes remain suboptimal in patients with T2D.

The LEADER trial of liraglutide, conducted on 9340 patients, was the first trial in which a GLP1RA was used to document an improvement in MACE in patients with T2D and established ASCVD [4]. Since then, other trials of multiple other GLP1RAs have demonstrated similar improvements in MACE and a probable improvement in renal outcomes [5].

Semaglutide, the newest GLP1RA, is widely used as the most powerful and efficacious GLP1RA [6]. The SUSTAIN 6 trial, conducted on 3297 patients and followed up for 104 weeks, compared once weekly injections of semaglutide with placebo in patients with T2D and established cardiovascular disease (previous cardiovascular, cerebrovascular or peripheral vascular disease), chronic heart failure (New York Heart Association class II or III) or chronic kidney disease ≥ stage 3 or age ≥ 60 years with at least one cardiovascular risk factor [7]. The primary outcome occurred in 6.6% of patients in the semaglutide group and in 8.9% of patients in the placebo group (hazard ratio, 0.74; 95% confidence interval [CI], 0.58–0.95; P < 0.001 for noninferiority). This difference was driven primarily by a reduction in nonfatal stroke, which occurred in 1.6% of patients in the semaglutide arm compared to 2.7% in the placebo group (hazard ratio, 0.61; 95% CI 0.38–0.99; P = 0.04). The rates of new or worsening nephropathy were lower in the semaglutide group. The recently published FLOW trial, conducted on 3533 patients and followed up for 177 weeks, comparing injectable once weekly semaglutide with placebo, revealed that semaglutide reduced the risk of clinically important kidney outcomes and death from cardiovascular causes in patients with type 2 diabetes and chronic kidney disease [8].

Oral semaglutide, the first and only orally available GLP1RA, is considered a major scientific breakthrough because of its unique mechanism and mode of action [9]. Naturally, the oral route makes it more attractive than its parenteral counterparts. The PIONEER trial program has studied the effects of oral semaglutide in patients with T2D in detail [10]. PIONEER 6 was conducted on 3183 patients and followed up for 69 weeks to assess the cardiovascular outcomes of once daily oral semaglutide in an event-driven, randomized, double-blind, placebo-controlled trial involving patients at high cardiovascular risk (aged ≥ 50 years with established cardiovascular or chronic kidney disease or aged ≥ 60 years with cardiovascular risk factors only) and revealed that oral semaglutide was safe in patients with T2D [11].

However, proof of its positive effect on cardiovascular and renal outcomes in patients with T2D remains elusive. The SOUL trial aims to provide this evidence [12].

In anticipation of this publication, a mathematical model was developed, and the results of the SOUL trial are predicted by this study. The search was conducted using the PICO search strategy [13].

P = Type 2 diabetes patients with high cardiovascular risk.

I = Semaglutide.

C = Placebo.

O = Reduction in composite CV events (MACE) and their individual components. In addition, a prediction analysis will be performed to predict the effective duration and outcome benefits in the ongoing SOUL study, aiming to suggest a means of reducing the cost of studies being designed for the future.

Methods

Study Design and Participants

The Semaglutide cardiOvascular oUtcomes (SOUL) trial is a multicenter, double-blind, placebo-controlled, randomized controlled trial (RCT). The primary aim of this study was to evaluate the reduction in major adverse cardiac events (MACE) associated with oral semaglutide compared with placebo in patients with type 2 diabetes (T2D) and established cardiovascular disease. Based on a 90% power calculation and a type 1 error of 0.025, the sample size was estimated to be 9642 participants. The plan is to follow the patients for 5 years and 5 weeks to target the first 1225 MACE [12]. This meta-analysis was conducted according to the recommendations of the PRISMA statement and registered with the International Platform of Registered.

Systematic Review and Meta-Analysis Protocols (INPLASY) on 17 June 2024 with INPLASY registration number INPLASY202460061 [14].

Systematic Literature Search

A comprehensive literature search was conducted in the Cochrane Library database to identify relevant RCTs with semaglutide and cardiovascular outcomes as the primary endpoints. The key search terms included “Type 2 diabetes” [MeSH], “Semaglutide,” and “Cardiovascular outcomes trial.” The date of publication was not limited. However, only articles published in English were selected. Additionally, manual searches for relevant citations were conducted using the Google Scholar search engine. The search strategy followed the PRISMA guidelines.

This article is based on previously conducted studies and does not contain any new studies with human participants or animals performed by any of the authors.

Study Selection and Data Extraction

The preliminary search yielded 2556 citations. After removing duplicates and using trials as a filter, 145 records were available for additional selection. After removing articles lacking full text, articles that were not relevant to the aim, and those that did not conform to the predefined inclusion criteria, three articles remained for analysis (Fig. 1). The data were extracted by two authors (SG and BS), and eligibility based on the predetermined inclusion criteria was assessed by both. Any discrepancy was addressed based on consensus.

Fig. 1
figure 1

Study selection process

The following predetermined inclusion criteria were used:

  • Age ≥ 18 years.

  • Patients with T2D.

  • The intervention arm included semaglutide.

  • Placebo was used as the comparator arm.

  • MACE was used as the primary endpoint.

  • The minimum duration of follow-up was 12 months.

Risk of Bias Assessment

The Cochrane risk of bias tool was used to assess the quality of the articles included in the analysis (Supplementary appendix A). The assessment was performed independently by both the authors, and disagreements were resolved through consensus.

Meta-Analysis of Prior Trials

To develop a robust predictive model for the SOUL trial, historical data on semaglutide and its impact on MACE are needed. These data were obtained by performing a meta-analysis of hazard ratios (HRs) from three previous RCTs (SUSTAIN 6, PIONEER 6, and FLOW). The relevant HRs and their 95% confidence intervals (95% CIs) were extracted from these studies. The ‘meta’ package in R was used to estimate the pooled HR.

Development of the Predictive Model

The predictive model for the SOUL trial was developed using the pooled HR estimate along with the preadjudicated assumptions mentioned in the trial protocol.

These assumptions include the following:

  • The annualized placebo MACE rate was 3.5%.

  • The true HR for the treatment effect was 0.83.

  • Uniform recruitment was allowed for 18 months.

  • The annual loss to follow-up rate was 1%.

  • The durations of the trials were 5 years and 5 weeks.

Effective Sample Size Estimation

The effective sample size was calculated using the following formula:

Effective sample size = N × (1 − annual loss to follow-up) total years, where N is the number of participants, and total years indicate the trial duration.

Event Rate Estimation

The following steps were used to calculate the event rates in the intervention and placebo groups:

  • The effective sample size was calculated for 1 year.

  • The annual number of events in the placebo group was estimated based on the placebo MACE rate.

  • The annual number of events in the intervention group was estimated based on the pooled HR from the meta-analysis.

  • The total annual number of events was calculated by summing events from both abovementioned groups.

Premature Termination Analysis

Because the desired number of events could be achieved well before the 5 years and 5 weeks of the trial had been completed, we calculated the time needed to reach the predetermined number of 1225 events using the following strategy:

  • The total number of events per year was estimated.

  • The total number of events was divided by the annual event rate to determine the estimated duration required to reach 1225 events.

Statistical Analysis

Statistical analysis was conducted using R software (R version 4.2.3 (2023-03-15)). The ‘metagen’ function from the ‘meta’ package was used to perform the meta-analysis, and the results are reported as logHRs (log-transformed HRs) and standard errors (SEs). The predictability and accuracy of the predictive model were assessed using a sensitivity analysis, varying key factors such as annual loss to follow-up and the placebo MACE rate. The statistical significance of the difference between the intervention and placebo groups was analyzed using the chi-square test. The dataset and codes used to conduct the analyses can be obtained from the corresponding author upon reasonable request.

Results

Baseline Characteristics of the Studies

This study analyzed 10,013 patients from three CVOTs and compared the benefit of semaglutide to that of a placebo in patients with T2D and established CVD or high CV risk. The semaglutide group included 5006 patients, and the placebo group included 5007 patients. The mean age ranged between 64.7 ± 7.1 and 66.6 ± 9.0 years, and the cohort was predominantly male. The patients in this analysis had a long duration of diabetes, with a mean baseline weight ranging between 89.5 ± 19.8 and 87 ± 1.5 kg, a mean HbA1c ranging between 7.8 ± 1.3% (61.7 ± 14.2 mmol/mol) and 8.7 ± 1.5% (70.9 ± 16.4 mmol/mol), and a duration of follow-up ranging between 69 and 177 weeks (Table 1). The quality of the individual studies was assessed using the Cochrane risk of bias algorithm, and publication bias was assessed using funnel plot asymmetry. The latter was assessed qualitatively when the number of studies was < 10 (Supplementary Appendix B).

Table 1 Baseline characteristics of the studies included in the meta-analysis

Pooled Meta-Analysis

The pooled analysis showed a significant reduction in the primary endpoint (MACE), with an HR of 0.79 (95% CI 0.69–0.91). The primary endpoint was predominantly driven by a reduction in CV death (HR 0.73, 95% CI 0.54–0.99) (Fig. 2).

Fig. 2
figure 2

Meta-analysis of the effect of semaglutide versus placebo on a major adverse cardiac events (MACE), b non-fatal myocardial infarction (NFMI), c non-fatal stroke (NFS), and d cardiovascular (CV) death

Development of the Predictive Model

In addition to the pooled HR from the meta-analysis, estimation of the annual event rates was the principal component needed to construct the model. The following formula was used to calculate the effective annualized sample size:

$${\text{Effective sample size}}\, = \,{9642} \times \left( {{1} - 0.0{1}} \right)^{{{5} + {5}/{52}}} = {9}0{61}.$$

The expected annual numbers of events in the intervention and placebo groups were estimated to be 147 and 177, respectively. The total annual expected number of events was 324.

Predictive Model for SOUL

Based on the duration of the study mentioned in the protocol (5 years and 5 months) as well as the predetermined assumptions, the SOUL study was estimated to accumulate a total number of 1464 events, with 817 events in the semaglutide group and 647 in the placebo group. The chi-squared test was used to determine whether this difference in events between the two groups was statistically significant. The chi-squared statistic was 19.74 (df = 1), with a df value for significance < 0.001.

Premature Termination Analysis

Because most RCTs terminate trials once the desired number of events has been achieved, we analyzed the duration needed to reach 1225 events. Based on the annualized event rates in the semaglutide and placebo groups, the SOUL trial was calculated to accumulate 1225 events at the end of 3.78 years, resulting in premature termination (Fig. 3). The predicted numbers of MACE events in the semaglutide and placebo groups were 578 and 647, respectively. The chi-squared test (3.88, df = 1) indicated a significant difference between the two groups (p = 0.04).

Fig. 3
figure 3

Premature trial termination timeline based on the SOUL trial protocol and meta-analysis of historical data

Sensitivity Analysis

Sensitivity analyses were conducted to evaluate the robustness of the predictive model by varying key parameters, such as the annual loss to follow-up rate and the placebo MACE rate. The results of these analyses confirmed the stability of the model, with the estimated duration to reach 1225 events remaining within a reasonable range under different scenarios.

Subgroup Predictive Analysis

The predicted number of events for NFMI, NFS, and CV death required additional calculation to obtain a historical baseline in the placebo arm for each of these variables. This calculation was based on the total number of MACE events reported in the three studies and using the proportion of each on the predicted MACE (1225 events). The predicted numbers of events are detailed in Table 2.

Table 2 Predicted number of events based on the assumptions from the SOUL trial protocol and the pooled meta-analysis

A chi-squared test was performed to identify whether the difference between the two groups was statistically significant for each of the individual components. The difference was not significant for NFMI (χ2 2.22, df 1, p 0.14) or NFS (χ2 1.78, df 1, p 0.18). However, the difference was significant for the CV death endpoint (χ2 13.66, df 1, p 0.0002).

Discussion

Individual patient-level data combined with SUSTAIN 6 (once weekly injectable semaglutide versus placebo) and PIONEER 6 (oral semaglutide versus placebo) were collected to assess MACE and HF in patients with and without established CV disease and/or chronic kidney disease, prior MI or stroke, and prior HF. This post hoc analysis of both studies revealed that semaglutide had consistent effects on MACE across varying CV risk groups. This benefit was driven mainly by a significant reduction in all-cause stroke caused by semaglutide. No effect of semaglutide on MACE was observed in subjects with prior HF [15].

A pooled analysis of these data also revealed that semaglutide significantly slowed the rate of kidney function decline, irrespective of kidney function at baseline [16]. This finding has been convincingly reiterated by FLOW, a randomized controlled trial that has shown significant improvement in renal and cardiovascular outcomes in patients with established CKD [8].

This meta-analysis of all CVOTs conducted with oral and parenteral semaglutide to date revealed that the use of semaglutide as a molecule resulted in a significant 21% reduction in MACE, which was driven by reduction in CV death (HR 0.79, 95% CI 0.61–0.90).

In the past, several trials were terminated prematurely because the required number of events was accrued earlier than the estimated duration. These studies include the CARMELINA, CAROLINA, CREDENCE, EMPAREG, DECLARE TIMI-58, LEADER, SUSTAIN 6, PIONEER 6, REWIND, EXSCEL, ELIXA, DAPA-CKD, and EMPA-kidney trials [7, 11, 22–27]. The estimated duration of these studies and the required events were based on the CV risk in the population determined by published prevalence or cross-sectional studies. While earlier trials did not have dedicated RCTs at their disposal, studies such as the SOUL trial had 3 prior RCTs to refer to, as far as baseline CV risk estimation was concerned. In fact, both PIONEER 6 and SUSTAIN 6 were prematurely terminated, reflecting a lack of accurate estimation of the time needed to attain the required number of CV events.

With the availability of data from three CVOTs with semaglutide, this meta-analysis provides a platform for designing a mathematical model to predict the outcomes of the ongoing SOUL trial, a CVOT comparing oral semaglutide with placebo.

Because the placebo arms in all the CVOTs now reflect the background CV risk in the population, the study duration required to obtain the desired number of events can be estimated far more accurately based on power calculations.

The calculated duration of the SOUL trial was 5 years and 5 weeks based on a baseline annualized CV risk of 3.5%. However, using the background MACE rate in the population in the placebo arm of the PIONEER 6, SUSTAIN 6, and FLOW trials, the duration required to reach 1225 events can be accurately estimated to be 3.78 years.

The duration required to obtain the desired number of events can be estimated from previous CVOTs, and the number of probable events in the placebo and intervention arms can both be estimated while predicting the direction of the effect size (hazard ratio). This analysis predicts that the required number of events (1225) will have occurred after 3.78 years. This analysis also predicted that the estimated number of events (MACE) in the semaglutide and placebo groups would be 647 and 578, respectively.

This difference in events was significant at 0.05, indicating that we can expect superiority from the primary end point of the SOUL trial.

The annualized MACE rate in the pooled placebo arm was 5.66% (per our calculation), in contrast to the 3.5% reported in the SOUL trial. This difference explains why the trial will likely be terminated early, and a more objective background risk estimation needs to be performed (if available) compared to heterogeneous prevalence studies.

This mathematical prediction will be subject to scrutiny once the original SOUL trial has been published. However, once it is proven to be accurate, this mathematical prediction may be used to calculate not only the power but also the duration of such studies. In addition to making such RCTs more practical, they will significantly reduce the cost of conducting such expensive projects, providing relief for both industry and researchers.

Limitations

The principal limitations are related to the fact that we analyzed means from the RCTs instead of individual patient-related data. The number of studies included in the present study was too low to conclusively assess heterogeneity. In addition, the annual CV event rate was assumed to be constant but may have been affected by the COVID-19 pandemic.

Strengths

The main strength of this analysis is related to the pooling of the placebo CV event rates from the three trials, providing us with a robust backdrop to compare the intervention-related outcomes. Traditionally, the background event rates in the population are based on cross-sectional or prevalence studies. In addition, the selection of a conservative random effect model would increase the credibility of the outcomes.

Conclusion

This meta-analysis clearly demonstrated that semaglutide reduces MACEs primarily by a reduction in CV-related deaths. More interestingly, we predict that the SOUL trial will be prematurely terminated after 3.78 years and that oral semaglutide will significantly reduce MACEs. If proven accurate, this model has great potential for improving structuring and, more importantly, reducing expenses while planning future CVOTs.