Introduction

Metabolic surgery improves mortality and morbidity outcomes in patients with obesity [1] and has been shown to be cost-effective [2, 3]. A Cochrane systematic review found metabolic surgery was more effective than non-surgical obesity treatment [4]. Data from the Swedish Obese Subjects Study found a 25% weight loss at 10 years in those undergoing gastric bypass surgery with a 24% decrease in mortality compared with the control group [1]. Metabolic surgery improves functional impairment and cardiovascular disease and reduces cancer risk and mortality related to type 2 diabetes (T2DM) [5,6,7,8]. Furthermore, metabolic surgery has been shown in a number of trials to induce remission of T2DM and has been endorsed by 200 diabetes organisations worldwide as a standard of care [9]. People living with obesity have been shown to have higher intensive care admissions and mortality during the recent coronavirus pandemic [10, 11] which has brought focus to the delivery of metabolic surgical services [12, 13]. However, studies demonstrate only a small proportion of the eligible population receive surgical intervention for obesity-related metabolic disease [14, 15].

Metabolic surgery is part of a well-established treatment intensification strategy for obesity-related comorbidities [16, 17], but the role of individual operative approaches is less certain. The Fourth Global International Federation for the Surgery of Obesity (IFSO) Registry in 2018 reported that of the 190,177 primary metabolic operations, sleeve gastrectomy (SG) was the most common (87,467; 46.0%) followed by Roux-en-Y gastric bypass (RYGB) (72,645; 38.2%) and one-anastomosis gastric bypass (OAGB) (14,516; 7.6%) [18]. Previously published standard pairwise meta-analyses have produced inconsistent findings [19,20,21,22]. With the evolution of procedures and the increasing use and understanding of one-anastomosis gastric bypass [23], an updated comparative efficacy assessment is required to aid healthcare decision making.

This study employed network meta-analysis (NMA) to compare weight loss, type 2 diabetes remission and perioperative complications in adults between Roux-en-Y gastric bypass, sleeve gastrectomy and one-anastomosis gastric bypass.

Methods

A systematic review was undertaken to identify all published randomised controlled trials (RCTs) evaluating Roux-en-Y gastric bypass, sleeve gastrectomy and one-anastomosis gastric bypass. The review was conducted in line with the PRISMA statement [24] and PRISMA-Network meta-analysis extension [25]. The review was registered in the PROSPERO database (CRD42020199779).

Search Strategy

Medline, Embase, the Cochrane Register of Clinical Trials and the International Standard Randomised Controlled Trial Number (ISRCTN) Registry (controlled-trials.com) were searched from January 2000 until August 2020 using a search strategy based on excess weight loss, remission of type 2 diabetes and perioperative complications for (1) Roux-en-Y gastric bypass (RYGB), (2) sleeve gastrectomy (SG) or (3) one-anastomosis gastric bypass (OAGB) (Appendix). Randomised controlled trials evaluating at least two of the techniques in adult patients for the treatment of obesity or related comorbidities were included. Trials recruiting only children and those not reporting any of the primary outcomes of interest were excluded. No language restrictions were employed. Abstracts and conference proceedings were not included as they provide insufficient detail. Two authors trained in systematic review techniques (ACC & AA) completed the search strategy and screening, with disagreements resolved through consensus of the wider study team.

Outcome Measures

All RCTS including the primary outcomes of interest, including percentage excess weight loss, remission of diabetes (trialist-defined) and perioperative complications (trialist-defined), were included.

Data Extraction

Arm level data were extracted from the included studies, from communication with authors or from previous meta-analyses where the original study authors had been contacted for data verification. After a calibration exercise, three authors (ACC, AA & AF) independently performed data extraction. Disagreement at all stages was resolved by discussion with the senior author (KM).

Quality Assessment

Potential biases were identified using the Cochrane risk of bias tool [26]: sequence generation, allocation concealment, blinding, if the outcomes reported were prespecified, completeness of outcome data and other potential sources of bias. Three authors (ACC, AA & AF) independently performed the quality assessment. Disagreement was resolved by discussion with the wider study team.

Statistical Methods

Random-effects network meta-analysis using the frequentist approach was applied to synthesis evidence for the primary outcomes. Evidence was summarised in a network map for each outcome. A common within network estimate for heterogeneity was estimated with the restricted maximum likelihood method [27, 28]. Consistency between direct and indirect evidence or between studies involving different sets of treatments for the same comparison was tested using the design by treatment interaction model [29]. Inconsistency was assessed using the node-splitting method [30, 31].

Where outcomes demonstrated statistically significant differences between procedures, the ranking probabilities of each metabolic surgical procedure (RYGB was the reference) were calculated and presented as rankograms. The ranking (surface under the cumulative ranking curve, SUCRA) scores for each treatment for the most commonly reported effectiveness (percentage excess weight loss and remission of diabetes) and adverse (perioperative complication) outcomes were then combined by time point into clustered ranking plots, to enable a simultaneous comparison of benefits and risks. Sensitivity analyses were undertaken excluding trials where the risk of bias was considered to be high due to poor allocation concealment or randomisation processes.

The assessment of statistical heterogeneity in the entire network was based on the magnitude of the heterogeneity variance parameter (τ) estimated from the NMA models, which is the estimated standard deviation treatment effects. We applied a 0.5 zero-cell correction before meta-analysis. To investigate potential small-study effects, a comparison-adjusted funnel plot was constructed. Statistical analysis was performed in STATA 16.0 (StataCorp, College Station, TX, USA) using ‘mvmeta' and ‘network’ commands and other routines as reported [32,33,34]. All statistical tests were two sided, with the threshold of significance set at a p value of less than 0.05.

Role of the Funding Source

There was no funding source for this study. All authors had full access to all the data in the study and had final responsibility for the decision to submit for publication.

Results

Following systematic searches in MEDLINE, EMBASE and trial registries, 2291 abstracts were identified, and following exclusions, 25 articles regarding 20 RCTs (1803 patients) were included [35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59] (Fig. 1). The characteristics of the included trials are shown in Table 1 and the risk-of-bias assessment shown in Fig. 2. The most commonly studied randomised comparison was between RYGB and SG (13 RCTs) for excess weight loss (EWL) (13 RCTs) and T2DM remission (12 RCTs) (Fig. 3). Whilst the overall risk of bias was low in the included articles, 8/25 had an unclear risk of bias related the randomisation process, and 4/25 had an unclear risk of bias regarding outcome reporting. No evidence of small-study effects on the network were identified on the comparison-adjusted funnel plot (Fig. 4).

Fig. 1
figure 1

PRISMA diagram for study inclusion

Table 1 Characteristics of the included studies
Fig. 2
figure 2

Cochrane risk of bias—2 tool assessments of quality of included studies

Fig. 3
figure 3

Network plot for eligible comparisons in perioperative strategies for a excess weight loss (1 year) and b T2DM remission. The size of the nodes is proportional to the number of patients (n) randomised to receive the treatment. The width of the lines is proportional to the number of trials comparing the connected treatment strategies

Fig. 4
figure 4

Comparison-adjusted funnel plot for the outcome of excess weight loss (1 year). Comparisons are made comparing newer treatments against older treatments (RYGB being the oldest, followed by SG and then OAGB). The horizontal axis is the study-specific effect sizes centred to the respective comparison-specific pooled effect size, and the vertical axis is the inverted standard error of the effect sizes as used in a standard funnel plot. The solid red line represents the null hypothesis that the study-specific effect sizes do not differ from the respective comparison-specific pooled treatment effect estimates. Symmetrical distribution of this funnel plot suggests there is no small-study effect in this comparison. SMD standardised mean difference

Excess Weight Loss

At 1 year, 13 trials including 1270 randomised patients were included in the comparison of percentage excess weight loss [35, 37, 39,40,41,42,43,44, 46, 52, 54, 57, 58] (Table 2). For 1-year EWL, 8 RCTs compared RYGB with SG, 2 RCTs compared RYGB v OAGB and 3 RCTS compared SG with OAGB. In standard meta-analysis, no approach was favoured over another for EWL at 1 year. On network meta-analysis, RYGB and SG had similar EWL at 1 year, but OAGB had greater EWL at 1 year than both RYGB and SG. In the NMA, there was no inconsistency seen between the direct and network effect sizes (χ2 = 8.56, 3 d.f., P = 0.0357). No evidence of statistical heterogeneity was seen in the network (τ2 < 0.001).

Table 2 Standard pairwise meta-analysis and network meta-analysis of excess weight loss following metabolic surgery

At 2 years, 4 RCTs compared RYGB with SG, 2 RCTs compared RYGB v OAGB and 2 RCTs compared SG with OAGB [39, 41, 43, 46, 47, 53, 57, 59] (Table 2). In standard meta-analysis, no approach was favoured over another for EWL at 2-year follow-up. On network meta-analysis, RYGB and SG had similar EWL at 2 years, whereas OAGB has slightly greater EWL at 2 years when compared to RYGB. SG and OAGB had similar EWL at 2 years. In the NMA, there was no inconsistency seen between the direct and network effect sizes (χ2 = 9.50, 3 d.f., P = 0.0187). No evidence of statistical heterogeneity was seen in the network (τ2 < 0.001).

At 3–5 years, 7 RCTs compared RYGB with SG, 1 RCT compared RYGB v OAGB and 2 RCTS compared SG with OAGB [39, 41, 46, 48, 55,56,57,58,59] (Table 2). In standard meta-analysis, RYGB had greater EWL than SG at between 3 and 5 years postoperatively. No differences in EWL at this time point were noted in standard meta-analysis of the two trials of SG and OAGB. On network meta-analysis, RYGB had greater EWL than SG at 3–5 years, whereas RYGB and OAGB had similar EWL at this time point. SG and OAGB had similar EWL at 3–5 years. In the NMA, there was no inconsistency seen between the direct and network effect sizes (χ2 = 10.16, 3 d.f., P = 0.0257). No evidence of statistical heterogeneity was seen in the network (τ2 < 0.001).

Diabetes Remission

Fourteen trials including 697 randomised patients with T2DM were included in the comparison of diabetes remission [36,37,38, 41, 43,44,45, 47, 48, 50, 52, 53, 55, 58, 60] (Table 3). Ten RCTS compared RYGB with SG, 2 RCTs compared RYGB v OAGB and 2 RCTS compared SG with OAGB. In standard meta-analysis, RYGB was 20% more likely to result in remission of type 2 diabetes compared to SG. There was no difference in T2DM remission between OAGB and either RYGB or SG. On network meta-analysis, both RYGB and OAGB had increased postoperative remission of type 2 diabetes. There were no differences between RYGB and OAGB for remission of type 2 diabetes. In the NMA, there was no inconsistency seen between the direct and network effect sizes (χ2 = 7.99, 3 d.f., P = 0.0397). No evidence of statistical heterogeneity was seen in the network (τ2 < 0.001).

Table 3 Standard pairwise meta-analysis and network meta-analysis of T2DM remission following metabolic surgery

Perioperative Complications

Thirteen RCTs including 1593 randomised patients were included in the comparison of perioperative complications [35, 39,40,41,42,43,44, 46,47,48, 52, 55,56,57,58] (Table 4). Nine RCTs compared RYGB with SG, 2 RCTs compared RYGB v OAGB and 2 RCTS compared SG with OAGB. In standard meta-analysis, RYGB was more likely to result in perioperative complications compared to both SG and OAGB. There was no difference in perioperative complications between OAGB and SG. Similarly, on network meta-analysis, RYGB had increased perioperative complications compared to both SG and OAGB, and there were no differences between SG and OAGB for the development of perioperative complications. In the NMA, there was no inconsistency seen between the direct and network effect sizes (χ2 = 1.92, 3 d.f., P = 0.0340). No evidence of statistical heterogeneity was seen in the network (τ2 < 0.001).

Table 4 Standard pairwise meta-analysis and network meta-analysis of perioperative complications following metabolic surgery

Cluster Plots of Benefits and Risk

SUCRA clustered ranking plots are shown in Fig. 5. For all years, OAGB consistently ranked highest in terms of maximising EWL and T2DM remission whilst reducing the risk of perioperative complications.

Fig. 5
figure 5

Clustered ranking plots of the metabolic surgery network based on cluster analysis of SUCRA values for two sets of different outcomes: i) excess weight loss (EWL) and perioperative complications, and ii) type 2 diabetes (T2DM) remission and perioperative complications. Treatments lying in the upper right corner are more effective and carry less risk than the other treatments

Discussion

This network meta-analysis has uniquely summarised all RCTs to date comparing the most commonly performed current metabolic surgical procedures, i.e. RYGB, SG and OAGB. The EWL demonstrated minor differences at the 1-year and 2-year time points, but no differences between procedures at longer term follow-up. Remission of type 2 diabetes was more likely to occur with either RYGB or OAGB when compared to SG. The risk of perioperative complications was higher with RYGB when compared to either SG or OAGB. When the potential benefits of EWL and T2DM remission were considered against the risk of perioperative complications, OAGB appears to offer the optimal balance of these factors across all time points. The findings from this NMA suggest that OAGB offers comparable metabolic control through weight loss and T2DM remission to RYGB and SG whilst minimising perioperative complications. Therefore, OAGB should be considered more strongly as a primary metabolic surgical procedure.

Previously published NMA studies in bariatric surgery have reported conflicting results. Park and colleagues [61] found that RYGB, SG or biliopancreatic diversion procedures, but not OAGB, produced equally good short and long-term T2DM remission. They considered a larger number of randomised interventions in the formation of their network; however, not all would be currently considered reasonable metabolic surgery interventions. The premise of NMA is that the included interventions could be ‘jointly randomisable’ as though offered in a contemporary RCT [62]. Banded bypass, gastric plication and even adjustable gastric banding are not considered standard practice in many parts of the world. In comparison, the current NMA compares the three most commonly performed metabolic surgical procedures worldwide.

According to the most recent IFSO registry report, the three most commonly performed procedures are RYGB, SG and OAGB [18]. IFSO endorses all of these procedures for use as metabolic surgical interventions [60, 63, 64]. RYGB is the acknowledged gold standard metabolic surgical intervention and has the most direct and indirect evidence supporting its use in this NMA. SG demonstrates excellent weight loss and metabolic control but is shown to be slightly inferior to RYGB when considering treatment of T2DM alone. This reason for the differences between bypass procedures and SG in metabolic control may be due to improved hepatic insulin sensitivity [38]. This is reflected in the increased use of RYGB, but also OAGB, compared to SG when considering metabolic surgery for T2DM in a recent IFSO report [65]. This NMA indicates that OAGB may offer acceptable weight loss and glycaemic control as a metabolic surgical procedure. However, there are concerns in the literature about the use of OAGB particularly with regard to risks of protein-calorie malnutrition and upper gastrointestinal malignancies related to bile reflux [66]. Whilst the rates of malnutrition reported in large multinational cohorts are low [67], a recent IFSO survey on OAGB recognised the potential serious risks of malnutrition, liver failure and bile reflux that may require surgical correction after OAGB [68]. Specifically relating to biliary reflux, there are case reports of biliary reflux associated cancers in the context of OAGB [69]. Whilst the overall risk of this complication is unclear, this should merit further investigation. There has been some debate that the limb length in OAGB may influence malnutrition [70], but there remains a lack of consensus on the optimal limb lengths in OAGB [68]. In the current NMA, the included trials had limb lengths from 200 to 350 cm [43, 47]. Some groups have found that reducing the limb length to 150 cm may obviate the risks of malnutrition [71], without a significant effect on the weight loss outcomes, whereas other groups have reported tailoring the limb length to one-third of the total bowel length offers the optimal metabolic outcome [72]. The latter approach may however be associated with higher risk of bowel injuries.

This NMA has identified comparative efficacies of RYGB, SG and OAGB to effect weight loss and induce remission of T2DM. Other studies have used data points in order to define which metabolic surgical procedures should be preferred for different indications. Aminian and colleagues, using data from 654 patients with diabetes who underwent RYGB or SG at the Cleveland Clinic, produced the Individualised Metabolic Surgical Score (IMSS) [73]. The IMSS incorporates the duration of T2DM, the number of preoperative antidiabetic medications, preoperative insulin use and glycated haemoglobin percentage (HbA1c (%)) to score patients into mild, moderate or severe categories. Patients with severe IMSS are proposed by the authors to undergo RYGB as it produces higher remission in this category on their validation set at a Spanish bariatric centre. No patients undergoing OAGB were included in that study, and there was minimal consideration of perioperative complications. Chiapetta and colleagues using data from the German register for obesity and metabolic surgery evaluated the risk of perioperative complications in patients with varying degree of obesity-related comorbidity using the Edmonton Obesity Staging System [74]. They found that patients with EOSS scores > 2 had a higher risk of morbidity which was reduced when either SG or OAGB was employed as the metabolic surgical procedure. The authors propose SG or OAGB is used to minimise risk in patients with severe obesity-related comorbidity. However, that study did not evaluate weight loss or T2DM remission outcomes. The current NMA extends these findings to suggest OAGB should be considered alongside RYGB and SG as an option for metabolic surgery in a patient presenting for treatment of obesity or obesity and T2DM.

This study has shown that both bypass procedures offer comparable weight loss and control of T2DM. The mechanism of action of RYGB is comparatively well characterised [75]. The pathways are likely to be multifactorial including changes in eating behaviour, humoral mediated through glucagon-like peptide 1 (GLP-1), gastroinhibitory peptide, glucagon and ghrelin signalling, bile acid kinetics and microbiome-associated effects. Conversely, the mechanism by which OAGB offers metabolic control comparable to RYGB remains to be established. Preliminary work indicates the mechanism of action of OAGB and RYGB may be similar [76]. Additional work in the experimental animal indicates the mechanism of metabolic improvement may be mediated by GLP-1 and bile acid pathways [77]. Overall, more work is required to characterise the mechanistic effect of OAGB in the human population.

Despite comprehensive reporting of all RCTS comparing RYGB, SG and OAGB, there are some limitations to this study that require consideration. By restricting the analysis to the three most commonly performed metabolic surgery procedures worldwide, some comparisons between techniques have limited direct comparative evidence—in particular, there are only two comparative trials of OAGB and RYGB at 1 and 2-year follow-up and only one at 3–5-year follow-up. Whilst this provides limited direct evidence, the strength of network meta-analysis is that the statistical technique permits use of indirect evidence to increase the precision of the treatment effect estimates. Indeed, by demonstrating consistency between direct and indirect treatment effects at all time points, this NMA analysis has shown the benefit of expanding the trial network beyond direct meta-analysis to maximise use of the randomised trial evidence in the area. However, more direct trials of OAGB and both RYGB and SG are needed to improve our understanding of their respective treatment effects. Whilst this study has been able to make a comprehensive comparison of trialist-defined complications, there is limited standardisation of reporting and grading of complications which precludes a more in-depth assessment. By including only RCTs, this NMA has limited reporting of longer term data. This may mean that some aspects of outcomes following these particular metabolic procedures may not have been captured, such as weight regain and the need for revisional procedures. Both RYGB and OAGB are associated with a long-term risk of the development of internal hernias and chronic abdominal pain. SG is associated with a long-term risk of GERD [78], and there are reports of long-term gastro-oesophageal and bile reflux following OAGB [64, 69]. By comparison with other NMAs, this current study assessed fewer procedural types, and therefore the confidence interval estimates are wider as fewer patients overall are included. However, the strength of this NMA lies in the fact that it meets the requirement of joint randomisability of included interventions [62], which other reviews do not by including procedures that are either not practiced by contemporary surgeons or not endorsed by major societies.

In summary, the current NMA has provided a synthesis of the evidence on the three most commonly used metabolic surgical procedures at present. RYGB is associated with the highest rate of perioperative complications. Weight loss at 1 and 2 years appeared greatest with OAGB, but at longer follow-up the differences were less certain. Remission of T2DM was more common with RYGB or OAGB compared to SG. This study has confirmed that RYGB, OAGB or SG should be considered as primary metabolic surgical procedures, and individualised approaches would be reasonable with any of these operations.