Introduction

Due to the excellent short- and long-term weight loss (WL) and durable improvements in obesity-associated comorbidities, bariatric and metabolic surgery (BMS) is considered the best treatment for extreme obesity [1,2,3,4,5,6]. However, not all patients maintain their achieved WL after BMS, and some experience weight regain (WR). For instance, the prevalence of WR ranged between 5.7 and 76% at 2–6 years after laparoscopic sleeve gastrectomy (LSG) [7, 8]; and a meta-analysis of patients with ≥ 7 years follow-up demonstrated a long-term WR rate of 27.8% (range 14–37%) [9].

Such wide variability in the proportion of patients who regain weight after BMS probably reflects the inconsistencies in the way WR is assessed. While standard definitions for WL after BMS surgery exist [10], the lack of a standardized WR definition results in variations and therefore imprecise comparisons across studies, procedures, settings, patient groups, and countries. This contributes to our poor understanding of WR and its significance [11].

Indeed, the definitions of WR after BMS are numerous and employ a wide range of different parameters, e.g., BMI or excess weight loss percentage (EWL%) as well as different cutoffs [12,13,14,15,16,17,18,19,20]. There have been calls to rectify this deficiency but identifying a single definition of clinically significant WR is challenging [21, 22].

Despite such calls, sparse research has assessed the prevalence of WR using the different definitions, and the literature remains limited in number and scope. For instance, in terms of the BMS procedure, most studies assessed the effects of WR definitions after Roux en Y gastric bypass (RYGB) only or after other BMS procedures, but not after  laparoscopic sleeve gastrectomy (LSG), despite its popularity and different outcomes [12, 14, 21, 23,24,25]. Hence, the current study focused on LSG to bridge this gap. In addition, some studies appraised the effects of various definitions on WR rates at single time point, e.g., at five years [12, 21] without providing the prevalence of WR at multiple successive time points. While such an approach is useful, it provides a limited “single snapshot” view rather than the ongoing mechanics of the WR definitions across time. Equally important is that very few studies have evaluated how various WR definitions are associated with different comorbidities. Such understanding is critical to appraise the clinical implications of each of these definitions [14, 23]. WR has important health consequences including relapse of obesity-associated comorbidities including type 2 diabetes (T2DM), hypertension, and dyslipidemia [12, 24,25,26,27,28]

Given the above knowledge gaps, the aim of the present study was to assess the prevalence of WR after primary LSG using the different definitions. The specific objectives were to evaluate the extent of variability of WR rates based on six definitions measured at four time points (years 2, 3, 4, and 5 after LSG); and, appraise the associations between the different WR definitions and a range of patient characteristics as well as the remission of three comorbidities at 5 years after LSG. Clearer definitions will help to understand the clinical significance of such definitions, and hence provide guidance when intervention is required [11, 29]. We selected LSG as it is the most common procedure performed at our institution, comprising almost 95% of our surgeries. Patients with revisional BMS were excluded as nadir weight, which is essential for four of the six WR definitions we examined, would be difficult to determine precisely [14].

Materials and Methods

Study Design, Ethics, and Participants

This retrospective study was approved by the Medical Research Centre at Hamad Medical Corporation in Doha, Qatar (IRB Protocol #MRC-01–20-658). The inclusion criteria were all adults aged ≥ 18 years with BMI ≥ 40 or BMI ≥ 35 kg/m2 with comorbidities who underwent primary LSG at the BMS Centre in our institution and had ≥ 5 years follow-up data. Hence, patients operated upon from February 2016 to September of 2017 were eligible. A total of 598 patients underwent LSG during this time period. Eleven of these subsequently underwent revisional surgery and were excluded. The remaining 587 patients were included in the current analysis. The follow-up rate was 88.5, 85.8, 81, 74.9, and 61.1% at years 1, 2, 3, 4, and 5 respectively. The BMS service at our institution includes pre- and postoperative care provided in line with standard international guidelines [30]. The surgical technique of LSG at our institution has been described elsewhere and has not changed much since the establishment of the BMS department [5, 31].

Procedures and Data Collection

We searched patients’ electronic records and retrieved pre- as well postoperative data across the five years. Information retrieved included demographics (age, sex) anthropometric measures (weight, height) data, and other patient characteristics [(number of follow-up visits, number of comorbidities, medication use (based on pharmacy dispense records and physician documentation)]. We also retrieved clinical data [systolic and diastolic blood pressure (SBP, DBP)], as well as cardiometabolic/laboratory values [total cholesterol (TC), triglycerides (TG), high-density lipoprotein (HDL), low-density lipoprotein (LDL)].

Weight Loss and Regain Measures

We computed the BMI, BMI change, excess weight (EW), excess weight loss percentage (EWL%), weight loss (WL), and total weight loss percentage (TWL%) at 1, 2, 3, 4, and 5 years after surgery using formulas recommended in previous studies [10, 31, 32]. Nadir weight was the lowest value achieved of all postoperative weight measures available prior to year five. WR was calculated at 4 time points (starting from the second year) based on the six definitions. Definitions, cutoffs, and calculations of WR are shown in Table 1.

Table 1 Six definitions of weight regain

Definitions of Comorbidities and their Remission

The presence of the three comorbidities prior to surgery were retrieved and ascertained from the patients’ electronic records including type 2 diabetes (T2DM), hypertension (HTN), and dyslipidemia. Such patients were either already diagnosed before presenting to our bariatric clinic or diagnosed when screened prior to surgery according to agreed international standards [10, 33, 34]. At 5-year follow-up, the status of these comorbidities was determined using the ASMBS guidelines [10].

The Bariatric and Metabolic Service: Standard Postoperative Care

The evolution and components of the Bariatric and Metabolic Surgery Department at our institution have been detailed elsewhere [35]. After surgery, patients are routinely followed by a multidisciplinary team of bariatric surgeons, physicians, dietitians and physiotherapists. Follow-up visits are scheduled at 2 weeks and then at 1, 3, 6, and 12 months, and yearly thereafter. Dieticians and physical therapists individually counsel patients on the routine post-surgery dietary intake and physical activity in accordance with international guidelines.

Statistical Analysis

All values for weight loss and regain were calculated in Microsoft Excel using basic descriptive statistics and customized formulae. Categorical variables were expressed as frequencies and percentages, and continuous variables as means and standard deviations. Logic tests were used to determine patients who met individual definitions at each time point according to the definitions outlined in Table 1 and by others [21].

Binary logistic regression was undertaken to assess the associations between baseline patient characteristics (age, sex, preop BMI, number of follow-up visits, number of comorbidities) and WR at five years according to each of the six definitions. Logistic regression was also used to test the associations between WR by each definition with the remission of the three comorbidities under examination (hypertension, type 2 diabetes mellitus, and dyslipidemia). Assumptions regarding regression were tested and met. IBM SPSS Statistics version 20 (UK) was used for the analysis, and P value < 0.05 was considered statistically significant.

Results

Study Population: Preoperative and Selected Postoperative Characteristics

Table 2 depicts that the mean age of patients was 34 years and females were slightly more represented. More than half the sample were married and about three quarters were current smokers. Mean BMI was 43.13 kg/m2. The most common comorbidities were dyslipidemia, T2DM, and hypertension (about one fifth of the sample each). These were followed by prediabetes, asthma, hypothyroidism and back pain (10–11% each). Gastroesophageal reflux disease, obstructive sleep apnea, and coronary artery disease were less common (2–9% each). Other comorbidities were rare (< 1%). Mean blood pressure of the sample was normal. Mean number of follow-up visits with the bariatric clinics were 3.1 visits.

Table 2 Preoperative and selected postoperative characteristics of patients (N = 587)

Anthropometric Characteristics at 5 Time Points

Across the total population, both weight and BMI showed a steady decrease until year 3 followed by a slight increase to year 5, echoed by the weight loss that displayed a steady decrease until year 3 followed by a slight increase to year 5 (Table 3). BMI change fluctuated to reach its maximum in year 2 (13.96 kg/m2) and its minimum in year 5 (11.67 kg/m2). Both %EWL and %TWL increased until year 2 and then started slightly decreasing up to year 5.

Table 3 Postoperative characteristics at 5 time points for the total population

The number and percentage of patients reaching their nadir weight at each of the 4 time points were: at year 1 (189, 36.4%), year 2 (218, 43.3%), year 3 (89, 18.7%), year 4 (73, 16.82%); and for the whole sample, mean nadir weight was 75.40 ± 13.71 kg. A total of 529 (92.97%) patients achieved successful weight loss (defined as %EWL at nadir > 50), with a mean time to achieve minimum weight of 2.08 ± 1.00 years.

%TWL at 5 Time Points by WR Definition

The %TWL is presented in Fig. 1 for all patients and for every weight regain group separately. The %TWL for all groups was calculated relative to preoperative weight. As WR was calculated relative to nadir and the earliest time point where patients could have measured nadir was at year one, WR definitions were only applied from year 2 onwards. Definition 5 (any WR) consistently yielded the highest %TWL for any given year across all the examined years. All other definitions generated relatively lower %TWL for any given year across all the years.

Fig. 1
figure 1

Percentage total weight loss (%TWL) over a period of 5 years for the total population and weight regain groups separately

Weight Regain Rates

Table 4 depicts that during years 2, 3, 4, and 5, the percentage of patients with WR fluctuated between 2.53% and 94.18%, subject to the definition and time point. At the 5-year follow-up, the highest prevalence of WR was that of definition 5 (any WR), where about 86% of patients had experienced WR. For the remaining definitions, the percentages of patients classified as experiencing weight regain ranged between 18 and 32%.

Table 4 Proportion of patients with weight regain at 4 time points using different definitions

The definition yielding the least prevalence of WR differed by year but generally, definitions 3 and 4 were responsible for the lowest WR rates across the five time points. The remaining three definitions generated WR rates that fell in between the maximum and minimum rates (Fig. 2).

Fig. 2
figure 2

Percentage of patients with weight regain over a period of 5 years for the total population and by weight regain definition

Associations of Weight Regain Definitions with Patient Characteristics at 5 Years

Table 5 shows that at 5 years, male sex was less likely to be associated with two definitions (defs 1 and 6; P range < 0.026 to 0.032). Patients with higher preoperative BMI had a higher likelihood of experiencing weight regain in three definitions (defs. 1, 3, and 4; P < 0.001–0.049). The number of comorbidities was positively associated with one definition (def. 2; P = 0.01). The remaining parameters under examination were not associated with any WR definitions.

Table 5 Associations of weight regain definitions with selected patient characteristics at 5 years

On the other hand, definition 1 was significantly associated with two characteristics (sex, preoperative BMI), while definitions 2, 3, 4, and 6 were each significantly associated with only one parameter (def. 2 with number of comorbidities; definitions 3 and 4 with preoperative BMI; def. 6 with sex).

Associations of Weight Regain Definitions with Remission of Comorbidities at 5 Years

At five years, remission of hypertension was the only one associated with any of the WR definitions (def. 5, any WR, P = 0.025) (Table 6). No other WR definition was significantly associated with the remission of the comorbidities under examination.

Table 6 Associations of weight regain definitions with remission of comorbidities* at 5 years

Discussion

To date, definitions of WR after BMS are premised on a variety of different parameters, and within each parameter, employ a wide range of different cutoffs. Despite the variability in the way WR is defined and measured, it is frequently used as a key outcome and employed in comparisons of mid- and long-term effectiveness of bariatric procedures vis-a-vis each other [7, 9, 27, 36]. Such a situation inevitably raises the question: When definitions differ, are comparisons meaningful and do differences matter? In response, the current study appraised the extent of WR among a large sample of patients and explored its dynamics across five years using 6 different definitions. We also assessed the associations between different WR definitions and  patient's characteristics as well as the remission of comorbidities at five years.

The main findings were that in terms of the time points, during years 2, 3, 4, and 5, the percentage of patients with WR fluctuated between 2.53 and 94.18%, subject to the definition and time point. At the 5-year follow-up, definition 5 (any WR) generated the highest prevalence of WR. The regression analysis at 5 years showed that sex, preoperative BMI and number of comorbidities were related to experiencing WR using various definitions. On the other hand, for remission of comorbidities, only hypertension remission was related to any of the WR definitions under examination. As expected, the definition of WR certainly influenced the WR prevalence, yielding considerable variations in the WR rates.

In terms of the breadth of the WR, at five years, 17.7–86% patients in the present study experienced WR subject to the definition used. Such a wide range of WR is very similar to the 16–87% WR prevalence five years after RYGB/LSG reported by Voorwinde et al. [21]. However, Lauti and colleagues [12] observed a higher WR (40–91%) using similar definitions as in the current study. Likewise, others observed WR ranging between 43.6–86.5% at five years after RYGB [14]. Two reasons might account for the lower WR rates observed by the current study and by Voorwinde et al. [21] compared to that of Lauti [12]. The first could be the lower mean preoperative BMI in our and Voorwinde’s studies [21] (43.12 and 44.8 kg/m2 respectively), compared to the higher baseline BMI of patients (50.7 kg/m2) in Lauti’s study [12]. Higher preoperative BMI is associated with a greater likelihood of WR [11, 37]. A second reason that cannot be ruled out could be the variations between centers and countries regarding the extent of multidisciplinary program before and after surgery and extent of follow-up on the recommended dietary, physical activity, and behavior modifications that might influence the sustainability of positive changes accomplished after BMS, hence playing a role in preventing WR or otherwise [11, 30, 38,39,40,41]. It is also not straight forward to rule out how patient characteristics might play a role if any, as 64.23% of our patients were females and mean age was 34 years; while these studies [12, 14, 21] had slightly higher percentage of females (79–81%) and an older population (46–49 years).

As for the magnitude of WR, based on the definition used, our highest proportion of patients with WR was with definition 6 (any WR, 86%), concurring with the findings of previous studies where this same definition yielded the highest WR rate (87–91%) [12, 21]. Notwithstanding, others found that the WR definition of “ ≥ 10% WR of maximum weight lost” was the one associated with highest proportion of patients with WR at five years (86.5%) [14]. On the other hand, in the current study, definition 4 yielded the lowest WR rate (17.7%), concurring with the findings of previous studies where this same definition resulted in the lowest WR rate (16%) [21].

Regarding the dynamics of WR across time, the proportion of patients with WR in the present study increased throughout the study period, regardless of the definition. This agrees with a study among RYGB patients and although the rate increased throughout the five years, the authors observed that the largest WR occurred during the first year after reaching nadir weight [14].

Collectively, the above findings highlight two points. The first is that some degree of WR occurs (and perhaps should be reasonably unsurprising) among most patients after BMS [11, 12, 21, 42] The second is that our findings as well as the literature illustrate a situation that requires overdue attention. They highlight the importance of debating and where possible standardizing the reporting of WR across the bariatric literature. The use of different WR definitions applied to the same population significantly alters the proportion of patients categorized as regaining weight. Such inconsistencies result in significant and clinically important findings.

As for the regression analysis, we observed that male sex was less likely to be associated with two definitions (definitions 1 and 6; P range < 0.026 to 0.032). Findings are inconsistent across studies, with some reporting no relation between sex and any of the 6 WR definitions [21]; or male gender being a predictor of WR [43]; or no association between gender and WR [23]. In connection with age, the current study noted no associations between age and WR across any of the definitions, where others found that age was inversely related to WR [21, 43].

As for preoperative BMI, it was positively associated with three WR definitions that incorporated weight and/or BMI (defs. 1, 3 and 4; P range < 0.001 to 0.049). This concurs with other studies where preoperative BMI was positively related to WR when definitions were based on changes in BMI, %EWL, and kilograms [21]. Likewise, one study showed that pre-surgery BMI was sometimes a positive significant predictor and at other times a negative predictor of WR subject to the definition used and its components (WR of “10 kg nadir,” “an increase of > 20% of maximum weight loss,” or “increase of > 25% EWL from nadir”) [23]. A recent study found that a preoperative BMI of > 45 kg/m2 was associated with WR after LSG, regardless of the definition of WR [44].

In terms of the number of comorbidities, we found it was positively associated with one definition (def. 2; P = 0.01). While there is no literature to directly compare this finding with, inconsistencies again exist. Some authors found that WR was associated with several comorbidities including obstructive sleep apnea and fatty liver disease [44], while others noted that the presence of T2DM or HTN were both not associated with WR definition [23].

Regarding to the remission of comorbidities, in the present study, WR was only significantly associated with remission of hypertension (def. 5 “any WR,” P = 0.025), albeit in a direction opposite to what would be expected. This suggests that comorbidity remission might not be entirely weight dependent. Such view is further supported by our observation that the remission of T2D and dyslipidemia were also not associated with any of the WR definitions. Our findings concur with other studies suggesting that WR might not largely be related to comorbidity remission. For instance, Voorwinde found that WR was associated with only one of the five comorbidities they examined (obstructive sleep apnea) [21]. Other mechanisms frequently play a role in resolution. For instance, several factors contribute to hypertension remission, including gut hormones (peptide YY and glucagon-like peptide-1), resolution of other obesity-related comorbidities that share pathophysiologic mechanisms with HTN (e.g., obstructive sleep apnea), decreases in systemic inflammation leading to reduced arterial stiffness, as well as decreased sodium reabsorption and diminished sympathetic activation [45,46,47,48].

As to the question of which WR definition is most appropriate to assess patient with WR, Table 7 provides a summary of design, patient populations, type of surgery, time points of assessment, and findings of the current and another four published studies that attempted to push the boundaries in order to answer this critical question. The table clearly illustrates the extent of the range of the variables that were assessed, when and how many times, as well as the inconsistencies in the findings and the wide variability in the reported range of values of WR and many dissimilarities in the associations/non-associations of different WR definitions with patient characteristics, resolution/remission of comorbidities, and patient perspectives (e.g., health-related QoL or satisfaction after surgery). The table undoubtedly shows that identifying one single categorical definition of clinically significant WR is difficult. Certainly, answering the question of which WR definition is the most appropriate to assess patient with WR requires further efforts and research from the bariatric community in order to reach a consensus.

Table 7 Towards a single definition of WR: Summary characteristics and findings of current study and comparison with the literature

This study has limitations. Nadir weight could have been reached between yearly assessments time points. Appraisal of remissions of obstructive sleep apnea and osteoarthritis would have been beneficial, although their prevalence was low among our sample. We had some missing data due to loss of follow-up and we are unable to confirm whether such missing values could be related to WR, as some studies found that patients that attend their postoperative appointments have better %EWL compared to those who failed to attend [49]. Data on patient perspectives would have been useful to collect, e.g., health-related quality of life or patient satisfaction with surgery. Future research should address these limitations. Despite this, the present study has many strengths. It is one of the very few studies to test different WR definitions, and one of the fewer to appraise such definitions across patients who had undergone LSG. Unlike others [21] where subjective patient-reported medication use and blood pressure were employed, the status of comorbidities and their remission in the current study was premised on objective findings (blood tests and medication used ascertained from pharmacy dispense records).

Final Thoughts—is Weight Regain a Useful Clinical Concept?

‘Exploring the unknown requires tolerating uncertainty’.

Brian Greene

Realistically, some WR is expected after BMS, thus multidisciplinary bariatric teams need to encourage long-term healthy lifestyle after surgery. Notwithstanding, the study findings uncover two critically challenging approaches.

On the one hand, using the findings of the current study pragmatically, awareness that higher preoperative BMI may be associated with longer term WR after LSG could prompt patients, when feasible, to seek earlier treatment, although this might not be entirely under their control. For the bariatric team, preoperative knowledge of patient groups at risk of WR, namely males with higher BMI and multiple comorbidities will help clinicians, where indicated, to offer earlier surgical intervention. Postoperatively, awareness of at-risk groups will also aid in providing early prevention or intervention when patients are not on their expected/optimal weight loss trajectory.

Conversely, on the other hand, the study findings raise concern about whether asking the question “Which WR definition is the most appropriate?” represents a suitable approach, and whether identifying a single categorical definition of clinically significant WR is a useful exercise. Perhaps the validity of WR as an outcome employed in comparisons and gauging of effectiveness of BMS procedures deserves a serious revisit. In particular, our findings as well as those of more recent studies [21] highly suggest that the current dichotomous WR definitions represent minor clinical significance as: (a) they showed mostly no associations or odd weak association with clinical outcomes; and (b) such associations were significant with only one of the comorbidities examined. Deliberations are required by expert groups, regional and international BMS agencies as to whether using metric definitions of WR after BMS represent an advantageous concept.

Given the above contrasting views, it might be argued that dichotomous WR definitions could offer some guidance in the assessment and management of individual patients. However, as a comparator metric across patients and procedures, its utility might need to be refined. Some authors have suggested a potential alternative concept that is increasingly being employed across the bariatric literature [50], where health benefits are assessed according to weight loss (e.g., sustained 5%, 10%, 15%, and 20% weight loss) [10, 51].

Conclusion

The prevalence WR after BMS remains inconsistent and its clinical significance uncertain. Our findings indicate that some WR could be reasonably expected after BMS. If preventing WR per se is viewed as an important dimension of success of BMS due to amelioration of comorbidities, then our findings indicate that the present dichotomous WR definitions are of minor clinical significance due to the weak associations with limited comorbidities. A deeper debate is required, as identifying a single categorical definition of clinically significant WR is difficult, and its full potential remains to be determined. Future research should address this challenge.