Introduction

The overall prevalence of non-alcoholic fatty liver disease (NAFLD) among patients with type 2 diabetes mellitus (T2D) is 55.5% [1]. Patients with T2D and obesity are at the highest risk of progression from fatty liver to non-alcoholic steatohepatitis (NASH) and cirrhosis [23]. Vice versa, non-alcoholic fatty liver disease (NAFLD) is associated with an approximate twofold higher risk of developing T2D and might precede the development of T2D [2]. The risk of T2D seems to correlate with severity of NAFLD, especially the degree of hepatic steatosis and fibrosis [4]. Current evidence suggests that bariatric-metabolic surgery (BS) decreases the grade of steatosis, hepatic inflammation, and fibrosis in patients with severe obesity [56]. Mechanisms include substantial weight loss, but also simultaneous effects on glucose homeostasis, lipid metabolism, and inflammatory pathways involved in NAFLD pathophysiology [7]. When comparing surgical procedures, Roux-en-Y gastric bypass (RYGB) might be superior to sleeve gastrectomy (SG) in reducing liver steatosis [89]. With regard to diabetes resolution, randomized clinical trials have found no differences in efficacy between RYGB and SG [10,11,12]. However, according to a large cohort study of 9710 adults with T2D, patients with RYGB seemed to have greater weight loss, a slightly higher T2D remission rate, less T2D relapse, and better long-term glycemic control compared with those with SG comparing 5-year diabetes outcomes [13]. Studies suggest that remission of T2D requires preferential and rapid reduction in liver fat, further supporting the ectopic fat hypothesis for diabetes [1415]. Recently, the presence of liver steatosis at the time of surgery has been found to be an independent predictor of long-term diabetes remission following RYGB [16].

Therefore, in the present study, we addressed the following question: Does the presence and severity of NAFLD in patients with obesity and T2D affect long-term diabetes remission following different types of BS?

Materials and Methods

Study Population

A total of 367 adult patients (≥ 18 years) who underwent either sleeve gastrectomy (SG) or Roux-en-Y gastric bypass (RYGB) according to the S3 Leitlinie (Guidelines) Chirurgie der Adipositas [17] were included in this study. Patients with second step procedures prior were considered as having SG (n = 1). Between 2014 and 2022, patients attended our university obesity clinic, which is certified by the European Accreditation Council for Bariatric Surgery as center of excellence for obesity and metabolic surgery. We obtained intraoperative liver biopsies in case of macroscopically suspected liver pathology. Liver biopsies in our cohort have been previously evaluated with a different aim and methodology approved by the ethics committee of the Hamburg Medical Chamber (TV-4889). An independent specialist in gastrointestinal pathology examined the liver biopsies and scored them according to Kleiner et al. [18]. We excluded patients with incomplete records, patients with presence of acute/chronic hepatitis apart from NAFLD, history of acute inflammation (pulmonary, gastrointestinal, urogenital, cutaneous or history of chronic autoinflammatory disease), hyperthyroidism, thyreostatic medication, history of alcohol abuse, and pregnancy (n = 72). Of the remaining 295 patients, 188 patients without T2D were excluded (n = 107).

All procedures performed in this study involving human participants were in accordance with the ethical standards of the institutional and/or national committee on human research and with the 1964 World Medical Association Declaration of Helsinki and its later amendments or comparable ethical standards. Retrospective data collection and anonymized analysis was conducted in accordance with local government law (HmbKHG. §12) without the requirement for informed consent.

Study Design

Follow-up data were retrospectively collected from 107 patients at baseline. To provide reasonable comparability between the cases, the available data were allocated 4 “visits” by time in relation to the procedure. In addition to baseline data − 3.6 ± 5.1 (mean ± SD) months before surgery, data from visit 1 (n = 100) were analyzed 6.0 ± 2.0 (mean ± SD) months after surgery, data from visit 2 (n = 74) 24.0 ± 4.9 (mean ± SD) months after surgery, data from visit 3 (n = 52) 47.4 ± 5.9 (mean ± SD) months, and data from visit 4 (n = 16) 95.4 ± 7.4 (mean ± SD) months post BS.

Variables

Data on height (cm), weight (kg), body mass index (BMI; kg/m2), sex, age, type of surgery, hemoglobin (g/dl), platelets (103/µl), CRP (mg/l), aspartate aminotransferase (AST; U/l), alanine aminotransferase (ALT; U/l), gamma-glutamyl transpeptidase (GGT; U/l), triglycerides (mg/dl), high-density lipoprotein cholesterol (HDL; mg/dl), low-density lipoprotein cholesterol (LDL; mg/dl) and HbA1c (%) were analyzed at baseline and during follow-up. Excess weight loss (EWL) in % was calculated by dividing the difference between initial BMI and final BMI by the difference between initial BMI and a target BMI of 25 kg/m2. Percent of total weight loss (TWL[%]) was calculated by dividing the difference between initial weight and postoperative weight by initial weight multiplied by 100 [19]. Insulin use, use of oral antidiabetic agents and glucagon-like peptide-1 receptor agonists (GLP-1 RA), was assessed at baseline and during follow-up visits. Patients with HbA1c ≥ 7% were defined as having poor glycemic control. Remission of T2D was defined as a return of HbA1c to < 6.5% following BS, that persisted for at least 3 months in the absence of usual glucose-lowering pharmacotherapy [20]. Liver histology was evaluated according to Kleiner et al. [18]. Steatosis was graded on a scale from 0 to 3 with grade 0 defined as < 5% liver fat, grade 1 as 5–33% liver fat, grade 2 as > 33–66%, and grade 3 as > 66%. Lobular inflammation was assessed according to the presence of inflammatory foci on a scale from 0 to 3: no foci (grade 0), < 2 foci per 200 × field (grade 1), 2–4 foci per 200 × field (grade 2), > 4 foci per 200 × field (grade 3). Liver cell injury was recorded as “hepatocyte ballooning” on a scale from 0 to 2: none (grade 0), few balloon cells (grade 1) and many cells/prominent ballooning (grade 2). Fibrosis stage (0–4) was defined as follows: none (F0), perisinusoidal or periportal (F1), mild, zone 3, perisinusoidal (F1A), moderate, zone 3, perisinusoidal (F1B), portal/periportal (F1C), perisinusoidal and portal/periportal (F2), bridging fibrosis (F3), and cirrhosis (F4). The NAFLD activity score (NAS) score was calculated as the sum of scores for steatosis, lobular inflammation and ballooning, ranging from 0 to 8 [18]. A NAS of ≥ 5 corresponded to the diagnosis of NASH, scores of < 3 to “no NASH” [18]. Noninvasive assessment to predict steatosis and fibrosis included the Heaptic-Steatosis Index (HSI) and the Fibrosis-4 index (FIB-4) [2122].

Statistical Methods

Standard descriptive statistics were used for all study end points. Distributions of continuous variables were described with mean and standard deviation (SD). Categorical variables were described with absolute and relative frequencies. Continuous variables at baseline were compared with Student’s t-test or Mann–Whitney test. Normality of continuous variables and equality of variances was tested using Shapiro–Wilk test and Levene’s test. Categorical variables were compared using chi-square statistics. Continues data between multiple visits were compared using one-way ANOVA with Dunnett’s test for multiple comparisons. Continuous data between different subgroups over the follow-up period was compared using two-way ANOVA with Šidáks test for multiple comparisons. Multiple logistic regression analysis was conducted to identify significant factors of long-term T2D remission. Odds ratio (OR) with 95% confidence intervals (CI) and p-values were calculated. P-values below 0.05 were considered statistically significant.

Results

Baseline Characteristics of Patients with T2D

Baseline characteristics are presented in Table 1 (n = 107). Mean age was 49.2 ± 11.3 years, 60.7% of patients were female. 69.2% of patients underwent SG, 30.8% underwent RYGB as initial weight loss procedure. Mean HbA1c prior to surgery was 7.6 ± 1.7%. 40.2% of patients were treated with insulin and 55.1% were treated with any oral antidiabetic medication other than insulin. 10.3% (n = 11) of patients were treated with GLP-1 RA. Sixty patients (57.0%) showed poor glycemic control.

Table 1 Baseline characteristics of patients with T2D

In total, liver steatosis (grade 1–3) was present in 103 patients (96.2%). Histological diagnosis of NASH defined by a NAS Score ≥ 5 was present in 43 patients (40.1%; data not shown).

84.1% (n = 90) of patients showed lobular inflammation (grade 1–3), 74.7% (n = 80) of patients showed hepatocyte ballooning (grade 1–2). Histological diagnosis of fibrosis (grade 1–4) was present in 48.6% of patients (n = 52), but only 8 patients (= 7.5%) had a score of F3 or F4 corresponding to significant fibrosis [23]. Accordingly, only 2 patients had a FIB-4 score of > 3.25 strongly suggestive of advanced fibrosis [19]. Mean Hepatic-Steatosis Index (HSI) was 64.6 ± 9.8 indicating a high likelihood of NAFLD [21].

Baseline Histological Characteristics of Patients without and with T2D Remission

Liver steatosis scores at baseline were different between patients without and with T2D remission (p = 0.007). Patients with T2D remission showed a higher percentage of low-grade steatosis (score 1) compared with patients without T2D remission. In those patients without T2D remission, steatosis grade 2 was most common. Interestingly, only 4 patients without hepatic steatosis showed no T2D remission. There were no significant differences in preoperative presence of lobular inflammation, hepatocyte ballooning or fibrosis between patients who achieved T2D remission or not post BS (Fig. 1).

Fig. 1
figure 1

Baseline histological characteristics of patients without and with T2D remission. Liver steatosis score grade 0–3 (A), lobular inflammation score grade 0–3 (B), hepatocyte ballooning score grade 0–2 (C), and liver fibrosis grade 0–4 (D) at baseline

Baseline and Long-term Follow-up Data of Patients Without and with T2D remission

At baseline, patients with T2D remission (n = 60) showed a lower mean HbA1c (6.6 ± 1.2 vs. 8.7 ± 1.4%; p < 0.001) and lower triglyceride levels (211.0 ± 104.2 vs. 275.9 ± 173.7 mg/dl; p < 0.05) than those without T2D remission (n = 47). There were no significant differences in liver enzymes, HSI and FIB-4.

Mean postoperative follow-up was 95.4 ± 7.4 months. At long-term follow-up, 50% (n = 8) of patients achieved remission of T2D. In patients with T2D remission, EWL% (66.1 ± 4.8 vs. 31.7 ± 25.9; p < 0.05) and TWL% (33.3 ± 9.5 vs. 15.5 ± 11.3; p < 0.005) were significantly higher compared to patients without T2D remission. There were no significant differences in liver enzymes, HSI and FIB-4. In both subgroups, mean HSI was ≥ 36, indicating that NAFLD diagnosis was highly likely. There was a significant decline in HSI in patients without T2D remission (p = 0.006) and with T2D remission (p < 0.001) (Fig. 2, Table S1).

Fig. 2
figure 2

Comparison between patients without and with T2D remission. BMI (A), TWL (B), HbA1c (C), and HSI (D) during long-term follow-up. BMI, Body Mass Index; T2D, type 2 diabetes; TWL (%), total weight loss; HSI, hepatic steatosis index; yr, years; *p < 0.005

T2D Remission at Long-term Follow-up Depending on Liver Steatosis Score

Since T2D remission rates were highest in patients with low-grade liver steatosis (grade 1), we compared patients with liver steatosis scores < 2 and ≥ 2 at baseline. HbA1c, triglycerides, AST, ALT, and GGT were significantly higher in patients with liver steatosis score ≥ 2 at baseline. Long-term, no significant differences were found in weight loss, metabolic and hepatic outcomes (Table 2, Figure S1).

Table 2 Comparison between patients with liver steatosis score < 2 (top) and ≥ 2 (bottom)

T2D Remission and NAFLD at Long-term Follow-up Depending on Type of Surgery

Regarding type of surgery, there was no significant difference in T2D remission rates, total weight loss, and HbA1c at long-term follow-up between SG and RYGB. Patients who have had RYGB (n = 33) showed greater improvement in HSI at short-term follow-up (p < 0.05), but not long-term (Fig. 3, Table S2).

Fig. 3
figure 3

Comparison between patients with SG and RYGB. BMI (A), TWL (B), HbA1c (C), and HSI (D) during long-term follow-up. BMI, Body Mass Index; SG, sleeve gastrectomy; RYGB, Roux-en-Y gastric bypass; TWL (%), total weight loss; HSI, hepatic steatosis index; yr, years; *p < 0.05

Discussion

In this retrospective analysis, we aimed to understand whether the presence of NAFLD in BS patients with T2D may affect the long-term remission rates of T2D following different surgical procedures. Here, we found that the presence of low-grade hepatic steatosis at the time of surgery was positively associated with long-term remission rates of T2D up to 8 years following both, RYGB and SG. According to previous research, T2D remission rates vary between 24 and 84% depending on patient baseline characteristics and the type of surgery with slightly superior efficacy of RYGB [24]. The prevalence of NAFLD varies among subgroups of patients with T2D identified by cluster analyses and occurs in up to 77% of patients with obesity and T2D [25]. Recently, liver steatosis at the time of surgery has been found to predict a higher chance of diabetes remission 5 years following RYGB suggesting primordial importance of intrahepatic fat. The analysis was performed on patients who underwent RYGB only [16].

In our cohort, T2D remission rates were 53.8% (n = 28) and 50.0% (n = 8) at 4-year and 8-year follow-up. In patients with T2D remission, TWL% at long-term was significantly greater compared to patients without T2D remission. Remission rates were highest in steatosis grade 1, which was accompanied by lower HbA1c, triglycerides, AST, ALT, and GGT levels at baseline. NAFLD is strictly linked to peripheral and hepatic insulin resistance and associated with pancreatic β cell dysfunction, even though a comprehensive understanding of the association between β-cell dysfunction and NAFLD is still missing [26]. A decrease in liver and pancreas fat content is required for remission of T2D, which is considered to be dependent upon capacity for β cell recovery [27]. Prasad et al. prospectively assessed measures of ß-cell function and found that preintervention β cell function and its change after RYGB predicted remission, but not changes in weight loss or insulin sensitivity [28]. Short-term improvements in beta-cell function using an intravenous glucose tolerance test were similar between SG and RYGB. However, it remained unclear if longer-term outcomes are better after RYGB due to greater weight loss [29].

There are only 13 observational studies evaluating the effects of BS on histopathological NAFLD and most of them assessed the effect of RYGB [30]. In general, improvements in steatosis, inflammation and fibrosis were reported [30]. In the only randomized controlled trial available, RYGB and SG were found to be similarly effective in reducing hepatic steatosis, with an almost complete clearance of liver fat 1 year after RYGB (100%) and SG (94%) assessed by magnetic resonance imaging [31]. In our cohort, type of surgery did neither affect the chance of achieving T2D remission, nor HSI score at long-term follow-up, presuming because weight reduction has to be considered as the main driver for ß-cell recovery. SG led to similar changes in total weight loss compared with RYGB, even though SG as primary intervention is considered to be inferior to RYGB in terms of weight loss (average weight loss at 1 year 25% vs. 31% with RYGB) and diabetes remission in the short and long term [32]. To establish predictive factors of diabetes remission after BS is of great clinical interest. Even though the DiaRem-Score is considered a valid, simple, and reproducible tool to predict the outcome of T2D after BS, it might not be complete since BMI and extent of weight loss were not considered in the model [33]. In our multivariate analysis, higher BMI (OR 1.19; p = 0.002), lower preoperative insulin use (OR 0.8; p = 0.001), and lower HbA1c (OR 0.29; p < 0.001) at baseline were associated with long-term T2D remission (n = 16) (Table S4), which is in line with previous data [16]. Our results support the hypothesis that predictors for diabetes remission after BS may be classified as belonging to either indirect indices for preserved pancreatic beta-cell function, including younger age, shorter duration of diabetes, higher C-peptide, lower HbA1c and no use of insulin or reflect the potential for an insulin resistance reduction, including higher baseline BMI and visceral fat area [34]. In our cohort, preserved beta-cell function seemed to be the major determinant of remission which links to the concept of metabolic memory, since patients with poor glycemic control and insulin use at baseline were more likely to experience treatment failure with regard to diabetes remission despite relevant weight loss.

Since it was not feasible to perform liver biopsy 8 years post surgical intervention, we calculated HSI as simple and efficient screening tool for NAFLD at long-term follow-up [35]. We observed a significant improvement in HSI over time in patients without and with T2D remission, but HSI at long-term follow-up was not significantly different between the groups and still indicating a high likelihood of NAFLD diagnosis. Even though diagnostic accuracy of HSI might be limited in the postbariatric population, this aspect may further support the results of Vangoitsenhoven et al. that the preoperative presence of steatosis plays a weight-independent additive predictive role, but is of secondary importance compared to indirect markers of ß-cell reserve and weight loss [16]. Prospective studies are needed to further evaluate whether the degree of hepatic steatosis might serve as a surrogate parameter for glycemic control predicting T2D remission over time following BS T2D remission over time. The duration of the benefit of surgery in terms of diabetes remission is still unclear. In our cohort, HbA1c was still decreasing by trend at 8 years follow-up without reaching statistical significance and paralleled the decline in HSI during follow-up.

Our study has a few limitations. Here, we report a retrospective analysis, at 96-month follow-up, only 16 patients have been included into the analysis. Measurement of direct markers of ß cell function and insulin resistance were not performed.

In conclusion, the presence of NAFLD should routinely be assessed in patients with obesity and T2D and impact treatment decisions. Metabolic surgery, independent of type of surgery, should be considered at an early NAFLD stage in patients with T2D. Moreover, mechanistic studies that further elucidate the bidirectional relationship between T2D remission and NAFLD are needed.