Introduction

The clinical implications of a novel concept named “metabolic dysfunction-associated fatty liver disease” (MAFLD) intrigue clinical interest. FLD is becoming more common and a serious public health concern. A global panel of experts proposed the new terminology MAFLD in 2020 [1], which replaced the term “non-alcoholic fatty liver disease” (NAFLD). Regardless of alcohol consumption habits, MAFLD has its own inclusion criteria based on a number of metabolic abnormalities. Diabetes mellitus (DM) is a metabolic abnormality included in the diagnostic criteria for MAFLD. Reduction in body weight can bring about improvements in glucose homeostasis and lessen cardiometabolic risk factors in patients with DM; however, lifestyle‐based weight loss interventions (e.g., exercise, diet, and behavior modification) may not be effective in the long term [2]. On the other hand, a recent systematic review shows that some glucose-lowering drugs are effective in inducing weight loss in patients with DM [3]. Sodium-glucose cotransporter 2 (SGLT2) inhibitors (inhibiting the reabsorption of glucose in the proximal tubule, resulting in the promotion of urinary glucose excretion and improvement in glycemic control) were originally developed as drugs for DM. Recent clinical trials have demonstrated the robust cardiovascular and kidney protective effects of SGLT2 inhibitors [4,5,6,7,8,9,10,11,12,13,14]. Furthermore, several small clinical trials have shown that SGLT2 inhibitors could also be effective against FLD [15,16,17,18]. However, little is known regarding whether the results of previous RCTs focusing on the advantage of SGLT2 inhibitors for FLD could be applicable to a broader range of patients with DM and MAFLD encountered in real-world clinical practice. Therefore, we analyzed a nationwide epidemiological database and sought to validate the potential benefits of SGLT2 inhibitor use in patients with DM and MAFLD.

Materials and methods

Anonymized data are publicly available for purchase from JMDC Inc.

Study population

This retrospective cohort study used the JMDC Claims Database, a large-scale administrative claims database [19,20,21]. The JMDC includes annual health checkup data (e.g., blood tests and anthropometric measurements) and health insurance records between 2005 and 2022. In Japan, annual health checkups for employees are a legal requirement. The JMDC Claims Database accumulates insurance claims data. Medical diagnoses were coded according to the International Classification of Diseases, 10th revision (ICD-10). We extracted the data of 21,883 individuals with DM (ICD-10 codes E10–E14) and MAFLD defined as ICD-10 codes of E10–E14 and fatty liver index (FLI) ≥ 30 [22, 23], who newly initiated SGLT2 (empagliflozin, dapagliflozin, canagliflozin, ipragliflozin, tofogliflozin, and luseogliflozin) or dipeptidyl peptidase 4 (DPP4) inhibitors (Supplementary Fig. 1). FLI was calculated using the following formula: FLI = 1/1 + e−(0.953×ln(triglycerides) + 0.139×(BMI) + 0.718×ln(γ−glutamyl transpeptidase [γ−GTP]) + 0.053×(waist circumference)−15.745) × 100. We defined initiating either drug class among individuals who had not previously used either drug class within the previous year as a new use. Furthermore, only individuals with available repeated data for the assessment of FLI during health checkups were included in this study. Among 21,883 individuals, we excluded participants for the following reasons: age < 20 years (n = 1); a history of liver disease defined as liver cancer (ICD-10 code: C22), fibrosis and cirrhosis of the liver (ICD-10 code: K74), hepatitis B (ICD-10 code: B16), hepatitis C (ICD-10 code: B182), autoimmune hepatitis (ICD-10 code: K754), and cholangitis (ICD-10 code: K830) (n = 317); and missing cigarette smoking (n = 262), alcohol consumption (n = 1257), and physical activity (n = 521) data. Finally, 19,525 individuals were included in this study (Supplementary Fig. 2).

Ethics

This study was approved by the Ethics Committee of the University of Tokyo (approval number: 2018-10862), and informed consent was not required because all data included in the JMDC Claims Database were anonymized and de-identified.

Measurements and definitions

We obtained the following data from the health checkups: body mass index (BMI), waist circumference, blood pressure, laboratory data (fasting plasma glucose, hemoglobin A1c [HbA1c], low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, triglycerides, aspartate aminotransferase [AST], alanine aminotransferase [ALT], and gamma-glutamyl transpeptidase [γ-GTP]), cigarette smoking (current or noncurrent/never), alcohol consumption (daily or not every day), and physical activity (active or inactive). Cigarette smoking and alcohol consumption were assessed using a self-report questionnaire during the health checkup. Physical inactivity was defined as not exercising for 30 min ≥ 2 times a week or not walking for more than an hour per day. Based on the ICD-10 code, we obtained data on the presence of diabetic nephropathy (ICD-10 codes: E102, E112, E122, E132, and E142), diabetic retinopathy (ICD-10 codes E103, E113, E123, E133, and E143), and diabetic neuropathy (ICD-10 codes: E104, E114, E124, E134, and E144) at the prescription date of SGLT2 or DPP4 inhibitors. Data on concomitant medications at the prescription date of SGLT2 or DPP4 inhibitors were extracted from administrative claims records.

Propensity score matching

A propensity score matching algorithm was used to generate a matched cohort to compare the benefits of SGLT2 and DPP4 inhibitor use. We estimated the propensity score for SGLT2 inhibitor users using a logistic regression model. To estimate the propensity score, we included the following variables: age, sex, BMI, waist circumference, systolic blood pressure, diastolic blood pressure, fasting plasma glucose, HbA1c, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, triglycerides, AST, ALT, γ-GTP, cigarette smoking, alcohol consumption, physical inactivity, diabetic nephropathy, diabetic retinopathy, diabetic neuropathy, use of medications (insulin, glucagon-like peptide-1 receptor agonist, biguanide, sulfonylurea, α-glucosidase inhibitor, thiazolidine, glinide, renin-angiotensin system inhibitor, β-blocker, calcium channel blocker, mineralocorticoid receptor antagonist, diuretics, and statins), year of SGLT2 or DPP4 inhibitors prescription, and FLI at the initial health checkup. We matched SGLT2 and DPP4 inhibitor users using a 1:1 matching protocol (caliper width equal to 0.2 standard deviations of the logit score).

Outcomes

Outcomes were obtained from the annual health checkup data between 2005 and 2022. The primary outcome was the change in FLI after the initiation of SGLT2 or DPP4 inhibitors. The secondary outcomes were changes in γ-GTP, AST, ALT, BMI, waist circumference, and HbA1c levels. We followed the study participants for a maximum of 5 years after the initial health checkup.

Statistical analysis

The median (interquartile range) and number (percentage) were used to report descriptive statistics. We used a linear mixed-effects model for repeated measures with random intercept and slope, assuming an unstructured covariance structure, to compare the change in outcomes, including FLI, γ-GTP, AST, ALT, BMI, waist circumference, and HbA1c, among SGLT2 and DPP4 inhibitors. This model included the treatment group (SGLT2 or DPP4 inhibitors), time, and interaction between the treatment group and time. To examine the difference in the outcome changes among SGLT2 or DPP4 inhibitors, the P-value for the interaction between the treatment group and time was tested using Wald test.

If a significant difference in the primary outcome change was detected between SGLT2 and DPP4 inhibitors, we compared the change in FLI between individual SGLT2 inhibitors to examine whether the effects of SGLT2 inhibitors would be considered a class effect. We also performed a linear mixed-effects model for repeated measures to compare the change in FLI among empagliflozin, dapagliflozin, canagliflozin, ipragliflozin, tofogliflozin, and luseogliflozin. Presently, these six SGLT2 inhibitors are commercially available in Japan.

This model included individual SGLT2 inhibitors, time, the interaction between the individual SGLT2 inhibitors and time, age, sex, BMI, waist circumference, systolic blood pressure, diastolic blood pressure, fasting plasma glucose, HbA1c, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, triglycerides, AST, ALT, γ-GTP, cigarette smoking, alcohol consumption, physical inactivity, diabetic nephropathy, diabetic retinopathy, diabetic neuropathy, use of the following medications (insulin, glucagon-like peptide-1 receptor agonist, biguanide, sulfonylurea, α-glucosidase inhibitor, thiazolidine, glinide, renin-angiotensin system inhibitor, β-blocker, calcium channel blocker, mineralocorticoid receptor antagonist, diuretics, and statins), and year of SGLT2 inhibitor prescription. To examine the difference in outcome changes among individual SGLT2 inhibitors, the P-value for the interaction between individual SGLT2 inhibitors and time was tested using a Wald test.

Three sensitivity analyses were performed to validate the primary findings. First, we examined the changes in FLI only in individuals who continued to use SGLT2 or DPP4 inhibitors for > 3 months. Second, we performed a linear mixed-effects model using a restricted cubic spline function with 4 knots to confirm the shape of the change in FLI from the initiation of SGLT2 inhibitors or DPP4 inhibitors. Third, we performed subgroup analyses stratified by age (≥ 50 and < 50 years), sex, BMI (≥ 30 and < 30 kg/m2), and HbA1c level at the initial health checkup (≥ 7.5 and < 7.5%). All statistical analyses were performed using STATA version 17 (StataCorp LLC, College Station, TX, USA).

Results

Clinical characteristics

Table 1 shows the baseline clinical characteristics of study participants before and after propensity score matching. After 1:1 propensity score matching, 6547 pairs were created. After propensity score matching, the individual distributions were well balanced between SGLT2 and DPP4 inhibitor users. The median age was 51 (46–56) years for SGLT2 inhibitor users and 51 (45–56) years for DPP4 inhibitor users. In addition, 5476 (83.6%) individuals were men in SGLT2 inhibitors users, and 5449 (83.2%) individuals were men in DPP4 inhibitors users. The median FLI was 78.2 (59.1–90.9) in SGLT2 inhibitor users and 78.7 (59.5–91.2) in DPP4 inhibitor users.

Table 1 Baseline Characteristics

Change in outcomes among SGLT2 and DPP4 inhibitors

The mean follow-up period was 750 ± 437 days. Figure 1 shows the changes in outcomes after the initiation of SGLT2 or DPP4 inhibitors. SGLT2 inhibitor users showed a greater decline in FLI than that of DPP4 inhibitor users. The predicted difference in FLI among SGLT2 and DPP4 inhibitors at 1-year measurement was -3.8 (95% confidence interval [95% CI], − 4.7 to − 3.0). A significant interaction was detected between the treatment group (SGLT2 or DPP4 inhibitors) and the time spent on FLI (P for interaction < 0.001).

Fig. 1
figure 1

Comparison of the Change in Outcomes among SGLT2 Inhibitors and DPP4 Inhibitors. We performed a linear mixed-effects model to compare the change in fatty liver index (A), γ-glutamyl transpeptidase (B), aspartate aminotransferase (C), alanine aminotransferase (D), body mass index (E), waist circumference (F), and hemoglobin A1c (G) among SGLT2 inhibitors and DPP4 inhibitors. The model included treatment group (SGLT2 inhibitors or DPP4 inhibitors), time, and the interaction between the treatment group and time. Error bars represented a 95% confidence interval

Furthermore, SGLT2 inhibitors users had a greater decline in γ-GTP, AST, ALT, BMI, waist circumference, and HbA1c at 1-year measurement than that of DPP4 inhibitors users (predicted difference [95% CI]; − 6.2 [95% CI − 8.4 to − 4.0] U/L; − 3.1 [95% CI − 3.8 to − 2.5] U/L; − 5.1 [95% CI − 6.1 to − 4.0] U/L; − 0.6 [95% CI − 0.7 to − 0.4] kg/m2; − 1.3 [95% CI − 1.7 to − 0.9] cm; and − 0.1% [95% CI − 0.2 to − 0.1], respectively). We detected a significant interaction between the treatment group (SGLT2 or DPP4 inhibitors) and time on γ-GTP, AST, ALT, BMI, waist circumference, and HbA1c (all P < 0.001).

Change in FLI among individual SGLT2 inhibitors

We compared the change in FLI between individual SGLT2 inhibitors because of the significant difference in FLI change among SGLT2 and DPP4 inhibitors. We analyzed 6,535 SGLT2 inhibitor users after excluding individuals prescribed multiple SGLT2 inhibitors (n = 12) from 6547 SGLT2 inhibitor users. SGLT2 inhibitor users were categorized into six groups: empagliflozin (n = 1593), dapagliflozin (n = 1389), canagliflozin (n = 1137), ipragliflozin (n = 955), tofogliflozin (n = 703), and luseogliflozin (n = 758). Figure 2 shows the change in FLI after the initiation of SGLT2 inhibitors. Each SGLT2 inhibitor user showed a similar reduction in FLI. The differences in FLI at 1-year measurement were 0.7 (95% CI − 1.0 to 2.4) for dapagliflozin, 0.3 (95% CI − 1.5 to 2.2) for canagliflozin, 0.8 (95% CI − 1.2 to 2.7) for ipragliflozin, − 1.1 (95% CI − 3.2 to 1.1) for tofogliflozin, and 0.9 (95% CI − 1.2 to 2.9) for luseogliflozin compared with empagliflozin. We detected no statistically significant interaction between individual SGLT2 inhibitors and time on the FLI (P = 0.2122).

Fig. 2
figure 2

Comparison of the Change in Fatty Liver Index among SGLT2 Inhibitors. We performed a linear mixed-effects model to compare the change in fatty liver index among individual SGLT2 inhibitors. The model included individual SGLT2 inhibitors, time, the interaction between the individual SGLT2 inhibitors and time, age, sex, systolic blood pressure, diastolic blood pressure, fasting plasma glucose, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, triglycerides, cigarette smoking, alcohol consumption, physical inactivity, diabetic nephropathy, diabetic retinopathy, diabetic neuropathy, use of the following medications (insulin, glucagon-like peptide-1 receptor agonist, biguanide, sulfonylurea, α-glucosidase inhibitor, thiazolidine, glinide, renin-angiotensin system inhibitor, β-blocker, calcium channel blocker, mineralocorticoid receptor antagonist, diuretics, and statin), year at the prescription of SGLT2 inhibitors. Error bars represented a 95% confidence interval

Sensitivity analyses

First, we analyzed 5783 SGLT2 inhibitor users and 5783 DPP4 inhibitor users who continued to use SGLT2 or DPP4 inhibitors for > 3 months. The main findings remained unchanged in this population (Supplementary Fig. 3). Second, we confirmed the robustness of the shape of the change in the FLI using the cubic spline function (Supplementary Fig. 4). Third, the results in terms of changes in FLI among SGLT2 and DPP4 inhibitors were generally consistent across subgroups stratified by age, sex, BMI, and HbA1c (Fig. 3).

Fig. 3
figure 3

Subgroup Analysis. We performed a linear mixed-effects model to compare the change in fatty liver index among SGLT2 inhibitors and DPP4 inhibitors stratified by age, sex, body mass index, and hemoglobin A1c. The model included treatment group (SGLT2 inhibitors or DPP4 inhibitors), time, and the interaction between the treatment group and time. Error bars represented a 95% confidence interval

Discussion

In the present study, using a large-scale health checkup and claims database including approximately 20,000 patients with MAFLD and DM, we compared the change in FLI between SGLT2 and DPP4 inhibitors after propensity score matching. Administration of SGLT2 inhibitors was associated with improved FLI and liver enzymes (γ-GTP, AST, and ALT) and decreased BMI and waist circumference. Various sensitivity analyses have shown consistent results. These findings were observed across all subgroups stratified by age, sex, BMI, and HbA1c level at baseline. The association of SGLT2 inhibitors with amelioration of FLI was comparable between individual SGLT2 inhibitors, suggesting a potential class effect of SGLT2 inhibitors for DM and MAFLD. To the best of our knowledge, this is the first study to show the possible advantage of SGLT2 inhibitors used for MAFLD using a large-scale real-world dataset.

Several clinical trials have validated the effect of SGLT2 inhibitor administration on FLD. A prospective observational study included 21 patients with type 2 diabetes, and the use of ipragliflozin for 16 weeks was associated with a decrease in FLI, HbA1c, body weight, and visceral adipose tissue [24]. In the Liraglutide Effect and Action in Diabetes trial, a prospective, single-arm trial, 40 patients with type 2 diabetes and NAFLD were treated using luseogliflozin for 6 months. Treatment with luseogliflozin was associated with decreased HbA1c and transaminase activity, as well as improvements in hepatic fat content. Serum ferritin levels were reduced and serum albumin increased after treatment with luseogliflozin [25]. Further, a prospective randomized controlled pilot study included 32 patients with type 2 diabetes and NAFLD and randomly assigned study participants to receive either luseogliflozin or metformin. The changes in the liver-to-spleen attenuation ratio, changes in the visceral fat area, HbA1c, and BMI after 6 months were significantly greater in the luseogliflozin group than that in the metformin group [18]. The Effect of Empagliflozin on Liver Fat Content in Patients With Type 2 diabetes trial was an investigator-initiated, prospective, open-label, randomized clinical study and randomly assigned 50 patients to the empagliflozin (standard treatment for type 2 diabetes plus empagliflozin) or control groups (standard treatment for type 2 diabetes alone) for 5 months. The reduction in liver fat assessed using magnetic resonance imaging-derived proton density fat fraction was significantly greater in the empagliflozin group. Furthermore, the empagliflozin group showed a significant decrease in serum ALT levels [26]. A randomized, 48-week, open-label, active-controlled trial randomly assigned 40 patients with biopsy-confirmed NAFLD and type 2 diabetes to receive tofogliflozin or glimepiride. The fibrosis score was improved in the tofogliflozin-treated group. The histological variables of steatosis, hepatocellular ballooning, and lobular inflammation improved in the tofogliflozin group, whereas only hepatocellular ballooning improved in the glimepiride group [15].

Although the present study is in agreement with previous studies in that we demonstrated the potential benefit of SGLT2 inhibitors for FLD, our study is distinguishable from previous studies in the following points and has clinical implications. We analyzed approximately 20,000 patients with DM and MAFLD using a large-scale epidemiologic cohort and compared approximately 6,500 well-balanced pairs of new users of SGLT2 or DPP4 inhibitors with propensity score matching. Given that clinical trials investigating the effects of SGLT2 inhibitors on FLD have been limited to a maximum of approximately 50 patients, our study is the first to examine the association between SGLT2 inhibitor administration and outcomes using a large-scale real-world dataset. Several potential pathological mechanisms for the possible benefits of SGLT2 inhibitors for MAFLD have been suggested (e.g., calorie restriction, improvement in systemic insulin resistance, and reduction in body weight). In this study, there was a greater improvement in HbA1c in the SGLT2 inhibitor group than in the DPP4 inhibitor group. The improvement in glucose tolerance in the SGLT2 inhibitor group could contribute to the greater improvement in fatty liver as well. Currently, cardiovascular and kidney protective effects have been demonstrated for SGLT2 inhibitors in patients with and without DM, and the pathological or pharmacological mechanisms underlying these effects of SGLT2 inhibitors are attracting interest. Basic or experimental studies are needed to determine the hepatoprotective effects of SGLT2 inhibitors. Furthermore, we need to clarify whether SGLT2 inhibitors could provide clinical benefits for patients with MAFLD without DM. On the other hand, the differences between the SGLT2 and DPP4 inhibitor groups were seemingly attenuated over time. Changes in medication (e.g., addition of SGLT2 inhibitors to the DPP4 inhibitor group) and lifestyle modifications during the follow-up period could have contributed to these results. Further data accumulation and investigation are needed in this regard. The large sample size of our database allows for various sensitivity analyses to confirm the robustness of our findings. In particular, it is important that the various subgroup analyses stratified by age, sex, BMI, and HbA1c level suggest a potential advantage of SGLT2 inhibitors over DPP4 inhibitors. We found that the influence of SGLT2 inhibitors on MAFLD would be similar among individual SGLT2 inhibitors, suggesting a potential “class effect” of SGLT2 inhibitors, which is consistent with our previous studies [21, 27]. As it may not be feasible to conduct a randomized controlled trial in this point of view, we believe that our study could have provided intriguing clinical data.

We acknowledge the study limitations mainly due to the use of the JMDC Claims Database, which we previously discussed [21, 27]. Because of the observational and retrospective nature of the present study, and despite robust statistical procedures, including propensity score matching and a multitude of sensitivity analyses, the possibility of unmeasured residual confounding could not be eliminated. For instance, although socioeconomic status or the duration of DM could have affected clinical outcomes, the JMDC Claims Database did not include these data. Because most of the people registered in our dataset are employees (or their family members) who work for relatively large companies in Japan, the socioeconomic status of the study participants would not be significantly different. However, the absence of these data must be considered a potential limitation of our study. It is unknown whether our findings may be applied to the older population because the JMDC Claims Database does not contain those over 75 years of age. We did not take into account the dosage of each medication. We evaluated the effects of SGLT2 inhibitors on liver enzyme levels, BMI, and waist circumference. However, it should be considered that liver enzymes alone may not reflect the liver histological responses. There is a possibility of treatment changes during the clinical course following the initiation of SGLT2 or DPP4 inhibitors, and this should be considered as a potential factor that could have influenced the results of the present study.

Conclusion

In real-world clinical practice, patients with MAFLD and DM who were newly prescribed SGLT2 inhibitors showed significantly better improvement in liver enzymes, reduction in BMI and waist circumference, and a decrease in FLI than those who were newly prescribed DPP4 inhibitors. Our findings were consistent irrespective of age, sex, BMI, and HbA1c level at baseline. This study, using a large-scale epidemiological cohort, complements the results of previous randomized clinical trials and confirms the potential liver-protective benefits of SGLT2 inhibitors in patients with MAFLD and DM.