Background

Metabolic dysfunction-associated fatty liver disease (MAFLD) is a new conceptual definition of fatty liver disease that is based on metabolic abnormalities (regardless of the etiology of chronic liver disease) that emphasizes the role of metabolic dysfunction in the clinical outcomes of patients with hepatic steatosis [1, 2]. MAFLD is defined according to the presence of hepatic steatosis along with one or more of the following criteria: (i) overweight or obesity, (ii) diabetes mellitus (DM), or (iii) ≥ 2 metabolic abnormalities [3]. Recent studies have investigated the predictive role of MAFLD in cardiovascular disease (CVD) [4, 5] and all-cause mortality [6,7,8].

However, MAFLD includes a heterogeneous group of patients, and the following subgroups should be recognized: lean and non-lean patients, alcoholic and nonalcoholic patients, and patients with isolated hepatic steatosis (vs. those with underlying liver disease) [9]. A previous study reported that MAFLDs are heterogeneous in terms of mortality outcomes and cardiovascular risk and that these outcomes vary according to the accompanying metabolic dysfunctions [10]. A previous study demonstrated that MAFLD better identifies cardiovascular risk than nonalcoholic fatty liver disease (NAFLD), which was attributed to the inclusion criteria of metabolic dysfunctions [11]. Moreover, a statistically significant difference in the degree of liver fibrosis has been reported among MAFLD subgroups, suggesting differential risks of advanced liver disease across MAFLD subgroups [12].

Since overweight/obesity is typically defined according to body mass index (BMI), it does not always entail metabolic abnormalities [13]. As MAFLD is a spectrum encompassing diverse disease groups defined according to metabolic abnormalities and overweight/obesity, there may be heterogeneity in clinical outcomes according to disease subcategorizations. Therefore, we aimed to investigate the differential risk of all-cause and disease-specific mortality according to MAFLD subgroups divided by metabolic risk factors and overweight/obesity status within a nationally representative Korean study population.

Methods

Data source

The present study obtained data from the Korean National Health Insurance System (NHIS). This is a national insurer managed by the Korean government, and approximately 97% of the Korean population subscribe to it [14]. The NHIS database contains health records, including sociodemographic data (age, sex, income level), anthropometric measurements, laboratory tests (e.g., lipid profiles, blood glucose levels), lifestyle behaviors (smoking, alcohol consumption, regular exercise), medical diagnoses (based on the 10th revision of the International Classification of Diseases [ICD-10]), and treatment data [15]. This database has been widely used for conducting previous epidemiologic studies [16, 17].

Study sample

This investigation included 10,585,844 adults aged ≥ 20 who underwent health screening examinations between January 1, 2009, and December 31, 2009. We ascertained outcome events after a lag of 1 year; those with outcome events within 1 year were excluded (n = 28,478). After excluding participants with incomplete information (n = 622,052), 9,935,314 subjects were analyzed.

The study protocol was approved by the Institutional Review Board of Soongsil University (SSU-202007-HR-236–01). This work conformed to the ethical guidelines delineated within the Declaration of Helsinki and its later amendments. The requirement for patient informed consent was waived as this was a retrospective study conducted exclusively using de-identified secondary data.

Measurement of hepatic steatosis

Although ultrasonography is a first-line screening technique used in clinical practice [18], it is not included in the NHIS mass screening program. Therefore, the fatty liver index (FLI), as a surrogate marker of hepatic steatosis, was used to assess hepatic steatosis. The calculation of FLI was based on the following equation, using BMI, waist circumference (WC), triglyceride, and gamma-glutamyl transferase (GGT) data [19]. The lower cut-off of FLI ≥ 30 was used in this study [4].

$$\mathrm{FLI }=\left[\left({\mathrm{e}}^{0.953 \times \mathrm{ ln\ triglyceride }+ 0.139 \times \mathrm{ BMI }+ 0.718 \times \mathrm{ ln\ GGT}\left[\mathrm{gamma}-\mathrm{glutamyl\ transferase}\right]+ 0.053 \times \mathrm{ WC}\left[\mathrm{waist\ circumference}\right]- 15.745}\right)/ (1 + {\mathrm{e}}^{0.953 \times \mathrm{ ln\ triglyceride }+ 0.139 \times \mathrm{ BMI }+ 0.718 \times \mathrm{ ln\ GGT }+ 0.053 \times \mathrm{ WC}-15.745})\right] \times 100$$

Definitions of MAFLD and subgroups

MAFLD was defined as the presence of metabolic risk factors in hepatic steatosis, not excluding other concomitant liver diseases and significant alcohol consumption, based on diagnostic criteria proposed in an international expert consensus statement in 2020 [3]. MAFLD was diagnosed as the presence of hepatic steatosis and one or more of the following criteria: (i) overweight or obesity (BMI ≥ 23 kg/m2), (ii) DM, and (iii) ≥ 2 metabolic abnormalities. The cut-off value of BMI for overweight and obesity was determined based on the criteria for the Asia–Pacific region. Metabolic abnormalities were as follows: (i) WC of ≥ 90 cm for men and ≥ 80 cm for women, (ii) blood pressure ≥ 130/85 mmHg or specific drug treatment for elevated blood pressure, (iii) fasting plasma triglycerides ≥ 150 mg/dL or specific drug treatment for elevated triglycerides, (iv) plasma high-density lipoprotein-cholesterol < 40 mg/dL for men and < 50 mg/dL for women or specific drug treatment for high cholesterol, and (v) fasting glucose ≥ 100 mg/dL. As the homeostasis model assessment of insulin resistance scores and high-sensitivity C-reactive protein levels were not available through the NHIS screening program, these criteria were not implemented in this study.

In the present study, participants identified as affected by MAFLD were classified into three groups, as described previously [12, 20]. First, we determined the diabetic MAFLD group based on the presence of DM regardless of BMI; subsequently, in subjects without DM, we classified MAFLD subgroups according to BMI: (i) overweight/obese (BMI ≥ 23 kg/m2) or (ii) lean or normal weight (BMI < 23 kg/m2). Finally, the study population was divided into four subgroups: no MAFLD, MAFLD-diabetes, MAFLD-overweight/obese, and MAFLD-lean (Fig. 1).

Fig. 1
figure 1

Flow chart of the study population enrollment. BMI, body mass index; DM, diabetes mellitus; MAFLD, metabolic-associated fatty liver disease

Study outcomes

The primary outcomes evaluated in this study were all-cause and disease-specific mortality. Information on mortality and cause of death was available for all subjects in the cohort; cause-of-death variables were classified according to the Korean Standard Classification of Diseases and Causes of Death, based on the ICD-10 (with data provided by the Korean National Statistical Office) [21].

Based on the ICD-10 diagnostic codes, the cause of death was categorized into CVD mortality (I00-I99), cancer mortality other than hepatocellular carcinoma (C00-C97, except for C22), and liver-associated mortality (K70-76, C22) [22]. The study population was followed from baseline to the date of death or until December 31, 2019, allowing for a minimum of 12 months of follow-up for each individual.

Covariates

As described previously [23], standardized self-reported questionnaires were used to collect data during enrollment. Briefly, smoking status (non-smokers, ex-smokers, current smokers), alcohol consumption (none, mild, heavy [≥ 30 g for males and ≥ 20 g for females per day]), and regular exercise were evaluated using a self-administered questionnaire. When participants exercised at a high intensity at least three times per week or at a moderate intensity at least five times per week, they were defined as undertaking regular physical exercise. Household income was dichotomized at the lowest 20%. Comorbidities were defined using ICD-10 diagnosis codes, prescription information in the year prior to the health screening, and health screening results. Criteria for hypertension were I10–13 or I15 claim codes in addition to ≥ 1 prescription for an antihypertensive agent or a systolic/diastolic blood pressure reading of ≥ 140/90 mmHg. Criteria for diabetes were E11–14 claim codes in addition to ≥ 1 prescription for an antidiabetic medication per year or a fasting glucose level of ≥ 126 mg/dL. The criteria for dyslipidemia were the E78 claim code and ≥ 1 prescription for a lipid-lowering agent or a total cholesterol level of ≥ 240 mg/dL. The Charlson Comorbidity Index (CCI) [24] was determined from claims data during the period spanning 2 years prior to baseline. K74 and B15–B19 claim codes were the criteria used to determine liver cirrhosis and viral hepatitis.

Height, weight, and WC were measured and evaluated by healthcare professionals during nationally mandated health screenings. BMI was calculated as the participant’s weight in kilograms divided by their height in meters squared. Blood specimens were obtained from each participant after an overnight fast. The estimated glomerular filtration rate (eGFR) was calculated from serum creatinine using an equation determined by the Modification of Diet in Renal Disease Study [25].

Statistical analyses

Data are presented as means ± standard deviations for normally distributed continuous variables and as proportions for categorical variables (unless otherwise indicated). Comparisons of baseline characteristics were conducted using independent t-tests and analysis of variance for continuous variables and chi-square tests for categorical variables.

All-cause and disease-specific mortality rates were calculated by dividing the number of incident cases by the total follow-up period and were presented as rates per 1000 person-years. Hazard ratios (HRs) and 95% confidence interval (CI) values for all-cause and disease-specific mortality according to MAFLD subgroups were analyzed using Cox proportional hazards models. We selected potential prognostic factors a priori based on clinical relevance and a comprehensive literature review; potential prognostic factors included age, sex, BMI, income, smoking, alcohol consumption, regular exercise, and CCI scores. Statistical analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC, USA). Two-sided P-values of < 0.05 were considered the threshold for statistical significance.

Results

Baseline characteristics

This analysis included 9,935,314 participants (median age [interquartile range] 46 [36–57] years; mean age, 47.2 years; 56.0% males). The prevalence of MAFLD was 35.8%. The prevalence of each MAFLD subtype was 5.5% (MAFLD-diabetes), 28.4% (MAFLD-overweight/obese), and 1.9% (MAFLD-lean). The baseline characteristics for the study population are shown in Table 1. Compared with the no MAFLD group, people with MAFLD were more likely to be male and current smokers and had higher alcohol consumption (P < 0.001 for all). Additionally, subjects in the MAFLD-diabetes group and the MAFLD-lean group were more likely to have hypertension and dyslipidemia than those in the no MAFLD group (P < 0.001). Most anthropometric and laboratory variables (including systolic/diastolic blood pressure, fasting glucose, total cholesterol, triglycerides, and eGFR) were less metabolically favorable in the MAFLD-diabetes and MAFLD-lean groups than in the no MAFLD group (all P < 0.001).

Table 1 Baseline characteristics of the study population

All-cause and disease-specific mortality by MAFLD subgroup

A total of 417,593 of the 9,935,314 subjects (4.2%) died during the median follow-up of 9.3 (interquartile range: 9.1, 9.6) years. Table 2 summarizes the observed associations between MAFLD subgroups and all-cause or cause-specific mortality. In the age- and sex-adjusted model, individuals in the MAFLD-diabetes group had the highest all-cause mortality among the four evaluated groups (HR: 1.43, 95% CI: 1.42–1.45). In the multivariable model adjusting for age, sex, BMI, income, smoking, alcohol consumption, exercise, and CCI score, individuals with MAFLD-diabetes had the highest increased risk of all-cause mortality (HR: 1.61, 95% CI: 1.59–1.63) followed by the MAFLD-lean and MAFLD-overweight/obese groups (HR: 1.36, 95% CI: 1.34–1.38 and HR: 1.19, 95% CI: 1.18–1.20, respectively, Table 2). When further adjusting for serum glucose, cholesterol, alanine aminotransferase, eGFR, liver cirrhosis, and hepatitis, these associations persisted. (Additional file 1: Table S1).

Table 2 All-cause and cause-specific mortality by MAFLD subgroup

Overall, cancer and CVD represented the two most common causes of mortality in the current study; there were a total of 152,212, 80,262, and 31,290 cancer-associated, CVD-associated, and liver disease-associated deaths, respectively. In the multivariate model, individuals in the MAFLD-diabetes group had the highest CVD-specific mortality (HR 1.61, 95% CI: 1.58–1.65). Individuals in the MAFLD-lean and MAFLD-overweight/obese groups also had higher CVD-specific mortality than those without MAFLD (HR 1.45, 95% CI: 1.40–1.51 and HR 1.22, 95% CI: 1.19–1.25, respectively). Cancer-specific mortality increased significantly in all MAFLD subgroups compared to those without MAFLD. Individuals in the MAFLD-diabetes group showed the highest HR (1.31, 95% CI: 1.29–1.33), followed by the MAFLD-lean and MAFLD-overweight/obese groups (HR 1.27, 95% CI: 1.23–1.30 and HR 1.11, 95% CI: 1.09–1.13, respectively). Individuals in the MAFLD-diabetes group had a 2.85-fold higher liver disease-related mortality (HR: 2.85, 95% CI: 2.75–2.95), with similar risk for those in the MAFLD-lean group (HR: 2.84, 95% CI: 2.72–2.97), followed by those in the MAFLD-overweight/obese group (HR 1.76, 95% CI: 1.70–1.82), compared to those without MAFLD.

Because FLI has different cut-offs for men and women and bears a large gray zone of undetermined presence/absence of hepatic steatosis, we conducted analysis using two different sex-specific cut-offs [26, 27]. Similar findings were observed in the increase of HR according to the MAFLD subtypes (Additional file 1: Table S2).

To minimize the effect of age difference among the MAFLD subtypes, we performed sensitivity analysis according to subjects aged 60 years or older, and the results were similar to those of the entire population (Additional file 1: Table S3 and Table S4).

Stratified analyses

We performed secondary analyses stratified by BMI. The increased risk for all-cause, cardiovascular, cancer, and liver-related mortality in the MAFLD-diabetes group was most prominent in individuals with underweight (P for interaction < 0.05). Liver-related mortality was the highest in MAFLD-lean individuals in the underweight group (HR, 5.03, 95% CI: 4.23–5.97, Table 3).

Table 3 Stratified analysis by body mass index: risk for all-cause and disease-specific mortality by MAFLD subgroup

Discussion

In this large nationwide, population-based cohort study, we calculated all-cause and disease-specific mortality in MAFLD subgroups using a database of health insurance claims in Korea. All-cause and disease-specific mortality were highest in the MAFLD-diabetes group, followed by the MAFLD-lean and MAFLD-overweight/obese groups, suggesting differential prognoses according to the MAFLD subtype. Interestingly, the liver-related mortality in the MAFLD-lean group was comparable to that in the MAFLD-diabetes group, emphasizing the poor liver-related prognosis in the MAFLD-lean group. The classification of MAFLD subtypes based on the metabolic phenotypes used in this study might help risk stratification of MAFLD patients.

Among the MAFLD subtypes, MAFLD-diabetes showed the highest increased risk for all-cause and disease-specific mortality. Diabetes is associated with a thrombotic and inflammatory condition and a higher incidence of atherosclerosis and carcinogenesis due to sustained hyperglycemia [28, 29]. As a consequence, diabetes has been recognized as a risk factor for all-cause and cause-specific mortality [30,31,32]. The results of this study showing that MAFLD-diabetes can be a strong predictor of all-cause and disease-specific mortality are in line with previous literature.

In addition to diabetes, MAFLD subcategorizations according to overweight/obesity may help elucidate the differential mortality risk among MAFLD subtypes. However, since the proportion of lean MAFLD is relatively low in real clinical practice and it is difficult to collect enough samples, the clinical and prognostic features of lean MAFLD have not been defined well in prior work. In the current study, although the prevalence of MAFLD-lean was 1.9% of the total population, the characteristics of the nationwide population study design enabled reliable analysis by securing a significant sample size.

In this study, individuals in the MAFLD-lean group were at a higher risk of all-cause and disease-specific mortality than those in the MAFLD-overweight/obese group. Similar to our results, several studies have reported that the cumulative CVD incidence rate was higher in lean MAFLD than in overweight/obese MAFLD [4, 19], and the risk of all-cause mortality was higher in those with lean MAFLD without diabetes (HR: 2.34) than in overweight/obese MAFLD subjects without diabetes (HR: 1.23) [10]. These findings are in line with poor clinical outcomes previously documented in patients with lean NAFLD [33,34,35]. A recent study reported that NAFLD subjects with normal BMI had a higher risk of all-cause mortality than those with obese NAFLD and that the major causes of death in this subgroup were cancer and CVD [36]. Similarly, people with lean NAFLD show a comparable risk of advanced liver disease, metabolic comorbidities, CVD, and liver-associated mortality to those with non-lean NAFLD without weight gain during follow-up and independent of genotype [37].

As lean NAFLD represents a unique subset related to the interaction of genetics, underlying medical conditions, and environmental exposures [38], the mechanism-related poor prognosis in lean MAFLD is not completely elucidated. Although lean and obese NAFLD share common altered metabolic and cardiovascular profiles, NAFLD patients having normal body weight show excess abdominal adipose tissue and metabolic abnormalities [39]. Recent studies reported that the progression of lean NAFLD is affected by multiple epigenetic mechanisms such as modification of cytosine and histones and changes in nucleosome localization [40], and circulating histones may discriminate the severity of steatosis in lean MAFLD patients [41]. In addition, a previous study reported that lean MAFLD subjects had lower fatty tissue index, lean tissue index, and total body water than did non-lean MAFLD subjects, which means that lean MAFLD subjects had less adiposity and less muscle mass compared to obese MAFLD subjects [42]. These results suggest that sarcopenia, which is the loss of muscle mass and function with aging, is a possible underlying reason for poor prognosis in people with lean MAFLD.

Because MAFLD is closely related to metabolic diseases, various pharmacological approaches that have been attempted to manage NAFLD’s metabolic traits, such as anti-obesity, antidiabetic, antioxidants, and cytoprotective agents, may also be effective in MAFLD. Emerging data from various clinical trials with drugs targeting diverse molecular mechanisms show promising results [43], and a recent phase II trial reported that semaglutide might be an attractive potential therapeutic option for NASH [44]. Future potential treatments for NAFLD include agents that act through peroxisomal proliferator-activated receptors, glucagon-like peptide-1 receptor agonists, sodium-glucose cotransporter 2 inhibitors, or farnesoid X receptor agonists, and several investigating agents have entered phase III clinical trials [45].

Overweight/obesity increases the risk for CVD, cancer, and liver-associated adverse outcomes. However, subjects in the MAFLD-overweight/obese group showed better prognoses than did those in the MAFLD-lean group in the present study. This phenomenon may be associated with the obesity paradox (i.e., elevated BMI may improve survival in individuals with CVD, primarily those with congestive heart failure [46, 47], cancer [48, 49], and type 2 DM [50]). Potential explanations for the obesity paradox include reverse causation, the inability to distinguish fat tissue from muscle mass using BMI, and even the possibility that adipose tissue may provide some level of protection. Although the evidence for an obesity paradox is mixed for individuals with a BMI between 22 and 24.9 kg/m2 [51, 52], the findings of increased mortality in individuals with lower BMI values are indisputable.

Our study presents the substantial strengths of a large sample size, a population-based design, and the categorization of MAFLD considering diabetes and metabolic risk factors in addition to BMI. This study provides new insights into the understanding of mortality-related outcomes of MAFLD, which can be used for risk stratification strategies in patients with MAFLD. However, we also acknowledge some limitations of the present work. First, although histological or radiological methods are more accurate for detecting hepatic steatosis than FLI, we used FLI as a surrogate marker of fatty liver and, thus, cannot accurately quantify the presence and severity of steatosis [53]. In addition, FLI has different cut-offs for men and women and bears a large gray zone of undetermined presence/absence of hepatic steatosis [26, 27]. However, similar results were observed when we used two different sex-specific cut-off values of FLI. Moreover, the use of FLI is practical for screening the general population in epidemiologic studies [10, 54], and FLI is regarded as an acceptable surrogate for diagnosing hepatic steatosis in MAFLD [55]. Second, our study cannot establish a causal relationship because of its population-based observational design. Third, there might be residual confounding factors such as family history or occupational exposures related to cancer development since the NHIS does not collect these data. Fourth, although most studies used BMI to define lean NAFLD/MAFLD [38], total body fat quantity (assessed based on BMI) does not reflect regional body fat distribution. Because visceral adiposity can be a target for interventions in the treatment of MAFLD, future studies need to consider visceral adiposity. Finally, it is possible that the evaluated conditions were under- or overestimated because the diagnoses were based on claims data (ICD-10 codes). However, the definitions we used in this study were validated in several previous studies [21, 22]. Further replicative research is warranted to validate and elucidate our results’ underlying mechanisms.

Conclusions

Lean or diabetic MAFLD showed a poor prognosis for all-cause and disease-specific mortality. Classifying MAFLD subgroups based on metabolic phenotypes that account for integrated metabolic disorders might help risk stratification of patients with MAFLD.