Background

Acute fatty liver of pregnancy (AFLP) is an uncommon but potentially life-threatening perinatal complication [1]. Even with modern treatment, mortality of AFLP can still reach 12-15% [2, 3]. Moreover, risk factors for AFLP-related mortality are not well established. Up to now, lacking of prognostic factors or models for AFLP renders prediction of clinical outcomes difficult. We aimed to explore prognostic factors and develop a more accurate model to predict mortality of AFLP.

So far, there are four scoring systems designed for hepatic failure as potential models to predict outcomes of AFLP. Model for end-stage liver disease (MELD) consisting of serum bilirubin and creatinine levels, international normalized ratio (INR) for prothrombin time, and etiology of liver disease was validated in patients stratified according to different causes of liver disease, such as acute liver failure and liver cirrhosis of various causes [4, 5]. In recent years, some scholars have also used it to evaluate prognosis of liver disease during pregnancy [6, 7]. Subsequently, serum sodium levels were incorporated into MELD to form the MELD-Na model, which provides more accurate survival predictions than MELD alone [8, 9]. Similarly, MELD incorporating with serum sodium levels and patients’ age (integrated MELD, iMELD) also performed better than original MELD in predicting 3-, 6- and 12-month mortality in patients with cirrhosis [10].

Murali et al. developed a model to predict 1-month mortality in patients with pregnancy-specific liver disease (PSLD) [11]. PSLDs mainly include preeclampsia, AFLP and haemolysis, elevated liver enzymes and low platelet count (HELLP) syndrome. And the area under the receiver operating characteristic curve (AUC) of this new model to predict 1-month mortality of PSLDs is 0.86. However, we found that this new PSLD model did not accurately predict outcomes of AFLP patients in our hospital. Moreover, the three PSLDs listed above differed not only morbidity and mortality but also pathogenesis, clinical manifestations and risk factors for death [2, 12]. Therefore, a generalised model cannot include relevant high-risk factors for all PSLDs.

In addition to the above scoring systems for end-stage liver disease, several general scoring systems are used to determine prognosis of severely ill patients, such as Sequential Organ Failure Assessment (SOFA) and Acute Physiology and Chronic Health Evaluation II (APACHE II) scoring systems [13, 14]. However, the prognostic efficiency of these scoring systems is diminished when they are applied to specific subpopulation of severely ill patients. Thus, a model based on a general scoring system may not be as accurate as one based on a specific scoring system.

Hence, we aim to search for relevant information to explore factors related to AFLP outcomes and to develop a new prognostic model to accurately predict mortality of AFLP. The new prognostic model will be based on the clinicopathological features of AFLP. We also intend to compare the predictive values of all five models (the new model, MELD, MELD-Na, iMELD and PSLD model) to determine which one best predicted clinical outcome.

Methods

We performed a retrospective study of 99 consecutive patients who were suspected to have AFLP admitted to Xiangya Hospital and the Second Xiangya Hospital, Central South University, Hunan, China between November 2009 and September 2019. Throughout the period covered by this report, AFLP patients were treated in relatively uniform fashion by residents who were supervised by obstetrics faculty and fellows. Data of all patients were entered into a registry and retrieved for the study with approval of the two medical centers. All patients were followed up for 3 months. This work has been carried out in accordance with the Declaration of Helsinki (2000) of the World Medical Association. The study was approved by the Ethics Committee of Xiangya Hospital Central South University. All participants provided written informed consent.

Diagnosis of AFLP was made on the basis of both clinical and laboratory criteria and Swansea criteria [15, 16]. Clinical and laboratory criteria were as follows: (i) patients with symptoms of anorexia, fatigue, nausea, vomiting, jaundice and abnormal liver function in the third trimester of pregnancy; (ii) characteristic laboratory examination; (iii) ultrasound imaging showing fatty liver; (iv) viral hepatitis, pharmaceutic hepatitis, toxic hepatic and other hepatic diseases associated with pregnancy were excluded by clinical feature and laboratory examination; and (v) liver biopsy in accordance with pathologic changes. While the Swansea criteria were as follows. Patients with six or more of the following features in the absence of any other explanation for their symptoms were diagnosed with AFLP: Vomiting; Abdominal pain; Polydipsia/polyuria; Encephalopathy; Elevated bilirubin (>14µmol/L); Hypoglycaemia (<4mmol/L); Elevated urate (>340µmol/L); Leucocytosis (>11 × 109/L); Ascites or bright liver on ultrasound scan (USS); Elevated transaminases (Alanine Transaminase/Aspartate Aminotransferase>42IU/L); Elevated ammonia (>47µmol/L); Renal impairment (Creatinine>150umol/L); Coagulopathy (prothrombin time>14s or activated partial thromboplastin time>34s); Microvesicular steatosis on liver biopsy.

We analysed clinical and laboratory variables from admission to one week after delivery to identify predictive factors of mortality in AFLP patients. Clinical variables that closely related to outcome of AFLP were maternal age, gestational age, referral situation, parity, delivery mode, twin pregnancy, interval time from AFLP attack to pregnancy termination, serious complications and artificial liver support (ALS) treatment or not. The indication for ALS treatment of our study was for patients with severe AFLP. Patients with any of the following features were identified as severe AFLP: total bilirubin (TBIL)>171µmol/L; prothrombin time activity (PTA) < 40%; hepatorenal syndrome; and hepatic encephalopathy. ALS treatment was promptly administered after pregnancy termination, subject to the consent of both the patients and their families if deemed appropriate. Laboratory variables that indicated AFLP were INR, white blood cell count (WBC), haemoglobin, haematocrit (HCT), platelet counts (PLT), TBIL, alanine aminotransferase (ALT), Aspartate Aminotransferase (AST), serum creatinine (Scr), blood glucose, blood urea nitrogen, blood ammonia, blood urate, lactate, serum sodium (Na) and serum calcium (Ca). Univariate and multivariate cox regression analyses were used to identify predictive factors of AFLP related mortality. Due to the small sample size of outcome events and the numerous prognostic factors associated with AFLP in this study, we initially conducted a univariate Cox regression analysis to eliminate irrelevant independent variables, followed by a multivariate Cox regression analysis to ensure the robustness of our findings.

MELD, MELD-Na, iMELD and PSLD scores were calculated as follows:

MELD score [4] = 9.57 × Loge (creatinine [mg/dL]) + 3.78 × Loge (bilirubin [mg/dL]) + 11.20 × Loge (INR) + 6.43× (etiology: 0 if cholestatic or alcoholic, 1 otherwise).

MELD-Na score [8] = MELD – Na [mmol/l] − [0.025 × MELD × (140 − Na [mmol/l]) + 140.

iMELD score [10] = MELD + (0.3 × age [years]) − (0.7 × Na [mmol/l]) + 100.

PSLD score [11] = (1.17 × total bilirubin [mg/dl]) + (2.09 × INR).

The area under the receiver operating characteristic curve (AUC) was used to compare prognostic efficiency of different models.

All statistical analyses were conducted using the SPSS 18.0 (IBM, Armonk, NY, USA) statistical software program. Continuous variables were expressed as means and standard deviations, tested for normal distribution, assessed with t-tests and variables not normally distributed were assessed by the Wilcoxon’s rank-sum test. Non continuous variables were presented as median and range. Categorical variables were assessed with chi-square tests. Time-covariate test was used to assess proportional hazards assumes. Univariate analysis of risk factors for death was performed using the Cox proportional hazard model, and covariates with P ≤ 0.10 were considered significant (to avoid eliminating significant variables). Variables found to be significant on univariate analysis were included in a multivariate Cox regression analysis, performed using a likelihood ratio (LR) approach, P ≤ 0.10 (two-tailed) was considered statistically significant.

Results

There were 99 consecutive patients who were highly suspected to have AFLP admitted to our centers. All patients were separately diagnosed using both the clinical and laboratory criteria and the Swansea criteria. Among them, 3 patients were excluded for incomplete data. For the remaining 96 patients, cases complicated with hepatitis B virus infection (n = 6) and HELLP syndrome (n = 2) were eliminated. And 10 cases were excluded because they did not satisfy six Swansea Criteria for diagnosis of AFLP. Thus, a total of 78 confirmed cases fulfilled both clinical and laboratory criteria and Swansea Criteria were included in our study. All cases were assessed by at least three professors.

After three months of follow-up, 11 of the 78 patients died (14.10%) in our study. Most deaths occurred due to encephalopathy, multi-organ failure, postpartum haemorrhage, disseminated intravascular coagulation or severe infection. 25 patients were transferred to our centers after delivery. The three-month mortality rate of transferred patients was 16% (4/25). Of the 53 patients admitted to our centers before delivery, 49 cases terminated gestation within 24 h after admission. Mean maternal age of the 78 cases was 27 years (range 19–41 years) and mean gestational age at delivery was 35.55 weeks (range 28–42 weeks). The perinatal mortality of the 78 cases was 17.95% (14/78).

Clinical and laboratory data obtained from admission to one week after delivery were used for analysis. First, we put laboratory data obtained before delivery into univariate analysis, differences in laboratory data between the survivors and non-survivors have been summarised in Table 1. Univariate analysis showed that mortality was significantly related to HCT and lactate.

Table 1 Univariate analysis of laboratory variables influencing maternal mortality on admission

To further detect prognostic factors for outcome and explore patients’ reaction to delivery and treatment, we put the data obtained at 72 h postpartum into univariate analysis. Differences in clinical and laboratory data between the survivors and non-survivors have been summarised in Table 2. All variables in cox regression analysis were confirmed to proportional hazards assume. Univariate analysis showed that mortality was significantly related to INR (P = 0.014), TBIL (P = 0.035), Scr (P = 0.018), lactate (P = 0.035), PLT (P = 0.024) and hepatic encephalopathy (P = 0.035). Then we put INR, TBIL, Scr, PLT and hepatic encephalopathy into multivariate Cox regression analysis. Factors such as postpartum haemorrhage, ALS, twin pregnancy, parity, gestational age, perinatal death, which were not detected in the univariate analysis but thought to be important in clinical, were also included in multivariate analysis. INR (HR = 3.446; 95% CI, 1.324–8.970, P = 0.011), TBIL (HR = 1.005; 95% CI, 1.000-1.010, P = 0.064), Scr (HR = 1.007; 95% CI, 1.001–1.013, P = 0.017), PLT (HR = 0.964; 95% CI, 0.931–0.997, P = 0.035) and ALS (HR = 0.123; 95% CI, 0.012–1.254, P = 0.077) were revealed as independent prognostic factors for poor AFLP outcomes. (Table 2) In our study, a total of 55 patients were identified as severe cases, out of which 16 patients were selected for ALS treatment while the remaining patients underwent conservative treatment. During both univariate and multivariate analysis, ALS was exclusively analyzed within these 55 patients. Among the 16 cases who received ALS treatment, 6 underwent plasma exchange, 3 received hemodialysis and continuous renal replacement therapy, another 3 underwent molecular adsorbent recirculating system (MARS) treatment, and finally, 4 cases received a combination of plasma exchange and MARS. Unfortunately, among the group that underwent ALS treatment, there were two patient fatalities.

Table 2 Univariate and multivariate analysis of factors influencing maternal mortality at 72 h postpartum

The cox regression model was h (t) = h0(t) exp (1.237×INR + 0.005×TBIL + 0.007×Scr-0.037×PLT-2.096×ALS) and fitted well with likelihood χ2 = 21.0, P = 0.001. The prognosis index (PI) for AFLP (PI-AFLP) was calculated as follows: PI = 1.237×INR + 0.005×TBIL + 0.007×Scr-0.037×PLT-2.096×(ALS: 0 without ALS treatment, 1 with ALS treatment). Patients were in high risk of death with a high PI score. Predictive accuracy of PI-AFLP, as measured using the receiver operating characteristic (ROC) curve, was 0.874. A cut-off PI-AFLP score of 2.98 yielded 72.7% sensitivity and 89.1% specificity. Inconsistency rate of PI-AFLP was 11.54% (9/78).

Then, to further study the efficacy and accuracy of the new prognostic model, we compared PI-AFLP model with MELD, MELD-Na, iMELD and PSLD models in predicting outcomes of AFLP, using ROC curve. The AUCs of the PI-AFLP, MELD, MELD-Na, iMELD and PSLD models were 0.874,0.781,0.774,0.744 and 0.643, respectively (Table 3; Fig. 1), and the inconsistency rate were 11.54%, 26.92%, 26.92%, 33.33%, 30.77%, respectively (Table 3).

Table 3 Comparison of accuracy and inconsistency of 5 models in predicting prognosis of patients with AFLP
Fig. 1
figure 1

Comparison of accuracy of five models to predict three-month mortality of AFLP. Note: The area under the receiver operating characteristic curve (AUC) was showed on the receiver operating characteristic (ROC) curve. The PI-AFLP (red) had the largest AUC, followed by the MELD (green), MELD-Na (blue), iMELD (purple) and M-PSLD (yellow)

Finally, we depicted recovery of INR, TBIL, Scr and PLT from delivery to one week after delivery in survivors and non-survivors. In survivors, INR began to decline 3–4 days after delivery, TBIL started to decrease 4–5 days after delivery, Scr started to decrease 2–3 days after delivery, and PLT began to recover 5–6 days after delivery. While in non-survivors, INR, TBIL, Scr and PLT delayed restoration or non-restoration.

Discussion

We adopted both clinical and laboratory criteria and Swansea criteria to confirm diagnosis of cases included in this study. Meanwhile, all cases were reviewed by at least three professors to determine the diagnosis. The clinical and laboratory criteria, proposed by Ze-yi CAO [15], was widely regarded as an objective and sensitive method for diagnosis of AFLP in China. The Swansea criteria, proposed by Ch’ng CL [16], had been validated as a good screening tool with positive and negative predictive value of 85% and 100% for hepatic microvesicular steatosis tested by liver biopsy [17]. Since almost 90% of the AFLP patients were associated with coagulation dysfunction and liver biopsy was contraindicated, Swansea criteria combined with clinical assessment was the most common and practical method for diagnosis of AFLP. In addition to this, there is one more thing to note that confirmation by known mutations like fetal homozygous mutation (1528G > C) in the gene that encodes for mitochondrial long-chain hydroxy acyl-CoA dehydrogenase (LCHAD) can also confirm the diagnosis in part of the patients, although often no known mutation is found [18].

Previous studies on the prognosis of AFLP have mostly analysed prenatal laboratory data [19, 20], and there are rare studies about correlation between postnatal laboratory indicators and AFLP prognosis. This paper also planned to study antenatal laboratory index on the prognosis of AFLP at the beginning, and results showed that HCT and lactate levels were associated with poor prognosis. However, the results differed greatly from previous studies, and we reflected on that this may be due to the prenatal laboratory data tested by different laboratories and laboratory standards of 25 patients who were transferred to our hospitals after delivered from local hospitals. Because the reference and test method were not consistent, bias may be produced. Therefore, we further studied the prognosis of AFLP using laboratory data tested 72 h after delivery in the same laboratory method to check the results of the study. Interestingly, we found that INR, TBIL, Scr, PLT, lactate and hepatic encephalopathy were highly related to mortality of AFLP. INR, TBIL and Scr had been revealed to be prognostic factors for both end stage liver disease and PSLD [4,5,6,7,8,9,10,11], and also appear to be useful in AFLP. Thrombocytopenia and Scr were found to be independent risk factors for postpartum complications occurred in pre-eclampsia and HELLP syndrome such as infection, disseminated intravascular coagulation, acute renal failure and needing for blood transfusion [21]. These complications can be also occurred in AFLP patients. Studies have shown that lactate was an independent risk factor for poor prognosis in patients with liver failure and cirrhosis [22, 23]. Lactate combined with hepatic encephalopathy had been proved to have a 90% sensitivity to predict which patients with acute liver failure in pregnancy are likely to die or require transplantation [24]. Regrettably, a further study about lactate and prognosis was limited as lactate was not routinely monitored in AFLP patients in our centers. Only 23 of 78 patients were monitored blood lactate level in our study.

Then, multivariate Cox regression analysis further revealed INR, TBIL, Scr, PLT and ALS as significant variables. ALS is an external mechanical, chemical or biological device to temporarily or partially replace liver function, so as to assist in the treatment of liver insufficiency, liver failure or related diseases. It was proved to be an effective method to improve outcome of AFLP [25, 26]. The commonly used artificial liver treatment methods include plasma exchange, hemofiltration, hemoperfusion, plasma perfusion, MARS, etc. Timely application of this treatment in the early phase of severe AFLP patients can effectively halt and reverse disease progression. Fitting time of ALS can protect liver cells from reducing mitochondrial damage due to oxidative stress and gain chance to improve patients’ prognosis [25]. PI-AFLP, calculated with INR, TBIL, Scr, PLT and ALS, was found to be able to accurately identify AFLP patients’ poor outcomes with an AUC of 87.4%, a sensitivity of 72.7% and a specificity of 89.1%. Our results highlighted the role of elevated INR, TBIL, Scr, and low PLT as significant factors for mortality of AFLP. However, due to the limited outcome event (death) in this study, the prediction of mortality by this model needs to be verified with a larger sample size and outcome event size (death) in a future study. In order to further determine the predictive effect of these factors on AFLP mortality, we tracked the changes of corresponding factors after delivery, and found that INR, TBIL, Scr and PLT counts delayed restoration or non-restoration in non-survivors. This further confirmed our hypothesis.

Furthermore, we compared prognostic efficiency of PI-AFLP with MELD, MELD-Na, iMELD and PSLD models in predicting AFLP outcomes. PI-AFLP has the highest efficiency for predicting mortality of AFLP. That’s because this PI was developed specifically to predict mortality among AFLP patients. In contrast, MELD, MELD-Na and iMELD were developed to predict mortality in end-stage liver disease and didn’t consider the influence of pregnancy. PSLD model was developed for multiple liver diseases during pregnancy and may not be pertinent to AFLP.

We also assessed inconsistencies between predicted and actual outcome for PI-AFLP model. The inconsistency rate was 11.54% (9/78). Among them, 7 cases were wrongly judged to be dead, and 2 cases were wrongly judged to be alive. It showed that the new prediction model was biased towards heavy judgment, which was helpful for early identification of critical patients and taking timely intervention to improve patients’ prognosis.

However, our study has some limitations. First, some patients had different clinical conditions. 25 patients were transferred to our hospitals after delivery, treatment among different rank of hospitals may result in different intervene. Second, because of the retrospective design, we could have missed some data. This limitation may be balanced by relatively large cases. Third, small sample size and limited event (death) may lead to instability of the prediction model. Future studies should consider increasing large numbers of cases or conducting multi-center studies to address this limitation. What’s more, external validation of the model is required in patients around the world before generalisability can be assumed.

Conclusions

In summary, our study demonstrates that INR, TBIL, Scr and PLT at 72 h postpartum are significant variables in predicting mortality of AFLP. ALS is an effective measure to improve severe patients’ prognosis. PI-AFLP calculated by TBIL, INR, Scr, PLT and ALS was a sensitive and specific model to predict mortality of AFLP. Further prospective studies across all geographical regions are needed to develop a more accurate and unique model for prognosis of AFLP.