Introduction

Liver test abnormalities (LTAs) are frequently observed in patients with inflammatory bowel disease (IBD), affecting between 3 and 50% of patients [1, 2]. In IBD patients, LTAs are most frequently ascribed to the use of medication or the presence of hepatic steatosis [3]. For instance, Gisbert et al. followed 786 IBD patients for 5 years and found LTA in 15% of patients, of which 43% were ascribed to thiopurines and hepatic steatosis was considered the cause in 41% of patients [3].

Whether liver test abnormalities have any clinical significance has recently been studied. Morsy and Hasanain found a relation between clinical and endoscopic indices of severity and levels of alkaline phosphatase (AP), alanine aminotransferase (ALT), and aspartate aminotransferase (AST) [15]. Mendes et al. demonstrated that inflammatory bowel disease patients with any LTA had a worse survival compared to patients without LTA [2]. A similar relation has been observed in the general population [4]. However, whether the presence of LTA has any prognostic value with regard to disease behaviour in patients with Crohn’s disease (CD) has not been studied. Recently, a study in patients with newly diagnosed coeliac disease demonstrated a relationship between the severity of mucosal inflammation and the presence of liver test abnormalities (AST, ALT, and gamma-glutamyl transpeptidase (GGT)) [5]. Although the pathogenesis of inflammation in coeliac disease has a different pathogenesis form of the inflammation observed in CD, there are also similarities, such as an increased bowel permeability. In CD, prognosis is related to the severity of mucosa inflammation [6]. Therefore, we hypothesized that in newly diagnosed CD, the presence of LTA has a prognostic value with regard to the development of complicated disease behaviour. We therefore retrospectively studied the relationship of LTA with complicated disease behaviour in a cohort of all CD patients newly diagnosed between 2004 and 2011 in one tertiary referral hospital and two large teaching hospitals.

Materials and methods

Study population

A retrospective study was performed in CD patients newly diagnosed between 1 January 2002 and 31 December 2011 at the Leiden University Medical Centre (Leiden), a tertiary IBD referral centre, and the Haga Hospital (The Hague) and the Haaglanden Medical Centre (The Hague), two large teaching hospitals. Patient charts were obtained from an electronic patient database, using (CD) diagnosis code “601”.

In order to be included, patients had to be newly diagnosed with CD between 1 January 2002 and 31 December 2011, the diagnoses must have been made in one of the three hospitals, and laboratory evaluation including ALT, AST, AP, GGT, and C-reactive protein (CRP) had to be available for analysis.

Patient characteristics such as demographics (age, gender, weight, height), medical history, medication, intoxications, and medical examinations (laboratory evaluation, ultrasound, viral evaluation, auto-immune evaluation) were retrieved from the medical charts. Data on the disease course were collected from medical charts. These included the use of steroids, 5-ASA, immunosuppressive treatment, anti-TNF, hospitalization, complications, surgery, liver and biliary diseases, and mortality during the disease course of CD. CD was classified using the Montreal classification [7].

Timing of lab tests

Although in most cases multiple laboratory test results were available, only the results of the first visit to the outpatient clinic or emergency department were used. As these lab tests were taken prior to the IBD diagnosis, or coincided with the moment of diagnosis, these results were all obtained prior to the initiation of any IBD therapy.

Definitions

The diagnosis of Crohn’s disease was made on the basis of standard clinical, endoscopic, radiological, and pathological criteria [8]. The relevant liver tests were AP, GGT, AST, and ALT. Liver test abnormalities were defined as an elevation of any of these liver tests above the upper limit of normal. During the study period, different upper limits of normal were used for the different liver tests at different moments in the participating hospitals. We therefore expressed all liver test values in this paper as the quotient of the test result divided by the upper limit of normal at that moment. An R value was calculated to assess whether liver test abnormalities were predominantly cholestatic, hepatocellular, or mixed [9]. The R value was defined as the serum alanine aminotransferase/upper limit of normal (ULN) divided by the serum alkaline phosphatase/ULN. The value R ≥ 5 is labelled as hepatocellular, R < 2 is labelled as cholestatic, and 2 < R < 5 is labelled as “mixed”.

Study outcome

The primary endpoint of this study was complicated disease behaviour, defined as the occurrence of a stenosis, abscess, fistula, or perforation, during the first 5 years of follow-up after diagnosis. Other endpoints were hospitalizations and surgery. In patients with complicated disease behaviour at diagnosis (B2 and/or B3, according to the Montreal classification), follow-up ended at the next occurrence of a complication. Hospitalization was defined as admission to the hospital specifically due to an exacerbation and/or complaints of CD. Surgery was defined as the need for surgical intervention due to complications or severe complaints of CD.

Statistical methods

Data were analysed using SPSS 20.0. To compare baseline patient characteristics of the groups with and without LTA at diagnosis, a chi-square test was used for categorical variables, whereas an independent samples t test was used for continuous variables. If the p value was ≤0.05, the results were considered statistically significant.

Survival analysis was performed using the Kaplan-Meier analysis with log-rank test comparing patients with and without liver test abnormalities. Other variables associated with a complicated disease course were found using univariate Cox regression analysis. Variables with p value ≤0.1 in univariate analysis were used for multivariate Cox regression analysis, in which hazard ratios (HRs) were calculated.

Results

Study population

A total of 383 patients were included in the study (Table 1). Of these patients, 62% were female. One patient with chronic liver disease (small duct Primary Sclerosing Cholangitis (PSC)) was excluded. Of the 383 patients included in this study, 131 had LTA (34.1%). Patients with LTA were older than those without LTA and had a higher CRP. Groups were similar with regard to the Montreal classification.

Table 1 Patient characteristics

When patients were divided into three groups based on their CRP, the prevalence of LTA was highest (48.6%) in the group with the highest CRP (>58 mg/L) compared to 39.6% in the 16–58 mg/L group and 22.4% in the <16 mg/L group (p < 0.0001). Patients with an elevated level of AP or GGT most often had a CRP >16 mg/L (for AP, 54/66 (p = 0.012); for GGT, 62/74 (p = 0.001)).

In general, LTAs were mild (Suppl. Table 4), and in 54% of patients with LTA, only a single liver test was elevated (Suppl. Table 5). GGT was elevated most often (Suppl. Table 6). The R ratio was calculated for all patients with LTA: in 79 patients (60%), the R ratio was <2 (cholestatic); in 39 patients (29.8%), the R ratio was between 2 and 5 (mixed); and in 3 patients (2.3%), the R ratio was >5 (hepatocellular).

Thirty-two percent of patients had any comorbid disease, and 6% had an additional auto-immune disorder; these proportions did not differ between patients with and without LTA. No difference was observed with regard to the presence of overweight, diabetes, or other features of the metabolic syndrome. There was also no relationship between the R ratio and the presence of overweight, diabetes, or other features of the metabolic syndrome.

Sixty-four percent of patients underwent abdominal ultrasound (Suppl. Table 4; 64.8% in the LTA group vs. 64.7% in the group without LTA), and the prevalence of steatosis was similar between groups. Similarly, no significant differences were found between the prevalence of biliary stones, sludge, hepatomegaly, or gallbladder polyps or adenomyomatosis. Given the mild LTA, a full analysis was not performed in all cases, but in the patients with evaluation for viral or auto-immune liver disease, this evaluation was negative (Suppl. Table 4).

Interestingly, 28 out of the 39 patients with perianal disease at diagnosis were started on anti-TNF compared to 165 out of the 344 patients without perianal disease (72 vs. 48%, p = 0.005).

Relation between the presence of LTA at diagnosis and the disease course of Crohn’s disease

In total, 27% of patients developed complicated disease behaviour during 5 years of follow-up. Twenty-three percent had surgery for CD, and 42% of patients were hospitalized during the first 5 years of follow-up after diagnosis.

In patients with LTA, 33.6% developed complicated disease behaviour, compared to 14.7% of the patients without LTA (p < 0.001; Table 1). Similarly, patients with LTA were significantly more often hospitalized. Although the number of patients undergoing surgery did not differ between groups, the type of surgery did differ between groups: more often, a small bowel resection was performed in LTA patients.

Suppl. Table 4 shows the relationship between each liver test and the risk factor for developing complicated disease behaviour, hospitalization, and surgery. Whereas GGT was elevated most often, an elevated level of AP was associated with an increased risk of complicated disease or hospitalization. In Suppl. Table 5, we show that the risk for patients with two abnormal liver tests of developing complicated disease behaviour was higher than that for patients with one abnormal test. However, the presence of more than two abnormal tests did not further increase the risk.

As shown in Fig. 1a, both complication-free survival and hospitalization-free survival were significantly longer in patients without LTA compared to patients with LTA (log rank: p < 0.0001; log rank: p = 0.005, respectively; Fig. 1b). When all patients were included, the difference for surgery-free survival was not significant (log rank: p = 0.057, Fig. 1c), but when only patients with B1 behaviour at diagnosis were analysed, patients with LTA had a shorter surgery-free survival (log rank: p = 0.017; Fig. 2).

Fig. 1
figure 1

Occurrence of complications, hospitalization, and surgery, when compared between patients with and without liver test abnormalities. a Complication-free survival; p < 0.0001 between patients with and without LTA (log rank). b Hospitalization-free survival; p = 0.005 between patients with and without LTA (log rank). c Surgery-free survival; p = 0.057 between patients with and without LTA (log rank). LTA liver test abnormalities

Fig. 2
figure 2

Need for surgery, based on the presence of complicated disease behaviour at diagnosis. a Patients without complicated disease at diagnosis; p = 0.017 between patients with and without LTA (log rank). b Patients with complicated disease at diagnosis; p = 0.56 between patients with and without LTA (log rank)

As CRP was higher in LTA patients, we evaluated the interplay between the elevated CRP and the presence of LTA (Fig. 3a–c). In patients with CRP <16 and 16–58 mg/L, the presence of LTA was associated with a higher prevalence of complicated disease behaviour compared to those without LTA (CRP < 16 mg/L, log rank: p < 0.0001; CRP 16–58 mg/L, log rank: p = 0.019; CRP > 58 mg/L, log rank: p = 0.101).

Fig. 3
figure 3

Development of complicated disease, when compared between patients with and without liver test abnormalities. a Patients with CRP <16 mg/L; p < 0.0001 between patients with and without LTA (log rank). b Patients with CRP 16–58 mg/L; p = 0.019 between patients with and without LTA (log rank). c Patients with CRP >58 mg/L; p = 0.101 between patients with and without LTA (log rank)

Univariate and multivariate analyses

To assess whether LTA is an independent risk factor for developing complicated disease behaviour, we performed univariate (Table 2) and multivariate analyses. The univariate analysis identified the presence of LTA, location of disease, and CRP as risk factors for developing complicated disease behaviour.

Table 2 Univariate analysis for the LTA and development of complicated disease behaviour and hospitalizations

The multivariate analysis showed that the presence of LTA was associated with a higher risk of developing complicated disease behaviour, as were ileal location and isolated upper disease, whereas CRP at diagnosis was not (Table 3).

Table 3 Multivariate analysis for the LTA and development of complicated disease behaviour

Using the multivariate analysis, the presence of LTA (HR 1.8, p = 0.008, 95% CI 1.2–2.7) and the presence of B2 behaviour (HR 1.9, p = 0.028, 95% CI 1.1–3.1) were found to increase the risk of hospitalizations. Patient age at diagnosis >17 years reduced the risk of hospitalization (A2: p = 0.033, HR 0.32, 95% CI 0.24–0.94; A3: p = 0.005, HR 0.32, 95% CI 0.14–0.70). B3 behaviour (p = 0.09), sex (p = 0.16), and BMI (p = 0.52) did not influence the risk of hospitalization.

For surgery, when all patients were analysed, the presence of complicated disease behaviour at diagnosis (B2: HR 3.6, 95% CI 2.3–5.8, p < 0.0001; B3: HR 5.5, 95% CI 2.0–10.7, p < 0.0001) was identified as a risk factor.

However, when patients with complicated disease behaviour at diagnosis were excluded, the presence of LTA (p = 0.015, HR 2.3, 95% CI 1.17–4.32) and either ileal (p = 0.019, HR 2.99, 95% CI 1.2–7.45) or upper GI location (p = 0.037, HR 9.57, 95% CI 1.148–79.3) were associated with a higher risk of surgery.

Sensitivity analysis

A number of sensitivity analyses were performed: when only patients with a normal CRP (below 5 mg/L; n = 27) were included, the result of the Kaplan-Meier (KM) analysis for the presence of LTA was similar to that of the analysis with all patients included. When only patients with B1 behaviour were included (inflammatory disease only; n = 281), the results of the KM LR and Cox regression analyses were similar to those of the entire cohort. When only patients from the HAGA hospital were analysed (n = 225), the results for the KM analysis and Cox regression analysis were similar to those of the entire cohort. When CRP was added to the Cox multivariate model divided into three equal groups instead of as a continuous variable, this did not alter the results (LTA: p < 0.0001, HR 2.4; locations L1 and L4: p = 0.022, HR 2.42, and p = 0.032, HR 5.5, respectively).

Discussion

In this study of 383 newly diagnosed Crohn’s disease patients, patients with liver test abnormalities more often developed complicated disease behaviour and more often were in need of hospitalization or surgery within 5 years of diagnosis than patients without LTA.

LTAs were present in 34.1% of patients. Most LTAs were mild (less than two times the ULN), and in most instances, only one liver test was elevated. The prevalence of 34% corresponds to the 29% found by Mendes et al. [2] with the prevalence in the literature varying between 3 and 50% [10]. As liver test abnormalities are often ascribed to medication (such as thiopurines) [3], we only included patients with IBD prior to the initiation of medication. The prevalence of chronic liver disease was low: only one patient was excluded due to small duct PSC, and the prevalence of steatosis diagnosed by ultrasound was 6%, similar to a recent paper demonstrating a prevalence of 8.2% in IBD patients [11].

In coeliac disease, a relationship was found between the presence of liver test abnormalities and disease activity. For instance, Zanini et al. found a correlation between the degree of mucosal inflammation, graded according to the Marsh-Oberhuber classification, and elevation of AST, ALT, and GGT, whereas AP was not reported in [5]. In our study of patients with active disease, the prevalence of LTA was highest in patients with the highest CRP (48% when CRP >58 mg/L) and this is in line with what is observed in patients with coeliac disease.

In Crohn’s disease, the relationship between inflammation and development of complications is well established. For instance, Allez et al. demonstrated that patients with more severe inflammation had a higher risk of fistulising disease [6]. In our study, 27% of CD patients developed a new episode of complicated disease behaviour within the first 5 years after diagnosis: 33% in patients with LTA and 14.6% in patients without LTA (p < 0.0001). Using the KM log-rank survival analysis, we showed that the presence of LTA was associated with an increased prevalence of complicated disease behaviour, as well as with increased need for hospitalizations.

As the mean CRP was higher in patients with LTA, we analysed whether the increased prevalence of complications in LTA patients was only reflective of the higher CRP concentration in this group. However, as shown in Fig. 3a, patients with a CRP <16 mg/L but with LTA had a higher risk of developing complicated disease compared to those without LTA. This demonstrates that the presence of LTA does not merely reflect a higher CRP concentration but may be a more sensitive indicator of an increased risk of complicated disease behaviour than CRP.

Additional evidence for the independent role of LTA as a risk factor for complicated disease behaviour comes from the multivariate analysis. In the Cox regression analysis, the presence of LTA, as well as isolated ileal disease, and the presence of upper GI disease are independent risk factors for complicated disease behaviour. Additionally, the presence of LTA was independently associated with an increased risk of hospitalizations (HR 1.7, p = 0.023) as well as surgery (HR 2.3, p = 0.015). For both these outcomes, a relationship with degree of inflammation has been suggested [12, 13].

Although the relationship between inflammation and liver test abnormalities has previously been found in ulcerative colitis [1, 14, 15], this has, to our knowledge, not been demonstrated previously in CD patients. In UC patients, Riegler et al. found a higher prevalence of LTA predominantly in those patients with a higher level of symptoms [14] and Morsy and Hasanain found a relation between a higher endoscopic index of severity and an elevated level of AP, ALT, and AST [15]. In these papers, no follow-up data was available and the mechanism behind this relationship was not explored in these papers.

Why the presence of LTA is associated with complicated disease behaviour is uncertain. But, as gut-liver crosstalk has received much attention recently [16], a number of theories have been suggested: increased intestinal permeability in active CD may lead to exposure of the liver to toxins or antigens of intestinal origin [5]. In coeliac disease, a correlation has been observed between transaminase level and the intestinal permeability index [17]. Coeliac disease patients with more severe inflammation have a higher intestinal permeability, and this is reflected in the higher prevalence of LTA. This may also be the case in CD patients as increased permeability is a hallmark of inflammation in CD and is associated with a more severe disease course [5].

Secondly, enterohepatic circulation of lymphocytes may play a role, in which memory cells from the gut mucosa are recruited to the liver in response to aberrantly expressed gut-homing molecules [18]. When these lymphocytes encounter an intestinal antigen in the liver, immune activation occurs leading to inflammation [18]. The combination of increased permeability and the presence of gut-derived lymphocytes may cause liver test abnormalities in patients with active Crohn’s disease.

We rejected hesitation to use immunomodulatory or biologicals in LTA patients as a potential explanation, as these medications may also induce liver test abnormalities, given the fact that the use of these drugs was similar in LTA and non-LTA patients.

The presence of perianal disease at diagnosis was not related to complicated disease behaviour, in contrast to a number of studies [19, 20]. However, 72% of patients with perianal disease at diagnosis were started on anti-TNF compared to 48% patients without perianal disease (p = 0.005). Recently, Magro et al. demonstrated that early treatment with either a biological or an immunomodulator can prevent progression to complicated disease behaviour [21]. The high frequency of anti-TNF usage in patients with perianal disease may have prevented progression to complicated luminal disease. Upper or isolated ileal location of disease was associated with a higher risk of surgery, as demonstrated previously [22, 23]. When all patients were analysed, the patient with complicated disease at diagnosis were also at an increased risk of surgery (stricturing disease: HR 3.6; penetrating: HR 5.5), confirming earlier reports [22, 23]. The presence of LTA was associated with an increased risk of surgery only in those patients without complicated disease behaviour at diagnosis.

In addition to their clinical relevance, our findings appear to underline the importance of the gut-liver axis and emphasize the need for a better understanding of its molecular and genetic components. The limitations of our study are its retrospective nature and the risk of referral bias. We sought to minimize the latter by including only newly diagnosed CD patients and patients from both a tertiary and two large teaching hospitals. The retrospective nature means that the work-up of patients with LTA was different in each case. However, in patients with persistent LTA or in those with higher elevations of liver tests, a more thorough evaluation was always performed, minimizing the chance of missed chronic liver disease in those patients.

In conclusion, the presence of LTA was an independent risk factor for developing complicated disease, need for hospitalization, and need for surgery in this retrospective cohort study. Liver tests may be a widely available, low-cost instrument that helps in identifying high-risk patients who could benefit from a top-down therapeutic regimen.