Introduction

Azathioprine (AZA) is a cornerstone immunosuppressive therapy for patients with inflammatory bowel disease (IBD) [1, 2]. It improves important outcomes such as hospitalization, surgery, steroid use and colorectal cancer risk [3,4,5,6,7]. Moreover, AZA has been used for over 50 years in millions of patients worldwide and hence common as well as rare adverse events are well documented. As IBD most commonly affects patients during their reproductive years, the well-established safety of AZA during pregnancy and breastfeeding is an important advantage, as compared to newer biologics and small-molecule drugs [8,9,10].

Unfortunately, AZA is limited by adverse events with the tolerated dose often being well below the effective dose [11, 12], resulting in 55% of patients discontinuing therapy within 5 years due to intolerance or ineffectiveness [13]. Consequently, a shift towards early therapy with biologics has occurred in those in more affluent countries. Whilst biologics can be transformative, they do have limitations: poor long-term effectiveness (related to antibody formation) [14]; parenteral administration, which affects quality of life [15]; infusion reactions (occasionally life-threatening) [16]; and high costs [17]. To address some of these concerns, new small-molecule drugs are being developed. However, the potential to optimize established small-molecule drugs like the thiopurines, and subsequently reduce the number of patients escalated to treatments like biologics, is often overlooked. Since thiopurines are affordable this strategy also reduces healthcare costs, which is particularly pertinent for developing countries and uninsured patients in higher-income countries [18].

One such optimization strategy for patients with IBD is to combine low-dose thiopurine with allopurinol, which was first described in 2005 [19]. Despite the fact that switching AZA monotherapy (AZAm) to low-dose AZA with allopurinol co-therapy (LDAA) improves tolerability and efficacy [19,20,21,22,23,24,25,26], LDAA remains underreported in current literature [27], not mentioned in most IBD guidelines [1, 2] and hence underutilized. Moreover, most studies restricted LDAA to patients both failing AZA (due to intolerance or inefficacy) and showing aberrant metabolism i.e. hypermethylation (production of toxic 6-MMPs at the expense of therapeutic 6-TGNs). Conversely, several studies have demonstrated a poor predictive value of metabolites with regard to clinical outcomes. For instance, 52% of the patients resistant [28] and up to 50% of the patients intolerant to AZAm [26, 29] were non-hypermethylators. Of those with high 6-MMP levels, nearly 90% did not show hepatotoxicity [29, 30]. Only using LDAA in patients showing hypermethylation thus excludes a large number of patients who could benefit from LDAA. Furthermore, this approach hinders LDAA use in less affluent institutions/countries without access to metabolite measurements.

Apart from being a valuable option for all patients with IBD who have failed AZAm, LDAA might be beneficial as first-line immunosuppressive therapy [24]. Using LDAA as first-line therapy in daily practice (rather than AZAm) could reduce delay in reaching an effective treatment, and could eliminate complex drug dosing and extra monitoring typically needed with AZAm (LDAA can be initiated and maintained at the target/effective dose in most patients). This would improve both patient and clinician experience.

Hence, we sought to evaluate the long-term effectiveness and safety of first-line LDAA therapy in a large IBD cohort undirected by 6-TGNs/6-MMPs, and to compare outcomes with a similar AZAm cohort.

Methods

Study Design

Medical records of a single centre (East Surrey Hospital, UK) were retrospectively reviewed. Subjects were identified by cross-referencing hospital pharmacy dispensing records from 01/2014 to 05/2019 with a prospective patient registry. Patients with Crohn’s disease, ulcerative colitis or IBD-unclassified and treated with either LDAA or AZAm as first thiopurine therapy, were included. Patients below 18 years, without complete follow-up or with commencement of therapy before 2014 (to avoid overlap with previously published data [24]) were excluded. Patients were followed until thiopurine discontinuation or therapy escalation, or May 2019.

Throughout the study period, patients were allocated to either LDAA or AZAm based on patient’s preference and/or clinician’s experience/preference with regard to these treatments. AZAm patients were initiated on 50 mg/day, slowly increased to a target dose of 2.5 mg/kg/day or 1.25 mg/kg/day in case of wild-type or heterozygous TPMT activity, respectively. LDAA patients received a target dose from the beginning, i.e. 25% of the equivalent AZAm dose. Allopurinol was commenced at 100 mg/day.

Hematological, biochemical and inflammatory parameters were monitored every 2 weeks for the first 12 weeks and then every 3 months. 6-TGNs and 6-MMPs were not routinely measured.

This study was considered an audit of routine clinical practice, hence was qualified as being exempt from Research Ethical Committee review. The study was conducted in accordance with the Declaration of Helsinki.

Effectiveness

Primary outcome was clinical benefit, which was defined as ongoing therapy without initiation of corticosteroids, biologics or bowel resection, and thus incorporated both efficacy and tolerability. Clinical benefit was assessed at 6 and 12 months. Moreover, sustained clinical benefit was assigned when the criterion was met during at least 12 months and maintained until final follow-up. Secondary outcomes included IBD activity scores, CRP and withdrawal of concomitant IBD medication.

Safety

Adverse events and reasons for withdrawal were recorded. Liver abnormalities were categorized into hepatotoxicity (ALT > twofold the upper limit of normal; > 82 U/L) and elevated liver enzymes without hepatotoxicity (ALT 41-82 U/L). Leukopenia was defined as leukocytes < 3.5 × 109/L and thrombocytopenia as platelets < 150 × 109/L.

Statistics

Effectiveness analyses were performed according to the per-protocol principle. Patients requiring steroids/biologics during the first three months were excluded from the effectiveness analysis, considering that AZA requires three months to become fully effective [31].

Chi-square and Mann–Whitney U tests were used to compare outcomes between groups. A Kaplan–Meier plot visualized survival characteristics (event: non-clinical benefit); the curves were compared using Mantel-Cox test and a Cox-regression yielded the hazard ratio.

Patient characteristics associated with clinical benefit in the LDAA cohort based on a univariable Cox-regression (p < 0.20) were included in a multivariable model to identify independent associated factors. A univariate logistic regression examined possible correlations between intolerance to LDAA and gender, age, IBD-type, TPMT activity.

Statistical analyses were performed using IBM SPSS Statistics version 25. A p-value < 0.05 was considered statistically significant.

Results

Patient and Therapy Characteristics

In total, 166 LDAA and 118 AZAm patients were included (Fig. 1), with a median follow-up of 25 and 27 months, respectively. Collected baseline characteristics were very similar in both cohorts (Table 1). Primary indication for thiopurines was ‘active disease’ (LDAA n = 149 [90%]; AZAm n = 109 [92%]). All patients were thiopurine-naïve and most patients were immunosuppressant-naïve (150 LDAA [90%] and 113 AZAm [97%]).

Fig. 1
figure 1

Flowchart showing the selection of the study population

Table 1 Patient and disease characteristics, n (%) or median (IQR)

Only 14 (8%) LDAA and 3 (3%) AZAm patients used concurrent biologic. In most cases (11/14 LDAA and 2/3 AZAm patients) thiopurine and biologic therapy were commenced simultaneously because of a top-down therapeutic approach. In those with normal TPMT activity (~90%), median AZA dose was 0.54 mg/kg/day (LDAA) and 1.83 mg/kg/day (AZAm).

Effectiveness

Clinical benefit rates are summarized in Figs. 2 and 3, and Table 2. Clinical benefit was more frequently observed in LDAA than AZAm patients at 6 months (116/156 [74%] vs. 59/111 [53%], p = 0.0003) and 12 months (74/138 [54%] vs. 38/103 [37%], p = 0.01). Furthermore, 51/138 (37%) LDAA compared to 25/103 (24%) AZAm subjects demonstrated sustained clinical benefit during a median of 36 (IQR 20–44) and 33 (IQR 22–44) months, respectively (p = 0.04).

Fig. 2
figure 2

Clinical benefit (CB) in patients receiving low-dose azathioprine with allopurinol (LDAA)

Fig. 3
figure 3

Clinical benefit (CB) in patients receiving azathioprine monotherapy (AZAm)

Table 2 Primary outcome: clinical benefit, n (%)

A Kaplan–Meier curve (Fig. 4) demonstrates that more LDAA than AZAm patients were having clinical benefit during the entire study period (p = 0.003). Throughout follow-up, AZAm individuals were 60% more likely to fail therapy (95%-CI 1.16–2.14). Moreover, median duration of beneficial response from commencement of therapy was 17 months in the LDAA cohort (95%-CI 9–25) compared to 6 months in the AZAm cohort (95%-CI 1–11).

Fig. 4
figure 4

Kaplan Meier survival curve of clinical benefit (CB) in patients treated with azathioprine monotherapy (AZAm) and low-dose azathioprine with allopurinol (LDAA). A statistical difference between the curves was found (p = 0.003)

A multivariable analysis on the association between patient characteristics and clinical benefit in LDAA-treated patients (Table S1, Supplementary Material) demonstrated that stricturing Crohn’s disease was inversely correlated with clinical benefit (hazard ratio 2.09, 95%-CI 1.02–4.28). No difference in response to LDAA was found between Crohn’s disease and ulcerative colitis patients, nor between patients with and without concurrent biologic therapy.

Secondary Outcomes

Secondary outcomes are summarized in Table 3. Steroid withdrawal within 6 months occurred in approximately 90% of both cohorts. In the LDAA cohort, 65% (69/106) demonstrated clinical remission (SCCAI ≤ 2 or HBI ≤ 4) at last review, compared to 58% (45/78) in the AZAm (p = 0.31). Biochemical remission (CRP ≤ 10 mg/L) was observed in 93/158 (59%) and 59/94 (63%) of LDAA and AZAm patients, respectively (p = 0.54).

Table 3 Secondary outcomes, n (%)

Adverse Events

Sixty-eight of the 116 AZAm (58%) and 80/166 (48%) LDAA patients experienced adverse events during a median follow-up of 27 and 25 months, respectively (p = 0.12, Table 4). Adverse events were the main reason for discontinuing either LDAA or AZAm. However, a greater proportion of AZAm patients discontinued due to adverse events (AZAm: 53/118 [45%]; LDAA: 43/166 [26%]; p = 0.001). LDAA withdrawal due to intolerance did not depend on gender, age, IBD-type or TPMT level.

Table 4 Adverse events, n (% of entire cohort)

No malignancies were reported in either cohort and the incidence of pancreatitis was low.

Liver Enzyme Abnormalities

Liver enzyme abnormalities occurred equally in both cohorts: 11 (7%) patients reported hepatotoxicity and 13 (8%) elevated liver function tests (LFTs) without hepatotoxicity in the LDAA cohort, compared to 8 (7%) and 10 (8%) AZAm patients, respectively. Upon more detailed review of LDAA patients, 4/11 reporting hepatotoxicity and 5/13 reporting elevated LFTs without hepatotoxicity had liver disease prior to LDAA initiation (fatty liver n = 7; focal nodular hyperplasia n = 1; primary sclerosing cholangitis n = 1). In contrast, 1/18 AZAm patients with liver enzyme abnormalities had prior liver disease (fatty liver).

Hepatotoxicity led to LDAA discontinuation in two (1%) patients as did elevated LFTs without hepatotoxicity in one patient (0.6%). In six other patients with hepatotoxicity, allopurinol dose was increased to 200–300 mg/day. Consequently, the abnormal LFTs resolved in 4/6, improved in 1/6 and persisted in the remaining patient (who had primary sclerosing cholangitis). All six patients continued LDAA until final follow-up and did not experience further side effects on the escalated allopurinol dose.

AZAm was discontinued due to hepatotoxicity and elevated LFTs without hepatotoxicity in three (3%) and two (2%) patients, respectively.

Myelotoxicity

Incidence of myelotoxicity was similar in both cohorts (LDAA: 18/166 [11%]; AZAm: 11/118 [9%], p = 0.17). One patient in each cohort required hospital admission related to myelotoxicity (LDAA: respiratory tract infection, AZAm: CMV reactivation).

Regarding the LDAA cohort, 2/21 (10%) with heterozygous TPMT activity developed myelotoxicity, whereas 16/137 (12%) patients with wild-type TPMT (p = 0.77). Additionally, myelotoxicity was not related to gender, age or IBD-type.

Upon minor AZA dose reduction in 5/12 LDAA patients with leukopenia, leukocyte count normalized in three patients. All three were able to continue therapy until final follow-up, without loss of response. Myelotoxicity resulted in permanent LDAA withdrawal in 3/166 (2%) patients.

None of the AZAm patients had a dose reduction after myelotoxicity was detected and 5/118 (4%) permanently discontinued AZAm due to myelotoxicity.

Discussion

This is the largest study with the longest follow-up evaluating first-line LDAA for IBD to date, and the first study comparing safety and effectiveness of LDAA (n = 166) with AZAm (n = 118) in two very similar cohorts. Ongoing therapy without need for steroids, biologics or surgery was more frequently observed in LDAA compared to AZAm patients at 6 months (74% vs. 53%), 12 months (54% vs. 37%) and in the long-term (median 36 months, 37% vs. 24%). Throughout follow-up, AZAm patients were 60% more likely to fail therapy, due to a higher intolerance rate (45% vs. 26%).

Direct comparison of our results with others is challenging since most studies have evaluated either effectiveness or tolerability of thiopurines, whereas our primary outcome assessed both elements. Nevertheless, high success rates have been reported for low-dose thiopurine with allopurinol therapy, but not for AZAm (see next paragraphs).

Two well-known randomized, double-blind trials evaluated the efficacy of 2.5 mg/kg/day AZAm: steroid-free remission was seen in 30% (51/170) of Crohn’s patients at 6 months [32] and 24% (18/76) of ulcerative colitis patients at 4 months [33]. The lower response rates compared to our study are likely due to the fact that endpoints of a randomized trial are more difficult to achieve, and that intention-to-treat analysis was used (while we used per-protocol analysis). Recently, long-term effective thiopurine monotherapy was observed in 4995/11928 (42%) subjects, but analysis did not include initiation of steroids [6].

The high incidence of intolerance to AZAm in our study is in keeping with other studies, in which 30–40% of patients withdrew AZAm due to adverse events [11, 12, 34, 35]. Additionally, it is worth noting in these studies that tolerability came at the expense of a sub-therapeutic AZA dose. In our study inability to tolerate an effective AZAm dose (i.e. 2.5 mg/kg/day for patients with normal TPMT activity [36]) was also observed: only a median dose of 1.83 mg/kg/day was achieved, which is 73% of the effective dose. Conversely, LDAA could be initiated and maintained at the target dose in most patients, with a better response rate. Hence, using LDAA as first-line therapy in clinical practice (instead of AZAm) would likely reduce periods of active disease, improve patient experience and eliminate the time and costs associated with managing poor treatment effectiveness and intolerance.

In a previous study, positive response to low-dose thiopurine with allopurinol (AZA in > 50% of patients, mercaptopurine in the rest) at 12 months was seen in 62% of patients failing thiopurine monotherapy and/or showing hypermethylation [25]. Another study showed thiopurine-allopurinol continuation in 78% (n = 145) and 66% (n = 83) of patients at 12 months and 2 years, respectively [37]. The possible explanation for the greater effectiveness in their study might be a twofold lower incidence of patients with active disease at initiation, and that initiation of steroids, biologics or surgery was not considered treatment failure. In a recent randomized trial, 73% of patients on thiopurine and allopurinol (AZA in 50%) were in steroid-free remission at 6 months according to per-protocol analysis [20]. Lastly, another study showed clinical benefit during at least 12 months in 38% of the optimized thiopurine cohort (co-administration of allopurinol in 40%) [23]. The results of these studies are in line with our outcomes.

To date, effectiveness of thiopurine-allopurinol co-therapy in thiopurine-naive patients has only been assessed by our study group [24]. An improvement in IBD activity scores in the absence of steroid use was observed in 79/110 (72%) patients after a median treatment duration of 12 months [24].

Our study has limitations which need to be acknowledged. First, allocation to LDAA or AZAm was not randomized. However, collected baseline characteristics in both cohorts were very similar. Another limitation is the retrospective design, which could have led to information bias. The influence of this was minimized by comparing two treatment groups, with a single researcher collecting all of the data. A further limitation concerns sampling bias, as the cohorts were restricted to patients in a single centre. Finally, we were not able to comment on mucosal response or draw firm conclusions regarding CRP level and IBD activity scores due to incomplete data.

Although myelotoxicity is a common concern during LDAA therapy, only 12 (7%) LDAA patients developed leukopenia during a median follow-up of 25 months. This incidence is comparable to our AZAm cohort, to other thiopurine monotherapy studies [12, 38], and also to two large studies using low doses of AZA during co-therapy [26, 37]. The lack of correlation between myelotoxicity and heterozygous TPMT activity in our study can be explained by dose adjustment for weight and TPMT activity.

Elevated LFTs occurred in 14% (24/166) of LDAA and 15% (18/118) of AZAm patients during our study, of which 7% of both cohorts had hepatotoxicity. Nine of twenty-four patients with elevated LFTs in the LDAA cohort had a pre-existing liver disorder, which is the likely explanation in these patients. In contrast, the liver injury in the AZAm patients was more likely to be drug-induced as only one patient had a pre-existing liver condition. Previous studies described hepatotoxicity in 5% of LDAA patients [24] and de-novo elevated LFTs in 14% [37], which is in line with our results. Interestingly, we showed that increasing the allopurinol dose can be a beneficial strategy to treat hepatotoxicity. On increasing allopurinol to 200–300 mg/day, LFTs fell significantly in 5/6 patients and stabilized in a patient with primary sclerosing cholangitis. These findings are in concordance with our previous study, in which an escalated allopurinol dose of 300 mg/day led to normalization of LFTs in 8/11 patients, with the remaining three patients having primary sclerosing cholangitis [24]. The biochemical mechanism for this phenomenon may be that allopurinol reduces methylated metabolites and reactive oxidative free radicals, both of which are thought to be hepatotoxic [19, 29, 39].

Although myelotoxicity and elevated LFTs were relatively common in our study, they only resulted in LDAA withdrawal in 2% for either abnormality. Recently, Kreijne et al. reported the same phenomenon in a cohort of 221 patients: myelotoxicity and elevated LFTs led to LDAA cessation in only 4% and 1%, respectively [37].

In conclusion, this large study demonstrates that optimization of AZA is needed, as poor outcomes were observed in our AZAm cohort. We have shown that first-line co-administration of allopurinol is a long-term effective and safe optimization strategy, even without metabolite monitoring. Although patients may still experience mild laboratory abnormality, we confirmed that myelotoxicity and elevated liver enzymes rarely necessitate LDAA withdrawal if closely monitored and adequately treated. Using LDAA as first-line therapy for IBD (rather than AZAm) would likely reduce periods of active disease, improve patient experience and reduce the number of patients escalated to more expensive treatments like biologics. Hence, LDAA may be considered as standard first-line immunosuppressive.