Introduction

Ulcerative colitis (UC) and Crohn’s disease (CD) are two major forms of chronic inflammatory bowel disease [1]. Although their etiology remains unknown, immunological, genetic, and environmental factors are considered to play important roles in their pathogenesis [2, 3]. Tumor necrosis factor-alpha (TNF-α) is one of the cytokines reported to be increased in both UC and CD [4, 5], and TNF-α blockade has emerged as an effective therapeutic strategy for both diseases. The first use of the anti-TNF-α antibody infliximab (IFX) for human inflammatory bowel disease was reported in a pediatric patient with severe CD in 1997 [6]. Efficacy in CD was confirmed in the subsequent ACCENT I and II randomized controlled trials [7]. Efficacy for UC was studied in the 54-week ACT 1 and 30-week ACT 2 trials, in which the IFX groups showed statistically significant increases in the proportion of patients with a clinical response, clinical remission, and mucosal healing (MH) compared with placebo [8].

Intravenous corticosteroid administration remains the first-line therapy in UC patients with acute moderate-to-severe disease and is reported to be effective in more than half of them [9]. However, a certain proportion of patients is refractory to corticosteroid treatment. Moreover, even among initial responders, approximately one-third of patients cannot withdraw from corticosteroids without a symptom flare, or experience symptoms after steroid discontinuation, and are therefore considered steroid-dependent. While IFX is a treatment of choice in steroid-refractory and steroid-dependent patients, other options that can be used for these patients as a rescue therapy include anti-TNF-α monoclonal antibodies (adalimumab and golimumab), calcineurin inhibitors (tacrolimus and cyclosporine), anti-α4β7 integrin (vedolizumab), and cytapheresis [10, 11]. No consensus on a strategy for corticosteroid-refractory or corticosteroid-dependent moderate-to-severe UC patients has been reached, since evidence directly comparing these options does not exist, except for a head-to-head comparison showing no difference between IFX and cyclosporine [12]. In addition, no definitive predictive factor has been reported to help choose a given treatment option.

Monitoring serum concentrations of IFX has been considered useful in evaluating the mechanism of loss of response to IFX. Afif et al. reported that low drug levels drawn 4 or 8 weeks after a scheduled infusion every 8 weeks predict loss of response after initial success with IFX [13]. A therapeutic algorithm for patients with secondary failure based on serum IFX trough levels (TLs) and the development of antibodies (ATIs) against IFX has been suggested to help determine an appropriate treatment intervention [13, 14].

Acute severe flares are one of the most critical features of UC. They may require hospitalization and surgery. Because decisions on whether a treatment should be continued or switched to the next option are often made with a lack of data about possible outcomes, and may result in necessary surgery being delayed, the earliest possible evaluation of the response to treatment is crucial. In clinical practice, judging short- and long-term efficacy soon after the initial infusion of IFX is always challenging: some patients gradually respond over time after multiple infusions, whereas others demonstrate a transient response followed by a subsequent loss of efficacy. Thus, the availability of an early and accurate predictor of outcome would be of particular value.

Here, as part of a phase 3 trial of IFX efficacy and safety in UC, we examined the accuracy of TL measurements as a predictive factor of therapeutic response early in IFX treatment.

Methods

Patients

This randomized, double-blind, placebo-controlled study (Japic CTI-060298) was conducted between July 2006 and December 2008 among 208 patients at 67 medical institutions in Japan. The protocol was approved by the institutional review board at each medical institution. All patients gave written informed consent. The study was conducted in accordance with the Declaration of Helsinki and Good Clinical Practice.

All eligible patients had an established diagnosis of UC and had been screened for tuberculosis by the skin test and chest radiography. Endoscopy was performed during screening to confirm disease activity. Eligible patients had no evidence of tuberculosis and had active UC with a Mayo score [15] of 6–12 points and an endoscopic subscore of ≥2. Documentation of one or more previous treatment failures was required at enrollment, namely, we enrolled patients with no response to immunomodulators (azathioprine and 6-mercaptopurine) that were started at least 12 weeks before enrollment and maintained at a stable dose from at least 4 weeks before enrollment or that were administered for at least 12 weeks within the preceding 5 years; patients who could not tolerate immunomodulators within the preceding 5 years; patients who had had no response to oral corticosteroids started at least 2 weeks before enrollment at 20 mg/day or more and maintained at a stable dose (corticosteroid-refractory); and patients who showed no response to corticosteroids (equivalent prednisolone 40 mg/day or more, for at least 2 weeks orally, or for at least a week intravenously), worsening of UC with tapering of corticosteroids (corticosteroid-dependent), or corticosteroid intolerance within the previous 18 months.

We excluded patients with a history of recent bowel surgery; bowel complications such as stricture, fistula, or dysplasia; or treatment with other biologics, methotrexate, calcineurin inhibitors, or cytapheresis within the previous 18 months because of the possible influence of these treatments on the efficacy of IFX. Patients with serious medical conditions such as chronic heart failure or latent infectious diseases (hepatitis C, human immunodeficiency virus infection, or any other chronic infectious disease) were also excluded.

Study design

Eligible patients were randomly assigned in a 1:1 ratio to receive an intravenous infusion of IFX at a dose of 5 mg/kg or placebo at weeks 0, 2, and 6 (Fig. 1a). Randomization was performed centrally with the use of computer-generated randomization schedules stratified according to the investigational site and concomitant use or nonuse of corticosteroids (prednisolone equivalent of 0, <20, or ≥20 mg/day). Patients with a significantly lower Mayo score at week 8 (defined as a decrease in the total Mayo baseline score of at least 3 points and at least 30 %, with an accompanying decrease in the rectal bleeding subscore of at least 1 point or an absolute subscore of 0 or 1, i.e., 8-week responders) then received IFX or placebo at weeks 14 and 22. Conversely, 8-week non-responders were discontinued from IFX or placebo treatment. Eight-week responders were followed through week 38. Eight-week non-responders and other patients who discontinued treatment with IFX or placebo were followed until 16 weeks after the last administration.

Fig. 1
figure 1

Study schema (a), patient disposition (b). IFX infliximab; UC ulcerative colitis; IC informed consent

Permitted concomitant medications for UC included corticosteroids (started at least 2 weeks before sigmoidoscopy at enrollment and maintained at a stable dose), oral aminosalicylates (started at least 3 weeks before enrollment and maintained at a stable dose), azathioprine and 6-mercaptopurine (started at least 12 weeks before enrollment and maintained at a stable dose from at least 4 weeks before enrollment). Doses of concomitant medications remained constant throughout the study except for corticosteroids, which were tapered by 5 mg/week after week 8 until a dose of 20 mg/day was reached and then by 2.5 mg/week.

End points

The primary end point was a clinical response at week 8. Secondary end points were clinical remission or MH response at week 8 as well as clinical response or clinical remission at week 30. The Mayo score was determined at weeks 0, 8, and 30. Clinical remission was defined as a total Mayo score of 2 points or lower, with no individual subscore exceeding 1 point. MH was defined as an endoscopic subscore of 0 or 1. The clinical activity index (CAI) [16] was determined at all visits. CAI remission was defined as a CAI score of 4 or lower.

TL at weeks 0, 2, 6, and every 8 weeks thereafter and ATI at weeks 0, 14, and 30 were measured using an enzyme-linked immunosorbent assay [17] (Mitsubishi Tanabe Pharma Corp., Osaka, Japan).

Statistical analysis

Efficacy was assessed in the full analysis set. Patients who took prohibited medication because of worsening UC (lack of efficacy or loss of response to the study medication), who discontinued the study medication because of worsening UC, including 8-week non-responders, or who underwent colectomy or colostomy were not considered to have had a clinical response, clinical remission, or MH, and their post-procedure CAI score was used as the baseline value from the time of the procedure onward. For other patients who withdrew prematurely, the last observation was carried forward. Statistical tests were two-sided, with p ≤ 0.05 considered to indicate statistical significance.

Demographic and baseline characteristics were compared using Fisher’s exact test, Student’s t test, or the Wilcoxon rank test. Proportions of patients with a clinical response, clinical remission, and MH were compared using logistic regression analysis [explanatory variables: treatment group, corticosteroid use (except for analysis in severe cases)].

Post hoc analysis

We conducted post hoc analysis in 82 patients who were treated with IFX at weeks 0, 2, and 6 and evaluated until week 14. The ratio of TL to CAI (TL/CAI) was calculated using the CAI value plus 1, because CAI scores include 0. Between-group differences in the TL and CAI remission rates were calculated with the Wilcoxon rank test and Fisher’s exact test, respectively. Receiver-operating characteristic (ROC) curve analysis was performed to assess the performance of the week 2 TL or TL/CAI for predicting CAI remission at week 14. Predictive factors in baseline characteristics as well as clinical and laboratory parameters identifying patients with CAI remission at week 14 were analyzed with logistic regression analysis. The cutoff values for age, body weight, TL, and TL/CAI used median values, while those for other end points used their reference values. Multiple logistic regression analysis was performed to identify independent predictors of 14-week CAI remission. The explanatory variables used in these multiple analyses were sex, age <39.5 years, weight <56.0 kg, smoker, colonic area involved, oral corticosteroid use, oral 5-aminosalicylate use, immunomodulator use, corticosteroid-refractory disease, corticosteroid-dependent disease, CAI remission at week 2, the week 2 TL (first TL) ≥ the median value, and the first TL/CAI ≥ the median value. Variables were chosen using stepwise backward selection to minimize the Bayesian information criterion.

Mitsubishi Tanabe Pharma Corp. sponsored this clinical trial and was responsible for the collection and analysis of data.

Results

Clinical outcome

Patient disposition is shown in Fig. 1b. There were no significant differences in backgrounds between the IFX and placebo groups (Table 1). At week 8, the clinical response rate, the primary end point, was significantly higher in patients who received IFX than in those who received placebo (p = 0.005; Table 2). The clinical remission rate showed marginal significance (p = 0.054), whereas MH was significantly more frequent with IFX than placebo (p = 0.006). Results of clinical response, clinical remission, and MH at week 30 are shown in Table 2. Response rate at week 8 in acute severe cases (n = 18, Mayo score ≥11) was similar to that in the entire study population, although it did not reach statistical significance because of the limited number of patients [54.5 % (6/11) in the IFX group versus 28.6 % (2/7) in the placebo group, p = 0.287].

Table 1 Baseline characteristics of patients
Table 2 Summary of efficacy results

Since this study was conducted as part of a nationwide phase 3 trial in Japan, safety profiles were also carefully calculated for all patients. There was no difference in the incidence of any adverse events, including serious infections and infusion reactions, during the short-term as well as the entire follow-up period in this trial (Table 3).

Table 3 Safety profiles

Clinical course of patients treated with IFX

We next assessed the clinical course of 82 patients who received the initial three infusions at weeks 0, 2, and 6 and evaluated them until week 14 to focus on the initial response to induction therapy with IFX, using CAI determined at all visits. Among these 82 patients, 37 (45 %) achieved CAI remission at week 2, but 8 (22 %) lost their response by week 14 (Fig. 2a). In contrast, 17 (38 %) patients who were not in CAI remission at week 2 improved and achieved CAI remission at week 14. Interestingly, most patients who achieved CAI remission at week 14 remained in remission until week 30, more frequently than the week 8 CAI remitters. Week 14 therefore appeared to be an appropriate time point to define the primary clinical response for the following analyses. TL was significantly higher in patients who were in CAI remission than in those who were not at each time point after the second infusion at week 6 (Fig. 2b).

Fig. 2
figure 2

Clinical course and TL of patients treated with IFX. a Clinical course of patients who achieved CAI remission. b TL according to clinical efficacy. Results are shown as median (interquartile range). Statistical differences were calculated with the Wilcoxon rank test. TL serum infliximab (IFX) trough level; CAI clinical activity index

Assessment of TL

Early prediction of clinical efficacy is especially important for the treatment of acutely ill patients. Therefore, we conducted a post hoc analysis to examine the potential usefulness of TL at the early time point for prediction of future outcome.

Interestingly, an association between week 2 TL and clinical efficacy showed a tendency to be stronger at the later time point (Fig. 3a) [the optimal cutoff value for predicting 14-week CAI remission: 21.3 μg/ml; 0.61 sensitivity, 0.69 specificity, 0.72 positive predictive value, and 0.58 negative predictive value; area under the curve: 0.65, 95 % confidence interval (CI): 0.52–0.77]. The correlation of TL with MH was also evaluated, since MH is considered an important predictive factor for long-term outcome. Similar to clinical efficacy, the first TL showed a significant correlation with MH at week 30 but not at week 8 (Fig. 3b).

Fig. 3
figure 3

TL at week 2 according to clinical efficacy at weeks 2, 6, 14, and 30 (a), MH at weeks 8 and 30 (b), 2-week TL/CAI according to 14-week CAI remission (c), and 30-week MH (d). Results are shown as median (solid line within box or bar) and interquartile range (upper and lower box boundaries). Statistical differences were calculated with the Wilcoxon rank test. TL serum infliximab trough level; CAI clinical activity index; MH mucosal healing

Usefulness of the first TL in predicting clinical outcome

We screened for the predictive values of multiple clinical parameters, including background and 2-week clinical parameters. Results showed that there was no patient background factor, including the severity and laboratory data, that demonstrated a significant association with clinical outcome at week 14 (Table 4). However, CAI remission at 2 weeks was significantly associated with the 14-week CAI remission, in addition to the first TL. This suggests that patients already in remission at week 2 are likely to remain in remission at week 14. However, this is not useful in identifying patients who gain/lose their response to IFX after week 2. We therefore evaluated the first TL/CAI and found that this ratio also correlated with 14-week CAI remission (Table 4) (the optimal cutoff value for predicting 14-week CAI remission: 4.3; 0.61 sensitivity, 0.81 specificity, 0.80 positive predictive value, and 0.62 negative predictive value; area under the curve: 0.76, 95 % CI: 0.65–0.86). Accordingly, although it did not reach statistical significance because of the limited sample size, the week 14 CAI remission rate in 2-week non-remitters (subgroup of delayed response) with the first TL/CAI of ≥4.3 tended to be higher than in those with the first TL/CAI <4.3 (50 % versus 36 %). Furthermore, the week 14 CAI non-remission rate in 2-week remitters (subgroup of loss of response) with the first TL/CAI of ≥4.3 tended to be lower than in those with the first TL/CAI <4.3 (14 % versus 44 %). In multiple logistic analysis, the first TL/CAI (odds ratio 8.07, 95 % confidence interval 2.84–27.07, p < 0.001) and 5-aminosalicylate use (odds ratio 3.98, 95 % confidence interval 1.29–14.26, p = 0.016) were identified as independent factors that correlated with 14-week CAI remission. The first TL/CAI was significantly associated with 30-week MH as well as with 14-week CAI remission (Fig. 3c, d).

Table 4 Predictive factors associated with 14-week CAI remission (univariate analysis)

Discussion

We assessed the predictive value of the first TL at week 2 for clinical outcome as part of a phase 3 randomized controlled trial of IFX in UC. Results showed the utility of the week 2 first TL as a predictive factor for both short- and long-term outcomes in these patients.

Previous studies reported that low serum albumin, absence of concomitant immunomodulator use, and severe disease were associated with poor response to IFX [1821]. However, in our study population, neither these nor any other baseline characteristics were associated with clinical outcome, suggesting that patient background, including severity and laboratory data, is not useful in predicting clinical outcome before the start of therapy. Therefore, early detection of response to IFX is critical for the management of UC, especially in acute severe patients (e.g., Mayo score ≥11 or CAI ≥12), whose clinical outcome was comparable with that in the moderately active patients. It has been reported that TL is associated with the clinical status at each time point in patients under maintenance therapy [22, 23], but its predictive role for future outcomes during induction therapy has not been studied. Therefore, we hypothesized that the first TL may be useful and found that the first TL significantly correlates with 14-week primary response. Interestingly, the correlation between the first TL and clinical outcome was seen even at week 30, suggesting that the 30-week clinical efficacy can be predicted after as little as 2 weeks. Furthermore, we also found that the first TL correlated with the achievement of MH at 30 weeks. Post hoc analysis was conducted in 82 patients who received treatment at weeks 0, 2, and 6 and were evaluated until week 14. However, even after including the 11 patients who discontinued the therapy because of worsening UC or lack of efficacy by week 14, the first TL showed significant correlations with 14-week remission and 30-week MH (data not shown). Although we did not conduct a long-term follow-up in this study, our results suggest that the first TL may predict not only the short-term, but also long-term outcome of subsequent IFX maintenance therapy, since MH is reported to be associated with a favorable long-term outcome [24]. This may be of major benefit in helping both patients and physicians to decide whether or not to continue IFX after 2 weeks.

On further investigation, we found that the first TL/CAI is an independent predictive factor for early clinical efficacy and significantly associated with subsequent MH. This seems clinically relevant since the TL/CAI may reflect the drug level required to treat residual inflammation in each patient. Therefore, TL measurement in combination with re-evaluation of clinical severity at week 2 appears to be a reasonable strategy for predicting both short- and long-term treatment outcomes with IFX while still in the initial stage of treatment.

One clinical question arising from our results is whether early dose escalation or elective switching could improve the outcome in patients whose first TL or TL/CAI appears low. In clinical practice, there are some acutely ill patients who do not improve initially but who then respond to dose escalation or acceleration compared to standard dosing. Consistent with this, a recent report demonstrated that an accelerated induction regimen improved the colectomy rate in acute severe UC [25]. Immediate switching to other induction options such as tacrolimus or vedolizumab might also be alternatives in patients with low TL or low TL/CAI who are predicted to be non-responders. Our results suggest that the first TL or TL/CAI may be a useful objective marker leading to an individualized induction regimen for acute severe UC. However, further prospective clinical studies are needed to elucidate the advantage of the trough-oriented induction strategy in acutely ill patients based on measurement of the first TL.

Development of ATI alters the pharmacokinetics of IFX [26] and is recognized as an important cause of loss of response in patients treated with maintenance or episodic IFX [27]. However, our method measures ATI values only when IFX is absent in sera [17], although we measured ATI at weeks 0, 14, and 30. No patients were positive for ATI at week 14 in our study, while 67 % of patients with rheumatoid arthritis have detectable ATI within 14 weeks after IFX initiation by the new and sensitive assay [28]. The possibility that the absence of ATI-positive patients was because of technical limitations cannot be excluded, however, the role of ATI in the primary non-responder is still unclear. Clearance of IFX is also influenced by other factors such as sex, body mass index, concomitant use of immunomodulators, disease type, albumin, and inflammatory burden [18, 26].

In this multicenter randomized controlled trial, we confirmed that the efficacy of IFX was comparable to that reported in previous trials, including ACT 1 and 2 [8]. This is the first placebo-controlled study with a sufficient sample size from Asia and suggests that IFX is as effective for Asian UC patients as it is for Western populations. Nevertheless, a number of differences between Western and Asian IBD patients are known, including the absence of NOD2 mutant CD and a different gender ratio in UC among Asians [29]. Therefore, it is still important to confirm the reproducibility of clinical trials in the Asian population.

In summary, IFX was confirmed to be effective and safe as therapy for UC. In addition, this study is the first to suggest that the first TL can be used to predict the future treatment outcome. Furthermore, the first TL/CAI was an independent predictor of primary response and was associated with subsequent MH. Our findings may provide critical knowledge for decision making in treatment strategies for acute severe UC. Further prospective studies will be needed to examine the usefulness of the first TL-guided induction dosing strategy in acute severe UC.