Introduction

Rheumatoid arthritis (RA) is a global public health challenge, with increasing rates of age-standardized point prevalence and annual incidence [1]. If inadequately treated, RA may cause joint damage, disability, and other sequelae that impact the quality of life and lead to economic losses [2]. Early diagnosis and treatment are important to reduce this disease burden in patients with RA, and a treat-to-target (TTT) approach is recommended to achieve clinical remission or low disease activity (LDA). Over the years, this has become a realistic goal with the advent of effective medications, including conventional synthetic (cs), biologic (b), and targeted synthetic (ts) disease-modifying anti-rheumatic drugs (DMARDs) [3, 4].

At present, methotrexate (MTX), a csDMARD is considered an integral part of the first-line treatment strategy in patients with RA [3]. Further optimization of MTX dosage or addition of b/tsDMARDs is driven by the TTT approach. If a b/tsDMARD fails, switching to a b/tsDMARD of a different class is recommended to achieve the target; however, there are limited data to support the choice of drug class for this approach [3, 4].

Two of the currently approved b/tsDMARD therapy classes act via inhibition of tumor necrosis factor-alpha (TNF) or interleukin-6 receptor (IL-6R), thereby playing a vital role in RA via their anti-inflammatory effects [5, 6]. Although 10% to 50% of RA patients achieve remission in 6 to 12 months with the use of the first b/tsDMARD (such as TNF inhibitor [TNFi] or IL-6R inhibitor [IL-6Ri]), either as monotherapy or in combination with csDMARDs, a meaningful proportion of patients have active disease and progression of disability [7, 8]. Thus, patients require switching between b/tsDMARDs to achieve the target of remission [3, 4], and this decision can be informed by research on the comparative effectiveness of these therapies.

A randomized Phase 3 head-to-head (H2H) trial (MONARCH) in biologic-naïve RA patients, with high disease activity and intolerance or inadequate response to MTX, showed sarilumab (an IL-6Ri) monotherapy to be superior to adalimumab (a TNFi) monotherapy for reducing disease activity and signs/symptoms of RA [9]. Tocilizumab (another IL-6Ri) also demonstrated superior clinical response than adalimumab in a Phase 4 randomized controlled trial (RCT) as monotherapy in a similar population [10]. However, there is limited research comparing the relative effectiveness of TNFi vs IL-6Ri as monotherapy or in combination with csDMARDs in RA patients with moderate or high disease activity, who have previously been treated with b/tsDMARDs (i.e., b/ts-experienced patients), in a real-world patient population.

The CorEvitas RA registry (formerly known as Corrona) is a prospective, multicenter, real-world registry, launched in the United States (US), and collects data (at the time of a clinical encounter) of clinical outcomes and patient-reported outcomes (PROs) from both physicians and patients. At present, the registry includes information on > 56,000 patients with RA from 857 rheumatologists across 42 states in the US [11]. Based on the CorEvitas RA registry, a retrospective examination of prospectively collected data was conducted to assess the clinical outcomes in b/ts-experienced RA patients, who received TNFi or IL-6Ri as monotherapy or in combination with csDMARDs. The objective of the study was to compare the effectiveness of second- and third-line TNFi vs IL-6Ri (as mono- or combination therapy) in treating RA patients with moderate or high disease activity.

Materials and methods

Study design and patient population

This was a retrospective, observational study in which data from adult RA patients, within the US CorEvitas RA registry [11], were evaluated. The study was conducted in accordance with the Declaration of Helsinki, and all participating investigators obtained full ethics or institutional review board (IRB) approval (central IRB: New England Independent Review Board [NEIRB] number: 120160610 and/or individual approvals at sites). All registry patients were required to provide written informed consent prior to participation.

Adult patients with RA (≥ 18 years) who initiated a TNFi (adalimumab, certolizumab pegol, etanercept, golimumab, or infliximab) or IL-6Ri (sarilumab or tocilizumab) during or after January 2010 (until May 2020) were included. The study period was selected based on the approval and clinical availability of IL-6Ri and TNFi classes of therapeutics. Patients were included in the study if they had a history of one or two b/tsDMARDs prior to initiation, moderate (Clinical Disease Activity Index [CDAI]: 10 to ≤ 22) or high (CDAI: > 22) disease activity at initiation [12], and recorded a follow-up visit at 6 (± 3) months after therapy initiation. Patients who were not eligible to participate in the registry included those who: (i) had a diagnosis of juvenile idiopathic arthritis, psoriatic arthritis, spondylarthritis, ankylosing spondylitis, systemic lupus erythematosus, or any other form of autoimmune inflammatory arthritis; (ii) were only on csDMARDs; or (iii) were participating/planning to participate in any RA clinical trial.

Study treatments

All RA patients with moderate or high disease activity were categorized as either TNFi or IL-6Ri initiators, both with subgroups initiating as monotherapy or combination therapy. Monotherapy initiators stopped the prior csDMARDs at the time of initiating TNFi or IL-6Ri or anytime earlier, and combination therapy initiators received MTX with or without other csDMARDs, in addition to TNFi or IL-6Ri.

Study assessments

At the baseline visit (i.e., visit at which TNFi/IL-6Ri was started), the following variables were recorded for each patient: demographic characteristics, lifestyle status, history of comorbidities, medication use, and disease severity measures including the CDAI and PROs. Clinical outcomes collected were as follows:

Clinical disease activity index

Mean change and achievement of low disease activity (LDA; CDAI: ≤ 10); achievement of minimal clinically important difference (MCID, i.e., improvement by ≥ 6 [for moderate disease activity at baseline] or ≥ 12 [for high disease activity at baseline] units) in the CDAI from baseline to follow-up; and achievement of remission (CDAI: ≤ 2.8) [12, 13].

Patient-reported outcomes

Health Assessment Questionnaire-Disability Index (HAQ-DI) (mean change and achievement of improvement in the HAQ-DI of ≥ 0.22 or ≥ 0.30 units) [14, 15]; EuroQol-5 Dimension (EQ-5D) score (mean change); pain visual analog scale (VAS, 0–100) (mean change and achievement of improvement by ≥ 10 units); patient global assessment (VAS, 0–100) (mean change and achievement of improvement by ≥ 10 units) [16]; and fatigue (single-item VAS, 0–100) (mean change and achievement of improvement by ≥ 10 units) [17].

Disproportionate pain (DP) [18, 19]

Presence or absence of DP1 at 6 months among patients with DP1 at baseline; and presence or absence of DP2 at 6 months among patients with DP2 at baseline, for which:

Presence of DP1 is defined as:

$$\mathrm{TJC}-\mathrm{SJC }\ge 7$$

Presence of DP2 (among those with TJC > 0) is defined as:

$$\frac{\mathrm{SJC}}{\mathrm{TJC}}<0.5$$

where, TJC is 28-tender joint count and SJC is 28-swollen joint count.

The following outcomes were measured as exploratory analyses: response to prior TNFi therapies (using duration of previous TNFi) to investigate whether response/non-response to prior TNFi therapy channeled patients to different subsequent treatments (TNFi vs IL-6Ri). Also, change in the prednisone dose from baseline to 6 months (using baseline dose of prednisone) was evaluated for all patients.

No safety outcomes were assessed in the present study.

Statistical analyses

Both TNFi and IL-6Ri initiators (as mono- or combination therapy) were compared at baseline and at the follow-up visit. Descriptive statistics were measured for each variable at baseline. Continuous variables were summarized using mean and standard deviation (SD), while categorical variables were reported as total number and proportion of each category. Univariate comparisons between therapy groups were performed using t-tests for continuous variables and chi-square tests for categorical variables.

Since the same patient can potentially contribute to multiple observations for repeated initiations within the same drug class or across different drug classes, mixed-effects regression models with random intercept for patient were used to account for the potential correlation among separate observations from the same patient.

For mean change in outcomes, the difference from baseline to six-month follow-up was calculated for each patient and used as the dependent variable in mixed-effect linear regression models. For binary outcomes, an indicator variable was created, measuring whether a patient achieved the outcome from baseline to follow-up or not. These indicator variables were then used as dependent variables in mixed-effect logistic regression models to predict the achievement of each outcome. Unadjusted and covariate-adjusted effects (mean change in effect [β, 95% confidence interval {CI}] for linear regressions and odds ratio [OR, 95% CI] for logistic regressions) were reported. The independent variables in all models included treatment group (TNFi as reference), the baseline value of the outcome variable, and a set of additional covariates (confounders) determined a priori to be likely to influence the outcome measures. Covariates also included those characteristics which were found to be significantly different at the baseline; For monotherapy initiators, covariates included biologic line of therapy, age, duration of RA, gender, work status, history of cardiovascular disease (CVD), CDAI, and morning stiffness; for combination therapy initiators, these were biologic line of therapy, history of CVD, CDAI, patient reported pain, prior use of csDMARDs, and opioids use. These analyses were replicated for monotherapy and combination therapy initiators.

Differences in duration of prior exposure to TNFi were also investigated among patients who had received TNFi earlier. Duration of prior TNFi therapy was used as a proxy for primary and secondary non-response, which may be associated with the effectiveness of subsequent TNFi [20, 21]. The last prior TNFi was identified among patients with a history of at least one prior TNFi, and the proportion of the population was reported with the available information as well as the mean (SD) and median (25th percentile and 75th percentile) duration of therapy. Also, the proportions of the population that persisted on therapy for at least 6 and 12 months were reported. This information was presented for all eligible initiators and by line of therapy. It was assumed that therapy discontinued prior to 6 months would be more likely to be associated with primary non-response.

Lastly, prednisone use was categorized as no use, dose < 10 mg, and dose ≥ 10 mg, and summarized at baseline and at 6 months for TNFi and IL-6Ri monotherapy and combination therapy initiators.

Sensitivity analyses were conducted for mean change in outcomes and prednisone use, where outcomes were reanalyzed by considering binary response outcomes as “non-responders” and imputing continuous outcomes with last observation carried forward (LOCF) for patients who discontinued a biologic prior to the six-months follow-up.

All analyses were performed using Stata 15 and/or SAS 9.4.

Results

Patient disposition and baseline characteristics

Out of 9682 patients (with moderate or high disease activity) initiating TNFi, 737 and 1315 patients were included in the study as monotherapy and combination therapy initiators, respectively. Similarly, out of 3008 patients (with moderate or high disease activity) initiating IL-6Ri, 286 and 401 patients were included in the study as monotherapy and combination therapy initiators, respectively (Fig. 1).

Fig. 1
figure 1

Patient disposition

Patients initiating IL-6Ri (n = 286) vs TNFi monotherapy (n = 737) were older (60.0 vs 55.4 years; P < 0.001), had a longer history of RA (12.2 vs 10.0 years; P = 0.001), higher CDAI at baseline (26.9 vs 24.9; P = 0.02), and were more likely to initiate as third-line therapy (57.0% vs 30.9%; P < 0.001). Further, patients initiating IL-6Ri (n = 401) vs TNFi (n = 1315) combination therapy had higher CDAI at baseline (26.7 vs 24.8; P = 0.007) and were more likely to initiate as third-line therapy (56.4% vs 28.7%; P < 0.001). The detailed baseline demographic and clinical characteristics are described in Table 1 and Table 2, respectively.

Table 1 Baseline demographic characteristics in monotherapy and combination therapy initiators by therapy class
Table 2 Baseline clinical characteristics in monotherapy and combination therapy initiators by therapy class

Outcome assessments

In unadjusted as well as adjusted analyses, no clinically or statistically significant differences were noted for disease activity measures, PROs, and DP between TNFi and IL-6Ri initiators, both as mono- or combination therapy, although there was one exception. In the unadjusted analyses of DP1 (all initiators) among the combination therapy group, higher odds of DP1 presence were noted at 6 months for IL-6Ri when compared with TNFi (17.8% vs 12.6%; OR: 1.64 [1.10, 2.45]; P = 0.015); however, this difference was not seen in the adjusted analyses. One-third of the TNFi and IL-6Ri monotherapy (37.0% vs 32.7%; adjusted OR [aOR]: 0.99 [0.59, 1.67]; Table 3) and combination therapy initiators (36.7% vs 31.2%; aOR: 0.96 [0.66, 1.38]; Table 4) achieved LDA.

Table 3 Results from mixed models evaluating the impact of treatment class on disease burden, disproportionate pain, and disease activity among monotherapy initiators
Table 4 Results from mixed models evaluating the impact of treatment class on disease burden, disproportionate pain, and disease activity among combination therapy initiators

In sensitivity analyses, no clinically meaningful differences were noted, with exception of the patient global assessment in the monotherapy initiators; IL-6Ri monotherapy initiators reported higher odds of achieving patient global assessment compared to TNFi monotherapy initiators (OR = 1.62; 1.12–2.35) (Online Supplementary Table 1 and Online Supplementary Table 2).

Among monotherapy (TNFi, n = 319; IL-6Ri, n = 115) and combination therapy (TNFi, n = 617; IL-6Ri, n = 173) initiators with available data on immediate prior TNFi therapy, no meaningful differences were noted between TNFi vs IL 6Ri for the duration of prior TNFi (Table 5).

Table 5 Duration of last prior TNFi, by mono/combination therapy and therapy class groups

Further, no meaningful differences were found between TNFi and IL-6Ri initiators (as mono- or combination therapy) for the use of prednisone, with majority of the patients continuing either at their baseline dose or switching to a low dose/no use of prednisone after 6 months of treatment (Online Supplementary Table 3 and Online Supplementary Table 4).

Discussion

In this retrospective real-world evaluation of a b/ts-experienced cohort, TNFi and IL-6Ri initiators had similar clinical outcomes (i.e., disease activity, PROs, and DP), regardless of whether they initiated as monotherapy or combination therapy.

Due to the limited number of real-world biologic-naïve patients initiating IL-6Ri monotherapy in this large retrospective registry, the present study did not evaluate biologic-naïve patients such as the ones included in the H2H trials, systematic literature reviews, and network meta-analysis, which have shown improved clinical outcomes with IL-6Ri when compared with TNFi as monotherapy [9, 10, 22,23,24,25,26]. Although evidence exists for similar clinical outcomes of TNFi and IL-6Ri as combination therapy in biologic-naïve patients [22, 23], no H2H trials in combination with a csDMARD have been conducted so far.

Compared with prior H2H trials [9, 10], patients in the present study were 4–6 years older, and had 2–4 years longer disease duration and half the disease activity (in terms of CDAI) at baseline. The H2H trials compared the efficacy of IL-6Ri with adalimumab in b/ts-naïve RA patients while the present study included the b/ts-experienced patients on IL-6Ri and various TNFi drugs (not limited to adalimumab). Further, the dose of IL-6Ri (tocilizumab) used in the trial [10] was higher as compared with the approved starting dose in clinical practice and the real-world studies, where tocilizumab (subcutaneous or intravenous) was initiated either at low doses or escalated over time as per the patient’s disease activity [27,28,29]. Lastly, the present study included those patients who either had prior use of csDMARDs and/or were on combination therapy with csDMARDs, while the patients in H2H trials were those considered inappropriate candidates for the continued treatment with MTX. All these differences with the populations included in the previous H2H trials may have contributed to a reduced difference in effectiveness between the two treatments in this study [30, 31].

In line with the recently published definition of “difficult-to-treat RA” by the European Alliance of Associations for Rheumatology (i.e., patients who have failed ≥ 2 b/tsDMARD therapies), there were approximately 30% TNFi and 57% IL-6Ri initiators in this study who would fall in this category and thus, would have been classified as refractory to the treatment [32]. Therefore, it is possible that the larger fraction of “difficult-to-treat patients” in the IL-6Ri cohort may have influenced the results in favor of the TNFi cohort. While most of the components included in the “difficult-to-treat RA” definition were adjusted in the present study, some of the factors (such as radiographic progression) were not adjusted.

There was a high proportion of patients who had prior TNFi exposure (88.8%–92.1%) in our study. Literature suggests that a better treatment response would occur when switching from a TNFi to an alternative mechanism of action therapy [33,34,35,36]. Since many patients initiating a TNFi had already failed another TNFi in our b/ts-experienced cohort, we investigated for a potential selection bias in patients who received a follow-on TNFi. Patients with secondary non-response to a TNFi may be more likely to respond to another TNFi than patients with primary non-response [20, 21]. Thus, duration of prior TNFi therapy was used as a proxy for primary and secondary non-response, assuming therapy discontinued within 6 months after initiation would be more likely to be associated with primary non-response. The distribution of prior TNFi discontinuation within or after 6 months of therapy did not differ between the TNFi and IL-6Ri cohorts. However, time on prior TNFi may not have been a good surrogate for primary and secondary non-response [36, 37].

The current analysis had some important differences compared with other studies, which had suggested better outcomes when patients were switched to a different class of biologics after a TNFi failure rather than rechallenged with another TNFi (i.e., cycling) [33,34,35,36]. In our study, CDAI was the primary outcome, whereas in some similar analyses, persistence was used as an outcome [33, 34, 36]; there were only 6 months of follow-up; and approximately one-third of the TNFi initiators and more than half of the IL-6Ri initiators were on their second treatment switch.

Though similar efficacy has been reported for TNFi and other biologics (with different mechanisms of action) in RA, there may be patient subsets with differences noted in clinical outcomes. For example, the AMPLE trial reported similar efficacy for abatacept and adalimumab in all patients with RA [38], while its exploratory analysis showed an association between seropositivity (anti-cyclic citrullinated peptide antibodies [ACPA] and/or rheumatoid factor) and better clinical response with abatacept than adalimumab [39]. Similarly, various subsets of RA patients have shown better treatment responses with IL-6Ri compared with TNFi [40,41,42,43]. Recently, machine learning was used to identify a rule to predict the treatment response to sarilumab and suggested that the subset of RA patients with ACPA and CRP > 12.3 mg/L might respond better to sarilumab than to adalimumab [40]; this finding was also validated in a real-world setting [41].

The present study was designed to better inform clinicians about treatment options in patients who have failed prior b/tsDMARDs. The major strengths of this study were its observational real-world nature (reflective of current clinical practices in the US) and the large number of enrolled patients [44]. Further, our methodology was based on logistic regression to control confounders, which is known to yield similar results as propensity score methods in observational studies [45, 46]. However, as with every retrospective observational study, there are limitations. Patients and physicians were unblinded to treatment, and there could be unidentified selection or channeling biases (such as factors affecting adherence to therapy, monitoring requirements, and beliefs and preferences of patients/physicians) that may influence the choice as well as outcomes of a therapy in clinical practice [37, 44]. Also, the visits occurred every 6 months in the study and thus, possible dose changes for the treatments that occurred in between visits might not have been captured accurately in the registry, especially if there were multiple changes. Clinical trials, on the other hand, frequently use pre-determined dose escalation schemas. This difference may have accounted for some of the discrepancy in the study findings. Further, there was no assessment done for the relationship between prednisone and/or non-prescription medications (such as non-steroidal anti-inflammatory drugs) with b/tsDMARDs, which was outside the scope of current analyses, and may have an impact on the effect of b/tsDMARD therapies. Lastly, the findings for DP1 and DP2 need to be validated in future studies.

In conclusion, no significant differences were noted in clinical outcomes for TNFi vs IL-6Ri initiators (as mono- or combination therapy) in b/ts-experienced patients in this observational study. Results from RCTs have shown that IL-6Ri therapy in biologic-naïve patients is more efficacious than TNFi therapy. This inconsistency may be explained by the fact that the present study included real-world b/ts-experienced patients. Further analyses may help understand the reasons for this inconsistency and optimize the clinical outcomes for patients with RA.