FormalPara Key Summary Points

Why carry out this study?

Head-to-head comparisons of biologic/targeted synthetic DMARDsare needed to help guide rheumatology provider  prescribing.

To evaluate the effectiveness of abatacept versus tofacitinib in CCP+ (a surrogate for anti-cyclic citrullinated peptide [ACPA]) patients with rheumatoid arthritis.

What was learned from this study?

In adjusted analyses, CCP+ patients with RA initiating treatment with abatacept did not show a statistically significant difference in reducing disease activity or improving patient-reported outcomes compared with CCP+ patients initiating treatment with tofacitinib.

To achieve a highly individualized and tailored approach to patient management, physicians will need to consider factors including comorbidity burden, patient preference, disease progression, and the risk–benefit ratio for each agent when selecting medications.

Introduction

Rheumatoid arthritis (RA) is a chronic inflammatory arthritis associated with joint inflammation, impaired quality of life, and increased rates of disability [1]. Over the last two decades, advances in our knowledge of RA have increased significantly. Knowledge such as recognizing that RA starts early and damage can develop before clinical symptoms present has led to early diagnosis and instituting prompt treatment within the window of opportunity [1, 2]. Another notable advancement is a better understanding of the disease pathogenesis, which has led to the development of targeted therapeutics that allow for reasonable control of signs and symptoms, thus improving the overall outcome of the disease [3]. To further help patients achieve sustained remission or low disease activity (LDA), the identification of specific serum biomarkers indicative of factors such as poor prognosis, worse outcomes, erosive disease, and less responsiveness to therapies would be beneficial when developing a personalized treatment approach for patients [4].

Rheumatoid arthritis patients may respond differently to medications with different modes of action based on their underlying serology. In recent years, significant advancements have helped clarify the mechanistic basis of the clinical associations of autoantibodies in RA [5]. One of the hallmarks of RA is the production of autoantibodies, including anti-citrullinated protein antibodies (ACPA). ACPAs are included in the 2010 American College of Rheumatology (ACR)/European League Against Rheumatism (EULAR) diagnostic criteria) [6]. ACPAs are highly specific serum biomarkers often presenting in serum before the onset of disease [7]. The presence of ACPAs suggests a poor prognosis including rapid disease progression, more severe joint damage, and increased disability [8, 9]. Likewise, RA patients who are positive for anti-cyclic citrullinated peptide (CCP+, a surrogate for ACPA) antibodies are more likely to develop severe, erosive disease than those who are anti-CCP-negative (CCP–) [8,9,10].

The wide array of biologic disease-modifying antirheumatic drugs (bDMARDs) has led to tremendous advancements in the treatment of RA. Although there are well-established prognostic factors at the group level (i.e., high disease activity, the early presence of erosive changes, extra-articular disease, previous biologic failure), they do not constitute specific predictors of response to a particular drug; thus, we are still unable to predict outcomes at the patient level by treatment. Hence, health care providers are left to implement the trial-and-error approach within the permitted treatments of insurance coverage to prescribing advanced therapies [11].

Prior work has demonstrated that response to RA therapies may vary based on CCP± status. Specifically, patients who were CCP+ had a more significant clinical response to treatment with abatacept than those who were CCP–. However, this relationship was not seen with tumor necrosis factor α inhibitors (TNFi) [12, 13]. A better understanding of the effectiveness of well-established b/tsDMARDs s for treating RA in particular subgroups of patients based on serologic biomarkers will enable more individualized care. To that end, in this current study, we used propensity score matching (PSM) to compare the effectiveness of abatacept versus tofacitinib in a large real-world cohort of CCP+ patients.

Methods

Study Sample

Data Source

The CorEvitas (formerly known as Corrona) RA Registry is an independent, prospective, national, observational cohort. Patients were recruited from 207 private practices and academic sites with 880 participating rheumatologists across 42 states in the US. Treatment and outcomes data for patients with RA are collected and analyzed [14]. As of March 31, 2022, the registry included 217 private and academic active clinical sites with 931 physicians throughout 42 states in the US. Data on 458,982 patient visits and approximately 228,871 patient-years of follow-up observation time have been collected. The mean duration of patient follow-up was 4.8 years (median 3.4 years).

Population

This study selected patients from the CorEvitas Registry who met the following inclusion criteria: RA confirmed by a rheumatologist; ≥ 18 years of age; initiation of abatacept or tofacitinib between December 2012 (when the FDA approved tofacitinib) and October 2019; CCP + (titer ≥ 20 U/ml) any time before starting or at the start of treatment; had a follow-up visit at 6 months (± 3 months) after starting treatment; a Clinical Disease Activity Index (CDAI) score at initiation or prior to treatment initiation and the 6-month follow-up visit; was not in remission based on CDAI at treatment initiation. There were 369 CCP+ abatacept and 475 CCP+ tofacitinib patients who had a 6-month follow-up visit; CDAI at initiation and at 6 months; and were not in remission at the index visit.

Propensity Score Matching

This study performed 1:1 frequency matching by the number of prior biologics (0, 1, or 2+ prior biologics) between abatacept and tofacitinib CCP+ patients. Once matched based on prior biologic use, we assessed the level of imbalance between the abatacept and tofacitinib CCP+ patients using standardized differences.

Standardized differences provide a measure of clinically important difference even if there is no statistically significant difference between the treatment groups [15, 16]. Therefore, we used standardized differences > 0.10 as suggesting meaningful differences between groups. Variables with standardized differences > 0.10 between the groups were considered to have remained potentially unbalanced. After considering standardized differences based on the frequency matching, a propensity score model for treatment (abatacept versus tofacitinib) [16] was fit using the following variables in the final propensity score model: gender, duration of RA, college education, number of prior conventional synthetic (cs) DMARDs (csDMARDs), number of prior bDMARDs, targeted synthetic (ts) DMARDs, follow-up months, baseline CDAI, history of cancer, history of cardiovascular disease (CVD), body mass index (BMI), monotherapy, and prednisone dose > 7.5 at index visit.

We matched the treatment groups and compared characteristics between matched groups using the propensity score. For patients who switched agents during the 6 months of follow-up, we used the outcome measure at the time of switch (rather than at 6 months) for continuous outcomes. For dichotomous outcomes, we imputed them as non-responders (e.g., did not achieve remission for patients that switched prior to their 6-month follow-up visit). For patients who discontinued their medication prior to the follow-up visit without switching to another b/tsDMARD, their outcomes at the follow-up visit were used.

Measures and Data Collection

Data were collected during the study period from physician assessment and patient questionnaires completed during routine clinical visits. These forms were used to gather information on disease severity and activity (including serologic markers) and components of ACR response criteria); comorbidities; use of medications including steroids, csDMARDs, tsDMARDs and bDMARDs; and adverse events. As a strictly observational registry that reflects typical clinical practice, the CorEvitas registry does not mandate that laboratory data, including serologic markers and acute-phase reactants, be performed, however asks participating investigators to submit these results if they were performed.

Data on demographics, insurance status, comorbid conditions, RA disease characteristics, and RA medications were available for > 97% of patients. Data disease activity elements collected included: CDAI; swollen joint count in 28 joints; tender joint count in 28 joints; Physician Global Assessment (PGA) and Patient Global Assessment (PtGA); modified ACR 20, 50, and 70% response criteria (mACR20, mACR50, and mACR70). The mACR is based on achieving the swollen and tender joint responses, as well as two out of four additional measures, including the modified Health Assessment Questionnaire (mHAQ) to assess physical function, patient pain, patient fatigue, and the Patient Global Assessment (PtGA); it does not include erythrocyte sedimentation rate (ESR) or C-reactive protein (CRP).

Study Outcomes

The primary outcome was mean improvement in CDAI score over 6 months (± 3 months) following treatment initiation among anti-CCP+ patients treated with abatacept versus tofacitinib. The secondary outcomes compared the response to therapy among CCP+ patients who were treated with abatacept versus tofacitinib at 6 months based on the following outcomes: the achievement of CDAI remission (CDAI score ≤ 2.8); the achievement of CDAI LDA (CDAI score ≤ 10) among those not in remission or LDA at initiation; and achievement of modified ACR20, ACR50, ACR70 responses which included changes from baseline for the modified HAQ (mHAQ), patient pain, patient fatigue, and PtGA. The proportion of patients in each group who discontinued or switched therapies over the study period was assessed.

Statistical Analysis

Patient demographic and disease characteristics at initiation for the two treatment groups were reported with calculated standardized differences. The outcome of response at 6 months was modeled using mixed-effects models. The primary fixed effect of interest was the treatment group (using tofacitinib initiators as the reference group), and site identification was the random effect. After PSM, remaining unbalanced variables, based on standardized differences > 0.10, were included as covariates in the adjusted multivariable models. Covariates included in the multivariable model were age, gender, race, work status, Medicare, blood pressure (systolic), history of non-TNFi DMARD use, baseline CDAI, and mHAQ at index visit.

Compliance with Ethics Guidelines

The study was performed in accordance with the Declaration of Helsinki and the Guidelines for Good Pharmacoepidemiology Practice (GPP). All participating investigators were required to obtain full board approval for conducting noninterventional research involving human subjects with a limited dataset. Sponsor approval and continuing review was obtained through a central Institutional Review Board (IRB), the New England Independent Review Board (NEIRB; no. 120160610). For academic investigative sites that did not receive authorization to use the central IRB, full board approval was obtained from their respective governing IRBs, and documentation of approval was submitted to CorEvitas, LLC before the site’s participation and initiation of any study procedures. All patients in the registry were required to provide written informed consent and authorization before participating.

Results

A total of 369 CCP+ abatacept patients and 475 CCP+ tofacitinib patients met the inclusion criteria. After 1:1 frequency matching based on the number of previous biologics, there were 369 patients in each group (Fig. 1). The baseline characteristics in the frequency-matched cohort before PSM showed that the two treatment groups differed (absolute value of the standardized differences > 0.10) in terms of gender, race, education, BMI, working status, comorbidities, duration of RA, physician global assessment, and prior treatment of RA. After considering standardized differences based on the frequency matching, a propensity score model for treatment (abatacept versus tofacitinib) was fit. After PSM, the overall population included 291 patients in each group (Fig. 1). In the overall PS matched population, most patients were female, 61 years old, with a RA duration of 12 years, and the age of RA onset at 48 years (Table 1). Age, gender, race, and baseline CDAI were specified a priori to be included as covariates in adjusted models. Additional baseline characteristics that remained unbalanced (standardized difference > 0.10) between treatments and were included as covariates in adjusted models included: systolic blood pressure, work status, Medicare, mHAQ, and history of non-TNFi DMARD use. While the mean change from baseline CDAI to 6-month CDAI was numerically greater in the abatacept group than in the tofacitinib group, the difference between the groups was not significant.

Fig. 1
figure 1

Patient flow chart. *Propensity score matching was based on gender, duration of RA, college education, number of prior cDMARDs, number of prior b/ts DMARDs, follow-up months, baseline CDAI, history of cancer, history of CVD, BMI, monotherapy, and prednisone dose > 7.5 at index visit. cDMARD conventional disease-modifying antirheumatic drugs; b/ts DMARD biologic/targeted synthetic DMARD; CCP cyclic citrullinated peptide

Table 1 Patient demographic and clinical characteristics of CCP + abatacept (ABA) initiators compared with CCP+ tofacitinib (TOFA) initiators (after propensity score matching, aN = 291 in each group)

After adjusting for covariates that were unbalanced between the abatacept and tofacitinib anti-CCP+ patients after PSM, there was no significant difference in change from baseline CDAI to 6-month CDAI between the two groups (abatacept 4.59 [95% CI 3.62, 5.55] vs. tofacitinib 4.10 [95% CI: 3.07, 5.13]; [P = 0.936]) (Table 2). There was also no significant difference between the percent of patients who remained on the drug at the 6-month visit; percent who discontinued initiated drug and did not start another biologic at/before 6 months; and the percent who switched initiated drug to another biologic at/before the 6-month visit (P = 0.265) (Table 2). In exploratory analyses, there were no significant differences when looking separately by line of therapy (0, 1, and ≥ 2 prior b/tsDMARDs) (Supplementary Table 1).

Table 2 Primary outcome among CCP+ initiators of abatacept (ABA) vs. tofacitinib (TOFA)

After adjusting for remaining unbalanced covariates between the abatacept CCP+ and tofacitinib CCP+ patients after PSM, there was a numerical advantage in terms of improvement of patient-reported outcomes with treatment with abatacept as compared to tofacitinib, but none were significant: change from baseline to 6-month patient pain (P = 0.682), change from baseline to 6-month patient fatigue (P = 0.531), or change from baseline to 6-month PtGA (0.726) [Fig. 2]. There was no difference with respect to change from baseline to 6-month mHAQ (P = 0.817) between drug groups. In terms of the adjusted odds ratios for binary secondary outcomes, there was a numerical advantage for abatacept with respect to achievement of remission and mACR20 and a numerical advantage for tofacitinib with respect to achievement of LDA, amACR50, mACR70). Overall, there were no significant differences between abatacept and tofacitinib CCP+ patients for these binary secondary outcomes [Fig. 3].

Fig. 2
figure 2

Secondary continuous outcomes measured at 6 months in CCP+ initiators of abatacept vs. tofacitinib. aAdjusted for covariates since CCP+ initiators of abatacept vs. tofacitinib were unbalanced after propensity score matching. Covariates include age, gender, race, work status, Medicare, blood pressure (systolic), history of non-TNFi DMARD use, baseline CDAI and modified Health Assessment Questionnaire (mHAQ) at index visit. bThe P value tests the difference between the CCP+ abatacept and CCP+ tofacitinib initiators according to the secondary outcome

Fig. 3
figure 3

Secondary binary outcomes measured at 6 months in CCP+ initiators of abatacept vs. tofacitinib. aAdjusted for covariates since CCP+ initiators of abatacept vs. tofacitinib were unbalanced after propensity score matching. Covariates include age, gender, race, work status, Medicare, blood pressure (systolic), history of non-TNFi DMARD use, baseline CDAI and modified Health Assessment Questionnaire (mHAQ) at index visit. bThe P value tests the difference between the CCP+ abatacept and CCP+ tofacitinib initiators according to the secondary outcome. cCDAI ≤ 10 among those with moderate or high disease activity. dCDAI ≤ 2.8 among those with low disease activity or more severe. eModified ACR: based on two out of four measures (not using erythrocyte sedimentation rate [ESR] or c-reactive protein [CRP])

Discussion

Using data from CorEvitas, a large US-based RA registry, we conducted a comparative effectiveness study comparing the effectiveness of abatacept to tofacitinib in a population of patients who were CCP+ . In adjusted analyses, CCP+ patients with RA initiating treatment with abatacept did not show a statistically significant difference in reducing disease activity or patient-reported outcomes compared with patients initiating treatment with tofacitinib.

Findings from clinical trials and real-world studies have also found no differences in abatacept compared with tofacitinib. Vieira et al. [17] compared tofacitinib versus abatacept, golimumab, rituximab, and tocilizumab in a network meta-analysis consisting of five trials based on 2136 patients. When comparing tofacitinib and abatacept efficacy outcomes (e.g., ACR20, ACR50, and ACR70) at weeks 12 and 24, risk estimates of ACR response rates for tofacitinib 5 mg BID were comparable to abatacept. In a real-world observational study of tofacitinib using prospectively collected data from an Israeli registry, Shouval et al. [18] focused on event-free survival (EFS), defined as the time from treatment initiation to treatment failure from any cause. They found that EFS with tofacitinib was similar to abatacept in a univariate (hazard ratio [HZ] = 1.05, 95% CI 0.74–1.49, P value: 0.791) and multivariable analysis (HZ = 1.24, 95% CI 0.85–1.81, P value: 0.269). Hirose et al. [19] used propensity score matching (PSM) in a multi-center, retrospective, observational study in 12 hospitals and clinics for RA in Japan. Patients included 70 PS matched pairs of tofacitinib-abatacept patients. At week 24, there were no significant differences in improvement in DAS28-ESR, CDAI, and SDAI and achievement of DAS-ESR remission in the two treatment groups.

We designed this observational study as a head-to-head comparison using PSM among CCP+ patients so each therapy has a fair and equal chance to succeed. Our prior work has demonstrated that response to RA therapies may vary based on CCP+ status. Specifically, in preliminary analyses, we found that patients who were CCP+ had a more significant clinical response to treatment with abatacept than those who were CCP–. We did not find a relationship between CCP+ and response to tofacitinib [12]. However, a post hoc pooled analysis of five phase III randomized clinical trials and a pooled analysis of long-term extension studies found that patients who were CCP+ /rheumatoid factor (RF)+ were more likely to achieve response with tofacitinib compared with those who were CCP−/RF− [20, 21]. For this reason, our comparison was in the CCP+ populations of abatacept and tofacitinib initiators.

This study has several strengths. The CorEvitas RA Registry is a US-based registry that includes a large cohort of patients with RA with physician-validated outcome measures. Previous analyses have compared Medicare patients with RA enrolled within the RA Registry to those not enrolled and found similar demographic and comorbidity characteristics, supporting the generalizability of data from the CorEvitas RA Registry [22]. The study is methodologically robust to address channeling of medications as PSM was used to identify patients with similar characteristics within each line of therapy to offset potential selection bias. Furthermore, the clinician can benefit from this study, as it provides real-world clinically relevant evidence that may assist them in making more informed treatment decisions. This study also has limitations. In real-world settings, patients are not randomized to treatment, but physicians select medications based on specific individual patient profiles and thus can lead to bias. We performed PSM to try an offset selection bias by matching patients with similar characteristics, including disease activity, severity, and line of therapy. The sample size was small, and the duration of follow-up (6 months) was short for both treatments. Given the person-time available for the patient groups included in the manuscript, we were not powered to adequately compare safety in the two groups. In addition, our patient cohort was a majority White population with health insurance and thus results may not be generalizable to other patient populations.

Conclusions

In adjusted analyses, CCP+ patients with RA initiating treatment with abatacept did not show a statistically significant difference in reducing disease activity or improving patient-reported outcomes compared with CCP+ patients initiating treatment with tofacitinib. Additional studies with longer follow-up times would be of interest to explore these findings in a larger population, particularly in b/tsDMARD-naive patients. According to EULAR treatment guidelines, the goal of RA treatment is to achieve sustained remission or low disease activity [23]. When selecting medications, physicians will need to consider factors including comorbidity burden, patient preference, disease progression, and the risk–benefit ratio for each agent in order to achieve a highly individualized and tailored approach to patient management.