Background

Treatment options for rheumatoid arthritis (RA) have progressed over the past two decades. Methotrexate (MTX), biologic disease-modifying antirheumatic drugs (bDMARDs), and targeted synthetic antirheumatic drugs (tsDMARDs) such as Janus kinase inhibitors (JAKis) have significantly improved disease activity, functional disability, and joint prognosis of RA patients [1, 2]. This clinical evidence is primarily based on well-designed randomized controlled trials; however, they cannot assess long-term efficacy and safety outside of the study period or advocate evidence in the groups of patients who met exclusion criteria due to age or comorbidities [3, 4]. Real-world data obtained from observational cohort studies could not only answer these questions but also describe how patient backgrounds and therapeutic methods affected treatment outcomes in the observational period [5].

The Kyoto University Rheumatoid Arthritis Management Alliance (KURAMA) cohort is an open-label, single-center, observational cohort study of RA and rheumatic diseases [6,7,8]. It was established in 2011, and a total of 4418 patients were registered until 2022. We recorded longitudinal treatment data at every clinical visit, including disease activity, functional disability, and adverse events. We also have conducted the “annual RA survey” that features radiographic examinations on joint destruction, testing for osteoporosis, and patient surveys regarding frailty, sarcopenia, mental status, and more every year since 2012 [9,10,11,12]. The KURAMA cohort also has a biobank of plasma, synovium, and other RA-related specimens [13,14,15].

Between 2012 and 2021, the Japanese Ministry of Health, Labour and Welfare approved two bDMARDs (certolizumab pegol, 2013; sarilumab, 2018) and five JAKis (tofacitinib, 2013; baricitinib, 2017; upadacitinib; peficitinib; and filgotinib, 2021) [16]. This study aimed to investigate how treatment strategies and outcomes evolved alongside advances in therapeutic strategies by reviewing the 10-year experience of the KURAMA cohort from 2012 to 2021.

Materials and methods

Patients

To observe the annual trends in RA management, we gathered clinical data from RA patients, including disease activity, functional disability, the usage of MTX, glucocorticoids (GCs), and b/tsDMARDs, along with demographic and anthropometric details, from the annual RA surveys conducted from 2012 to 2021. For patients treated with b/tsDMARDs, we recorded disease activity; functional disability before and 1, 2, 3, 6, and 12 months after initiation; and the date of initiation and discontinuation. Disease activity was assessed using the Disease Activity Score (DAS) 28-CRP and Clinical Disease Activity Index (CDAI), and functional disability was evaluated by the Health Assessment Questionnaire Disability Index (HAQ) [17, 18]. All patients met the 1987 American College of Rheumatology (ACR) classification criteria or the 2010 ACR/European League Against Rheumatism classification criteria for RA diagnosis. The KURAMA cohort study was approved by the Kyoto University Graduate School of Medicine Medical Ethics Committee (R0357), and informed consent was obtained from all patients.

Data analysis

Since the annual cross-sectional dataset had relatively low missing value rates, with the highest rate at 8.4% for DAS28-CRP, data with missing values were excluded from subsequent statistical analyses.

We employed propensity score matching to address potential confounding factors that could impact the effectiveness of b/tsDMARDs. The reference group consisted of patients treated with CTLA4-Ig (abatacept). The following variables were used to account for potential confounding factors: age, sex, CDAI, DAS28-CRP, HAQ, disease stage, disease class, methotrexate (MTX) and glucocorticoid (GC) dosages, rheumatoid factor (RF), and anti-cyclic citrullinated peptide antibody (ACPA) titers at baseline. One-to-one matching without replacement was performed using the nearest neighbor match on the logit of the propensity score with the caliper width set to the standard deviation of the logit of the propensity score. Student’s t-test and the chi-square test were used to analyze continuous variables and categorical variables, respectively.

For the time-series analysis, we fitted mixed-effects models to evaluate the trends in disease activity and functional disability measures [19]. All models were fitted with patient-specific random intercepts to account for inter-patient variations. Box-Cox transformation and square root transformation were applied to CDAI and HAQ, respectively, to fix skewness and gain a better fitting [20, 21]. Age, sex, BMI, RF, ACPA, Steinbrocker’s stage, class, dose of MTX, and dose of GCs were used as covariates.

Statistical analyses were performed using scipy v1.7.1, statsmodels v0.12.2, and scikit-learn v0.24.2. Propensity score matching was performed using psmpy v0.3.13. Kaplan‒Meier survival estimation was performed using lifelines v0.27.3 running on Python v3.9.7. We used ggalluvial v0.12.5 for the Sankey diagram, car v3.1–2 for Box-Cox transformation, and glmmTMB v1.1.7 for a mixed-effect model, running on R v4.2.0. Graphs were drawn using matplotlib v3.4.3, seaborn v0.11.2, and ggplot2 v3.4.2.

Results

Changes in RA patient background and RA treatment trends from 2012 to 2021

From 2012 to 2021, a total of 1156 patients and a cumulative total of 5070 patients participated in annual RA surveys. The mean age of RA patients increased from 62.9 in 2012 to 65.9 in 2021, and the percentage of patients younger than 60 years old was less than 20% in 2021 (Fig. 1A, B, and Supplementary Table S1). This trend may reflect the aging society of Japan. Regarding treatment, the percentage of patients treated with MTX was over 60% in all periods and slightly decreased between 2012 and 2021 from 70.8 to 64.3% (Fig. 1C). GC usage decreased (40.5–18.6%), whereas b/tsDMARD usage (29.5–53.2%) increased, indicating that more intensive treatment has been administered to RA patients in the past decade (Fig. 1C). The patients’ demographics and treatment methods during the observation period are summarized in Supplementary Table S1.

Fig. 1
figure 1

Change in patients’ demographics and treatment from 2012 to 2021. A Box plots showing the change in the mean patient age. B Change in the proportion of age groups. C Percentage of patients treated with MTX, GCs, and b/tsDMARDs

The transition of disease activity and functional disability in 10 years

We next investigated how disease activity and functional disability changed over 10 years. The average DAS28-CRP sharply decreased from 2012 to 2015, during which the number of patients treated with b/tsDMARDs increased (Fig. 1C) and then remained stable thereafter (Fig. 2A and Supplementary Table S2). The percentage of patients who achieved DAS28-CRP remission gradually increased and reached 79.7% in 2021 (Fig. 2B). Similarly, CDAI decreased, and the rate of CDAI remission also increased from 25.1 to 48.1% in 10 years (Fig. 2C, D, and Supplementary Table S2). Moreover, the median HAQ scores decreased over time from 0.69 to 0.25 (Fig. 2E and Supplementary Table S2).

Fig. 2
figure 2

Evolution of disease activity and functional disability over 10 years. A Change in DAS28-CRP. B Proportion of disease activity categories defined by DAS28-CRP. C Change in CDAI. D Proportion of disease activity categories defined by CDAI. E Change in HAQ

Treatment outcomes of b/tsDMARDs in the KURAMA cohort

Because an increasing number of patients in the KURAMA cohort were treated with b/tsDMARDs and achieved good clinical outcomes from 2012 to 2021, in which several new b/tsDMARDs were approved, we next analyzed the real-world treatment outcomes of b/tsDMARDs using longitudinal observational data on the treatment responses to b/tsDMARDs. We found 1816 new prescriptions (820 for b/tsDMARD-naïve patients and 996 for b/tsDMARD-switching patients) between 2012 and 2021. Baseline demographic data are shown in Supplementary Table S3; the patients in this dataset partially overlap with those in Supplementary Tables S1 and S2. Supplementary Fig. S1 shows the trends in the proportion of initiated b/tsDMARDs categorized by mode of action (MOA) from 2012 to 2021. The proportion of IL-6 receptor inhibitors (IL6Ri; tocilizumab and sarilumab) was almost consistent over 10 years, accounting for approximately 10% of naïve cases and 20% of switch cases. The initiation of TNF inhibitors (TNFi; infliximab, etanercept, adalimumab, golimumab, and certolizumab pegol, including their biosimilars) and CTLA4-Ig decreased in both types of cases from 2012 to 2021. The prescription of JAKis increased from 2014 in switch cases first, and JAKis were more recently initiated in over 20% of naïve patients (Supplementary Fig. S1). Concomitant usage of MTX in b/tsDMARD-naïve patients and switched patients was 74.5% and 63.3% in TNFi, 40.7% and 56.6% in IL6Ri, 47.3% and 48.5% in CTLA4-Ig, and 62.0% and 47.0% in JAKi, respectively (Supplementary Table S3).

We next compared the effectiveness of b/tsDMARDs, assessing the transition of DAS28-CRP, CDAI, and HAQ as depicted in Supplementary Fig. S2. Each MOA achieved low disease activity or remission according to the averaged DAS28-CRP and CDAI at 6 months (Supplementary Fig. S2). Similarly, in switched cases, all MOAs achieved CDAI levels below the threshold for low disease activity at 12 months (Supplementary Fig. S2). To address potential confounding factors, we applied propensity score matching, with the CTLA-4 group as the reference group, to evaluate the drug effectiveness in comparison to CTLA-4. Notably, we excluded JAKis from b/tsDMARDs-naïve cases, as the number of patients treated with JAKis as a first-line therapy was fewer than other MOAs (Supplementary Table S3). In b/tsDMARDs-naïve patients, TNFi showed a statistically better improvement in CDAI and DAS28-CRP at 1 and 2 months, and IL6Ri significantly ameliorated CDAI at 2, 6, and 12 months, and DAS28-CRP at 1, 2, 6, and 12 months compared to CTLA4-Ig (Table 1). All MOAs exhibited gradual improvements in HAQ, with no statistical differences (Table 1). In switched cases, all measures of disease activity and functional disability were comparable across all comparisons (Table 1).

Table 1 P-values computed to compare the effectiveness of b/tsDMARDs in the propensity score-matched groups

The retention of b/tsDMARDs is another indicator of successful treatment [22,23,24,25]. For Kaplan‒Meier analysis, we defined an event as when the prescription of b/tsDAMRDs ceased due to inefficacy and adverse events, and discontinuation due to remission, socioeconomic issues, or the patient’s decision was defined as a censored case. There was no significant difference in the retention rate in b/tsDMARD-naïve patients (Fig. 3A). Moreover, IL6Ri had a significantly better retention rate (p < 0.005) in switched patients (Fig. 3B), which agrees with a previous report [23]. The MOA and continuation/discontinuation status of the first b/tsDMARDs in naïve patients and the choice of MOA when switching are summarized in the Sankey diagram (Supplementary Fig. S3). Different MOAs tended to be chosen when switching from IL6Ri; however, the same MOA was selected in approximately half of the cases when switching from JAKi and TNFi (Supplementary Fig. S3). These results show treatment outcomes of b/tsDMARDs between different MOAs in a real-world setting.

Fig. 3
figure 3

Drug retention rate of b/tsDMARDs. A, B Kaplan–Meier survival curves showing drug retention comparing different MOAs. The X-axis shows the number of days after b/tsDMARDs initiation, and the Y-axis shows the retention probability. Statistics used: log-rank test. A Drug retention in b/tsDMARD-naïve patients. B Drug retention in switched patients

Statistical implications of the annual trends in disease activity and functional disability

To know the statistical implications of observed trends in DAS28-CRP, CDAI, and HAQ from 2012 to 2021 (Fig. 2A–D), we used a mixed-effect model for adjusting patient-specific characteristics [19]. The annual changes in CDAI and HAQ were statistically significant, but not DAS28-CRP (Fig. 4A–C and Table 2). Patients’ demographic data such as age, stage, and class significantly affected HAQ, and b/tsDMARDs, MTX, and GC use were significantly associated with DAS28-CRP and CDAI. b/tsDMARDs use was also associated with HAQ (Table 2).

Fig. 4
figure 4

Mixed-effect model fitted to the annual change in disease activity and functional disability. AC The dots denote the observed median values, and the lines represent the regression line fitted using mixed-effect models. A DAS28-CRP. B CDAI. C HAQ

Table 2 Results of the mixed-effect model

Discussions

RA treatment strategies have evolved over the past 20 years. Two bDMARDs and five tsDMARDs were launched in the last decade, and several drugs with novel MOAs are in clinical trials. Therefore, tracking the RA management of individual patients and analyzing real-world data is crucial for addressing whether the management of RA has improved in the real-world setting. In this study, we analyzed the trends and outcomes of RA treatments in the KURAMA cohort between 2012 and 2021; we showed that (1) the percentage of patients who were treated with GCs and b/tsDMARDs changed significantly, (2) treatment responses and drug retentions differed between different MOAs, and (3) metrics of disease activity and functional disability improved over time. Here, we demonstrated the disparity in b/tsDMARD effectiveness and yearly improvements in the overall disease activity and functional disability using propensity score matching and mixed-effects modeling to account for patient-specific factors.

We found that GC use consistently decreased while b/tsDMARD use increased over a decade, which is consistent with other studies that observed real-world trends of RA medications [5, 26]. However, some reported that the proportion of patients treated with b/tsDMARDs or GCs was stable over the observation period [16, 27]. Regarding treatment outcomes, studies have shown a stable decrease in disease activity and functional disability indices from the 1990s or early 2000s to the early 2010s [5, 27,28,29,30,31,32]. In contrast, some studies showed that the average DAS28 or DAS28 remission rate remained the same after the early 2010s, which is consistent with our data [5, 28]. We found that CDAI decreased, and the CDAI remission rate continued to increase from 2012 to 2021. This discordance could be attributed to the difference between DAS28-CRP and CDAI. CDAI criteria are reportedly more stringent than DAS28-CRP criteria, and DAS28-CRP does not correlate with CDAI when patients achieve DAS28 remission [33,34,35,36]. Aletaha et al. reported that the sensitivity of CDAI to subtle changes in disease activity (ACR < 20) is higher than DAS28 and Simplified Disease Activity Index (SDAI) [37]. In the KURAMA cohort, approximately 80% of patients achieved DAS28-CRP remission in 2021; thus, DAS28-CRP might not have correlated with CDAI in the current study.

Various treatment outcomes of b/tsDMARDs have been observed in randomized control trials and real-world studies [38,39,40,41,42,43]. In this study, we analyzed the treatment outcomes in b/tsDAMRD-naïve patients and switched patients and found the following: (1) in naïve patients, TNFi most effectively improved disease activity and functional disability after 3 months, though there were no significant differences in disease activity, functional disability, or drug retention rate between MOAs at 12 months, and (2) in switched patients, IL6Ri yielded significantly lower DAS28-CRP at 12 months and higher retention rate compared to reagents with other MOAs. These findings agree with a recent systematic review where non-TNFi drugs showed better retention rates, and there were no significant differences between bDMARDs in bDMARD-naïve patients [44]. We also observed lower concomitant usage of MTX in IL6Ri users. This trend may reflect the recommendations for RA treatment [45, 46]. The lower concomitant use of MTX and higher concomitant use of GCs in the CTLA4-Ig group may reflect the differences in patient characteristics such as age, comorbidities, and MTX intolerance [47].

Regarding drug persistence, IL6Ri had a better drug retention rate in the KURAMA cohort. Our findings are consistent with previous studies reported by Li et al., who assessed the treatment outcomes of over 8000 patients and observed the superior drug persistence of IL6Ri. Ebina et al. and Jinno et al. also found that IL6Ris had a better retention rate in the ANSWER cohort, a multicenter observational RA cohort where the KURMA cohort participates [23, 48, 49]. Finally, a systematic review by de Castro et al. showed greater persistence of non-TNFi over TNFi [44]. The improvement of disease activity and functional disability in the KURAMA cohort could be attributed to the evolution of RA treatment strategies during the past decade, in which the usage of b/tsDMARDs increased.

The emergence of b/tsDMARDs has altered RA treatment strategies. In this study, we aimed to assess whether the increased treatment options have indeed benefited RA patients in a real-world setting. To the best of our knowledge, this is the first study that revealed annual improvements in disease activity and functional disability using a mixed-effect model. Although it cannot be certain that the yearly trends in CDAI and HAQ in the mixed-effect models represent the increase in treatment options, some studies support that this may be the case. In the study period, RA refractory to multiple b/tsDMARDs was defined as difficult-to-treat RA [50,51,52,53]. Ochi et al. reported that JAKi is preferable for difficult-to-treat RA [54]. The emergence of JAKis and their usage optimization might positively affect RA management. Another factor that could account for the annual decline in disease activity and functional disability is the encouragement of exercise or physical therapy. Exercise is reported to improve functional disability in RA patients [55]. We previously reported that sarcopenia was associated with worse RA management [56]. Advances in treatment and care may have contributed to the annual improvements in disease activity and functional disability.

This study has the following limitations: First, individual patient circumstances, such as comorbidity, allergy history, and socioeconomic status, can affect the choice and outcome of RA treatment, but these factors were not considered here because this study analyzed a relatively large dataset of 5070 RA patients who participated in the annual RA survey and 1816 RA patients who initiated b/tsDMARDs. Given the nature of this real-world observational cohort study, the presence of other, possibly unknown, confounding factors may have influenced the results. Second, the annual RA survey does not include all RA patients who are regularly treated in our hospital.

Conclusions

In summary, we reviewed the real-world transition of therapeutic strategies and their outcomes in the 10-year history of the KURMA cohort. The disease activity and functional disability metrics of patients with RA improved over a decade with increased use of b/tsDMARDs.