Introduction

Rheumatoid arthritis (RA) and spondylarthritis (SpA) (axial or peripheral) are chronic systemic inflammatory arthritides (IA), characterized by pathogenetic and phenotypic heterogeneity and diverse responses to treatments [1, 2]. Although distinct molecular and cellular pathways are involved in the pathogenesis of RA and SpA, inhibition of TNFα has been successfully proved as a common therapeutic target. However, response to treatment with TNFα inhibitors (TNFis) is mostly unpredictable, while patients’ disease course varies.

Recently, methods to characterize groups of patients with different disease activity trajectories while on treatment with biologics have gained popularity. Trajectory modelling identifies patient clusters according to the similarity of their disease progression in time, a preferred method compared to the dichotomous (either a responder or a non-responder) indices measuring response as a snap-shot at a chosen timepoint. However, studies of disease activity trajectories using data from randomised clinical trials (RCTs) [3,4,5,6] or form observational settings [7,8,9,10,11,12] are still sparse and mostly include early RA patients and short-term follow-up. Relevant literature for SpA is more limited and performed in groups of patients with early AxSpA [13, 14] or heterogenous populations (early and established AS) receiving various treatments (NSAIDS or biologics) [15]. Furthermore, studies comparing long-term disease activity trajectories between RA and SpA are not available.

Most of the aforementioned studies analyze patients receiving TNFis as a group. However, differences in clinical efficacy and drug survival of different TNFis in RA or SpA have been shown in previous works [16, 17]. Thus, analyzing patients on a single TNFi may exclude any differential effect of other biologics on long-term treatment outcomes. Etanercept is one of the first TNFα inhibitors approved for the treatment of IA and has been shown to be effective in RA and SpA patients both in early and established disease [18, 19].

In the present analysis, we aimed to compare in the same research setting long-term drug survival and disease activity trajectories of RA and SpA patients treated with the etanercept, thus excluding any differential effect of different biologics on treatment outcomes. Furthermore, we sought to determine predictors of good response and longer etanercept survival in the two diseases and we investigated the impact of belonging to different disease activity trajectories in terms of long-term patient function and the cumulative incidence of serious adverse events.

Methods

Cohort

The present study was based on the University of Crete Rheumatology Clinic Registry (UCRCR), a single center prospective cohort study. Patients ≥ 18 years old with IA are included in UCRCR at the time of initiation of the first biologic or targeted synthetic (b-/ts)-DMARD and they are prospectively followed irrespectively of treatment switches for as long as they receive b-/ts-DMARD therapy. Biological treatment initiation, as well as all treatment decisions (bDMARD selection, co-medication, dosage adjustments/switches) are made by the attending rheumatologists based on clinical assessments, national guidelines and patient’s preferences. According to the national guidelines, patients are considered candidates for biologic treatment if they have active disease (in RA defined as DAS28 > 3.2 and in AxSpA ASDAS > 2.1) and have failed previous treatment with at least one disease modifying drug (in RA and perSpA) or two NSAID courses (in AxSpA). According to the protocol, data on demographics, disease characteristics, comorbidities, disease activity, function and quality of life are collected at bDMARD initiation and every 3–6 months for the first 2 years and yearly thereafter [20]. Treatment discontinuations are registered prospectively and classified by the treating physician as due to treatment inefficacy (primary and secondary), adverse event(s), patient decision, pregnancy, disease remission, or other reasons. In cases of loss of follow-up of a patient for more than one and a half years, the patient is reported as “lost to follow-up” at the date of the last recorded follow-up visit.

In the present study, we included patients who received at least one etanercept subcutaneous injection between 1-2004 and 31–12-2020 for the diagnosis of either RA, axial (AxSpA) or peripheral SpA (PerSpA). Diagnosis was based on the judgment of the treating rheumatologist, but the fulfilment of the classification criteria for the respective diagnosis (ACR/EULAR 2010 RA Classification Criteria, ASAS Axial and Peripheral SpA criteria, or PsA CASPAR criteria) was also recorded. Ninety nine percent of the included RA patients, 93% of AxSpA and 100% of peripheral SpA patients fulfilled the aforementioned classification criteria. No specific exclusion criteria were applied. Patients were followed until discontinuation of etanercept, death, loss of follow-up, or 31 May 2021. All patients provided a written informed consent at inclusionin the registry according to the Declaration of Helsinki.

Outcome variables

Disease activity in RA and SpA patients having peripheral disease was measured by Disease Activity Score in 28 joints based on erythrocyte sedimentation rate (DAS28-ESR) [21], Simplified Disease Activity Index (SDAI) [22] and Clinical Disease Activity Index (CDAI) [23], calculated by the following formulas:

$${\text{DAS}}28 - {\text{ESR }} = \, 0.56 * \surd \left( {28{\text{tender joint count}}} \right) \, + \, 0.28 * \surd \left( {\text{Swollen joint count}} \right) \, + \, 0.70 * \ln \left( {{\text{ESR}}} \right) \, + \, 0.014 * {\text{Patient global assessment}}$$
$${\text{SDAI }} = {\text{Swollen joint count }} + {\text{ Tender joint count }} + {\text{ Physician}}'{\text{s}}\;{\text{global assessment}} + {\text{ Patient's}}\;{\text{global assessment}} + {\text{ C}} - {\text{reactive protein in }}{{{\text{mg}}} \mathord{\left/ {\vphantom {{{\text{mg}}} {{\text{dl}}}}} \right. \kern-\nulldelimiterspace} {{\text{dl}}}}$$
$${\text{CDAI}} = {\text{Swollen joint count}} + {\text{Tender joint count}} + {\text{Physician}}^{\prime}{\text{s}}\;{\text{global assessment}} + {\text{Patient}}^{\prime}{\text{s}}\;{\text{global assessment}}$$

Functional status of these patients was assessed by the modified Health Assessment Questionnaire (mHAQ) [24], a short version of the original 20-question HAQ. Disease activity and function in AxSpA patients was measured using the Ankylosing Spondylitis Disease Activity Score based on C-reactive protein (ASDAS-CRP), Bath Ankylosing Spondylitis Disease Activity Score (BASDAI) and Bath Ankylosing Spondylitis Index (BASFI) [25]. BASDAI is a self-administered patient questionnaire assessing fatigue, axial and peripheral symptoms, enthesopathy and duration and intensity of morning stiffness using visual analogue scales (VAS in centimeters, 0–10) [25]. ASDAS has been developed to improve the objectivity of this index and includes the questions of BASDAI concerning the level of axial and peripheral symptoms and the duration of morning stiffness, but also the level of acute phase reactants – either ESR or CRP—and an overall global assessment in VAS (0–10) [26].

Disease activity status in RA and Axial SpA was defined based on published cut-offs of DAS28 [High disease activity (HDA) > 5.1, moderate disease activity (MDA) 3.2–5.1, low disease activity (LDA) ≥ 2.6 and < 3.2 and remission (REM) < 2.6] and ASDAS (very HDA > 3.5, HDA 2.1—3.5, LDA < 2.1 and ≥ 1.3 and REM < 1.3) respectively [27, 28]. Moreover, information on all adverse events [“Medical Dictionary for Regulatory Activities” (MedDRA)-coded and categorized according to severity and any relation to therapy] were analyzed.

Drug retention was calculated as the time period between the first prescription of etanercept and the date of the first missed dose of the drug, death, or 31/05/2021. Temporary treatment interruptions of < 6 months (e.g. due to adverse events, surgeries, loss of insurance, etc.) were allowed. Patients lost to follow-up were censored at their last recorded visit.

Statistical analysis

Data are presented with standard descriptive statistics and differences between groups were analyzed using One-way-Anova (for normally distributed data), the non-parametric Kruskal–Wallis test (for non-normally distributed data) and the Pearson’s chi-square test as appropriate. Post-hoc comparisons were performed using the Tuckey-Kramer test (for normally distributed data), Dunn’s test with Benjamini–Hochberg stepwise adjustment (for non-normally distributed data) [29] and Pairwise Z-test with Bonferroni correction. Kaplan–Meier plots with log-rank tests were used to explore differences in drug survival between the three different diagnoses. For the analysis of time to treatment discontinuation due to ineffectiveness, discontinuations due to other reasons were treated as censored observations. Multivariate Cox and logistic regression models were employed to assess for factors associated with etanercept discontinuation and patient response to therapy, including baseline demographics, disease and patient characteristics. Baseline variables with < 30% missing values were first tested in univariable Cox regression analyses and non-collinear factors with a p-value < 0.15 were included in the multivariable model. Variables with least significance were then excluded stepwise until only variables with a p value < 0.15 remained in the model.

Distinct trajectories of disease activities were identified using latent class growth models (LCGM). For this analysis we used the DAS28-ESR scores (for RA patients) and ASDAS-CRP (for AxSpA patients) and time polynomials (linear, quadratic, cubic) as covariates. For RA patients the time period used was from 0 to 48 months and for AxSpA patients was from 0 to 24 months, due to low number of disease activity scores measured thereafter. PerSpA patients were excluded in this analysis due to the low number of patients. The adjusted Bayesian information Criterion (BIC) was used in order to identify the best fitting model with smaller values indicating a better model fit. Four-group cubic models were identified at the ones with the best model fit. Additional analysis of screen plots for the curves generated from the within sum of squares (WSS) and the η2 coefficient, which is quite similar to the R2, or the proportional reduction of error (PRE) coefficient was performed [30]. Such analysis is typically used in k-means cluster analyses when the number of clusters is unknown. Our analysis indicated that adding more than 4 groups offers little in the reduction of WSS values or the respective increase of the η2 value (Supplementary Figure). We also performed univariate analysis in order to identify potential differences in baseline characteristics between the trajectory groups. For RA patients we additionally performed a linear mixed model plot of HAQ values over time using the trajectory groups produced above as a covariate. Trajectory analyses and plots were performed using the “Traj” plugin for STATA [29, 31]. Intention-to-treat analysis was carried throughout. All analyses were performed using STATA version 16 and Statistical Package for Social Sciences version 22 (SPSS, SPSS Inc). P-values of 0.05 (two- tailed) were considered statistically significant.

Results

Baseline patients’ characteristics

We analyzed 711 patients (RA = 450, AxSpA = 178 and PerSpA = 83) starting etanercept and prospectively followed for 1371 patient-years. The median (interquartile range) follow-up [12 (5.2–32) months] was comparable between the different diagnoses. As expected, there was a heterogeneity in baseline demographics, disease characteristics and co-administered treatments between the three diagnosis groups (Table 1). RA patients were older, had more comorbidities and received etanercept less often as monotherapy, while disease activity indices (DAS28-ESR and ASDAS-CRP) as well as patients’ perception of disease activity status (VAS global) were high and comparable between RA and SpA. Of note, PerSpA patients had lower disease activity status and better function as compared to the two other groups.

Table 1 Baseline patient demographics, disease characteristics and treatments

Long-tern etanercept survival and baseline predictors

Overall, 466 (65.5%) patients discontinued etanercept. Approximately half of the stops (55.8%) occurred within the first year of therapy. Treatment inefficacy was the most frequent cause of discontinuation (70.2% of cases), while stops due to adverse events were infrequent (19%) across all diagnoses (Supplementary Table 1 and Supplementary Table 2 for a description of all adverse events during follow-up). The overall estimated 1-year retention rates of etanercept in RA, AxSpA and PerSpA were 57.5%, 67% and 56.8% respectively, while the respective 5-year retention rates were 22%, 30% and 21% (log-rank p = 0.204 and p = 0.006 for the 1-year and 5-year comparisons respectively). When considering only discontinuations due to inefficacy, 1-year (5-year) estimated retention rates were 64.8% (32%), 77.5% (48%) and 74.5% (42%) respectively (log-rank p < 0.05 for both 1-year and 5-year comparisons) (Supplementary Fig. 1). However, in the multivariable Cox regression analyses of the whole cohort, the clinical diagnosis was not associated with treatment retention for neither inefficacy nor safety discontinuations (Table 2A).

Table 2 Baseline factors associated with Etanercept discontinuation (multivariable Cox regression analyses)

Since inefficacy was the major cause of discontinuations, we performed adjusted analysis for factors predicting etanercept discontinuation due to treatment failure. Both in RA and SpA, treatment cessation was significantly associated with female gender and obesity (BMI > 30). Additionally, higher RDCI, and higher baseline swollen joint count predicted inefficacy discontinuations in RA, while co-administration of csDMARD(s), presumably due to co-existence of peripheral arthritis, predicted treatment failure in SpA patients (Table 2B). Significant predictors for safety-related discontinuations in the whole group were older age and no MTX co-therapy (Table 2A).

RA patients’ long-term disease activity trajectories

We furthermore aimed to categorize RA patients into discrete groups based on the long-term (4-year) course of disease activity assessed by DAS28. LCGM analysis identified 4 distinct groups of patients with similar disease activity trajectories (RATraj 1–4), clearly differentiated as early as from the 6th month of etanercept initiation (Fig. 1). Patients in RATraj 1 (n = 28, 6.3% of total) improved from baseline moderate disease activity (MDA) to inactive disease, while patients in RATraj 2 (n = 123, 27.4%) and 3 (n = 182, 40.6%) improved from baseline high disease activity (HDA) to low disease activity (LDA) and MDA respectively. Improvements were gradual and plateaued after the first year of treatment. In contrast, patients in RATraj 4 (n = 116, 25.7%) remained having HDA throughout the observational period.

Fig. 1
figure 1

A Trajectory plot of disease activity levels of RA patients receiving etanercept. B Linear mixed models plot of mHAQ scores of RATraj groups

Univariate analysis of baseline characteristics between the four RATraj groups indicated several important differences (Table 3). In RATraj 1–2, males were overrepresented, patients had shorter disease duration, better functional status and lower total comorbidity count and RCDI index. On the contrary, RATraj 3–4 were more often obese, with higher baseline disease activity and worse physicians’ and patients’ reported scores.

Table 3 Between-groups univariate comparisons of baseline characteristics of RA patients classified to trajectory groups (RA-Traj 1–4) according to LCGM analysis (n = 449 included in the analysis)

Disease activity significantly contributes to patient’s functional status. Thus, we assessed whether the four trajectory groups presented above, correspond to distinct long-term courses of function (based on mHAQ values). Indeed, results from the linear mixed models indicated statistically significant differences in mHAQ scores between the four RATraj groups since baseline which were preserved over time. RATraj groups 1–3 presented a small yet steady improvement in mHAQ scores, contrary to RATraj group 4 in which functional status worsened over time (Fig. 1).

Furthermore, we investigated the impact of the different disease activity trajectory groups on the incidence of serious adverse events. Interestingly, patients in RATraj 3 and 4 experienced significantly more serious adverse events (SAEs) during follow-up compared to RATraj 1 and 2 (Incidence Rate (IR) for SAEs 11.5/100 patient-years vs 1.91/100 patient-years in RATraj 4 and RATraj 1 respectively, p < 0.001) (Table 3).

Distinct long-term disease activity trajectories for AxSpA patients

We also assessed for 2-year trajectories of AxSpA patients based on disease activity quantified by the ASDAS-CRP. Applying LCGM, we found that patients were grouped in four distinct trajectories (SpATraj 1–4), again as early as from 6th month of etanercept treatment (Fig. 2). Approximately 74% of the patients were grouped in SpATraj 1 (n = 111, 62.4%) and 3 (n = 21, 11.8%) experiencing fast improvement in their disease activity from HDA or very HDA to LDA and inactive disease respectively and steadily remained at these levels for the total follow-up period. One out of four patients were grouped in either SpATraj 2 (n = 40; 22.5%), in which there was a small clinical improvement, but residual high disease activity remained, or in SpATraj 4 (n = 6, 3.4%), in which patients were absolutely non-responders.

Fig. 2
figure 2

Trajectory plot of disease activity levels of AxSpA patients receiving etanercept

Several baseline patient parameters were different across the four AxSpA trajectories groups (Table 4). Univariate analysis between SpATraj groups’ baseline characteristics indicated that patients in SpATraj groups 1&3 (responders) were more frequently males, less often obese, with fewer comorbidities and lower RCDI. Patients in SpATraj group 1 had shorter disease duration, more often received etanercept as monotherapy and fewer of them received prednisolone. Furthermore, participants classified in SpATraj group 1 had lower baseline disease activity levels (ASDAS, VAS global, VAS pain) and lower levels of acute phase reactants (ESR, CRP). SpATraj group 3 patients, who also responded to treatment, had significantly higher CRP values than patients in other groups.

Table 4 Between-groups univariate comparisons of baseline characteristics of AxSpA patients classified to trajectory groups (SpA-Traj 1–4) according to LCGM analysis (n = 178)

Notably, patients responding to etanercept treatment (SpATraj 1 and 3) experienced also less SAEs during follow-up than patients in SpA Traj 2 and 4 (non-responders) (IR for SAEs 5.5/100 patient-years vs 1.6/100 patient-years in SpATraj 2 and SpATraj 1 respectively, p < 0.001; Table 4).

Discussion

RA and AxSpA are heterogenous diseases concerning pathophysiology, phenotypes and response to treatment-prognosis [32,33,34,35]. Most studies have characterized treatment responses of biologic therapies at the group level, unifying all patients and reporting on average disease activity evolution. In the present study we identified 4 long-term disease activity trajectories in both RA and AxSpA patients initiating etanercept therapy in real-world clinical practice and we investigated the baseline characteristics of the patients who follow different disease activity courses. We also compared etanercept retention in patients with either RA and spondyloarthritis and we identified predictors for long-term etanercept survival in the whole cohort and separately for RA and AxSpA.

The major strength of the present study is the large cohort of unselected patients starting etanercept. Patients were prospectively monitored using the same protocol in a tertiary rheumatology center. Disease activity trajectories, as well as treatment response and survival may differ by treatment class and stage of disease. As all patients in this cohort had established disease and received the same main treatment, we consider that trajectories found in the present analysis could represent “true” disease-related courses in a cohort of patients treated with etanercept.

In RA patients, we were able to define four distinct latent disease activity-related subgroups during the course of treatment. Patients in traj. groups 1&2 (33.7% of total) showed a significant clinical improvement from the first 6 months of therapy and remained in remission or low disease activity levels for up-to 48 months of treatment. In 40.6% of long-term etanercept treated patients (RA-Traj group 3) there was only a partial response to therapy and patients had moderate disease activity throughout observation time, while patients in RA-Traj group 4 (25.7% of total) had only limited, if any, response. In this latter group, women were overrepresented, patients were older, had experienced more bDMARDs failures and had more comorbidities. Patients on RA-Traj groups 1 & 2 had shorter disease duration, lower comorbidities count and started etanercept treatment with lower disease activity. These factors are similar to predictors of treatment response in other long-term prospective studies and national registries [36, 37].

Most of previous studies have also shown distinct disease activity trajectories in early as well as in established RA under therapy with different biologics [10, 11, 38]. Differences in described trajectories between the aforementioned studies can be explained by methodological differences, heterogenous populations (early/established RA) and different background therapies. Our data together with the previous reports characterizing heterogenous subgroups of inflammatory burden of RA [10, 11], further corroborate “molecular” heterogenicity of the disease depicted by RNAseq profiling of synovium and peripheral blood, which predicted clinical responses and radiological outcome [33, 39].

It has been previously shown that RA patients on persistent moderate disease activity had more serious adverse events as compared to patients in lower disease activity levels [40], a finding confirmed in the present analysis. Indeed, patients in RA-Traj groups 1&2 with a better long-term control of disease activity experienced less SAEs as compared to RATraj 3 & 4. We consider that patients with persistently higher inflammatory burden are predisposed to a higher frequency of serious infections (a major contributor to SAEs) as well as cardiovascular SAEs, or medication side effects. An additional explanation could be that responders are generally a more “healthy” group according to their baseline characteristics: they are younger, with better functional status, less frequently obese, with lower total number of comorbidities and receive less frequently corticosteroids. On the other hand, function, as measured with HAQ, improved in all groups except in RA-Traj group 4 patients, who remained having HDA and had worsening function course over time (Fig. 1).

Regarding AxSpA, disease activity “endotypes” have been even less characterized. Herein, we have identified 4 distinct disease activity trajectories, based on ASDAS-CRP evolution over 2 years (Fig. 2). We consider that subgroups identified represent valid AxSpA disease activity grouping, since 74% of the patients (AxSpA-Traj groups 1&3) showed significant clinical responses maintained for the 2 years of follow-up, a percentage rather comparable to TNFis retention rate reported by others [41]. Subgroups defined by our analysis in an established AxSpA cohort treated by the same TNFi, seem to be at least partially comparable to those described in a mixed AxSpA population of early and established cohorts on different background therapies [15]. Unfortunately, data assessing functional status in our AxSpA cohort were limited, and thus we could not correlate disease activity trajectories to long-term functional outcomes. Our data further support that females, obesity, disease duration, baseline disease activity and comorbidities’ accumulation are associated with an adverse inflammatory burden course (Table 4). Interestingly most of these parameters were also associated to an adverse outcome in the study of Imkamp et al. [15] and in large registry-based studies [42]. Female gender has been associated with higher incidence of fibromyalgia and higher PROs, thus resulting to less response in several studies [43]. Similarly, disease duration may be associated with accrual of damage and thus worse outcomes while accumulation of comorbidities may affect perception of pain, limit drug options and restrict function. Nevertheless, causality studies or mechanistic studies assessing how the aforementioned factors affect clinical responses are lacking.

Latent class trajectory modeling has been successfully applied before, revealing the existence of several distinct trajectory groups [11]. Literature has pointed out the possible existence of 3–7 distinct groups according to the methodology and outcome variables used in each case. We repeated our trajectory analysis for various number of groups (ranging from two to seven groups) and we chose the best fitting model (which consisted of four trajectory groups) based on the lowest value of BIC. Although results from trajectory analyses should always be interpreted with caution [44], our findings seem to be in accordance with the results produced from drug-survival analysis in terms of the overall drug performance.

Finally, we compared long-term etanercept survival between the three diagnoses, RA, AxSpA and PerSpA. Although in crude analysis patients with RA showed a lower drug survival, in the multivariate analysis after adjustments for several different baseline factors, diagnosis was not a predictor of etanercept discontinuation. This finding, which has been also shown by other groups for golimumab and certolizumab [45, 46], may be considered contradictory with the common perception that survival of TNFis is higher in SpA than in RA patients; nevertheless, it further supports the validity of proper statistical analysis and comparisons between groups in the same study as compared to crude disease-based analysis. Approximately 57.5% (22%), 67% (30%) and 56.8% (21%) of RA, AxSpA and PerSpA patients respectively remained on etanercept at 1 year (5 years). These drug survival rates are lower than what is reported by other registries [47]. However, considering that etanercept was the ≥ 2nd bDMARD in approximately half of the patients in our study, these data are comparable to those reported by other groups both for RA and for SpA [48, 49].

Owing to the observational design of the study, some co-variates were missing and some patients were lost to follow-up. To address this limitation we tried to triangulate our results using both survival analysis, which is a robust method for right-censored data with “non-informative” censorship [50], and latent class growth models. We further included only baseline variables with < 30% missing values in the univariate and multivariate Cox-regression models, as use of completer-analysis may have lowered the generalizability of our results. Furthermore, despite the relatively large overall size of our cohort, the number of patients with SpA—especially peripheral SpA—was low therefore our findings should be interpreted with caution. Moreover, AxSpA patients with available data for the longitudinal analysis were rather limited, and trajectories analysis in this group was performed only for the first two years of follow-up.

DAS28 index was used in our analyses, although SDAI and CDAI may better reflect disease activity in RA. However, CRP and physician’s global assessment needed for these indices is missing in several cases in our dataset, resulting in a lower number of available patients for analysis. A sensitivity analysis performed to model disease activity trajectories based on CDAI identified 3 trajectory groups, while the baseline variables differentiating the groups at baseline were comparable to those shown in the analysis based on DAS28 (data not shown).

Concluding, the present study is the first trajectory modelling analysis of etanercept-treated patients with RA/AxSpA in the same prospective research setting. We identified patients with different responsesand prognosis over time in respect to function and serious adverse events. Male gender, lower disease activity and fewer comorbidities, were found to predict generally a more favorable outcome. These factors could assist rheumatologists for a more personalized treatment approach.