Introduction

Axial spondyloarthritis (axSpA) is a chronic inflammatory disorder that predominantly affects the axial skeleton and the sacroiliac joints (SIJs) [1, 2]. Patients with axSpA are characterized by inflammation at the joints and entheses, destruction, and repair, as well as new bone formation [3]. The pathogenesis of axSpA is not clearly elucidated [4], but tumor necrosis factor α (TNF-α) plays an important role as a pro-inflammatory cytokine in the disease, resulting in joint inflammation [5]. Therefore, treatment with TNF-α inhibitors, such as adalimumab (ADA), is recommended after treatment failure to non-steroidal anti-inflammatory drugs (NSAIDs) [6]. TNF-α inhibitors have been shown to reduce disease activity within 12 weeks [7]. However, it may be detected much faster if assessed by blood-based biochemical markers, specifically pharmacodynamic biomarkers, which may assess the treatment response [8].

Tissue damage is a central pathological feature in axSpA [8], and several studies have shown that there is extracellular matrix (ECM) remodeling involved in the disease [9]. The ECM comprises a complex network of structural proteins and is essential for tissue function, structure, and homeostasis [10]. The main structural element of the ECM is collagens, and type I, II, III, IV, VI, and X collagen are the most abundant collagens of the joint tissue. During joint tissue turnover, the ECM is remodeled, generating fragments that are subsequently released into the circulation and can be quantified in blood using immunoassays to detect serological biomarkers [4, 8]. ECM biomarkers reflecting inflammation, bone, and connective tissue turnover have been associated with disease activity and progression when compared with clinical outcomes [11,12,13].

Blood-based biomarkers of metalloproteinases (MMP)-degraded type I (C1M), II (C2M), III (C3M), IV (C4M), and VI (C6M) collagen have shown higher levels in patients with ankylosing spondylitis (AS) compared to healthy controls [11, 14], and they have also presented elevated levels in patients with axSpA and psoriatic arthritis (PsA) compared to healthy [13, 15, 16]. Another MMP-degraded biomarker of type II collagen, T2CM, has reflected cartilage degradation in patients with osteoarthritis [17]. Prolargin and vimentin are found in connective tissue and cartilage and tendon, respectively [12, 18]. Degradation fragments of C-reactive protein (CRP) and prolargin (CRPM and PROM, respectively) have been associated with disease activity in axSpA and together with the citrullinated vimentin neoepitope biomarker, VICM, could separate AS patients from non-radiographic axSpA patients [12]. Biomarkers measuring type II, III, and VI collagen formation (PRO-C2, PRO-C3, and PRO-C6, respectively) have been found elevated in PsA compared to healthy individuals [16], and only PRO-C2 has also presented higher levels in axSpA [19]. PRO-C4, measuring ECM basement membrane turnover, has shown increased levels of axSpA compared to healthy controls [13]. Even though it has not previously been tested in axSpA, human neutrophil elastase (HNE)-mediated degradation of calprotectin (CPa9-HNE) measures neutrophil activity and has been highly associated with patients suffering from inflammatory bowel disease [20]. C6M, VICM, and PRO-C3 have also been associated with ASDAS (AS Disease Activity Score based on C-reactive protein) after TNF-α treatment [19]. Type X collagen degradation biomarkers (C10C) have shown lower levels in patients with axSpA in TNF-α treatment [21]. Nevertheless, only a few randomized placebo-controlled studies have investigated the association of biomarkers related to ECM turnover, such as MMP-3, tissue inhibitor of MMP (TIMP)-1, osteocalcin, type I collagen N-telopeptides (NTX), C-terminal of type II collagen neoepitope (C2C), and human cartilage glycoprotein-39 (YKL-40) [3, 22,23,24,25] and clinical measures of disease activity and treatment efficacy in patients with axSpA. Therefore, ECM biomarkers showing key features of axSpA should be further explored as potential pharmacodynamic biomarkers to aid in predicting and monitoring the treatment response of TNF-α inhibitors in patients with axSpA.

In the present study, we investigated a panel of 15 novel blood-based ECM biomarkers, reflecting MMP-driven inflammation and ECM tissue degradation (C1M, C2M, T2CM, C3M, C4M, C6M, C10C, CRPM, PROM, VICM), neutrophil activity (CPa9-HNE), fibrosis (PRO-C3 and PRO-C6), cartilage turnover (PRO-C2), and basement membrane turnover (PRO-C4) before and during treatment with the TNF-α inhibitor ADA based on two placebo-controlled axSpA studies. The aims were to investigate the potential of the ECM biomarkers to show the pharmacodynamic effect (i.e., changes during placebo vs active treatment) and to reflect treatment response to ADA based on the Bath AS Disease Activity Index (BASDAI) and ASDAS criteria.

Methods

Study design and population

Data from two randomized, double-blind, placebo-controlled investigator-initiated trials of axSpA patients were included in this study: the Danish Multicenter Study of Adalimumab in Spondyloarthritis (DANISH, NCT00477893) [26] and the Adalimumab in Axial Spondyloarthritis study (ASIM, NCT01029847) [27], see Fig. 1. The DANISH study included 52 patients with axSpA randomized 1:1 to receive subcutaneous (SC) injections of ADA 40 mg or placebo every other week from week 0 to week 12, followed by SC ADA 40 mg every other week to week 48. The ASIM study comprised 49 patients with axSpA randomized 1:1 to receive either SC ADA 40 mg or placebo every other week for 6 weeks, followed by SC ADA 40 mg every other week from week 6 to week 48. For both studies, the inclusion and exclusion criteria, study procedures, and main results for clinical outcomes and magnetic resonance imaging (MRI) have been described previously [26, 27].

Fig. 1
figure 1

Study design for the DANISH and ASIM studies. Red triangles represent the visits where serum was collected. For the DANISH study, when analyzing the response to treatment, data of the placebo group at week 12 was considered week 0, and data from week 24 was considered week 12

Demographic and clinical data were acquired from all participants. In brief, the conventional patient-reported outcomes (PROs), clinical examinations, blood samples, and MRI scans of the SIJs and spine were performed at week 0 (before treatment initiation) for both studies and at weeks 6 (ASIM), 12 (DANISH), and 24 and 48 (DANISH and ASIM, respectively). Semi-coronal T1-weighted (T1W) and short tau inversion recovery (STIR) MRI sequences of the SIJs, and sagittal T1W and STIR sequences of the spine were evaluated according to the Spondyloarthritis Research Consortium of Canada (SPARCC) inflammation [28] and structural SIJ scores (SSS) for fat, erosion, backfill, and ankylosis [29]. MRI of the spine was evaluated according to the SPARCC spine inflammation [28] and Canada-Denmark (CanDen) Spine scores for inflammation, fat, erosion, and new bone formation [30, 31]. In addition, the Modified New York Criteria were applied on radiographs of the SIJs and the Modified Stoke AS Spine Score (mSASSS) on lateral lumbar and cervical spine radiographs.

Biomarker assays

Serum samples were available from 49 and 45 patients in DANISH and ASIM, respectively. The samples had been stored at − 80 °C until biomarker analysis. A panel of ECM turnover biomarkers was measured in the serum using either manual solid-phase competitive enzyme-linked immunosorbent assay (ELISA) or the Immunodiagnostic Systems robotic platform (IDS-i10; Immunodiagnostic Systems (IDS), Bolden, Tyne & Wear, UK). The assessed biomarkers can be found in Table 1.

Table 1 The panel of biomarkers evaluated in this study

Statistical analysis

Baseline characteristics of the patients are described as number (frequency) for categorical variables and as mean ± standard deviation (SD) for continuous variables. The Kruskal–Wallis test, Mann–Whitney test, or chi-square test were used when appropriate to examine the baseline differences between the groups.

To investigate the pharmacodynamic effect, the average percentage change in biomarker concentrations from baseline to week 12 or week 6 (DANISH and ASIM, respectively) between the placebo and ADA groups was evaluated by linear mixed models. The average percentage change was included as the dependent variable, group and visit as fixed effects, and patient-specific intercepts as random effects (to account for correlated measurements within patients).

To explore the treatment response, biomarker levels were compared between the groups defined by the BASDAI and ASDAS response criteria, where ASDAS was calculated based on CRP. A BASDAI response was defined as ≥ 50% reduction in the BASDAI index from baseline (BASDAI50 responder), and a BASDAI non-response was defined as < 50% reduction in BASDAI index from baseline (BASDAI50 non-responder). For ASDAS, we applied the cutoffs for treatment response defined by the Assessment of SpondyloArthritis International Society (ASAS): no/low improvement (NI) (ΔASDAS < 1.1), clinically important improvement (CII) (1.1 ≤ ΔASDAS < 2.0), and major improvement (MI) (ΔASDAS ≥ 2.0). Furthermore, another classification was done for ASDAS, where we pooled the groups who had at least a CII (≥ CII) (ΔASDAS ≥ 1.1, i.e., including CII and MI) as compared to NI (ΔASDAS < 1.1). To increase the number of patients when evaluating the response to treatment, in the DANISH study, data from the placebo group at week 24 (i.e., the visit after 12 weeks of active treatment with SC ADA) was pooled with the data from the ADA group at week 12 (i.e., for this group also the visit after 12 weeks of active treatment). In the ASIM study, the data were not pooled because the period where the patients received active treatment in both groups was different (i.e., at the visit at week 24, the placebo group had received 18 weeks of active treatment whereas the ADA group had received 24 weeks of active treatment).

The average level of biomarkers between response categories was evaluated by linear mixed models. Biomarker data was log-transformed and used as the dependent variable, with treatment response and visit included as fixed effects and patient-specific intercepts as random effects. For this analysis, estimates are shown back transformed, i.e., on the original scale of the markers. Contrast on the original scale represents the ratios of the geometric mean of serum levels for responders vs the geometric mean for the non-responders (BASDAI50 responder vs non-responder, ASDAS MI vs NI, ASDAS CI vs NI, and ASDAS ≥ CII vs NI).

To explore the correlations between serological biomarkers and clinical scores, Spearman’s correlations were calculated. Only correlations with a rho (ρ) value ≥ 0.3 or ≤  − 0.3 and a p-value < 0.01 were deemed important, as correlations below that threshold were considered very weak. Correlations between 0.30 and 0.39 were regarded as weak, 0.40 and 0.59 as moderate, and 0.60 and 0.79 as strong.

A significance level of 5% was used throughout the analyses, and false discovery rate correction for multiple testing was applied where indicated (targeting a false discovery rate of 5%). Data analyses were performed using R studio version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org; 2020). Graphical illustrations were created using GraphPad Prism version 9.00 for Windows (GraphPad Software, GraphPad Software, San Diego, CA, USA, www.graphpad.com).

Results

Baseline demographics

Baseline patient characteristics can be found in Table 2. In the DANISH and ASIM studies, 24 and 23 patients were randomized to receive ADA treatment and 25 and 22 patients to placebo, respectively.

Table 2 Baseline clinical characteristics of the patients in the DANISH and ASIM studies*

When comparing the two studies, patients enrolled in the DANISH study had a higher fraction of male participants, mean BASDAI, percent of the fulfillment of the Modified New York Criteria, CanDen spine inflammation score, CanDen spine erosion score, and CanDen spine fat compared to the ASIM study (Table 2, all p-value < 0.02).

Baseline characteristics of the patients based on the three classifications of response to ADA can be found in the supplementary material (Additional file 1: Table S1, Table S2, and Table S3).

Pharmacodynamic effects of ADA treatment on serum biomarkers of ECM degradation and inflammation

In the DANISH study, the percentage of change from baseline to week 12 of type I (C1M), type III (C3M), type IV (C4M), and type VI (C6M) collagen degradation and CRP was significantly different between the ADA-treated group and the placebo group (p < 0.001, p < 0.05, p < 0.05, p < 0.05, p < 0.01, respectively; Fig. 2A.1–E.1). For all these biomarkers, the percentage of change was negative in the ADA group, reflecting a decline in the biomarker levels. In the ASIM study, the percentage of change from baseline to week 6 of type I (C1M), type III (C3M), type IV (C4M), and type VI (C6M) collagen degradation; basement membrane turnover (PRO-C4); neutrophil activity marker (CPa9-HNE); and CRP was significantly different between the ADA-treated group and the placebo group (all p < 0.001, Fig. 2.A.2–G.2). For all these biomarkers, the percentage of change was negative in the ADA group, showing a decrease in the biomarker levels. Furthermore, when the placebo group of both studies started receiving ADA treatment (i.e., after the crossover), the levels of the biomarkers also showed a negative percentage of change, and most of the biomarkers approached the levels of the ADA-treated group during the observation time (Fig. 2).

Fig. 2
figure 2

The pharmacodynamic effect of ADA treatment in the DANISH and ASIM studies. Percentage (%) change from baseline of the placebo group vs the ADA group of the selected ECM biomarkers. A.1G.1 DANISH study. A.2G.2 ASIM study. p-values at weeks 12 and 6 were adjusted for multiple comparisons by the false rate discovery method, and a significant difference is displayed by the following asterisks: *p < 0.05, **p < 0.01, ***p < 0.001. Dashed points determine the crossover from placebo to active ADA treatment. Data are presented as mean ± standard error of the mean. Only biomarkers that obtained significance in the analyses are shown here; the rest are provided in Additional file 1: Fig. S1. Abbreviations: C1M, metalloproteinase (MMP)-2/9/13-degraded type I collagen; C3M, MMP-degraded type III collagen; C4M, MMP (multiple)-degraded type IV collagen; C6M, MMP-2/9-degraded type VI collagen; PRO-C4, type IV 7S domain collagen; CPa9-HNE, HNE-mediated degradation of calprotectin

The remaining investigated biomarkers, CRPM, PROM, VICM, C2M, T2CM, C10C, PRO-C3, PRO-C6, and PRO-C2, did not show any significant percentage of change between the ADA and placebo groups in either DANISH or ASIM (Additional file 1: Fig. S1).

Serum ECM biomarkers of inflammation and degradation in BASDAI and ASDAS responders vs non-responders

In the DANISH study, we observed that BASDAI50 responders and patients who achieved ASDAS CII and ASDAS MI tended to have higher biomarker levels at baseline compared to BASDAI non-responders and patients with ASDAS NI, respectively. Specifically, patients with ASDAS MI had significantly higher levels of type I (C1M), III (C3M), IV (C4M), and VI (C6M) collagen degradation and CRP at baseline compared to patients with ASDAS NI (p < 0.001, p < 0.05, p < 0.001, p < 0.01, and p < 0.001, Table 3).

Table 3 Comparison of biomarker levels based on a 50% reduction in the BASDAI index and ASDAS criteria from baseline to week 12 or week 24 in the DANISH and ASIM studies, respectively

Furthermore, patients achieving at least an ASDAS clinically important (≥ CII) also had significantly higher levels of type I (C1M), III (C3M), IV (C4M), and VI (C6M) collagen degradation and CRP compared to ASDAS NI at baseline (all p < 0.05, Table 3). In the ASIM study, we observed that BASDAI responders, patients with ASDAS CII, ASDAS MI, and patients with an ASDAS ≥ CII tended to have numerically higher biomarker levels at baseline compared to BASDAI50 non-responders or patients with ASDAS NI, respectively. Only PRO-C4 reached statistical significance in the group of patients with an ASDAS ≥ CII as compared to ASDAS NI (p < 0.05, Table 3).

After 12 weeks of treatment in the DANISH study, the biomarker levels showed a numerical trend to be lower or equal in BASDAI50 responders vs non-responders, in patients with ASDAS ≥ CII, ASDAS CII, or ASDAS MI vs ASDAS NI, respectively. After 24 weeks of treatment in the ASIM study, biomarker levels also tended to be lower or equal in BASDAI50 responders vs non-responders, patients with an ASDAS ≥ CII, ASDAS CII, or ASDAS MI vs ASDAS NI, respectively, but none reached statistical significance.

ECM remodeling biomarkers’ correlation with clinical assessment of disease activity or severity of axSpA

We further examined the correlations between circulating ECM metabolites and clinical parameters. In the DANISH study, moderate correlations were observed between ASDAS and the biomarkers C1M C3M, C4M, C6M, VICM, and CPa9-HNE (ρ = 0.68 [0.47, 0.82], 0.44 [0.15, 0.66], 0.53 [0.27, 0.72], 0.63 [0.40, 0.79], 0.22 [− 0.09, 0.50], and 0.64 [0.42, 0.79]) and a weak correlation between mSASSS and PROM (ρ = 0.33 [0.02, 0.58], Additional file 1: Table S4). In the ASIM study, weak correlations were observed between age and CPa9-HNE (ρ =  − 0.34 [− 0.58, − 0.05]). Moderate to strong correlations were observed between ASDAS and C1M, C3M, C4M, C6M, CPa9-HNE, and PRO-C4 (ρ = 0.70 [0.51, 0.83], 0.36 [0.07, 0.59], 0.51 [0.25, 0.70], 0.48 [0.21, 0.68], and 0.51 [0.25, 0.70], Additional file 1: Table S5). Weak correlations were found between SPARCC inflammation score and PRO-C6 (ρ = 0.31 [0.01, 0.55]), SPARCC SSS fat score, and C10C (ρ =  − 0.31 [− 0.55, − 0.01]). Moderate and weak correlations were observed between SPARCC SSS erosion score and CRPM and PRO-C6, respectively (ρ = 0.44 [0.17, 0.65] and 0.36 [0.07, 0.60]). SPARCC SSS ankylosis score showed a moderate correlation with PRO-C6 (ρ =  − 0.43 [− 0.65, − 0.16]) while mSASSS correlated with PRO-C3 (ρ =  − 0.31 [− 0.56, − 0.02], Additional file 1: Table S5).

Discussion

In this study, we evaluate the capacity of ECM biomarkers to reflect the pharmacodynamic effects and response to the TNF-α inhibitor ADA in two axSpA randomized double-blind placebo-controlled studies. To our knowledge, this is the first study where an extensive panel of ECM serological biomarkers has been measured in a randomized controlled study of the effect of a TNF-α inhibitor in patients with axSpA. The main findings were as follows: (i) C1M, C3M, C4M, C6M, and CRP were decreased over time in patients receiving ADA compared to the placebo group; (ii) C1M, C3M, C4M, and C6M could discriminate patients with ASDAS MI and patients achieving at least an ASDAS ≥ CII vs ASDAS NI at baseline; (iii) PRO-C4 could discriminate between patients with an ASDAS ≥ CII and patients with ASDAS NI at baseline; and (iv) mild-moderate correlations were observed between ECM biomarkers and clinical scores.

Determining the optimal treatment for axSpA in a timely and non-invasive manner remains a challenge in clinical practice [44]. Blood-based biomarkers might ameliorate this unmet need as they are non-invasive, more accessible, and less expensive tools compared to other available resources. MRI is a useful tool for evaluating the status and change in axial inflammation before and during TNF-α inhibitor therapy, but it is unclear how early the benefit can be detected [9]. Furthermore, MRI techniques are costly and require experienced professionals to interpret the findings. The early identification of patients who are most likely to exhibit improvement after TNF-α inhibitors using biomarkers might assist clinicians in their treatment decisions.

In this study, we also demonstrated the discriminant capacity and sensitivity to change of ECM biomarkers when patients are treated with ADA, representing an effective treatment in this patient group of two well-characterized studies, DANISH and ASIM, the former presenting patients with higher disease activity. It is well known that axSpA is characterized by inflammation [45]. Yet the nature of the inflammation is not fully elucidated, changes in the ECM, specifically collagen turnover, have been associated with inflammation in patients with axSpA [9, 13, 19]. We found that biomarkers related to MMP-driven inflammation and ECM tissue degradation, C1M, C3M, C4M, and C6M, reflected the pharmacodynamic effect of the TNF-α inhibitor ADA. In agreement with our findings, Holm Nielsen et al. [19] assessed C3M, C4M, C6M, and VICM in patients with axSpA treated with TNF-α inhibitor and observed that C6M and VICM were significantly decreased after two weeks of treatment. Schett et al. [16] found reduced levels of C1M, C3M, C4M, and C6M in patients with PsA after treatment with guselkumab, targeting the interleukin 23 p19 subunit, which is involved in the inflammation processes in SpA. However, Visvanathan et al. [46] observed that a C-terminal cross-linking telopeptide of type I collagen marker (CTX-I) did not show any differences between infliximab (TNF-α inhibitor) and placebo-treated patients with axSpA. On the other hand, collagen formation biomarkers, PRO-C3 and PRO-C6, were not altered in response to ADA, in agreement with Holm Nielsen et al. [19], suggesting that it may mainly suppress inflammation-driven tissue degradation and not the development of fibrosis. The latter is also in agreement with MRI studies, including the DANISH study, showing progression in structural damage during the first year after initiation of TNF-α inhibitor [26, 47, 48]. We did not detect any changes in either type II collagen degradation or formation, C2M and PRO-C2, respectively. These results are similar to those reported by Kim et al. [49], who observed no differences in serum biomarkers of type II collagen formation as a result of infliximab therapy in patients with axSpA. In contrast, it has been found that a type II collagen degradation fragment level (C2C) was reduced in etanercept-treated patients compared to placebo-treated patients [25], and a urinary type II collagen C-telopeptide marker (CTX-II) was suppressed after ADA treatment [24]. These results might implicate that C2M could not evaluate the cartilage turnover in this study. In only one of the two studies investigated, we observed that PRO-C4, a basement membrane turnover marker, and CPa9-HNE, a neutrophil activity marker, were also decreased after ADA treatment, which suggests that ECM turnover and immune-related activity might also be inhibited after TNF-α inhibitor treatment. The additional assessed biomarkers did not show any changes between ADA and placebo in the pharmacodynamic effect of ADA.

Additionally, we investigated the discriminative ability of the ECM biomarkers to differentiate clinical responders vs non-responders to ADA based on the BASDAI50 and ASDAS response criteria. From our knowledge, this is the first study that investigates the association of an extensive ECM biomarker panel with the ASDAS criteria in a randomized placebo-controlled study design. No significant differences were observed between BASDAI50 responders and non-responders at any time point. However, we found that patients achieving at least an ASDAS change > CII had higher levels of C1M, C3M, C4M, C6M, CRP, and PRO-C4 at baseline. These results emphasize the association of these biomarkers with clinical response. Specifically, increased inflammation-driven ECM degradation, reflected by these biomarkers, may lead to a better response to ADA treatment in patients with axSpA when assessed by the ASDAS criteria. On the other hand, no differences were observed in type II, III, and IV collagen formation markers in patients fulfilling vs not fulfilling neither the BASDAI nor ASDAS response criteria. Specifically for type III collagen formation, Holm Nielsen et al. [19] observed that patients with no improvement in ASDAS (after 22 weeks of TNF-α inhibitor treatment) had higher baseline levels of PRO-C3 than patients with CI and MI in ASDAS. Further research in this area could contribute to the development of personalized treatment strategies for axSpA patients, considering their unique disease characteristics and ECM remodeling patterns.

As a complementary analysis, we assessed the correlations between the biomarker data and the clinical scores. We observed weak-moderate correlations between ASDAS and C1M, C3M, C4M, C6M, VICM, and CPa9-HNE. In a previous work [13], we found mild-moderate correlations with the SPARCC SSS ankylosis score and C3M, C4M, C6M, and PROM. However, in this study, we observed significant weak-moderate correlations between SPARCC SSS (inflammation, fat, and erosion) and C10C, CRPM, and PRO-C6. Further studies are therefore needed to clarify the relation between the biomarkers and structural damage in patients with axSpA.

The strength of this study was the extensive panel of ECM biomarkers to evaluate the ADA effect in two independent well-characterized axSpA cohorts. The results of this study emphasize the great potential of ECM biomarkers for the evaluation of pharmacodynamic effects in randomized placebo-controlled studies, even in studies with small sample sizes (< 50) (Table 2). Moreover, in subsequent studies, by combining information from ECM remodeling biomarkers with other clinical scores, clinicians may potentially obtain a more comprehensive picture of the patient’s disease status and response to treatment.

There are several limitations to this work. Studies with a bigger sample size are needed to validate the role of the biomarkers in patients with axSpA and to explore the utility of combining the results of different biomarker analyses to predict treatment response. We only evaluated blood-based biomarkers for exploring ECM turnover; further research is needed to understand the ECM turnover in the affected tissues by tissue-based methods.

Conclusion

In conclusion, this study demonstrates that ECM metabolites are associated with the pharmacodynamic effect and response to TNF-α inhibitor treatment in patients with axSpA. Therefore, selected biomarkers of ECM degradation and inflammation (C1M, C3M, C4M, and C6M), individually or together with clinical evaluation and/or MRI, shows promise as future biomarkers to assess disease activity after TNF-α inhibitor therapy.