Introduction

Systemic sclerosis (SSc) is a generalized auto-immune disease characterized by inflammation, micro-vasculopathy, and fibrosis, affecting skin and internal organs. The primary cause of death in patients with SSc is interstitial lung disease (ILD) [1]. The incidence of ILD in SSc varies from 25 to 90% depending on the method used to identify ILD [2], with a 10-year survival rate in SSc-ILD patients of 70% from onset of SSc-ILD [3]. Although the pathogenesis of SSc-ILD has not been elucidated, it is hypothesized that chronic inflammation of the alveoli leads to progressive lung tissue damage and increasing fibrosis [4,5,6,7,8].

According to EULAR recommendations, cyclophosphamide (CYC) is the first choice of therapy for treating SSc-ILD [9]. Cyclophosphamide acts as a cytotoxic immunosuppressive agent through modulation of lymphocyte function that leads to depression of the inflammatory response and less fibrosis [10, 11]. Several uncontrolled studies showed a positive effect on pulmonary function in patients receiving cyclophosphamide [12,13,14,15,16,17,18,19]. Based on these studies, two double-blind placebo-controlled randomized controlled trials were subsequently carried out [20, 21]. The first, the scleroderma lung study (SLSI), investigated oral CYC and showed a small positive effect on pulmonary function and quality of life after 12 months compared to placebo, which did not sustain after 24 months. The second, the UK lung study, showed a trend towards improvement of forced vital capacity (FVC) after six monthly intravenous (IV) CYC pulses during 12 months compared to placebo. A third study (SLSII) compared CYC to mycophenolate mofetil (MMF) in a randomized controlled setting. Both CYC and MMF were proven to be equally effective in SSc-ILD after 24 months. These results show that CYC can have an effect, albeit small, on pulmonary function in SSc-ILD and quality of life.

The small effect of these studies might be caused by differences in dosage, treatment duration, and administration route as well as a longer SSc disease duration. In the Radboud University Medical Center, patients with SSc-ILD are treated with monthly CYC pulses according to a protocol in which patients are given monthly iv CYC (750 mg/m2) for 12 months as soon as the ILD is diagnosed. We hypothesized that the effects of CYC in our cohort of SSc-ILD patients would be equal to or larger than in the aforementioned trials, because of this upfront, high-dose treatment protocol. We also hypothesized that, based on what is known about the mechanism of action of CYC in SSc-ILD [22], CYC may have a larger effect in an “active” inflammatory phase of SSc-ILD than in a later fibrotic phase.

Therefore, in this study, we analyzed whether the extent of inflammation, as assessed by the proportion of ground glass compared to fibrosis, SSc disease duration, the extent of ILD, or baseline diffusion capacity of the lungs (DLCO) < 60%, modifies the effect of intravenous CYC pulse therapy (750 mg/m2) on pulmonary function (as measured by FVC, DLCO) in SSc-ILD patients, after 12, 24, and 36 months.

Methods

Design

This retrospective study was designed as an observational, long-term cohort including prospective data collection and intention-to-treat analysis. Since 2003, intravenous, monthly CYC pulse therapy is the first of choice therapy for SSc-ILD and/or rapid progressive diffuse SSc at our Department of Rheumatology of the Radboudumc. All patients since then diagnosed with SSc-ILD who started treatment with monthly iv CYC pulses (dose 750 mg/m2) were analyzed in this study. Pulmonary function tests (FVC and DLCO) obtained during scheduled follow-up were used to analyze the treatment effect of CYC at 12, 24, and 36 months after start of CYC pulses. Variables that were chosen to reflect the concept of an “active” inflammatory phase of SSc-ILD included the percentage of ground glass, SSc disease duration, extent of ILD by Goh’s classification criteria, and baseline DLCO. These variables were used as inflammatory proxies to analyze the effect on pulmonary function test results over time. Occurrence of adverse events (AE) and survival were also analyzed.

Due to the observational nature of this cohort, no ethical review was needed according to Dutch law and regulations.

Patients

All patients with SSc-ILD as indicated by high-resolution computed tomography (HRCT) who started intravenous CYC pulse therapy from 2003 in the Radboudumc were included in this study. Patients who received cyclophosphamide for skin involvement were not analyzed in this study. The dosage of CYC was 750 mg/m2. Presence of SSc-ILD was defined as presence of ground glass opacity (GGO) or honeycombing or pulmonary fibrosis on HRCT. Patients were followed from the start of CYC treatment up to 3 years. Most patients in our center received mycophenolate mofetil following CYC infusions, during at least 3 years. Before 2013, patients received MMF with an increasing rate; other patients received azathioprine or no follow-up treatment; from 2013 onwards, follow-up treatment with MMF is the standard.

Assessments

Baseline demographic and clinical data and results from pulmonary function tests (PFTs) and HRCT were collected. Occurrence of the first non-Raynaud (NR) symptom was appointed as disease onset of SSc. Pulmonary function testing was performed at 6, 12, 24, and 36 months after the start of CYC pulses. Spirometry (FVC) and carbon monoxide DLCO were performed according the ATS/ERS guidelines [23]. Predicted normal values were derived from the European Community for Steel and Coal [24].

The presence and extent of GGO and fibrosis on HRCT was assessed according to the criteria by Goh for limited and extensive ILD. HRCT images were scored at five levels: (1) origin of great vessels, (2) main carina, (3) pulmonary venous confluence, (4) halfway between the third and fifth sections, (5) immediately above the right hemi-diaphragm. The total extent of interstitial lung disease was estimated to the nearest 5 % in each of the five sections, with global extent of disease on HRCT computed as the mean of the scores [25].

Two independent physicians were blinded for patient characteristics including PFT results and blinded for purpose of the study performed this assessment, respectively. Cases in which there was disagreement were discussed in a consensus meeting and consensus was reached in every case.

Outcomes

The primary endpoint in this study was the FVC after 12 months since the first CYC infusion. Secondary outcomes in pulmonary function were DLCO after 12 months, FVC and DLCO after 24 and 36 months, and therapy response according to the American Thoracic Society [26]. A good response to therapy was defined as an increase in FVC of > 10% or an increase in DLCO > 15% compared to baseline (treatment start) [5, 17, 19, 26, 27]. Stabilization was defined as a change in FVC of < 10% and DLCO of < 15%, and a decrease of > 10% in FVC or > 15% in DLCO as worsened SSc-ILD [28]. Further, secondary outcomes were serious adverse events (SAEs) and death related to ILD.

Effect modifiers

Effect modifiers of interest, reflecting inflammatory activity of ILD by proxy, were the presence of more GGO than fibrosis, SSc disease duration < 3 years, the extent of ILD according to Goh, and DLCO < 60% at baseline. The method of Goh was used to determine whether limited or extensive ILD was present [25]. This staging system consists of a combination of HRCT scan and PFT data and provides discriminatory prognostic information on SSc-ILD [25]. In short, the disease extent on HRCT scan (more or less than 20% of the surface of the lungs) is combined with FVC % predicted (higher or lower than 70%) to stage the ILD as limited or extensive disease [25].

Statistical analysis

All analyses of FVC and DLCO were conducted on all patients with SSc-ILD starting a first pulse with CYC. Missing outcome data due to dropout were replaced by the last observation carried forward (LOCF), except in the case of death. Per-protocol analysis was also done, including all patients finishing a complete course of 12 pulses [29, 30].

The effect of the predefined variables, chosen to reflect the concept of an “active” inflammatory phase of SSc-ILD, on the effect of CYC on pulmonary function was analyzed. Linear regression analysis was performed with ΔFVC and with ΔDLCO as the dependent variable. For these analyses, potential confounders were regarded as actual confounders if their inclusion led to a change of > 10% of the regression coefficient of the group effect. Analyses were done for every time point: the difference in FVC and DLCO between 0 vs. 12 months, 0 vs. 24 months, and 0 vs. 36 months. All four probable effect modifiers were analyzed separately.

Further, it was explored whether there were baseline differences between responders and poor responders using Student’s t test, Mann–Whitney U test, chi-square test, or Fisher’s exact test, as appropriate. Occurrence of SAEs was tabulated; survival was analyzed using a Kaplan-Meier plot. A p value < 0.05 was deemed to notify statistical significance. Analyses were done using SPSS version 22.0.

Results

A total of 91 SSc patients were treated with CYC pulses for SSc-ILD; 16 patients were excluded due to the lack of a baseline HRCT (n = 15) or PFT (n = 1). Baseline characteristics of the 75 eligible patients are summarized in Table 1. Patients with diffuse cutaneous SSc (dcSSc) were more often male with a shorter disease duration. Fibrosis on HRCT was the most prevalent abnormality at baseline and was found in 95% of the patients; GGO was found in 76% of the patients. Both fibrosis and GGO were most often present in the lower lung fields.

Table 1 Baseline characteristics

Course over time

PFT results after 12, 24, and 36 months are shown in Fig. 1. Both FVC and DLCO showed a non-significant increase. Table 2 displays the change in FVC and DLCO between baseline and 12, 24, and 36 months, sequentially. Results are shown for the raw data, data adjusted with LOCF and the per-protocol analysis (PPA) of all patients who completed 12 cycles of CYC (n = 44). On average, forced vital capacity after 36 months was 1% higher than baseline (95% CI, − 5.5–7.5%). Similarly, there was a non-significant increase of 1.6% (95% CI, − 2.5–5.7) in DLCO after 36 months compared to baseline. Adjusting results for lost-to-follow-up and per-protocol did not lead to significant changes.

Fig. 1
figure 1

Pulmonary function test results at baseline and during follow-up at 12, 24, and 36 months

Table 2 Pulmonary function test during follow-up

Effect modifiers

To investigate whether the treatment effect was more pronounced in the early inflammatory phase, we identified four proxies for active, early inflammation: GGO, disease duration, extent of ILD, and baseline DLCO (see “Methods”). In Table 3, “GGO > fibrosis” indicates the difference between GGO > fibrosis and GGO < fibrosis. This is similar for the analyses of the variables “disease duration < 3 years,” “limited vs. extensive ILD,” and “baseline DLCO < 60%.”

Table 3 Linear regression analysis

None of these proxies for early, active inflammation were associated with a higher treatment effect. For example, the change in percentage of predicted FVC values between 0 and 12 months differed 0.2% between GGO > fibrosis and GGO < fibrosis, which was a non-significant difference (p = 0.92). The other probable effect modifiers also showed no significant differences in both FVC and DLCO at all follow-up time points. In the per-protocol analysis in the 44 patients who completed 12 cycles of CYC pulse therapy, similar results were seen (not shown).

Responders

Out of the 75 patients who started treatment with CYC pulses, 14 patients (18.7%) fulfilled the good response criteria during follow-up. A total of 41 patients (54.7%) stabilized and 20 patients (26.7%) worsened during follow-up.

There was a significant difference (p = 0.008) between the different response groups in SSc disease duration: the good responders had a median disease duration of 10 months (p25–p75: 4–29); the stabilization group had a median of 25 months (10–98) and in the worsened group the median was 19 (9–34). Baseline FVC was significantly (p = 0.04.) lower in the good responders and stabilized patients (median 75 (p25–75: 71–92) and 83 (70–99)) compared to the poor responders (93 (84–113)). A per-protocol analysis in the 44 patients who completed 12 cycles of CYC pulse therapy showed similar results.

Concomitant medication

A total of 51 patients (68%) received treatment with mycophenolate mofetil during follow-up. There were no baseline differences between the users and non-users of mycophenolate mofetil. Also, there were no differences in FVC and DLCO, nor in the change of FVC and DLCO, during follow-up between patients receiving mycophenolate mofetil and patients not receiving mycophenolate mofetil.

Treatment discontinuation

A total of 31 patients (41%) withdrew before the completion of 12 monthly cycles of CYC. During the monthly cycles of CYC pulses, four patients died (see section Survival). The reasons for treatment discontinuation in the other 27 patients were progression of ILD (13), pneumonia (3), excessive burden on the patient (2), renal impairment (1), leucopenia/thrombopenia (1), allergic exanthema (1), and leukocytoclastic vasculitis (1). Additionally, there were four patients who completed only six CYC pulses, due to a different choice for the treatment strategy, for example to preserve fertility. In one patient, the discontinuation reason was unclear. There were no significant differences regarding treatment discontinuation between the probable effect modifiers: limited and extensive ILD, more GGO than fibrosis, disease duration, or baseline DLCO < 60%. Although not significant (p = 0.35), in 11 out of 59 patients (18.6%) with a baseline DLCO < 60%, the reason for treatment discontinuation was treatment inefficacy, compared to 0 out of 10 patients (0%) who had a baseline DLCO > 60%.

Adverse events

Adverse events during treatment with CYC occurred in 17 cases (23%): pneumonia (7), anemia (4), leucopenia (2), and hematuria, renal insufficiency, leukocytoclastic vasculitis, and allergic exanthema (1).

No significant differences were seen in occurrence of adverse events between limited and extensive ILD, more GGO than fibrosis, disease duration, or baseline DLCO < 60%.

Survival

A total of 14 patients died during the CYC pulses (4) or during the 3-year follow-up period (10).

Of the four patients who died during the CYC pulses, one patient died after a complication of vascular surgery, and one died of respiratory insufficiency following congestive heart failure due to progression of ILD and pulmonary hypertension (PH). Two patients died after a probable pneumonia complicating their ILD. There were another 10 patients who died during the 3-year follow-up period: progressive ILD/PH (n = 4), progressive ILD (n = 2), metastasized colon carcinoma (n = 1), sepsis (n = 1), acute cardiac asthma (n = 1), and SSc-related myocarditis (n = 1).

In Fig. 2, the Kaplan–Meier survival curve is displayed, showing that most patients died in the first year after the start of CYC pulses. The overall 3-year survival rate in the 75 patients who started CYC pulses was 81%. Survival in the lcSSc patients was 76% compared to 86% in dcSSc patients; this was a non-significant difference (log-rank 0.30). The effect modifiers (more GGO than fibrosis, disease duration, limited ILD, or baseline DLCO < 60%) showed no significant difference in survival (not shown).

Fig. 2
figure 2

Kaplan–Meier curve. Survival in SSc-ILD treated with monthly CYC pulses

Discussion

The present study shows that high-dose treatment with CYC leads to a sustained stabilization of pulmonary function in patients with ILD associated with SSc. We found that both vital capacity and carbon monoxide capacity of the lungs remained stable for 3 years after treatment with CYC. The treatment effect seemed to be independent of the inflammatory phase of ILD; proxies for disease activity did not significantly alter the treatment effect.

According to the EULAR treatment recommendations, CYC is the first choice treatment for SSc-ILD [9]. The effect of CYC on pulmonary function as demonstrated in other studies is confirmed by the results of the current study [13,14,15,16, 19,20,21, 31]. In this study, we also were able to show long-term stabilization of pulmonary function during 3 years of follow-up, as well as an overall 3-year survival rate of 81%. Besides, this study has identified a subgroup of good responders. On the opposite, treatment with CYC also has its downsides, with adverse events occurring in almost one in four patients.

As stated in the “Introduction,” we hypothesized that CYC might be more effective if SSc-ILD still has a large inflammatory component while fibrosis is less present. We have collected four proxies of inflammatory activity and inflammation to approach this as closely as possible (the proportion of ground glass, SSc disease duration, extent of ILD according to Goh’s criteria, and baseline DLCO < 60%).

Contrary to our hypothesis, none of these proxies showed could predict a good treatment response in our cohort of SSc-ILD patients. This might have several explanations. First, the treatment effect of CYC might be independent of the extent of ILD. Second, the proxies we chose might have reflected the early inflammatory phase inadequately. Third, inflammation might not be the driving factor for the treatment effect of CYC. Thus, there is a lack of proper proxies for treatment effect in SSc-ILD. The underlying issue is the lack of a gold standard for inflammation in interstitial lung disease. In order to gain further insight, future research should focus on the pathophysiology of SSc-ILD to identify relevant markers that show good respond to CYC treatment.

Predictors of a good response on CYC therapy are pivotal. In our study, 19% had a predefined good response, 55% of the patients had stabilized pulmonary function test results, and 27% worsened during follow-up. Although we could not find a robust treatment effect of our proxies, there was some evidence that patients with a short disease duration respond better. Baseline FVC in the good responders was remarkably significantly lower than that in the poor responder group, but this might be a result of less loss of FVC in the poor responder group: “less loss, less to regain.” The failure of our proxies to identify relevant subgroups of patients that might respond better to cyclophosphamide treatment underlines the need for better treatment surrogates in future trials, especially in light of the rate of serious adverse events seen in the present study.

The use of MMF in SSc-ILD has increased over the last decade. In our center, MMF is often described after CYC infusions. We found no differences in MMF users and non-users in both baseline and follow-up results. Therefore, we could not prove an additional effect of MMF following CYC treatment as was found in the study performed by Tashkin et al. [31].

There are some limitations to this study. The most important limitation probably is our choice of the four surrogate markers for an “early” inflammatory phase in SSc-ILD. There is no gold standard available to measure the amount of inflammation in ILD. We have tried to choose several components which might reflect the early, inflammatory, reversible phase of SSc-ILD. Ground glass opacities seen on HRCT indicate the presence of alveolitis, which is still reversible. Secondly, 75 out of 91 patients who started with CYC pulses because of SSc-ILD were analyzed in this study. The 16 patients all dropped out due to missing baseline HRCT (n = 15) or PFT (n = 1), not because of specific disease-related causes. Therefore, the probability of bias is small.

In conclusion, overall pulmonary function in SSc-related ILD measured by FVC and DLCO shows stabilization after intravenous CYC pulses during a 3-year follow-up period. One fifth of the patients improved, more than half stabilized, and a quarter worsened. A significant part of the patients died during the follow-up period. The different inflammatory activity proxies, proportion of ground glass, SSc disease duration, extent of ILD, and baseline DLCO do not influence the effect of CYC on pulmonary function. Therefore, based on the results of both previous RCTs and the results of this study, CYC appears to be an appropriate therapy for a large proportion of patients with SSc-ILD; subgroups of patients with good or no response could not be identified. Future studies should focus on identifying relevant markers of inflammation to better select those patients with SSc-ILD that will have the best response to this intensive treatment.