Background

Systemic sclerosis (SSc) is a severe inflammatory disease of the interstitial tissue with clinical manifestations ranging from limited skin involvement to life-threatening effects on the heart, kidneys and lungs. SSc is a rare disease with an annual incidence in Europe of about 2 cases per 100,000 population, and a prevalence of about 10–25 per 100,000 [1, 2]. According to international registry studies [3], a high proportion of patients with SSc have interstitial lung disease (ILD), with or without pulmonary hypertension (PH), cardiac and gastrointestinal involvement. Cardiac, pulmonary and renal manifestations of SSc lead to an elevated disease-specific mortality [4,5,6]. Despite therapeutic progress, the mortality of patients with SSc is 3.5-fold higher than that of the general population – this factor has been stable over several decades [7].

Involvement of internal organs and joints typically results in impairment of exercise capacity, as measured by the 6-min-walk test (6-MWT) or cardiopulmonary exercise testing (CPET). In particular, CPET provides an important insight into exercise physiology, and has shown patients with SSc to have a lower cardiopulmonary exercise capacity, measured as peak oxygen uptake (peakVO2) [8] and as the relationship between ventilation and carbon dioxide output (VE/VCO2-slope) [9], compared with control individuals. Recent studies suggest that CPET can be used to determine whether the primary cause of exercise capacity limitation is cardiac or pulmonary in origin [10, 11]. Prognosis in SSc has not previously been assessed using CPET. However, studies in PH [12] and pulmonary arterial hypertension (PAH) [13] that included patients with SSc as a subgroup have suggested that CPET parameters may have prognostic value.

Against this background, we retrospectively assessed CPET parameters which could potentially predict survival. Analysis of a representative number of patients with SSc was made possible through the collaboration of multiple centres. Patients with SSc were subdivided into groups with and without interstitial pulmonary manifestations. We hypothesised that in addition to established prognostic factors – age, PH and ILD – CPET parameters, particularly peakVO2 and VE/VCO2-slope, can predict survival in patients with SSc.

Methods

Study design and participants

This study was a retrospective analysis of patients with SSc from a prevalent cohort. The patients were treated in four university hospitals (Greifswald, Regensburg, Dresden and Graz) and two expert centres (Missio Clinic Würzburg, and the Leipzig Pulmonary Study Center). All patients fulfilled the criteria of SSc or CREST syndrome (Calcinosis, Raynaud’s syndrome, Oesophageal dysmotility, Sclerodactyly, Telangiectasia; a subgroup of SSc with limited cutaneous manifestation [lcSSc]) according to current guidelines [14].

Patients without CPET data were excluded from the analysis, as were those with pulmonary diseases other than SSc (e.g. bronchial asthma, previous pulmonary surgery, or pulmonary emphysema visible in high-resolution computed tomography [HR-CT]). Patients with impaired systolic left ventricular function or relevant valvular disease other than tricuspid regurgitation (TR) were also excluded.

Patients with SSc were divided into two groups. Group 1 comprised patients with diffuse cutaneous SSc (dcSSc, n = 88). Group 2 (lcSSc, n = 122) included patients with lcSSc (including a subgroup presenting as CREST syndrome, n = 51). Pulmonary manifestation was assessed by HR-CT and pulmonary function testing as defined by the American College of Rheumatology/European League Against Rheumatism criteria [15]. Parenchyma involvement < 20% was considered to represent a limited manifestation. Extensive manifestation was defined as ≥20% parenchyma involvement. Patients with an uncertain extent of manifestation according to HR-CT were classified as extensive manifestation if forced vital capacity (FVC; as percentage of predicted [%predicted]) was < 70% of normal [16]. Co-morbidity was assessed using the Charlson index [17].

Follow-up and survival of all patients was documented from the first visit until June 30, 2016 (December 31, 2014 at Graz). Patients whose survival could not be documented at these dates were censored at the last day of contact. We defined three different follow-up times: 1) at time of diagnosis for the comparison between dcSSc and lcSSc (groups 1 and 2) and for demographic data such as age and gender; 2) at time of CPET for all other analyses except right heart catheterization (RHC) data; and 3) at time of RHC for analysis of the prognostic value of systolic right ventricular pressure (RVsys).

Echocardiography

Resting echocardiography was performed by experienced physicians according to relevant guidelines [18, 19]. TR was classified according to American College of Cardiology/‌European Society of Cardiology (ESC) recommendations, and RVsys was estimated by simplified Bernoulli equation via TR velocity (v) as RVsys (mmHg) = 4v2, with the addition of 5 mmHg if the inferior vena cava was not dilated and there was visible respiratory variability, and 10 mmHg if the inferior vena cava was dilated or without respiratory variability.

Pulmonary function and diffusion capacity

All centres assessed pulmonary function by spirometry, body plethysmography and measurement of diffusion capacity according to current standards [20,21,22]. Obstructive pulmonary disease was defined by forced expiratory volume in 1 second (FEV1)/FVC < 70%; restrictive pulmonary disease by total lung capacity (TLC) < 80%; and clinically relevant diffusion impairment by diffusion capacity of carbon monoxide (DLCO) < 60% of normal. Normal values for FEV1, FVC and TLC were calculated by the formulas published by our working group [23,24,25], and normal values for DLCO were taken from European Respiratory Society (ERS) formulas [26].

Cardiopulmonary exercise testing

CPET was performed on a bicycle ergometer as a symptom-limited test. Performance and analysis methods have been described in detail previously [23, 27]. All centres started the test with a 3-min resting phase and unloaded cycling of 1–3 min (no unloaded phase was used at Graz), followed by a ramp protocol with 10–12.5 W∙min− 1 in two centres and a step-increment protocol with 12.5–16 W∙min− 1 in the other centres. All values were recorded as absolute values and percentage of normal, based on our reference values [23].

The 6-MWT was performed according to current American Thoracic Society guidelines [28].

Right heart catheterisation

RHC was performed according to the guidelines of the ESC and the ERS [29] if clinical symptoms and echocardiographic criteria suggested possible PH. We applied the criteria defined in an expert consensus [30], which are based on clinical findings (progressive or unexplained dyspnoea, signs of right heart failure), echocardiography (RVsys > 45 mmHg, right ventricular dilation) and DLCO (< 50%). All centres used the mid-thoracic level as the zero-pressure point. PH was defined according to ESC and ERS guidelines as mean pulmonary artery pressure (PAPmean) ≥25 mmHg, and PAH was defined as PAPmean ≥ 25 mmHg, pulmonary artery wedge pressure (PAWP) ≤15 mmHg and pulmonary vascular resistance (PVR) > 3 Wood units (> 240 dyn∙s∙cm− 5) [31].

Statistical analysis

Continuous variables, stratified by group status, are reported as median and interquartile range (IQR, in brackets). Categorical variables are reported as absolute numbers and percentages. Differences among groups were verified by Wilcoxon (continuous data) and χ2-tests (categorical data). Potential associations of group status and parameters from pulmonary function testing and CPET with mortality were tested using Cox regression models adjusted for age and gender. For group status the follow-up time was calculated based on the time of diagnosis; for the other variables the time of first examination defined the starting point.

Prediction models were determined using Cox regression models with age, gender, body mass index (BMI), and all parameters from pulmonary function testing and CPET as explanatory variables. For the final model, we eliminated variables by a backward selection procedure using a cut-off p-value of 0.1. The discrimination of these models was reported by Harrell’s C-statistic. Based on logistic regression models with the outcome “death: yes/no” we conducted receiver operating characteristic (ROC) analyses for selected variables. Kaplan–Meier curves were plotted for selected variables – for continuous variables, cut-off values were defined as the point which maximised the Youden index for the outcome “death”. The Youden index is defined as sensitivity + specificity − 1.

All analyses were carried out with Stata 14.1 (Stata Corporation, College Station, TX, USA).

Ethical approval

The study was approved by the ethics committee of Greifswald University (No. 043/13a, study protocol and amendment of May 5th, 2015).

Results

The study included 210 patients with SSc – demographic and clinical data are shown in Table 1. The majority of patients were women in both SSc groups, with group 2 (dcSSc) having a significantly lower proportion of women (73.9%) than group 1 (lcSSc, 86.1%; p = 0.03). The proportion of active smokers was < 20% in both SSc groups. There were no significant differences between SSc groups in co-morbidity status (Charlson index: 2 [IQR, 1–2] in both groups; p = 0.65) or in the proportion of patients with TR, assessed by echocardiography (80.3 vs 89.7%; p = 0.63). A significantly higher proportion of patients in group 1 had extensive ILD, compared with group 2 (27.1% vs 8.2%; p < 0.001).

Table 1 Demographic parameters

Pulmonary function parameters were significantly different between SSc groups, particularly with regard to FEV1%predicted (group 1, 90% [IQR, 77–104%]; group 2, 95% [IQR, 84–107%; p = 0.002]), and the proportion of patients with impaired FVC (< 70% of normal, 20.0% vs 8.6%; p = 0.02). There were no significant differences in diffusion parameters (DLCO %predicted and DLCO per alveolar volume [Krogh factor; KCO] %predicted; Table 2), or the proportion of patients with DLCO %predicted ≤60% (50.6% vs 37.8%; p = 0.08).

Table 2 Hemodynamic, pulmonary function and CPET parameters

6-min-walking distance (6-MWD) was documented in 96 of 210 patients with SSc, with no significant difference between groups (p = 0.8). All CPET parameters tested were similar in the two SSc groups (e.g. peakVO2, 72.2% vs 75.2% of predicted; p = 0.3 and VE/VCO2-slope, 31.6 vs 33.6; p = 0.1). The overall correlation of 6-MWD and peakVO2 was weak (r = 0.2).

Subgroup with right heart catheterisation

RHC data were available for 136 patients, of whom 52 had PH, including a subgroup of 38 patients with PAH. Patients with lcSSC more frequently underwent RHC (73.8% in group 1 vs 55.7% in group 2; p = 0.006). There were no significant differences between SSc groups in the proportion of patients with PH (42.6 vs 36.0; p = 0.45) or PAH (27.7% vs 28.7%; p = 0.9), or in haemodynamic parameters (Table 2). The subgroup with RHC had higher proportions of patients with extensive ILD and TR, higher mean estimated RVsys, and lower mean DLCO, FVC and 6-MWD. Most CPET parameters in the RHC group were worse compared with the non-RHC group (e.g. VE/VCO2-slope, 35 [IQR, 29–47] vs 29 (IQR, 26–33); peakVO2, 1087 (IQR, 824–1380) vs 1270 (IQR, 1097–1292) mL∙min− 1; both p < 0.001; see Additional file 1: Table S1).

Subgroup with interstitial lung disease

All 195 patients with interpretable HR-CT were included in the subgroup analysis of pulmonary manifestation; of these, 191 patients had a complete pulmonary function test. The proportion of women was lower among patients with ILD (104 of 121; 86%) than among those without ILD (52 of 74; 74%, p < 0.01). Compared with patients without ILD, those with ILD had worse results in all pulmonary restriction and diffusion parameters, and more frequently underwent RHC. In addition, a higher proportion of patients with ILD had pulmonary limitation at exercise (defined as VE/MVV > 80%). There were no significant differences in co-morbidity or echocardiography, or in most haemodynamic and CPET parameters. A detailed comparison between patients with and without ILD is shown in Additional file 2: Table S2.

Mortality

The median follow-up after first diagnosis of SSc was 7.7 years, with a total of 1970 patient-years analysed. From first diagnosis, 5-year survival was 93.8%, and 10-year survival was 86.9% (Fig. 1a). There was no significant difference in survival between SSc groups (p = 0.3; Fig. 1b). In addition, there was no significant difference in survival between patients without ILD and those with extensive ILD (p = 0.1) or limited ILD (p = 0.25). In the subgroup of patients with RHC (n = 139), for whom analysis of PH was possible, a diagnosis of PH was associated with a significantly worse prognosis (p = 0.007, Fig. 1d).

Fig. 1
figure 1

Survival of patients after first diagnosis of SSc (Kaplan–Meier analyses). a Overall. b According to limited or disseminated disease. Bold line: group 1 (dcSSc, n=88); dashed line: group 2 (n=122) comprising lcSSc (n=71) and CREST-syndrome (n=51). c Divided by 6-MWD, Youden index defining best cut-off at 413 m. d Divided by pulmonary hypertension. Bold line: PAPmean ≥25mmHg, dashed line: PAPmean <25mmHg. 6-MWD: 6 minute walking distance; CREST: Calcinosis, Raynaud´s syndrome, Oesophageal dysmotility, Sclerodactyly, Telangiectasia; dcSSc: disseminated cutaneous manifestation; lcSSc: limited cutaneous manifestation; PAPmean: mean pulmonary arterial pressure

Prognostic factors

Cox regression analysis adjusted for age and gender determined that a number of factors were significantly associated with mortality (Table 3). Prognostic value was identified for age, Charlson index, body weight, BMI, extensive ILD, echocardiographic RVsys, and various haemodynamic parameters, pulmonary function and CPET. Moreover, 6-MWD was significantly associated with survival, with a walking distance of 413 m discriminating best (p = 0.003; Fig. 1c) between a favourable and a poor prognosis.

Table 3 Cox regression adjusted for age and gender

In a further step, the model was adjusted for BMI, age and gender and used to analyse all parameters of pulmonary function and CPET that had a significant association with survival (Table 4, model 1). In addition to age, in this model FVC, KCO and peakVO2 in mL∙kg− 1∙min− 1 were significantly linked to survival (Harrel’s C, 0.96). Exclusion of peakVO2 impaired the predictive value of the model (Harrel’s C, 0.84). In a calculation restricted to KCO, TLC and peakVO2, only peakVO2 remained associated with survival. A second model used peakVO2%predicted as a variable instead of peakVO2 in mL∙kg− 1∙min− 1: in this model, age, VE/VCO2-slope, KCO, FVC, and peakVO2%predicted had a significant association with survival (Table 4, model 2).

Table 4 Two different models for the calculation of predictive variables for survival

Finally, ROC analyses were conducted for the parameters peakVO2 and VE/VCO2-slope, and cut-off values were calculated (Fig. 2d). A peakVO2 of 15.6 mL∙kg− 1∙min− 1 (64.5% of predicted) and a VE/VCO2-slope of 34.9 had the highest discriminative value between favourable and poor prognoses (Fig. 2a-c).

Fig. 2
figure 2

Survival and CPET parameters, Kaplan–Meier analysis (a-c), receiver operation characteristic. d. a peakVO2 in mL∙kg-1∙min-1. b peakVO2 as % of predicted normal value. c VE/VCO2-slope. d Receiver operation characteristic for selected parameters. FVC: forced vital capacity in % predicted (area under curve [AUC]=0.73; best cut-off [cut]=80%, Youden Index [Y]=0.30); KCO: Krogh factor (DLCO per alveolar volume in % predicted; AUC= 0.80, cut=62%, Y=0.54); peakVO2: peak oxygen uptake in mL∙kg-1∙min-1 (AUC=0.8, cut=15.6, Y=0.59); VE/VCO2-slope: slope of the relationship between ventilation and carbon dioxide output (AUC=0.8, cut=35, Y=0.57)

Discussion

The results of this study demonstrate for the first time in a large cohort of patients with SSc that CPET parameters (peakVO2, VE/VCO2-slope) and 6-MWD can predict survival.

Although there is some variation among previous studies (as detailed in Additional file 3: Table S3), these have in general found that peakVO2, oxygen uptake at the anaerobic threshold (VO2@AT) and the ratio of oxygen uptake to heart rate (VO2/HR) are lower in patients with SSc than reference or matched control values, while the ratio of ventilation to carbon dioxide output at the anaerobic threshold (VE/VCO2@AT) is higher [8, 9, 11, 32,33,34,35,36,37]. Our study confirmed these differences from reference values for pulmonary function, diffusion and CPET parameters.

The 5-year and 10-year survival rates from first diagnosis in our retrospective group of 210 patients with SSc were 93.8 and 86.9%, respectively. Overall, patients in group 1 (dcSSc) and group 2 (lcSSC) had similar 10-year survival rates (87% in both groups). This is consistent with results reported in the recent literature, with published 10-year survival rates of 93% in a Spanish study [4], 82% in a Canadian study [38], and 88% in an Italian study [39]. Earlier studies reported poorer 10-year survival rates, of 55% [40] and 54–67% [41].

In a Kaplan–Meier-analysis of our cohort according to pulmonary involvement, there was no significant difference for survival in patients with extensive or limited ILD compared with patients without ILD. However, Cox regression demonstrated a significantly higher risk of mortality in patients with extensive disease, compared with those without ILD (hazard ratio = 2.5; p = 0.04). This is in line with other published studies, which have shown significantly better survival rates in patients with moderate interstitial disease [16, 42] than in those with more extensive lung involvement, and with a meta-analysis that found the degree of interstitial changes to be an independent prognostic variable for mortality in SSc [43]. A recent study differentiated among subforms of ILD and showed that manifestation as usual interstitial pneumonia (UIP) has a 2.3-fold risk of mortality compared to manifestation as non-specific interstitial pneumonia (NSIP) [44]. Moreover, new drugs – rituximab [45, 46], mycophenolate [47], their combination [48], and nintedanib [49] – have the potential to provide an effective therapy for ILD. These therapies have been shown to improve parameters of pulmonary function that are related to prognosis, such as DLCO, DLCO/FVC and TLC [45, 50, 51], but to date no study has actually demonstrated improved survival in patients treated with immunosuppressive agents. Hence, there is a need for new parameters that better predict long time survival under immunosuppression [52].

Our analyses of subgroups as ILD/non-ILD and RHC/non-RHC found no relevant prognostic differences regarding CPET parameters. This might be caused by the heterogeneity of these groups or by a pre-selection bias. All study centres assessed CPET parameters as indication criteria for the performance of the RHC, and therefore nearly all CPET parameters were worse in the RHC group than in the non-RHC group (e.g. lower peakVO2 and higher VE/VCO2-slope). Similarly, the proportion of RHC in ILD was 84%, compared with 54% in non-ILD patients, preventing an evaluation of prognosis in these subgroups.

In accordance with the literature [53, 54] survival in our study was worse in patients with PH than among patients without PH. Multiple studies have shown that the prognosis of patients with ILD in addition to PH is even worse than in patients with PH alone (see Additional file 3: Table S3) [55,56,57,58,59]. It is notable that patients with PH who have SSc do not often suffer from PAH, but rather from PH due to left heart disease or PH due to lung disease (groups 1, 2 and 3 of the Nice classification, respectively) [31].

Our study has confirmed the prognostic significance of age, gender and pulmonary function parameters (vital capacity, TLC, FVC, FEV1, KCO, DLCO and quotient FVC/DLCO). Studies of these prognostic parameters, as well as meta-analyses describing patients with SSc with and without PH, have been reported previously [43, 60]. In particular, impaired DLCO and increased FVC/DLCO have a high sensitivity for predicting PH (particularly PAH) and have been included in several screening algorithms for PH in SSc [61,62,63].

In addition to these established parameters, our study showed a significant relationship between 6-MWD and survival in SSc. To our knowledge, this relationship has not previously been reported. The 6-MWD predicts prognosis in PAH [64], but has several limitations [65, 66]. In general, the use of 6-MWD in studies assessing pulmonary haemodynamics in patients with SSc has been recommended [67], but CPET is regarded as an alternative [68]. The weak correlation between 6-MWD and peakVO2 in our study may indicate that these two parameters identify different patients at risk. Previous 6-MWD studies have assessed subgroups of SSc. A recent meta-analysis of 6-MWD showed differences in walking distances between groups with or without PH or ILD [69]. For the subgroup of patients with SSc and ILD, the 6-MWD has been included in an algorithm for calculating mortality risk [70]. Similarly, in a meta-analysis of patients with SSc with PH, a shorter 6-MWD was associated with a worse prognosis [53], alongside age, gender, pericardial effusion, increased right atrial pressure, increased PAPmean, and reduced cardiac output. In contrast to our results, a retrospective study by Le Pavec et al. found no relationship between 6-MWD and survival in 70 patients with SSc with ILD and PH [71]. However, Zhao et al. found a 6-MWD of < 380 m to be an independent predictor of mortality in 190 patients with PH associated with various collagenoses [72]. This is consistent with our observations, and the difference from our cut-off value of < 430 m may result from our restricting the population to patients diagnosed with SSc, with or without PH.

The most important insight from our study may be the high prognostic relevance of CPET parameters for the survival of patients with SSc. The results confirm our hypothesis that peakVO2 and VE/VCO2-slope can predict survival. In addition, our study found this prognostic relationship in a cohort of patients of whom only a minority had PH or PAH. This is in contrast to previous studies, which have shown a prognostic relevance for CPET parameters only in patients with SSc who have PH or PAH [12, 13]. Multiple studies, including two analyses of patients with idiopathic PAH from our study group, have found peakVO2 and VE/VCO2-slope, among other parameters, to be related to survival [12, 13, 73, 74]. In a recent study of 226 patients with idiopathic PAH, peakVO2, VO2@AT, VO2/heart rate, petCO2@rest, petCO2@AT, VE/VCO2-slope and VE/VCO2@rest were related to survival in a univariate analysis (in a multivariate analysis only peakVO2 and VE/VCO2@rest were retained) [74]. Interestingly, CPET parameters can be sensitive in cases of pulmonary vasculopathy without manifested PH or PAH [10, 75, 76], because in these cases the integration of different cardiac, muscle and pulmonary pathologies in CPET parameters allows prognostication. Moreover, CPET can differentiate between predominantly cardiac and predominantly pulmonary manifestation, and increase the pre-test probability for PH [77]. In this way CPET may suggest specific therapeutic options.

Limitations

Our retrospective study analysed a prevalent cohort of patients with SSc. The cohort was heterogeneous with respect to pulmonary pressure, ILD, and co-morbidities, which previous studies have found to affect the magnitude of changes in CPET parameters [32, 77, 78]. Although combined from six centres, the number of patients in our study was not high enough to separately analyse patients with PH and PAH. A slightly different CPET protocol was used in one centre, but this did not change the relevant CPET parameters [79]. However, despite substantial heterogeneity, we were able to identify highly significant prognosticators of survival which suggests robust results.

Conclusions

Our study has demonstrated the prognostic value of the CPET parameters peakVO2 and VE/VCO2-slope in a large cohort of patients with SSc. Cut-off values of peakVO2 < 15.6 mL∙kg− 1∙min− 1 (< 64.5% of predicted) and VE/VCO2-slope > 35 predict worse survival. Further work is needed to determine whether the poor prognosis in these groups reflects the development of PH. If so, this would be of clinical importance, because while there is no specific SSc therapy, there are therapeutic options for the subgroup with PH. Therefore, peakVO2 or VE/VCO2-slope may increase the pre-test probability for PH, meaning that CPET results may suggest specific treatment.