Applicability of radiomics in interstitial lung disease associated with systemic sclerosis: proof of concept

Martini, K.; Baessler, B.; Bogowicz, M.; Blüthgen, C.; Mannil, M.; Tanadini-Lang, S.; Schniering, J.; Maurer, B.; Frauenfelder, T.

doi:10.1007/s00330-020-07293-8

Applicability of radiomics in interstitial lung disease associated with systemic sclerosis: proof of concept

Chest
Open access
Published: 06 October 2020

Volume 31, pages 1987–1998, (2021)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Applicability of radiomics in interstitial lung disease associated with systemic sclerosis: proof of concept

Download PDF

K. Martini ORCID: orcid.org/0000-0002-2638-6832¹,
B. Baessler¹,
M. Bogowicz²,
C. Blüthgen¹,
M. Mannil¹,
S. Tanadini-Lang²,
J. Schniering³,
B. Maurer³ &
…
T. Frauenfelder¹

3991 Accesses
22 Citations
5 Altmetric
Explore all metrics

Abstract

Objective

To retrospectively evaluate if texture-based radiomics features are able to detect interstitial lung disease (ILD) and to distinguish between the different disease stages in patients with systemic sclerosis (SSc) in comparison with mere visual analysis of high-resolution computed tomography (HRCT).

Methods

Sixty patients (46 females, median age 56 years) with SSc who underwent HRCT of the thorax were retrospectively analyzed. Visual analysis was performed by two radiologists for the presence of ILD features. Gender, age, and pulmonary function (GAP) stage was calculated from clinical data (gender, age, pulmonary function test). Data augmentation was performed and the balanced dataset was split into a training (70%) and a testing dataset (30%). For selecting variables that allow classification of the GAP stage, single and multiple logistic regression models were fitted and compared by using the Akaike information criterion (AIC). Diagnostic accuracy was evaluated from the area under the curve (AUC) from receiver operating characteristic (ROC) analyses, and diagnostic sensitivity and specificity were calculated.

Results

Values for some radiomics features were significantly lower (p < 0.05) and those of other radiomics features were significantly higher (p = 0.001) in patients with GAP2 compared with those in patients with GAP1. The combination of two specific radiomics features in a multivariable model resulted in the lowest AIC of 10.73 with an AUC of 0.96, 84% sensitivity, and 99% specificity. Visual assessment of fibrosis was inferior in predicting individual GAP stages (AUC 0.86; 83% sensitivity; 74% specificity).

Conclusion

The correlation of radiomics with GAP stage, but not with the visually defined features of ILD-HRCT, implies that radiomics might capture features indicating severity of SSc-ILD on HRCT, which are not recognized by visual analysis.

Key Points

• Radiomics features can predict GAP stage with a sensitivity of 84% and a specificity of almost 100%.

• Extent of fibrosis on HRCT and a combined model of different visual HRCT-ILD features perform worse in predicting GAP stage.

• The correlation of radiomics with GAP stage, but not with the visually defined features of ILD-HRCT, implies that radiomics might capture features on HRCT, which are not recognized by visual analysis.

Association of quantitative computed tomography ındices with lung function and extent of pulmonary fibrosis in patients with systemic sclerosis

Article 16 September 2021

Quantitative computed tomography assessment for systemic sclerosis–related interstitial lung disease: comparison of different methods

Article 19 March 2020

Performance of a new quantitative computed tomography index for interstitial lung disease assessment in systemic sclerosis

Article Open access 01 July 2019

Introduction

Interstitial lung disease (ILD) is a well-known complication of systemic sclerosis (SSc) affecting over 60% of patients [1,2,3] and represents the leading cause of disease-related death [4].

The detection of SSc-ILD is crucial because an early diagnosis of SSc-ILD has important prognostic and therapeutic implications. Novel imaging approaches such as quantitative computed tomography (CT) [5, 6], magnetic resonance imaging (MRI) [7, 8], and nuclear imaging [9, 10] are applied in ILD to provide prognostic, functional, and metabolic information [11]. So far, high-resolution CT (HRCT), a non-invasive, cost-effective, and sensitive technique, remains the gold standard for ILD diagnosis because it is able to detect lung involvement prior to appearance of clinical symptoms and provides prognostic information [12,13,14]. However, there are many features to determine the presence of ILD and inter-reader variability, especially in unexperienced readers, is an issue.

Most patients with SSc-ILD have mild or stable disease, which does not warrant treatment, only surveillance [15]. However, the high morbidity and mortality of progressive SSc-ILD define the need for early detection for therapeutic intervention. Such a screening modality should combine both high sensitivity and reproducibility.

Radiomics, defined as the conversion of medical images to higher-dimensional data, is a novel research area. Feature extraction is a crucial step in radiomics and comprises the computation of texture, density, and shape from predefined regions of interest (ROIs). Radiomics offers the advantage of an objective quantification of tissue characteristics and enables the detection of abnormalities in radiological images not depicted by routine visual analysis [16,17,18,19]. Due to the high objectivity and reliability of data, radiomics shows great potential as support for clinical decision-making [20]. Radiomics has attracted increased attention in recent years, and several studies show that radiomics can be of benefit in terms of prognosis and diagnosis of multiple diseases, especially malignancies [21,22,23]. In SSc-ILD, to the best of our knowledge, radiomics analyses have not yet been performed.

Currently, no validated single tools are established for staging in SSc-ILD although in clinical practice, a 70% threshold of percentage predicted forced vital capacity (FVC [%]) and extent of fibrosis on HRCT with a threshold of 20% are routinely used [13, 24]. Although most commonly employed, pulmonary function tests as “stand-alone” examination are inferior for diagnostic purposes than HRCT [2]. To overcome the limitations of single factors, several composite scores have been proposed: One of them is the so-called gender, age, and pulmonary function (GAP) score and staging system, developed by Ley et al in 2012 [25]. The system uses four variables: gender (G), age (A), and two pulmonary physiological parameters (P)—FVC [%] and percentage predicted diffusion capacity of the lungs for carbon monoxide (DL_CO [%]). The score has been validated in the USA, Italy, and South Korea and showed robust predictive power in patients with chronic ILD [26, 27]. GAP stage is not routinely calculated in SSc-ILD, and visual analysis of ILD criteria on HRCT does not, or not sufficiently, reflect prognosis. We hypothesize that radiomics features might provide important information on disease extent and could potentially influence individual patient management.

In this retrospective pilot study, we aim to evaluate if texture-based radiomics features are able to detect ILD and to distinguish between the different disease stages in patients with SSc-ILD in comparison with mere visual analysis of HRCT.

Methods

Patients

Sixty patients (46 females, median age 56 years), who were part of the SSc cohort at the University Hospital Zurich, Switzerland (EUSTAR online Database [pre-BASEC-EK-839, BASEC KEK-Nr.-2016-01515] and VEDOSS online Database [BASEC-Nr.2010-158/5]), fulfilled the ACR/EULAR classification criteria [28], and underwent HRCT (Table 1) between January 2012 and October 2015 with signs of ILD, were retrospectively included in the study. The corresponding image analysis was done retrospectively. Demographic and clinical data, as well as values for pulmonary function tests (PFT), were acquired for each patient (Table 1). The PFT indices included the actual values and the percentage predicted values of a certain age, height, and gender group (%predicted) of forced expiratory volume in 1 s (FEV₁), forced vital capacity (FVC), total lung capacity (TLC), and diffusion capacity of carbon monoxide (DCLO). In order to make results comparable throughout the study population, %predicted values were used for statistical evaluation. GAP stage was calculated according to Mango et al [25, 29]. Patient characteristics are summarized in Table 1. The retrospective study has been approved by the institutional review board (BASEC-Nr. 2018-02165), and written informed consent was sought from all patients.

Table 1 Main patient characteristics. n number of patients, f/m female/male, y/n yes/no, SD standard deviation, mRSS modified Rodnan skin score, ILD interstitial lung disease, HRCT high-resolution computed tomography. The PFT indices included the percentage predicted values of a certain age, height, and gender group (%predicted) of forced expiratory volume in 1 s (FEV₁), forced vital capacity (FVC), total lung capacity (TLC), and diffusion capacity of carbon monoxide (DCLO). Percentage of fibrosis per lung (fibrosis > 20%). *Antibodies comprised anti-centromere antibodies, anti-nuclear antibodies, anti-topoisomerase I antibodies, anti-RNA-polymerase III antibodies, and anti-U1nRNP antibodies. **Immunosuppressive therapy included prednisone, cyclophosphamide, methotrexate, azathioprine, mycophenolate mofetil, d-penicillamine, rituximab, imatinib, and anti-TNF (tumor necrosis factor alpha) inhibitors. ***Expert opinion by echocardiography

Full size table

HRCT protocol

All HRCT images were acquired in prone position in full inspiration. HRCT scans were obtained with a 64-slice CT scanner (Somatom Definition AS, Siemens Healthineers). The CT protocol included a topogram and one series in prone position in full inspiration. The following parameters were used for the standard HRCT: tube voltage 120 kV, tube current 30 mAs (reference dose, care dose: on), slice thickness: 1 mm, increment: 0.8 mm, kernel B70. The standard HRCT was reconstructed with iterative reconstruction (SAFIRE) strength 3 [30].

ILD features on HRCT

The readout was performed by two radiologists (T.F. 16 and K.M. 5 years of experience in thoracic imaging) by consensus: If there was disagreement between the two readers, whether an HRCT feature was present or not, re-assessment was performed until consensus was reached. Images where evaluated for the presence of characteristic visual ILD features (yes/no) including pulmonary emphysema, honeycombing, subpleural lines, pleural margins, bronchiectasis, ground-glass opacities, and reticular changes (Fig. 1). A case-by-case evaluation was performed.

Image analysis was performed on a standard picture archiving and communication system workstation (Impax, Version 6.5.5.1033; Agfa-Gevaert) and a high-definition liquid crystal display monitor (BARCO; Medical Imaging Systems).

Visual assessment of lung fibrosis severity

Extent of lung fibrosis

According to Goh et al [13], estimation of disease extent defined as definitely less than 20% (mild disease extent) or definitely more than 20% (severe disease extent) was performed. All sections from the lung apex to the hemidiaphragm were evaluated. In order to keep results specific for visual analysis, we did not include the FVC threshold of 70% proposed by Goh et al [13] in cases with an indeterminate extent of disease on HRCT.

Pulmonary fibrosis was defined as presence of reticular changes, honeycombing, or both.

Coarseness of lung fibrosis

The most extensive parenchymal pattern in each lobe was recorded as categorical coarseness grade 0, normal lung; grade 1, ground-glass opacity; grade 2, fine reticulation; grade 3, coarse reticulation; and grade 4, honeycombing. The primary coarseness score represented the sum of coarseness grades (grade 0–4). To remove the effect of pattern extent and prevent the underestimation of coarseness severity in patients, in whom some lobes had no parenchymal abnormality, the score was adjusted proportionally to a six-lobe score [31]:

$$ \mathrm{CS}={\sum}_{1-n}^n\left(\mathrm{CG}\right)/{\mathrm{L}}_{\mathrm{ILD}}\ast 6 $$

where n is the number of lobes, CS is the coarseness score, CG is the coarseness grade, and L_ILD is the number of lobes with ILD.

Radiomics

3D lung segmentation

We chose to segment only the right lung, since the presence of the heart on the left side potentially makes lung segmentation more difficult and may lead to alteration of results. The right lung of each patient was segmented semi-automatically with dedicated software MIM (Version 6.0, MIM Software Inc.) by setting the Hounsfield unit (HU) values from − 950 to − 150 HU. Where automatically registered borders did not correspond with lung borders, manual corrections were made. The hilar vessels were carefully excluded.

Extraction of texture features

Prior to analysis, all images were resampled to isotropic voxels of 2 mm, using linear interpolation [32]. In total, 1116 features were extracted with two bin sizes (10 and 35 HU) corresponding to the following feature classes [33]:

4 shape features
19 intensity features
105 texture features (52 from the gray-level co-occurrence matrix, 5 from the neighborhood gray-tone difference matrix, 32 from the gray-level run length matrix, and 16 from the gray-level size zone matrix)
976 wavelet features (coiflet filtering)

Feature descriptions and mathematical definitions were used as described (see the Supplemental Material).

Data augmentation

Data augmentation was performed using the imbalance package in R (version 3.4.0; R Foundation for Statistical Computing) and applying a majority weighted minority oversampling technique (MWMOTE) (details can be found in the Supplemental Material). After applying the MWMOTE technique, the dataset consisted of an equal number of GAP1 (n = 54) and GAP2 (n = 54) stage patients. An example of data oversampling and resulting feature values is shown in the Supplemental Material.

Splitting of the dataset into training and testing datasets

In order to ensure the generalizability of the trained statistical models, the balanced dataset was then randomly split into separate training (n = 76 patients, n = 38 GAP1 and n = 38 GAP2) and testing dataset (n = 32 patients, n = 16 GAP1 and n = 16 GAP2) using a ratio of 0.7:0.3. The entire dimension reduction and feature selection process as further described in the “Results” section was performed only on the training dataset.

Statistical analysis

Statistical analysis was performed in R (version 3.4.0; R Foundation for Statistical Computing) with RStudio (version 1.0.136; RStudio). R packages used for statistical analysis are described in the Supplemental Material. All continuous data are given as means ± standard deviation. Categorical variables are expressed as frequencies or percentages. A two-tailed p value of < 0.05 was considered to indicate statistical significance. Testing for group differences was performed by using Wilcoxon’s signed-rank tests and Friedman’s test after assessing normal distribution of the data. The chi-squared test was used to compare categorical parameters.

For selecting variables that allow classification of GAP stages 1 and 2, single and multiple logistic regression models were fitted and compared by using the Akaike information criterion (AIC). The misclassification rate of these models was assessed by using 10-fold cross-validation. The diagnostic accuracy of optimal predictive parameters was evaluated from the area under the curve (AUC) from receiver operating characteristic (ROC) analyses, and diagnostic sensitivity and specificity were calculated.

Similarly, predictive value of ILD-HRCT features for the GAP stage was tested.

Results

Visual assessment of HRCT

In 17 out of the 60 cases, readers disagreed about the presence of ILD features. In these cases, disagreement was resolved in consensus reading.

Eight patients showed pulmonary emphysema (13%), eight honeycombing (13%), 52 subpleural lines (87%), 24 bronchiectasis (40%), 29 ground-glass opacities (48%), 52 reticular changes (87%), 36 pleural margins (60%), and 11 fibrosis involving more than 20% of the lung parenchyma (18%). Mean coarseness score was 12.3 (SD ± 3.6).

For detailed information and distribution of the features among GAP stages, please refer to Table 1 and Fig. 2.

Highest AUC could be obtained when combining honeycombing, emphysema, and bronchiectasis in a model, which resulted in an AUC of 0.86 with a sensitivity of 100% and a specificity of 63%. When performing ROC analysis, the AUC for predicting GAP stage with extent of fibrosis (fibrosis > 20%) is 0.606 (95% confidence interval 0.543–0.791, p = 0.145) with a sensitivity of 50% and a specificity of 85.2%.

When performing ROC analysis for coarseness score of fibrosis, the AUC for predicting GAP stage reached 0.863 (95% confidence interval 0.703–1.000, p = 0.004) with a sensitivity of 83% and a specificity of 74%. Differences between predicting ROC curves with extent of fibrosis versus coarseness of fibrosis were not statistically significant (p = 0.057).

Radiomics

Dimension reduction and radiomics feature selection for classification of GAP1 versus GAP2 stage

Radiomics feature selection and dimension reduction were performed on the augmented training dataset. After normalization of all numeric features using z-score standardization, features were fed into the Boruta dimension reduction and feature elimination algorithm as described previously [25, 26], resulting in the selection of 73 features, which were considered most important for classification accuracy. Since the Boruta algorithm does not account for collinearity in the data, a correlation matrix was calculated in a next step in order to detect clusters of highly correlated features (defined as Pearson’s r ≥ .60; Fig. 3). After visualization of each single parameter in box and whisker plots and random forest models fitted separately on each of the six detected correlation clusters, only one feature from each cluster with the highest Gini index and visually the best separation between the two groups (“GAP1” and “GAP2” stage) was selected for further analysis. At the end of the multistep dimension reduction process, the six most important and independent features were selected for further statistical analyses: M_homogenity_n.LHL, neighContrast.LHL, fractal_dim.LLL, M_correlation.HLL, M_correlation.HHL, and sizeVar_n.LLH.

Training of statistical models for classification of GAP1 versus GAP2 stage

In the original non-augmented dataset, values for M_homogenity_n.LHL, M_correlation.HLL, and sizeVar_n.LLH were significantly lower in patients with a GAP stage of 2 when compared with those in patients with a GAP stage of 1 (p = 0.003, 0.001, and 0.007, respectively; Fig. 4 and Table 2). In contrast, values for neighContrast.LHL were significantly higher in patients with a GAP stage of 2 (p = 0.001). No significant differences were observed for fractal_dim.LLL and correlation.HHL, although the difference for fractal_dim.LLL reached statistical significance in the augmented dataset.

Table 2 Results of radiomics. GAP stage gender, age, and pulmonary function stage, n number of patients

Full size table

Single and multiple logistic regression models were fitted on the training dataset and compared according to their AIC. In single logistic regression models, M_homogenity_n.LHL and neighContrast.LHL showed the lowest AIC with 21.13 and 23.81, respectively (fractal_dim.LLL: 98.37, M_correlation.HLL: 41.50, correlation.HHL: 107.16, and sizeVar_n.LLH: 75.59). Results of the corresponding ROC analyses for the training, testing, and the original (non-augmented) datasets are shown in Table 3.

Table 3 Diagnostic performance of radiomics features and visual assessment of HRCT features. GAP stage gender, age, and pulmonary function stage, n number of patients, AUC area under the curve with bootstrapped 95% confidence intervals (CI)

Full size table

Combining M_homogenity_n.LHL and neighContrast.LHL in a model resulted in a higher AIC (21.94) and showed collinearity of the two features without significant improvement of diagnostic sensitivity and specificity. The combination of neighContrast.LHL and M_correlation.HLL in a multivariable model finally resulted in the lowest AIC of 10.73 with an AUC of 1.00, 100% sensitivity, and 97% specificity in the training dataset; an AUC of 0.92, 100% sensitivity, and 88% specificity in the test dataset; and an AUC of 0.96, 84% sensitivity, and 99% specificity in the original dataset (Fig. 5 and Table 3).

Ten-fold cross-validation of this model in the independent test dataset resulted in a cross-validation estimate of an accuracy of 0.88 (95% confidence interval 0.71–0.97).

Discussion

HRCT imaging together with PFT is currently the gold standard for a cost-effective, non-invasive assessment of ILD [34]. However, features to determine the presence of ILD are manifold and inter-reader variability, especially in unexperienced readers, is an issue. Radiomics, in contrast, is an objective imaging-based tool that enables a more detailed and reliable quantitative assessment of lesion characteristics, which is not hampered by subjective image interpretation and experience of the reader as in visual analysis.

In this study, we were able to show that radiomics features can predict GAP stage with a sensitivity of 84% and a specificity of almost 100%. Extent of fibrosis on HRCT and a combined model of different visual HRCT-ILD features performed worse in predicting GAP stage. We believe that this is due to the high inter-reader variability, even in expert radiologists, in determining the presence and severity of ILD features.

Since the dataset in our patient cohort was imbalanced regarding the distribution of the two classes with 54 patients in GAP1 stage, but only six patients in GAP2 stage (imbalanced ratio: 0.11)—thereby reflecting the prevalence of GAP1 versus GAP2 stages in our cohort of SSc patients—we performed a data augmentation step in order to achieve better class balance and to avoid model overfitting before further evaluation. This data augmentation technique does not affect the reliability of the statistical evaluation, and results have been additionally tested on the original dataset.

Extracted radiomics features can be divided into four groups, namely (1) first-order histogram-based features, (2) co-occurrence matrix-based features, (3) multiscale features, and (4) other features [35, 36]. The latter are part of a specific group of features that are related to neighborhood gray-tone difference matrix (GTDM) [35, 37, 38]. The GTDM is based on measuring the difference between the intensity level between each voxel and its neighboring voxels, resulting in features to resemble the human perception of the image. Homogeneity (M_homogenity_n.LHL) reflects the homogeneity of image textures and scales the local changes of image texture. High values of homogeneity denote the absence of intra-regional changes and locally homogenous distribution in image textures [39]. Fractal features (fractal_dim.LLL) provide important spatial information. Contrast (neighContrast.LHL) and correlation (M_correlation.HLL and correlation.HHL) rely on perceptual attributes of texture in terms of spatial changes in intensity or dynamic range of intensity [35, 37, 38]. In our study, the combination of neighContrast.LHL and M_correlation.HLL in a multivariable model resulted in an AUC of 0.92, 100% sensitivity, and 88% specificity in the test dataset and an AUC of 0.96, 84% sensitivity, and 99% specificity in the original dataset. AUC of the ROC curve for percentage of fibrosis was significantly worse in predicting GAP stage, and also, a model combining different HRCT-ILD features performed less well than radiomics did. These findings raise the question, if radiomics is able to capture features on HRCT which are not perceptible by the radiologist with the naked eye?

Radiomics has attracted increased attention in recent years, and several studies show that radiomics can be of benefit in terms of prognosis and diagnosis of multiple diseases, especially malignancies [21,22,23]. These studies have shown that radiomics features show great potential to serve as surrogate imaging markers for tissue biopsies [40] and reliably predict outcome [41,42,43,44] and drug response [45, 46]. Currently, there are different approaches for the evaluation of HRCT, namely (1) visual analysis, (2) semiquantitative analysis, and (3) quantitative analysis or automated approaches using artificial intelligence. While sheer visual analysis suffers from a relatively high inter-observer variability [47, 48], semiquantitative and quantitative analyses (such as densitometric analysis) have the potential to overcome the drawbacks of subjective visual assessment of CT images and have also been shown to correlate with therapeutic response outperforming qualitative analysis [48].

In the past decade, radiomics gained importance in medical imaging. Unlike computer-aided detection (CAD) systems, which are directed toward delivering a single answer (i.e., presence of a lesion or cancer), radiomics is a process designed to extract a large number of quantitative features from digital images, which are subsequently mined for hypothesis generation and testing. Recent data from non-malignant lung diseases suggest that the texture-based analysis of CT data might outperform the currently used visual and/or histogram measures for diagnosis and staging [49,50,51]. The process of radiomics-based stratification of data provides a far more detailed characterization of phenotypes than current criteria can.

Compared with other studies [52], we did not train the algorithm to recognize specific patterns or features, such as honeycombing or bronchiectasis. We trained the system to find an algorithm to differentiate between the different GAP stages. With this approach, we omitted to use pattern-based classifications coming from known guidelines for pulmonary fibrosis, as this might not reflect the activity of the disease and might narrow the diagnosis. By just providing lung function, age and gender as input parameters, the validation of the algorithm is quite open and thus, best-discriminating radiomics features might come from feature groups that are not per se visible or quantifiable by the radiologist.

At present, data on radiomics in ILD are limited. The accumulating results, however, are promising and underline the great potential of radiomics in HRCT for detection and staging. In the future, the use of radiomics in SSc-ILD management could be expanded to support treatment decisions. Future studies integrating both radiomics and tissue-based molecular information, however, will be needed to assess whether radiomics reflect the underlying pathophysiology and thereby allow distinguishing inflammatory from fibrotic processes. This would be the prerequisite for treatment guidance toward anti-inflammatory or anti-fibrotic drugs in the individual patient.

The limitations of this study include as follows: firstly, the GAP staging system consists of three stages (low, intermediate, high). We only have patients with GAP stages 1 and 2 in our cohort and the percentage of patients with GAP stage 2 is relatively small, thereby reflecting the prevalence of GAP1 versus GAP2 stage in our clinical population. We performed a data augmentation step in order to achieve better class balance and to avoid model overfitting. Secondly, we only evaluated data from one institution acquired with one CT scanner. Since differences in scanning parameters such as type of CT scanner, tube voltage, tube current, reconstruction kernel, and contrast agent may influence the results of quantitative analysis, our approach might need to be adapted for future use with other scanners and protocols. Further studies with higher patient numbers, on other scanners, are needed to validate our findings and to investigate potential outcome predictors in a longitudinal study setting. Thirdly, we chose the right lung for image evaluation. Even though evaluation of the left site in our patient population (see Supplemental material) showed comparable results between the two sides, we prefer to use the right lung for image evaluation, since the left lung, due to the proximity of the left lower lobe and lingula to the heart, might be more prone to motion artifacts due to cardiac pulsation and might therefore deliver less robust results. We acknowledge that in cases with asymmetrical lung involvement, this approach might alter the results. Finally, lung segmentation was performed semiautomatically. This approach gave us the opportunity to correct datasets, where automatically registered borders did not correspond with lung borders.

Conclusion

The correlation of radiomics with GAP stage, yet not with the visually defined features of ILD-HRCT, implies that radiomics might capture features indicating severity of SSc-ILD on HRCT, which are not recognized by visual analysis.

The texture-based radiomics features identified in this pilot study will pave the way for the assessment whether texture-based radiomics signatures may be valuable tools for computer-aided decision-making in imaging.

Abbreviations

GAP stage:: Gender, age, and pulmonary function stage
HRCT:: High-resolution computed tomography
ILD:: Interstitial lung disease
SSc:: Systemic sclerosis

References

Schurawitzki H, Stiglbauer R, Graninger W et al (1990) Interstitial lung disease in progressive systemic sclerosis: high-resolution CT versus radiography. Radiology. 176:755–759
CAS PubMed Google Scholar
Suliman YA, Dobrota R, Huscher D et al (2015) Brief report: pulmonary function tests: high rate of false-negative results in the early detection and screening of scleroderma-related interstitial lung disease. Arthritis Rheumatol 67:3256–3261
CAS PubMed Google Scholar
Hoffmann-Vold AM, Fretheim H, Halse AK et al (2019) Tracking impact of interstitial lung disease in systemic sclerosis in a complete nationwide cohort. Am J Respir Crit Care Med 200(10):1258–1266
Steen VD, Medsger TA (2007) Changes in causes of death in systemic sclerosis, 1972-2002. Ann Rheum Dis 66:940–944
PubMed PubMed Central Google Scholar
Wu X, Kim GH, Salisbury ML et al (2019) Computed tomographic biomarkers in idiopathic pulmonary fibrosis. The future of quantitative analysis. Am J Respir Crit Care Med 199:12–21
PubMed Google Scholar
Kim HJ, Brown MS, Elashoff R et al (2011) Quantitative texture-based assessment of one-year changes in fibrotic reticular patterns on HRCT in scleroderma lung disease treated with oral cyclophosphamide. Eur Radiol 21:2455–2465
PubMed Google Scholar
Mirsadraee S, Tse M, Kershaw L et al (2016) T1 characteristics of interstitial pulmonary fibrosis on 3T MRI-a predictor of early interstitial change? Quant Imaging Med Surg 6:42–49
PubMed PubMed Central Google Scholar
Pinal-Fernandez I, Pineda-Sanchez V, Pallisa-Nunez E et al (2016) Fast 1.5 T chest MRI for the assessment of interstitial lung disease extent secondary to systemic sclerosis. Clin Rheumatol 35:2339–2345
PubMed Google Scholar
Fanti S, De Fabritiis A, Aloisi D et al (1994) Early pulmonary involvement in systemic sclerosis assessed by technetium-99m-DTPA clearance rate. J Nucl Med 35:1933–1936
CAS PubMed Google Scholar
Wells AU, Hansell DM, Harrison NK, Lawrence R, Black CM, du Bois RM (1993) Clearance of inhaled 99mTc-DTPA predicts the clinical course of fibrosing alveolitis. Eur Respir J 6:797–802
CAS PubMed Google Scholar
Montesi SB, Caravan P (2019) Novel imaging approaches in systemic sclerosis-associated interstitial lung disease. Curr Rheumatol Rep 21:25
PubMed PubMed Central Google Scholar
Strollo D, Goldin J (2010) Imaging lung disease in systemic sclerosis. Curr Rheumatol Rep 12:156–161
PubMed PubMed Central Google Scholar
Goh NS, Desai SR, Veeraraghavan S et al (2008) Interstitial lung disease in systemic sclerosis: a simple staging system. Am J Respir Crit Care Med 177:1248–1254
PubMed Google Scholar
Kaloudi O, Miniati I, Alari S, Matucci-Cerinic M (2007) Interstitial lung disease in systemic sclerosis. Intern Emerg Med 2:250–255
CAS PubMed Google Scholar
Kim GHJ, Tashkin DP, Lo P et al (2012) Using transitional changes on HRCT to monitor the impact of cyclophosphamide or mycophenolate on systemic sclerosis-related interstitial lung disease. Arthritis Rheumatol 72(2):316–325
Google Scholar
Mannil M, von Spiczak J, Manka R, Alkadhi H (2018) Texture analysis and machine learning for detecting myocardial infarction in noncontrast low-dose computed tomography: unveiling the invisible. Invest Radiol 53:338–343
PubMed Google Scholar
Yasaka K, Akai H, Nojima M et al (2017) Quantitative computed tomography texture analysis for estimating histological subtypes of thymic epithelial tumors. Eur J Radiol 92:84–92
PubMed Google Scholar
Beckers RCJ, Lambregts DMJ, Schnerr RS et al (2017) Whole liver CT texture analysis to predict the development of colorectal liver metastases-a multicentre study. Eur J Radiol 92:64–71
PubMed Google Scholar
Mannil M, von Spiczak J, Hermanns T, Poyet C, Alkadhi H, Fankhauser CD (2018) Three-dimensional texture analysis with machine learning provides incremental predictive information for successful shock wave lithotripsy in patients with kidney stones. J Urol 200:829–836
PubMed Google Scholar
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology. 278:563–577
PubMed Google Scholar
Aerts HJ, Velazquez ER, Leijenaar RT et al (2014) Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun 5:4006
CAS PubMed PubMed Central Google Scholar
Scrivener M, de Jong EEC, van Timmeren JE, Pieters T, Ghaye B, Geets X (2016) Radiomics applied to lung cancer: a review. Transl Cancer Res 2016 5:398–409
Bogowicz M, Vuong D, Huellner MW et al (2019) CT radiomics and PET radiomics: ready for clinical implementation? Q J Nucl Med Mol Imaging 63(4):355–370
PubMed Google Scholar
Nihtyanova SI, Schreiber BE, Ong VH et al (2014) Prediction of pulmonary complications and long-term survival in systemic sclerosis. Arthritis Rheumatol 66:1625–1635
PubMed Google Scholar
Ley B, Ryerson CJ, Vittinghoff E et al (2012) A multidimensional index and staging system for idiopathic pulmonary fibrosis. Ann Intern Med 156:684–691
PubMed Google Scholar
Ryerson CJ, Vittinghoff E, Ley B et al (2014) Predicting survival across chronic interstitial lung disease: the ILD-GAP model. Chest. 145:723–728
PubMed Google Scholar
Lee SH, Kim DS, Kim YW et al (2015) Association between occupational dust exposure and prognosis of idiopathic pulmonary fibrosis: a Korean national survey. Chest. 147:465–474
PubMed Google Scholar
(1980) Preliminary criteria for the classification of systemic sclerosis (scleroderma). Subcommittee for scleroderma criteria of the American Rheumatism Association Diagnostic and Therapeutic Criteria Committee. Arthritis Rheum 23:581–590
Mango RL, Matteson EL, Crowson CS, Ryu JH, Makol A (2018) Assessing mortality models in systemic sclerosis-related interstitial lung disease. Lung. 196:409–416
CAS PubMed Google Scholar
Nguyen-Kim TDL, Maurer B, Suliman YA, Morsbach F, Distler O, Frauenfelder T (2018) The impact of slice-reduced computed tomography on histogram-based densitometry assessment of lung fibrosis in patients with systemic sclerosis. J Thorac Dis 10:2142–2152
PubMed PubMed Central Google Scholar
Walsh SL, Sverzellati N, Devaraj A, Wells AU, Hansell DM (2012) Chronic hypersensitivity pneumonitis: high resolution computed tomography patterns and pulmonary function indices as prognostic determinants. Eur Radiol 22:1672–1679
PubMed Google Scholar
Shafiq-Ul-Hassan M, Zhang GG, Latifi K et al (2017) Intrinsic dependencies of CT radiomic features on voxel size and number of gray levels. Med Phys 44:1050–1062
CAS PubMed PubMed Central Google Scholar
Leijenaar RT, Nalbantov G, Carvalho S et al (2015) The effect of SUV discretization in quantitative FDG-PET radiomics: the need for standardized methodology in tumor texture analysis. Sci Rep 5:11075
CAS PubMed PubMed Central Google Scholar
Hansell DM, Goldin JG, King TE Jr, Lynch DA, Richeldi L, Wells AU (2015) CT staging and monitoring of fibrotic interstitial lung diseases in clinical practice and treatment trials: a position paper from the Fleischner society. Lancet Respir Med 3(6):483–496
PubMed Google Scholar
Amadasun M, King R (1989) Textural features corresponding to textural properties. IEEE Trans Syst Man Cybern 19:1264–1274
Google Scholar
Larue RT, Defraene G, De Ruysscher D, Lambin P, van Elmpt W (2017) Quantitative radiomics studies for tissue characterization: a review of technology and methodological procedures. Br J Radiol 90:20160665
PubMed PubMed Central Google Scholar
Chen CC, Daponte JS, Fox MD (1989) Fractal feature analysis and classification in medical imaging. IEEE Trans Med Imaging 8:133–142
CAS PubMed Google Scholar
Cohen L (1989) Time-frequency distributions-a review. Proc IEEE 77:941–981
Google Scholar
Zhao Q, Shi C-Z, Luo L-P (2014) Role of the texture features of images in the diagnosis of solitary pulmonary nodules in different sizes. Chin J Cancer Res 26:451–458
PubMed PubMed Central Google Scholar
Bogowicz M, Tanadini-Lang S, Guckenberger M, Riesterer O (2019) Combined CT radiomics of primary tumor and metastatic lymph nodes improves prediction of loco-regional control in head and neck cancer. Sci Rep 9:15198
PubMed PubMed Central Google Scholar
Zhao B, Tan Y, Tsai WY et al (2016) Reproducibility of radiomics for deciphering tumor phenotype with imaging. Sci Rep 6:23428
CAS PubMed PubMed Central Google Scholar
Bogowicz M, Riesterer O, Bundschuh RA et al (2016) Stability of radiomic features in CT perfusion maps. Phys Med Biol 61:8736–8749
CAS PubMed Google Scholar
Parmar C, Leijenaar RT, Grossmann P et al (2015) Radiomic feature clusters and prognostic signatures specific for Lung and Head & Neck cancer. Sci Rep 5:11044
PubMed PubMed Central Google Scholar
Mattonen SA, Palma DA, Johnson C et al (2016) Detection of local cancer recurrence after stereotactic ablative radiation therapy for lung cancer: physician performance versus radiomic assessment. Int J Radiat Oncol Biol Phys 94:1121–1128
PubMed Google Scholar
Grossmann P, Narayan V, Chang K et al (2017) Quantitative imaging biomarkers for risk stratification of patients with recurrent glioblastoma treated with bevacizumab. Neuro Oncol 19:1688–1697
CAS PubMed PubMed Central Google Scholar
Aerts HJ, Grossmann P, Tan Y et al (2016) Defining a radiomic response phenotype: a pilot study using targeted therapy in NSCLC. Sci Rep 6:33860
CAS PubMed PubMed Central Google Scholar
Ariani A, Lumetti F, Silva M et al (2014) Systemic sclerosis interstitial lung disease evaluation: comparison between semiquantitative and quantitative computed tomography assessments. Biol Regul Homeost Agents 28:507–513
CAS Google Scholar
Yabuuchi H, Matsuo Y, Tsukamoto H et al (2014) Evaluation of the extent of ground-glass opacity on high-resolution CT in patients with interstitial pneumonia associated with systemic sclerosis: comparison between quantitative and qualitative analysis. Clin Radiol 69:758–764
CAS PubMed Google Scholar
Sorensen L, Nielsen M, Lo P, Ashraf H, Pedersen JH, de Bruijne M (2012) Texture-based analysis of COPD: a data-driven approach. IEEE Trans Med Imaging 31:70–78
PubMed Google Scholar
Cunliffe AR, Armato SG 3rd, Straus C, Malik R, Al-Hallaq HA (2014) Lung texture in serial thoracic CT scans: correlation with radiologist-defined severity of acute changes following radiation therapy. Phys Med Biol 59:5387–5398
PubMed PubMed Central Google Scholar
Kloth C, Blum AC, Thaiss WM et al (2017) Differences in texture analysis parameters between active alveolitis and lung fibrosis in chest CT of patients with systemic sclerosis: a feasibility study. Acad Radiol 24:1596–1603
PubMed Google Scholar
Walsh SLF, Calandriello L, Silva M, Sverzellati N (2018) Deep learning for classifying fibrotic lung disease on high-resolution computed tomography: a case-cohort study. Lancet Respir Med 6:837–845
PubMed Google Scholar

Download references

Acknowledgments

The work was funded by the Lunge Zürich (Switzerland) and the Schweizer Gesellschaft für Radiologie (SSR).

Funding

Open access funding provided by University of Zurich. This study has received funding by the Lunge Zürich (Switzerland) and the Schweizer Gesellschaft für Radiologie (SSR).

Author information

Authors and Affiliations

Institute of Diagnostic and Interventional Radiology, University Hospital Zurich, University of Zurich, Rämistrase 100, 8091, Zurich, Switzerland
K. Martini, B. Baessler, C. Blüthgen, M. Mannil & T. Frauenfelder
Department of Radiation Oncology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
M. Bogowicz & S. Tanadini-Lang
Center of Experimental Rheumatology, Department of Rheumatology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
J. Schniering & B. Maurer

Authors

K. Martini
View author publications
You can also search for this author in PubMed Google Scholar
B. Baessler
View author publications
You can also search for this author in PubMed Google Scholar
M. Bogowicz
View author publications
You can also search for this author in PubMed Google Scholar
C. Blüthgen
View author publications
You can also search for this author in PubMed Google Scholar
M. Mannil
View author publications
You can also search for this author in PubMed Google Scholar
S. Tanadini-Lang
View author publications
You can also search for this author in PubMed Google Scholar
J. Schniering
View author publications
You can also search for this author in PubMed Google Scholar
B. Maurer
View author publications
You can also search for this author in PubMed Google Scholar
T. Frauenfelder
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Martini.

Ethics declarations

The authors confirm that the study has been carried out in compliance with ethical standards.

Guarantor

The scientific guarantor of this publication is Prof. Thomas Frauenfelder.

Conflict of interest

The authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article.

Statistics and biometry

No complex statistical methods were necessary for this paper.

Informed consent

Written informed consent was obtained from all subjects (patients) in this study.

Ethical approval

The retrospective study has been approved by the institutional review board (BASEC-Nr. 2018-02165) and Institutional Review Board approval was obtained.

Methodology

• retrospective

• diagnostic or prognostic study

• performed at one institution

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(DOCX 194 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Martini, K., Baessler, B., Bogowicz, M. et al. Applicability of radiomics in interstitial lung disease associated with systemic sclerosis: proof of concept. Eur Radiol 31, 1987–1998 (2021). https://doi.org/10.1007/s00330-020-07293-8

Download citation

Received: 19 March 2020
Revised: 30 July 2020
Accepted: 14 September 2020
Published: 06 October 2020
Issue Date: April 2021
DOI: https://doi.org/10.1007/s00330-020-07293-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Applicability of radiomics in interstitial lung disease associated with systemic sclerosis: proof of concept

Abstract

Objective

Methods

Results

Conclusion

Key Points

Similar content being viewed by others

Association of quantitative computed tomography ındices with lung function and extent of pulmonary fibrosis in patients with systemic sclerosis

Quantitative computed tomography assessment for systemic sclerosis–related interstitial lung disease: comparison of different methods

Performance of a new quantitative computed tomography index for interstitial lung disease assessment in systemic sclerosis

Introduction

Methods

Patients

HRCT protocol

ILD features on HRCT

Visual assessment of lung fibrosis severity

Extent of lung fibrosis

Coarseness of lung fibrosis

Radiomics

3D lung segmentation

Extraction of texture features

Data augmentation

Splitting of the dataset into training and testing datasets

Statistical analysis

Results

Visual assessment of HRCT

Radiomics

Dimension reduction and radiomics feature selection for classification of GAP1 versus GAP2 stage

Training of statistical models for classification of GAP1 versus GAP2 stage

Discussion

Conclusion

Abbreviations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Methodology

Additional information

Publisher’s note

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation