Introduction

Increasing use of chest CT, such as for lung cancer screening, and widespread reduction of slice thickness have led to a higher incidence of small pulmonary nodules in clinical routine. Their management remains challenging for clinicians since benign and malignant nodules have ambiguous radiographic features [1]. A previous study suggested that substantial increases in chest imaging and nodule detection produced more false-positive results but at the same time failed to identify more cases of lung cancer [2]. Apart from long-term imaging controls to detect growth, histological work-up, or PET/CT, the probability of lung cancer in incidental pulmonary nodules > 8 mm can be estimated non-invasively by statistical prediction models in clinical routine [3, 4]. Two of the most commonly used models are the Mayo Clinic Model for incidental nodules and the Brock University Model for screening-detected nodules [5, 6]. Both models have been widely validated in several studies based on various populations with both types of nodules [7,8,9,10,11]. However, a recent study based on a large cohort of 23’789 participants with incidental pulmonary nodules reported only an “acceptable” predictive value of both models with a tendency toward lung cancer overestimation [12]. Some groups tried improving risk and outcome prediction by taking the peritumoral environment into account [13,14,15,16]. For example, Lee et al reported that combining intratumoral radiomics with peritumoral radiomics improved the predictive value regarding the outcome prediction in NSCLC patients [16]. Therefore, it is tempting to speculate whether a whole-lung approach could add further value to the existing models.

Quantitative computed tomography (QCT) has long been established as an objective method to assess lung parenchymal and airway abnormalities in various diseases such as COPD and interstitial lung disease (ILD) [17,18,19,20]. Since COPD and ILD are associated with an increased lung cancer risk, we hypothesized that QCT may yield predictive value concerning malignancy; however, its potential has hardly been explored to date. Thus, the aim of the present study was to explore potential QCT metrics to explain malignancy in incidental pulmonary nodules in 251 subjects in a population at risk using a whole-lung as well as a lobe-based approach. Consecutively, the most promising metrics could be further investigated in larger datasets and potentially be integrated into the Mayo Clinic Model and the Brock University Model.

Materials and methods

This study was approved by the local ethics committee and conducted in accordance with the principles of the declaration of Helsinki.

Population

For this retrospective exploratory study, patients with chest CT scans between 01/2010 and 12/2021 containing pulmonary nodules were screened. After the strict application of the exclusion criteria, 251 subjects were eligible for the final analysis (Fig. 1). Of note, only one exam per patient was permitted and only one predefined index nodule, the most suspicious or the largest lesion, was analyzed. Clinical information was obtained from the electronic medical records.

Fig. 1
figure 1

Patient flowchart

Chest computed tomography parameters

All patients were examined on a 64-row CT scanner (Definition AS64, Siemens, Siemens Medical Solutions) in full inspiration supine position and without intravenous contrast administration. The following acquisition parameters were used: 100–120 kV and 70 mAs reference with dose modulation (Caredose 4D, Siemens), collimation 0.6 mm, reconstructed slice thickness 1.0 mm, and increment 0.8 mm in an iterative medium-soft kernel (I40f, SAFIRE level 3, Siemens).

Quantitative post-processing

The previously well-evaluated in-house software YACTA (version 2.9.4.16) analyzed the chest CT images fully automated. The airway tree and lung parenchyma were segmented and QCT parameters were calculated for the whole lung as well as individually for each lobe (Table 1) as previously described [21,22,23,24,25]. The overall segmented lung voxels are subdivided into 50% central and 50% peripheral lung zones.

Table 1 Quantitative CT (QCT) parameters computed by YACTA

Statistical analysis

All statistical analyses were performed using SPSS Statistics version 25.0. (IBM Corp. 2017) and GraphPad Prism (GraphPad Software, Inc., version 8) under the guidance of a professional statistician. Continuous parameters are reported as median and interquartile range (IQR), if normal distributions are not expected, else mean with standard deviation (SD) is given. Comparisons between the groups were performed using the Mann-Whitney U test. Categorical variables are reported as absolute numbers and percentages, and comparisons between the groups were performed using the chi-square test. Additionally to the description, those QCT variables with a descriptive difference (based on median and Mann-Whitney U test) were investigated using a univariate logistic regression. Due to the small number of malignant tumor patients (n = 51), a backward variable selection based on likelihood ratio tests starting with these QCT variables was then conducted to not endanger the stability of the model. The logistic regression models are evaluated using the areas under the receiver-operating-characteristic curve (AUC) based on all available data. The different models were compared descriptively with a likelihood ratio test. For the nodule-bearing lobe analysis, the parameters were weighted according to their relative share in total lung volume and then compared with the rest of the lung using the Wilcoxon test for paired samples. This is an exploratory analysis. Hence, all p values are of descriptive nature and no formal sample size calculation was conducted.

Results

After the strict application of the eligibility criteria, 251 patients with one defined index lesion each were consecutively included (Fig. 1).

The median [IQR] age of the cohort was 65 [57–73] years, and 37% (n = 92) were females. Sixty-three percent were current or former smokers, and the proportion of these ever-smokers was slightly different between the benign and the malignant nodule group (61 vs. 71%).

The malignant nodules all had a histological work-up; the dignity of the benign nodules was proven by either histology (n = 11/200) or at least 1-year follow-up (n = 189/200). During the follow-up, 13% of the benign nodules had decreased in size or were entirely gone. The mean [SD] nodule diameter was 9.5 [6.9] mm for all patients and differed between the benign vs. the malignant nodule group (7.9 [3.3] vs. 16.0 [11.8] mm) (Table 2). Regarding the applied effective doses, there was no difference regarding the median [IQR] size-specific dose estimation between the benign and the malignant nodule group (5.1 [4.4–5.8] vs. 5.0 [4.1–5.9] mSv).

Table 2 Patient demographics and nodule characteristics. Continuous parameters as median (interquartile range), categorical parameters as absolute numbers (percentages)

Association of whole-lung QCT and nodule malignancy

The group comparison showed differences regarding multiple whole-lung QCT parameters (Table 3). Patients with malignant nodules for example had lower MLD, higher EI, and greater lung volumes (Fig. 2). However, they had a lower FIBI and GGOI. There were no differences observed between the groups regarding the airway parameters.

Table 3 Quantitative CT parameters (whole lung). Parameters as median [interquartile range]
Fig. 2
figure 2

Exemplary YACTA analyses. Top row (a): A 75-year-old male with severe, mainly left-sided emphysema (marked in yellow) with an 8 mm nodule in the right upper lobe (red circle). The nodule turned out to be a primary pulmonary adenocarcinoma. The patient had suffered of recurrent pneumothorax caused by this severe one-sided bullous emphysema of unknown etiology since childhood. The 3D volume image on the right depicts the airway segmentation and the large clusters of emphysema in the left lung (whole lung parameters: MLD, −814.7 HU; EI, 10.1%; tumor-bearing lobe parameters: MLD, −824.3 HU; EI, 2.6%). Bottom row (b): A 73-year-old male COPD patient (50 py) with a 5 mm nodule in the right upper lobe (red circle). The nodule disappeared during follow-up and therefore was classified as benign. The 3D volume image on the right depicts the airway segmentation and the small clusters of emphysema distributed over both lungs (whole lung parameters: MLD, −827.0 HU; EI, 17.3%; tumor-bearing lobe parameters: MLD, −831.4 HU; EI, 15.6%)

MLD and pEI are associated with an increased risk of malignancy

Univariate logistic regression analysis showed that multiple parameters are influential variables for pulmonary nodule malignancy. These results are consistent with the description of the QCT variables grouped by malignancy.

MLD, Perc15, GGOI, or FIBI showed a protective association with malignancy (OR < 1). EI, pEI, or BI were associated with an increased malignancy risk (OR > 1). In order to elaborate a multivariate QCT parameter model, a backward variable selection was performed, starting with all parameters with univariate p values < 0.05. It resulted in a model consisting of MLD and pEI (Table 4). A ROC-curve based on all available data was constructed for this model, the corresponding AUC was 0.69 (95%-CI, 0.61–0.76).

Table 4 Results of the logistic regression analysis.

Addition of MLD and pEI potentially improves the Mayo Clinic Model and the Brock Model

To evaluate a potential benefit of the QCT parameters for the Mayo Clinic Model or the (parsimonious) Brock University Model, MLD and pEI were integrated into these models. In both models, the AUC based on all available data increased slightly after the addition of MLD and pEI: The AUC of the Mayo Clinic Model increased from 0.91 (95%-CI, 0.86–0.96) to 0.93 (95%-CI, 0.88–0.97, p = 0.02) and of the parsimonious Brock Model from 0.88 (95%-CI, 0.84–0.93) to 0.91 (95%-CI, 0.88–0.95; p < 0.001) (Fig. 3).

Fig. 3
figure 3

a The ROC curves of the Mayo Clinic Model without (red curve) and with (blue curve) the QCT parameters MLD and pEI. b The respective ROC curves of the parsimonious Brock Model with (blue curve) and without (red curve) the QCT parameters MLD and pEI

Nodule-bearing lobes show less lung disease than the rest of the lung

To explore the features of the nodule-bearing lobes, a lobe-specific analysis was performed and the QCT parameters of the respective lobes were compared to the rest of the lung per subject. While the MLD was similar, parameters indicating emphysema were lower in the nodule-bearing lobe compared to the rest of the lung similarly in subjects with malignant and benign nodules. In both groups, FIBI and the airway WP were higher in the nodule-bearing lobes compared to the rest of the lung (Table 5).

Table 5 Parameters presented as median [interquartile range]

Discussion

In this retrospective exploratory study, we found that whole-lung QCT metrics may support in identifying malignancy in incidental pulmonary nodules. Furthermore, mean lung density and the emphysema index of the peripheral zone might improve the performance of two established radio-clinical risk models. We additionally observed that in both groups the nodule-bearing lobes had less emphysema, more fibrosis, and more airway wall thickening than the rest of the lung within the same individual. This analysis can serve as a starting point for further research on the explanatory characteristics of QCT variables in pulmonary nodule malignancy.

The fact that lung cancer and emphysema are linked, is well appreciated [26, 27] as they share smoking as an underlying trigger, and our findings agree with the literature, as the whole-lung EI was higher in the malignant compared to the benign nodule group. By using bulla shape-based features, we could furthermore show that the malignant nodule group had a higher proportion of peripheral zone emphysema and larger clusters of emphysema than the benign nodule group.

The whole-lung approach used in this study has several advantages over the widespread tumoral or peritumoral approach: First, it does not have the limitation of region-of-interest (ROI) selection and the connected inter-observer variability. Second, it is highly standardized and therefore not only effort- and time-saving but also enables optimal comparability with follow-up studies. Although the existing literature on lung cancer prediction using whole-lung radiomics is rather sparse, our findings are somewhat consistent with similar studies. For example, Liang et al analyzed a smaller cohort of idiopathic pulmonary fibrosis patients (n = 116) and reported that the histogram-based whole-lung radiomics features kurtosis and energy have a significant predictive value regarding lung cancer development, especially if combined with traditional risk factors such as smoking status or age [28]. In the current exploratory study, nine out of thirteen analyzed parameters were different between the benign and the malignant nodule group, and six of them showed a predictive value in the logistic regression analysis. In contrast to Liang et al, the current study not only relied on histogram-based radiomics but also included shape-based radiomics such as the bulla index. As reported by Wiemker et al, these features have the advantage of not only depending on a fixed HU threshold and may also account for irregular shapes and overlapping or open contours of the bullae [29].

One of the relevant advantages of QCT parameters is that they are well-evaluated and generated from routine imaging. Thus, neither additional query, investigation, invasive or otherwise bothering interventions nor costly analyses are necessary.

Regarding the comparison to the standard prediction models, Vachani and colleagues have already pointed out that both, the Mayo Clinic and the Brock University Model, may overestimate cancer probabilities in certain cohorts [12]. In their study, the Mayo Clinic Model had an AUC of 0.75, which was slightly better than the Brock Model, with an AUC of 0.71.

In this study, both models showed an excellent performance; as expected, the Mayo Clinic Model performed slightly better than the Brock Model (AUC, 0.91 vs. 0.88) since it was initially designed for incidental pulmonary nodules.

The derived model from the backward variable selection based on QCT parameters alone already had an AUC of 0.69 on all available data. In consequence, the integration of QCT parameters into the two standard models increased the AUCs to 0.93 and 0.91, respectively. The AUCs were all calculated using the complete dataset, which means that the model is evaluated with the data it was created with. The reason for this unconventional approach is that the amount of data and the number of malignant tumor patients were rather low. Of note, a separation into training, validation, and test sets would have resulted in a small test set with an expected 10 malignant patients (20% of the data) which would not have allowed a robust evaluation of the derived model. The interpretation is therefore strictly descriptive and no definite conclusions about the predictive value of the QCT variables can be drawn. Additionally, it must be noted that the AUCs for the here proposed models are very likely estimated too high, since an evaluation based on the training data results in overfitting.

The lobe-specific analysis of QCT parameters revealed that amongst others, the nodule-bearing lobes had lower EI, lower EI in the peripheral zone, and smaller BI compared with the rest of the lungs. Interestingly, the benign and the malignant nodule groups did not show any differences in the lobar approach. One possible explanation for this finding is that there is less functional lung parenchyma and less blood supply in areas of emphysema and that thus neoplasms of any kind may be less probable in lobes with fewer grounds to grow on. Literature on the relationship between emphysema and lung cancer remains controversial. For example, Hohberger et al stated that higher regional emphysema scores are associated with the presence of lung cancer in a cohort containing 624 malignant nodules using a semiquantitative approach [30]. In contrast to the current study, this approach is based on a subjective estimate rather than objectively measured values. Other studies using a quantitative approach failed to show a link between emphysema and lung cancer at all [31, 32]. In the lobe-specific analysis, we observed that the WP was higher in the nodule-bearing lobes compared with the rest of the lungs. Similar to the emphysema-associated parameters, this accounted for both, the benign and the malignant nodule group and is most probably based on the underlying inflammatory or malignant process. However, this finding needs to be elaborated in future studies, e.g. to evaluate the predictive value of different types of bronchial wall thickening regarding malignancy.

This study has several limitations. The most relevant one is the lack of external validation regarding the proposed QCT- and the expanded models because they were so far only evaluated on the same data used to derive the models. Therefore, further validation of the models on an independent dataset is warranted. Furthermore, the algorithm could not segment and therefore not exclude the nodules from the analysis, which might have affected the QCT parameters.

However, this effect would only account for the density-based parameters MLD, EI, and FIBI. It can be assumed that the current results of the whole lung analysis would have been even more significant after the exclusion of the nodules, since the malignant nodules were larger and their voxels contribute to the higher HU values in the lungs. This leads, for example, to the fact that the differences for MLD, FIBI, and EI between groups would become even larger. Regarding the lobar approach, the relative contribution of a nodule to the lobe-based CT parameters could be greater than in the whole-lung approach. Thus, an exclusion of the nodules indeed might affect the density-based QCT parameters (MLD, EI, and FIBI) of the respective lobe and could lead to a lower MLD, higher EI, or lower FIBI in the respective lobe. However, the other emphysema-related parameters (pEI, EIcc120, Perc15, and BI) can be assumed to be robust to this effect, since they rather describe the distribution and clustering of the emphysema instead of being solely based on the voxel density.

Then, the recruitment occurred in a dedicated chest hospital with frequent referrals of patients with severe ILD and COPD, presenting with incidental nodules. Thus, our results may not be readily transferable to screening populations. The sample size was relatively small and only contained 51 malignant nodules. However, our cohort is clinically well characterized and CT protocols were strictly standardized, which was necessary to ensure comparability regarding the QCT parameters [33, 34]. This led to the exclusion of a great number of cases. Lastly, depending on the CT scanner and its specific settings, the quantitative results may vary, however, they should be consistent within the same institution. Future endeavors should focus on the harmonization of imaging protocols facilitating the advance of QCT analysis and fostering cross-institutional standardization and reproducibility.

In conclusion, the present study demonstrates that QCT parameters of the whole lung may be considered for malignancy risk assessment in incidental pulmonary nodules. QCT might add value in combination with the established Mayo Clinic and Brock University Model.