Introduction

Lung cancer is one of the most common and serious causes of cancer-related deaths worldwide [1]. With the development of high-resolution computed tomography (CT) and the popularization of low-dose CT screening, the incidence of pure ground-glass nodules (pGGNs) is rapidly increasing [2, 3]. A pGGN can be defined as a nodule with a hazy attenuation increase in the lung window, without a solid component when viewed with mediastinal window settings, and without vessel and bronchial structure obscuring [4, 5]. According to the new classification proposed by the International Association for the Study of Lung Cancer, the American Thoracic Society, and the European Respiratory Society, lung adenocarcinomas include atypical adenomatous hyperplasia (AAH), adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), and invasive adenocarcinoma (IAC) [6]. Persistent pGGNs lasting for more than three months have been proven to be associated with early stage lung adenocarcinoma including AAH, AIS, MIA and IAC [7,8,9,10,11]. Most pure GGNs are preinvasive lesions; however, recent histologic studies have shown that approximately 20% to 50% have invasive components [12,13,14,15]. In recent study, Ye et al. found that 10.8% of pure ground-glass lung adenocarcinoma nodules were of the IAC subtype among 988 pulmonary nodules [16].

Lobectomy is the standard surgical treatment for IACs; however, AIS and MIA may be candidates for sublobar resection [17]. Previous studies have demonstrated that the 5-year disease-free survival (DFS) of patients with AIS and MIA can reach 100% or almost 100%, while the DFS of patients with IAC is 40–85% [18]. Therefore, distinguishing IACs from preinvasive lesions and MIAs before surgery is crucial for clinical management and prognosis prediction in patients with pGGNs.

Clinical features including age, sex, and smoking history have been found as predictors of nodule growth and pathologic diagnosis [8, 19, 20]. Adenocarcinoma also accounts for a large percentage of lung tumors in female patients [21]. Huang et al. found that non-smoking female patients with lung cancer were more likely to have adenocarcinoma [22]. IAC occurred more often in older patients; Hu et al. found that aged ≥ 60 years was one of independent predictors of pGGNs histologic invasiveness [23].

CT is the most commonly used technique for the detection and differentiation of pGGN invasiveness [24]. Previous studies have shown that IAC differentiation is mainly based on radiographic features, including morphological features (margin, shape, vessel change, bubble sign, and pleural indentation) and quantitative features (lesion size, CT value, and volume) [14, 15, 19, 20, 25, 26]. While morphological features widely depend on the experience of observers and quantitative features are affected by scanning parameters (e.g., consistency of measurement), some features in pGGNs overlap; this is especially true in small nodules with a size of < 6 mm [27].

Radiomics that can extract high-throughput data from medical images and analyses with numerous quantitative descriptors in order to investigate the associations between imaging features and various endpoints have a promising potential for the evaluation of pGGN invasion [28,29,30]. Most previous studies have only extracted texture features from nonenhanced CT images and have included a small number of lesions. It would be very helpful in deciding on the optimal treatment plan if using clinical, radiographic and radiomics features for the differentiation of IAC from MIA and preinvasion (AAH and AIS) in evaluating pGGNs.

Therefore, the authors of the present study hypothesized that the combination of clinical, radiographic and radiomics features can improve the diagnostic ability to determine the histological invasiveness of adenocarcinomas appearing as pGGNs. The purpose of this study is to develop a combined prediction model in order to help guide an individualized preoperative design of surgical procedures.

Materials and methods

Study population

The present retrospective study was approved by the institutional research board, and the requirement for informed consent was waived.

The records and images of patients with pGGNs who underwent a preoperative chest CT were retrospectively reviewed. All nodules were pathologically confirmed as lung adenocarcinomas (AAH, AIS, MIA, and IAC) via surgical resection between January 2017 and December 2020 (Fig. 1 shows the flowchart of study population), lobectomy resection for patients with IAC, whereas limited resections for patients with AAH, AIS or MIA.

Fig. 1
figure 1

The flowchart of study population

Inclusion criteria: (1) Patients with a single pGGN on thin-slice (≤ 1.50 mm) CT lung window images (window width: 1200 HU and −600 HU); (2) patients with a nodule diameter of < 30 mm; and (3) patients who underwent surgical resection (including lobectomy resection and limited resections) within one week of receiving a CT scan.

Exclusion criteria: (1) Patients who received preoperative treatment or biopsy which hemorrhage or exudates occurred around pGGNs; (2) patients with a history of malignant tumors; (3) patients who had preoperative CT images with obvious artifacts affecting further analysis, the decision was made by an experienced radiologist before grouping; and (4) patients with multiple GGN (mGGN).

Finally, 382 patients aged 35–73 years (mean age = 56 ± 10 years) with a total of 382 pGGNs were enrolled in the present study. The patients comprised 123 males aged 35–67 years (mean age = 54 ± 11 years) and 259 females aged 38–73 years (mean age = 58 ± 13 years). Of these patients, 231 were assigned to the training cohort and 99 were assigned to the test cohort randomly assigned to the training and testing cohorts with a ratio of 7:3; meanwhile, 52 patients scanned at the third CT were assigned to the external validation cohort. According to the pathological results (AAH, AIS, MIA, and IAC), preinvasive adenocarcinoma and MIA were considered noninvasive lesions. All included pGGNs were divided into two groups: the non-IAC group (n = 275; 72%) and the IAC group (n = 107; 28%).

Acquisition of CT imaging

All patients received a contrast-enhanced chest CT using the Siemens SOMATOM Definition Flash, Siemens SOMATOM Sensation Open CT or GE Revolution. The detailed scanning parameters are listed in Table 1. The contrast-enhanced CT was performed after 25 s of intravenous administration of iodinated contrast material (2 mL/kg) at a rate of 3 mL/sec. The CT scans were acquired from all patients in the supine position at full inspiration. Scan coverage was from the lung base to the thoracic inlet.

Table 1 CT scanning parameters

Image preprocessing

To minimize noise and processing artifacts, resampling of image data was avoided wherever possible, as previously described in similar investigations [31]. First, linear interpolate was applied to re-sampled the CT images and make voxel isotropic of 1 × 1 × 1 mm. Second, the image was discretized in gray scale and the binwidth was set to 25. Third, a Laplacian of Gaussian convolution kernel filter (σ = 3, 5, and 7) and wavelet transform was employed to decrease noise and enhance features at different spatial scales.

Clinical characteristics and CT image evaluation

According to the research literature related to lung cancer [32], the clinical characteristics, including age, gender, smoking status, and clinical symptoms, were derived from medical records. Two thoracic radiologists with 5 and 10 years of experience, respectively, reviewed the CT images of each patient and identified the radiographic features including morphological and quantitative features without knowledge of the patient’s pathological results. Decisions regarding the CT features were reached by consensus. Morphological features included lobulation, spiculation, pleural indentation, and the vacuole sign. Quantitative features included the lesion maximum diameter and mean CT value. The CT images were read with a lung window setting (width: 1200 HU; level: −600 HU).

Reproducibility analysis

To ensure reader reproducibility, 50 patients were randomly selected for a reproducibility analysis. For the evaluation of the interobserver agreement of the radiomics features, two radiologists with 8 years (Reader 1) and 10 years (Reader 2) of experience in chest CT interpretation, respectively, completed the region of interest (ROI) delineation without any information regarding the patients. Next, Reader 1 repeated the ROI delineation with an interval of one week for the assessment of the interobserver agreement. The interobserver and interobserver agreement of the radiomics feature extraction were evaluated using interclass and intraclass correlation coefficients (ICC). Features with an ICC of > 0.8 were regarded as an acceptable agreement and were included in the subsequent analyses.

Segmentation of pulmonary nodules

Segmentation of pulmonary nodules was performed on the thin-slice enhanced images with a lung window setting (width, 1200 HU; level, −600 HU) using the ITK-SNAP software (version 3.8.0, https://www.itksnap.org). The radiologist delineated the nodule length, and the software automatically drew regions of interest covering the entire range of the tumor on the axial CT images. The radiologist manually adjusted and identified the boundary regions on each section. The involved pleura, blood vessels, and bronchi at the edge of the nodules were excluded from nodule segmentation.

Radiomics feature extraction

Radiomics feature extraction was performed using the PyRadiomics. A total of 1130 radiomics features were extracted from the contrast-enhanced CT image for each patient [33]. The extracted features comprised tumor shape features, first-order statistic features, and texture features, such as the gray-level co-occurrence matrix (GLCM), gray-level run length matrix, gray-level dependence matrix, gray-level zone matrix (GLSZM), and neighborhood gray tone difference matrix, to reflect internal heterogeneity as previously described [34].

clinical-radiographic-feature-based model

Relevant clinical features, including sex, age, smoking history, and clinical symptoms, and radiographic features including morphological and quantitative features were analyzed to build clinical-radiographic-based model.

The clinical feature selection was performed in three steps: (1) The Shapiro–Wilk test was used to test the normality of the data sets; (2) the Student t test or Wilcoxon rank sum test was used for the continuous variables (mean CT value and maximum diameter), and the \({\chi }^{2}\) or Fisher exact test was applied for the categorical variables (sex, smoking history, and clinical symptoms); and (3) the stepwise multivariable logistic regression analysis was applied to obtain the independent clinical risk factors and build a clinical-feature-based model.

Radiomics-feature-based model

The radiomics-feature-based model was built in four steps: (1) The correlation analysis was performed to identify the redundant features (features with a correlation coefficient of > 0.8 were eliminated) [28]; (2) the Mann–Whitney U-test was performed to compare the differences between each radiomics feature in the two groups (radiomics features with a p value of < 0.05 were kept); (3) the least absolute shrinkage and selection operator (LASSO) was performed for feature selection [35,36,37,38]; and (4) the stepwise multivariate logistic regression analysis based on the Akaike information criteria was performed to identify the optimal radiomics features for the differentiation of noninvasive lesions and IACs. Then, a combined predictive model was built with the use of selected conventional and radiomics features.

The performance of the three models in the training, testing, and validation cohorts was evaluated by the receiver operating characteristics (ROC) curves; the area under the curve (AUC), accuracy, sensitivity, and specificity were calculated, respectively. A radiomics signature-based score (Rad-score) for each outcome was then obtained from the final model. The Rad-score cut-off values for differentiation of the non-IAC group and the IAC group were chosen according to the Youden index criteria. To estimate model goodness-of-fit, calibration curves were performed, and the Hosmer–Lemeshow test was used to assess model consistency [39, 40]. Decision curve analyses (DCAs) were used to evaluate the potential net benefit based on the clinical diagnosis, radiomics, and the combined model in the different cohorts.

Results

Patient demographic characteristics

A total of 382 patients with pGGNs were enrolled in the present retrospective study according to the inclusion and exclusion criteria. Among them, 275 were diagnosed with noninvasive lesions (AAH, AIS, and MIA) and 107 were diagnosed with IAC. The training cohort comprised 330 patients (330 pGGNs); these patients were divided into the training cohort and the testing cohort according to the ratio of 7:3 with stratified sampling. A total of 52 patients (52 pGGNs) were assigned to the validation cohort. The demographic characteristics of three cohorts of the 382 patients are listed in Table 2.

Table 2 Demographic characteristics

Feature selection

According to the univariate analysis results, three clinical features (gender, age, and smoking history), six radiographic features including four morphological features (burr, lobe, vacuole sign, and pleural involvement), and two quantitative features (mean CT value and axis max length diameter) were found to be significant in the differentiation between noninvasive lesions and IPAs in the training cohort; only one quantitative feature (mean CT value) was found to be significant between noninvasive lesion and IPAs in the external validation cohort.

According to multivariable logistic regression, age, gender, and the mean CT value were selected to build clinical-radiographic-based model. After LASSO and the stepwise logistic regression analysis were conducted, three radiomics features were ultimately selected to build a radiomics-feature-based model. These features included the Log.5.0_glszm_SmallAreaHighGrayLevelEmphasis, wavelet.LHL_glcm_MCC, and wavelet.LLL_glcm_SumAverage.

Performance of clinical-radiographic, radiomics, and combined models

The ROCs and AUCs of the three cohorts are shown in Fig. 2A–C. The best model was the combined model in the training and testing cohorts, with an AUC of 0.856 and 0.859, in the validation cohort, the radiomics model and the combined model performed better than clinical model, with an AUC of 0.814, 0.765 and 0.692, respectively. The radiomics models showed an excellent predictive performance in the discrimination of IACs from noninvasive lesions in the three cohorts. The radiomics signature yielded an AUC of 0.854, 0.846, and 0.814 in the training, testing, and validation cohorts, respectively (Table 3), and the sensitivity was 0.883, 0.840 and 0.955, specificity was 0.663, 0.644 and 0.633 in three cohorts.

Fig. 2
figure 2

The ROCs of the three models in the training cohort (a), testing cohort (b), and validation cohort (c). The predictive performance for an invasive pGGN lesion was better in the combined model than in the clinical and radiomics models in the training and testing cohorts. In the validation cohort, the radiomics model and combined model performed better than clinical model

Table 3 Diagnostic performance of three models for the prediction of pGGN invasiveness

The Rad-score was calculated from the selected radiomics features. The corresponding regression coefficients and distribution in the training, testing, and validation cohorts are presented in Fig. 3. The Rad-score cut-off value was −1.66.

Fig. 3
figure 3

The box plots show the distribution between noninvasive and invasive lesions for GGNs in the training cohort (a), testing cohort (b), and validation cohort (c). The Wilcoxon test obtained p values. The Rad-score was higher in the invasive group than in the noninvasive group

The calibration curves showed good predictability between prediction and observation in the three cohorts (Fig. 4). Compared with the results of the validation cohort, the correspondence between actual and ideal predictions suggested good calibration of the clinical, radiomics, and combined models in the training and testing cohorts.

Fig. 4
figure 4

Calibration curves for the prediction of pGGN invasiveness based on the three models in the training cohort (a), testing cohort (b), and validation cohort (c). The x-axis represents the predicted probability of IACs based on the clinical, radiomics, and combined models, and the y-axis represents the actual probability of pGGN invasiveness. The 45° diagonal line represents ideal prediction, and the red, green, and blue lines represent the predictive performance of the nomogram. The closer the line was to the ideal line, the better the predictive nomogram performance

The decision curve analysis for the three models is presented in Fig. 5. The decision curve indicated that the use of a combined model for the prediction of invasive lesions added more net benefit than the use of clinical features or radiomics features alone in the differentiation of IACs from noninvasive lesions; this was especially true in the training and testing cohorts. The DCA in the external validation cohort showed that the combined and radiomics models achieved higher average precision scores than the clinical-radiographic model.

Fig. 5
figure 5

Decision curve analysis for the combined model (c) compared with the clinical model (a) and radiomics models (b) alone. The x-axis shows the threshold probability, and the y-axis measures the net benefit. The black line represents the hypothesis that all patients with pGGNs had noninvasive lesions, and the gray line represents the hypothesis that all patients with pGGNs had invasive lesions

Clinical-radiographic features, including gender, age, mean CT value, and radiomics features (Log.5.0_glszm_SmallAreaHighGrayLevelEmphasis, wavelet.LHL_glcm_MCC, and wavelet.LLL_glcm_SumAverage), were significantly associated with an increased risk of IACs. The highest odds ratio (OR) was detected for gender in the clinical model (OR = 2.466), for wavelet.LLL_glcm_SumAverage in the radiomics model (OR = 2.287), and for wavelet.LLL_glcm_SumAverage in the combined model (OR = 2.471) (Table 4).

Table 4 Associations between features and invasiveness of pGGNs

Discussion

Radiomics are a rising field of quantitative imaging that can extract high-throughput data features and describe tumor phenotype characteristics. Radiomics analysis provides a noninvasive and powerful alternative for disease diagnosis, differentiation, clinical treatment, and assessment [34]. Recent studies have succeeded in radiomics analyses in the field of oncology, showing the diagnostic and prognostic values [34, 41, 42].

Therefore, in the present study, the radiomics features and evaluated clinical features were extracted from CT images for the prediction of pGGN invasiveness; they were then investigated, and the performance of the clinical, radiomics, and combined models for classifying noninvasive lesions and IACs was compared.

The results of the present study showed that the combined model performed better than the radiomics and clinical models, with a higher AUC in the training and testing cohorts (0.856, 0.859, respectively), suggesting that it is a noninvasive tool for differentiating IACs from noninvasive lesions.

Some studies have suggested that radiomics features can be used to differentiate pGGN invasiveness. Previous studies reported that in clinical practice, lesion size, CT value, and morphological characteristics were associated with pGGN invasiveness [43,44,45]. Recent studies have highlighted the combined model, which incorporated clinical features and radiomics features in the diagnosis of lung cancer.

Sun et al. and Liu et al. [46, 47] showed that the predictive model for IAC constructed by integrating the clinical and radiomics features based on the radiomics nomogram exhibited excellent accuracy in the differentiation of noninvasive lesions from IACs (AUC 0.831; 95% CI: 0.765–0.897). Although morphological characteristics, such as lobulation, burr, vacuole sign, and pleural involvement sign, are helpful in the identification of nodule invasiveness, they were not included in the final clinical model construction of this study; this is mainly because the morphological characteristics of early stage lung cancer are usually atypical, especially in IACs with a smaller diameter than 1 cm. Furthermore, identification of the morphological features depended on the radiologist’s diagnostic experience.

Former studies have found that the mean nodule CT value is a significant predictor in the differentiation of pGGN invasiveness. Previous study has shown that pGGNs with a mean CT value higher than −600 HU indicated invasive adenocarcinoma [23].

Wu et al. [48] evaluated CT and histopathologic features of lung adenocarcinoma with pGGNs that were ≤ 10 mm in diameter, and the results showed no statistically significant difference in the CT value between noninvasive lesions and IACs.

In the present study, the multivariate logistic regression analysis revealed that the mean CT value was useful in histopathologic subtype differentiation; IACs reflected a higher mean CT value than noninvasive lesions. The study also showed that the AUC for the clinical model established by using gender, age, and the mean CT value was lower than the AUC for the radiomics model in the three cohorts (AUC = 0.729, 0.652, and 0.692, respectively).

Radiomics show the ability to serve as a bridge between medical imaging and precise medicine [49,50,51]. Yang et al. [52] utilized 14 radiomics features of lung adenocarcinoma to distinguish IACs and noninvasive lesions; an AUC of 0.77 was achieved. The radiomics signatures performed better than the most commonly used clinical features, such as the mean CT value. Weng et al. [53] identified that the nomogram, which integrated morphology characteristics and radiomics features, showed a high performance in the classification of IACs and MIAs (AUC = 0.888).

In the present study, the radiomics features performed a good differentiation ability in three cohorts (AUC = 0.854, 0,846, and 0.814, respectively); this ability was significantly better at predicting IACs than in the clinical model. The findings of the present study are consistent with the findings of previous studies, which have found various texture and shape features to be significant predictors of IACs.

Furthermore, radiomics features comprising tumor shape features, first-order statistic features, and texture features in a noninvasive, three-dimensional manner may allow for more precise and personalized treatment than traditional modality of these patients with pGGN detected by CT.

The clinical and radiomics models were then combined to improve the diagnosis accuracy; the combined model had a better performance than clinical and radiomics model alone and achieved a satisfactory result in the external validation cohort. The ROC and DCA also indicated that using the combined model to predict invasive lesions added more net benefit than using clinical features or radiomics features alone in differentiating IACs from noninvasive lesions, especially in the training and testing cohorts.

The present study has several limitations. First, it is a single-institutional retrospective study, and the sample size is small; large and multi-institutional cohorts will be recruited in future research. Second, the CT images were acquired from different scanners, and standardization of scanning and reconstruction parameters is required for further study. Third, although an external validation cohort was constructed, the number of patients assigned to it was relatively small; thus, it is necessary to increase the number of cases in this group to identify the performance of the model and usefulness of radiomics signatures.

Conclusion

In conclusion, the radiomics signature from the CT images provided a noninvasive modality for IAC prediction. Radiomics signatures combined with clinical features yielded a better performance than using alone in differentiating IACs from noninvasive lesions appearing as pGGNs on thin-slice CT; this may facilitate clinical diagnosis and treatment in further work.