Introduction

In 2015, the World Health Organization (WHO) formally defined STAS (spread through air spaces), that is, tumor cells appear in the lung tissue surrounding the primary tumor in the form of micropapillary cell clusters, solid cancer nests, or single tumor cells, while STAS was identified as the fourth type of invasion mode of lung adenocarcinoma [1]. Regarding STAS-positive patients with clinical or pathological stage IA lung cancer, lobectomy can achieve a superior clinical prognosis compared to sublobectomy and lower the risk of postoperative tumor recurrence and metastasis [2,3,4]. At present, the gold standard for the diagnosis of STAS is postoperative histopathological examination; consequently, it is challenging to develop appropriate preoperative surgical strategies. Moreover, preoperative puncture and intraoperative frozen pathological examination have drawbacks such as low sensitivity, limited tissue samples, and short diagnosis time [5, 6].

Therefore, scholars attempted to explore the STAS status of lung cancer patients based on preoperative CT images, and the results demonstrated that some radiological features of lung cancer were correlated with STAS status, such as tumor size, solid component size, ratio of solid component size to total tumor size (consolidation-to-tumor ratio, CTR), air bronchogram sign, vacuole sign, spiculation sign, and lobulation sign [7,8,9,10,11,12]. Additionally, Kim et al. [11] found that STAS was absent in pure ground-glass nodule (pGGN). Based on different inclusion criteria and different multiple regression models, the prediction performance of the model varied. In general, the area under curve (AUC) values of the model ranged from 0.726 to 0.803.

As a characterization algorithm, radiomics offers a non-invasive method to characterize the biological behavior of lesions through high-throughput extraction and analysis of a large number of quantitative image features [13]. Indeed, earlier studies have showcased that intratumoral and peritumoral radiomics techniques have tremendous potential in predicting lymphovascular invasion, lymph node metastasis, and clinical prognosis of lung cancer [14,15,16]. Both Zhuo et al. [17] and Liao et al. [18] constructed models based on intratumoral and peritumoral radiomics features of clinical T1 stage lung adenocarcinoma including pGGN, but the peritumoral volume of interest (VOI) acquisition methods were different in the two studies, and the conclusions were also varied. In Zhuo’s study [17], a spherical shape was fitted based on the tumor center point, and the spherical peritumoral scope was obtained by extending uniformly to the periphery by 5 mm, 10 mm, and 15 mm, and the results showed that the peritumoral radiomics models were not well fitted. Liao’s study obtained four types of peritumoral segmentation volume (PTV) by extending the range of 5 mm, 10 mm, 15 mm and 20 mm to the periphery along the segmented tumor edge, and the results showed that the gross radiomic signature (GRS) model which combined tumor radiomic signature (TRS) and peritumoral region of 15 mm radiomic signature (PRS-15 mm) achieved the highest values of AUC [18]. Although, the two studies constructed combined models based on the best radiomics signatures, CT morphological features, and clinical information, all of which have achieved good discriminative accuracy based on internal cohort with AUC values of 0.99 and 0.869 in the internal validation cohort, respectively. However, the transportability and generalizability of the models’ predictive efficacy have not been verified through external cohort. Furthermore, previous studies described above only included histologic adenocarcinoma tumors, but STAS was reportedly associated with poor prognosis of other types of lung cancer, such as lung squamous cell carcinoma, lung pleomorphic carcinoma, and lung neuroendocrine tumors [19,20,21].

In this study, we focused on clinical stage IA NSCLC and excluded pGGN which was clearly STAS-negative. We constructed the radiomics model based on the independent segmented VOI of gross tumor volume (GTV), four types of peritumoral volume (PTV) (5 mm, 10 mm, 15 mm, and 20 mm around the tumor), and their corresponding four types of gross peritumoral and tumor volume (GPTV). We aimed to explore its value in predicting STAS status of clinical stage IA NSCLC and to explore whether it can further improve the diagnostic efficiency combined with relevant radiological features and valuable clinical information in both the internal and external cohorts.

Materials and Methods

Patients

The data of NSCLC patients who underwent chest CT examinations and postoperative pathological assessment of STAS status in our hospital and the other two hospitals from September 2019 to September 2022 were retrospectively analyzed. We collected 290 lung cancers presenting as pGGNs, none of which was positive for STAS. As reported in references [11], we excluded pGGNs.

The inclusion criteria were as follows: (i) thin-slice chest CT with slice thickness ≤ 1.5 mm and no artifacts within 1 week before surgery; (ii) complete clinical and pathological data; (iii) clinical stage IA NSCLC (cT1N0M0, the maximum tumor diameter ≤ 3 cm); (iv) solid or mixed ground glass nodules (mGGNs). The exclusion criteria were as follows: (i) poor image quality; (ii) incomplete clinical and pathological data; (iii) maximum diameter of tumor > 3 cm; (iv) tumors with lymph node or distant metastasis; (v) the pathological type was not NSCLC; (vi) preoperative neoadjuvant and chemotherapy; (vii) pGGNs. A total of 336 patients were included from our hospital (hospital 1) and randomly divided into the training cohort (n = 236) and the internal verification cohort (n = 100) at a ratio of 7:3. Furthermore, 69 cases from the other two hospitals were used as the external validation cohort, including 30 patients in hospital 2 and 39 patients in hospital 3. The training cohort was used to train the designed group model, and the validation cohort was used to evaluate the accuracy of the model. The detailed patient inclusion procedure is shown in Fig. 1. The training cohort was used to develop the prediction model, the internal validation cohort was used to test the reproducibility of the model development process, and the external validation cohort was used to evaluate the transportability and generalizability of the model in data from different hospitals.

Fig. 1
figure 1

The flow chart for patient selection

If multiple lesions in the same patient were surgically removed and met the inclusion criteria, we referred to Dercle et al. [22] to select a representative lesion with the largest tumor size for analysis. Therefore, independent and different patient data were randomly assigned to the training cohort and the internal validation cohort.

This study was approved by the ethics committee of our hospital (hospital 1, decision number: CZ-20220712–03), hospital 2 (decision number: GYLLPJ-20220714–27), and hospital 3 (decision number: CYLLPJ-20220719–08). Due to the retrospective nature of this study, informed consent was waived.

Equipment and Parameters

Patients in our hospital (hospital 1) underwent preoperative chest CT examinations with four types of CT machines from three vendors, including the Toshiba Aquilion16 row, GE Light Speed VCT64 row, Philips Ingenuity 64 row, and Brilliance iCT 128 row CT machines. In the external cohort, patients from hospital 2 were assessed with American Light Speed 16, Light Speed VCT64 row, and Dutch Philips iCT 256-row CT machines. Hospital 3 utilized the SOMATOM Definition Flash and SOMATOM Drive 64-row CT machines from Germany. The patients were instructed to lie in the supine position during the scan, which covered the entire lung field. The parameters were set as follows: tube voltage was set to 120 kVp, with a tube current of 150–250 mAs or automatic tube current regulation. The scanning slice thickness and slice increment were 5 mm, while the reconstruction slice thickness and slice increment were 1 mm in hospital 1 and hospital 2 and 1.3 mm in hospital 3. The lung algorithm or standard algorithm reconstruction was selected, and non-contrast enhanced images were used for analysis.

Clinicopathological and Radiological Data

Patient data were collected, including sex, age, clinical symptoms, smoking status, family history of lung cancer, history of malignancy, history of multiple primary lung cancer, CEA levels, surgical type, and pathological type. The DICOM images of patients were imported into software (RadiAnt DICOM Viewer 4.2.1, Medixant, Poland) and analyzed by two independent radiologists with 2 years and 10 years of experience who were blinded to the pathological information. The lung window (width 1500 HU, level − 500 HU), mediastinal window (width 300 HU, level 50 HU), multiplanar reformation (MPR), and maximal intensity projection (MIP) were used to analyze the lesion. For quantitative measures, the average measurements of two independent radiologists were used as the final data. For qualitative indicators, disagreements were discussed until a consensus was reached.

First, the longest diameter of the whole tumor and the consolidation part were measured at the lung window on the MPR images, and the CTR was calculated [7]. Clinical T staging was performed according to the maximum diameter of the solid components of the tumor [23].

Second, we assessed the following qualitative radiological features: tumor location, density type (solid, mGGN), shape (round, irregular), tumor-lung interface (well-defined, ill-defined), margin (lobulation, spiculation), internal characteristics (vacuole sign, cavity/cystic airspace), and external characteristics (vascular convergence, air bronchogram, pleural tags, pleural indentation, ill-defined peripheral opacity, satellite lesions, distal ribbon sign, combined with emphysema).

The definitions of radiological features are described in Supplementary Table S1, and graphical figures of radiological features are shown in Supplementary Fig. S1-S7. Most of the definitions of these features of pulmonary nodules have been previously reported [11, 24,25,26,27]

Image Processing and Model Construction

Standardized image resampling and grayscale discretization were performed on the CT images. ITK-SNAP 3.8.0 software (www.itksnap.org) was used to outline the total volume of the tumor slice by slice along the tumor boundaries, and the GTV was determined, which was used as VOI. GTV was defined as the whole tumor area that was identified within the visible tumor boundary. During segmentation, blood vessels, bronchi, surrounding pleura, and atelectatic lung tissue were avoided as much as feasible. Differences in opinion were resolved by discussion and reaching a consensus. Thirty lesions were randomly selected, and two radiologists with 2 years and 10 years of experience who were blinded to the pathological information independently segmented the tumor to evaluate inter-observer repeatability. One month later, the radiologist with 2 years of working experience performed secondary segmentation on thirty lesions to evaluate intra-observer repeatability. The remaining lesions were segmented by a radiologist with 2 years of working experience. About the definition of peritumoral extent, previous study quantified the histopathologically proven distance between tumor surface and farthest STAS from the tumor edge was 17 mm [28]. Based on the above research, in order to cover all potential STAS, the peripheral extension distance of lung cancer in this study was increased to 20 mm, and four different gradients were set in 5 mm units to make an exploratory study. Python 3.1.1 (https://www.python.org) was used to write the expansion algorithm program to capture the range of 5 mm, 10 mm, 15 mm, and 20 mm peritumoral areas based on the segmented GTV to get the VOIs of GPTV (labeled as GPTV5, GPTV10, GPTV15, and GPTV20, respectively), and pixel filtering was performed on peritumoral non-lung tissues (blood vessels, chest wall, ribs, neck, mediastinum, abdominal cavity) according to the pixel value threshold. Then, the intratumoral mask was subtracted from the GPTV masks to obtain peritumoral areas from the tumor surface, which was the VOIs of PTV (labeled as PTV5, PTV10, PTV15, and PTV20, respectively).

The PyRadiomics open-source software (version 3.0.1, https://pyradiomics.readthedocs.io/en/latest/changes.html) was used to extract radiomics data from images, including 14 morphological features, 18 first-order statistical features, and 68 texture features (22 Gy co-occurrence matrix GLCM, 14 Gy dependence matrix GLDM, 16 Gy size area matrix GLSZM, and 16 Gy run matrix GLRLM). A total of 100 original features were obtained (Supplementary Table S2). In order to acquire high throughput features, the image voxels were transformed by non-linear intensity (square, square root, logarithm, and exponent). Gaussian Laplacian (LoG) conversion was performed with sigma values of 1 mm, 2 mm, 3 mm, 4 mm, and 5 mm, and eight wavelet transforms (LLL, LLH, LHL, LHH, HLL, HLH, HHL, and HHH) were carried out for first-order statistical features and texture features, yielding in a total of 1218 radiomics features (Supplementary Table S3). The definition of each radiomics feature is provided in the Supplementary Table S4. Considering that the CT images of the cases included in this study were collected from multiple hospitals and CT protocols, the intensities of all radiomics features were normalized by the ComBat compensation method (Combat Tool is available here: https://forlhac.shinyapps.io/Shiny_ComBat) and z-score (z = x − μ/σ) transformation [29,30,31].

Intraclass correlation coefficient (ICC) was used to evaluate intra-observer and inter-observer consistency between the segmented intratumoral and peritumoral radiomics features; the “psych” package in R language was used to test the consistency of the radiomics features. Firstly, in order to mitigate overfitting, the maximal redundancy minimal relevance (mRMR) algorithm and the least absolute shrinkage and selection operator (LASSO) logistic regression method were applied to features with a good consistency (ICC > 0.80) in the training cohort to limit the dimension of the features [32]. Secondly, tenfold cross-validation was used to select the optimal regularization parameter λ value. Under the optimal λ value, features whose coefficients were not equal to 0 were used as the features to construct the radiomics model. Finally, radscore was calculated based on the linear model by selecting the optimal radiomics features, and the Wilcoxon test was used to compare differences between the STAS-positive group and the STAS-negative group. Overall, nine radiomics models were constructed, and their diagnostic efficiency was evaluated. The model with the highest AUC value in the external validation cohort was considered the best radiomics model.

In order to avoid missing meaningful variables, variables with p < 0.1 in univariate analysis were involved in multivariate analysis of logistic regression, and backward step-wise selection was applied by using the likelihood ratio test with Akaike’s information criterion (AIC) as the stopping rule to select the best combination of variables to build the clinical prediction model in the training cohort [33].

Following this, the radscore of the best radiomics model and clinical predictors were utilized to construct a combined model and design a nomogram, and its predictive efficacy was evaluated centrally in internal and external validation, as illustrated in Fig. 2.

Fig. 2
figure 2

Overall design flow chart of this study

Pathological Diagnosis

Pathologic diagnosis of each patient included in our study was established by two pathologists, respectively, a junior pathologist and a senior pathologist with more than 10 years of work experience, according to the 2015 WHO definition of STAS [1]. The classification of lung cancer was based on the WHO classification of lung cancer (2015 edition) [1], and the clinical and pathological staging was based on the TNM staging standard of lung cancer (8th edition) [23]. It is important to note that the pathologic diagnoses were determined as part of routine clinical practice, and the specimens were not reviewed specifically for this study.

Statistical Analysis

SPSS 20.0 software and R statistical software (R version 4.2.2) were used for statistical analysis. Pearson’s chi-squared test, Yate’s correction for continuity, or Fisher’s exact test was used for categorical variables analysis. Regarding the clinical-radiological features selected by univariate and multivariate logistic regression analysis, P < 0.05 was considered statistically significant. The ROC curve and AUC value were used to evaluate the diagnostic performance of the model, the DeLong test was used to analyze the difference in AUC value between the models, the Hosmer–Lemeshow test and calibration curve were used to examine the goodness of fit of the model, and decision curve analysis (DCA) was used to analyze the clinical applicability of the model. Inter-observer and intra-observer consistency tests were performed using the “psych” package of R language, the “rms” package of R software was employed for multivariate logistic regression analysis and constructing the nomogram and calibration curves, the “pROC” software package was used for ROC curve analysis, and the “rms” package was applied for internal and external validation. The “dca.R” package was used to analyze the decision curve. The kappa coefficient and ICC were used to evaluate the consistency of qualitative and quantitative parameters among observers, respectively.

Results

Clinicopathological Characteristics and Radiological Features

Of the 405 patients with NSCLC, 118 were STAS-positive, and 287 were STAS-negative. Statistically significant differences in sex were observed in the training cohort and the external validation cohort (P < 0.05). The difference in the surgical method was statistically significant in the training and internal validation cohorts (P < 0.05), whereas the difference in smoking status, carcinoembryonic antigen (CEA) levels, and pathological type was statistically significant exclusively in the training cohort (P < 0.05). On the other hand, significant differences were identified in clinical symptoms in the external validation cohort (P < 0.05). Concerning radiological features, good consistency was observed in terms of quantitative parameters between the two observers (ICC 0.934–0.935), with strong consistency in qualitative indicators (Kappa 0.852–1.000); the interobserver agreement assessment results of each index are shown in Supplementary Table S5.

Model Development and Evaluation

The optimal combinations of variables selected by multivariate logistic regression analysis consisted of sex, CEA level, CTR, density type, and distal ribbon sign, among which density type (OR = 6.738, 95% CI 3.107 ~ 15.18, P < 0.001) and distal ribbon sign (OR = 5.141, 95% CI 2.272 ~ 12.00, P < 0.001) were independent risk factors for STAS (Tables 1 and 2), and there was no multicollinearity (Supplementary Table S6). The AUC values of the clinical model constructed based on the aforementioned variables in the three cohorts were 0.874, 0.822, and 0.810, respectively (Table 3).

Table 1 Clinicopathological and radiological characteristics of patients in the training and two validation cohorts
Table 2 Univariable and multivariable logistic regression analysis of factors in the training cohor
Table 3 The predictive efficacy of GTV, PTV, GPTV radiomics model in three cohorts

Among the 1218 radiomics features extracted from each VOI, the proportion of radiomics features with inter-observer and intra-observer ICC greater than 0.8 ranged from 86.0 to 99.8% (Supplementary Table S7). Among the features with ICC > 0.80, the mRMR algorithm was first used to eliminate redundant and irrelevant features, and 30 features were retained in each group. Then, LASSO logistic regression method was used to select the optimized feature subset to establish the final model, and tenfold cross-validation was used to select the values of the optimal hyperparameter λ, which were identified to be 0.0006 (GTV), 0.0033 (PTV5), 0.0226 (PTV10), 0.0139 (PTV15), 0.0071 (PTV20), 0.0143 (GPTV5), 0.0135 (GPTV10), 0.0054 (GPTV15), and 0.0955 (GPTV20), respectively. With the optimal λ values, 19, 16, 9, 15, 16, 11, 10, 12, and 2 features were selected to construct the 9 radiomics models. The features used for radiomics model construction and their ICC details are shown in Supplementary Table S8. The group with the highest AUC value in the external validation cohort was considered the best radiomics model. The results demonstrated that the GPTV10 radiomics model had the best prediction performance, with AUC values of 0.887, 0.876, and 0.868, in the three cohorts (Table 3). Besides, the DeLong test demonstrated that the GPTV10 model significantly outperformed PTV10 and GPTV20 in the training cohort, PTV (5, 10, 15, 20) in the internal validation cohort, and PTV (5, 10, 15, 20) and GPTV15 in the external validation cohort (P < 0.05). Detailed results of DeLong tests between radiomics models are shown in Supplementary Table S9-S11.

Based on GPTV10 radscore and the selected clinical-radiological predictors, a combined model was constructed, and a nomogram was developed, as delineated in Fig. 3. The combined model formula is as follows: Nomoscore = (Intercept) ×  − 1.56 + sex × 0.60 + CEA × 1.46 + CTR × 0.56 + density type × 0.57 + distal ribbon sign × 1.01 + GPTV10 radscore × 0.85, and its AUC values in the three cohorts were 0.901, 0.875, and 0.878 (Table 4). ROC curves of the clinical model, GPTV10 radiomics model, and combined model in the three cohorts are presented in Fig. 4. The DeLong test showed that the combined model was superior to the clinical model in the three cohorts (Z = 2.480, 2.068, 2.388, P < 0.05), detailed results of DeLong tests between models in three cohorts are shown in Supplementary Table S12. Meanwhile, the Hosmer–Lemeshow test showed that the combined model was well-fitted in all three cohorts (P = 0.473, 0.496, 0.246), and the calibration curve portrayed that the predicted probability value of the combined model was in good agreement with the actual situation, as shown in Fig. 5. Lastly, the DCA illustrated that the combined model had superior clinical application value compared to the clinical model or the GPTV10 radiomics model alone, as illustrated in Fig. 6.

Fig. 3
figure 3

Nomogram for preoperative prediction of STAS status based on intratumoral and peritumoral radiomics and clinical-radiological features in clinical stage IA NSCLC

Table 4 The predictive efficacy of clinical model, GPTV10 radiomics model, and combined model in three cohorts
Fig. 4
figure 4

ROC curve analysis of the clinical model, GPTV10 radiomics model, and combined model in three cohorts. a The training cohort; b the internal validation cohort; c the external validation cohort

Fig. 5
figure 5

The calibration curves of combined model in the three cohorts. a The training cohort; b the internal validation cohort; c the external validation cohort

Fig. 6
figure 6

The decision curve shows that the combined model has better clinical application value than the clinical model and GPTV10 radiomics model in the three cohorts. a The training cohort; b the internal validation cohort; c the external validation cohort

Discussion

STAS, as an important invasive mode of lung cancer, affects the postoperative recurrence-free survival (RFS) and overall survival (OS) of patients [2]. Accurate diagnosis of STAS status prior to surgery is conducive to the clinical selection of the optimal surgical method, thereby prolonging survival and enhancing the prognosis of patients [3, 4]. Herein, a combined model was constructed based on GPTV10 radscore and clinical-radiological features to predict STAS status in patients with stage IA non-small cell lung cancer. The results showed that the combined model had high diagnostic efficacy for STAS, with AUC values of 0.901, 0.875, and 0.878, in the three cohorts. Moreover, the accuracy was 83.90%, 80.00%, and 82.61%; the sensitivity was 84.75%, 82.76%, and 100%; and the specificity was 83.62%, 78.87%, and 69.23%. In addition, the created nomogram can transform complex regression equations of the model into a visual graph, which is intuitive and easily interpreted, as well as facilitate the preoperative assessment of STAS status.

In this study, spiculation sign, pleural indentation sign, and vascular convergence sign were more common in the STAS-positive group, which was consistent with previous literature reports [34, 35] and might be related to its pathological mechanism. Spiculation is associated with tumor cell infiltration into adjacent blood and lymphatic vessels, suggesting lung cancer is more aggressive [36], while pleural indentation arises from intratumor reactive fibrous hyperplasia, pulling the adjacent pleura and causing the deviation of the pleura from its original position [37]. Vascular convergence sign is also caused by the reactive fibrous hyperplasia of the tumor, which pulls the adjacent pulmonary vessels, causing them to converge toward the tumor [38]. Notably, a higher degree of tumor infiltration is associated with a higher level of internal reactive fibroplasia and a higher probability of pleural indentation and vascular convergence, indicating a greater risk of STAS. The results of this study revealed that tumor density was an independent risk factor for STAS. Among them, the incidence of STAS in patients with solid nodules on CT images was 63.4% (85/134), while that in patients with mGGNs was 12.1% (33/271), which was consistent with the results of previous studies [11, 39, 40]. Solid components typically represent the more aggressive section of the tumor. It is worthwhile emphasizing that prior studies have pointed out that CTR was positively correlated with STAS while negatively correlated with ground-glass opacity (GGO) positivity. Higher CTR values were associated with more aggressive tumors; the more likely STAS positivity is, the worse the prognosis of patients [40]. In this study, the STAS-positive rate was 36.6% (115/314) in patients with CTR ≥ 50%. Conversely, the STAS-positive rate was merely 3.3% (3/91) in patients with CTR < 50%, signaling a higher risk of STAS in cases with higher solid components on CT images, which was in agreement with the observation of previous research. This study also found that the distal ribbon sign was an independent risk factor for STAS. This might be attributed to tumor cells escaping from the primary lesion, redistributing through the airway, and proliferating along the surrounding alveolar wall, resulting in parenchymal obstruction of the surrounding lung or obstruction of the terminal bronchioles, thereby reducing the gas content in the alveoli. This phenomenon is similar to the ground glass ribbons discovered by Qi et al. [35]. In general, a higher degree of tumor invasion is associated with a higher incidence of STAS, which is reflected by a higher proportion of tumor solid components on CT images and more malignant radiological features. Lobectomy is recommended for this type of early lung cancer.

In the present study, on the premise of accurate tumor segmentation, four peritumoral regions with different gradient ranges were automatically expanded to construct nine radiomics models in order to explore the most efficient radiomics model to predict STAS status, and a multi-center study was conducted to evaluate the generalizability of the model. The results showed that the GPTV10 radiomics model had the highest predictive efficiency. Kadota et al. [28] found that, in 97% (151/155) of the STAS-positive cases, the distance between STAS and the edge of the primary tumor lesion ranged from 0.3 to 10.5 mm, and the peritumoral extension range of 10 mm may accurately cover various high-order features related to the heterogeneity of lung cancer. Based on the VOI of GPTV10, 10 best radiomics features were selected, including the two first-order features, which was correlated with the CT value of the tumor. The higher the CT value, the denser the tumor cells, and the higher the degree of tumor invasion. The eight texture features are Small Area Low Gray Level Emphasis, Gray Level Variance, Zone Entropy, Cluster Shade, Cluster Prominence, Imc2, Large Dependence Low Gray Level Emphasis, and Large dependence High Gray Level Emphasis. Small Area Low Gray Level Emphasis, Gray Level Variance, and Zone Entropy are parameters of GLSZM and principally provide information on the uniform area size of each gray level on the 3D image. Cluster Shade, Cluster Prominence, and Imc2 are parameters of GLCM that chiefly evaluate the spatial relationship between pixels and describe the frequency of appearance of specific pixel combinations in the image. Large Dependence Low Gray Level Emphasis and Large Dependence High Gray Level Emphasis are parameters of GLDM that mainly reflect the grayscale relationship between the central pixel and its neighborhood. More specifically, the larger the values of LDLGLE, GLCM, and GLDM, the more uneven the image texture distribution and the more irregular the gray change, indicating a higher spatial heterogeneity of tumor [16, 41] and reflecting its strong aggressiveness and a greater possibility of STAS. This study also noted that the GPTV radiomics model was superior to the GTV and PTV models; the GTV radiomics model was superior to the PTV model, inferring that the occurrence of STAS was predominantly related to the aggressiveness of the tumor itself and that the peritumoral region also partly reflected the aggressive behavior of the tumor. As a quantitative method, radiomics techniques can be used to quantify internal tumor heterogeneity and differences in the peritumoral microenvironment [42].

In this study, a combined model was constructed based on GPTV10 radscore and clinical-radiological features to predict STAS status. The DeLong test results demonstrated that the predictive value of the combined model in the three cohorts was superior to that of the clinical model. Hosmer–Lemeshow test and calibration curve showed that the combined model fitted well in all three cohorts. DCA validated that the combined model had better clinical application value than the clinical model, which can be ascribed to the excellent predictive efficiency of the GPTV10 radiomics model. The evaluation of traditional CT features of tumors mostly depends on the experience of radiologists, which is subjective to a certain extent, and different conclusions can be obtained from different clinical levels [43]. Radiomics can extract massive high-dimensional features from segmented images, including gray level changes and voxel spatial relationships, and then achieve accurate diagnosis and prognosis assessment of diseases through feature selection and model establishment [44]. It can not only avoid the subjectivity of the observer’s interpretation of the CT morphological features, but also deeply excavate and integrate a large number of digital information in the image which cannot be recognized and distinguished by human eyes [44]. At present, only one study has been conducted on the prediction of lung cancer STAS based on deep learning, Tao et al. [45] constructed a three-dimensional (3D) convolutional neural network (CNN) model based on 203 cases of any stage of NSCLC patients with preoperative enhanced thin-slice CT; the results showed that the 3D CNN model yielded superior performance with AUC values of 0.93 and 0.80 in the training and validation cohorts. With high repeatability and reliability, radiomics and deep learning techniques may become a non-invasive precision diagnostic tool reflecting tumor biological behavior for clinical application in the future [44, 45].

This study has several limitations. Firstly, the sample size of this study was relatively small, but through strict case selection, and excluding pGGN that was clearly STAS negative, the cases were more representative; we will continue to collect more cases and construct prediction models based on deep learning technology to explore whether the prediction performance can be further improved. Secondly, the study was retrospective, resulting in inevitable selection bias; we will prospectively and continuously collect large samples of external center cases and take pathological results as the gold standard to further verify the repeatability of the nomogram constructed in this study. Third, due to the lack of detailed follow-up data, whether the nomogram developed in this study can further accurately predict the prognosis of patients, so as to make prognostic risk stratification, we will conduct this study in the future when more follow-up data are collected.

Conclusions

In conclusion, the GPTV10 radiomics model performs better than the GTV, all PTV, and the other three GPTV radiomics models in predicting the STAS status of clinical stage IA NSCLC preoperatively. Additionally, the nomogram based on GPTV10 radiomics features and relevant clinical-radiological predictors can further improve predictive efficiency, which will assist in timely providing guidance and aiding in the development of personalized treatment strategies for early lung cancer.