Abstract
Objectives
To develop and validate a preoperative CT-based nomogram combined with radiomic and clinical–radiological signatures to distinguish preinvasive lesions from pulmonary invasive lesions.
Methods
This was a retrospective, diagnostic study conducted from August 1, 2018, to May 1, 2020, at three centers. Patients with a solitary pulmonary nodule were enrolled in the GDPH center and were divided into two groups (7:3) randomly: development (n = 149) and internal validation (n = 54). The SYSMH center and the ZSLC Center formed an external validation cohort of 170 patients. The least absolute shrinkage and selection operator (LASSO) algorithm and logistic regression analysis were used to feature signatures and transform them into models.
Results
The study comprised 373 individuals from three independent centers (female: 225/373, 60.3%; median [IQR] age, 57.0 [48.0–65.0] years). The AUCs for the combined radiomic signature selected from the nodular area and the perinodular area were 0.93, 0.91, and 0.90 in the three cohorts. The nomogram combining the clinical and combined radiomic signatures could accurately predict interstitial invasion in patients with a solitary pulmonary nodule (AUC, 0.94, 0.90, 0.92) in the three cohorts, respectively. The radiomic nomogram outperformed any clinical or radiomic signature in terms of clinical predictive abilities, according to a decision curve analysis and the Akaike information criteria.
Conclusions
This study demonstrated that a nomogram constructed by identified clinical–radiological signatures and combined radiomic signatures has the potential to precisely predict pathology invasiveness.
Key Points
• The radiomic signature from the perinodular area has the potential to predict pathology invasiveness of the solitary pulmonary nodule.
• The new radiomic nomogram was useful in clinical decision-making associated with personalized surgical intervention and therapeutic regimen selection in patients with early-stage non-small-cell lung cancer.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Low-dose computed tomography (LDCT) screening has been shown to reduce lung cancer mortality in a high-risk group [1]. In lung cancer screening trials, the nodule prevalence (%) is 33% on average, while only 1.4% of the detected nodules are diagnosed as lung cancer. How to select patients with malignant pulmonary nodules for timely intervention has become a major challenge.
Numerous appropriate follow-up protocols are utilized to manage these pulmonary nodules detected by CT screening. For indeterminate nodules, the Fleischner Society guidelines [2] and the Lung CT Screening Reporting and Data System (Lung-RADS) prescribe a CT screening after a particular time interval based on nodule size. However, recommendations from the British Thoracic Society (BTS) guidelines [3] reduce the need for follow-up imaging for patients with nodules of < 5 mm diameter or < 80 mm3, and a reduction of the follow-up period to 1 year for solid pulmonary nodules (SPN). However, the awareness of recommendations and the management choices in clinical practice have exhibited heterogeneity between radiologists and pulmonologists [4].
Differentiating pathology types between a pulmonary precancerous lesion (i.e., atypical adenomatous hyperplasia and adenocarcinoma in situ (AAH/AIS)) and early-stage invasive adenocarcinoma (IAC) leads to vastly divergent prognoses after standard thoracic surgery [5,6,7]. Minimally invasive adenocarcinoma (MIA) is a small, solitary adenocarcinoma with mostly lepidic growth and an invasion smaller than 5 mm at its largest dimension at any one point, whereas invasive adenocarcinoma involves growths larger than 5 mm [5]. It is challenging to identify the pathological nature of suspected malignant pulmonary nodules through visual assessment with CT scans because of the considerable overlap in morphologic features between them, such as pleural tags, spiculation, and lobulation [8].
Models including machine learning and artificial neural networks have been applied in lung cancer diagnosis [9,10,11,12], and excellent identification efficiency and accuracy have been achieved according to internal data. However, these tools suffer from limited external validity, overfitting, and unexplainable results [13]. Radiomics provides a noninvasive approach and is more promising in its sensitivity, selectivity, and experimental feasibility for disease diagnosis, tumor staging, and patient prognosis [14,15,16,17].
Tumor-infiltrating lymphocytes and tumor-associated macrophages were observed to be distributed at the edge of the invasion lesions (ILs) in the pathological map [18] and be associated with the likelihood of metastasis [19]. The perinodular parenchyma may be considered to represent the tumor microenvironment and has biological importance in defining tumor behavior, including cell migration, stromal inflammation, immune infiltration, and vascularization [20, 21]. We assumed that radiomic signatures from perinodular areas might provide a preoperative reference for the accurate prediction of pathological invasiveness in solitary pulmonary nodules and for guiding surgical methods and the extent of resection.
Herein, we developed and validated a nomogram based on clinical–radiological and radiomic signatures from nodular and perinodular areas for preoperative prediction of pathological invasiveness in patients with a solitary pulmonary nodule using data from a multicenter study.
Methods
Study design and patients
In this multicenter, retrospective, diagnostic study, patients with a solitary pulmonary nodule were recruited from three independent centers (Guangdong Provincial People’s Hospital, Guangdong Province, China, named as the GDPH center; Sun Yat-sen Memorial Hospital of Sun Yat-sen University, Guangdong Province, China, named as the SYSMH center; Zhoushan Lung Cancer Institution, Zhejiang Province, China, named as the ZSLC center) during the period of August 1, 2018, to May 1, 2020. Information about the three institutions that participated in this study is shown in eTable 1.
The inclusion and exclusion criteria were applied at the three centers, and 373 patients from the 571 recruited patients were finally enrolled after the application of the exclusion criteria. The patients (N = 203) enrolled at the GDPH center from March 1, 2015, to December 31, 2019, were divided into two cohorts: The development cohort comprised 149 patients (73.4%) randomly selected by a computer algorithm in a ratio of 7:3, and the validation cohort comprised 54 patients (26.6%). The SYSMH center (N = 63) and the ZSLC center (N = 107) cooperatively formed an external validation cohort of 170 patients from December 18, 2012, to July 30, 2019, and January 1, 2019, to December 30, 2019. Figure 1 presents the exclusion criteria and the patient recruitment process.
The following were the criteria for inclusion: (I) patients ≥ 18 years of age who underwent CT screening and were diagnosed with SPN for the first time, (II) patients who underwent preoperative enhanced chest CT scans (within 3 months), (III) pathologically confirmed precancerous lesions (AAH/AIS) or early-stage lung adenocarcinoma (MIA/IAC), and (IV) lesions smaller than 30 mm without distant metastases, or lymph node involvement.
The exclusion criteria were (I) preoperative therapy (neoadjuvant chemotherapy or radiotherapy), (II) a history of previous lung tumor diseases or (III) past/present history of other malignant tumors, and (IV) incomplete clinical information or unavailable standard enhanced chest imaging data.
Because of the retrospective nature of this study, the institutional review board waived informed patient consent. The study protocol was approved by academic ethics committees and conducted according to the Declaration of Helsinki and Good Clinical Practice guidelines and was registered with ClinicalTrials.gov (registration number NCT04452058).
Image review and feature extraction
The Picture Archiving and Communication System (PACS) was used to retrieve preoperative CT images from three centers, and all researchers assessed the initial screening of image data. The CT protocol is described in detail in eTable 2.
The regions of interest (ROIs) in the pulmonary nodular area (ROI-1) were all manually refined by one researcher (W.H.L.) slice by slice in three orthogonal planes (axial, coronal, and sagittal) under the guidance of two senior radiologists with 13 years (S.Y.W.) and 17 years (G.Y.W.) of experience in chest CT interpretation. Other irrelevant components, such as air, peripheral vessels, normal tissue, ribs, pleura, and surrounding organs, were removed by the researchers to avoid interference. The 3D Slicer program was used to semi-automatically segment the perinodular area (ROI-2, including the perinodular parenchymal representing a 5-mm extension outward) (https://www.slicer.org/, version 4.10.2) [22]. The disagreement was resolved by discussion among senior researchers, including two radiologists and three thoracic surgeons (Q.G.B., Z.H.Y., and Z.D.K.).
All assessors were blind to the final pathology diagnostics which were reviewed by a senior pathologist (S.Y.) using the 2017 8th TNM staging system and the 2011 International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society (IASLC/ATS/ERS) classification for pathological staging and pathological grading after thoracic surgery, respectively [5, 23].
After the ROI-1 and ROI-2 were segmented and reconstructed, the volume of interest (VOI-1 and VOI-2) images (DICOM format) were transferred to the SlicerRadiomics code using an in-house texture extraction platform based on the Python package PyRadiomics.
There are 1722 quantitative radiomic features in all, which include first-order statistics, shape, gray-level co-occurrence matrix (GLCM), gray-level size zone matrix (GLSZM), gray-level dependence matrix (GLDM), and neighborhood gray-tone difference matrix (NGTDM), which were extracted from two segmented regions (VOI-1 and VOI-2). These features were used for further analysis and regression modeling. The same image segmentation process and feature extraction were conducted among 30 SPNs in the cohorts after 3 months. More information about the standard radiomic workflow and model construction is shown in Fig. 2.
Development of the radiomic signatures
High-dimensional imaging data is featured from the two VOIs by the LASSO algorithm (eFigure 1 in the Supplementary materials). By linear combination, the most useful predicted combination of data was used to create two radiomic signatures (RS1 for VOI-1 and RS2 for VOI-2).
The final radiomic signature was combined with two radiomic signatures obtained by logistics regression. The tenfold cross-validation was implemented to avoid overfitting. Based on the combined radiomic signature (RS-C), the radiomic score was calculated and presented in the development and two validation cohorts.
Development of the clinical–radiological signature and nomogram
Baseline clinical data were obtained from medical records. The researchers also recorded several radiological feature descriptors of each pulmonary nodule, such as the size, number, location, border, and internal characteristics (e.g., density and consolidation tumor ratio (CTR)), and any disagreement was resolved through consultation. The densities of pulmonary nodules were described using terminology derived from the BTS guidelines [3].
After the analysis, significant risk factors were used to build a clinical–radiological signature. This signature was combined with the final radiomic signature (RS-C) to form a nomogram using logistic regression.
Statistical analysis
Normalization was performed on radiomic features using a z-score transformation. To investigate differences in categorical variables, the chi-square test was used. The differences in continuous variables between PILs and early-stage pulmonary interstitium ILs were compared using a two-sample t test.
Univariate logistic regression analysis was used to select the independent clinical and radiological prognostic factors in the internal cohort. Significant risk factors were then introduced into stepwise logistical regression analyses to build a clinical–radiological signature. To visualize the results of the multivariable logistic regression analysis for risk stratification of pathological invasiveness, a nomogram based on both the clinical–radiological signature and the combination radiomic signature was created. Intrarater agreement in radiomic features between two times ROI segmentation was assessed using the two-way random ICC model.
To evaluate the performance of models, a receiver operating characteristic (ROC) analysis was done, and the accuracy, sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV) were calculated. The DeLong test compared the nomogram to other models in the development cohort in terms of the area under the ROC curve (AUC).
The Akaike information criterion (AIC) [24] was used to compare and rank multiple competing models and emphasize the comparison of the goodness of fit of the competing models while considering the principle of parsimony. We chose the model with the lowest AIC value (representing the “best-approximating model”) in this study.
The utility and clinical value of models can be evaluated using decision curve analysis (DCA) [25], which determines the net benefit for patients at each threshold probability. The calibration of the nomogram was assessed using the Hosmer–Lemeshow test and calibration curves in the three cohorts. Two-sided p values < 0.05 indicated statistical significance. The packages of GLMNET were run, and statistical analysis was performed using R software (version 3.6.2; http://www.Rproject.org).
Results
Participants
The imaging of 373 preoperative patients with a solitary pulmonary nodule was collected from three independent institutions in China. The development cohort included 149 patients from the GDPH center (female, 61.7%; median [IQR] age, 59.0 [49.0 to 66.0] years), and the internal validation cohorts included 54 patients (female, 57.4%; median [IQR] age, 54.7 [46.0 to 63.8] years). The external validation cohort from the SYSMH and ZSLC centers included 170 patients (female, 60.0%; median [IQR] age, 57 [48.3 to 65.0] years).
In the PIL group, 35.6% (53/149) of the patients were diagnosed with PILs (AAH/AIS) in the development cohort, 38.9% (21/54) were diagnosed with PILs in the internal validation cohort, and 37.6% (64/170) were diagnosed with PILs in the external validation cohort. Table 1 shows the baseline characteristics of the patients in the development and two validation cohorts.
Validation of the radiomic signatures
In total, 1722 radiomic features were extracted from two VOIs (RS1: four features for VOI-1, RS2: eight features for VOI-2) and were selected by the LASSO algorithm. Moreover, RS1and RS2 were combined into a final radiomic signature (RS-C) using logistic regression, and the radiomic score calculation formula was presented in eTable 3 in Supplementary materials.
The radiomic score for each patient was significantly different between the PIL group (AAH/AIS) and IL group (MIA/IAC) in three cohorts (p < 0.001 for eFigure 2A; p = 0.003 for eFigure 2B; p < 0.001 for eFigure 2C, eTable 4 in the Supplementary materials). The mean value of the radiomic score for patients in the IL group (MIA/IAC) was significantly higher in both the development and two validation cohorts (7.28, 7.45, and 8.34, respectively) compared with the patients in the PIL group (AAH/AIS) (− 1.24, − 1.06, and − 2.59, respectively).
The AUC for the RS1 was 0.83 (95% CI, 0.76 to 0.89) in the development cohort, 0.85 (95% CI, 0.71 to 0.93) in the internal validation cohort, and 0.88 (95% CI, 0.83 to 0.93) in the external validation cohort (eFigure 3A in the Supplementary materials). In the development cohort, the AUC for the RS2 was 0.92 (95% CI, 0.86 to 0.95), and in the internal and external validation cohorts, it was 0.89 (95% CI, 0.80 to 0.98), 0.89 (95% CI, 0.84 to 0.94). (eFigure 3B in the Supplementary materials).
Among all radiomic-related signatures in the development and two validation cohorts, the RS-C had the greatest AUCs of 0.93 (95% CI, 0.89 to 0.97), 0.91 (95% CI, 0.83 to 0.98), and 0.90 (95% CI, 0.85 to 0.94) (eFigure 3C in the Supplementary materials). The two-way random ICC model was applied to measure the reliability of the radiomic features between two-times image segmentation and feature extraction process. The agreement levels are defined regarding ICC values: excellent (ICC ≥ 0.81), good (0.61 < ICC < 0.8), moderate (0.41 < ICC < 0.60), and poor (ICC ≤ 0.40). eTable 5 summarizes the results of the intrarater agreement analysis. Radiomic features in the RS1 show excellent intrarater reliability (ICC = 0.92 to 0.99) between this process, and the high ICC values for radiomic feature in the RS2 ranging from good (ICC = 0.74; 95% CI, 0.521 to 0.872) to excellent (ICC = 0.98; 95% CI, 0.949 to 0.989).
Validation of the clinical–radiological signature
Clinical–radiological characteristics, including density (part-solid nodule (PSN)/solid nodule), pleural retraction, irregular shape, lobulated borders, CTR ≥ 0.5, and blurred margins, were significantly associated with pathology invasiveness after the univariate analysis (p < 0.05; eTable 6 in the Supplementary materials), and four of these characteristics (PSN/solid nodule, irregular shape, pleural retraction, and blurred margins) were selected using the stepwise logistic regression model to form the clinical–radiological signature (eTable 3 in the Supplementary materials).
Based on the ROC analysis, the AUCs for the clinical–radiological signature were 0.79 (95% CI, 0.72 to 0.89), 0.79 (95% CI, 0.67 to 0.92), and 0.88 (95% CI, 0.83 to 0.93) (eFigure 3D in the Supplementary materials) in the development, internal, and external validation cohorts, respectively.
Validation, calibration, and discrimination of the nomogram
To develop a clinically applicable approach that could predict pathological invasiveness in patients with a solitary pulmonary nodule, we constructed a radiomic nomogram that considers the clinical–radiological and radiomic signatures (Fig. 3). The multivariate logistic regression analysis showed that the clinical–radiological signature (odds ratio (OR) = 1.60; 95% CI, 1.03 to 2.59; p = 0.04) and the radiomic signature (odds ratio (OR) = 2.43; 95% CI, 1.76 to 3.66; p < 0.001) represented independent predictors in the nomogram (eTable 7 in the Supplementary materials).
As shown in the nomogram (Fig. 3), when compared to the clinical–radiological signature, the radiomic signature accounted for the most significant proportion, making it the cardinal biomarker for distinguishing PILs from early-stage ILs. Based on the obtained features, the clinical–radiological signature and combined radiomic signature could be calculated using the formula. The value assigned to each signature was scored on a point scale from 0 to 10. By adding the scores for each signature, one can obtain a total score. The risk of this solitary pulmonary nodule having pulmonary interstitial invasion can be predicted by projecting the score to the bottom risk axis.
The nomogram formed by the clinical–radiological and combined radiomic signatures performed better than any isolated signatures. The nomogram achieved an excellent predictive value, with an AUC test of 0.94 (95% CI, 0.90 to 0.97) in the development cohort, which achieved better discriminatory performance than the radiomic signatures and clinical–radiological signature (Fig. 4b). Similar findings of model comparisons were also observed in the two validation cohorts (Fig. 4c, d). In the two validation cohorts, the nomogram also yielded high AUCs of 0.90 (95% CI, 0.81 to 0.98) and 0.92 (95% CI, 0.88 to 0.92) (Fig. 4a). Moreover, the accuracy, specificity, and PPV of the nomogram were higher than 80.0% in the three cohorts (Table 2).
In order to prove that the nomogram model also has a good discriminatory performance among the SPNs with different densities (pure ground-glass nodule (pGGN), PSN, solid), we conducted a subgroup analysis in this research. In the subgroup of pGGN, the nomogram showed high AUCs of 0.87 (95% CI, 0.79 to 0.95), 0.92 (95% CI, 0.84 to 0.99), and 0.89 (95% CI, 0.83 to 0.95) (eFigure 4 in the Supplementary materials) in the internal, external, and total cohorts, respectively. eTable 8 shows the performance evaluation of the nomogram in subgroup analysis. Subgroup analyses also detected high discrimination of the nomogram in the PSN subgroups (AUC = 0.93, 0.81, and 0.89) and the solid group (AUC = 0.96, 0.93, and 0.94) in cohorts.
The DeLong test was performed on the ROC curves of five models among the AUCs in the development cohort. The differences were statistically significant between the nomogram and RS1 and the nomogram and the clinical–radiological signature, with p = 0.005 and p < 0.001, respectively (eTable 9 in the Supplementary materials). The clinical–radiological signature achieved the highest AIC value at 159.36 among all prediction models in eTable 9. The AIC of the nomogram (121.68) was similar to that of the radiomic signature (121.0) and RS2 (117.62), which was less than the AIC of RS1 (149.83). Based on the overall consideration of the AIC and ROC curves, the nomogram model proved to have excellent goodness of fit and parsimony.
Moreover, the nomogram calibration curve in cohorts indicated a good agreement between the nomogram prediction and actual observation. The Hosmer–Lemeshow test revealed that the nomogram was well fitting, with a nonsignificant difference (p > 0.05) (eFigure 5 in the Supplementary materials).
DCAs (Fig. 5) were used to assess the utility of the three predictive models by calculating the net benefit at various probability thresholds. According to the decision curves, the radiomic signature showed more benefit than the clinical–radiological signature in predicting the risk of the interstitial invasion when the probability threshold in the clinical decision of a patient or physician was above 0.2 in the development cohort. The nomogram line achieved the highest clinical net benefit across the entire range of threshold probabilities in three cohorts, which indicated that the nomogram was a reliable clinical tool to predict pathology invasiveness.
Discussion
In this multicenter study, we built and validated a radiomic nomogram to distinguish PILs from early-stage pulmonary interstitium ILs preoperatively. The nomogram incorporated radiomic signatures selected from the nodular and perinodular areas and the clinical–radiological signature and performed well in the development and validation cohorts. The low AIC in the nomogram demonstrated the good quality of this available tool. The DCAs indicated that the nomogram is a reliable clinical treatment decision support tool to predict pulmonary interstitial invasion for patients with a solitary pulmonary nodule.
This research describes some important radiological characteristics that contribute to the differential diagnosis of SPN. The nodule with part-solid/solid density, pleural retraction, irregular shape, and blurred margins had a higher risk for malignancy, consistent with the radiologists’ experience. Previous researches [26, 27] have used nodule size and CTR to distinguish PILs from ILs. However, they were not included in the final predictive nomogram in our study.
We found considerable reliability of the radiomic features in the repeatability study. Overall, more than 83% (10/12) of the radiomic features achieve an excellent intrarater reliability (ICC ≥ 0.81) in the RS-C. To a certain extent, it can reflect the stability and generalization of the nomogram, which is mainly composed of the combined radiomic signature. Through commonly used and simple metrics, first-order statistics explain the distribution of voxel intensities inside the image region defined by the mask. GLCM is a statistical texture analysis method that evaluates the spatial relationship between pixels and determines how frequently a particular combination of pixels appears in an image. The radiomic features, including the sum average in the GLCM category and uniformity in the first-order category, were also reported to differentiate invasive pulmonary adenocarcinoma from PILs in a previous study [28]. The GLSZM provides information on the size of homogeneous zones for each gray level in three dimensions. Nearly half (5/12) of the combined radiomic signature was related to the GLCM and GLSZM categories and is stable by changes in the ROIs [29]. Although not statistically significant according to the DeLong test, RS2 can numerically distinguish PILs from early-stage pulmonary interstitium ILs and showed better performance than RS1 in all cohorts. Moreover, the lower AIC in RS2 indicates that its model quality is better than that of RS1. These findings may be due to more unstable features in the focal area than perinodular ones [29].
We constructed the first radiomic model combined with a 5-mm perinodular radiomic signature for the early diagnosis of pathological invasiveness. The predictive reproducibility of the models was evaluated in this multicentric study. Previous studies classified MIA as a benign or preinvasive pulmonary nodule because it has a good prognosis after surgical treatment as AIS [5]. Unlike other studies conducted by She et al [30] and Xu et al [31], we prefer to regard MIA as early-stage pulmonary malignant lesions because the difference between AIS and MIA lies in the microenvironment [30], especially in the expression level of laminin-5 [32,33,34] and the frequency of tumor protein p53 gene (TP53) mutations [35,36,37].
It is difficult for radiologists or thoracic surgeons to differentiate from PILs to ILs in pGGNs and PSNs. On the one hand, the surgical treatment strategies for patients with high-risk pulmonary nodules remain a massive challenge as the histopathologic definition is difficult to make before an operation. On the other hand, the entire histologic sampling of the tumor is required to diagnose the AIS or MIA, which may prolong the procedure time and lead to inappropriate surgical decision-making. In our study, the subgroup analysis shows good discrimination of the nomogram in the pGGNs and PSNs. A nomogram may determine whether surgical treatment or conservative surveillance is required and recommend management strategies for patients with nodules diagnosed as ILs.
The relevance of biological importance to the distribution of immune cells in the perinodular area has already been demonstrated [18]. The combination of intratumoral and peritumoral features proved useful in predicting the complete pathological response and lymph node metastasis, and identifying molecular subtypes [38,39,40]. Several studies demonstrated the added value of using radiomic features from perinodular parenchyma to differentiate nodules in terms of potential malignancy, and the definition of perinodular area differs between studies [41, 42]. Beig et al conducted a study [19] that used 30-mm perinodular radiomic features to distinguish IAC from benign granulomas, and the most predictive features were within a 5-mm perinodular area. The same distance of the perinodular area was also used in the study conducted by Wu et al [43]. As Wu et al indicated, adding perinodular features did not improve the radiomic model performance. However, the radiomic model achieved a better predictive value in our study after combining with the radiomic signature selected from the perinodular area.
In this study, pathological findings were used as the gold standard rather than the consensus malignancy rating of each nodule used in other studies [44]. This retrospective study was restricted to only suspected malignant nodules (PILs and ILs) and excluded some benign pulmonary lesions (tuberculosis or granulomas) to simulate the most likely clinical situation. Additionally, to improve the reliability of the results and enhance the homogeneity of the population, this multicenter study focused on patients with a solitary pulmonary nodule and excluded multiple pulmonary nodules.
Limitations of the study included its retrospective nature and the variation in the research period among the multiple centers, which prevented some clinical factors from being obtained, and a certain bias and heterogeneity may have existed in the study. Second, the CT acquisition protocol (i.e., image thickness) was not unified among all patients in the three centers. A standard process on radiomic features was performed to alleviate this problem, and the nomogram finally performed well in the external validation group. This finding indicated that the nomogram has good universality and is worthy of clinical application. Third, owing to the limitation of data, we did not have transcriptomics and mutation-sequencing data. Therefore, we could not further explain the relevant mechanism between radiomics and the tumor microenvironment. In the future, prospective, high-quality research with a larger population is still required to verify our results further.
Conclusion
The perinodular radiomic signature improved the distinction between pulmonary interstitium ILs and PILs when combined with the nodular radiomic signature. This study demonstrated that a nomogram constructed by identifying the clinical–radiological signature and the combined radiomic signature has the potential to be an easy-to-use, non-invasive preoperative biomarker to precisely predict pathological invasiveness and add diagnostic value to clinical decisions for optimal intervention benefit.
Abbreviations
- AIC:
-
Akaike information criterion
- AUC:
-
Area under the receiver operating characteristics curve
- CI:
-
Confidence interval
- C-R:
-
Clinical–radiological
- DCA:
-
Decision curve analysis
- GLCM:
-
Gray-level co-occurrence matrix
- GLDM:
-
Gray-level dependence matrix
- GLSZM:
-
Gray-level size zone matrix
- ILs:
-
Invasion lesions
- PILs:
-
Pre-invasive lesions
- RS1:
-
Radiomic signature selected from the nodular area
- RS2:
-
Radiomic signature selected from the perinodular area
- RS-C:
-
Combined radiomic signature selected from the nodular area and perinodular area
References
The National Lung Screening Trial Research Team (2011) Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med 365(5):395–409. https://doi.org/10.1056/NEJMoa1102873
MacMahon H, Naidich DP, Goo JM et al (2017) Guidelines for management of incidental pulmonary nodules detected on CT images: from the Fleischner Society 2017. Radiology 284(1):228–243. https://doi.org/10.1148/radiol.2017161659
Callister MEJ, Baldwin DR, Akram AR et al (2015) British Thoracic Society guidelines for the investigation and management of pulmonary nodules: accredited by NICE. Thorax. 70(Suppl 2):ii1–ii54. https://doi.org/10.1136/thoraxjnl-2015-207168
Mets OM, de Jong PA, Chung K, Lammers J-WJ, van Ginneken B, Schaefer-Prokop CM (2016) Fleischner recommendations for the management of subsolid pulmonary nodules: high awareness but limited conformance – a survey study. Eur Radiol 26(11):3840–3849. https://doi.org/10.1007/s00330-016-4249-y
Travis WD, Brambilla E, Noguchi M et al (2011) International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society: International Multidisciplinary Classification of Lung Adenocarcinoma: executive summary. Proc Am Thorac Soc 8(5):381–385. https://doi.org/10.1513/pats.201107-042ST
Borczuk AC, Qian F, Kazeros A et al (2009) Invasive size is an independent predictor of survival in pulmonary adenocarcinoma. Am J Surg Pathol 33(3):462–469. https://doi.org/10.1097/PAS.0b013e318190157c
Zhang J, Wu J, Tan Q, Zhu L, Gao W (2013) Why do pathological stage IA lung adenocarcinomas vary from prognosis?: a clinicopathologic study of 176 patients with pathological stage IA lung adenocarcinoma based on the IASLC/ATS/ERS classification. J Thorac Oncol 8(9):1196–1202. https://doi.org/10.1097/JTO.0b013e31829f09a7
Ost D, Fein A (2000) Evaluation and management of the solitary pulmonary nodule. Am J Respir Crit Care Med 162(3 Pt 1):782–787. https://doi.org/10.1164/ajrccm.162.3.9812152
Causey JL, Zhang J, Ma S et al (2018) Highly accurate model for prediction of lung nodule malignancy with CT scans. Sci Rep 8(1):9286. https://doi.org/10.1038/s41598-018-27569-w
Chae H-D, Park CM, Park SJ, Lee SM, Kim KG, Goo JM (2014) Computerized texture analysis of persistent part-solid ground-glass nodules: differentiation of preinvasive lesions from invasive pulmonary adenocarcinomas. Radiology 273(1):285–293. https://doi.org/10.1148/radiol.14132187
Bi WL, Hosny A, Schabath MB et al (2019) Artificial intelligence in cancer imaging: clinical challenges and applications. CA Cancer J Clin 69(2):127–157. https://doi.org/10.3322/caac.21552
Feng B, Chen X, Chen Y et al (2020) Solitary solid pulmonary nodules: a CT-based deep learning nomogram helps differentiate tuberculosis granulomas from lung adenocarcinomas. Eur Radiol 30(12):6497–6507. https://doi.org/10.1007/s00330-020-07024-z
Chen S, Qin J, Ji X et al (2017) Automatic scoring of multiple semantic attributes with multi-task feature leverage: a study on pulmonary nodules in CT images. IEEE Trans Med Imaging 36(3):802–814. https://doi.org/10.1109/TMI.2016.2629462
Lambin P, Rios-Velazquez E, Leijenaar R et al (2012) Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer 48(4):441–446. https://doi.org/10.1016/j.ejca.2011.11.036
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278(2):563–577. https://doi.org/10.1148/radiol.2015151169
Yang L, Yang J, Zhou X et al (2019) Development of a radiomics nomogram based on the 2D and 3D CT features to predict the survival of non-small cell lung cancer patients. Eur Radiol 29(5):2196–2206. https://doi.org/10.1007/s00330-018-5770-y
Song SH, Ahn JH, Lee HY et al (2016) Prognostic impact of nomogram based on whole tumour size, tumour disappearance ratio on CT and SUVmax on PET in lung adenocarcinoma. Eur Radiol 26(6):1538–1546. https://doi.org/10.1007/s00330-015-4029-0
Beig N, Khorrami M, Alilou M et al (2019) Perinodular and intranodular radiomic features on lung CT images distinguish adenocarcinomas from granulomas. Radiology 290(3):783–792. https://doi.org/10.1148/radiol.2018180910
Banat G-A, Tretyn A, Pullamsetti SS et al (2015) Immune and inflammatory cell composition of human lung cancer stroma. PLoS One 10(9):e0139073. https://doi.org/10.1371/journal.pone.0139073
Nishino M (2019) Perinodular radiomic features to assess nodule microenvironment: does it help to distinguish malignant versus benign lung nodules? Radiology 290(3):793–795. https://doi.org/10.1148/radiol.2018182619
Christiansen A, Detmar M (2011) Lymphangiogenesis and cancer. Genes Cancer 2(12):1146–1158. https://doi.org/10.1177/1947601911423028
van Griethuysen JJM, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77(21):e104–e107. https://doi.org/10.1158/0008-5472.CAN-17-0339
Goldstraw P, Chansky K, Crowley J et al (2016) The IASLC Lung Cancer Staging Project: proposals for revision of the TNM stage groupings in the forthcoming (eighth) edition of the TNM classification for lung cancer. J Thorac Oncol 11(1):39–51. https://doi.org/10.1016/j.jtho.2015.09.009
Akaike H (1998) Information theory and an extension of the maximum likelihood principle. In: Parzen E, Tanabe K, Kitagawa G (eds) Selected papers of Hirotugu Akaike. Springer Series in Statistics (Perspectives in Statistics). Springer, New York, NY. https://doi.org/10.1007/978-1-4612-1694-0_15
Vickers AJ, Elkin EB (2006) Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making 26(6):565–574. https://doi.org/10.1177/0272989X06295361
Luo T, Xu K, Zhang Z et al (2019) Radiomic features from computed tomography to differentiate invasive pulmonary adenocarcinomas from non-invasive pulmonary adenocarcinomas appearing as part-solid ground-glass nodules. Chin J Cancer Res 31(2):329–338. https://doi.org/10.21147/j.issn.1000-9604.2019.02.07
Lee SM, Park CM, Goo JM, Lee H-J, Wi JY, Kang CH (2013) Invasive pulmonary adenocarcinomas versus preinvasive lesions appearing as ground-glass nodules: differentiation by using CT features. Radiology 268(1):265–273. https://doi.org/10.1148/radiol.13120949
Li W, Wang X, Zhang Y et al (2018) Radiomic analysis of pulmonary ground-glass opacity nodules for distinction of preinvasive lesions, invasive pulmonary adenocarcinoma and minimally invasive adenocarcinoma based on quantitative texture analysis of CT. Chin J Cancer Res 30(4):415–424. https://doi.org/10.21147/j.issn.1000-9604.2018.04.04
Tunali I, Hall LO, Napel S et al (2019) Stability and reproducibility of computed tomography radiomic features extracted from peritumoral regions of lung cancer lesions. Med Phys 46(11):5075–5085. https://doi.org/10.1002/mp.13808
She Y, Zhang L, Zhu H et al (2018) The predictive value of CT-based radiomics in differentiating indolent from invasive lung adenocarcinoma in patients with pulmonary nodules. Eur Radiol 28(12):5121–5128. https://doi.org/10.1007/s00330-018-5509-9
Wu L, Gao C, Xiang P, Zheng S, Pang P, Xu M (2020) CT-imaging based analysis of invasive lung adenocarcinoma presenting as ground glass nodules using peri- and intra-nodular radiomic features. Front Oncol 10:838. https://doi.org/10.3389/fonc.2020.00838
Naito M, Aokage K, Saruwatari K et al (2016) Microenvironmental changes in the progression from adenocarcinoma in situ to minimally invasive adenocarcinoma and invasive lepidic predominant adenocarcinoma of the lung. Lung Cancer 100:53–62. https://doi.org/10.1016/j.lungcan.2016.07.024
Patarroyo M, Tryggvason K, Virtanen I (2002) Laminin isoforms in tumor invasion, angiogenesis and metastasis. Semin Cancer Biol 12(3):197–207. https://doi.org/10.1016/S1044-579X(02)00023-8
Moriya Y, Niki T, Yamada T, Matsuno Y, Kondo H, Hirohashi S (2001) Increased expression of laminin-5 and its prognostic significance in lung adenocarcinomas of small size: an immunohistochemical analysis of 102 cases. Cancer 91(6):1129–1141. https://doi.org/10.1002/1097-0142(20010315)91:6%3c1129::AID-CNCR1109%3e3.0.CO;2-C
Zhang C, Zhang J, Xu F-P et al (2019) Genomic landscape and immune microenvironment features of preinvasive and early invasive lung adenocarcinoma. J Thorac Oncol 14(11):1912–1923. https://doi.org/10.1016/j.jtho.2019.07.031
Yim J, Zhu L-C, Chiriboga L, Watson HN, Goldberg JD, Moreira AL (2007) Histologic features are important prognostic indicators in early stages lung adenocarcinomas. Mod Pathol 20(2):233–241. https://doi.org/10.1038/modpathol.3800734
Nakanishi H, Matsumoto S, Iwakawa R et al (2009) Whole genome comparison of allelic imbalance between noninvasive and invasive small-sized lung adenocarcinomas. Cancer Res 69(4):1615–1623. https://doi.org/10.1158/0008-5472.CAN-08-3218
Braman NM, Etesami M, Prasanna P et al (2017) Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res 19(1):1–14. https://doi.org/10.1186/s13058-017-0846-1
Braman N, Prasanna P, Whitney J et al (2019) Association of peritumoral radiomics with tumor biology and pathologic response to preoperative targeted therapy for HER2 (ERBB2) –positive breast cancer. JAMA Netw Open 2(4):e192561. https://doi.org/10.1001/jamanetworkopen.2019.2561
Wang X, Zhao X, Li Q et al (2019) Can peritumoral radiomics increase the efficiency of the prediction for lymph node metastasis in clinical stage T1 lung adenocarcinoma on CT? Eur Radiol 29(11):6049–6058. https://doi.org/10.1007/s00330-019-06084-0
Levman JED, Martel AL (2011) A margin sharpness measurement for the diagnosis of breast cancer from magnetic resonance imaging examinations. Acad Radiol 18(12):1577–1581. https://doi.org/10.1016/j.acra.2011.08.004
Uthoff J, Stephens MJ, Newell JD et al (2019) Machine learning approach for distinguishing malignant and benign lung nodules utilizing standardized perinodular parenchymal features from CT. Med Phys 46(7):3207–3216. https://doi.org/10.1002/mp.13592
Wu G, Woodruff HC, Shen J et al (2020) Diagnosis of invasive lung adenocarcinoma based on chest CT radiomic features of part-solid pulmonary nodules: a multicenter study. Radiology 297(2):451–458. https://doi.org/10.1148/radiol.2020192431
Ferreira JR, Oliveira MC, de Azevedo-Marques PM (2018) Characterization of pulmonary nodules based on features of margin sharpness and texture. J Digit Imaging 31(4):451–463. https://doi.org/10.1007/s10278-017-0029-8
Acknowledgements
We are grateful to MEDcentra Technology Limited, Infiniti MINDS Limited, and Tencent AIMIS Open Platform for supporting the technology services.
Funding
This study was funded by the Guangdong Province Medical Scientific Research Foundation (B2018148); Science and Technology Program of Guangzhou (201903010028, 2017B030314026); Guangdong Provincial People’s Hospital Intermural Program (KJ012019447); Medical Artificial Intelligence Project of Sun Yat-sen Memorial Hospital (YXRGZN201902); Natural Science Foundation of Guangdong (2017A030313828, 2018A0303130113) and China (81572596, 81972471, and U1601223); Key Area Research and Development Program of Guangdong Province, China (No. 2018B010111001); National Key Research and Development Project (2018YFC2000702); and Health and Medical Research Funds (HMRF 02131026, HMRF 16172561).
Author information
Authors and Affiliations
Contributions
Prof. Zhou and Dr. Huang had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Dr. Huang, Lin, Xie, and Yu are the co-first authors.
Concept and design: Zhou, Chan, Huang, Qiao, Yu
Acquisition, analysis, or interpretation of data: Huang, Xie, Lin, Liao, Wu, L. Yao, He, Li
Drafting of the manuscript: Huang, Yu, Chan, Lin, Z. Zhang
Critical revision of the manuscript for important intellectual content: Huang, Yu, Chan, Zhou
Statistical analysis: Xie, Huang, Lin, D. Chen
Obtained funding: Zhou, Chan
Administrative support: Qiao, D. Zhang
Technical support: Cho, Chan, G. Wang, S. Wang, Cao, M. Wang, Z. Wang, S. Yao
Supervision: Zhou. Chan. Qiao
Corresponding authors
Ethics declarations
Guarantor:
The scientific guarantor of this publication is Prof. Zhou.
Conflict of Interest:
The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article. The funding organizations had no role in the design and conduct of the study; the collection, management, analysis, and interpretation of the data; the preparation, review, or approval of the manuscript; and the decision to submit the manuscript for publication.
Statistics and Biometry:
One of the authors (D. Chen) has significant statistical expertise.
Informed Consent:
Since the retrospective design of this study, the written informed consent was waived by the institutional review board.
Ethical Approval:
Institutional review board approval was obtained. The study protocol was approved by academic ethics committees and conducted according to the Declaration of Helsinki and good clinical practice guidelines. The present study was registered with ClinicalTrials.gov (registration number NCT04452058).
Methodology
• Retrospective
• Diagnostic study
• Multicenter study
• Machine Learning Approach
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Luyu Huang and Weihuan Lin appear as equal contributors in this article.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Huang, L., Lin, W., Xie, D. et al. Development and validation of a preoperative CT-based radiomic nomogram to predict pathology invasiveness in patients with a solitary pulmonary nodule: a machine learning approach, multicenter, diagnostic study. Eur Radiol 32, 1983–1996 (2022). https://doi.org/10.1007/s00330-021-08268-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-021-08268-z