Introduction

Recent advancements in lung cancer screening have led to the detection of an increasing number of early lung adenocarcinomas, often indicated by ground glass nodules(GGNs) [1]. In 2021, the World Health Organization categorized lung adenocarcinomas into three stages: adenocarcinoma in situ(AIS), minimally invasive adenocarcinoma(MIA), and invasive adenocarcinoma(IAC), based on their progression [2]. AIS and MIA are called preinvasive lesions(PIL).GGNs are classified into pure ground glass nodules(pGGNs) and part solid nodules(PSNs) depending on the presence of solid components [3]. While pGGNs are typically perceived as less aggressive, with pathology usually corresponding to AIS and MIA, approximately 16-27% of these nodules still present with IAC pathology [4].

Complete resection typically results in a cure for patients with AIS and MIA. However, those with IAC face a recurrence risk even after complete resection [5, 6]. The approach to lymph node management also varies between these types. Lymph node treatment is not necessary for AIS and MIA, but lymph node dissection or sampling is essential for IAC [7]. Therefore, accurately identifying IAC is crucial in treating pGGNs.

Intraoperative freezing exhibits limitations in determining the invasiveness of pGGNs [8]. In comparison, imaging features offer more advantages in assessing the invasiveness of these nodules [9]. Several studies have indicated the effectiveness of imaging characteristics in identifying the invasiveness of pGGNs [10,11,12]. Given the variability in findings across different researchers, a meta-analysis was undertaken to examine the relationship between imaging features and the invasiveness of pure ground glass nodules. This analysis led to the development of a predictive model, aiming to guide clinical management of these nodules.

Methods

Study registration

The protocol was registered in the International PROSPERO with the registration number CRD42022362400.

Study population characteristics

Derivation cohort

Participants

The patient data for the derivation cohort was sourced from a systematic review and meta-analysis. Two researchers independently explored a range of databases, such as PubMed, Embase, Web of Science, Cochrane Library, Scopus, Wanfang, CNKI, VIP, and CBM, up to December 20, 2023. They focused on finding relevant studies published in either Chinese or English. The search involved medical subject heading terms like computed tomography, ground-glass nodules, ground-glass opacities, adenocarcinoma in situ, micro-invasive adenocarcinoma, and invasive adenocarcinoma. Additionally, the bibliographies of chosen articles were reviewed to uncover more pertinent literature. In instances of disagreement, a third researcher was consulted to resolve the differences. Studies were included according to specific inclusion and exclusion criteria.Criteria for Inclusion: (1) Research exploring the link between computed tomography (CT) features and the invasiveness of pGGNs, published both domestically and internationally. (2) Utilization of surgery as the definitive method for pathological diagnosis. (3) Inclusion of both retrospective and prospective studies. (4) Studies providing raw data directly or through calculation. Criteria for Exclusion: (1) Exclusion of abstracts, reviews, and case reports. (2) Rejection of studies with duplicated data. (3) Disregarding studies with incomplete data or when specific data are unobtainable, even after author consultation. (4) Non-consideration of literature published in languages other than Chinese and English.

Data accessibility and quality evaluation

Initially, the evaluation of articles retrieved was conducted separately by two researchers, adhering to the established criteria for both inclusion and exclusion. This was followed by a comprehensive review together to ensure a consistent approach in their assessments. In instances of differing opinions, the lead author provided the decisive input, resolving these discrepancies through a collaborative dialogue involving all contributing authors. The methodology for data extraction from the selected papers involved key details such as the identification of the principal author, the publication year, the title of the study, the participant count, the maximum observed diameter, and a spectrum of characteristics observable in CT scans. These characteristics encompassed indicators like vascular convergence, the presence of spiculation, air bronchogram, vacuole sign, lobulation, the computed mean CT value, and signs of pleural traction. A separate, unbiased evaluation of each paper’s quality was carried out using the Newcastle-Ottawa Scale (NOS). This scale examines three critical domains: selection of the study (rated from 0 to 4 points), comparability among subjects (0 to 2 points), and the precision in assessing outcomes (0 to 3 points). Literature that achieved a NOS score of 5 or more was classified as high-caliber research.

Validation cohort

Participants

Our study evaluated 688 patients with pGGNs from the Third Affiliated Hospital of Kunming Medical University, admitted between September 2020 and September 2023. The patient selection followed specific inclusion and exclusion criteria. Inclusion criteria included: (1) Pre-surgical CT imaging data within two weeks showing one or more pGGNs; (2) Pathological diagnosis of lung adenocarcinoma (AIS, MIA, IAC) post-surgical resection of pGGNs; (3) Surgical intervention for one or more pGGNs; (4) No prior anti-tumor treatments like radiotherapy or chemotherapy; (5) Age 18 years or older. Exclusion criteria involved: (1) Patients with incomplete imaging data or medical records; (2) Lung infections that could affect image analysis; (3) Significant respiratory movement artifacts in images impairing imaging analysis; (4) Inconsistent locations of GGNs in postoperative pathology reports and preoperative CT images.

CT acquisition and image analysis

CT scans were performed using a Siemens spiral CT scanner, featuring 64 rows and 128 slices. The scanning covered the entire lung, from the apex to the base. The technical parameters for the scan were set as follows: the tube voltage at 120 kV, the tube current at 100 mAs, a pitch setting of 1.0, and the slices had a thickness of 1 mm. In terms of reconstructing the images, a matrix of 512 × 512 was employed, along with an algorithm specifically designed for high-resolution lung imaging. Settings for the lung window were adjusted to a width ranging between 1200 and 1500 HU and a level between − 600 and − 700 HU. For the mediastinal window, settings were established with a width from 400 to 500 HU and a level set at 40 to 50 HU. These parameters were derived from plain CT scan images. Two experienced chest radiologists, each with over 15 years in the field, independently reviewed the images, blinded to the patients’ clinical and pathological data. Discrepancies in their evaluations were resolved through discussion. HRCT characteristics assessed included (1) the spiculation sign, characterized by spinous protrusions at the nodule edges, extending into the surrounding lung tissue(Fig. 1A); (2) pleural traction sign, identified by linear or tent-shaped shadows connecting the lesion to the pleura, sometimes presenting as a star-shaped shadow(Fig. 1B); (3) maximum diameter, measured on axial CT images [13]; and (4) mean CT value, calculated using a region of interest (ROI) cursor at the largest cross-section of the nodule, avoiding large bronchi, blood vessels, and any vacuoles/cavities [14].

Fig. 1
figure 1

A Spiculation sign; B Pleura traction sign

Histopathological evaluation

In the training dataset, the classification of pGGNs was based on histopathological analysis of surgically removed specimens. These lesions were categorized into AIS, MIA, or IAC as per the 2021 classification criteria set by the World Health Organization [2]. For the diagnosis of these nodules, two seasoned pathologists, each with over 15 years of experience at the Pathology Department of the Third Affiliated Hospital of Kunming Medical University, conducted a joint review and confirmation.

Statistical analysis

Meta-analysis

Odds Radio(OR) and their 95% confidence interval(CI) were systematically compiled from the selected cohort studies. The identification of risk factors was based on the degree of variability across studies, determined using both the Q-test and the I² statistic. In cases where significant variability was observed, indicated by a P-value of less than 0.10 or an I² exceeding 50%, the synthesis of the pooled OR and 95% CI was conducted using a model that assumes random effects. Conversely, in the absence of notable heterogeneity, a fixed-effects model was employed. The robustness of the findings was assessed through sensitivity analyses, which involved the sequential omission of individual studies. The potential for publication bias within these studies was appraised utilizing Funnel plots and Egger tests and Begg test; a P-value greater than 0.05 was interpreted as a lack of substantial bias.

Model development and validation

A mathematical model was developed using significant CT characteristics identified by meta-analysis, with each characteristic’s impact quantified by multiplying it with its corresponding regression coefficient (β), where β equals the natural logarithm of the OR. The imaging features of the patients were included at Third Affiliated Hospital of Kunming Medical University were analyzed. Continuous variables adhering to a normal distribution were presented as mean ± standard deviation, while those with skewed distributions were expressed as median (interquartile range). Frequency and proportion were used to represent categorical variables. The analysis of the ROC curve included the calculation of sensitivity and specificity, along with the area under the curve(AUC) and the ideal threshold. The AUC values, which vary between 0.5 and 1.0, are indicative of the model’s predictive precision, where larger numbers suggest enhanced accuracy. Calibration curves evaluated the model’s calibration, and decision curve analysis(DCA) assessed its clinical applicability.

Results

Description of the cohorts

Derivation cohort

In this meta-analysis, initial searches identified 2586 relevant publications, but ultimately only 17 studies were selected after excluding duplicates, irrelevant titles and abstracts, animal research, and articles requiring full-text review. These studies were all retrospective, comprising 7 in English and 10 in Chinese(Fig. 2). Quality assessment of these publications was conducted using the NOS, with 8 articles scoring 5 points, 7 articles scoring 6 points, and 2 articles scoring 7 points. All selected studies had NOS scores above 5, signifying their high quality, as illustrated in Table 1.

Fig. 2
figure 2

Flowchart of literature search of screening

Table 1 Basic characteristics and quality evaluation results of the included literature

Validation cohort

The study encompassed 512 individuals with pure ground glass nodules (Fig. 3), among whom 372 were female, representing 73% of the total cohort. Non-smokers comprised 412 of the participants, making up 81% of the group. Additionally, 410 patients were under the age of 60, accounting for 80% of the total (Table 2).

Fig. 3
figure 3

Process for the selection of patients in the validation cohort

Table 2 The clinical and CT characteristic in the validation data set

Model development and validation

In the analyzed meta-study, four out of nine evaluated risk factors were linked to the onset of IAC. The study identified several factors with notable ORs: the largest tumor diameter (OR = 3.178, 95% CI: 2.132–4.737), average CT values (OR = 1.006, 95% CI: 1.002–1.011), pleural traction (OR = 1.94, 95% CI: 1.24–3.03) and spiculation (OR = 1.55, 95% CI: 1.05–2.29). Conversely, variables like vacuole and bronchial air signs, uniform tumor shape, lobulation, and vascular convergence showed no substantial correlation with IAC (P > 0.05), as depicted in Fig. 4. When subjected to sensitivity analysis, factors like maximum diameter, regular shape, average CT value, air bronchogram, and vascular convergence sign demonstrated consistency, whereas the lobulation sign exhibited variability as detailed in Fig. 5. Furthermore, The funnel plot shows no publication bias in the maximum diameter (provided in Supplementary Fig. 1) and Egger test and Begg test confirmed the absence of publication bias across all risk factors (P > 0.05).

Fig. 4
figure 4

Pooled OR (95% CI) and heterogeneity test of the risk factors for IAC in pure ground-glass nodule

Fig. 5
figure 5

The sensitivity analysis of CT-based features for the included studies. A) maximum diameter; B) mean CT value; C) lobulation; D) air bronchogram sign; E) regular shape; F) vascular convergence sign

The meta-analysis determined β values for several factors: 1.16 for maximum diameter, 0.01 for mean CT value, 0.66 for the presence of pleural traction, and 0.44 for spiculation sign. The derived mathematical formula is: 1.16×the largest lesion diameter + 0.01 × the average CT value + 0.66 × pleural traction + 0.44 × spiculation. Within the validation group, the ROC curve’s AUC was 0.88 (95% CI 0.794–0.868)(Fig. 6).The model exhibited an 81% sensitivity and an 80% specificity in predicting IAC. Furthermore, the analyses of the calibration curve (Fig. 7) and DCA (Fig. 8) for the validation group demonstrated the model’s accurate calibration and its clinical applicability.

Fig. 6
figure 6

ROC curve of risk prediction model for IAC with pure ground-glass nodule in validation set

Fig. 7
figure 7

Calibration curve of risk prediction model for IAC with pure ground-glass nodule in validation set

Fig. 8
figure 8

Decision analysis curve of risk prediction model for IAC with pure ground-glass nodule in validation set

Discussion

The increasing use of HRCT has led to the detection of a higher number of ground glass nodules [30]. These nodules, often pure ground glass in nature, are typically slow-growing tumors, presenting challenges in their management [31]. Identifying invasive adenocarcinoma (IAC) in nodules that appear as pure ground glass is crucial, given the necessity for more comprehensive surgical approaches and timely intervention [6].

Recent research indicates that various CT imaging characteristics correlate with the development of IAC in pure ground glass nodules, though findings vary across studies [10, 19]. Meta-analysis, being the pinnacle of evidence-based medicine, offers more reliable predictive models than individual studies [32, 33]. This approach was utilized to amalgamate data from relevant cohort studies, identifying IAC risk factors and formulating a predictive model. This model, with an AUC of 0.88, demonstrates 81% sensitivity and 80% specificity in predicting IAC, showcasing high calibration accuracy and clinical utility. Such findings are instrumental in guiding the timing and approach of surgical interventions for pure ground glass nodules.

In this meta-analysis, We extended the search time again.17 articles were reviewed, identifying four IAC risk factors: maximum diameter, mean CT value, pleural traction sign, and spiculation sign. Compared to previous study [34]. we re-extracted the OR of the maximum diameter and average CT values in each study to obtain a new forest map of combined results((provided in Supplementary Figs. 2, 3). It was noted that IAC risk escalates with an increase in the maximum diameter of pure ground glass nodules, possibly linked to heightened tumor aggressiveness and structural changes. A higher mean CT value, observed in IAC cases, might result from the myofibroblast matrix thickening due to tumor cell invasion into normal pulmonary tissue. Contrarily, Fu et al [19] reported that mean CT value was not a reliable identifier of IAC, possibly attributed to minimal group differences stemming from sample selection. The spiculation sign, indicative of tumor cell proliferation and local infiltration, increases with tumor invasion, aligning with our meta-analysis findings. Several studies have indicated no correlation between the pleural depression sign and IAC [15, 35], but their limited sample sizes may affect these conclusions. Our meta-analysis, however, found an independent association between IAC and the pleural traction sign.

We re-performed sensitivity analysis for all risk factors exhibiting significant heterogeneity. Results remained stable for all except lobulation, which demonstrated instability upon sensitivity analysis. By omitting the study by Yu et al [21], a statistically significant difference was observed in the incidence of lobulation between IAC and PIL in pGGNs. This variation could be attributed to disparities in clinical features or patient demographics. In addition, publication bias for all CT features was reassessed, which made our results more robust.

In our validation cohort, the prevalence of pure ground glass nodules was notably higher among women, non-smokers, and individuals under 60 years, aligning with other researchers observations [36, 37]. The reasons behind this pattern remain uncertain and warrant further investigation.

Previous study [34] have identified risk factors for differentiating PIL from IAC in pGGNs. However, Dai et al [38] found that using a single CT feature qualitatively has limited diagnostic value for differentiating PIL from IAC in pGGNs. A combination of both quantitative and qualitative CT features has shown effective in determining the invasiveness of pGGNs. Predictive models for IAC in pure ground glass nodules have been developed by several researchers. Hu et al. [11] suggested a model integrating multiple imaging characteristics for predicting invasiveness. Similarly, Min et al. [18] and Liu et al [39] emphasized the efficacy of combining quantitative and qualitative features in their diagnostic models. Yet, these models, primarily based on small, single-cohort studies, lack comprehensive validation, reducing their clinical applicability. Based on the high-level evidence of meta-analysis, we for the first time use regression coefficient combined with the results of meta-analysis to establish a prediction model. For validation, 512 patients with pure ground glass nodules from The Third Affiliated Hospital of Kunming Medical University were included. This is a large sample validation compared to previous studies, making the results more robust. Compared to previous studies [22, 26], our model shows a better predictive performance, with an AUC of 0.88 (95% CI: 0.852–0.909), with 81% sensitivity and 80% specificity. Additionally, calibration curves and decision analysis were conducted, yielding favorable results. This model can effectively identify IAC in pure ground glass nodules, thus aiding treatment decisions.

Our study has certain limitations. Firstly, due to variations in study design and CT scan parameters, heterogeneity across the literature is unavoidable. Secondly, the inclusion of only retrospective studies might introduce selection bias. Thirdly, as all studies were conducted in China, this may restrict the broader applicability of our findings. Lastly, the validation cohort was derived from a single center, limiting the model’s predictive efficacy testing. Multi-center patient testing is necessary for further validation.

Conclusion

The meta-analysis conducted led to the creation of a reliable and effective predictive model for assessing the risk of IAC in cases of pure ground glass nodules. This model integrates several key parameters, including the largest diameter, average CT values, indications of spiculation, and signs of pleural traction. It offers substantial utility in assisting clinicians in making informed decisions about the treatment of pure ground glass nodules.