Introduction

Lung cancer is the most common cause of cancer-related death worldwide, causing 69,410 male deaths and 62,470 female deaths in the United States alone in 2021 [1]. As a prominent type of lung cancer, non-small cell lung cancer (NSCLC) accounts for approximately 85% of all types of lung cancer, with lung adenocarcinoma and lung squamous cell carcinoma (SCC) accounting for 60 and 15% of histological subtypes, respectively [2]. With the advent of the new era of targeted therapy and immunotherapy, the overall survival (OS) of patients with NSCLC has considerably increased for each tumor stage [3]. Despite these novel treatments, lung surgery remains the most substantial and supportive tool for treating NSCLC. For patients with locally advanced NSCLC, neoadjuvant therapy plays a crucial role in downstaging lung cancer and providing an opportunity for surgery, which effectively improves prognosis [4]. Traditional neoadjuvant therapy includes chemotherapy and chemoradiation, and molecular-targeted therapy and immunotherapy are evolving as revolutionary neoadjuvant treatments for NSCLC [5]. However, tools and predictive models for predicting the prognosis of patients receiving neoadjuvant therapy followed by lung surgery are limited.

The American Joint Committee on Cancer (AJCC) TNM staging system is the most commonly used tool for predicting recurrence and survival [6]. For the N descriptor, the lymph node (LN) is based on the lymphatic region involved without any information of the number of dissected LNs (NDLN) and the number of positive LNs (NPLN) [7]. The log odds of positive LNs (LODDS) is a novel LN descriptor that has advantages over the N stating descriptor of the TNM system in many malignancies, including rectal cancer [8], gallbladder cancer [9], gastric adenocarcinoma [10], cervical cancer [11], and esophageal carcinoma [12]. LODDS is calculated using the following formula: ln([NPLN + 0.5]/[NDLN-NPLN + 0.5]). Therefore, it is usually a negative number. The higher the LODDS, the higher the NPLN, and the worse the prognosis. The LN ratio (LNR) is another N descriptor that represents the NPLN/NDLN ratio. Wang et al. reported that the nomogram combining TNM staging with LODDS+LNR performed better than the AJCC 8th TNM staging in clinical practicability [13]. Yu et al. found that LODDS exhibited better predictive power than the N, NPLN, and LNR staging systems [14]. However, no previous reports have assessed the application of LODDS in predicting the prognosis of patients receiving neoadjuvant therapy followed by lung surgery. Thus, in this study, we screened suitable cases from the SEER database and compared the value of LODDS and TNM N descriptors. Finally, we constructed a model combining LODDS with clinicopathological characteristics for better prediction. This study was conducted according to the TRIPOD reporting checklist [15].

Materials and methods

Patient selection

All patients were selected from the SEER database (http://seer.cancer.gov/). Eighteen population-based cancer registries were selected from the SEER database, and the SEER*Stat program (v. 8.3.9) was used to extract information of patients with lung cancer. The extraction terms were as follows: “the location of the disease: lung and bronchus” and “diagnosis year: 2004–2015.” In this study, we enrolled patients with primary lung cancer who received neoadjuvant therapy and lung surgery between 2004 and 2015 (only neoadjuvant chemotherapy or radiotherapy, without any patients receiving immune checkpoint inhibitors and tyrosine kinase inhibitors). Figure 1 shows a flowchart of patient selection. The following variables were extracted: “Age recode with <1 year olds,” “Race recode (White, Black, Other),” “Sex,” “Marital status,” “Derived AJCC T, 6th ed (2004-2015),” “Derived AJCC M, 6th ed (2004-2015),” “Primary Site – labeled,” “Histologic Type ICD-O-3,” “RX Summ--Surg Prim Site (1998+),” “CS tumor size (2004-2015),” “CS Tumor Size/Ext Eval (2004-2015),” “Grade (thru 2017),” “Survival months,” “Vital status recode (study cut-off used),” “Regional nodes positive (1988+),” “Regional nodes examined (1988+),” “CS Reg Node Eval (2004-2015),” “First malignant primary indicator.” The AJCC TNM staging system was updated to the 8th version. Variables of “CS Tumor Size/Ext Eval (2004-2015)” and “CS Reg Node Eval (2004-2015)” were used to identify patients who underwent neoadjuvant therapy. The following patients were excluded: (a) patients with metastatic disease; (b) patients who did not undergo lung surgery; (c) patients in whom lung cancer was not the only primary tumor; (d) patients not receiving neoadjuvant therapy; (e) patients without information about the number of retrieved and positive LNs; and (f) patients with unknown race, marital status, tumor site, laterality, grade, T stage, and N stage.

Fig. 1
figure 1

Flowchart of patient selection

Ethical statement

Informed consent was waived, and ethical approval from the institutional review board was not needed because SEER is a public database and the SEER data contained no personal identifying information. This study was conducted according to the Declaration of Helsinki and the Harmonized Tripartite Guideline for Good Clinical Practice of the International Conference on Harmonization.

LODDS calculation

LODDS was calculated using the following formula: lg([NPLN + 0.5]/[NDLN-NPLN + 0.5]), where NPLN is the number of positive LNs and NDLN is the number of dissected LNs. X-Tile software (version 3.6.1; Yale University School of Medicine, New Haven, CT, USA) was used to identify the optimal LODDS cutoff value with the maximal survival difference or highest log-rank χ2 value among the three groups [16]. As the X-tile software presented − 1.07 and − 0.27 as the LODDS cutoff value for the included patients, LODDS was divided into three ranges: LODDS<− 1.07, − 1.07 ≤ LODDS<-0.27, and LODDS≥-0.27.

Statistical analysis

R software (version 4.0.2) was used for statistical analysis. Statistical significance was set at p values < 0.05. Categorical variables are presented as proportions. Chi-square tests or Fisher’s precision probability tests were performed for different evaluations of categorical variables. Univariate and multivariate Cox regression analyses were conducted to screen risk factors for OS when variables with P values < 0.05 were finally incorporated into the risk model.

Kaplan–Meier survival curves and log-rank tests were used to compare the OS of patients with different LODDS ranges and N classifications. Receiver operating characteristic (ROC) curves were used to evaluate the predictive value of the N classification, LODDS, and multivariate model for patients’ long-term outcomes. Weighted mean rank statistics were used to compare the area under the curve (AUC) of the N classification, LODDS, and multivariate model [17]. To better balance the baseline of patients with different LODDS ranges, propensity scores were determined using generalized boosted models, and inverse probability of treatment weighting (IPTW) was used to adjust the Cox regression analyses [18]. In addition, prediction accuracy was compared by calculating the integrated discrimination improvement (IDI) and net reclassification improvement (NRI) between the LODDS, N classification, and multivariate model.

Results

Demographic and clinicopathological characteristics

In Table 1, we compared demographic and clinicopathological characteristics of patients with different LODDS ranges. A total of 2059 patients from the SEER database were enrolled in this study and divided into three groups: LODDS<− 1.07, − 1.07 ≤ LODDS<-0.27, and LODDS≥-0.27. There was no significant difference among the groups in terms of age, marital status, surgery type, and radiotherapy (all P > 0.05). However, the variables of sex, race, laterality, primary site, histologic type, differentiation, and chemotherapy were significantly different among the three groups (all P < 0.05). Patients with LODDS≥-0.27 had higher proportions of females, right laterality, primary site of the lower lobe, adenocarcinoma, low differentiation grade, low T1 stage, and chemotherapy. Since LODDS was calculated using NDLN and NPLN, patients with LODDS≥-0.27 had a higher N stage, more regional nodes examined, and positive. We conducted IPTW to eliminate demographic and clinicopathological characteristics of patients with different LODDS ranges. As shown in Fig. S1, the absolute standardized differences in variables decreased under 0.2 and mostly under 0.1, indicating that the three groups were well matched after IPTW.

Table 1 Baseline characteristics of patients with NSCLC who received neoadjuvant therapy

Univariate and multivariate cox regression analyses

We conducted univariate and multivariate Cox regression analyses to confirm independent risk factors for patient survival, as shown in Tables 2 and 3. Before IPTW, univariate analysis demonstrated that LODDS, age, sex, T stage, N stage, and radiotherapy were significantly associated with OS (all P < 0.05). However, multivariate analysis showed that LODDS, age, sex, T stage, and radiotherapy were independent risk factors for patient survival (all P < 0.05), with N stage excluded.

Table 2 Cox regression analysis of patients with NSCLC who received neoadjuvant therapy before IPTW
Table 3 Cox regression analysis of patients with NSCLC who received neoadjuvant therapy after IPTW

After IPTW, the results of the univariate analysis were similar to previous results, showing that LODDS, age, sex, T stage, N stage, and radiotherapy were statistically significant variables, whereas race, marital status, primary site, histologic type, differentiation, and surgery type were newly added variables (all P < 0.05). Furthermore, multivariate regression analysis indicated that LODDS, age, sex, race, marital status, primary site, differentiation, and T stage were independent risk factors for patient survival (all P < 0.05), with N stage excluded. With or without IPTW, LODDS was an independent risk factor for the prognosis of patients receiving neoadjuvant therapy followed by lung surgery.

We also conducted subgroup analysis to further validate the significance of LODDS. We further compared the relative risks of different LODDS ranges by dividing patients into different subgroups based on the variable. We found that a higher LODDS was associated with a higher risk in most subgroups, as shown in Table 4. However, there was no statistical significance among the different LODDS ranges with respect to middle lobe, overlapping primary site, grade I differentiation, grade IV differentiation, and N3 stage, which could be because of the relatively small sample size.

Table 4 Multivariable Cox regression analysis of subgroups of patients with NSCLC who received neoadjuvant therapy

Survival analysis

We compared the long-term survival of patients with different N classifications (Fig. 2A). Although patients with different N stages presented different survival curves with P values of 0.036, the curve was not separate and mostly overlapped. Nevertheless, when we divided patients into three groups based on LODDS ranges, we found that the curve was much more distinct (Fig. 2B). Patients with LODDS<-1.07 had the best survival status compared to patients in the other two groups, while patients in the middle group (− 1.07 ≤ LODDS<-0.27) had better OS than those with LODDS≥-0.27 (P < 0.0001). Even after IPTW, the survival curve remained significant among the three groups (P < 0.0001), as shown in Fig. 3.

Fig. 2
figure 2

Kaplan–Meier estimates of OS of patients with NSCLC who received neoadjuvant therapy stratified by N classification (A) and LODDS (B) before IPTW. OS, overall survival; NSCLC, non-small cell lung cancer; LODDS, log odds of positive lymph node; IPTW, inverse probability of treatment weighting

Fig. 3
figure 3

Kaplan–Meier estimates of OS of patients with NSCLC who received neoadjuvant therapy stratified by LODDS after IPTW. OS, overall survival; NSCLC, non-small cell lung cancer; LODDS, log odds of positive lymph node; IPTW, inverse probability of treatment weighting

ROC curve analysis

We compared the accuracy and prognostic value of the N classification, LODDS, and multivariate model using ROC curves and AUC comparisons. We used a multivariate model with five variables that were independent prognostic indicators in the multivariate analysis in Table 2: LODDS, age, sex, T stage, and radiotherapy. As shown in Fig. 4, LODDS had a significantly higher AUC than the N classification for 1-year (P = 0.008), 3-year (P = 0.007), and 5-year OS (P = 0.010) but not at 10-year OS (P = 0.228). However, the multivariate model had a significantly higher AUC than LODDS and N classification for 1-, 3-, 5-, and 10-year OS (all P < 0.001). We also compared the IDI and NRI of the N classification, LODDS, and multivariate model, as shown in Table 5. On considering LODDS as a reference, we found that the IDI and NRI of the N classification were negative. At the same time, those of the multivariate model were positive, suggesting that the LODDS had significantly higher predictive accuracy than the N classification but had lower predictive accuracy than the multivariate model (P < 0.05).

Fig. 4
figure 4

ROC curves for a multivariable model (including LODDS, age, sex, T stage, and radiotherapy), LODDS, and N classification predicting 1-year (A), 3-year (B), 5-year (C), and 10-year (D) OS of patients with NSCLC who received neoadjuvant therapy. ROC, receiver operating characteristic; LODDS, log odds of positive lymph node; OS, overall survival; NSCLC, non-small cell lung cancer

Table 5 Comparison of predictive performance between LODDS and other models

Discussion

Controversies regarding the nodal status of the 8th TNM staging system have existed for several years. In summary, there are four commonly used nodal classifications for lung cancer: N classification, NPLN, LNR, and LODDS [19]. The N classification in the TNM staging system is the most commonly used prognostic tool for patients with lung cancer. The N classification for lung cancer is easy to understand and remember; it categorizes no metastasis to LNs as N0, metastasis to ipsilateral peribronchial and/or hilar nodes and intrapulmonary nodes as N1, metastasis to ipsilateral mediastinal and/or subcarinal nodes as N2, and metastasis to contralateral mediastinal and/or hilar nodes and any supraclavicular LNs as N3 [20]. The TNM staging system helps clinicians determine treatment and predict prognosis. However, the N classification is based on the anatomic position of positive nodes, without any quantitative information, leading to inaccuracy and low discrimination power [21]. In this study, we found that the AUCs of the N classification were 0.493 (95% CI 0.461–0.526), 0.538 (95% CI 0.513–0.563), 0.549 (95% CI 0.522–0.577), and 0.603 (95% CI 0.554–0.651) for 1-, 3-, 5-, and 10-year survival, respectively. The low discriminative power of the N classification of the TNM staging calls for a more accurate nodal status assessment tool.

For patients undergoing radical lung cancer resection, systematic LN dissection (SND) is the standard procedure for surgical treatment of NSCLC [22], especially for patients receiving neoadjuvant therapy who are usually diagnosed with stage II–III NSCLC, when systematic LN dissection is necessary. In this study, 78.1% of patients underwent lobectomy and 19.3% underwent pneumonectomy, with only 2.7% of patients undergoing sublobectomy. Mun et al. reported that lobe-specific mediastinal LN dissection is vital for patients with pN1, whereas SND contributes to survival in patients with pN1 after recurrence [23]. The LNs retrieved during surgery provide sufficient knowledge about nodal status with quantitative information. NPLN represents the number of positive LNs requiring retrieval of LNs during surgery [14]. However, NPLN can be significantly affected by the surgical technique and number of examined LNs because the pathological results are dependent on LN dissection. Using a Chinese multi-institutional registry and the US SEER database for stage I–IIIA resected NSCLC, Liang et al. recommended that 16 LNs should be examined for prognostic stratification [24].

Ratio-based nodal evaluation methods are also used and do not require information of the number of examined LNs, including LNR and LODDS. LNR is calculated by dividing NPLN with NDLN. LODDS is calculated using the formula: log (NPLN+ 0.50)/(NDLN−NPLN+ 0.50). Therefore, LODDS is the only indicator that includes the numbers of dissected, positive, and negative LNs. The controversy regarding the comparison between LNR and LODDS is that they demonstrate advantages in different situations [25, 26]. However, LODDS was superior to LNR for lung cancer in most studies. Yu et al. demonstrated that LODDS showed better predictive performance than the N classification, NPLN, and LNR in patients with node-positive SCC after surgery [14]. Deng et al. found that LODDS and LNR performed slightly differently in patients with different resected LNs. They proved that LODDS was slightly better than LNR for patients with < 10 resected LNs, whereas LNR was slightly better than LODDS for patients with ≥10 resected LNs [27]. When combined, LODDS and LNR had the highest predictive accuracy compared with other models for cancer-specific survival and OS of patients with lung adenocarcinoma after surgery [13]. However, there are no previous reports on the predictive ability and accuracy of LODDS in patients receiving neoadjuvant therapy and surgery. In this study, we found that LODDS could effectively differentiate patients’ prognoses. In addition, LODDS demonstrated a much higher AUC than N classification for 1-, 3-, and 5-year OS prediction but not for 10-year OS prediction. Univariate and multivariate Cox regression analyses demonstrated that LODDS was an independent risk factor for patients’ OS. Subgroup analyses confirmed the results in the different subgroups.

We noticed that baseline characteristics and demographic data of patients with different LODDS ranges were significantly different. To eliminate the bias caused by this difference, we applied IPTW to balance the baseline characteristics and demographic data. With or without IPTW, LODDS showed statistical significance in the Kaplan–Meier curve and regression analyses. Because of its excellent predictive ability, LODDS was incorporated into the multivariate model to construct a nomogram. Wang’s nomogram included LODDS+LNR as the nodal status factor and showed excellent predictive ability with a high C-index (0.7222 for the CSS nomogram, 0.6920 for the OS nomogram) for patients with T1-4N0-2M0 lung adenocarcinoma after surgery [13]. This study used a multivariate model with five critical factors: LODDS, age, sex, T stage, and radiotherapy. The model showed a higher AUC than the N classification and LODDS. The multivariate model’s predictive performance indicators, IDI and NRI, were also higher than those of the N classification and LODDS, which proved that LODDS is an independent and compatible factor for LN staging and could be incorporated into the risk assessment model well.

Compared with the N descriptor, NPLN, LNR, and LODDS had an unignorable shortcoming. They depended on the dissection of LNs and pathological results, while the N descriptor could be determined using PET-CT and LN biopsy. Therefore, the N stage can directly decide the TNM stage and the following treatment approach before surgery; however, NPLN, LNR, and LODDS can only be adopted as tools to predict recurrence and prognosis after surgery. This study had several limitations. On the one hand, many important data are absent in the SEER database, including smoking history, sequence of surgery and chemotherapy, and novel treatments with tyrosine kinase inhibitors and immune checkpoint inhibitors. Missing data may lead to a worse predictive effect of the nomogram. We attempted to construct a nomogram based on our findings but failed in this study because the C-index was very low. We suspected that the low C-index of the nomogram was because of the heterogeneity of patients who received very different treatment regimens. On the other hand, the new era of tyrosine kinase inhibitors and immune checkpoint inhibitors brings a paradigm shift for neoadjuvant therapy for patients with NSCLC, which challenges LODDS and other nodal status indicators.

Conclusions

For patients with NSCLC receiving neoadjuvant therapy followed by surgery, LODDS had better predictive ability than the AJCC N staging descriptor. A multivariate clinicopathological model with LODDS demonstrated excellent performance in predicting prognosis. LODDS provides clinicians with more accurate nodal status information, while nomograms and external validation are required in future studies.