Introduction

Lung cancer is the first-rank cause of cancer-related mortality worldwide, and it mainly includes two histological types, non-small cell lung cancer and small cell lung cancer (SCLC) [1]. Most lung cancer is non-small cell lung cancer, which accounts for over 83.0%, and its 5-year overall survival rate reaches about 23% [2]. SCLC has a much worse survival than non-small cell lung cancer, of which the 5-year overall survival rate is approximately 6%, although it is only a small part of lung cancer (about 15%) [2]. As an important treatment approach, surgery provides survival benefits for limited-stage SCLC patients, confirmed by previous studies and recommended for patients with clinical classification T1-2N0M0 [3,4,5]. Thus, limited-stage SCLC patients with lung resection have better survival outcomes than those without surgery [3, 4, 6]. However, the prognosis of those patients remains heterogeneous [1, 7]. Differences in patient prognosis may imply different treatment strategies; therefore, it is important to assess the patient’s prognosis accurately. The tumor, nodal, and metastasis (TNM) classification system is the leading evaluation approach to predicting the prognosis [8]. The current 8th edition TNM category system was launched in 2017, which had a more accurate predictive ability to assess the patient prognosis than the previous edition [9]. However, the pathological nodal (pN) classification is still the same in those two editions [10]. The status of lymph nodes indicates whether SCLC patients with a clinical classification of T1-2N0M0 receive adjuvant radiotherapy [5]. Accordingly, accurate lymph node status estimates are the key to providing prognostic information and determining the appropriate therapy strategies.

However, the current pN classification still has some drawbacks and is limited by the level of lymph node dissection. Some studies suggested that the number of examined lymph nodes (ELNs) could affect patient survival [11, 12]. In addition, the status of pN classification does not reflect the tumor burden in the lymph node enough. Some researchers confirmed that lymph node ratio (LNR) might be a more accurate tool than the current pT classification to predict prognosis and guide the plan for adjuvant therapy [13,14,15]. As a consequence, ELNs and LNR may play important roles in the modification of pN classification.

Previous studies have assessed patient prognosis by ELN or LNR; however, the ELN or LNR usually appears alone [16,17,18]. For example, in one report, the LNR was primarily evaluated as an indication of patient prognosis, but the harm caused by insufficient lymph node dissections was not reflected. Although another study evaluated both ELN and LNR, this study included an analysis of patients with metastases of lymph nodes and did not include SCLC [15]. Also, most studies utilized publicly available data sets and did not validate each other with data from their hospitals [15,16,17].

Moreover, there is a lack of studies including ELNs and LNR in SCLC patients with pathological classification T1-2N0-2. Therefore, in the present study, we analyzed the cutoff points for the number of ELNs and LNR in SCLC patients with pathological classification T1-2N0-2M0 using data from the multicenter database (MCDB) in China (included four hospitals) and Surveillance, Epidemiology, and End Results (SEER) database. In addition, we further compared predictive ability between the current pN classification and a new pN classification that combined ELNs with LNR in patient prognosis. This study aimed to elucidate which pathological nodal classification is the best: the current pN classification or the new pN classification that combined ELNs with LNR.

Materials and methods

Patient selection

A total of 1,156 SCLC patients with pathological classification T1-2N0-2M0 diagnosed between 2004 and 2018 were included in the present study. Eight hundred forty-nine cases were selected from the SEER database by SEER*Stat software (version 8.4.0), and 307 cases were collected from four hospitals in China (Shanghai Pulmonary Hospital, Sun Yat-sen University Cancer Center, Zhongda Hospital, and The Affiliated People’s Hospital of Ningbo University). Eligible patients for main analyses met the following criteria: (1) pathologically diagnosed as SCLC; (2) age ≥ 18 years old; (3) tumor size ≤ 5.0 cm; (4) diagnosed as classification M0 and N0-2; (5) known surgical approach; (6) known information about ELNs and positive lymph nodes. Patients were excluded if they: (1) dead within one month after surgery; (2) pN classification and lymph node records were inconsistent. Detailed information about patient selection standards is shown in Fig. 1. The work has been reported in line with the STROCSS criteria [19].

Fig. 1
figure 1

The standards of case selection in the SEER database (A) and MCDB from China (B)

Follow-up and data collection

The follow-up information on the SEER database and MCDB cohort from China was updated in November 2021 and March 2022, respectively. We used telephone and outpatient visit records for follow-up updates. The median follow-up time was 44.0 months (from 2 to 151 months). Patients after the operation visited the outpatient clinic at 3- or 6-month intervals for the first three years and 12-month intervals after that. The time interval between the diagnosis of the SCLC and the cancer-caused mortality was defined as cancer-specific survival (CSS). Cases were censored at the end of the follow-up. CSS was considered best concerning clinical relevance.

The patients’ clinical-pathological information was obtained from the SEER database: sex, age, tumor location, tumor grade, tumor size, surgical approaches, the number of ELNs, positive lymph nodes, radiotherapy record, chemotherapy record, and pN classification.

Statistical analyses

The X-tile software was used to determine the optimal cutoff points of ELNs and LNR in the pN0-2 cohort and pN1-2 cohort, respectively [20]. X-tile was tested statistically by enumeration, using different values as the truncation value groups, and the test result with the smallest p-value was recognized as the best truncation value for the data set. According to the results of the X-tile software, we finally classified patients into three new pathological N categories, new pN#1 (pN0-#ELNs > 3), new pN#2 (pN0-#ELNs ≤ 3 or pN1-2-#LNR ≤ 0.14), and new pN#3 (N1-2-#LNR > 0.14). Survival curves were generated through the Kaplan-Meier method and compared by the Log-rank test. The nomograms were conducted to show the weight of all variables visually. Univariable and multivariable Cox regression analyses were performed to calculate the hazard ratio (HR) and 95% confidence interval (CI) of the variables for cancer-specific mortality. A two-sided P < 0.05 was defined as statistically significant. The − 2LogLikelihood value, Akaike information criterion (AIC), and Bayesian Information Criterion (BIC) were calculated according to Cox regression. The abovementioned three indicators mean a more accurate model when their values are smaller [21]. The concordance index (C-index) and the area under the receiver operating characteristic (ROC) curve (AUC) were used to compare the prognostic, predictive ability between the current pN classification and N modification. The standard error (SE) was performed to evaluate the stability of the C-index. C-index and SE were calculated by “compareC” packages and nomograms constructed by “survival” and “rms” packages using R 4.1.2 software (https://www.r-project.org/), then other analyses were performed using software SPSS 25.0 (IBM SPSS, Inc., Armonk, IL, USA).

Results

Patient characteristics

In the SEER database, 849 patients with a median survival time of 31.0 months (range 1-190 months) were identified as 533 pN0 cases (62.8%), 174 pN1 cases (20.5%), and 142 pN2 cases (16.7%). Females outnumbered men, constituting 56.5% of the patients. 369 (43.5%) patients were aged 65 and below, whereas 480 (56.5%) were over 65 years old. The optimal cutoff value of ELNs in the entire cohort was 3. Thus, 23.0% and 77.0% of patients were categorized into the ELNs ≤ 3 and ELNs > 3 (Supplementary Fig. 1), respectively. Of note, 89 patients with ELNs ≤ 3 were in the cohort with lobectomy/ pneumonectomy; 88 patients with ELNs > 3 were in the cases with sub-lobectomy. LNR’s optimal cutoff point was 0.14 in the cohort with pN1-2 (Supplementary Fig. 2). Therefore, pN1-2 patients with LNR ≤ 0.14 and pN0 patients were combined into one group (n = 628, 74.0%), and patients with LNR > 0.14 were classified into another group (n = 221, 26.0%). According to the prognostic performance of ELNs and LNR, we combined those two factors to develop a new indicator, N modification. Overall, 402, 226, and 221 cases were classified as new pN#1 (pN0-#ELNs > 3), new pN#2 (pN0-#ELNs ≤ 3 or pN1-2-#LNR ≤ 0.14), and new pN#3 (N1-2-#LNR > 0.14), respectively.

As for MCDB from China, 307 cases with a median survival time of 33.0 months (range 2-152 months) included 149 pN0 diseases (48.5%), 51 pN1 diseases (16.6%), and 107 pN2 diseases (34.9%), respectively. There were 19 patients with ELNs > 3 in the group of sub-lobectomy and 23 patients with ELNs ≤ 3 in the category of lobectomy/ pneumonectomy. Overall, 30.4%, 28.0%, and 31.6% of cases were classified as N modification 1, N modification 2, and N modification 3. The patients’ other baseline characteristics are presented in Table 1.

Table 1 The characteristics of small cell lung cancer patients

Survival effect of ELNs and LNR

The median values of ELNs were 8 (range 1–51) and 11 (range 0–50) in the SEER database and MCDB from China, respectively. LNR’s median values were 0 and 0.04 in the SEER database and MCDB, respectively. The distribution of ELNs and LNR in the abovementioned database is shown in Fig. 2A-B. Patients’ survival curves were stratified successfully according to the optimal points of ELNs and LNR generated from the SEER database (Fig. 2C-F). Besides, the same cutoff points also performed prognostic significance in the MCDB cohort (Fig. 3A-D). In short, patients with ELNs > 3 or LNR ≤ 0.14 had s better survival than those with ELNs ≤ 3 or LNR > 0.14.

Fig. 2
figure 2

The distribution of ELNs and LNR in the SEER database (A) and MCDB from China (B). The survival curves based on ELNs were in the SEER database (C) and the pN0 patients of SEER database (D). The survival curves based on LNR were in the SEER database (E) and the pN1-2 patients of SEER database (F)

Fig. 3
figure 3

The survival curves based on ELNs were in the MCDB (A) and the pN0 patients of MCDB (B). The survival curves based on LNR were in the MCDB (C) and the pN1-2 patients of MCDB (D). The nomograms of the pN0 patients (E) and the pN1-2 patients (F) were constructed in the SEER database

Then, the nomograms were constructed to visually evaluate the weight of ELNs and LNR to affect survival in the pN0 cohort and the pN1-2 cohort in the SEER database, respectively (Fig. 3E-F). In the nomogram of the pN0 cohort, ELNs play the most important role in affecting survival; LNR’s weight was larger than the pN classification in the pN1-2 cohort. Therefore, we classified patients into four categories, N0-#ELNs > 3, N0-#ELNs ≤ 3, N1-2-#LNR ≤ 0.14, and N1-2-#LNR > 0.14. The survival curves based on the abovementioned four categories were drawn (Fig. 4A). The prognosis of N0-#ELNs ≤ 3 patients was similar to those with N1-2-#LNR ≤ 0.14 (Log-rank; P = 0.733). To further develop a new pN classification, we combined cases with N0-#ELNs ≤ 3 with those with N1-2-#LNR ≤ 0.14.

Fig. 4
figure 4

The survival curves were based on ELNs and LNR (A). The survival curves based on the N modification were in the SEER database (B) and the MCDB from China (C). The survival curves based on the current pN modification were in the SEER database (D) and the MCDB from China (E)

Prognostic significance of N modification

The new pN category included new pN#1 (pN0-#ELNs > 3), new pN#2 (pN0-#ELNs ≤ 3 or pN1-2-#LNR ≤ 0.14), and new pN#3 (N1-2-#LNR > 0.14). Overall, patients with new pN#1 had the best survival than the other two classifications (Fig. 4B). The 3-year CSS rates were 73.0%, 54.0%, and 29.0% in the cases of new pN#1, new pN#2, and new pN#3, respectively. The multivariable Cox regression identified the new pN classification as an independent prognostic indicator (all P < 0.001, Table 2). In addition, there was descreasing survival trend by category of the N modification (all P < 0.001, Fig. 4B).

Table 2 Univariable and multivariable analyses for cancer-specific mortality in the SEER database

To further validate the performance of the new pN classification, we used the same classification in the MCDB cohort. Similarly, in the MCDB cohort, patients’ prognoses showed a decreasing trend by the new pN classification (all P < 0.001, Fig. 4C). The median survival time of patients with the new pN#1, new pN#2, and new pN#3 were 98 months, 50 months, and 22 months, respectively. Besides, the new pN classification was confirmed as an independent prognosticator (Table 3).

Table 3 Univariable and multivariable analyses for cancer-specific mortality in the multi-center database from China

Comparison between the new pN and the current pN classifications

Survival curves based on the current pN classification were drawn in order to compare further the predictive performance between the new pN and current pN classifications. The stratified effect of survival curves originating from the current pN classification was not as distinct as those derived from the new pN category (Fig. 4D-E). The values of -2LogLikelyhood, AIC, and BIC were smaller in the multivariable Cox regression model that included the new pN category than those in the model that included the current pN classification (Tables 2 and 3). Thus, the new pN classification had a better ability to indicate prognosis than the current pN classification.

Besides, the C-index was also calculated to compare the predictive effect between the current pN and the new pN classifications (SEER: 0.617 vs. 0.654, P < 0.001, Table 2; MCDB: 0.622 vs. 0.679, P < 0.001, Table 3). We further drew the ROC curves and calculated the AUC to validate the performance of the new pN category. The results of the analysis for the SEER database showed that the new pN classification had higher values of AUC in predicting 1-year CSS (0.682 vs. 0.637), 2-year CSS (0.692 vs. 0.649), and 3-year CSS (0.686 vs. 0.644) than the current pN classification, respectively. Similar results were also presented in the analysis of the MCDB cohort (Fig. 5).

Fig. 5
figure 5

The ROCs of 1-year, 2-year, and 3-year CSS were in the SEER database (A-C) and the MCDB from China (D-F).

Discussion

SCLC is a small part of lung cancer that has received less attention from investigators than non-small cell lung cancer. In thoracic surgical oncology, surgeons seem more inclined to discuss which combined stages of the disease are appropriate to undergo surgical resection. The National Comprehensive Cancer Network guidelines recommend that SCLC patients with clinical stage I-IIA (classification T1-2N0M0) perform a surgical operation [5]. Thus, recent studies on the surgical treatment of SCLC have explored the role of surgery in stage III patients [3, 6, 22, 23]. Based on their results, surgery was considered a potential treatment for stage III SCLC patients, which provided patients with an optional therapeutic approach, though their research results were not consistent enough. However, there are few studies on lymph node dissection and pN classification in the limited-stage SCLC patients. The current pN classification was based on the idea that lymph node metastasis follows the direction of lymphatic drainage from intrapulmonary lymph nodes (pN1) to mediastinal lymph nodes (pN2) and then to distant lymph nodes (pN3). However, in some studies and our clinical practice, it has been found that lymph node metastases can skip the N1 station and jump directly to the N2 station (skip-N2). Moreover, patients with the N1 station and N2 station metastases are also classified as N2. However, although both are N2, the prognosis of patients with skip-N2 metastases is much better than those with N2 metastases that include N1 stations [24,25,26]. In other words, the current pN classification is likely to have some flaws in evaluating the prognosis of SCLC patients after surgery. Accordingly, some researchers used LNR and the number of positive lymph nodes to supplement the pN classification in order to strengthen the predictive ability [15, 27]. Regrettably, both of their studies only collected non-small cell lung cancer patients. Therefore, to the best of our knowledge, there is still information lacking regarding improving pN classification in SCLC.

In the present study, we used a large cohort from the SEER and the multicenter database in China to develop and validate a new pathological nodal classification. ELNs and LNR reflected the status of lymph node dissection and lymph node metastasis and reached optimal cutoff points by X-tile software. We develop a new pN classification. The new pN classification had a better predictive ability than the current pN classification. Therefore, we propose that the new pN classification that combines LNR and ELNs has a better performance than the current pN classification in predicting SCLC patients’ prognoses.

A sufficient lymph node dissection is key to categorizing the pathological nodal classification and evaluating the prognosis precisely. Previous studies confirmed that an insufficient lymph node dissection was associated with a poor prognosis in the field of non-small cell lung cancer [12, 16]. Besides, the survival benefit of lobectomy was unable to be accurately assessed because of lack of lymph node dissection information compared with sub-lobectomy, according to a recent report [28]. The studies mentioned above show that adequate lymph node dissection plays a vital role in thoracic surgical oncology. Nevertheless, the current guidelines about SCLC do not indicate a consistent number of lymph nodes necessary [5]. Analysis of data from 1051 limited-stage SCLC patients indicated that survival was improved s when the ELNs were over 7 [29]. The threshold for the number of ELNs from their study was more extensive than ours. The reason might be that they excluded patients with sub-lobectomy; however, we did not. We found that patients with sub-lobectomy also performed lymph node dissection when we processed the data; therefore, we decided to reserve this part of the data. Besides, another study performed the same operation of data as the present study [17]. Regardless of the difference in the number of ELNs, all demonstrated similar results that inadequate lymph node dissections decreased the survival time of limited-stage SCLC patients after surgery.

LNR is a simple and effective tool for postoperative prognostic assessment for SCLC patients with classifications T1-2N0-2M0. However, it is undeniable that the calculation of LNR is still based on lymph node dissection. The present study showed that patients with N0-#ELNs ≤ 3 had similar survival to those with N1-2-#LNR ≤ 0.14. Those results might suggest that patients with inadequate lymph node dissection were accompanied by the diseases of lymph node metastasis. The low number of ELNs masked the actual metastatic status of the lymph nodes. Accordingly, the N modification that combined ELNs and LNR might reflect the actual nodal situation, which showed a better predictive ability than the current pN classification in the present study. Similarly, the study from Li F at el. also demonstrated that the pN classification combined with the number of metastatic lymph nodes or LNR had a more accurate performance than the current pN classification alone in the non-small cell lung cancer patients with pN1-2 classification [15]. On the one hand, these results illustrated the shortcomings of the current pN classification and, on the other hand, suggested that the combination of LNR improved the prognostic, predictive accuracy of the pN classification. Thus, we suggest that the next 9th edition TNM staging system should consider the information about lymph nodes except the anatomical location of lymph nodes, such as ELNs, LNR, and the number of metastatic lymph nodes.

There were still some limitations in this study. First, the present research was a retrospective study; therefore, some confounders were not included in the study; selection bias was still possible. Second, the treatment sequence of chemotherapy and radiotherapy in the SEER database was unknown. Thus, the cases with neoadjuvant therapy were not excluded from the SEER database. However, we excluded cases with neoadjuvant therapy in the multicenter database from China and reached similar results after analyses for the MCDB to validate the performance of the N modification. Third, the distribution of baseline characteristics still had some differences between the MCDB and SEER databases, though the sample size of MCDB was large. Finally, the N modification was confirmed to have a better performance than the current pN classification, but this does not mean it could replace the traditional TNM staging system. We need more studies to confirm our findings.

Conclusions

In this multicenter cohort we developed and validated a new nodal classification for SCLC patients with pathological category T1-2N0-2M0 after surgery. Besides, we propose that an adequate lymph node dissection is essential; surgeons should perform and consider the situation of ELNs and LNR when they evaluate postoperative prognoses of SCLC patients.