Introduction

The expansion of screening programs has allowed an increasing number of lung cancer patients to undergo surgery [1, 2]. In patients without lymph node metastasis, such as adenocarcinoma in situ (AIS) and minimally invasive adenocarcinoma (MIA), lung cancer could be practically cured [3, 4]. Once the lymph nodes are involved, however, the risk of postoperative recurrence is significantly increased [5, 6]. The standard procedure for non-small cell lung cancer (NSCLC) surgery is lobectomy plus lymph node dissection [7]. However, lymph nodal sampling is widely performed due to its convenience in the real-world settings. The non-inferiority of lymph nodal sampling compared to lymph node dissection on postoperative outcome is still controversial. Sufficient number of lymph nodes needs to be sampled to allow accurate disease staging and selection of appropriate postoperative treatment [8]. Several studies supported station-based sampling [9, 10] or lobe-specific systematic nodal dissection based on tumor histology [11], but others suggested a need for a higher and more precise number of dissected lymph nodes [12,13,14,15]. This issue is particularly important for clinical stage IA lung adenocarcinoma since the rate of nodal involvement varies substantially (0-23.7%) [16]. We conducted a retrospective analysis to examine the relationship between the number of dissected lymph nodes and recurrence-free survival (RFS) in patients with clinical stage IA pure solid lung adenocarcinoma and to clarify the number of resected lymph nodes which did not lead to worse outcome.

Methods

Study design and patients

We retrospectively screened all adult patients aged 18 years or older who underwent lobectomy for pure solid cT1N0M0 lung adenocarcinoma at the Fourth Hospital of Hebei Medical University during a period from January 1, 2015 to December 31, 2017. The flowchart of the subject enrollment is provided in Fig. 1. The study protocol was approved by the Institutional Ethics Review Board (2018mec160). Based on the retrospective design, informed consent was waived.

Fig. 1
figure 1

A flowchart of patient selection

For inclusion in the final analysis, a diagnosis of primary lung adenocarcinoma must be confirmed by postoperative pathology, and preoperative computed tomography (CT) of the chest must show pure solid nodules. Patients with AIS or MIA were excluded from the analysis. Patients with part-solid nodules were excluded from the study due to the significant disparity in the rate of lymph node metastasis versus solid nodules [17].

Preoperative staging

Clinical staging was performed according to the 8th edition of the TNM classification for lung cancer [18] and based on thin-section CT on a 128-slice spiral system (Siemens, Definition, Flash). The following parameters were used during scanning: slice thickness, 5 mm; slice interval, 5 mm; voltage, 120 kVp; scanning matrix, 512 × 512; lung window width, 1,200 HU; lung window level, -600 HU; mediastinal window width, 350 HU; mediastinal window level, 40 HU.

Two thoracic surgeons (RY and NH) independently reviewed CT scans to determine the location of pulmonary nodules, their maximum diameter and density, and the presence or absence of enlarged lymph nodes. Disagreements were resolved by a senior thoracic surgeon with 40 years of experience (JL). Pure solid nodules were defined as opacification that completely obscured the underlying vascular pattern [19]. cT1 was defined as maximum diameter at < 30 mm on chest CT. cN0 was defined as absence of non-calcified lymph nodes with a short-axis diameter of > 1 cm in the mediastinum or hilum, parabronchial or intrapulmonary parenchyma [20]. Although PET-CT can identify positive lymph nodes more accurately than CT, routine preoperative PET-CT does not seem to be widely available in various hospitals in most cities in China [21]. Therefore, we still chose CT as the screening method for enrolled patients.

Postoperative staging and the number of dissected lymph nodes

The number of dissected lymph nodes was the sum of the mediastinal and intrapulmonary lymph nodes. Additional pulmonary lymph nodes identified during postoperative pathological examination were added to this sum at the discretion of the pathologists. The number of intact lymph nodes in the region that were clustered together after formalin fixation was calculated by the pathologist.

The pT and pN staging was defined according to the 8th edition of TNM staging system.

Follow-up

The routine follow-up schedule at authors’ institute was every 6 months during the first 2 years after surgery and every 12 months thereafter. Each follow-up visit included a clinical examination, CT of the chest and abdomen, and a panel of serum tumor markers that consisted of carcinoembryonic antigen, squamous cell carcinoma antigen, neuron-specific enolase, carbohydrate antigen 125, and cytokeratin 19 fragment antigen 21 − 1.

If recurrence was suspected, additional examinations were performed as appropriate, such as contrast-enhanced cranial magnetic resonance imaging and radioisotope skeletal scanning. Local recurrence was defined as recurrence of cancer in the ipsilateral lung or mediastinal/hilar lymph nodes. Distant metastasis was defined as recurrence in all other part than local recurrence. RFS was calculated from the time of surgery to recurrence (either locoregional of distant) or death, whichever came first. The last follow-up was conducted in July, 2022.

Statistical analysis

Data were analyzed using R 4.1.2 (The R Foundation, www.r-project.org) and Empower (X&Y Solutions, Boston, MA, USA). P < 0.05 (2-sided) was considered statistically significant. Categorical variables were analyzed by chi-squared or Fisher’s exact test and reported as number and percentage. Continuous variables following normal distribution were analyzed by Student’s t-test and reported as mean ± standard deviation. Continuous variables not following normal distribution were analyzed by Mann–Whitney U test and reported as median (interquartile range; IQR).

Cox proportional hazards regression was performed to identify independent predictors of poor RFS. Independent variables (in addition to the number of dissected lymph nodes) were selected based on previous studies and > 10% change in effect estimates [22]. The relationship between RFS and the number of dissected lymph nodes was assessed using restricted cubic spline analysis and fitted using a model based on the log-likelihood ratio. A recursive algorithm was used to identify the inflection point in the number of dissected lymph nodes. Kaplan-Meier curves of RFS were compared between the patient subgroups using the log-rank test.

Sensitivity analyses were conducted by: (1) limiting the number of dissected lymph nodes in single-station to a maximum of 5. According to a previous randomized control study, the largest numbers dissected lymph nodes in a single station were resected from stations 4R and 7, with a median of four and three nodes, all less than 5, respectively [23]. Moreover, the number of lymph nodes fragment, which could not be counted exactly, was recounted to 5. This may help minimize the interference of lymph node fragments on the count results of the number of dissected lymph nodes; (2) treating occurrence and death as competing events using the Fine-Gray subdistribution hazard regression.

Results

Patient characteristics

Of the 3,966 patients screened,458 (241 women) were included in the final study, with a mean age of 60.3 ± 8.1 years) (Fig. 1) (Table 1). Disease stage was IA in 238 (51.96%), IB in 94 (20.52%), IIB in 56 (12.23%), and IIIA in 70 (15.28%) patients. The most frequent histological subtype was acinar predominant, followed by solid predominant, papillary predominant, lepidic predominant and micropapillary predominant. The median number of dissected lymph nodes was 15 (range, 0–39; IQR, 11–19). Lymph node metastasis was reported in 126 (27.51%) patients. Within the median follow-up of 60.20 (IQR, 54.87–66.03) months, recurrence occurred in 118 (25.76%) patients, and 93 (20.31%) patients died.

Table 1 Clinicodemographic characteristics of the 458 patients in the study

Relationship between the number of dissected lymph nodes and RFS

The restricted cubic spline analysis showed an L-shape relationship between the number of dissected lymph nodes and RFS (Fig. 2). The relationship was non-linear (P for non-linearity = 0.033). P for the log-likelihood ratio test for a two-line model of RFS was 0.001, indicating the presence of an inflection point. In recursive algorithm analysis, the inflection point was 9.

Fig. 2
figure 2

Association between the number of lymph nodes examined and recurrence-free survival (RFS)

Adjusted for age, sex, age-adjusted Charlson Comorbidity Index, surgical year, tumor site, family history of malignancy, histopathological subtype, spread through air space, pT, pN and Postoperative adjuvant therapy

The following variables were included in the Cox regression as covariates: sex (male vs. female), age (≤ 65 vs. > 65 year), tumor site (RUL vs. RML/RLL/LUL/LLL), age-adjusted Charlson comorbidity index, year of surgery (2015 vs. 2016/2017), family history of malignant tumor (yes vs. no), histopathological subtype (micropapillary or solid predominant subtype vs. others), spread through air space (yes vs. no), pT stage (T1a vs. T1b/T1c/T2a), pN stage (N0 vs. N1/N2) and postoperative adjuvant therapy (yes vs. no). Within a range from 0 to 9 dissected lymph nodes, RFS increased linearly with increasing number of dissected lymph nodes (Table 2).

Table 2 Association between the number of dissected lymph nodes and recurrence-free survival according to piece-wise regression models

Association of the number of dissected lymph nodes with RFS and OS

RFS was significantly longer in patients with ≥ 9 vs. <9 dissected lymph nodes (log-rank test P = 0.022; Fig. 3A). Such a difference was observed in the subgroup analysis that only included 332 patients without lymph node metastasis (log-rank test P = 0.031; Fig. 3B) as well as the analysis that only included 126 patients with lymph node metastasis (log-rank test P = 0.022; Fig. 3C).

Fig. 3
figure 3

Subgroup analysis of recurrence-free survival (RFS) in (A) all patients, (B) patients without lymph node metastasis (N0) and (C) patients with lymph node metastasis (N1 or N2)

In the multivariate Cox regression, the following factors were independently associated with RFS: the number of dissected lymph nodes, aCCI, family history of malignancy, pN stage, and postoperative adjuvant therapy (Table 3). The association between ≥ 9 dissected lymph nodes and improved RFS was apparent in all sensitivity analyses (Table 4).

Table 3 Multivariate Cox proportional hazard regression to identify independent predictors of survival
Table 4 Sensitivity analyses to assess the robustness of the association between number of dissected lymph nodes and Recurrence-free survival

Discussion

Results from the current study suggest that 9 is the appropriate minimum number of lymph nodes that should be examined in patients with clinical stage IA pure solid lung adenocarcinoma. RFS increased in proportion to increasing number of dissected lymph nodes, but only up to 9. There was no further improvement beyond nodes. The association between improved RFS with ≥ 9 dissected lymph nodes was apparent in the subgroup analysis that not only included patients with no lymph node involvement, but also in patients with lymph node metastasis, adding support to the validity of the findings.

The issue of adequate lymph node sampling is universal for practically all cancers. For example, current guidelines recommend a minimum of 15 lymph nodes for gastric cancer [24], 8–20 lymph nodes for breast cancer depending on the T state [25], and 10–42 lymph nodes for esophageal cancer depending on the T state [26,27,28]. A prospective cohort study of lung cancer found that inadequate detection of intrapulmonary (N1) lymph nodes could lead to occult metastases of lymph nodes [29]. The consequence of erroneous labeling of N1 as N0 could lead to wrong decision with regards to adjuvant treatment [30, 31]. Our results showed that in patients without lymph node metastasis in postoperative pathology, the risk of recurrence decreased significantly in the group with a higher number of dissected lymph node. This may be because of the possibility that occult lymph node metastasis decreased as the number of dissected lymph node increased, and the true pathological status of the patient was more accurately assessed. Significantly, the aforementioned association remained consistent in the subgroup analysis of patients with positive lymph nodes in the current study. This finding may be attributed to the lower residual tumor burden in patient with higher number of dissected lymph nodes since more or even all metastatic lymph nodes have been resected. Patients with low number of dissected lymph nodes, in contrast, may have higher residual tumor burden after the surgery. Alternatively, detection of lymph node metastasis with a smaller number of dissected lymph node tends to suggest overall higher percentage/number of the lymph nodes with metastasis, which in turn is associated with poorer prognosis. A cohort study of 16,393 subjects with NSCLC by David and colleagues also supported the significance of higher number of lymph nodes sample in improved long-term prognosis in patients with stage I disease [32]. Unfortunately, no suggestions have been provided on the number of dissected lymph nodes in their study. A previous study of 442 patients with stage I NSCLC reported that a minimum of 6 lymph nodes needed to be examined to define lymph nodal staging and improve patient outcome [33]. The major reason for this discrepancy between this study and our findings is that their study enrolled some patients who underwent surgery before establishing rules for systematic lymph node sampling and about one-third of the enrolled patients admitted for surgery at community hospitals that lacked strict surgical quality control. The larger number in the current study (9) likely reflects the fact that the current study was conducted more recently in a more contemporary setting with improved quality control. A study based on SEER database plus a multi-center retrospective study in patients with NSCLC suggest that increasing the number of examined lymph node to 16 can result in superior overall survival [34]. Patients with stage IIIB or IV disease were excluded in their study and there was no consideration of treatment following tumor recurrence. Subsequent treatments, especially targeted therapy and immunotherapy, may have potential effects on overall survival [35, 36]. To minimize this influence, we chose recurrence-free survival as the primary outcome of our study. Different primary outcomes may be a reason for the different sampling thresholds for lymph nodes. Another reason may be the presence of T4 staging patients recruited by investigator, who may have developed more lymphatic metastasis and therefore may have contributed to the increase in numbers at the time of sampling. A study of patients with pure solid nodules with cT1a-2bN0-1M0 reported that approximately 16–17 lymph nodes were the optimal cut point for RFS [37]. This study also considered the imaging characteristics of patients when recruiting patients. However, the population enrolled in this study was NSCLC. Studies have clearly pointed to differences in the risk of recurrence between adenocarcinoma and other types of NSCLC [38]. In addition, different histologic subtypes of adenocarcinoma (especially micropapillary/solid subtypes) had a significant impact on survival outcomes [39, 40]. In patients with pure solid nodules, the high-risk subtype of adenocarcinoma has a higher chance of recurrence [41]. It is important to note the current study excluded patients with AIS and MIA, two subpopulations in which lymph node metastasis rarely occurs. Also, this study only included patients with adenocarcinoma, and the results may not apply to cases in which intraoperative pathology (wedge-resection or needle aspiration) is not performed prior to lobectomy in patients with preoperative histologic diagnosis.

The minimum number of dissected lymph nodes as derived from the current study (9) is similar to a previous study based on SEER database plus a Chinese multi-center retrospective study in patients with NSCLC [42]. The fact that we obtained the similar number despite adjusting for other variables (e.g., age-adjusted Charlson comorbidity index, year of surgery, histopathological subtype, spread through air space) adds to the robustness of the findings. Our results may be more reliable partly because we only recruited patients with cT1 (tumor diameter ≤ 3 cm), and the effect of T stage on outcomes was greatly reduced, and on the other hand, we adjusted for covariates more comprehensively. In addition, our study also incorporated the prognostic potential of imaging, which was not included in the above studies [17].

Findings from the current study also suggest that “more is better, but only to a certain extent”. Consistent with the conclusion of a large trial of the American College of Surgeons Oncology Group (ACOSOG) Z0030 [23], prognosis did not improve with higher number of dissected lymph nodes at a range beyond 9. Indeed, higher number of dissected lymph nodes has been associated with longer operation time and increased bleeding [43]. The reliability of our findings is supported by their robustness to subgroup and sensitivity analyses.

Our findings should be interpreted with caution in light of several limitations. First, the retrospective design increases the risk that our analysis is confounded by other factors for which we did not control. Second, it remains unclear whether our results can be generalized to patients whose histological subtype of adenocarcinoma cannot be determined before or during surgery. Third, this study was conducted at a single-center, and therefore must be considered as preliminary.

Conclusion

In patients receiving lobectomy of stage IA pure solid lung adenocarcinoma, longer RFS was associated with higher number of dissected lymph nodes up to and not beyond 9 dissected lymph nodes.