Background

Gastric cancer (GC) is the fourth most commonly diagnosed cancer on the upper digestive system and remains the second most common cause of cancer-related deaths worldwide [1]. Cancer staging is the cornerstone of successful tumor therapy after surgery. Today, the American Joint Committee on Cancer (AJCC) tumor-node-metastasis (TNM) classification is the most widely used staging system for GC [2,3,4]. Lymph node staging system, also defined as N category, is an essential factor for predicting prognosis and selecting the appropriate treatment [5]. In the AJCC staging system, only the number of positive regional nodes is taken into account and the number of the total examined regional nodes does not get enough attention [6, 7]. However, GC patients with same number of regional positive nodes accompanying with more total regional nodes examined tend to have a higher level of radical gastrectomy and lower risk with the residual tumor [8, 9]. New attempt and exploring an evaluating criterion of the lymph node staging system is needed.

Several studies have demonstrated that GC patients with more examined regional nodes were more likely to have better survival because they tended to have a higher level of radical gastrectomy [10,11,12]. With the development of surgical techniques, obtaining and examining an increasing number of total regional nodes is no longer difficult for an experienced surgeon. Lymph node density (LND), also called lymph node ratio (LNR), is calculated by the ratio of the number of positive nodes to the total examined nodes [13, 14]. This parameter has been proven as a potential prognostic factor in several cancers, including GC [15, 16]. From the 7th edition (the year 2010) of the AJCC staging system, resection of at least 16 regional lymph nodes was introduced into the guideline of GC surgery [17]. The number of lymph node dissection before 2010 was small, usually less than 15, and previous studies of LND in GC were limited because they enrolled the patients both before and after 2010 [18]. The prognostic value of LND needs to be reassessed by only using data after 2010, which is same as the current lymph node dissection standard.

In the present study, we determined the impact of LND on cancer-specific survival (CSS) and overall survival (OS) in GC patients based on data after 2010 in the Surveillance, Epidemiology, and End Results (SEER) database. We hypothesize that LND is an independent prognostic factor in GC. The interaction between LND and current lymph node staging system was detected to provide evidence for the development of a novel accurate lymph node staging strategy.

Methods

Patients and data

This study had a retrospective design and included data from patients with histologically confirmed GC from the SEER database. Data were recorded according to the third edition of the International Classification of Diseases for Oncology. Enrollment occurred between 2010 and 2015, during which period the 7th AJCC staging strategies were applied and the follow-up and treatment information was complete. Patients were included if they were older than 18 years and had undergone gastrectomy. Patients with neoadjuvant radiotherapy were removed from the analysis, and only those with a histological type of adenocarcinoma were enrolled. Patients with incomplete data regarding tumor location or staging information were also excluded from the analysis. The primary variable in this study was lymph node data and patients lacking information regarding the exact number of nodes examined or positive nodes were not included in the analysis. In the 7th edition (2010) of the AJCC staging system for gastric cancer, the cutoff levels of N category were as follows: N0 = 0 node; N1 = 1–2 nodes; N2 = 3–6 nodes; N3 = more than 6 nodes. The patient screening process is shown in Fig. 1. Finally, a total number of 4281 patients with GC were included for analysis in the present study.

Fig. 1
figure 1

Flow chart illustrating the screening process in this study

The following available information on pathological and clinical appearance was collected from data-mining: age at diagnosis, sex, year of diagnosis, race, marital status at diagnosis, insurance status, primary site, 7th AJCC stage group information, grade, chemotherapy, lymph node data, cause-specific death classification, vital status recode, and survival months. Overall survival (OS) was determined with the vital status recode and survival months, while cancer-specific survival (CSS) was calculated with the cause-specific death classification and survival months. Informed consent was not obtained for the present study because of the open-source nature of the SEER database. This study was conducted in accordance with the Declaration of Helsinki.

Statistical analyses

LND was calculated by the ratio between the number of positive nodes and the number of examined nodes. Means were calculated for continuous variables and proportions for categorical variables. To identify the ideal cutoff point of LND in predicting survival, the X-Tile software (3.6.1 version, Yale University) was used [19]. The chi-square test (categorical variables), one-way analysis of variance (normal distribution), or the Mann-Whitney U test (skewed distribution) was used to test for differences among different LND groups. The relationship between specific variables and survival were analyzed by the Kaplan-Meier log-rank test and Cox proportional hazards regression. Univariate analyses of individual pathological and clinical features and CSS or OS were performed. To identify any independent association between LND and survival, we constructed three models: non-adjusted model, where no covariates were adjusted; adjusted I model, where only sociodemographic data of age, sex, and race were adjusted and adjusted II model, where covariates in the adjusted I model and other elected covariates were adjusted. If the matched odds ratio changed at least 10% as the result of adding covariates, the variables were considered to be added in the adjusted II model, as described in a previous study [20]. To estimate the robustness of LND in predicting survival, sensitivity analyses were stratified according to different AJCC N stages. Tests for effect modification for those of subgroup indicators were followed by the likelihood ratio test. All the analyses were performed with the statistical software package R (http://www.R-project.org, The R Foundation) and EmpowerStats (http://www.empowerstats.com, X&Y Solutions, Inc., Boston, MA). P values less than 0.05 (two-sided) were considered statistically significant.

Results

Demographic characteristics and relationship between LND and clinical features

After screening by inclusion and exclusion criteria, a total of 4281 participants diagnosed with GC were selected from the SEER database for the final data analysis. Table 1 provides an overview of patient characteristics and clinicopathologic features. The average age at diagnosis was 69.73 years for the whole population enrolled, and almost two-thirds of the participants (63.78%) were male. The number of enrolled patients was evenly distributed throughout the enrollment period from 2010 to 2015. Other baseline demographic data and information on pathologic staging and grading are shown in Table 1. The X-Tile software was used to identify the ideal cutoff point for LND in predicting survival. As shown in Supplemental Figure S1 and S2, cutoff points of 0.1 and 0.4 yielded the maximum chi-square of log-rank tests for CSS and OS, and LND was divided into three levels: LND1, LND2, and LND3. The survival difference was significant in the three groups both for CSS and OS when divided by the best cutoff points. The relationship between different groups of LND and clinical features is shown in Table 1. Insured patients tended to have a lower level of LND than that the uninsured or unknown categories, indicating that the insured population may have easier access to physical examinations and thus receive an earlier diagnosis. Tumors located on the upper stomach tended to have a lower level of LND and tumors with overlapping lesions tended to have a higher level of LND. A high rate of LND in advanced AJCC staging and grading is clearly shown in Table 1. This result is somewhat counterintuitive and requires further investigation since patients in the middle level of LND were more likely to receive chemotherapy.

Table 1 Patient characteristics and correction between LND and clinicopathologic variables

Association between clinical variables and CSS and OS

Table 2 shows the prognostic factors affecting CSS and OS in GC patients based on univariate analysis. Older age was significantly associated with reduced CSS (HR = 1.01, 95% CI 1.01–1.01, P < 0.001) and OS (HR = 1.03, 95% CI 1.02–1.03, P < 0.001). Null associations of sex were observed with CSS and OS. The mortality of GC patients was reduced with an increase in the year of diagnosis, although this result was just short of being statistically significant. The effect of race was also assessed, with Asian or Pacific islander populations showing a lower risk of mortality than white patients in CSS (HR = 0.79, 95% CI 0.68–0.92, P = 0.002) and OS (HR = 0.71, 95% CI 0.63–0.80, P < 0.001). Marital status at diagnosis was associated with GC survival and being widowed and single was associated with a dismal prognosis in CSS and OS analysis. Uninsured patients or those with unknown insurance status were at high risk of death in CSS (HR = 1.21, 95% CI 1.08–1.35, P = 0.001) and OS (HR = 1.13, 95% CI 1.03–1.23, P = 0.007). Compared to upper GC, primary tumors located at overlapping lesions significantly increased the risk of mortality both in CSS analysis (HR = 1.48, 95% CI 1.21–1.82, P = 0.001) and OS (HR = 1.36, 95% CI 1.16–1.60, P = 0.001). There was a significant positive correlation between advanced AJCC staging and survival both in CSS and OS (all P values < 0.001), which was in accordance with our clinical experience. In addition, patients with poorly or undifferentiated GC had an approximately doubled mortality risk compared to those with well-differentiated tumors. Patients receiving chemotherapy had a 1.33-fold increase in mortality risk in CSS analysis than those without chemotherapy or unknown chemotherapy status (HR = 1.33, 95% CI 1.19–1.50, P < 0.001), but the association was not significant in OS analysis. The primary variable in this study was LND and its survival predicting effects were fully investigated. Patients in LND2 or LND3 had 3.98 (95% CI 3.45–4.60, P < 0.001), 8.57 (95% CI 7.48–9.82, P < 0.001) folds increase in mortality risk in CSS, respectively, than those in LND1. Differences in mortality rate of OS were also shown for patients in the LND2 or LND3 which had 2.88 (95% CI 2.59–3.20, P < 0.001) and 5.54 (95% CI 4.99–6.15, P < 0.001) fold increase in mortality risk compared to those with the lower level. Kaplan–Meier survival curves for those patients according to LND status are presented in Fig. 2. Significant differences were observed among LND subgroups in predicting CSS (Fig. 2A, P < 0.001) and OS (Fig. 2B, P < 0.001).

Table 2 Association between clinical variables and gastric cancer-specific survival or overall survival
Fig. 2
figure 2

Kaplan–Meier survival curves of cancer-specific survival (A) and overall survival (B) for gastric cancer patients according to LND status

Independent prognostic effect of LND and stratified effect analysis in current lymph node staging systems

To further elucidate the prognostic impact of LND on GC patients who underwent gastrectomy, Cox multivariate analysis was performed (Table 3). We constructed three models to analyze the independent effects of LND on survival including a crude (non-adjusted) model, minimally adjusted (adjusted I) model and fully adjusted (adjusted II) model. As previously shown in the results of multivariate analysis, CSS and OS of GC patients decreased with an increase in LND in different models. In the fully adjusted model, compared to the lower LND group, patients with a middle and higher level of LND had 2.43 (HR = 2.43, 95% CI 2.09–2.84, P < 0.001) and 4.69 (HR = 4.69, 95% CI 4.02–5.48, P < 0.001) fold increase in mortality in CSS, respectively. Similar results were found in the evaluation of OS which showed that patients with a middle and higher level of LND had 2.04 (HR = 2.04, 95% CI 1.81–2.29, P < 0.001) and 3.61 (HR = 3.61, 95% CI 3.20–4.07, P < 0.001) fold increase in mortality, respectively. Taken together, these data suggest that LND had independent prognostic effects in patients with GC. To further elucidate whether LND had independent prognostic effects in the current nodal category strategies, subgroup analysis was performed. Each positive N category (N1-3) was stratified into different LND subgroups. In the N1 and N2 stages, patients with GC were divided into three groups according to the cutoff points of LND. As shown in Table 3, CSS and OS of GC patients significantly decreased with the increase in LND grade in different models (all P values < 0.05). For patients in the N3 stage, there were only two patients in LND1 group (Table 1) and thus for patients in the N3 category the cutoff point was 0.4. Results repeatedly showed that a high level of LND was associated with worse survival in CSS and OS in different models (all P values < 0.05). In addition, survival curves were plotted using the Kaplan-Meier method to determine the independent prognostic effect of LND in subgroups of N categories. As presented in Fig. 3, patients in each positive N category were found to contain subgroups divided by LND, with significantly heterogeneous CSS and OS (all P values < 0.001).

Table 3 Multiple regression analysis to assess the independent effect in CSS and OS and stratified effect analysis in different N stages
Fig. 3
figure 3

Stratified effect analysis of LND in positive N stages. Kaplan–Meier survival curves of cancer-specific survival in N1 (A), N2 (C), and N3 (E) subgroups. Kaplan–Meier survival curves of overall survival in N1 (B), N2 (D), and N3 (F) subgroups

Discussion

Accurate staging is the cornerstone of optimal cancer care and therapy. The current staging system of N categories, the AJCC TNM system, for GC patients is based on the number of lymph nodes with metastases and the number of examined lymph nodes is not taken into account. The prognostic role of LND, calculated by the ratio of the number of positive nodes to the number of examined nodes, in GC staging was analyzed in the present study. Optimal cutoff values of LND were calculated as 0.1 and 0.4, and LND was divided into three levels: LND1 (< 0.1), LND2 (> = 0.1, < 0.4), and LND3 (> = 0.4). Significant differences in survival were observed among LND subgroups and in predicting CSS and OS by univariate analysis. We constructed three models to analyze the independent effects of LND on survival and results showed that LND had independent prognostic effects in patients with GC. To evaluate the performance of LND in the staging system, each N category was stratified by different levels of LND. Each N category was found to contain subgroups of GC patients with significantly heterogeneous CSS and OS.

Currently, the AJCC TNM system is the most commonly used staging system for GC. From the 7th edition AJCC staging manual, the N category was recommended to include resection of at least 16 regional lymph nodes [17, 21]. A considerable amount of literature has been published on the topic of LND or LNR from 2010, when the 7th edition AJCC staging manual firstly came into efforts [22, 23]. Many studies have demonstrated that regional positive nodes did not or insufficiently reflect the survival of patients with GC [14, 24, 25]. For patients who were detected with the same number of regional positive nodes, those with more regional nodes examined were more likely to have better survival because those patients tended to be with a higher level of radical gastrectomy and have lower risk with the residual tumor. Validation analysis from Taiwan University Hospital Cancer Registry also found the advantages of a staging system using a lymph node ratio using a patient cohort from Eastern medical centers [26]. One study from China showed that the lymph node ratio staging system is a reliable classification for GC patients after neoadjuvant radiotherapy [27].

In the present study, we demonstrated that LND was an independent predictor of patients’ survival and to have stratified effects in different N subgroups. Although these findings have already been reported in previous studies [18, 22, 25, 26, 28], and some of these also used data from the SEER database [18, 25], our study has distinct characteristics that make it differ from previous studies. The time of enrolment in the present study was from 2010 to 2015, during which period the 7th AJCC staging strategies were applied and the follow-up and treatment information was complete. Most previous studies enrolled GC patients across a long-time span, and patients before 2010 were also included for analysis. Another consideration is that, before the application of the 7th AJCC staging system, the number of examined lymph nodes may have been lower. A recent study by Yang et al. included GC patients from the SEER database between 1988 and 2015 and 61.31% of patients had less than 15 examined lymph nodes [18, 29]. LND or lymph node ratio is determined by the number of positive regional nodes and the number of examined regional nodes. With the development of surgical techniques, an increasing number of regional nodes can be resected and examined. Data before 2010 when the lower number of examined regional nodes was common may inevitably affect the lymph ratio and result in unstable evidence. In the present study, the median regional nodes examined were 17.96, which was higher than the 16 in the 7th edition AJCC staging system, thus providing a relatively higher level of evidence. Another distinct characteristic of our study is that we only enrolled patients with gastric adenocarcinoma according to histologic subtype codes of the SEER database. The significance of lymph node metastasis is different in different histologic types [30, 31]. Gastric adenocarcinoma comprises nearly 90% of the total number of gastric malignancies and thus focusing on adenocarcinoma helps to better understand the characteristics of this histologic subtype and improve diagnosis and treatment [32].

Although the present study is a relatively large population-based study, it inevitably had several limitations. First, the SEER database is a public open-access cancer registry data and does not include certain information regarding neoadjuvant chemotherapy and surgical complications which may influence the survival of GC patients. Second, the retrospective nature of the SEER database does not allow to extrapolate the results. Third, only 4281 patients were analyzed from a total of 17,988 gastric adenocarcinomas, due to incomplete data. Therefore, the conclusion of this paper is limited to gastric adenocarcinoma. Fourth, in this study, we used data from SEER database only, and lacked independent datasets from our center or other centers. Fifth, because the SEER database lacks information such as the extent of lymph node dissection (D0-2) and the curability (R0-2) of participants, these two factors were not fully considered in this study. Finally, the 8th edition of the AJCC TNM system has been recommended in clinical practice since 2017, but data from the SEER database from after 2017 cannot be used because of the limitation of lack of follow-up information.

In conclusion, by analyzing the SEER database from 2010, when the 7th edition of the AJCC TNM system came into efforts, the present study demonstrates that LND is an indicator of CSS an OS in GC. We calculated two optimal cutoff values of LND as 0.1 and 0.4. Significant differences in survival were observed among LND subgroups in predicting CSS and OS. Patients in each positive N category were found to contain subgroups divided by LND with significantly heterogeneous CSS and OS. Development of a new LND staging strategy may help improve patient care and to better informed and precise treatment decisions. However, due to the retrospective nature of data from the SEER database, further prospective cohort studies are needed.

Conclusion

Through analysis of a population-based SEER database, we found that LND is an independent prognostic factor for GC without distal metastasis. In the current lymph node staging system, LND has potential value in further accurately classifying the GC patients without distal metastasis.