Introduction

Surgery is the only effective intervention for cure or long-term survival among patients with gastric adenocarcinoma (GC). In areas without screening (i.e., North America and Europe), patients often present late and have a high frequency of nodal involvement [1], with an estimated rate of lymph node (LN) involvement of 3–5 % for T1a; 11–25 % for T1b; 50 % for T2; and 83 % for T3 tumors [2, 3]. Nodal status is one of the most important independent predictors of patient survival [4, 5].

There is international variation in standards for nodal resection in GC. The Japanese Gastric Cancer Association (JGCA) has published detailed guidelines for the pathologic assessment and staging of GC, describing 16 nodal compartments classified as N1, N2, and N3 groups. While there is strong support in Asia and some European centers for the more extensive D2 LN dissection, which removes both the N1 and N2 nodal groupings, several randomized control trials have failed to demonstrate a survival benefit for this extended surgery, leading to less acceptance of this resection in North America [6, 7]. For example, in the United States, the Intergroup 0116 trial found that only 10 % of patients had the recommended D2 LN dissection; 36 % had a D1 LN dissection; and 54 % had an inadequate D0 LN dissection [8].

An alternative method to determine the extent of LN resection for a given patient is the “Maruyama Index of Unresected Disease” (MI). Researchers at the National Cancer Center Hospital in Tokyo have documented the disease spread in patients treated by D2 or greater lymphadenectomy in a searchable computerized database, referred to as the “Maruyama Program” [911]. This program calculates the probability of positive LNs in the 16 stations, determined by seven variables: age, sex, Borrmann type of tumor, greatest dimension of the tumor as measured on the luminal surface, location of the tumor, tumor depth, and histology. The MI is the sum of the probability of nodal involvement for the nodal stations that were not dissected. An MI of 0 can be achieved in all gastric resections [12] if surgical removal of all ‘at risk’ nodal stations is performed. A strong correlation between a lower MI and an increase in overall survival (OS) has been demonstrated in American, European, and Asian patients [8, 13, 14]. This has led researchers to assert that better local–regional therapy can favorably affect survival in GC, and that the MI may be a better indicator of adequacy of surgery than a particular lymphadenectomy [13, 14]. Additionally, these analyses show that findings from Asian studies have strong implications for patients in the West, despite the assertion that biologic differences explain the improved survival of GC in the East [2]. Regardless of which resection technique is selected, it appears that there is a complex relationship between the LN dissection performed, clinical staging, and ultimate patient outcome.

In 1997 the Union International Contre le Cancer (UICC)/American Joint Committee on Cancer (AJCC) recommended, in the 5th edition of their staging manual, that a minimum of 15 LNs be assessed per patient [15]. The recommendation was based on the nodal classification system of N1 = 1–6 positive LNs; N2 = 7–15 positive LNs; and N3 = more than 15 positive LNs. The cut-off points were derived from retrospective databases [16]. Subsequent examinations [17, 18] have shown superior predictive ability of LN staging based on number of nodes involved rather than the location of nodal involvement. [19]. Part of this superior predictive ability has been attributed to the category N3 (>15 LNs positive), a group with extremely poor prognosis. The 7th edition of the UICC/AJCC staging manual (2010) revised the nodal classification system such that N1 = 1–2 positive LNs; N2 = 3–6 positive LNs; N3a = 7–15 positive LNs; and N3b = 16 or more positive LNs [20]. As such, the UICC/AJCC now recommends that at least 16 LNs be assessed per patient [20]. The staging changes attempt to minimize the impact of surgical dissection on GC staging, and to improve the prognostic ability of N-staging compared to that in the 5th/6th editions [21]. Unfortunately, despite the changes to simplify staging, the number of LNs assessed in each GC case varies, and in many cases, the number reported per specimen is less than current recommendations [15, 22]. In the United States, a median of only 10 nodes per patient were assessed, and 9 % had no LN assessment at all. Only 29 % of patients had >15 LNs assessed [2325].

An LN ratio (LNR) has also been proposed and many international groups have examined the utility of an LNR for the prognosis of GC, finding this more predictive than simple analysis of the number of positive LNs [2628]. However, most studies have excluded patients with fewer than 15 LNs examined, and no uniform cut-off points have been identified to date. Although an LNR does mitigate the effects of stage migration somewhat, its utility in patients in whom fewer than 15 LNs were assessed has not been proven.

Studies have shown that stage migration occurs in patients with a lower number of LNs examined, creating inaccuracies in survival predictions [18, 2931]. In addition to the importance of nodal status as a prognostic factor in GC, it has been suggested that inadequate LN assessment directly affects patient survival for the worse [32, 33]. Therefore, the purpose of this review was to identify and synthesize findings from all articles evaluating both predictors of nodal harvest and long-term outcomes based on the number of LNs assessed.

Methods

Data sources

Electronic literature searches were conducted using Medline and Embase from January 1, 1998, to December 31, 2010 according to the search algorithm presented in electronic Appendix A. Search terms included: exp Stomach Cancer/or [((gastric or stomach) adj1 cancer$) or ((gastric or stomach) adj1 carcinoma) or ((gastric or stomach) adj1 adenocarcinoma) or ((gastric or stomach) adj1 neoplasm$)).mp.] and [number of lymph nodes or lymph node assessment or lymph node examination or total lymph node count] and [((negative or resection) adj2 margin$).mp. or exp frozen section/or exp GASTRECTOMY/or ((gastric or stomach) adj2 resect$).mp. [mp = title, abstract, subject headings, heading word, drug trade name, original title, device manufacturer, drug manufacturer name] or omentectom$.mp. or multivisceral resection$.mp.] and [clinical trial/or controlled clinical trial/or exp comparative study/or meta analysis/or multicenter study/or exp practice guideline/or randomized controlled trial/] not [Case Report/or review]. A separate search of the Cochrane Central Register of Controlled Trials (1985–2010) was performed using the search term “gastric cancer”. No attempt was made to locate unpublished material.

Study selection and review process

To be eligible, studies had to meet the following criteria: (1) investigated number of LNs examined in newly (not recurrent) diagnosed patients with histopathology-confirmed gastric adenocarcinoma; (2) reported long-term outcomes and clinicopathological factors based on the number of LNs assessed; (3) patients had a pathological examination of the surgical specimens and LNs dissected; (4) involved human patients with a minimum of 30 patients; and (5) published in peer reviewed journals in English. Studies were excluded according to the following criteria: (1) studies that did not provide short- or long-term trial outcomes; (2) involved animals and/or ex vivo samples; (3) involved patients with mixed cancer with no separate analysis of GC subjects; and (4) review articles, meta-analyses, abstracts, conference proceedings, editorials/letters, and case reports. All electronic search titles, selected abstracts, and full-text articles were independently reviewed by a minimum of two reviewers (N.C., R.S., R.C.). Disagreements on study inclusion/exclusion were resolved with a consensus meeting.

Data extraction and analysis

A systematic approach to data extraction was used to produce a descriptive summary of participants, interventions, and study findings. Data abstracted included the study type, number of patients, patient characteristics, resection margin status, number of LNs removed, number of positive LNs, and follow-up time. Data on associations with clinicopathological factors (resection type, tumor location, tumor size, TNM stage, and clinical stage) and the number of assessed LNs were extracted. Disease-free survival, recurrence rate, overall survival, and survival by subgroups (TNM and clinical stage) relating to the number of LNs assessed were extracted. The first reviewer (A.B.) independently extracted the data and a second reviewer (R.S.) checked the data extraction. No attempt was made to contact authors for additional information.

Results

Literature search

A total of 3,608 titles/abstracts were identified from the electronic searches and reference lists for preliminary review. After the removal of duplicates and screening for relevant titles and abstracts, a total of 52 articles were identified as pertaining to the examination of LN number and long-term outcomes, and these were submitted for a full-text review. A total of 25 retrospective articles [5, 18, 2325, 2931, 3450] involving 74,228 patients were included in this review (Fig. 1). The descriptive characteristics of each included study are presented in Table 1.

Fig. 1
figure 1

Article selection flow

Table 1 Characteristics of studies evaluating the number of lymph nodes assessed

Clinicopathological factors and the number of LNs assessed

Details on the various clinicopathological factors associated with the number of LNs assessed are presented in electronic Appendices B–D. Five studies [5, 24, 31, 35, 38] reported significant associations between increased surgical resection (extended dissection or total gastrectomy) and an increased number of LNs assessed (Table 2). Three studies found an association between tumor location (distal tumors in two studies, and middle or upper-third tumors in one study) and an increased number of assessed LNs [35, 38, 43], whereas four studies did not [5, 31, 44, 48] (Table 2). Increased tumor size was significantly associated with a greater number of assessed LNs in 3 studies [35, 38, 48], while one study reported no significant relationship [31]. Four studies found a significant relationship between advanced T stage and increased number of LNs assessed [24, 29, 35, 44]; six studies found a significant association between advanced N stage and increased number of assessed LNs [5, 23, 31, 43, 44, 48]; and two studies found a significant association between advanced clinical stage and increased number of LNs assessed [31, 44].

Table 2 Clinicopathological factors associated with the number of LNs assessed

Recurrence and disease-free survival and the number of lymph nodes assessed

Disease-free survival (DFS) and/or recurrence rate was reported in 3 studies [37, 40, 47] (Table 3). In two studies, DFS was significantly longer in patients with a greater number of assessed LNs [37, 47], regardless of whether the patient was node-negative or node-positive [47]. Local recurrence was significantly lower in patients with a higher number of LNs assessed [47]. The recurrence rate was also lower among patients with more LNs assessed according to T stage [40]. If recurrence was observed, median overall survival (OS) with recurrence was significantly longer in patients with at least 15 LNs assessed [37]. In patients with positive nodes, DFS was still significantly longer when more than 25 LNs were assessed [47].

Table 3 Disease Free Survival and Recurrence Rate associated with the number of LN assessed

Overall survival and the number of LNs assessed

OS was reported in 18 studies [5, 24, 25, 31, 34, 3640, 4247, 49, 50] (Tables 4, 5, 6). Improved OS was significantly associated with an increased number of LNs assessed in just over half of the studies (10 of 18) [24, 25, 34, 36, 37, 39, 40, 42, 46, 47]. Studies investigating OS in only N0 (Table 3) or N+ (Table 6) cohorts are shown in separate Tables, as stage migration is a factor in these cohorts. Four studies [31, 39, 40, 50] reported OS by T stage for number of LNs assessed (Table 7). One study found a significant relationship with improved OS for all T stages with an increased number of assessed LNs [40], while other studies only found significant associations for T2 patients [31], or T4 patients [39]. One study found no significant association between the number of LNs assessed and survival rates according to T stage [50].

Table 4 Overall survival rates associated with the number LN assessed for mixed N-stage cohorts
Table 5 Overall survival rates associated with the number LN assessed for N0 cohorts
Table 6 Overall survival rates associated with the number LN assessed for N+ cohorts
Table 7 T-stage survival rates associated with the number LN assessed

Eleven studies [18, 24, 30, 31, 38, 39, 43, 45, 47, 48, 50] reported survival rates by N stage (Table 8), and found a significant relationship with improved survival among N0 patients [24, 30, 31, 38, 45, 50], N1 patients [18, 31, 39, 45, 50], and N2 patients [18, 31, 39, 45, 48, 50], with an increased number of LNs assessed. Only one study found this relationship to be significant among N3 patients [39]. Scartozzi et al. [47] found this relationship significant for all node-positive patients, whereas Liu et al. [43] did not find any significant associations between number of LNs assessed and survival according to N stage.

Table 8 N-stage survival rates associated with the number LN assessed

Among the three studies [5, 23, 25] investigating OS according to combined T and N stage (Table 9), two found a significant relationship between improved OS and increased number of LNs assessed irrespective of T and N stage [23, 25], whereas one study found this association significant only at specific stages (T2N1, T3N0, T2N2, and T4N0) [5]. OS by clinical stage was reported in 10 studies [5, 18, 24, 29, 30, 35, 39, 41, 44, 49] (electronic Appendix E). Two studies found a significant relationship with improved OS among patients with stage I GC and increased number of LNs assessed [24, 30]; four studies found this association significant among stage II patients [5, 24, 39, 49]; seven studies found this relationship significant among stage III patients [5, 18, 24, 29, 30, 39, 44]; and three studies found this association significant among stage IV patients [18, 24, 39]. An increased relative risk of death was reported among patients with fewer LNs assessed in a combined analysis of stage I and II patients [35]. Although one study did not report P values, a visible trend was found for improved OS among patients with increased numbers of assessed LNs among all 4 stages [41].

Table 9 Combined TN stage survival rates associated with the number of LNs assessed

Discussion

Nodal status is one of the most important prognostic factors in GC [5]. To ensure that patients are accurately staged and provided the optimal treatment, it is important to assess an adequate number of LNs. Currently the UICC/AJCC recommends that at least 16 LNs be assessed per patient [22]. However, the number of LNs assessed in each GC case varies; in the United States, the median number assessed is 10 [24, 51], while in Asia, it is generally three to four times higher [29, 39, 40, 48]. This significant variation may affect both staging accuracy and patient survival. A significant problem with the current staging system, as well as with the 5th and 6th UICC/AJCC editions, is stage migration [15, 21, 22]. If an inadequate number of LNs are assessed, a patient may be inappropriately considered “node-negative” or designated a lower N stage and therefore classified as a lower overall stage; however, these patients have a worse survival than patients who were classified appropriately through a thorough LN assessment. Additionally, a classification of N3 (using the 5th or 6th UICC/AJCC staging editions [15, 22]) could not be given unless 16 LNs were positive. In the current 7th UICC/AJCC staging edition [20], the minimum recommendation has now been increased to 16, which addresses the above issue and also allows for the classification of the new N3b category (16 or more positive LNs) [20]. However, the likelihood of a patient being staged as N3b, if only the minimum number of 16 LNs are assessed, is obviously lower than that if more than 16 LNs are assessed. Studies of surgical dissections have shown that a mean of 26 LNs (range 8–55) were removed with a D1 LN dissection, whereas 37.4 LNs (15–72) were included in a D2 LN dissection [52]. Roukos et al. [52] also reported that the number of nodes found at each station had high variation in nodal yields, and many stations contained no LNs despite adequate resection and thorough pathological examination. Thus, it is difficult to define an ideal number of LNs in a surgical specimen, and the type of LN dissection may significantly affect the number of LNs assessed.

Stage migration confounds comparisons of survival across centers and countries in the evaluation of surgical techniques, adjuvant treatments, and the conduct of trials. However, over and above stage migration, authors suggest there may be an actual survival benefit from an extensive lymphadenectomy [5, 44, 47, 49]. Thus, we performed a systematic review to examine clinicopathological factors associated with the assessment of higher numbers of LNs, as well as the outcomes of these assessments based on the number of LNs assessed. While other overviews have been published, to our knowledge, this is the first systematic review to address this issue.

Clinicopathological factors associated with number of LNs assessed

The results of the present systematic review show that patients with a larger tumor size, more advanced disease, or those who had a more extensive lymphadenectomy were likely to have more LNs assessed (Table 1, Appendix C–E). This can either be attributed to the surgeon performing a more extensive primary resection which would include more LNs, or a more thorough examination of the specimen by the pathology team for more advanced cancers. Additionally, with more advanced cancers, LNs may be grossly positive, and thus more easily identified during specimen processing than microscopically involved or uninvolved LNs. For example, as only 43 % of pathologists in a recent survey reported using fat-clearing solutions, such as acetic acid, LNs that are not grossly involved and thus smaller may be missed [53]. Although some studies found an association with tumor location and the number of LNs assessed, this relationship was unclear, as two articles reported that patients with tumors in the distal stomach were likely to have more LNs assessed [38, 43], while another article reported that tumors in the middle or upper-third of the stomach allowed for a greater number of LNs to be assessed [35]. Heterogeneity in surgical techniques among these studies may confound these results. Furthermore, these findings contradict studies showing more LNs are assessed in patients having a more extensive surgical resection (total vs subtotal gastrectomy) [5, 24, 31, 35, 38].

Recurrence and disease-free survival and the number of lymph nodes assessed

The three articles reporting DFS in the context of LN assessment [37, 40, 47] showed longer times to recurrence when more LNs were assessed. Scartozzi et al. [47] reported a remarkable decrease in local recurrence, from 23 % in patients with ≤25 LNs assessed compared with 4.7 % in patients with >25 LNs assessed. It has been hypothesized that the removal of additional LNs results in better locoregional control. For example, in the 15-year analysis of the Dutch D1/D2 randomized controlled trial (RCT), local recurrence was 22 % in the D1 group, compared with 12 % in the D2 group; while regional recurrence was 19 % in the D1 group, and 13 % in the D2 group (P = 0.015) [54]. However, because Huang et al. [40] explored node- negative patients only, and Scartozzi et al. [47] explored T2–3 N1–3 and T3N0 patients only, it is likely that the main effect they measured was that of stage migration.

Stage migration

Studies in which survival is stratified based upon the determination of N stage will suffer from stage migration as more LNs are assessed (Tables 7, 8, 9). Additionally, if a study has limited the cohort under examination to node-negative or node-positive patients only, stage migration will occur [25, 36, 37, 39, 40]. With the exception of the article by Liu et al. [43], which examined T2b cancers only, all of the studies which examined survival by N stage found a significant difference when more LNs were assessed, likely the effect of stage migration [18, 24, 30, 31, 38, 39, 45, 47, 48, 50].

The effect of stage migration is most striking when fewer than 10 LNs are assessed [23, 35], but continues with even greater numbers of LNs examined [23, 55]. For example, 45 % of nodes were found to be positive when only 10 or fewer were examined compared with 17 % positive if more than 40 LNs were examined [55]. Bouvier et al. [55] estimate that the risk of misclassification is 47.1 % when fewer than 10 LNs are examined, while Bando et al. report that 45 % of patients with LN involvement would have been understaged if a D1 LN dissection had been performed [56].

A linear regression analysis by Smith et al. [23] showed a statistically significant observation in which one positive node was found for each additional five nodes that were examined. Furthermore, using a model-predicted 5-year survival with only one LN examined as a baseline, Smith et al. [23] found that for every 10 extra LNs dissected, the calculated overall survival improved by 7.6 % for T1/2N0 patients, 5.7 % for T1/2N1 patients, 11 % for T3N0 patients, and 7 % for T3N1 patients. Schwarz et al. [25] performed a similar analysis using a model-predicted 3-year survival with a baseline of 0 LNs examined, and found that for every 10 extra LNs assessed, the calculated overall survival (OS) increased by 5.7 % for T2b–3N2 patients, 4.6 % for T2b–3N3 patients, and 5.9 % for the entire cohort, with differences found for up to 40 LNs assessed. However, as the analysis was performed based upon the N stage of the patients, stage migration may explain much of these findings.

It is not surprising that many studies found no statistically significant survival effects for N3 cancers (Table 7), as dissection of more LNs will not put the patient into a higher staging category, and, in order to achieve an N3 stage (by the 5th/6th edition of the UICC/AJCC staging manual), a minimum of 16 LNs were assessed. Some authors have postulated that improved survival for N3 patients should not occur through stage migration, as there is no worse stage from which the patients can migrate. Huang et al. [39] found an apparent survival benefit from more LNs dissected and assessed, and attributed this to the removal of additional cancer burden. However, a patient found to be N3b by virtue of having 16 positive nodes of 16 LNs assessed may have many more positive nodes and thus poorer prognosis than a patient with 16 positive nodes out of 50 nodes assessed. Therefore, the stage migration effect is possible even within this group of poor prognosis patients.

Survival

Many studies show a difference in OS based upon the number of LNs examined. In univariate analysis, six of eleven cohorts found no survival difference according to the number of LNs assessed [5, 31, 4345, 50]; while others showed that there was a significantly improved survival when >10 [34], >15 [49], or >25 LNs [47] were assessed (Table 3). Volpe et al. [47, 49]and Scartozzi et al. [47, 49] attributed the survival benefit to the close association of more LNs assessed with a D2 LN dissection [47, 49]. However, these studies also found that the survival benefit was highly stage-specific (stage 2 [5, 49]; stage 3 [44]); thus, the effect of stage migration cannot be ignored. If there is a true survival effect based upon the removal of additional LNs, this could be explained by the additional removal of disease. It is therefore reasonable that no significant survival difference was found in the T1 cancers, as these cancers have a significantly lower chance of LN involvement than higher T-stage cancers [31, 50].

In order to account for the effect of stage migration and to determine whether the assessment of more LNs yields a survival benefit, many authors have utilized multivariate survival modeling, incorporating both N stage and number of LNs assessed. In most of these models, survival is associated with both variables (Table 3). An increased number of LNs assessed was found to have a hazard ratio (HR) of death of 0.635 (95 % confidence interval [CI] 0.597–0.676) [24], 0.81 (95 % CI 0.76–0.87) [42], 0.76 (95 % CI 0.61–0.95) [46], and 0.59 (95 % CI 0.39–0.89) [47], and an odds ratio (OR) of survival of 3.7 (95 % CI 1.5–8.8) [34]. However, Giuliani et al. [38, 43] and Liu et al. [38, 43] found no difference in survival outcomes by number of LNs assessed. These conflicting findings fuel the debate of whether there is a true, independent positive effect of greater number of LNs removed versus the possibility that a confounding factor exists, such as extended LN dissection, that is associated with both the increased number of LNs removed and improved survival outcomes. An additional challenge for these analyses is making the assumption that the variables N stage and number of LNs assessed are fully independent, while in reality there may be significant interactions between these two variables, which may not be fully accounted for. Of note, none of the multivariable models to date reported testing for an interaction between the number of LNs assessed and N stage. While these studies do show a statistically significant association between survival and more LNs assessed, they cannot confirm causation.

The appropriate number of LNs to assess

While many studies found a significant difference in survival when >15 LNs were assessed, most only analyzed the data utilizing the cut-off point of 15 LNs based upon the 5th/6th edition of the UICC/AJCC staging manual. Many groups have suggested that the number of LNs assessed should vary according to stage and should be fewer than 15 LNs for early gastric cancer (EGC), and greater than 15 for more advanced cancers [30, 34, 50]. Lee et al. [29] found no significant stage migration for the stage Ia patients in their cohort based upon the number of LNs assessed. In comparison, in their analysis of EGC, Borie et al. [34] found that patients with <10 LNs assessed had significantly lower 5- and 7-year OS rates compared with patients with ≥10 LNs assessed (Table 3). Siewert et al. [5] and Volpe et al. [49] found stage migration to be most prominent in stage II patients, while Karpeh et al. [18] and Liu et al. [44] found migration in stage III patients (Table 9), suggesting that a higher cut-off point of LNs assessed should be used for accurate staging in more advanced disease in an effort to avoid stage migration.

Smith et al. [23] and Schwarz et al. [25] have stated that the AJCC/UICC minimum goal of 15 LNs appears insufficient in the context of advanced-stage cancers [23, 25]. They recommend that the minimum goal should be set to at least ten LNs above the number of positive nodes, which would require at least 25 LNs for patients with advanced nodal staging. Although other groups have suggested numbers as high as 40 LNs assessed, a goal of 25 seems reasonable, even in the context of surgery performed in North America. Paradoxically, in Western countries it may be necessary to set the recommended number of LNs for dissection and assessment higher than 15 due to the advanced presentation of disease.

Limitations

There are many limitations to the data included in this review. All of these studies were retrospective; thus, there was no standard surgical or pathology protocol. There may be inherent differences in the patients receiving an extensive versus a limited LN dissection that are also associated with survival (i.e., co-morbid status, palliative resections, positive margins, institutional factors), and therefore the potential for unreported confounders continues to exist. The evaluation of the specimen for LNs, the histological evaluation of the node and methods to detect disease are also unaccounted for and introduce further potential bias to the studies being evaluated. Finally, a bias for publication of positive results may exist, decreasing the number of statistically non-significant results reported in the literature.

Conclusion

The extent of nodal involvement continues to be one of the most important prognostic factors for GC survival. Thus, an adequate number of LNs must be assessed to ensure patients are accurately staged and optimal treatment is prescribed. Population-based studies have indicated that the number of LNs assessed in each GC case varies, and in many cases is sub-optimal. From our systematic literature review of the data, we found that increased resection, tumor location, tumor size, and TNM staging information were all associated with a greater number of LNs assessed. With respect to long-term outcomes, patients with an increased number of assessed LNs have a better prognosis; however, much of this appears to be the effect of stage migration. Although current guidelines support a minimum number of 16 LNs to be assessed for patients with GC, it is apparent that extensive dissections, nodal harvest, and pathological identification of more LNs limit the effects of stage migration. Therefore, a goal of more than 16 LNs appears to be an appropriate target for surgery and pathological analysis.