Gastric cancer is the fourth most common malignant tumor in the world, with an estimated 1 million new cases every year.1 More new cases of gastric cancer are diagnosed in China each year than in any other country.2 Although current practice includes incorporating chemotherapy or radiation into the treatment protocol, surgical resection may remain the only chance for curing this disease.3 Complete surgical resection with radical lymphadenectomy also provides information for accurate staging, which is very important in prognosis prediction and further therapy decisions.

The 6th edition American Joint Committee on Cancer (AJCC)/International Union against Cancer (UICC) tumor–node–metastasis (TNM) staging system was published in 2002.4 In recent years, many modifications to the 6th edition UICC TNM system have been proposed. This year, the UICC published the 7th edition TNM classification of malignant tumors for gastric cancer.5 In this edition, the T classification was changed to contain five subdivisions, namely T1, T2, T3, T4a, and T4b. Furthermore, subdivision of the N classification based on the number of metastatic lymph nodes was also changed. Finally, stage IV classification was redefined based on the presence of distant metastases, and the T4N+/TanyN3M0 classification, which was defined as stage IV in the 6th edition, was eliminated.

In this study, we aim to evaluate the efficacy and predictive ability of the 7th edition UICC TNM staging system and to compare the 6th and 7th editions of the UICC TNM staging system in a cohort of patients who underwent primary surgical resection for gastric cancer.

Patients and Methods

Patients

From January 1994 to December 2006, clinicopathological data from 1,503 cases of gastric cancer patients who underwent surgical resection at Sun Yat-Sen University Cancer Center were analyzed retrospectively. Eligibility criteria were: (1) gastric adenocarcinoma identified by histopathological examination, (2) application of gastrectomy plus lymphadenectomy (limited or extended), (3) complete follow-up data, (4) no history of familial malignancy or other synchronous malignancy [such as gastrointestinal stromal tumor (GIST), esophageal cancer, colorectal cancer, etc.], (5) no recurrent gastric cancer and remnant gastric cancer, and (6) no death in the perioperative period. The principles of tumor resection and D2 lymphadenectomy by experienced surgeons were similar in all radically resected patients.

Follow-Up

Postoperative follow-up included clinical and laboratory examinations every 3 months for the first 2 years at our outpatient department, every 6 months from the third to fifth years, and annually thereafter until at least 5 years after the operation or until the patient died, whichever came first. Overall patient survival, defined as time from operation to death or last follow-up, was used as a measure of prognosis.

Statistics

All patients were restaged using the 6th and 7th editions of the UICC TNM staging systems. Median overall survival (OS) was determined. Survival curves were plotted using the Kaplan–Meier method, and the log-rank test was used to determine significance. Factors that were deemed of potential importance on univariate analysis were included in multivariate analyses. Multivariate analysis of OS was performed by means of the Cox proportional-hazards model using the forward logistic regression (LR) stepwise procedure for variable selection. Also, since we thought that we needed to directly compare the 7th UICC TNM staging system with the 6th edition system, the likelihood ratio χ2 test related to the Cox regression model was used to measure homogeneity. The discriminatory ability and monotonicity of gradient assessments were measured with the linear trend χ2 test. To assess potential bias in comparing prognostic systems with different numbers of stages, the Akaike information criterion (AIC) within the Cox proportional-hazard regression model was used. The AIC statistic was defined by AIC = −2 log maximum likelihood + 2 × number of parameters in the model. A smaller AIC value indicated a better model for predicting outcome.6,7 Hazard ratios (HR) and 95% confidence intervals (95% CI) were generated. All calculations were performed using SPSS 18.0 software, and a P value of less than 0.05 was considered significant.

Results

Median follow-up for the entire cohort was 50 months (range 3–197 months). Mean age of the 1,503 patients was 56.21 (range 21–85) years; 999 were male, and 504 were female. The overall 5-year survival rate for all patients was 50.6%, and 741 patients were alive when our follow-up was completed. The total number of lymph nodes harvested from all patients in this study was 22,244 (14.80 ± 10.12), of which 6,773 (4.51 ± 6.34) were positive. The incidence of nodal involvement was 68.1% overall (1,024 of 1,503 patients). One hundred sixty (10.6%) patients were incidentally found to have distant metastases (abdominopelvic cavity, ovary, liver, or lung) during the operation and were classified as M1. No patients died during the initial hospital stay or first month after surgery.

Univariate and Multivariate Survival Analysis

In all 1,503 patients, 14 factors were found to have statistically significant associations with overall survival on univariate analysis. They were age, tumor location, tumor size, histological type, type of gastrectomy, radical resection, lymphatic/venous invasion, 6th and 7th UICC T stage, 6th and 7th UICC N stage, M stage, number of retrieved lymph nodes, and chemotherapy (Table 1). All of the aforementioned 14 variables were included in a multivariate Cox proportional-hazards model to adjust for effects of covariates. In our study, the T and N classifications in the 6th and 7th edition system were highly correlated. Therefore, two separate multivariate models, one with 7th and the other with 6th edition system, were run to avoid problems with presence of multicollinearity. In that model, we demonstrated that age, tumor size, histological grade, lymphatic/venous invasion, T, N, and M stage, and retrieved lymph nodes remained independent prognostic factors (Table 2). Tumor location, type of gastrectomy, radical resection, and chemotherapy—all of which were significant prognostic factors on univariate analysis—did not show significant influence on multivariate analysis. Concerning radical resection, we believe its prognostic value was overlapped by the M stage, and thus it did not show significant influence on multivariate analysis. Concerning chemotherapy, although patients in the adjuvant treatment group showed better 5-year survival rates on univariate analysis (P = 0.013), these values did not reach significance on multivariate analysis. This finding probably reflects selection bias, because adjuvant therapy was offered to more advanced-stage patients.

Table 1 Univariate survival analysis of clinicopathologic variables in 1,503 cases of gastric cancer patients
Table 2 Multivariate survival analysis results

In our database, the survival curves in each T subgroup in the 7th edition staging system had excellent discriminatory abilities (P = 0.000, 0.003, 0.000, and 0.000, respectively), which was also observed for subgroups in the 6th edition (P = 0.000, 0.000, and 0.000, respectively). In the 6th edition staging system, N stage was categorized as N0 (no regional lymph node metastasis), N1 (metastasis in 1–6 nodes), N2 (metastasis in 7–15 nodes), and N3 (metastasis in 15 or more nodes). By contrast, according to the 7th edition TNM classification system, N stage was reclassified as N0 (no regional lymph node metastasis), N1 (metastasis in 1–2 nodes), N2 (metastasis in 3–6 nodes), and N3 (metastasis in 7 or more nodes). The Kaplan–Meier plot for the 6th edition showed discriminative survival curves for N0 versus N1 and N1 versus N2 (P = 0.000 and 0.000, respectively) but not for N2 versus N3 (P = 0.780). In the 7th edition, the Kaplan–Meier plot showed discriminatory ability among the subgroups of N0 versus N1, N1 versus N2, and N2 versus N3 (P = 0.000, 0.000, and 0.000, respectively) (Fig. 1).

Fig. 1
figure 1

Comparison of survival curves according to T between the 6th (a) and 7th edition (b), and N classifications between the 6th (c) and 7th edition (d)

Figure 2 shows the patient distribution and stage-specific survival rates. In the 6th edition staging system, the Kaplan–Meier plot showed no overlapping survival curves among the six subgroups. In the 7th edition, among all eight substages, there were similar survival curves between stages IB and IIA (P = 0.261), while the Kaplan–Meier plot showed good discriminatory ability among other substages. However, when classified as four major stages, the Kaplan–Meier plot showed good discriminatory ability among stages I through IV in both systems.

Fig. 2
figure 2

Comparison of survival curves according to 6th and 7th editions. a, b classified as four major stages in the 6th and 7th editions, c classified as six substages in the 6th edition, and d classified as eight substages in the 7th edition (P = 0.000, respectively)

The performance of the 6th and 7th edition staging systems as assessed by the linear trend χ2, likelihood ratio χ2, and the AIC tests is presented in Table 3. Compared with the 6th edition system, the 7th edition system had better homogeneity (higher likelihood ratio χ2 score), discriminatory ability, and monotonicity of gradients (higher linear trend χ2 score). Furthermore, the 7th edition system had a smaller AIC value, representing optimum prognostic stratification.

Table 3 Comparison of the performance of the 6th and 7th editions of the AJCC/UICC TNM staging system

To obtain the best classification of substages for prognosis evaluation of gastric cancer in our statistical analysis, we used stratified analysis to introduce the factor of tumor size into substages of 92 cases of IB and 150 cases of IIA. We found that, in both subgroups of tumor size ≤5 and >5 cm, substage IB patients had significantly longer survival than substage IIA patients. In the subgroup of tumor size ≤5 cm, the 5-year survival rate of stage IB and IIA patients was 95.5 and 84.2%, respectively (P = 0.042), and in the subgroup of tumor size >5 cm, the 5-year survival rate of stage IB and IIA patients was 73.9 and 56.8%, respectively (P = 0.043).

In our study, among 331 cases of stage IV disease in the 6th edition system, 160 cases (48.3%) remained in stage IV while 130 cases (39.3%) were downstaged to IIIC, 40 cases (12.1%) to IIIB, and 1 case (0.3%) to IIIA in the 7th edition system. Therefore, we compared the prognosis of patients who remained in stage IV and those who were downstaged to stage III. We found that patients downstaged to stage III had better prognosis than those who remained in stage IV, with 5-year survival rates of 28.6% and 5.6%, respectively (P = 0.000). Even when divided into three subgroups (group 1, stage IIIB: T3N3bM0, T4bN1M0; group 2, stage IIIC: T4bN2-3bM0, T4a-4bN3bM0; and group 3, stage IV: T[any]N[any]M1) according to the 7th edition system, the patients had distinguishable prognoses. Subgroup 1 had a significantly better overall survival rate than subgroups 2 and 3 (P = 0.039 and 0.000, respectively) (Fig. 3).

Fig. 3
figure 3

Comparison of survival curves between patients downstaged to stage IIIB and IIIC and those remaining in stage IV in the 7th edition system (P = 0.039 and 0.000, respectively)

Discussion

The TNM classification system attempts to account for most basic parameters of cancer, and it has utility for determining extent of disease, providing guidance for treatment planning, and predicting outcome. The most substantial major changes for the 7th edition UICC TNM staging system are described below.

Firstly, concerning T classification, the T2 stage in the 6th edition system was subdivided into T2 and T3 in the 7th edition system, and directly resulted in two additional substages in the 7th edition system in addition to the six substages in the 6th edition system. Although substages IB and IIA had no significant difference in survival rates, we suppose that this finding was caused by differences of composition and the relatively small number of cases in these two substages. In our database, subgroups T2 and T3 had significantly different survival curves. Therefore, we believe the indistinguishable survival curves between substages IB and IIA were caused by the N status in these two substages. Substage IB was composed of 9 cases of T1N1M0 and 83 cases of T2N0M0, and substage IIA was composed of 8 cases of T1N2M0, 42 cases of T2N1M0, and 100 cases of T3N0M0. However, when we incorporated tumor size to subclassify stage IB and IIA in the 7th edition system, the role of tumor size in gastric cancer patient survival became important. On univariate and Cox regression multivariate analyses, we determined tumor size to be an independent prognostic factor. On stratified analysis, we found that, in both subgroups of tumor size ≤5 cm and >5 cm, patients in substage IB had better prognoses than did those in substage IIA. Park and colleagues found subclassification of pT2 gastric cancers into pT2a or pT2b to be necessary to demonstrate their different prognoses according to the 6th edition system based on 1,118 cases of gastric cancer.8 Liu et al. found tumor size to be an independent prognostic factor for patients with T3 gastric cancer in the 6th edition system, and the 5-year survival rates were 29.5 and 42.7% in the large-size (LSG, tumor >6 cm) and small-size groups (SSG, tumor ≤6 cm), respectively (P = 0.045).9

Secondly, the subdivision of N classification based on the number of metastatic lymph nodes was changed. The N1 substage in the old system was divided into N1 and N2 in the new system, and N2 and N3 substages were merged into N3. Nodal status is a singularly important prognostic factor in gastric cancer. Actually, many studies had demonstrated N stage in the 6th edition system to be not suitable for prognostic prediction in clinical analysis, because of the inappropriate cutoffs of that stage. Deng et al. determined the most appropriate cutoffs of metastatic lymph nodes to be 0, 1–4, 5–8, and ≥9 in 308 gastric cancer cases with curative resection classified by the 6th edition system.10 In a group of multicenter retrospective studies in Italy based on 652 resected early gastric cancer cases classified by the 6th edition system, Roviello et al. found the 10-year survival rate in early gastric cancer patients to be 92% for patients with negative nodes, 82% for patients with 1–3 positive nodes, 73% for 4–6 positive nodes, and 27% for >6 positive nodes.11 Similarly, using multivariate analysis including the number of positive nodes with cutoffs of 0, 1–4, 5–8, and ≥9, Wu et al.12 reported that the category of positive nodes was the most important prognostic indicator for 510 gastric cancer cases. In accordance with Deng et al., N2 and N3 patients in our study using the 6th edition system (equivalent to N3a and N3b in the 7th edition) had similar survival.13 The studies mentioned above show that the 6th edition UICC N stage less accurately reflected patient prognosis. Therefore, our results indicated that identifying lymph node metastases and classifying patients as N3a and N3b are unnecessary.

Thirdly, partial cases of stage IV disease due to T4 or N3 in the 6th edition system were downstaged to IIIC, IIIB, and IIIA in the 7th edition system. An et al. reported each subgroup of stage IV gastric cancer in the 6th edition system to have different clinical outcomes, and thus subclassification of stage IV gastric cancer into IVA (T1-3N3M0), IVB (T4N1-3M0), and IVM (T[any]N[any]M1) might be useful for more accurate prediction of prognosis.14 Ma et al.15 also predicted that subclassification of stage IV gastric cancer into IVA (T1-2N3M0), IVB (T3N3M0 and T4N1-2M0), and IVC (T4N3M0, TanyNanyM1) may be helpful to predict the outcome for patients with stage IV gastric cancer as classified by the 6th edition system. Additionally, Ahn et al. determined that survival for pN3 gastric cancer patients after curative gastrectomy was significantly longer in the pN3/M0 group as compared with that of the pN3/M1 group, and advanced T stage was a predictor of poor prognosis for pN3 patients based on 467 cases of gastric cancer patients classified by the 6th edition system.16 Park et al.17 found survival of patients with T4N3M0 and M1 stage disease to be significantly shorter than that of patients with T1-3N3M0 and T4N1-2M0 stage disease (P = 0.000) in a 422-patient study. They suggested that subclassification of stage IV gastric cancer into IVa (T1-3N3M0, T4N1-2M0) and IVb (T4N3M0, T[any] N[any]M1) may help predict the outcome of patients with stage IV gastric cancer as classified by the 6th edition system. In our study, we also demonstrated that patients with stage IV in the 6th edition system could be divided into two subgroups. Patients downstaged to stage III in the 7th edition system had significantly better prognosis than did those who remained in stage IV. Even when we divided these patients into three substages (stage IIIB, IIIC, and IV) according to the 7th edition system, the patients still had distinguishable prognoses. In our database, only one case was categorized as having stage IIIA disease in the 7th edition system, so we excluded this patient from the survival analysis because the number of patients enrolled was too small. This phenomenon may be explained by the fact that the three subgroups have different biologic behaviors of relapse or metastasis models, which still merits further research.

According to the theory mentioned by Ueno and colleagues, the criteria for evaluating the performance of the staging systems were as follows: (1) homogeneity within subgroups (small differences in survival among patients within same stage), (2) discriminatory ability between different groups (greater differences in survival among patients in different stages), and (3) monotonicity of gradients shown in the association between stages and survival rates (patients with earlier stages have longer survival than those in later stages).18 In the 6th edition staging system, the survival rate of the six substages had significant differences, while in the 7th edition system, the discriminatory ability between the eight substages appeared more powerful. As shown above, the new system had better homogeneity, discriminatory ability, and monotonicity of gradients. Additionally, the new system had a smaller AIC value, representing optimum prognostic stratification.

Conclusions

Although our sample population was from a single-institution experience and is relatively small compared with the worldwide gastric cancer collaboration database, the surgical procedures, pathologic examinations, and patient follow-up were highly uniform throughout the entire study period. Strengths of the 7th edition UICC TNM staging system include redefinition of T classification, redescription for N classification based on the number of positive lymph nodes, and reclassification of stage IV disease. Based on analysis of 1,503 gastric cancer patients who underwent primary surgical resection, we provided an external validation of the new UICC TNM staging system. Furthermore, we found that several clinicopathological variables, especially tumor size, were significant prognostic factors in our database worthy of further research. Overall, the 7th edition UICC TNM staging system is superior to the 6th edition system in terms of homogeneity, discriminatory, and monotonicity of gradients.