Introduction

Radical gastrectomy with D2 lymph node dissection has become increasingly regarded as the standard surgical procedure for most patients with operable gastric cancer. Even after R0 resection, the long-term outcome of gastric cancer remains poor. Although the prognosis of gastric cancer after surgery is mainly associated with tumor stage and other clinicopathological features, there is a growing consensus in the surgical literature that enhanced surgical technique and rich experience can improve outcomes for patients with curatively resected gastrointestinal cancer. Previous volume-outcome studies of patients with resectable gastric cancer have shown a positive association of surgical outcome with the hospital volume [14]. Compared with hospital volume, surgeon specialization can better predict surgical outcome in a given center. Many studies have affirmed the survival benefit for cancer patients treated by specialized surgeons [510]. With regard to gastric cancer, few studies focus on surgeon specialization.

We therefore used data of 967 patients with stage Ia–IIIc gastric cancer in a large-volume center to explore the impact of surgeon specialization on long-term outcome. Patients were categorized into two groups based on surgeon specialization, the specialized group (SG) and nonspecialized group (NSG). A one-to-one propensity score matching method was used to overcome bias due to the different distributions of covariates for the two groups. Clinicolpathological features, related prognostic factors and recurrence were analyzed.

Materials and methods

Patients and surgeons

A total of 1,695 patients with GC who underwent surgical resection at Tianjin Medical University Cancer Institute and Hospital between January 2003 and December 2007 were eligible for this study. Eligibility criteria included: (1) adenocarcinoma of the stomach, (2) patients with stage I–III disease who underwent gastrectomy with curative intent, (3) no history of gastrectomy or other malignancy and (4) no history of neoadjuvant chemotherapy. Ultimately, 976 patients were included in this study.

The 976 patients including 710 males (72.7 %) and 266 females (27.3 %). The ages ranged from 20 to 89 years, and the median age was 62 years. All patients were categorized into two groups according to the specialization of their surgeons: the specialized group (SG) and nonspecialized group (NSG). As a result, 261 patients were assigned to the SG and 715 patients to the NSG. Regarding surgeon specialization, specialized surgeons were defined as those: (1) from the Department of Gastric Cancer, (2) who had undergone professional training in gastrectomy and lymph node dissection in Japan, (3) whose annual gastric cancer surgeon volume was more than 50 and (4) with proven experience in D2 lymph node dissection confirmed by intraoperative photographs. As a result, only two surgeons met these conditions. The other 23 surgeons from nonspecialized units (including 8 from the Department of Hepatobiliary Cancer, 6 from the Department of Colorectal Cancer, 5 from the Department of Thoracic Cancer and 4 from the Department of Pancreatic Cancer) were defined as nonspecialized surgeons.

Evaluation of clinicopathological variables and survival

Clinicopathological features studied included gender, age, body mass index (BMI), tumor location, tumor size, Borrmann type, histology, depth of invasion, lymph node metastasis, metastatic lymph node ratio (rN), TNM stage, postoperative chemotherapy, surgical margin status, extent of lymphadenectomy, type of gastrectomy, total number of lymph nodes (LNs) dissected, postoperative complications and death. As patients in the two groups had different extents of node dissection and different node counts, which may result in stage migration, the metastatic lymph node ratio, which was not influenced by these factors and is reported to be a good indication of stage migration, was implemented in propensity score analysis [11, 12]. Using the log-rank test, we identified 0.1 and 0.5 as the best cutoff values for rN (rN0 = 0, 0 < rN1 ≤ 0.1, 0.1 < rN2 ≤ 0.5, rN3 > 0.5). Negative resection margin was defined as microscopic complete resection without residual cancer cells in the margin. Positive resection margin was defined as microscopically determined residual cancer cells in the margin. Postoperative complications during hospitalization only included those directly associated with surgery, such as hemorrhage, wound dehiscence, anastomotic leak, pancreatic fistula, lymphatic fistula, and abdominal or wound infection. Postoperative mortality was defined as any deaths that occurred with 30 days of surgery.

The tumors were staged according to the 7th edition of the Union for International Cancer Control TNM Classification System, whereas lymphadenectomy and lymph node stations were defined according to the 3rd English Edition of the Japanese Classification of Gastric Carcinoma [13]. Tumors were classified into two groups based on histology: differentiated type, including papillary, well or moderately differentiated adenocarcinoma, and undifferentiated type, including poorly differentiated or undifferentiated adenocarcinoma, signet ring cell carcinoma and mucinous carcinoma.

Follow-up

The patients were followed up every 3 months up to 2 years after surgery, then every 6 months up to 5 years, then every year or until death. Physical examination, laboratory tests, chest X-ray and abdominal ultrasound (US) were performed at each visit, while endoscopy and abdominal computed tomography (CT) were obtained every 6 months. The overall survival rate was calculated from the day of surgery until time of death or final follow-up. The median follow-up was 31 months (range 0–103) and 33 months (range 0–103) for the whole study series and the matched pair, respectively. The date of final follow-up was 30 December 2012.

Statistical analysis

For continuous variables, parametric analysis was performed using Student’s t test. Categorical variables were analyzed by means of the chi-square or Fisher’s exact test. Overall survival curves were calculated using the Kaplan-Meier method based on the length of time between primary surgical treatment and final follow-up or death. The log-rank test was used to assess statistical differences between curves. Independent prognostic factors were identified by the Cox proportional hazard regression model. To overcome bias due to the different distribution of covariates not associated with surgery for the two groups, a one-to-one match was applied using propensity score analysis. Variables involved in the propensity model were sex, age, BMI, tumor location, tumor size, Borrmann type, histology, depth of invasion, N stage, rN, TNM stage and postoperative chemotherapy. Operative factors including surgical margin status, extent of gastrectomy, type of gastrectomy, total number of lymph nodes (LNs) dissected, postoperative complications and death were not entered in the propensity model. p < 0.050 was considered statistically significant. The statistical analysis was performed using the statistical analysis program package SPSS 18.0 (SPSS, Chicago, IL, USA).

Results

Clinicopathological features and postoperative outcome of the whole study series

Of the 976 GC patients who underwent gastrectomy, 261 were operated on by specialized surgeons in comparison to 715 patients operated on by nonspecialized surgeons. Nine hundred five patients had a R0 resection, and 71 patients ended up with an R1 resection. Four hundred eleven patients underwent D2 or greater lymph node dissection, and 559 patients received postoperative adjuvant chemotherapy with 5-fluorouracil, leucovorin and oxaliplatin.

The patients were categorized into two groups according to their surgeon type. Clinicopathologic variables were compared between the two groups as shown in Table 1. There were no statistical differences in sex, BMI, tumor size, Borrmann type, histology, depth of invasion, N stage, TNM stage, chemotherapy, surgical margin status, postoperative morbidity and mortality between the two groups, whereas patients in the SG had a larger proportion of LN retrieval >15 (75.9 vs. 31.6 %, p < 0.001), greater number of lymph node retrieval (22.7 ± 12.4 vs. 12.9 ± 8.3, p < 0.001), higher ratio of ≥D2 lymph node dissection (83.1 vs. 27.1 %, p < 0.001) and higher rate of total gastrectomy (33.7 vs. 20.0 %, p < 0.001), but lower incidence of tumors invading the upper third of the stomach (20.3 vs. 34.4 %, p < 0.001), younger mean age (59.3 ± 11.6 vs. 61.9 ± 11.5, p = 0.002) than in the NSG.

Table 1 Clinicopathological features of GC patients who underwent gastrectomy with curative intent grouped by surgeon specialization: data are reported for the whole study series and for one-to-one propensity score matched pairs

In the entire study population, patients in the SG had a significantly higher OS rate than those in the NSG (5-year OS: 50.7 vs. 37.2 %, p = 0.001) (Fig. 1a). The results of the univariate and multivariate survival analysis are shown in Table 2. A total of eight factors evaluated in the univariate analysis had a significant effect on survival: age (≥70), tumor location, tumor size (<5 vs. ≥5 cm), Borrmann type (type III and IV), histology, TNM stage, postoperative chemotherapy and surgeon specialization. In multivariate analysis, age (≥70), tumor location (invading 2/3 or more of the stomach), tumor size, Borrmann type (III and IV), histology, TNM stage and surgeon specialization (hazard ratio was 1.489 for– NSG, p < 0.001) were found to be independent prognostic factors for OS. With the strata analysis, significant prognostic differences between the two groups were only observed in patients at stage II, IIIa and IIIb or N0–2 or rN0–2 (Table 3). The numbers of lymph nodes retrieval in the ≥ D2 resection performed in the SG and NSG were 24.2 ± 12.2 and 16.7 ± 9.7, respectively (p = 0.026). D2 or greater than D2 lymph node dissection performed by specialized surgeons demonstrated a better prognosis than that performed by nonspecialized surgeons (5-year OS: 53.7 vs. 44.8 %, p = 0.039) (Fig. 1b).

Fig. 1
figure 1

Prognosis of the whole study series who underwent surgery for GC. Patients were categorized into two groups based on their surgeon type: the specialized group (SG) and nonspecialized group (NSG). a Survival curve for all patients; the 5-year OS rate was 50.7 and 37.2 % for the SG and NSG, respectively (p = 0.001). b Survival curve for patients receiving ≥ D2 dissection; the 5-year OS rate was 53.7 and 44.8 % for the SG and NSG, respectively (p = 0.039)

Table 2 Univariate and multivariate survival analysis of GC patients in the whole study series
Table 3 Strata survival analysis of the GC patients according to the surgeon specialization: data are reported for the whole study series and for one-to-one propensity score matched pairs

Results after propensity score matching

We analyzed 261 patients in each group, who were selected by one-to-one matching using propensity scores. Characteristics after the propensity score analysis are shown in Table 1. Two hundred sixty-one of the 715 patients in the NSG were matched with 261 patients in the SG after covariate adjustment. All covariates that possibly affected OS but were not associated with surgery in the whole study series were equally distributed over the two matched groups. The rN stage was also entered in matching to overcome the stage migration resulting from the different extents of node dissection and different node counts. Matched patients in the SG had a similar sex ratio, mean age at surgery, BMI, tumor location, tumor size, Borrmann type, histology, depth of invasion, N stage, rN, TNM stage and chemotherapy as those of the matched patients in the NSG. Although the covariates associated with surgery were not entered in the matching, there were no significant differences in the surgical margin status, postoperative morbidity or mortality between the two groups. However, even after matching, patients in the SG still had a larger proportion of LN retrieval >15 (75.9 vs. 34.9 %, p < 0.001), higher ratio of ≥ D2 lymph node dissection (83.1 vs. 29.5 %, p < 0.001), higher rate of total gastrectomy (37.5 vs. 21.5 %, p = 0.002) and more lymph node retrieval (22.7 ± 12.4 vs. 13.3 ± 8.4, p < 0.001).

In the matched study series, the OS rate was also significantly higher in the SG than in the NSG. The 5-year OS rates were 50.7 % in the SG and 39.8 % in the NSG, respectively (p = 0.029) (Fig. 2). With the strata analysis, patients in the SG had a significantly higher 5-year OS than those in the NSG at stage IIIa–b, N1–2 or rN1–2. Similar to the whole study series, for patients receiving ≥D2 lymph node dissection, those in the SG had a significantly higher OS than those in the NSG (5-year OS: 53.7 vs. 46.1 %, p = 0.041).

Fig. 2
figure 2

Prognosis of the matched study series of 522 patients who underwent surgery for gastric cancer. The 5-year OS rate was 50.7 % for patients operated on by specialized surgeons and 39.8 % for those by nonspecialized surgeons, respectively (p = 0.029)

Recurrence in the whole study series

The patterns and incidence of recurrence in the whole study series are shown in Table 4. Although there were no significant differences in peritoneal, hematogenous and combined recurrence between the two groups, patients in the NSG had a significantly higher overall recurrence rate than those in the SG (55.7 vs. 44.4 %, p = 0.002). The proportion of locorregional recurrence was greater in the NSG (17.6 vs. 9.6 %, p = 0.002). Locoregional recurrence was the main recurrence type followed by peritoneal and hematogenous recurrence for patients in the NSG. For patients with locoregional recurrence, lymph node recurrence was greater in the NSG (7.6 vs. 3.8 %, p = 0.038), while other recurrence types, including gastric stump, anastomosis and gastric bed recurrence, showed no significant differences between the two groups.

Table 4 Patterns of initial recurrence in the whole study series

Discussion

This study demonstrates that GC patients operated on by a specialized surgeon who comes from a gastric oncology unit and has undergone professional training in gastrectomy and lymph node dissection were associated with improved survival and lower locoregional recurrence rates compared to those operated on by a nonspecialized surgeon; surgeon specialization was an independent prognostic factor for GC patients after surgery on multivariate analysis.

Previous studies have affirmed the positive influence of surgeon specialty on survival for various types of malignancies. Dooley et al. [8] reported the OS and disease-free survival of breast cancer were significantly improved if the surgery was performed by surgical oncologists rather than general surgeons. Read et al. [9] demonstrated that surgeon specialty contributed to prolonged disease-free survival and a higher rate of local control for colorectal cancer, and the background of the surgeon (colorectal or noncolorectal) was an independent predictor of disease-free survival. Barbas et al. [10] found the OS of colon cancer patients was significantly improved if the operation was performed by a specialty-trained surgeon, and surgeon specialization was an independent predictor of improved OS. They further discovered worse survival for stage II patients was associated with inadequate lymphadenectomy performed by non-specialty-trained surgeon. All of these studies highlighted the importance of specialty treatment in cancer surgery. Compared with these carcinomas, the D2 dissection procedure in gastric cancer surgery is more complex and requires a significant degree of training and expertise. The NCCN (National Comprehensive Cancer Network) guidelines emphasize that D2 dissection should be performed by experienced surgeons in high volume centers [14]. However, until now. few studies have focused on professional treatment in gastric cancer.

Several studies suggested that GC surgery should be performed in specialized centers as they found the rate of mortality and morbidity was significantly lower if GC patients were operated on by surgeons with high caseloads or coming from specialized units [1517]. Our result was different from these studies. We found no significant differences in postoperative complications and deaths between the SG and NSG, even after propensity score matching. This was consistent with the research conducted by Yu et al. [18], which revealed no statistical differences in surgical mortality between patients who underwent surgery by a specialized surgeon and those by a general surgeon. In the present study, the ratio of ≥ D2 dissection was merely 27.1 % in the NSG compared to 83.1 % in the SG. Nonspecialized surgeons were usually reluctant to performed D2 dissection as it was associated with higher postoperative mortality and morbidity, proved by previous studies [19, 20]. We think the low rate of ≥ D2 dissection in the NSG may account for the low incidence of mortality and morbidity after surgery.

With regard to the influence of surgeon specialization on the long-term outcome of GC, these studies did not give further illustration. So far, there is no clear definition of a specialized GC surgeon. Only Yu et al. [18] defined a specialized GC surgeon as one who had carried out 2 or more consecutive years of surgical practice, with caseloads of more than 50 per year. They demonstrated the 5-year OS was significantly improved for GC patients operated on by high-volume surgeons with 2 or more consecutive years of surgical practice, and the 5-year OS rate was 66.8 and 58.7 % for high-volume and low-volume surgeons, respectively (p = 0.0004). Bachmann et al. [21] reported that the risk of death decreased by 7 % for each increase of ten patients in hospitals’ annual caseloads (p = 0.009). In the present study, a specialized surgeon was defined as one from a gastric oncology unit who had undergone professional training in D2 lymphadenctomy and with gastric cancer caseloads of no less than 50 per year, and surgeon specialization was found to be an independent prognostic factor on multivariate analysis (hazard ratio 1.489, 95 % CI: 1.207–1.837, p < 0.001). We further discovered that patients in the SG had a significantly higher 5-year OS rate than those in the NSG even after propensity score matching (5-year OS: 50.7 vs. 39.8 %, p = 0.029). In the stratified analysis, a significant improvement in survival was seen for IIIa–b, N1–2 or rN1–2 stages with respect to NSG, but this effect disappeared for stage I and IIIc patients in both the whole study series and the matched pairs. Schwarz et al. [22] found that patients who had more than 15 N2 nodes and more than 20 N3 nodes examined had the best long-term survival outcomes. A study from the USA also suggested the positive impact of total lymph node count on survival after gastrectomy for GC [23]. From this point of view, the worse survival of the NSG may be due to inadequate lymphadenectomy as the average lymph node retrieval was merely 12.9 ± 8.3 and < D2 dissection accounted for 72.9 % in the operations performed by nonspecialized surgeons. A more interesting result in the present study is that for patients receiving ≥ D2 dissection, those operated on by specialized surgeons still demonstrated significantly better survival than those operated on by nonspecialized surgeons after matching (5-year OS: 53.7 vs. 46.1 %, p = 0.041). D2 dissection in gastric cancer surgery is considered to be a complex procedure and requires a significant degree of training and expertise. Nonspecialized surgeons without specialized training are not familiar with the technique and have not reached the plateau of the learning curve for D2 lymphadenectomy. The mean number of lymph nodes harvested in ≥ D2 resection performed in the NSG was merely 16.7 ± 9.7. Therefore, these D2 dissections could not have been as complete as those performed by specialized surgeons; some locoregional lymph nodes were more or less left without being removed. This is possibly the main reason for the poorer outcome of D2 dissection performed by nonspecialized surgeons.

To verify the incompleteness of lymph node dissection performed by nonspecialized surgeons, the recurrence data of the whole study series were also analyzed. We found that the major recurrence pattern for NSG was locoregional, while for SG it was peritoneal and hematogenous, and patients in the NSG had a significantly higher lymph node recurrence rate than those in the SG (7.6 vs. 3.8 %, p = 0.038). The 15-year follow-up results of the randomized nationwide Dutch D1D2 trial suggested that a D2 dissection decreased locoregional recurrence compared with limited D1 surgery (25 vs. 41 %) [24]. Our result was somewhat consistent with the trial as in the SG 84.1 % of patients underwent a ≥ D2 dissection and in the NSG 72.9 % of patients received a < D2 surgery. We considered that incomplete lymphadenectomy resulted in residual locoregional lymph nodes, and thus a higher rate of locoregional recurrence, especially lymph node recurrence.

In conclusion, GC patients operated on by a specialized surgeon tended to have a higher OS and lower locoregional recurrence rate with acceptable postoperative mortality and morbidity than those operated on by a nonspecialized surgeon. Surgeon specialization was an independent prognostic factor for GC after surgery. It is suggested that GC patients should be treated by specialized surgeons in a large-volume center. Professional training, especially standard lymphadenectomy, was necessary for GC surgeons.