Introduction

According to GLOBOCAN 2012 (http://globocan.iarc.fr), gastric cancer is the third most common cause of cancer deaths worldwide, resulting in 723,000 deaths in 2012. Surgery remains the most essential part of multimodal therapy for patients with this type of cancer. Total gastrectomy is mandatory for curative resection of cancer that involves the upper part of the stomach, despite its detrimental effects on health and quality of life (QOL). Roux-en-Y (RY), the most common type of reconstruction performed after total gastrectomy, is simple, safe, and associated with a low incidence of reflux symptoms [1]. However, this method often fails to prevent various other postgastrectomy symptoms that adversely affect the QOL of patients, and eventually leads to malnutrition. Numerous other reconstruction methods have been proposed and explored, but the optimal method that is clinically beneficial, technically simple, and cost-effective remains elusive.

Several studies have evaluated the effect of creating a jejunal pouch to compensate for the loss of reservoir capacity [29]. The construction of a pouch was once considered complex and time-consuming, but these disadvantages are no longer a concern in the era of mechanical staplers. However, pouch reconstruction is sometimes associated with food stasis and excessive pouch dilatation, and the optimal size and site of the pouch remain a source of controversy. More recently, aboral pouch reconstruction (AP), in which an anisoperistaltic jejunal pouch is created in the Y limb of the RY reconstruction [10], has emerged as a relatively inexpensive option that is feasible, even using a laparoscopic approach. In addition, this is the only pouch-reconstruction method that can be reasonably employed after transhiatal extended gastrectomy in junctional tumors. In this method, the pouch not only endows reservoir capacity but also allows food and digestive fluids from the Roux limb to mix well, thereby theoretically enhancing digestion and nutrient absorption. In a randomized trial with RY as a control arm, AP was found to be superior in terms of postoperative serum cholesterol levels, lipid absorption, and QOL [10]. However, this study was a small single-institutional attempt with no predetermined endpoints or sample size calculations based on statistical considerations.

Therefore, we conducted a prospective randomized controlled trial comparing AP and RY, in which the postoperative physical functioning scores on the European Organization for Research and Treatment of Cancer (EORTC) Quality of Life Questionnaire-Core 30 (QLQ-C30), an established instrument for the assessment of QOL [11], were evaluated as the primary endpoint. The secondary endpoints included other QLQ-C30 subscales, body composition parameters, and safety.

Methods

Patient eligibility

CCOG1101, a multi-institutional prospective randomized trial, was designed to test the hypothesis that AP results in a superior physical functioning score. Patients were randomized during surgery to receive either RY or AP after total gastrectomy for gastric cancer. Comparisons of patient-reported outcomes, body composition, nutritional status, and safety were made between the two types of reconstructions.

The eligibility criteria for preoperative registration were as follows: (1) histologically proven gastric adenocarcinoma; (2) total gastrectomy required for R0 resection; (3) performance status of 0 or 1; (4) age of between 20 and 80 years; (5) esophageal invasion of <3 cm and no need for thoracotomy; (6) no history of upper abdominal surgery with the exception of laparoscopic cholecystectomy; (7) no pretreatment with chemotherapy or chemoradiation; (8) the ability to understand and respond to the questionnaires; (9) no severe organ malfunctions that excluded the patient from being indicated for standard surgical treatment; and (10) the provision of written informed consent. The exclusion criteria were as follows: (1) cancer of the gastric remnant; (2) linitis plastica type gastric cancer; (3) the need for co-resection of other organs with the exception of cholecystectomy; (4) currently under treatment or being followed up for other malignancies; (5) body mass index of >28 kg/m2; (6) uncontrollable hypertension or diabetes mellitus; and (7) regular steroid use.

The patients were registered by fax or telephone for consideration as candidates for the trial and were randomized during surgery after the surgical team confirmed that the following criteria were fulfilled: (1) macroscopic diagnosis of up to T4a, N2 and M0; (2) total gastrectomy required for curative resection with adequate surgical margins; and (3) no need for thoracotomy. Randomization was performed by the minimization method and was stratified by center, intraoperative estimation of tumor depth (T1a and T1b/T2 and T3/T4a), and surgical approach (open/laparoscopic). The results were conveyed to the surgeons either by fax or telephone.

The patients were asked to complete the EORTC QLQ-C30 and stomach module (STO22) questionnaires before surgery and at 1 month, 6 months, and 12 months after surgery. Additionally, they underwent blood tests and measurements of body composition by the bioimpedance method using a TANITA DC-320 body analyzer (TANITA Corp., Tokyo, Japan) at the baseline, 1 month, 6 months, and 12 months after surgery. Follow-up evaluation either by abdominal computed tomography or abdominal ultrasound was conducted every 6 months.

All patients were informed of the nature of the study and provided written consent. The study was approved by the Institutional Review Boards of all 12 participating institutions. The trial was registered in the University Hospital Medical Information Network’s Clinical Trials Registry (UMIN-CTR; number 000006155).

Surgical treatment and postoperative adjuvant chemotherapy

Standard surgery was performed for all patients in accordance with the Japanese Gastric Cancer Treatment Guidelines [12], except that the laparoscopic approach (which is not considered a standard in the guidelines) was allowed for the patients with a preoperative diagnosis of stage I cancer. Figure 1 summarizes the characteristics of each method of reconstruction. The Y limb in RY was constructed by hand sewing, or by using a linear stapler where the length of anastomosis had to be less than 4 cm. The Y limb in AP was created by anisoperistaltic side-to-side anastomosis using two 75-mm linear staplers. In addition to organizing a series of meetings among the participants of the trial to discuss the surgical techniques and study design, a video of an AP reconstruction exemplarily conducted by one of the participants (Y. I.) was distributed to ensure that the procedure would be performed identically by all of the participants. The decision of whether to use the antecolic or retrocolic route was left to the preference of each surgeon.

Fig. 1
figure 1

The Y limb in the Roux-Y group was constructed either by hand sewing or using a linear stapler where the length of anastomosis had to be less than 4 cm. The Y limb in the aboral pouch group was created by anisoperistaltic side-to-side anastomosis using two 75-mm linear staplers

D2 lymph node dissection conforming to the Japanese Gastric Cancer Treatment Guidelines was performed for the patients with a preoperative and intraoperative diagnosis of T2–T4a gastric cancer, while those with T1 disease received a D1+ dissection. After pathological examination of the resected specimens, the patients with confirmed stage II–III disease were treated with postoperative S-1 monotherapy for 12 months.

Assessment of QOL

The EORTC QLQ-C30 questionnaire is composed of 30 items for assessing the QOL of cancer patients. These items are categorized into the global health status/QOL scale, 5 functioning scales (physical, role, cognitive, emotional, and social), 3 symptom scales (fatigue, pain, nausea and vomiting), and 6 single items (dyspnea, sleep disturbance, appetite loss, constipation, diarrhea, and financial difficulties). The EORTC STO22 is a questionnaire tailored to gastric cancer patients that is composed of 22 items [13, 14]. The 22 items are categorized into 5 symptom scales (dysphagia, abdominal pain, reflux, eating restriction, and anxiety) and 4 single items (dry mouth, taste problems, body image, and hair loss). Missing data were processed according to the EORTC scoring manuals [15], and all scales were linearly transformed to scores of 0 to 100. Consequently, high scores denoted favorable outcomes regarding global health status and the functioning scales, whereas low scores on the symptom scales and items indicated favorable outcomes.

In the current study, the validated Japanese language versions of the EORTC QLQ-C30 and STO22 were used to assess the patient-reported outcomes [16, 17]. The patients were provided the questionnaire sheets by their attending physician and were asked to complete them prior to surgery and to send them by mail to the independent data center run by the Chubu Clinical Oncology Group (CCOG). Further questionnaire sheets were sent directly from the data center to the patients at one, six, and twelve months after surgery to be completed and returned by mail.

Sample size calculation and statistical considerations

The primary endpoint was the physical functioning score, as measured with the EORTC QLQ-C30 at 12 months after surgery. The secondary endpoints were the other subscales of the QLQ-C30 and STO22, body composition, nutritional status as evaluated by blood testing, operation time, blood loss during surgery, and morbidity. An intergroup difference of 10 points in the QLQ-C30 physical functioning score obtained at 12 months after surgery was expected out of a scale of 100. Assuming a common standard deviation of 15, 37 patients were required per group for analysis given an alpha error on both sides of 5 % and a detectability of 80 %. A sample size of 90 patients was used, taking into consideration missing QOL survey data and the emergence of ineligible patients for other reasons. The chi-squared test was used to compare categorical data, and Student’s t test was used for continuous variables. A p value of less than 0.05 was considered significant. The JMP statistical package, version 10 (SAS, Cary, NC, USA), was used for statistical analysis.

Results

Patient demographics

A total of 121 patients were registered preoperatively from September 2011 to November 2012. Of them, 101 were deemed eligible during surgery and were enrolled and randomly allocated to either of the treatment arms. Peritoneal deposits were found in one patient after being randomized and allocated to the RY arm. This patient was subsequently considered unresectable. Thus, 100 patients were eligible for comparison, of which 51 were allocated to the RY group and 49 to the AP group (Fig. 2). The patient demographics are presented in Table 1. No significant differences were observed between the two groups in terms of age, gender, preoperative weight, preoperative body mass index, tumor diameter, depth of invasion, or stage distribution.

Fig. 2
figure 2

CONSORT flow diagram. Of 101 randomized patients, 52 were allocated to receive Roux-en-Y reconstruction and 49 to undergo aboral pouch reconstruction. The numbers of patients available for analysis of the primary endpoint were 41 and 42 in each group, respectively

Table 1 Patient demographics

Safety analysis

Short-term outcomes stratified by the operative procedure are presented in Table 2. No significant differences were observed between the two groups in terms of operation time, blood loss, postoperative hospital stay, or morbidity. No in-hospital mortality was observed. Postoperative complications of grade 2 or higher according to CTCAE ver. 4.0 are shown in Table 2. There were no surgical complications that were considered to be specific to AP. Seventeen (33 %) patients in the RY group and 15 (31 %) patients in the AP group underwent postoperative adjuvant chemotherapy according to the Japanese Gastric Cancer Treatment Guidelines. Recurrence was observed within 1 year in 5 (9.8 %) patients in the RY group and 4 (8.2 %) patients in the AP group.

Table 2 Type of surgery and short-term outcome of the patients enrolled

Patient-reported outcomes

The questionnaires were completed and returned by 91, 94, 93, and 88 % of the patients at the baseline and at 1 month, 6 months, and 12 months postoperatively. The scores for the global health status and the five functioning scales decreased in both groups at 1 month after surgery and then gradually recovered to the baseline level at 12 months postoperatively (Fig. 3). No significant difference was observed in the physical functioning score (the primary endpoint) at 12 months postoperatively. Among the QLQ-C30 and STO 22 symptom scores, the nausea and vomiting score showed significant improvement in the AP group (p = 0.041) at 1 year after surgery. The reflux score was significantly better in the AP group at 1 month after surgery (p = 0.036). Additionally, the scores for abdominal pain, dry mouth, taste problems, and body image showed improvements in the AP group, although these differences did not reach statistical significance (Fig. 4).

Fig. 3
figure 3

Global health status and functioning scores determined using the EORTC QLQ-C30 and stratified by the method of reconstruction. The scores were generally the lowest at 1 month postoperatively but recovered to the baseline by 12 months after surgery. No significant differences were observed between the groups in any of the functioning scores at any of the time points, including the physical functioning score at 12 months, which was the primary endpoint

Fig. 4
figure 4

Symptom scores determined using the EORTC QLQ-C30 and STO22 and stratified by the method of reconstruction. The scores were the lowest at 1 month postoperatively and often had not recovered to the baseline at 12 months after surgery. Several of the symptom scores tended to be lower for the aboral pouch group, but the majority of these differences were not statistically significant

Body composition and nutritional consequences

Changes in serum nutritional and body composition parameters in both groups are presented in Table 3. Nutritional status was evaluated by blood testing, but no significant difference was noted between the two groups. The body weight loss at 12 months after surgery was approximately 9 kg for both reconstruction methods. No significant differences were observed between the two groups in total body weight, fat mass, or lean body mass.

Table 3 Comparison of changes in serum nutritional and body composition parameters between the R-Y group and the AP group

Discussion

Various reconstruction methods after total gastrectomy have been reported, but no consensus has been reached regarding a standard method. Due to the prolongation of survival through early diagnosis and the need for tolerance to evidence-based postoperative adjuvant therapies for gastric cancer that have been reported in these patients [18, 19], a reconstruction method that adequately maintains their nutritional status and QOL after gastrectomy has become a matter of paramount importance. Consequently, various trials examining jejunal pouch reconstruction, mainly at the esophagojejunal anastomosis, as a means of maintaining reservoir capacity after total gastrectomy have been conducted. Gertler et al. conducted meta-analysis, concluding that pouch reconstruction mainly at the esophagojejunal anastomosis may offer advantages [20], but no definitive conclusion has been reached due to the small size of each of the trials.

The current randomized trial was conducted in a multi-institutional setting because all of the surgeons at the participating institutions considered RY to be a tentative standard due to the lack of evidence for selecting more complex procedures; however, an agreement was reached on the theoretical advantage of AP and its technical simplicity compared with other types of pouch reconstruction. This allowed for the enrollment of a generous number of patients compared with the previous trials conducted during the 2 years of patient accrual, and the safety and feasibility of AP was demonstrated despite the fact that most of the surgeons lacked a training period. The AP technique using linear staplers was found to be easily performed by surgeons with sufficient experience in gastric cancer surgery.

However, differences in the physical functioning scores (the primary endpoint) were disappointingly scarce. The QLQ-C30 and STO22 were employed in the current study because they have been established as globally applicable instruments for measuring patient-reported outcomes. We previously performed a pilot study comparing postoperative QOL among patients treated by open total gastrectomy with RY, open distal gastrectomy, and laparoscopy-assisted distal gastrectomy [21]. The physical functioning score at 12 months postoperatively was found to be outstandingly poor in that study after total gastrectomy compared with distal gastrectomy, and this finding led to its selection as a primary endpoint in the current study. However, our results indicated that the differences in this parameter as a consequence of adding a jejunal pouch (if any) should be considered trivial compared with those caused by the presence or absence of the remnant stomach. Additionally, the sample size of the trial was insufficient to detect significant differences in favor of AP for a majority of the items included in the QLQ-C30 and STO22, with the exception of the incidence of reflux at 1 month and nausea and vomiting at 12 months after surgery. Since these items were not the primary endpoint basis for the calculation of the sample size of the study, these consolatory results will have to be interpreted rather sparingly at this time. There are possibilities, however, that more distinct differences in some items, including these two, could in future be detected through longer follow-up. No trend was observed in terms of differences in other clinically relevant quality indicators, including body weight, lean body mass, and nutritional status, as evaluated by blood testing. Overall, this trial did not robustly demonstrate a superiority of AP reconstruction over RY reconstruction.

In the original trial that showed a superiority of AP in terms of some nutritional parameters, Horvath et al. created pouches with lengths of 15 cm for AP [10]; in contrast, the pouches constructed in the present study had a median length of 12 cm (range: 8–15 cm) due to the regulation that two 75-mm linear staplers should be used to minimize cost. In addition, the authors sought advice from investigators who are enthusiastic advocates of pouch reconstruction both at the esophagojejeunal anastomosis [22] and in the Y limb [23], and were warned that excessive pouch dilatation, one of the adverse events associated with pouch reconstruction, is more likely to occur in patients with large pouches. Although the optimal jejunal pouch length for AP remains to be elucidated, we considered it unlikely that the patient-reported outcomes in the current study could have been influenced by the subtle differences in the capacities of the reservoirs when compared with the original report of a case series involving more heavily built Caucasian patients [10].

As a result of a pivotal phase III trial [18], postoperative adjuvant chemotherapy has become the standard therapy for gastric cancer patients who have undergone stage II/III curative resection in Japan. Consequently, S-1 adjuvant chemotherapy was performed for 12 months for more than 30 % of the patients in both groups. Chemotherapy may have affected the QOL of these subgroups and nullified the benefits of AP. However, no significant differences in the various relevant parameters were observed, even when analysis was limited to the patients who had not undergone postoperative adjuvant chemotherapy (data not shown). Additionally, the investigators had originally expected that this subset would actually benefit from AP through improved nutritional status, which would minimize the adverse effects of chemotherapy [19].

The combination of the QLQ-C30 with the STO22 was used to assess the patient-reported outcomes among gastric cancer patients who had undergone a wide array of anticancer therapies, and was not specifically compiled to address the issue of postgastrectomy symptoms. Several instruments that more specifically address patient-reported outcomes following gastrectomy are now available, albeit in Japanese [23, 24], and studies using these instruments that have shown some noteworthy results are now starting to be published [2527]. Furthermore, pouch reconstruction in general has been proposed to improve long-term QOL when evaluated after as long as 30 months post-surgery [1]. This finding suggests that the 12-month observation period used in the current study may have been too short. Further comparison after a longer follow-up period and the use of another questionnaire with a greater focus on postgastrectomy symptoms [23] may reveal some advantages or disadvantages of AP. A protocol amendment in this regard is currently in progress, and future analysis based on the results of this study might generate some new findings.

Conclusion

AP after total gastrectomy is safe and feasible. However, the benefit of this method over the conventional RY method in terms of QOL and nutritional status was not shown as of 12 months postoperatively in this randomized trial.