Introduction

Gastric cancer remains the second leading cause of cancer death in the world and is the most frequent malignancy in Japan, South America, and Eastern Europe [1, 2]. Long-term survivors after radical gastrectomy have been increasing as the result of better early detection and improved surgical techniques [35]. The better surgical outcome has led to greater interest in the quality of life (QOL) of gastrectomized patients. For prevalence of postgastrectomy disorder, the procedures used in gastrectomy for early gastric cancer are designed as function-preserving operations or various reconstructions to restore postoperative QOL [6]. Although the postgastrectomy disorders greatly influence the living condition (QOL) of gastrectomized patients, there are limits to evaluation of outpatients because of the difficulty in measuring subjective and physical symptoms. In recent years, questionnaires have been developed to create objective rating systems for QOL [711]. The Japan Postgastrectomy Syndrome Working Party was founded in order to investigate symptoms and lifestyle changes among patients who have undergone gastrectomy. This Working Party collaboratively developed a questionnaire to evaluate the symptoms, i.e., living status and QOL, among gastrectomized patients. Using this questionnaire, a nationwide, multi-institution surveillance study was performed.

The frequency of cancers in the upper third of the stomach and gastroesophageal junction has been increasing in both Western and Asian countries [1215]. Total gastrectomy (TG) and proximal gastrectomy (PG) are operative options for proximal gastric cancer. In PG, the gastric fundic gland region is kept, and gastric-acid secretion and Castle intrinsic factor are maintained, but patients often suffer from reflux or gastric stasis. The choice of TG or PG has been discussed from the viewpoint of postoperative disorders, especially reflux esophagitis and nutrition. By using the newly developed Postgastrectomy Syndrome Assessment Scale (PGSAS-45), QOL after TG and PG for gastric cancer was compared.

Methods

Patients

Fifty-two institutions participated in this study. The PGSAS-45 questionnaire was distributed to 2,922 patients between July 2009 and December 2010. Of these forms, 2,520 (86.2 %) were retrieved, of which 152 were deemed ineligible because of patient age >75 years (n = 90), postoperative period <1 year (n = 29), co-resection of other organs (n = 8), and other factors (n = 25). As a result, 2,368 questionnaires (81 %) were decided as eligible for inclusion in various analyses related to the PGSAS-45. Of these, 393 patients who had undergone TG and 193 who had undergone PG were identified and retrieved for the current study (Fig. 1).

Fig. 1
figure 1

Outline of the study

Patient eligibility criteria

Patient eligibility criteria were: (1) pathologically confirmed stage IA or IB gastric cancer; (2) first-time gastrectomy; (3) age ≥ 20 and ≤ 75 years; (4) no history of chemotherapy; (5) no known recurrence or distant metastasis; (6) gastrectomy conducted one or more years prior to the enrollment date; (7) performance status (PS) ≤1 on the Eastern Cooperative Oncology Group (ECOG) scale; (8) full capacity to understand and respond to the questionnaire; (9) no history of other diseases or operations that might influence the responses to the questionnaire; (10) no organ failure or mental illness; and (11) provision of written informed consent. Patients with dual malignancy or concomitant resection of other organs (with co-resection equivalent to cholecystectomy being the exception) were excluded.

QOL assessment

The PGSAS-45 is a newly developed, multidimensional QOL questionnaire (QLQ) based on the Short-Form Health Survey (SF-8) [16] and the Gastrointestinal Symptom Rating Scale (GSRS) [1720]. The PGSAS-45 questionnaire consists of 45 questions, with eight items from the SF-8, 15 from the GSRS, and 22 clinically important items selected by the Japan Postgastrectomy Syndrome Working Party (Table 1). The PGSAS-45 questionnaire includes 23 items pertaining to postoperative symptoms (items 9–33), including 15 items from the GSRS and eight newly selected items. In addition, 12 questionnaire items pertaining to dietary intake, work, and level of satisfaction for daily life are included. Dietary intake items include five about the amount of food ingested (items 34–37 and 41) and three about the quality of ingestion (items 38–40). One questionnaire item pertains to work (item 42), while three address the level of satisfaction for daily life (items 42–45). For the 23 symptom items, a seven-grade (1–7) Likert scale is used. A five-grade (1–5) Likert scale is used for all other items except 1, 4, 29, 32, and 34–37. For items 1–8, 34, 35 and 38–40, higher scores indicate better conditions. For items 9–28, 30, 31, 33, and 41–45, higher scores indicate worse conditions. The main outcome measures were refined through consolidation and selection. Twenty-three symptom items were consolidated into seven symptom subscales by factor analysis, as listed in Tables 1 and 2. Assessment data include total symptom score, quality of ingestion subscale, level of satisfaction for daily life, physical component summary (PCS), and mental component summary (MCS) of the SF-8 as main outcome measures. In addition, the following results were selected as main outcome measures: changes in body weight, amount of food ingested per meal, necessity for additional meals, ability to work, dissatisfaction with symptoms, dissatisfaction at the meal, and dissatisfaction at working. Each subscale score is calculated as the mean of composed items, and the total symptom score is calculated as the mean of seven symptom subscales (Table 2).

Table 1 Structure of PGSAS-45
Table 2 Domains and main outcome measures

Study methods

This study utilized continuous sampling from a central registration system for participant enrollment. The questionnaire was distributed to all eligible patients as they presented to participating clinics. Patients were instructed to return completed forms to the data center. All QOL data from questionnaires were matched with individual patient data collected via case report forms.

This study was registered with the University Hospital Medical Information Network’s Clinical Trials Registry (UMIN-CTR; registration number 000002116). It was approved by the ethics committees at all institutions. Written informed consent was obtained from all enrolled patients.

Statistics

In comparing patient QOLs after TG and PG, statistical methods included the t test and Chi square test. All outcome measures that exhibited significant difference in univariate analysis were further analyzed using multiple regression analysis. p < 0.05 was considered statistically significant. In the case of p < 0.1 by univariate analysis, Cohen’s d was calculated. In the case of p < 0.1 in multiple regression analysis, standardization coefficient of regression (β), a decision coefficient (R 2), and the p value were calculated and shown in a table. Cohen’s d, β, and R 2 measure effect sizes. Interpretation of effect sizes were 0.2 ≤ small, 0.5 ≤ medium, and 0.8 ≤ large in Cohen’s d; 0.1 ≤ small, 0.3 ≤ medium, and 0.5 ≤ large in β; and 0.02 ≤ small, 0.13 ≤ medium, and 0.26 ≤ large in R 2.

StatView software for Windows Ver. 5.0 (SAS Institute Inc.) was used for all statistical analyses.

Results

Patient characteristics

Background data of both groups of patients are shown in Table 3. Reconstruction procedures were not regulated by the protocol, and depended on the principle of the institution or discretion of each surgeon. Consequently, whereas all patients treated by TG (393 patients) underwent Roux en Y reconstruction, the reconstruction after PG (193 patients) was varied and consisted of gastro-esophagostomy (115 patients), jejunal interposition (34 patients), and jejunal pouch interposition (44 patients).

Table 3 Patient background and operative features

In the PG group, the mean postoperative period was significantly longer (TG 35.0 ± 24.6 months vs. PG 40.5 ± 28.1 months; p = 0.0163), and the rates of celiac and pyloric branch preservation were significantly higher, while the rates of laparoscopic approaches, D2 lymph node dissection, and combined resections were significantly lower than in the TG group.

QOL assessments

The results of the main outcome measures by univariate analysis are shown in Table 4. The body weight loss (TG 13.8 % vs. PG 10.9 %; p = 0.0001; Cohen’s d = 0.35), diarrhea subscale (TG 2.3 vs. PG 2.0; p = 0.0016; Cohen’s d = 0.29), and dumping subscale (TG 2.3 vs. PG 2.0; p = 0.0118; Cohen’s d = 0.24) in the PG group were significantly lower than those in the TG group.

Table 4 Main outcome measures by univariate analysis

The necessity for additional meals was significantly lower in the PG group than in the TG group (TG 2.4 vs. PG 2.0; p < 0.001; Cohen’s d = 0.40), which indicates a better status in the PG group. However, the constipation subscale value of the PG group was significantly higher than that of the TG group (TG 2.1 vs. PG 2.3; p = 0.0145; Cohen’s d = 0.21), and the quality of ingestion subscale value of the PG group was significantly lower than that of the TG group (TG 3.8 vs. PG 3.6; p = 0.0281; Cohen’s d = 0.20), both of which indicate worse status of the PG group.

The physical and mental component summaries were not different in the two groups.

To eliminate confounding factors, multiple regression analysis was performed by adding postoperative period, age, sex, surgical approach, and celiac branch of vagal nerve preservation as explanatory variables (Table 5). Although the effect size of the advantages in PG over TG is relatively small, comparing the type of gastrectomy, the PG group was better than the TG group in body weight loss (β = 0.148; p = 0.003), diarrhea (β = 0.097; p = 0.048), dumping (β = 0.106; p = 0.043), and necessity for additional meals (β = 0.192; p < 0.001). Constipation and quality of ingestion, which were worse in the PG group by univariate analysis, showed no difference by multivariate analysis.

Table 5 Main outcome measures by multivariate analysis

Multiple regression analysis revealed that the postoperative period influenced the extent of body weight loss (β = 0.097; p = 0.030), diarrhea (β = −0.076; p = 0.078), and quality of ingestion (β = 0.092; p = 0.0365). This means that as the postoperative period lengthens, body weight loss and diarrhea improve.

The age influenced the constipation subscale (β = 0.147; p = 0.001), dumping (β = −0.114; p = 0.010), and the quality of ingestion (β = −0.126; p = 0.034). At older ages, although dumping decreased, constipation increased.

Diarrhea was often found in men (β = 0.137; p = 0.001), and surgical approach and celiac branch preservation had little influence on any of the main outcome measures by multiple regression analysis.

Discussion

Optimal evaluation methods for postgastrectomy disorders are important for selecting and improving the operative procedures and maintaining the high QOL for gastric cancer patients [2123]. The Japan Postgastrectomy Syndrome Working Party developed a questionnaire to evaluate general features; i.e., symptoms, living status, and QOL, among gastrectomized patients. Using this questionnaire, a nationwide, multi-institution surveillance study was performed. This was the first nationwide survey of its type and involved 52 medical institutions throughout Japan. The necessary QOL data were collected from 2,520 patients, and the final sample size, following exclusion and participant selection, was sufficient for statistical validity of this type of study.

In recent years, a tendency to increasing numbers of proximal gastric cancers has been reported, and early detection and potentially curative operations by PG for upper-third gastric cancers have been increasing [24, 25].

In this study, the effect of tumor progression was removed by constraining patient selection to those with pathologic Stage IA/IB disease, and it is thought that accurate QOL comparison between operative procedures is possible under these circumstances. Although QOL scores usually depend on the time after surgery, Kobayashi et al. [11] reported that the QOL after gastrectomy was impaired during a few months after surgery, but more or less stabilized at around 6 months after surgery. This is the reason that, in this nationwide survey, we chose to evaluate patients who had lived for 12 months or more after surgery. In addition, we used multiple regression analysis with time relapse after surgery as one of variables so as to adjust this problem.

Whereas the reconstruction for TG was only by the Roux-en-Y method, the reconstructions of PG could be by esophagogastrostomy, jejunal interposition, and jejunal pouch interposition [6]. Because the best reconstruction for PG has not yet been established, various procedures are performed. However, as the gastric fundic gland region is preserved in PG, gastric-acid secretion and production of Castle intrinsic factor and ghrelin, a gut hormone known increase to appetite, are maintained [26, 27].

In the PG group, the rates of celiac and pyloric branch vagal nerve preservation were significantly higher, and the rates of laparoscopic approaches, D2 lymph node dissection, and combined resection were significantly lower than in the TG group. Standard TG is composed of more D1b dissection and sacrifice of the vagal nerve, often with combined resection, such as of the spleen and gallbladder [6, 28]. On the other hand, PG, which is a function-preserving operation, usually consists of less than D1b dissection and preservation of the vagal nerve [6]. The differences in the surgical background are caused by the procedure itself. Therefore, there seems to be no problem in comparing the QOL scores of these two groups.

From the results of the main outcome measures by univariate and multivariate analysis, body weight loss, diarrhea, dumping, and necessity for additional meals were significantly lower in the PG than in the TG group. Although esophageal reflux is common after PG [29, 30], various reconstruction methods have recently been described that reduce this problem [31, 32]. In this study, there was no difference in the esophageal reflux subscale values between the groups. This result suggests that PG is not necessarily disadvantageous with regard to reflux.

As three types of reconstruction with various modifications were performed with PG reconstruction, it is necessary to compare the three procedures in future studies. Dumping symptoms, such as early dumping with systemic symptoms, early dumping with abdominal symptoms, and late dumping, were examined in detail. Late dumping was significantly less common in the PG than in the TG group. Also, a tendency toward less early dumping with abdominal symptoms was seen in the PG group (data not shown). As a result, PG performed well on the dumping subscale. Although PG reflected the storage capacity and pylorus-preserving function, in TG, solid food is passed rapidly to the jejunum because of no storage ability [33].

Although the constipation subscale results and quality of ingestion subscale values were worse with PG than with TG by univariate analysis, multivariable regression analysis revealed that there were no statistical differences in these subscales as the result of the type of gastrectomy. Body weight loss and quality of ingestion subscale improved if the postoperative period was long. This means that gastrectomized patients adapt in some ways to the anatomic changes over time, even after more than 1 year following gastrectomy.

Multivariable regression analysis showed that dumping decreased and constipation increased with advancing age. This result may reflect the known intestinal peristaltic decrease in older patients [3437].

By multivariable regression analysis, men were more likely to have diarrhea than women. This may be a consequence of the fact that the intestinal transit time is longer in women than in men at equivalent ages [3739]. As for the effect of the surgical approaches and celiac branch preservation, no differences were found by multivariable regression analysis.

There were no statistical differences between the groups with regard to ability to work, dissatisfaction with symptoms, dissatisfaction at working, dissatisfaction for daily life subscale, PCS, or MCS. It is suggested that daily life is largely unchanged and that statistically different postgastrectomy disorders do not have a major effect on adaptation.

In conclusion, although the effect size of the advantages of PG over TG is relatively small, our results indicate that PG is useful as a function-preserving procedure for early upper-gastric cancer. Although this study is limited in that it is retrospective and examines a single time point, it suggests the value of PG, use of which should be encouraged. To confirm this conclusion, a randomized study to determine the most desirable reconstruction for PG to achieve a good long-term QOL will have to be conducted using the PGSAS-45 questionnaire and successive endoscopic examinations.