Background

Colorectal cancer (CRC) is the most frequently diagnosed cancer and the second leading cause of cancer-related death, accounting for > 50,000 cases in Japan in 2018 [1]. Clinical guidelines from the Japanese Society for Cancer of the Colon and Rectum advise against routine administration of adjuvant chemotherapy to all patients with stage II CRC,1 but the American Society of Clinical Oncology, the National Comprehensive Cancer Network, and the European Society for Medical Oncology suggest that adjuvant chemotherapy is beneficial in patients with stage II colon cancer (CC) with high-risk factors for recurrence [2,3,4,5]. However, unlike the proven benefits of adjuvant chemotherapy for patients with stage III CC, the benefits for patients with stage II CC remain unclear because of an absence of conclusive data from randomized controlled clinical trials [6, 7]. In addition, risk factors for stage II CC have not been fully established [2, 5, 8,9,10,11,12].

Large-scale randomized comparative studies are challenging, especially when they include a surgery-alone arm, because risk factors for stage II CC have not been established and effective evidence-based adjuvant chemotherapies are lacking. To overcome these issues, we conducted a multi-center prospective two-arm trial, including a non-randomized arm, in which patients chose whether to undergo surgery alone or surgery followed by oral uracil and tegafur plus leucovorin (UFT/LV) with propensity score (PS) matching [13, 14] to minimize confounding biases in between-group comparisons, and a randomized arm, in which patients chose to be assigned randomly to undergo either surgery alone or surgery followed by UFT/LV. Oncologic outcomes and safety were evaluated after adjusting for previously reported clinicopathological risk factors using PS matching and inverse probability of treatment weighting (IPTW) in the non-randomized arm.

Methods

Details of the study protocol can be found in our previous report [15].

Patients

Eligible patients were aged 20 to 80 years with histologically confirmed stage II colon adenocarcinoma with at least one of the following conditions: T4 disease, perforation/penetration, poorly differentiated adenocarcinoma/mucinous carcinoma, or < 12 dissected lymph nodes. Eligible patients had undergone R0 resection, had an Eastern Cooperative Oncology Group performance status of 0/1, could take the study drugs orally, and could begin the study treatment within 8 weeks after surgery.

Allocation to treatment

This prospective observational study was based on patients’ self-selected treatments: the non-randomized arm comprised patients who chose to undergo surgery alone (NR-Group S) or surgery followed by UFT/LV (NR-Group U), while the randomized arm comprised patients who were assigned to undergo surgery alone (R-Group S) or surgery followed by UFT/LV (R-Group U). Patients in the randomized arm were allocated (1:1) using the web-based minimization method and stratified by depth of tumor invasion, histology, number of lymph nodes examined, age, sex, and institution.

Protocol treatment

Patients in NR-Group S and R-Group S underwent a blood test every 3 months and chest and abdominal computed tomography examinations every 6 months for up to 5 years or until confirmation of recurrence, occurrence of other malignancies, or death. Colonoscopy was also performed 1 and 3 years after surgery. Adverse events (AEs) were monitored for 6 months after registration.

Patients in NR-Group U and R-Group U started treatment with oral UFT (300 mg/m2/day) and LV (75 mg/day), three doses per day (every 8 h) within 8 weeks after surgery under one of the following two regimens: daily for 28 days followed by a 7-day rest or daily for 5 days followed by a 2-day rest. One course lasted for 5 weeks, and a total of five courses were given. If the criteria for starting/continuing treatment were not fulfilled, treatment was postponed or suspended until AEs improved sufficiently to meet the criteria. Depending on AE severity, the UFT dose was reduced according to the protocol. Physical examinations and blood tests were performed during the first week of each course, and medication compliance was monitored. Concomitant use of anti-tumor drugs other than UFT/LV, radiation therapy, and immunotherapy for the target disease were prohibited until confirmation of recurrence, double cancer, or multiple primary CRC. After completion of UFT/LV therapy, patients were followed up according to the same schedule used in NR-Group S and R-Group S, except that AEs were monitored from the start of UFT/LV therapy until 28 days after the final dose.

Endpoints

The primary endpoint of this study was disease-free survival (DFS), which was defined as the time from the date of registration to the date of occurrence of secondary cancer, recurrence, or death of any cause, whichever occurred first. The secondary endpoints were overall survival (OS), which was defined as the time from the date of registration to the date of death of any cause, and the incidences of AEs by severity. Expression of carcinoembryonic antigen mRNA 24 h after surgery was also measured as a secondary endpoint, and the results will be presented in a separate report.

Statistical analysis

The target sample size in the non-randomized arm with a sample size ratio of 1:2 between NR-Group S and NR-Group U was estimated at 1,715, and the target sample size in the randomized arm was 1,100 (550 patients each in R-Group S and R-Group U) [15].

The efficacy endpoints were evaluated in the full-analysis set, which included all enrolled patients, except those with major protocol violations in the non-randomized and randomized arms. The efficacy endpoints were also evaluated in the PS-matched population, which excluded patients who did not match, in the non-randomized arm. Safety was analyzed in the safety-analysis set, which included all non-randomized or randomized patients, except those who did not receive UFT/LV.

For the analysis of the non-randomized population, 1:2 PS matching was used to minimize confounding biases in comparisons between NR-Group S and NR-Group U [13, 14]. Potential confounding factors for estimating the PS were prespecified as follows: age (≥ 70 or < 70 years), sex, number of dissected lymph nodes (≥ 12 or < 12), T4 disease, bowel perforation, poorly differentiated adenocarcinoma, mucinous carcinoma, and number of participating patients at each institution (≥ 5 or < 5). The caliper for PS matching was determined before the analysis so that the standardized difference of all confounding factors was < 0.1. Differences before and after matching were assessed using the χ2 test. A stratified log-rank test accounting for matched pairs was used to compare groups using a two-sided significance level of 5%. Hazard ratios (HRs) and 95% confidence intervals (CIs) were calculated using a stratified Cox proportional hazards model. The Kaplan–Meier method was used to estimate the 3-year DFS and OS rates in each group, and Greenwood’s formula was used to calculate the 95% CIs.

Analyses were also conducted with IPTW using estimated PSs [16]. For the IPTW analysis, standard errors were estimated using the robust sandwich variance estimator [17].

For the analysis of efficacy in the randomized arm, the same analyses were performed with the same stratification factors, except for institution. The incidence of each AE was calculated and the severity was evaluated using the Japanese version of the Common Terminology Criteria for Adverse Events v4.0. All statistical analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC, USA).

Results

Patient dispositions and characteristics before and after PS matching

A total of 1,938 patients were enrolled from 321 institutions in Japan from May 2012 to April 2016, including 1,902 patients (98%) in the non-randomized arm and 36 (2%) in the randomized arm (Fig. 1). The number of patients in the randomized arm was too small and, they were therefore excluded from the following analysis.

Fig. 1
figure 1

Patient flow diagram. Patients in NR-Group S and R-Group S underwent surgery alone; patients in NR-Group U and R-Group U underwent surgery followed by UFT/LV. Abbreviations: IPTW, inverse probability of treatment weighting; NR, non-randomized; PS, propensity score; R, randomized; UFT/LV, uracil and tegafur plus leucovorin

The baseline characteristics of the 1,880 eligible patients in the non-randomized arm (641 patients in NR-Group S and 1,239 patients in NR-Group U) are summarized in Table 1.

Table 1 Baseline characteristics of patients before and after propensity score matching in the non-randomized arm

After 1:2 PS matching, 402 patients in NR-Group S and 804 patients in NR-Group U were matched, with no significant differences in the eight confounding factors between the groups (Table 1). The standardized difference for all confounding factors was < 0.08.

Survival outcomes in the PS-matched population

As of the data cut-off date on December 11, 2019, the median follow-up time for DFS in NR-Groups S and U was 59.0 months (interquartile range [IQR], 47.2–60.6 months) in the PS-matched population. The 3-year DFS rate was significantly higher in NR-Group U (80.9% [77.9%–83.4%]) than in NR-Group S (74.0% [69.3%–78.0%]) (HR, 0.64 [0.50–0.83]; P = 0.0006) (Fig. 2a and Supplementary Table S1). The median follow-up time for OS in NR-Group S and U was 59.8 months (IQR, 47.5–61.2 months) in the PS-matched population. No significant difference was observed in the 3-year OS rate between NR-Group S and NR-Group U (P > 0.2) (Fig. 2b and Supplementary Table S1).

Fig. 2
figure 2

Disease-free and overall survival in non-randomized patients with resected high-risk stage II colon cancer. a Disease-free and b overall survival in the propensity score-matched groups. c Disease-free and d overall survival in the inverse probability of treatment weighting groups. NR-Group S (blue): surgery alone; NR-Group U (green): surgery followed by UFT/LV treatment. Abbreviations: CI, confidence interval; DFS, disease-free survival; HR, hazard ratio; NR, non-randomized; OS, overall survival; UFT/LV, uracil and tegafur plus leucovorin. P values obtained by log-rank test

Events in the PS-matched population

The median time to recurrence was significantly longer in NR-Group U (17.2 months; range, 1.8–59.8 months) than in NR-Group S (8.7 months; range, 2.0–47.1 months) (P < 0.0001). There were 115 (28.6%) and 181 (22.5%) DFS events in NR-Group S and NR-Group U, respectively. Recurrence of primary CC occurred in 73 patients (18.2%) in NR-Group S (lung and liver were the most common recurrence sites) and in 123 patients (15.3%) in NR-Group U (peritoneum and liver were the most common recurrence sites). There were 39 (9.7%) and 68 (8.5%) deaths in NR-Group S and NR-Group U, respectively (Table 2).

Table 2 Summary of disease-free survival and overall survival events

Survival outcomes in the IPTW population

The patients’ background characteristics were well balanced between NR-Group S (n = 641) and NR-Group U (n = 1,239) (Table 1). The median follow-up time for DFS was 58.9 months (IQR, 47.2–60.6 months) and the 3-year DFS rate (95% CI) was significantly higher in NR-Group U than in NR-Group S (P = 0.0013) (Fig. 2c and Supplementary Table S1). The median follow-up time for OS was 59.8 months (IQR, 47.5–61.2 months) and the 3-year OS rate was significantly higher in NR-Group U than in NR-Group S (P = 0.0315) (Fig. 2d and Supplementary Table S1).

Treatment with UFT/LV and safety

Of 1,239 patients in NR-Group U in the IPTW population, 890 (71.8%) completed UFT/LV treatment, 318 (25.7%) discontinued treatment, and 31 (2.5%) did not receive treatment. Treatment discontinuation occurred most frequently during the first course (140/1,208, 11.6%), and the most common reason for discontinuation was AEs (209/318, 65.7%). Of 1,208 patients in NR-Group U, 1,040 (86.1%) were treated without decreasing the doses of the study drugs.

Grade ≥ 3 AEs that occurred with a frequency of ≥ 2% in NR-Group U were diarrhea (3.9%), elevated alanine aminotransferase (3.1%), and elevated aspartate aminotransferase (2.2%). No other notable AEs with respect to incidence or severity occurred in any group (Supplementary Table S2).

Discussion

Studies involving patients with stage II/III or III CC have shown that 5-fluorouracil with or without oxaliplatin or LV, UFT/LV, and capecitabine are effective adjuvant chemotherapies. However, the efficacies of these therapies in patients with stage II CC are inconsistent [4, 18,19,20,21,22]. UFT/LV is widely used in Japan because it can be administered orally and has shown similar efficacy and safety to intravenous 5-fluorouracil/LV in patients with stage III disease [18, 23].

Consistent with our previous study demonstrating that 6 months of oral UFT/LV was sufficient in patients with stage IIB/III CC [24], the current prospective observational study demonstrated a significant 3-year DFS benefit following 6 months of oral UFT/LV compared with surgery alone in patients with resected high-risk stage II CC, adjusted using PS matching.

A previous study showed that the time to recurrence after curative resection of CC was significantly longer in patients treated with adjuvant chemotherapy than in patients treated with surgery alone [25]. When the incidence rate of DFS events was presented in 3-month intervals, the incidence of DFS events was highest at 6 to 9 months after registration in patients who underwent surgery alone; however, this peak level was lower in patients treated with UFT/LV, indicating that 6 months of UFT/LV effectively suppressed the peak of DFS events (Supplementary Fig. S1).

We found no significant difference in OS between NR-Group S and NR-Group U. However, using IPTW, UFT/LV therapy significantly improved OS. This improvement may have been detected because of the higher statistical power provided by the inclusion of all patients (n = 1,880). However, patients in the PS-matched population require long-term observation (5–10 years) before conclusions can be drawn regarding the survival benefits of UFT/LV.

This study included non-randomized and randomized arms because of potential difficulties in obtaining sufficient numbers of patients willing to participate in a randomized study without reliable measures to estimate the risk of recurrence, and because of the use of a therapy with no evidence-based effectiveness or safety for stage II disease. As predicted during preparation of the study protocol, 98% of the 1,938 registered patients self-selected their treatment. Among the patients who chose their treatment, 34% selected surgery alone and 66% chose surgery followed by adjuvant chemotherapy. Our results are consistent with a previous study in which > 60% of patients with CC answered a questionnaire saying that they would undergo adjuvant chemotherapy if the treatment reduced the 5-year recurrence rate by 1% to 2% [26].

The incidence rates of AEs in the present study were lower than those previously reported, and the completion rate of UFT/LV treatment was similar to equivalent and similar regimens available for patients with the same stage of and/or more-advanced CC [25, 27]. Six months of oral UFT/LV therapy thus appeared to be safe and well-tolerated as a postoperative adjuvant chemotherapy.

In this study, we applied PS matching to the non-randomized arm to adjust for confounding variables to assess the effects of UFT/LV. Increasing numbers of studies have used this analysis, and we showed that it was indeed a useful statistical method for achieving similar distributions of observed baseline covariates. We therefore expect that the data obtained from meta-analyses and non-randomized controlled studies can be analyzed using PS matching to provide statistically and clinically meaningful information.

This study had some limitations. The main results were obtained from a non-randomized population. Although PS matching and IPTW were used to adjust for risk factors, we cannot exclude other unknown confounding factors that might have caused a certain degree of bias. In addition, some risk factors, such as microsatellite instability [28, 29], were not included. We also excluded lymphatic, vascular, or perineural invasion as a risk factor because its evaluation was likely to be inconsistent among the participating institutions. Further studies are therefore required to validate the present results with respect to the risk factors used for adjustment.

Conclusions

The present results suggest that 6 months of adjuvant chemotherapy with oral UFT/LV has significant survival benefits in patients with stage II CC characterized by at least one of the following conditions: T4 disease, perforation/penetration, poorly differentiated adenocarcinoma/mucinous carcinoma, or < 12 dissected lymph nodes.