Introduction

Performing HRQOL assessments in clinical practice has been demonstrated to not only increase health provider and patient awareness of treatment-related symptoms and enhance patient–physician communication, but also to improve patients’ overall survival [1]. Several studies have described the use of HRQOL assessments among Japanese patients that were treated for bladder cancer [2]. However, most of these HRQOL studies were performed using generic QOL instruments, such as the Medical Outcomes Short Form-36 (SF-36) and European Organization for Research and Treatment of Cancer Quality of Life Core Questionnaire (QLQ-C30) [3, 4]. Although these instruments can be used to assess HRQOL related to physical well-being and functional, emotional, and social factors after cancer treatment, they cannot necessarily be used to evaluate bladder cancer-specific side effects. Therefore, bladder cancer-specific HRQOL assessment instruments, such as the Functional Assessment of Cancer Therapy (FACT)-BL, have been used to evaluate HRQOL differences across various types of treatments [5, 6]. The main limitation of the FACT-BL is its inability to determine which symptoms are the most troublesome for a patient, because it does not break its questions into separate domains (e.g., into urinary, bowel, and sexual symptoms).

The Bladder Cancer Index (BCI) is bladder cancer-specific HRQOL instrument, which was designed as a comprehensive instrument for assessing HRQOL in bladder cancer patients who still possess their own bladders after surgery or who have undergone radical cystectomy [7]. The main advantage of the BCI is that it can be used to assess symptom magnitude and impairment in the urinary, bowel, and sexual health domains. Recently, the BCI was translated into Hungarian, Spanish, French, Dutch, and Arabian, and each of these versions has been validated [8,9,10,11,12]. To meet the demand for a bladder cancer-specific HRQOL questionnaire in Japan, we developed a preliminary Japanese version of the BCI through a multistep process, which involved forward-translation, back-translation, and discussion with the original developer. This Japanese BCI showed demonstrable face validity and reliability in the pilot study and can be used to estimate HRQOL among Japanese patients with bladder cancer and for cross-cultural comparisons [13]. This study was performed to validate the newly developed Japanese BCI using a large population of bladder cancer patients. Furthermore, we aimed to evaluate bladder cancer patients that had undergone various surgical procedures using the Japanese BCI via a multicenter cross-sectional survey conducted in Japan.

Materials and methods

The validation cross-sectional cohort included bladder cancer patients with various disease stages (non-muscle-invasive and muscle-invasive cancer) that had undergone a range of surgical procedures (endoscopic treatment, intravesical therapy, or radical cystectomy). A total of 450 bladder cancer patients that were treated between 1999 and 2017 were enrolled from 7 institutions in Japan. Responses were obtained from 397 patients (88.2%). The study protocol was approved by each institutional review board, and the approval number for Hokkaido University Hospital was 015-0504. All patients were at least 20 years old and were required to have an Eastern Cooperative Oncology Group performance status (ECOG-PS) of 0 or 1. All patients provided written informed consent and responded to the relevant questionnaires, including the Japanese BCI, Medical Outcomes Short Form-12 (SF-12) [14], FACT-G [15], FACT-BL, and EQ-5D-5L [16]. All questionnaires were self-administered and sent back by mail or submitted to an outpatient clinic in a sealed envelope, so that privacy was maintained. The questionnaires were scored centrally by trained assessors.

Item analysis, factor analysis, reliability, and validity procedures

The item analysis was performed using the data collected in the present study. The percentage of missing responses was calculated for each item. When ≥ 15% of the responses for an item were missing, the profiles of the responders and non-responders were compared using the Student’s t test or Pearson’s Chi square test. To identify discrete domains, we performed factor analysis using the principal component method with promax rotation. Each Likert scale item was categorized into one of 3 domains (urinary, bowel, or sexual) and 2 subdomains (urinary function/bother, bowel function/bother, and sexual function/bother). Each domain score was comprised of the mean of the standardized items. Higher scores indicate a better health status. The score for a domain was regarded as missing if more than 20% of the items in the domain were not answered. Other than that, the domain score was calculated from the answered questions. Cronbach’s alpha coefficient was used as a reliability coefficient for each subscale. To assess criterion-related validity, we analyzed the correlations between the scores for the BCI subscales and the scores for other relevant validated instruments (SF-12, FACT-G, FACT-BL, and EQ-5D-5L). Furthermore, to evaluate known-groups validity, the subjects were divided into the following 5 groups: (1) patients with native bladders that underwent transurethral resection alone, (2) patients with native bladders that underwent transurethral resection and intravesical therapy, (3) patients that underwent cystectomy involving ileal conduit diversion (IC), (4) patients that underwent cystectomy involving ileal neobladder diversion (NB), and (5) patients that underwent cystectomy involving cutaneous ureterostomy (CU). Statistical comparisons among the 5 groups were performed using the Kruskal–Wallis rank-sum test and the general linear model, followed by Tukey’s HSD test. Regarding intergroup comparisons, least-squares means (LSM) estimates, adjusted for age, sex, and ECOG-PS, were calculated. JMP version 14 (SAS Institute Inc., Cary, NC, USA) was used for the statistical analyses, and p values of < 0.05 were considered to indicate a statistically significant difference.

Results

The median age, sex ratio, ECOG-PS, disease stage, and median disease duration of the validation cohort are shown in Table 1. Two hundred and twenty-one of the enrolled patients were treated endoscopically, including patients who underwent transurethral tumor resection with or without intravesical therapy. Of these patients, 104 had a history of one or more transurethral surgical procedures. Among those who received intravesical therapy, 73 patients and 30 patients were treated with intravesical BCG and intravesical chemotherapy (e.g., mitomycin C, epirubicin, pirarubicin), respectively. Of the other patients who underwent cystectomy, 101 (57.4% of those who were treated with cystectomy, 25.4% of the entire cohort) underwent IC, 49 (27.8% and 12.3%, respectively) underwent NB, and 26 (14.8% and 6.5%, respectively) underwent CU. The median disease duration for the native bladder without intravesical therapy group, native bladder with intravesical therapy group, cystectomy with IC group, cystectomy with NB group, and cystectomy with CU group were 22.5 months, 25 months, 35 months, 75 months, and 20.5 months, respectively. The percentage of missing items ranged from 2.3 to 33.0%. Eighteen of the 36 items exhibited a missing response rate of ≥ 15% (Supplementary Table 1). Among the 3 domains, the number of items with missing response rates of ≥ 15% was higher in the sexual domain (55.6%) than in the urinary domain (27.8%) or bowel domain (16.7%). Fifteen of the 18 missing items were presented in a table-style questionnaire, which matched the original version of the BCI; i.e., the missing items were presented in a table-style questionnaire format significantly more often than in an independent-style questionnaire format (p < 0.01). A higher number of missing responses for the 18 above-mentioned items was found to be associated with higher age, female sex, and a single marital status (Supplementary Table 1). The percentage of missing items ranged from 1.5% to 6.5% for SF-12, ranged from 2.0% to 2.5% for EQ-5D-5L, ranged from 5.0% to 48.6% for FACT-G, and ranged from 6.0% to 38.6% for FACT-BL, respectively.

Table 1 Clinical characteristics in 397 patients in Japanese version of the BCI validation cohort

For all 3 domain scales and six domain subscales, Cronbach’s alpha coefficient was high (0.78–0.93), except for the bowel bother domain (0.64) (Table 2), indicating that each of the domain and subdomain items coherently measured urinary, bowel, or sexual factors. The percentage of patients reporting maximum scores was relatively high across all domains, indicating a moderate ceiling effect (Table 2). The ceiling effect was especially marked in the urinary and bowel domains, and the sexual bother domain. More than two-thirds of the patients who recorded the maximum score for each domain were endoscopically treated. Actually, the maximum score percentage was 100% for 5 out of 6 subdomains (Table 2). On the other hand, no floor effect was found, except for the urinary function (1.0%) and sexual function (35.0%) subdomains. Factor analysis supported the idea that most of the items could be assigned to urinary, bowel, or sexual health domains and their subdomains (their function and bother subscale), although there were two outliers (item numbers: 42 and 52) (Supplementary Table 2).

Table 2 BCI domain-specific summary and subscale characteristics

The results of the assessments of the correlations between the domain scales and subscales are shown in Table 3. These correlations were evaluated to validate the convergence/divergence between function scores and bother scores. In each domain, the function scores and the bother scores exhibited moderate correlations (urinary and bowel subdomains: r = 0.42 and 0.47, respectively), indicating that bother subscale of each domain quantified similar disabilities to those measured by the respective function subscale. In the sexual domain, the function and the bother score exhibited a negative correlation, indicating that they were inversely correlated (r = − 0.19). In contrast, the correlations between the function subscale and the bother subscales of different domains were weak. Regarding concurrent validity, the SF-12 physical summary score and mental summary score were inversely correlated with the sexual bother and sexual function score, respectively (r = − 0.06 and − 0.11, respectively). In addition, the SF-12 role summary score and EQ-5D score were moderately correlated with the urinary bother score (r = 0.38 and r = 0.33, respectively). Despite moderate-to-strong correlations being detected between most of the BCI subdomain scores and the FACT-BL score (r = 0.32–0.48), the sexual bother subdomain score demonstrated a weak relationship with the FACT-BL score (r = 0.01). Table 4 shows that the urinary, bowel, and sexual domain scores differed significantly among the 5 treatment groups (p < 0.05), except for the sexual bother subdomain score (p = 0.23). However, regarding other HRQOL instruments, only the SF-12 role summary score (p < 0.01) and FACT-BL score (p < 0.01) differed significantly among the five groups.

Table 3 Interscale correlation between BCI function and bother subscales, and other HRQOL instrument summary scores
Table 4 Age, sex, and ECOG-PS adjusted Mean BCI domain scores by treatment group

The mean BCI subscale scores for the 5 treatment groups, which were adjusted according to age, sex, and ECOG-PS, are shown in Fig. 1. Interestingly, the urinary and bowel function scores of the patients in the NB group were significantly lower than those of the patients in the other groups. In addition, the bowel bother score in patients that underwent cystectomy with IC or NB were significantly lower than those of the patients with native bladders (regardless of whether intravesical therapy was performed). Although the urinary bother scores and sexual function scores of the patients in the three cystectomy groups were significantly lower than those of the patients with native bladders, the sexual bother scores of the five treatment groups did not differ significantly. Regarding the FACT-BL score, it only differed significantly between the patients in the native bladder without intravesical therapy group and those in the radical cystectomy with IC group. The SF-12 physical summary scores of the patients in the CU group were significantly lower than those of the patients in the other cystectomy groups and the native bladder without intravesical therapy group (Supplementary Fig. 1). The SF-12 role summary scores of the patients in the IC group and CU groups were significantly lower than those of the patients with native bladders (Supplementary Fig. 1). Among the patients with native bladders, neither the number of prior transurethral surgical procedures nor the type of intravesical therapy significantly affected their BCI subdomain scores. In addition, the five treatment groups were divided into two groups by the median follow-up period, respectively: (i) the short-term follow-up group and (ii) the long-term follow-up group. The sexual bother score for the native bladder without intravesical group, the sexual function score for the NB group, and the bowel bother score for the IC group were significantly different between the short-term follow-up group and the long-term follow-up group (Supplementary Table 3). The QOL score for the long-term follow-up group was significantly higher compared with that of the short-term follow-up group in all the three domains.

Fig. 1
figure 1

The least-squares means (LSM) estimation adjusted by age, sex, and ECOG-PS and standard error by treatment group (n = 371, Japanese cohort). Higher scores indicate better health status. (**indicates p < 0.05* and indicates p < 0.05.)

Discussion

In this study, we confirmed that the Japanese BCI can be used to comprehensively evaluate the postoperative symptom burdens of bladder cancer patients who undergo transurethral surgery or radical cystectomy. The Japanese BCI exhibited similar levels of reliability and validity to the original BCI. Substantial differences between patients that underwent different surgical treatments for bladder cancer were identified. In particular, we revealed HRQOL differences in the urinary, bowel, and sexual function and bother subdomains between patients that retained their native bladders and patients that underwent cystectomy using the Japanese BCI. To the best of our knowledge, this is the largest cross-sectional study to compare HRQOL among patients who have undergone surgical treatment for bladder cancer in Japan.

The current study highlights some important points that were not reported in the validation study of the original BCI. For 18 items of the BCI, a relatively high percentage of missing values (≥ 15%) was found to be associated with age, sex, and marital status. Therefore, BCI data obtained from elderly, female, and single patients should be interpreted carefully. Interestingly, the items that were presented in a table-style questionnaire format exhibited a high missing response rate. It is suggested that when completing table-style questionnaires elderly patients might find it difficult to see responses in the upper row when they answer questions in the lower rows. Consequently, we will revise the format of the table-style items in future. In addition, Japanese males have been reported to be reluctant to consult physicians about sexual issues, and therefore, might hesitate about answering items relating to sexual issues [17].

The relatively low internal consistency coefficient obtained for the bowel problem subdomain (Cronbach’s alpha coefficient: 0.64) in the present study is another issue. In a study involving an American population, the original BCI indicated that Cronbach’s alpha coefficient for the bowel problems subdomain was as high as 0.87; however, another international validation study reported lower Cronbach’s alpha coefficients for the bowel bother subdomain (range: 0.69–0.77) [9, 10]. One of the reasons for this is the weak correlations between the severity of constipation and other bowel bother items (e.g., pelvic pain, bloody stools, and bowel movement urgency). In addition, a high ceiling effect was detected in the patients who retained their native bladders. These findings were comparable to those obtained in the previous studies [10, 11], but suggested that the questionnaires should be modified, so that they can assess patients with good function more precisely. The conceptual independence among the urinary, bowel, and sexual domains was supported by the low inter-scale correlation coefficients for these relationships (range 0.05–0.32), which were very similar to those reported for the original BCI (range 0.17–0.39). The low correlations seen between the BCI and the SF-12 suggest that the BCI captures additional information that is not covered by generic instruments.

Mixed HRQOL results have been obtained regarding about whether continent diversion is superior or equivalent to incontinent diversion. Although several studies have reported that continent diversion provides marginal HRQOL advantages over incontinent diversion, most studies have shown no such differences [2, 18]. We found that patients who had undergone cystectomy with NB exhibited significantly lower HRQOL scores for the urinary function and bowel function subdomains than those who were treated with the other two incontinent diversion procedures. This suggests that urine leakage occurred more often and bowel movement frequency was increased in the patients that underwent NB than in those that underwent the other diversion procedures. However, the scores for urinary and bowel bother did not differ significantly among the cystectomy groups. In a longitudinal study, Hedgepeth also reported that patients that underwent NB exhibited significant functional urinary impairment with little difference in urinary problems [19].

Interestingly, the incontinent CU group and the incontinent IC group displayed comparable scores for most of the BCI subdomains. The previous studies also showed that the HRQOL scores of the patients that underwent CU did not differ significantly from those of the patients that underwent IC [4, 20]. Although CU is employed less frequently than other forms of urinary diversion due to the risk of stomal stenosis, it was reported that a high ureteral patency rate (82–89%) was achieved with CU using the Toyoda method [21, 22]. In addition, CU can be performed simply and safely and involves a shorter operating time and a lower bowel complications rate [20]. This procedure has generally been considered for elderly and frail patients with limited life expectancy. Possessing a single kidney is a good indication for CU, as we found that most of the patients (91%) who underwent cystectomy with simultaneous nephroureterectomy chose CU. CU is a reasonable alternative option for patients who are elderly or frail, based on the risk of perioperative complications and from the viewpoint of HRQOL.

The negative correlation between the sexual function and bother subscales deserves a mention (correlation coefficient: -0.19). One of the reasons for this negative correlation is due to about one-third of the patients who reported no sexual function without sexual bother. Although the Japanese patients in the current study displayed relatively low scores for the sexual function subdomain, they accepted their impairments rather than regarding them as bother. As was reported by Hisasue et al., satisfactory sexual function, such as erectile rigidity, does not always equate to a satisfactory sexual life in Japanese couples. These cultural factors might have influenced the discrepancy between the sexual function and sexual bother subdomains among Japanese patients. While a moderate correlation (correlation coefficient: 0.56) was detected between the sexual function and sexual bother subdomains in the original BCI study, weak relationships were detected between these subdomains in validation studies of the Spanish (correlation coefficient: 0.15) and Hungarian (correlation coefficient: 0.26) versions of the BCI [9, 11]. It is speculated that ethnicity and cultural differences might play significant roles in the perception of sexual impairment.

This study had several potential limitations. First, it lacked a baseline assessment, as it was a cross-sectional study, and data about how the patients’ HRQOL scores changed over time. However, this cross-sectional design also provides the potential advantage, because the patients with variety of follow-up periods can be analyzed. The analysis of such patients with the diversity of the follow-up periods enabled us to appropriately validate the Japanese BCI from various points of view. Second, our study did not involve any test–retest reliability testing; however, the observed high internal consistency of the BCI indicates that the results are reliable. The high proportion of missing values in the sexual domain was another limitation. In addition, other critical HRQOL factors, including patients’ body image perceptions, were not evaluated in this study. Finally, the cohorts were not random samples, and selection bias might have been unavoidable. The longer disease duration for the NB group compared with that of other four treatment groups should be considered when interpreting the HRQOL outcome of the NB group. Nevertheless, to the best of our knowledge, this is the first validation study of the Japanese BCI and the largest study to have compared HRQOL among patients that underwent surgical treatment for bladder cancer in Japan. We consider that the present Japanese BCI would contribute to evaluations of HRQOL in Japanese bladder cancer patients and provide considerable amounts of information that could be used to compare the outcomes of different surgical procedures.

The newly developed Japanese BCI was validated in bladder cancer patients that underwent surgery. Although it has several limitations, it will be a valuable tool for assessing the HRQOL of bladder cancer patients across a range of new treatment options, not only in clinical practice, but also during research. Further HRQOL-related research using the Japanese BCI, involving larger longitudinal bladder-cancer cohorts, would provide critical information and help clinicians to better understand the outcomes experienced by bladder cancer survivors.