Background

Despite improvements, many treatments for brain tumors are not curative and these patients have a poor prognosis. In addition, such treatments may be neurotoxic and, thus, negatively affect patient health-related quality of life (HRQOL). Therefore, the use of well-validated instruments when assessing the HRQOL of patients with brain tumors is particularly important. Most HRQOL instruments were developed in English for a predominantly English-speaking population. The European Organization for Research and Treatment of Cancer brain cancer module (EORTC QLQ-BN20) has previously been tested in English-speaking patients and was shown to have adequate psychometric properties [1]. The QLQ-BN20 was translated into Korean by the EORTC group [2], and its reliability and validity have been evaluated, although only in a pilot study. The standard Korean version of EORTC Quality of Life Core Questionnaire (QLQ-C30) is, overall, a valid instrument for the assessment of HRQOL in Korean patients with breast, stomach, colon, and rectal cancers [3], although its reliability and validity have not yet been specifically evaluated in patients with brain tumors. Hence, the purpose of the current study was to examine the validity and reliability of the Korean version of the QLQ-BN20 and QLQ- C30 in patients with brain tumors.

Methods

Sample and setting

This study used a prospective, descriptive cross-sectional design in which the convenience sampling was performed at a tertiary-care university hospital in Seoul, Korea. To be included in the study, patients had to be aged >18 years, fluent in Korean, and diagnosed with a histologically verified brain tumor. Patients who had not undergone surgery or those with metastatic brain tumors were excluded. The study was approved by the institutional review board of Asan Medical Center. Written informed consent was obtained from all patients or their legally authorized representative.

Instruments

The EORTC QLQ-BN20 is a 20-item questionnaire grouped into 4 domains and 7 single items [1], while the QLQ-C30 comprises a 30-item questionnaire. Raw scores were standardized by linear transformation so that the final scores were in the range of 0 –100. Higher scores on the global QOL and functional scales represent a better QOL, whereas high symptom scale scores indicate significant symptoms or greater difficulty.

Statistical analysis

Multi-trait scaling was employed to examine the scale structure of the instruments. Item-scale correlations ≥ 0.40 were considered to validate item convergent. Item-scale discriminant validity was examined by comparing the correlation of each item with its own scale versus that with other scales. We expected a high correlation between items (2 standard errors) with their own scale rather than with other scales. The internal consistency reliability of the instruments was estimated using Cronbach’s alpha coefficient (α). Values of α ≥ 0.70 were considered acceptable for group comparisons.

The known group validities were tested by comparing the score of the Korean version of the QLQ-BN20 and QLQ-C30 between patient groups. First, we hypothesized that patients with a high performance status (Karnofsky performance scale [KPS] score > 70) would report lower levels of symptoms and a better QOL than patients with a low performance status (KPS score ≤ 70). Second, patients with glioma were expected to report a higher level of symptoms than patients with meningioma. Finally, patients who received adjuvant therapy (chemotherapy, radiation therapy, or both) were expected to report a higher level of future uncertainty than patients who underwent surgery alone.

Construct validity was then examined by calculating the correlations between the multi-item scales of the QLQ-BN20 and QLQ-C30. We expected moderate correlations between future uncertainty and emotional functioning, visual disorder and cognitive functioning, motor dysfunction and physical functioning, and communication deficit and social functioning (r > 0.40). We used SPSS 20.0 (SPSS Inc., Chicago, IL, USA) for data analysis. Values of p < 0.05 were considered statistically significant. Missing data were excluded from the analysis.

Results

Of 350 invited patients, 317 (90.6%) agreed to participate, and a total of 307 (87.7%) useful questionnaires were analysed. The mean age was 47.95 ± 13.64 years (range, 18–81 years). Fifty-seven percent of the patients were women. Most patients were married (72%), and 37.5% had completed college or graduate school. Gliomas (39.7%) constituted the most frequent tumor type. More than half of the patients underwent surgery alone (Table 1).

Table 1 Socio-demographic and clinical characteristics (N=307)

Scale reliability and scale structures

As shown Table 2, with the exception of the cognitive functioning (0.60) and nausea and vomiting scales (0.64), most of the multi-item scales of the QLQ-C30 and the all scales of the QLQ-BN20 met the minimal standard of reliability (α > 0.70).

Table 2 Scale reliability and structures

The all-item scale of the QLQ-C30 correlated with its own scale with a value of r ≥ 0.50, corrected for overlap, and met the recommended psychometric standards. Item-scale correlations of the QLQ-BN20 indicated that each item had a stronger significant correlation with its own scale (range, 0.72–0.92) than with other scales (range, 0.18–0.50).

Clinical validity

As hypothesized, patients with higher KPS scores had significantly better functioning and lower symptom scores on all of the QLQ-C30 and QLQ-BN20 multi-item scales than patients with lower KPS scores (p < 0.001–0.005, Table 3). Patients with glioma had significantly lower physical, cognitive, and social functioning scores as well as higher future uncertainty, motor dysfunction, and communication deficits than patients with meningioma (p < 0.001–0.02). Patients who underwent surgery plus adjuvant therapy reported lower functioning (p < 0.001–0.03) and poor QOL (p = 0.01), but higher future uncertainty (p = 0.02) and great communication deficits (p = 0.03) than those who underwent surgery alone. The 4 scales of the QLQ-BN20 were moderately correlated with some of the QLQ-C30 scales as hypothesized (r = −.410 – -.642, Table 4).

Table 3 Mean scores (standard deviation) of clinically distinct groups
Table 4 Correlations between the Korean versions of the QLQ-C30 and QLQ-BN20

Discussion

The validity and reliability of the EORTC QLQ-BN20 and QLQ-C30 have been shown in studies across various countries [4, 5]. Here we present the results of a study using the Korean version of the QLQ-BN20 and QLQ-C30 in patients with brain tumors.

The internal consistency reliability coefficients of the QLQ-BN20 were high, although our subjects had various types of brain tumors. These reliability coefficients are higher than those reported in studies of the English [1] and other language versions [4, 6]. Our results suggest that the QLQ-BN20 can be used to assess the HRQOL in patients with other brain tumors as well as in those with brain cancer.

Most of the item-subscale correlation coefficients in the QLQ-C30 met the required convergent and discriminant validity standards. This has not always been the case; for example, the structures of the Mexican-Spanish and Greek versions of the QLQ-C30 had weaknesses with respect to both their assessment of the cognitive functioning scale and the nausea and vomiting item [710]. In this study, we confirmed that the cognitive functioning scale and the nausea and vomiting item were improved as shown in an earlier study of Korean patients with cancer [3].

Most scores of the 2 instruments could clearly distinguish between patient groups according to performance status as shown in earlier studies [4, 6, 10, 11]. The second hypothesis, that the 2 instruments could distinguish patients with glioma from those with meningioma, was partially supported. A Chinese group [12] reported that most scores of the EORTC-C30 were able to distinguish among patients with different brain tumor types. However, they did not compare differences of functioning, QOL and symptoms among specific tumor types (e.g. glioma vs meningioma) using a post-hoc test.

The third hypothesis, that the adjuvant treatment group had greater future uncertainty than the group who underwent surgery alone, was supported. This may have been because the patients who received adjuvant therapy after surgery perceived themselves to have more severe disease than those who underwent surgery alone.

Finally, we confirmed that the construction validities of the QLQ-BN20 and QLQ-C30 were acceptable, with correlation coefficients values of r > 0.40. These results are similar to those of the previous validation studies of the QLQ-BN20 and QLQ-C30 conducted for patients speaking English [1], Persian [6], and 15 other languages [4].

The present study had some limitations. First, there might have been sampling bias since all participants were recruited from a single hospital and accurate sample size for this study was not calculated. Second, we did not include responsiveness analysis because of the lack of follow-up data. Therefore, these results should be carefully interpreted and implemented.

In conclusion, our findings suggest that the Korean versions of the EORTC QLQ-BN20 and QLQ-C30 are clinically valid and reliable for measuring HRQOL and are suitable for use in both clinical practice and clinical studies, involving Korean patients with brain tumors.