Introduction

Malignant primary brain tumors (PBTs) are responsible for 2.7% of cancer deaths worldwide [1]. In the United States of America (USA), the incidence rate of PBTs is approximately 14.8/100,000/year, and morality rate is greater in males (5.6/100,000) as compared to females (3.7/100,000) [2]. In developing countries, however, both the incidence and mortality rates of PBTs is lower than those seen in developed countries [2]. In Pakistan, which is a lower-middle-income country (LMIC) in South Asia, malignant PBTs comprise around 3.6% of all malignancies [3]. However, the levels of distress experienced by patients with malignant PBTs is higher than that experienced by patients suffering from most other types of malignancies [4].

Quality of life (QoL) is a broad, multi-faceted concept that encompasses functionality and well-being in the physical, emotional, and psychosocial domains [5]. It is an increasingly important outcome in clinical neuro-oncology [5]. The vast majority of patients with PBTs face varying levels of physical, emotional, or cognitive distress. This distress may be attributed to factors such as physical disability, disfigurement, sensorimotor deficits, losses of individual freedoms, employment, and income, and social stigma [4]. Moreover, mental health outcomes, particularly depression, are strongly associated with poorer QoL in patients with PBTs [6]. Resilience, which is the capacity of individuals to maintain stable physical and cognitive functionality despite the many challenges of cancer, may help protect against adverse mental health outcomes and improve QoL in patients with PBTs [7,8,9]. Amongst the several tools designed to assess QoL amongst patients with PBT, the EORTC QLQ-C30 (European Organization for Research and Treatment of Cancer Quality of Life Questionnaire) and its brain tumor-specific module EORTC QLQ-BN20 (EORTC QLQ-Brain Neoplasms 20) have proved to be brief, reliable, and valid assessment measures [10, 11].

Differences in languages and cultures across the world have led to the translation of the EORTC QLQ-C30 and QLQ-BN20 into many different languages. The EORTC QLQ-C30 has been used in over 3000 studies to date and has been translated and approved in over 100 languages [12]. Although this includes Urdu, the national and official language of Pakistan, the validation of the Urdu version of the EORTC QLQ-C30 in a Pakistani population has to the best of our knowledge only been carried out in a cohort of 70 patients with hematologic malignancies [13]. The EORTC QLQ-BN20 has been less widely translated and validated, and never before in Urdu in a Pakistani population of patients with PBT [12]. There is an increasing need to assess the QoL experienced by patients with PBTs in Pakistan using proven tools such as the EORTC QLQ-C30 and QLQ-BN20. Due to cultural differences and a sizeable percentage of patients with PBT being from lower socioeconomic and less educated backgrounds, the English versions of the EORTC QLQ-C30 and QLQ-BN20 are of limited utility in a Pakistani setting. Moreover, although more than 10 different languages are spoken by the different cultural groups and ethnicities in Pakistan, Urdu is spoken and understood throughout the country. Thus, this study aimed to formulate and validate an appropriately translated Urdu version of the EORTC QLQ-C30 and QLQ-BN20 amongst patients with PBT in Pakistan. In addition, this study also aimed to assess the correlation of QoL with resilience, depression, and anxiety.

Methods

Study tools

The two tools validated in this study were the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30) and the EORTC QLQ-Brain Neoplasms 20 (EORTC QLQ-BN20). Permission was obtained from the EORTC for the translation and validation of both tools. In addition, to explore the correlation of QoL with resilience, depression, and anxiety, two additional tools were included in the survey: Wagnild and Young’s Resilience Scale (RS-14) and Hospital Anxiety and Depression Scale (HADS). Thus, the final survey instrument consisted of a section on demographic and clinical characteristics, followed by the EORTC QLQ-C30, EORTC QLQ-BN20, RS-14 and HADS:

  • EORTC QLQ-C30: A 30-item QoL measure for patients with cancer. The tool comprises five multi-item functional scales (physical, role, cognitive, emotional, and social), three symptom scales (fatigue, pain, and nausea and vomiting), a global health and QoL scale, and single items for measurement of other symptoms frequently experienced by cancer patients (such as dyspnea, appetite loss, sleep disturbance, constipation, and diarrhea), in addition to the perceived financial impact of the disease and treatment [10]. All items are scored using a 4-point Likert scale (1: ‘not at all’; to 4: ‘very much’), except for two items in the global health/QoL scale which instead employ modified 7-point linear analog scales [14]. The functioning and global QoL subscales are scored ranging from 0 to 100, where higher scores imply favorable conditions. However, while symptom subscales are also scored ranging from 0 to 100, higher scores in these subscales imply greater symptoms i.e., unfavorable conditions.

  • EORTC QLQ-BN20: A 20-item QoL measure specifically for patients with primary brain neoplasms [11]. The tool comprises four domains all relevant to the disease (future uncertainty, visual disorder, motor dysfunction, and communication deficit), in addition to seven single items (headaches, seizures, drowsiness, hair loss, itchy skin, weakness of legs, bladder control). All items are scored using a 4-point Likert scale (1: ‘not at all’; to 4: ‘very much’) and are then linearly converted to a 0–100 scale, where a higher score implies unfavorable conditions.

  • RS-14: A 14-item measure of five core characteristics of resilience (purposeful life, perseverance, equanimity, self-reliance and existential loneliness) that uses a 7-point Likert Scale to calculate an aggregate score for resilience [15]. The higher the score on the RS-14, the higher the resilience. The validated Urdu version of RS-14 was used, which has an acceptable Cronbach’s alpha of 0.763 [16]. We re-verified the internal consistency of the translated RS-14 and found a Cronbach’s alpha of 0.903, demonstrating excellent internal consistency of the RS-14 in the current population of brain tumor patients.

  • Hospital Anxiety and Depression Scale (HADS): A 14-item tool using a 4-point ordinal scale to measure depression and anxiety. The lower the score on the HADS, the more favorable the outcome. The Urdu version of the HADS [17] was used. The Urdu version of the HADS has been validated in pregnant females, and has been found to have an overall Cronbach’s alpha of 0.84 [18]. The Cronbach’s alpha for the depression and anxiety subscales were 0.64 and 0.82, respectively [18]. We re-evaluated the internal consistency for our sample. While the tool's overall internal consistency (Cronbach’s alpha: 0.89) and that of the anxiety subscale (Cronbach’s alpha: 0.81) were comparable to the previously reported values, the Cronbach’s alpha for the depression scale amongst our sample (Cronbach’s alpha: 0.86) was higher than previously reported.

Translation of study tools and pilot testing

For purpose of validation in Urdu, the EORTC QLQ-C30 and QLQ-BN20 underwent a translation process in accordance with EORTC standards [19] and COSMIN Study Design Checklist for Patient-reported Outcome Measure Instruments [20]. Two translators initially translated the English versions of the EORTC QLQ-C30 and QLQ-BN20 into Urdu independently. Both translators involved in the forward translation were bilingual, with native proficiency in Urdu and full professional proficiency in English, and with more than 7 years of experience in translation of healthcare-related surveys. To reduce the risks of bias and to identify subtle discrepancies, one of the translators was aware of the constructs that the tools were intended to measure, while the other translator was naïve to the intended purpose of the tools. One consolidated Urdu version was then produced, which subsequently underwent backwards translation to English by two translators independently. Both translators involved in the backwards translation were native Urdu speakers with full professional proficiency in English, and were naïve to the intended purpose of the tools. A single consolidated backwards-translated English version of each tool was created and reviewed by the research team for consistency with the original English tools. Differences in the translation were then reviewed and settled in the presence of a third independent translator who was aware of the constructs measured by the tools. The difficulties encountered during the translation process are described in the Additional files 1 and 2. The preliminary Urdu translated versions were pilot tested on 25 (i.e., 10% of calculated minimum required sample size) brain tumor patients who were native Urdu speakers. This pilot testing, a form of linguistic validation of the comprehensibility of the tools, was performed in accordance with the guidelines published by the EORTC [21]. Prior to the administration of the tool via individual interviews, patients participating in the pilot testing were instructed to comment on whether any questions in the tool were difficult to understand, difficult to answer, confusing, upsetting, or offensive. Patients were also asked to rate the comprehensibility of each item in the tools using a Likert scale of 1–4 (see “Content validity index”). No major areas of improvement were identified by participants during the pilot testing, and only a few minor revisions were effected to produce a final Urdu translation of both tools. These final forms were included in the survey instrument of the current study.

Study setting

This survey was conducted over the period November 2019 to May 2020 at the Aga Khan University Hospital (AKUH), which is a Joint Commission International Accreditation (JCIA-accredited) hospital in Karachi, Pakistan.

The institutional review board of AKUH granted ethical approval for this study (Reference Number: 5154-Sur-ERC-17). The complete protocol of this study has been published by the authors [8].

Study subjects and sampling

Our target population was adult (≥ 18 years) patients treated for primary malignant brain tumors at AKUH. Patients were included if they were currently ≥ 4 weeks post-initiation of treatment, provided written informed consent for participation, and were residing in Pakistan for at least the past 3 months. The lattermost criterion was to ensure the validation of the EORTC QLQ-C30 and QLQ-B20 tool was achieved while measuring QoL in the context and setting of Pakistan, as patients residing abroad may experience QoL different to their counterparts residing in Pakistan.

Exclusion criteria included patients with history of psychiatric illness or on prescription psychiatric medications, or with debilitating comorbidities such as stroke or renal failure. However, patients with comorbid hypertension (HTN), type 2 diabetes mellitus (T2DM) or chronic obstructive pulmonary disease (COPD) were not excluded from the sample, as the prevalence of these comorbidities is high amongst the Pakistani population [22]. Moreover, these comorbidities are commonly seen in patients with brain tumors [23]. Thus, including such patients ensured a representative population.

Non-probability consecutive sampling was used for recruiting participants. Trained research assistants approached brain tumor patients visiting AKUH as per their scheduled appointments at the surgical/oncology clinics. Potential participants were screened for eligibility by the research assistants. After providing their informed consent, they were administered the EORTC QLQ-C30 and QLQ-BN20 tools as part of a survey conducted as an interview by the research assistants. Although both the EORTC QLQ-C30 and QLQ-BN20 tools are generally self-administered, we opted to routinely administer them via interviews so as to include patients within our setting who lacked the literacy to read (similar to Montazeri et al. in their Iranian translation and administration [24]). A minimum sample size of 250 was required. This was calculated using the one population mean formula, a standard deviation (SD) of 20, 5% level of significance with precision of 2.5, and by adjusting the sample size for 10% rate of incomplete responses [8].

Content validity index

Content validation of the EORTC QLQ-C30 and QLQ-BN20 tool was performed with accordance to the COSMIN Study Design Checklist for Patient-reported Outcome Measure Instruments [20]. Five experts (a psychologist, epidemiologist, biostatistician, neurosurgeon, and mental health researcher) were asked to rate the relevance of all items of both tools using a Likert scale of 1–4. In addition, patients participating in the pilot testing were requested to rate the relevance and clarity (comprehensibility) of each item in the tools using a Likert scale of 1–4. However, the comprehensiveness of EORTC QLQ-C30 and QLQ-BN20 was not assessed from the professionals’ or patients’ perspective. Both tools used were pre-constructed and have been used in their current form in studies globally, and the comprehensiveness of their content is well established. Content validity index (CVI) scores for clarity and relevance of the tools were calculated as described below in the Statistical Analysis section.

Statistical analysis

Data was analyzed using SPSS Statistics for Windows version 23.0 (IBM Corp., Armonk, N.Y., USA). Two members of the research team were involved in data analysis. Categorical variables were represented as frequencies and percentages. Numerical variables were represented by mean and standard deviation or median and interquartile range. The Pearson correlation coefficients were calculated to determine the construct validity between EORCT QLQ-C30 and QLQ-BN20 with RS-14 and HADS. Reliability was estimated using the Cronbach’s alpha coefficient, with a value ≥ 0.70 considered acceptable. Content validation index (CVI) was reported to determine the relevance and clarity of the content of the tool. For each item, the score suggested by the raters was summed up and divided by the total number of raters to obtain an average score for each item. The sum of the average score of each item was further divided by the total number of items to obtain a CVI score ranging from 0 to 1 (1 = perfect agreement; and 0 = no agreement). Interscale correlations were also calculated for both the EORCT QLQ-C30 and QLQ-BN20. A p value of < 0.05 was considered significant.

Results

Sample characteristics

A total of 255 patients with PBTs were approached for inclusion in this survey. However, five amongst these were ineligible (1 patient no longer resided in Pakistan and 4 patients had pre-existing psychiatric disorders), leaving a total of 250 participants in the final sample. Their mean age was 44 ± 0.83 years, and 68% were male. The most common mother tongue was Urdu (30.8%), with others including Sindhi (18.8%), Punjabi (14.8%) and Pushto (10.4%). However, every patient was able to understand and fluently converse in Urdu, with no language barriers to administering the survey. The diagnoses of PBTs included glioma (46.8%) and meningioma (21.2%). Most participants had undergone tumor biopsy (78%). 9.6% patients reported receiving radiotherapy, 4.4% chemotherapy, and 25.2% combination therapy, while 60.8% reported no adjuvant therapy (Table 1).

Table 1 Participant demographic characteristics and disease-related factors

Internal consistency or reliability

Table 2 depicts the reliability of the EORTC QLQ-C30 tool and QLQ-BN20 tool. The Cronbach alphas for the 30 items of the QLQ-C30 and the 20 items of the QLQ-BN20 were 0.860 and 0.880, respectively, indicating good internal consistency of both tools. The internal consistency for the global health status scale of the QLQ-C30 was also good (Cronbach’s alpha = 0.800; p < 0.001). The Cronbach’s alpha for the 15 items of the functional scale of the QLQ-30 tool was 0.74 (range: 0.630–0.830), indicating acceptable to good internal consistency (p < 0.001). The overall internal consistency for the 11 items of the symptom scale of the QLQ-C30 was also good (Cronbach’s alpha 9 = 0.82 (range: 0.800–0.86); p < 0.001). Lastly, the consistency of the four domains of the QLQ-BN20 demonstrated acceptable-to-good consistency (Cronbach’s alpha range: 0.703–0.868).

Table 2 Internal consistency of EORTC QLQ-C30 (1A) and QLQ-BN20 (1B)

Content validity

The expert-reported CVI scores for relevance of the Urdu version of the EORTC QLQ-C30 and QLQ-BN20 tool were 0.96 and 0.95, respectively, indicating excellent agreement among the five experts. The patient-reported CVI scores for clarity and relevance of the Urdu version of the EORTC QLQ-C30 were 0.92 and 0.93, respectively. Similarly, the patient-reported CVI scores for clarity and relevance of the Urdu version of the QLQ-BN20 tool were 0.80 and 0.93, respectively. These results indicate good-to-excellent agreement among the 25 patients for clarity and relevance of both tools.

Construct validity

The correlation of QoL (as measured by EORTC QLQ-C30 and QLQ-BN20) with resilience (RS-14) was assessed using Pearson correlation coefficients (Tables 3, 4). We observed a significant moderate positive correlation between global status of EORTC QLQ-C30 and resilience (r = 0.422; p value < 0.001). Similarly, there was a significant moderate positive correlation between the 5 domains of the functional scale of EORTC QLQ-C30 and resilience (r ranging from 0.462 to 0.570; p value < 0.001). Lastly, there was a significant moderate negative correlation between future uncertainty and resilience (r = −0.473, p < 0.001).

Table 3 Correlation between QLQ-C30 with resilience, depression and anxiety (construct validity)
Table 4 Correlation between QLQ-BN20 with resilience, depression and anxiety (construct validity)

When assessing the correlation between the EORCT QLQ-C30 and the QLQ-BN20 tool with depression and anxiety as measured using the HADS, we observed a significant moderate negative correlation between global status of EORTC QLQ-C30 with depression (r = −0.541; p < 0.001) and anxiety (r = −0.502; p < 0.001). Additionally, there was a significant moderate-to-strong negative correlation between the 5 domains of the functional scales of the EORTC QLQ-C30 i.e., with both depression (r ranging from −0.467 to −0.688; p < 0.001) and anxiety (r ranging from −0.276 to −0.704; p < 0.001). There was a strong positive correlation between future uncertainty and depression (r = 0.614, p < 0.001) and moderate correlation between future uncertainty and anxiety (r = 0.514, p < 0.001). The Pearson’s correlation coefficients for the EORTC QLQ-C30 and QLQ-BN20 with the HADS are shown in Tables 3 and 4, respectively.

Inter-scale correlation of EORTC QLQ-C30 and EORTC QLQ-BN20

When assessing inter-scale correlations of EORTC QLQ-30, we observed a significant weak-to-moderate positive correlation between global QoL and the 5 functional domains (r ranging from 0.384 to 0.561; p < 0.001). We observed a significant moderate-to-strong positive correlation (r ranging from 0.40 to 0.68; p < 0.01) within the 5 functional domains of EORCT QLQ-30, with strongest correlation between physical and role functioning (r = 0.600; p < 0.001), physical and social functioning (r = 0.631; p=0.001), role and social functioning (r = 0.680; p < 0.001).

Moreover, a significant weak-to-strong negative correlation was observed between 5 functional domains and 8 symptom domains of EORCT QLQ-30 (r ranging from −0.172 to −0.690; p < 0.01), with strongest correlations between physical functioning and insomnia (r = −0.690; p < 0.001) and role functioning and insomnia (r = −0.641; p < 0.001). In addition, within the symptom domains, the strong correlation was between fatigue and pain (r =0.602; p < 0.001).

When assessing inter-scale correlations of EORTC QLQ-BN20, we observed a significant weak-to-strong positive correlation of future uncertainty and 9 symptom domains (r ranging from 0.245 to 0.628; p < 0.001), with the strongest correlation between future uncertainty and drowsiness (r = 0.628; p < 0.001). The strongest correlations within the symptom domains were between motor dysfunction and drowsiness (r = 0.605; p < 0.001) and motor dysfunction and weakness in leg (r = 0.683; p < 0.001).

The interscale Pearson’s correlation coefficients for EORTC QLQ-C30 and QLQ-BN20 are shown in Tables 5 and 6, respectively.

Table 5 Interscale correlation of EORTC QLQ-C30
Table 6 Interscale correlations of EORCT QLQ-BN20

Discussion

Though health related QoL is an increasingly important outcome in the management of patients with malignant PBTs, it is challenging to measure QoL of patients with PBTs in Pakistan due to the lack of specific tools available in Urdu. In this regard, we attempted to translate and validate the EORTC QLQ-C30 and QLQ-BN20 in Urdu, the national and official language of Pakistan, to provide a valuable tool for the measurement of QoL of patients with PBTs in a clinical setting. With overall high internal consistency (Cronbach’s alpha: 0.86 and 0.88, respectively), and content validity index scores for patient-reported clarity (0.92 and 0.80, respectively) and expert-reported relevance (0.96 and 0.95, respectively), the Urdu versions of the EORTC QLQ-C30 and QLQ-BN20 tools displayed good validity in patients with PBTs in our study.

The overall experience of the translation and validation process of the EORTC QLQ-C30 and QLQ-BN20 in Urdu was relatively straightforward. The translations in Urdu were able to accurately convey the intended English equivalents, and none of the patients had any major issues with understanding the final Urdu version of either tool. This was encouraging, as the Urdu translations of both tools displayed high content validity for clarity and relevance. Moreover, as both tools were administered via interview as opposed to self-administration, our results also demonstrate the undiminished value of the EORTC QLQ-C30 and QLQ-BN20 tools in populations of LMICs where literacy may be low [24]. This study was conducted in Karachi, the largest city of the country, and home to all major ethnicities living in Pakistan. In addition, the Aga Khan University Hospital (AKUH) is one of the largest private quaternary care hospitals catering to diverse ethnic and socioeconomic groups within Karachi. The patients with PBTs in our sample represented heterogenous ethnicities and socioeconomic strata from within Pakistan, as evidenced by the diversity of mother tongues (Urdu: 30.8%, Sindhi: 18.8%, Punjabi: 14.8%, Pushto: 10.4%, Saraiki: 4.4%, Balochi 4%, and Hindko: 2.4%) and distribution of monthly family incomes. The diversity in patients’ demographics in our study strengthens the generalizability and utility of the Urdu translation of the EORTC QLQ-C30 and QLQ-BN20 as tools for measuring QoL within the sociocultural context of Pakistan.

The internal consistency of the Urdu version of the EORTC QLQ-C30 was high for the overall tool (Cronbach’s alpha: 0.86) and the global scale (0.80). The only other study reporting translation and validation of the EORTC QLQ-C30 in Urdu that we were able to find also reported a high Cronbach’s alpha of 0.82 for the overall tool [13].

Moreover, internal consistency was within minimum-to-ideal ranges (0.75–0.83) for most functional subscales in the current study. The exception was the subscale for cognitive functioning (0.63). The relatively low internal consistency of the cognitive functioning subscale has been observed in previous translation and validation studies in Korean (0.60) [25], Chinese (0.45) [26] and Japanese (0.63) [27], and also in validation studies of the original version of the EORTC QLQ-C30 by Aaronson et al. (0.54) [10], Bjordal et al. (0.28) [28], Osoba et al. (0.56) [29], and Ringdal et al. (0.65) [30]. Thus, our study also corroborates findings in existing literature which suggest that the cognitive functioning subscale of the EORTC QLQ-C30 has low reliability. This may be because memory and concentration are different aspects of cognitive functioning [26], although it is clinically more useful to measure cognitive functioning as a single construct. Lastly, in our sample, the consistency of the overall tool (0.88) and the four domains of symptoms in the QLQ-BN20 also demonstrated above minimum psychometric consistency (0.703–0.868). These ranges are similar to translation and validation studies conducted in other languages, including Persian (0.740–0.890) [31] and Chinese (0.753–0.869) [32]. Moreover, a review of the internal consistencies of the different translations of the EORTC QLQ-BN20 reviewed an overall range of internal consistency of the four subscales ranging from 0.71–0.90 [33]. Our results thus align with previous evidence and confirm the reliability of the Urdu version of the EORTC QLQ-BN20 in a Pakistani setting.

Lastly, inter-scale correlations for the Urdu version of the EORTC QLQ-C30 and QLQ-BN20 were significant in the expected directions, and similar to those seen previous studies validating the Iranian and Korean versions [24, 25].

The Urdu versions of both the EORTC QLQ-C30 and QLQ-BN20 tools also demonstrated acceptable construct validity, with the global quality of life domain showing significant positive correlations with resilience (r = 0.422), and significant negative correlations with depression (r = −0.541) and anxiety (r = −0.502). Previous studies have also reported similar positive correlations between QoL and resilience [34, 35] and similar negative correlations between QoL and depression and anxiety in cancer patients [36, 37]. In addition, the Hindi translation of the EORTC QLQ-BN20 has also demonstrated significant correlations with depression and anxiety as measured by HADS [38].

The Urdu translations of the EORTC QLQ-C30 and QLQ-BN20 hold considerable clinical relevance for the management of patients with malignant PBTs in Pakistan. Keeping in mind cultural differences in Pakistan and that most patients with PBT belong from less educated backgrounds, it is crucial to have tools that provide valid assessments of QoL in a Pakistani population. Urdu, being the national and official language, is spoken and understood throughout the country. Thus, our translation and validation of the EORTC QLQ-C30 and QLQ-BN20 provides a comprehensive, valid, and reliable method for the measurement of QoL in patients with malignant PBTs in Pakistan. Assessment of QoL may be incorporated into the routine management of patients with malignant PBTs in Pakistan, particularly in holistic goal setting, prognostication, and monitoring the impact of disease and therapy. In addition, the tools also provide a much-needed measure of patient-reported outcomes in clinical cancer research in Pakistan.

Our study has a few limitations. Firstly, we did not perform test-retest analysis to investigate stability. Additionally, the cross-sectional nature of the study does not capture changing relationships between QoL, resilience, depression, and anxiety. However, our study is the first validating the Urdu versions of EORTC QLQ-BN20 and QLQ-C30 in a sample of patients with malignant PBTs in Pakistan. Our results provides the preliminary base for further psychometric evaluation.

Conclusion

Our study performed the translation of the EORTC QLQ-C30 and QLQ-BN20 to Urdu as per EORTC guidelines. The Urdu versions of the EORTC QLQ-C30 and QLQ-BN20 demonstrated good validity amongst patients with primary brain tumors. Thus, our study confirms the EORTC QLQ-C30 and QLQ-BN20 as valuable clinical tools for the measurement of QoL in primary brain tumors patients within the cultural and socioeconomic context of Pakistan.