Japanese value set for the EORTC QLU-C10D: A multi-attribute utility instrument based on the EORTC QLQ-C30 cancer-specific quality-of-life questionnaire

Shiroiwa, T.; King, M. T.; Norman, R.; Müller, F.; Campbell, R.; Kemmler, G.; Murata, T.; Shimozuma, K.; Fukuda, T.

doi:10.1007/s11136-024-03655-7

Japanese value set for the EORTC QLU-C10D: A multi-attribute utility instrument based on the EORTC QLQ-C30 cancer-specific quality-of-life questionnaire

Open access
Published: 09 May 2024

Volume 33, pages 1865–1879, (2024)
Cite this article

Download PDF

You have full access to this open access article

Quality of Life Research Aims and scope Submit manuscript

Japanese value set for the EORTC QLU-C10D: A multi-attribute utility instrument based on the EORTC QLQ-C30 cancer-specific quality-of-life questionnaire

Download PDF

611 Accesses
Explore all metrics

Abstract

Purpose

This study aimed to develop a Japanese value set for the EORTC QLU-C10D, a multi-attribute utility measure derived from the cancer-specific health-related quality-of-life (HRQL) questionnaire, the EORTC QLQ-C30. The QLU-C10D contains ten HRQL dimensions: physical, role, social and emotional functioning, pain, fatigue, sleep, appetite, nausea, and bowel problems.

Methods

Quota sampling of a Japanese online panel was used to achieve representativeness of the Japanese general population by sex and age (≥ 18 years). The valuation method was an online discrete choice experiment. Each participant considered 16 choice pairs, randomly assigned from 960 choice pairs. Each pair included two QLU-C10D health states and life expectancy. Data were analyzed using conditional logistic regression, parameterized to fit the quality-adjusted life-year framework. Preference weights were calculated as the ratio of each dimension-level coefficient to the coefficient for life expectancy.

Results

A total of 2809 eligible panel members consented, 2662/2809 (95%) completed at least one choice pair, and 2435/2662 (91%) completed all choice pairs. Within dimensions, preference weights were generally monotonic. Physical functioning, role functioning, and pain were associated with the largest utility weights. Intermediate utility weights were associated with social functioning and nausea; the remaining symptoms and emotional functioning were associated with smaller utility decrements. The value of the worst health state was − 0.221, lower than that seen in most other existing QLU-C10D country-specific value sets.

Conclusions

The Japan-specific QLU-C10D value set is suitable for evaluating the cost and utility of oncology treatments for Japanese health technology assessment and decision-making.

Australian Utility Weights for the EORTC QLU-C10D, a Multi-Attribute Utility Instrument Derived from the Cancer-Specific Quality of Life Questionnaire, EORTC QLQ-C30

Article Open access 21 December 2017

French Value-Set of the QLU-C10D, a Cancer-Specific Utility Measure Derived from the QLQ-C30

Article 15 June 2020

German value sets for the EORTC QLU-C10D, a cancer-specific utility instrument based on the EORTC QLQ-C30

Article Open access 04 September 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Plain English summary

1. Why is this study needed?

The EORTC QLU-C10D is a preference-based multi-attribute utility instrument (MAUI) derived from the EORTC Quality of Life Questionnaire-Core 30 (QLQ-C30), a health-related quality-of-life (HRQL) questionnaire widely used in cancer clinical trials internationally. The QLU-C10D enables quantification of utility from responses to the QLQ-C30, and hence enables HRQL data to be used in health policy decisions about cancer. Such decisions are typically made within a country, relating to the health budget of that country, or of regional and local health authorities. Therefore, country-specific ‘value sets’ based on the values and preferences of the general population of specific countries are needed.

2. What is the key problem/issue/question this manuscript addresses?

No value sets for the EORTC QLU-C10D existed for Japan prior to this study.

3. What is the main point of your study?

The valuation survey used to develop the Japanese QLU-C10D value set followed the standard protocol developed by the Multi-Attribute Utility in Cancer (MAUCa) Consortium for evaluating the EORTC QLU-C10D in general population samples. The valuation method used was an online discrete choice experiment. The resultant value set enables HRQL data from the EORTC QLQ-C30 to be used in Japanese health policy decisions and health technology assessment.

4. What are your main results and what do they mean?

A Japanese value set for the EORTC QLU-C10D was created. Physical functioning, role functioning, and pain were associated with the largest utility weights. The value of the worst health state was -0.221, lower than that seen in most other existing QLU-C10D country-specific value sets.

Introduction

When economic evaluation of healthcare technologies is performed, quality-adjusted life years (QALYs) are standardly used for outcome measurement. QALYs can be calculated by weighting life years by the utility of the health state[1]. In Japan, since 2019, economic evaluation submissions are required for selected drug and medical device pricing before the Ministry of Health, Labour and Welfare (MHLW) can approve higher prices than for existing drugs[2]. As of December 2022, evaluations of 39 drugs and devices were completed or are in progress. The guideline for submission to the authority [3] indicates that “QALY should be used in principle” and “If Japanese quality-of-life (QOL) scores (utilities) are newly collected for a cost-effectiveness analysis, EQ-5D-5L is recommended as the first choice.” However, it does not preclude the use of alternative utility instruments.

For the purposes of economic evaluation, utility is anchored as 0 = dead and 1 = full health, which is necessary for construction of QALYs. To obtain scores on this utility scale, we typically use a preference-based measure (PBM). Many generic PBMs have been developed to measure utility, for example, the EuroQol 5 Dimensions (EQ-5D) [4, 5], Health Utilities Index (HUI)[6, 7], and Short Form 6 Dimensions (SF-6D) [8]. On the other hand, in clinical studies, disease-specific profile-type instruments are often used to measure patients’ HRQL. However, profile-type measures cannot be used for economic evaluation because they are not preference-based and therefore do not measure utility.

In collaboration with the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group, the Multi-Attribute Utility in Cancer (MAUCa) Consortium has developed the EORTC QLU-C10D, a multi-attribute utility instrument (MAUI) derived from the EORTC Quality of Life Questionnaire-Core 30 (QLQ-C30) [9, 10]. The QLQ-C30 is the most widely used cancer-specific HRQL questionnaire [11]. But because the QLQ-C30 is a profile-type measure, it cannot be used to quality-adjust survival to calculate QALYs. The QLU-C10D was developed to enable quantification of utility from responses to the QLQ-C30. While mapping algorithms are available to derive utilities from QLQ-C30 responses through generic MAUIs [12], the QLU-C10D is potentially theoretically and empirically stronger because it comprises a descriptive system and a valuation method that complies with the Checklist for Reporting Valuation Studies [13], and aims to retain the cancer-specific sensitivity which is part of the QLQ-C30. Five of the 10 QLU-C10D dimensions capture symptoms and impacts of cancer and its treatments that are not explicitly included in generic instruments: nausea, fatigue, loss of appetite, and problems with sleep and bowel function. The other five dimensions are pain and four aspects of functioning (physical, role, social, and emotional). The QLU-C10D is not a stand-alone questionnaire; it is a MAUI that comprises a health state descriptive system plus country-specific preference weighting algorithms. Online Resource 1 shows the QLU-C10D descriptive system and explains how the 10 dimensions can be derived from 13 of the 30 items in the QLQ-C30.

The MAUCa Consortium has developed a standard protocol for evaluating the EORTC QLU-C10D in general population samples, as described. Using this method, value sets have been estimated for 11 countries so far, with more in progress [14,15,16,17,18,19,20,21,22]. This study aimed to apply this valuation method in a Japanese general population sample to produce Japan-specific utility weights and value set for the QLU-C10D, and to compare the Japanese value set to those from other countries.

Methods

A cross-sectional population-based survey was designed to collect QLU-C10D valuation data from a representative sample of the Japanese general population; the study protocol was approved by the Japanese National Institute of Public Health ethics committee (approval number NIPH-IBRA #12272). The methods were consistent with previous QLU-C10D valuation studies [18, 19, 21, 23,24,25,26,27,28].

The survey was implemented by SurveyEngine, a company specialized in online choice experiments. SurveyEngine managed sample recruitment (via a Japanese online panel), survey administration, and data collection. SurveyEngine and its panel provider complied with the International Code on Market, Opinion and Social Research and Data Analytics [29]. The survey opened on 5th February 2021 and closed on 15th March 2021. Online panel members were eligible if they were aged ≥ 18 years and able to read and understand Japanese. Online panelists received an e-mail invitation to participate, including a hyperlink to the study and survey. Panel members who attempted to enter the survey via mobile phones were screened out as the discrete choice experiment (DCE) was too complex for a small screen. Consent was sought from the remainder, who were screened for quota sampling to ensure the age and sex distributions of the sample matched those of the Japanese general population (Table 1). Participants who consented and were within quota proceeded to further survey questions.

Table 1 Self-reported sociodemographic characteristics and health of the Japanese valuation survey sample (n = 2435 participants who completed 16 Discrete Choice Experiment choice sets) compared with the Japanese general population

Full size table

A target sample size of ≥ 2000 respondents was determined to provide acceptable precision for model parameter estimates, based on the MAUCa consortium’s extensive experience with DCE valuation surveys and the number of health state comparisons in the QLU-C10D DCE [31,32,33,34]. This sample size was larger than most similar studies [35], and meets the various rules of thumb outlined by de Bekker-Grob et al. [36].

DCE valuation task

The feasibility of the implemented valuation task was previously established [10]. The valuation task involved choosing between pairs of hypothetical health states from the QLU-C10D; each pair formed a choice set. Online Resource 2 provides an example choice set from the Japanese survey. Each respondent was asked to consider 16 choice sets and indicate which health state they would prefer to live in until death. Each health state was described in terms of the ten dimensions of the QLU-C10D and a specified duration of survival (life years), which could take the values 1, 2, 5, or 10 years. Survival duration allowed the trade-off between QoL and life expectancy to be inferred, and enabled anchoring of utility scores at dead (zero life years) [31, 35]..

The QLU-C10D health state classification system has over a million possible health states (4¹⁰ = 1,048,576). To determine which of these to include in the DCE, we constructed a designed experiment of 960 choice sets that maximized statistical efficiency of the utility model parameter estimation. The DCE contained 12 attributes: 11 attributes for the 10 QLU-C10D dimensions because two attributes were used to represent physical functioning (long and short walk); survival duration was included as the twelfth attribute to enable estimation on a health utilities scale. Twelve attributes is a relatively large number for respondents to consider simultaneously, so we simplified the cognitive task in three ways [10]: (1) we constrained the number of QLU-C10D dimensions that differed between health states in any given choice set to four; (2) we highlighted in yellow the four dimensions that differed within a choice set; (3) for the physical functioning dimension, the descriptors for levels 2 and 3 are conceptually complex, so to aid respondent comprehension, the two items (‘long walk’ and ‘short walk’) were presented separately in the survey but scored as one 4-level dimension in the DCE design. We successfully used this approach in all previous QLU-C10D valuation surveys [18, 19, 21, 23,24,25,26,27,28], confirming feasibility across 8 languages and 11 countries.

The DCE used the same designed experimental as in previous QLU-C10D valuation studies; how it was constructed has been explained previously [18, 19, 21, 23,24,25,26,27,28]. The final DCE experimental design consisted of 960 choice sets, with an estimated D-efficiency of 90.4% relative to the best design with that level of overlap. There were three levels of randomization in the DCE component of the survey: (1) each respondent was randomized to answer 16 of 960 choice sets in the DCE design; (2) which option was presented as Situation A or Situation B was randomized within each choice set to mitigate any ordering bias; (3) the order of QLU-C10D dimensions was randomized for each person to prevent any order effect, with duration always presented as the last attribute.

Statistical analyses

Descriptive statistics summarized sample demographics, self-reported general health, and participant feedback on the DCE valuation task. Sample representativeness was assessed against population reference data for demographics and self-reported general health using chi-square tests and t-tests.

Analysis of the DCE data followed the MAUCa consortium’s standard approach, as described previously for other QLU-C10D country-specific value sets [18, 19, 21, 23,24,25,26, 28]. This yields utility estimates consistent with standard QALY model restrictions by using a functional form we and others have used previously [10, 33, 34, 43, 44]. The QALY model requires all health states have zero utility at dead [45]. This requirement is satisfied by Eq. 1 and 2 because they include the interaction between the QLU-C10D levels and a TIME variable representing survival duration (life years). The designed experiment allowed for all these interactions. In Eqs. 1 and 2, as TIME tends to zero, the systematic component of the utility function tends to zero. Another requirement of the QALY model is constant proportional time trade off, therefore the relationship between utility and TIME (life years) was considered to be linear.

A useful feature of this functional form is that the impact of moving away from Level 1 (no problems) in each HRQL dimension is characterized by the two-factor interaction term between the QLU-C10D levels and TIME. This enables a utility algorithm in which the effect of each level of each dimension is included as a decrement away from full health (which has a value of 1).

We analyzed the DCE data with STATA 13.0 [46] in two ways. The primary analysis used conditional logit models (Eq. 1), in which the utility of option j in choice set s for survey respondent i was assumed to be

$${U}_{isj}=\alpha {TIME}_{isj}+ \beta {X}_{isj}^{\mathrm{^{\prime}}}{TIME}_{isj}+{\varepsilon }_{isj}$$

(1)

i = 1, …, I respondents; j = situations A, B; s = 1, …, 960 choice sets.

Here, α is the utility associated with a life year,${X}_{isj}^{\prime}$ is a vector of dummy variables representing the levels of the QLU-C10D health state presented in option j, and β is the corresponding vector of utility weights associated with each level in each dimension within ${X}_{isj}^{\prime}$, for each life year. The error term ${\varepsilon }_{ isj}$ was assumed to have a Gumbel distribution.

Because each respondent assessed up to 16 choice pairs, we allowed for intra-individual correlation, using a clustered sandwich estimator to adjust the standard errors. We estimated utility decrements for each movement away from Level 1 (no problems) in each QLU-C10D dimension by dividing each β term by α [44], and used the delta method [47] in STATA to estimate standard errors and confidence intervals for these ratios.

We estimated two versions of Eq. 1. Model 1 included every decrement from the best level (i.e., Level 1, no problems) in each dimension within ${X}_{isj}^{\prime}$; thus, ${X}_{isj}^{\prime}$ contained 30 terms (i.e., 10 dimensions x (4-1) levels within each). Model 2 imposed a restriction of monotonicity in the levels of the dimensions of the QLU-C10D health state classification system by combining non-monotonic levels and re-estimating the model. Model 2 therefore included a reduced number of estimates in β (the vector of preference weights).

We conducted unweighted and weighted analyses for all models. In weighted analyses, sampling weights controlled for non-representativeness in measured respondent characteristics using the iterative proportional fitting algorithm (i.e., raking) proposed by Deming and Stephan [48], and implemented in STATA using the ipfweight command. Variance inflation due to weighting was assessed by calculating the percentage increase in the standard errors of the unweighted versus weighted coefficients.

We compared utilities derived from the Japanese QLU-C10D algorithm with those from other countries in two ways. We randomly generated 500 QLU-C10D health states, and scored each according to five country-specific algorithms, then plotted them by country, ordered them according to the Japanese values.

The following three data quality assessment metrics were assessed. We tallied the number of respondents who chose either all As or all Bs across the choice sets, then re-estimated weighted Model 2 with their data excluded. We considered the time respondents took to complete the survey. We divided respondents into deciles based on total survey time, ran a conditional logit on the DCE data in each decile, then graphed the pseudo-R² and the number of statistically significant coefficients for each decile, interpreting low values on either indicator as suggesting relatively low quality data.

Results

Sample characteristics

As Fig. 1 shows, 3513 respondents entered the survey, 2662 (76%) of whom were within sampling quotas, consented and completed at least one choice pair, and 2435 (69%) completed all choice pairs. The data from these 2435 participants were included in analyses to assess representativeness and estimate the Japanese value set.

The sample characteristics (n = 2435) are compared to published Japanese general population characteristics in Table 1. Study participants were representative in terms of sex, age, and paid employment. Our study team discussed the type and degree of non-representativeness of the remaining variables and agreed to include four variables in raking (weighting): household income, education, health status (EQ-5D), and mental health (Kessler 6). The three remaining demographics that were non-representative were not included in raking for the following reasons: Region—the Japanese population is generally homogeneous across regions; Work status—correlated with household income, which was included in raking; Relationship status—differences per category were small (< 3.2%).

Respondents’ perception of the DCE valuation task

Online Resource 3 details respondent perceptions of the DCE valuation task. In summary, 44% rated the health state presentation as ‘unclear’ or ‘very unclear,’ and 23% found it ‘clear’ or ‘very clear.’ Regarding the choice task, 66% found it ‘difficult’ or ‘very difficult’ to choose between pairs of health states, and only 7% found it ‘easy’ or ‘very easy.’ With regard to the strategy participants used to choose between pairs of health states, 32% focused on aspects highlighted in yellow, 26% considered most aspects, and 25% focused on just a few aspects. Of 103 participants provided additional detail on their strategy, length of survival time was considered by 65/103 (51%) when choosing between health states. Acceptability of burden to themselves was cited by 22/103 (17%), and burden to others was cited by 18/103 (14%). Specific symptoms (pain, appetite, sleep) were cited by a small number of respondents (8, 4, and 4, respectively).

Data quality

Online Resource 4 details the data quality findings. When data from the 73 respondents who gave either all As or all Bs across their completed choice sets was excluded, there was little difference (max absolute difference of 0.0042) and no evidence of bias (mean difference of − 0.00054) in coefficient estimates. Median survey completion was 12.5 min, minimum 3.75 min, and maximum 69.33 min. Respondents in all completion time deciles sped up as they became more familiar with the choice task (Figure A). The fastest completion time decile yielded the least statistically significant coefficients (6/31) and the slowest two deciles yielded the most (26/31 and 25/31, respectively) (Figure B). While this suggested slower respondents produced less random data, the pseudo-R² values were similar across deciles.

DCE Results

Conditional logit results for the 2435 respondents who completed all 16 choice pairs are presented in Table 2. In the unweighted Model 1 analysis, all coefficients are negative and increase in absolute terms in progressively higher levels. Dimensions with the largest impact (based on the largest absolute coefficient) are physical functioning, pain, and role functioning. When responses were weighted, some small non-monotonicities were observed in the trouble sleeping dimension. The effect of combining levels to prevent this (Model 2) was small. Figure 2 shows the impact of enforcing monotonic ordering on the coefficients (Panel A) and using weights (Panel B). Both figures report a line of best fit between models with and without these adjustments, as well as a 45 degree line reporting equality. All data points are close to the 45 degree line, illustrating minimal impact of these adjustments, and thus the preference weights are robust to them. The standard errors of Model 1 coefficients in weighted analyses were on average 48% larger (minimum 27%, median 46%, maximum 77%). The combined effect of weighting on coefficient estimates and variance inflation reduced the level of statistical significance of three coefficients from 5% to non-significant (Social L2, Emotional L2, Pain L2), one from 1% to non-significant (Emotional L3), and three from 0.1% to 5% (Role level 2, Trouble Sleeping level 3, Nausea level 2). In one case, it increased statistical significance (Bowel problems from not significant to 5% to 1%).

Table 2 Conditional logit results for Model 1 (unconstrained) and Model 2 (monotonicity imposed), unweighted and weighted analyses (estimated coefficients and robust standard errors (SE)), based on data from respondents who completed all 16 choice pairs in the discrete choice experiment (n = 2435)

Full size table

As a further robustness check, the same analyses were run including all DCE data (n = 2,662 respondents who completed at least one choice set) and the subset who completed all 16 choice pairs and all subsequent demographics (n = 2,312). As shown in Online Resources 5 and 6, results for these subsets were very similar to those in Table 2 (n = 2435), i.e., the same to two decimal places in all cases and to three decimal places in most cases.

We calculated the QLU-C10D preference weights from the unweighted Model 1 results because they were fully monotonic, and because weighting did not change the coefficient estimates much but did increase standard errors considerably. These are plotted in Fig. 3 and tabulated under the graph. As these are derived by dividing through the coefficients in Table 2 by the duration coefficient, the pattern is unchanged, with physical functioning, pain, and role functioning the largest drivers of preference. We recommend these preference weights be used in the Japanese QLU-C10D scoring algorithm, provided in Online Resource 7, including syntax for STATA and SPSS.

Japanese value set compared with other countries

The first comparison was based on four health states representing a range of health from very good to worst possible, with utility scores based on 12 country-specific utility algorithms (Fig. 4). For the best of these health states (with just a little physical functioning impairment and pain, 2111121111), the Japanese utility score ranked 8th of 12. For the health state with a little impairment in all domains (2222222222), the Japanese utility score ranked 11th, and for the health state with quite a bit of impairment in all domains (3333333333), the Japanese utility score was the lowest (rank 12/12). For the worst possible health state (very much impairment in all domains, 4444444444), the Japanese score (− 0.221) was ranked 11th, with only France having a lower value (− 0.44).

The second comparison, based on 500 randomly generated health states, compared the Japanese value set with two English-speaking countries (the United Kingdom (UK) and the United States (US) and two European countries (Spain and France) (Fig. 5). Across these health states, the Japanese values tend to lie above those from France, but below those from Spain, the UK, and the US. This suggests Japanese respondents were generally more willing to give up life expectancy for improved health than respondents in the latter three countries, but less likely than the French respondents. However, the pronounced oscillations in the lines for the four other countries indicate further complexity in the between-country story, due to variations between countries in dimension-specific preference weights.

Discussion

This study provides the Japanese value set for the EORTC QLU-C10D, endorsed by the EORTC Quality of Life Group. The largest utility weights were associated with decrements in physical functioning, role functioning, and pain. Intermediate utility weights were associated with decrements in social functioning and nausea, while the remaining symptoms and emotional functioning were associated with smaller utility decrements. Compared with the QLU-C10D value sets from other countries, the Japanese decrements in social functioning, fatigue, and appetite were the largest of the 12 countries where QLU-C10D value sets have been established. The level 4 decrements in role function and nausea were the second largest among these countries. Generally, the Japanese weights of symptom-related items were larger than the average of the 12 countries, except for pain, where it was among the smaller. In addition, the comparison based on 500 randomly generated health states revealed considerable heterogeneity among countries, reflecting variations in dimension-specific preference weights in country-specific value sets. Different dimensions may play different roles in different cultures, but the observed variations may also be due in part to linguistic non-equivalence between countries; irrespective, these justify the need for country-specific value sets.

The value of the worst health state was -0.221, which was lower than that seen in most other existing QLU-C10D country-specific value sets, excluding France. This is a surprising result compared to the Japanese EQ-5D-5L value set. The worst Japanese EQ-5D-5L index [55555] was − 0.025, [49] which is the highest value in the world. This may be due to a key difference in the valuation methodologies used to generate the two value sets; the EORTC QLU-C10D was created by DCE with the duration method while the EQ-5D-5L was performed by composite time trade-off (cTTO) by the in-person interview. According to the Japanese EQ-5D-5L value set, the Japanese are reluctant to trade health states with death, suggesting a strong risk-aversion to death. By contrast, the international comparison of the EORTC QLU-C10D value set suggests that the Japanese willingly trade life-years with their health state. It may be caused by reflection of the Japanese preference; good health states are more preferable to long life years, but death is less acceptable, compared with Western people. Of course, it is possible that methodological artifacts may have contributed to the inconsistency. One of these is the method of preference elicitation: the QLU-C10D DCE was conducted as a self-complete survey while the EQ-5D-5L was interviewer administered. Another is translation effects in the source instrument (QLQ-C30 and EQ-5D-5L) and/or the preference elicitation DCE questionnaire for the QLU-C10D versus the cTTO for the EQ-5D-5L that somehow differentially distorted the Japanese preference task of the QLU-C10D relative to that of the EQ-5D-5L.

EQ-5D-5L is now a standard instrument for utility assessment in Japan. However, in clinical trial settings, the collection of EQ-5D-5L is sometimes omitted, and some studies include only disease-specific HRQL instruments. The QLQ-C30 and FACT-G are frequently used, particularly in cancer contexts [50]. Before the QLU-C10D was created, data obtained using the QLQ-C30 could not be used directly to calculate the QALY. Therefore, when QLQ-C30 data were used for cost-effectiveness analysis, mapping from the QLQ-C30 to PBMs was sometimes used. Although a mapping algorithm from the QLQ-C30 to the EQ-5D-5L has been established in Japan [51] mapping is not necessarily recommended for estimating utility, as there is considerable uncertainty around such calculations particularly at the extremes of the utility scale. In the Japanese HTA guidelines, mapping is only allowed if utility data cannot be obtained by other methods. As the QLU-C10D is now an established PBM, its use is more acceptable than that of the mapping algorithm. Also, because the EQ-5D-5L is a generic PBM, the QLQ-C30 is likely to be more sensitive to changes and differences in the health states of patients with cancer, and may therefore capture the utility of patients with cancer more appropriately and calculate the cost per QALY more precisely. Finally, many existing clinical trials have collected QLQ-C30 data in Japan. Such accumulated data can now be converted to utilities using the scoring algorithm generated by our research, and thereby provide HRQL weighting for QALY calculation. Given these advantages the Japanese value set for the EORTC QLU-C10D has many benefits for academics and the Japanese HTA system.

This study has several strengths. The Japanese value set was established in a large-sample representative of age and sex, enhancing generalisability of results. Second, only one inconsistency was observed in all the weights (the second and the third level of the “Trouble Sleeping” dimension). This is the lowest number of inconsistencies yet for QLU-C10D valuation studies, suggesting the Japanese survey was of high quality. Moreover, as our survey was based on the standard international protocol of the MAUCa Consortium, it facilitated international comparison with other country-specific QLU-C10D value sets. This study also has some limitations. Some respondents may not have engaged as fully in the online choice task as in face-to-face surveys. Also, respondents were not selected by random sampling from the entire Japanese population but by quota sampling from an online panel. Some characteristics of the respondents were statistically different from the Japanese population norms (e.g., region and work); we adjusted for most of these using raking, a form of sample weighting that allows weighting by several variables simultaneously. Further, the survey was conducted during the COVID-19 pandemic which had substantial impact on life in Japan. Other authors have noted that the pandemic did not impact on the ability to conduct online surveys such as this one during the pandemic, and indeed greater use and development of online research occurred during the pandemic [52]. However, it is unknown whether there was an impact on health preferences during the pandemic for the health attributes assessed in our study in comparison to pre- and post-pandemic preferences. There is limited evidence about the impacts of the COVID-19 pandemic on how people value health, and the policy implications of any such effects are unclear [53].

Conclusion

This study employed data from approximately 2,500 Japanese general population respondents who completed a DCE task based on an international valuation protocol developed for the EORTC QLU-C10D by the MAUCa Consortium and the EORTC Quality of Life Group. This produced the EORTC-endorsed Japanese value set for the EORTC QLU-C10D, which has some distinguishing characteristics compared to existing country-specific QLU-C10D value sets. Fundamentally, this study promotes economic evaluations in Japan and the development of HTA systems that produce transparent, consistent and defensible decisions around health and healthcare.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available due to the lack of consent from participants, but are available from the corresponding author upon reasonable request.

References

National Institute for Health and Care Excellence. 2013 NICE Process and Methods Guides. Guide to the Methods of Technology Appraisal 2013. London: National Institute for Health and Care Excellence (NICE).
Shiroiwa, T. (2020). Cost-effectiveness evaluation for pricing medicines and devices: A new value-based price adjustment system in Japan. International journal of technology assessment in health care, 36(3), 270–276.
Article PubMed Google Scholar
Center for Outcomes Research and Economic Evaluation for Health. 2022 Guideline for Preparing Cost-Effectiveness Evaluation to the Central Social Insurance Medical Council. Available from https://c2h.niph.go.jp/tools/guideline/guideline_en.pdf. accessed 10 Jan 2023
1990 EuroQol--a new facility for the measurement of health-related quality of life. Health policy (Amsterdam, Netherlands). 16(3):199–208
Shiroiwa, T., Fukuda, T., Ikeda, S., et al. (2016). Japanese population norms for preference-based measures: EQ-5D-3L, EQ-5D-5L, and SF-6D. Quality of life Research, 25(3), 707–719.
Article PubMed Google Scholar
Feeny, D., Furlong, W., Torrance, G. W., et al. (2002). Multiattribute and single-attribute utility functions for the health utilities index mark 3 system. Medical care., 40(2), 113–128.
Article PubMed Google Scholar
Noto, S., Shiroiwa, T., Kobayashi, M., Murata, T., Ikeda, S., & Fukuda, T. (2020). Development of a multiplicative, multi-attribute utility function and eight single-attribute utility functions for the Health Utilities Index Mark 3 in Japan. Journal of patient-reported outcomes., 4(1), 23.
Article PubMed PubMed Central Google Scholar
Brazier, J. E., Mulhern, B. J., Bjorner, J. B., et al. (2020). Developing a new version of the SF-6D health state classification system from the SF-36v2: SF-6Dv2. Medical care., 58(6), 557–565.
Article PubMed Google Scholar
King, M. T., Costa, D. S., Aaronson, N. K., et al. (2016). QLU-C10D: A health state classification system for a multi-attribute utility measure based on the EORTC QLQ-C30. Quality of life research, 25(3), 625–636.
Article CAS PubMed Google Scholar
Norman, R., Viney, R., Aaronson, N. K., et al. (2016). Using a discrete choice experiment to value the QLU-C10D: Feasibility and sensitivity to presentation format. Quality of life Research, 25(3), 637–649.
Article CAS PubMed Google Scholar
Smith, A. B., Cocks, K., Parry, D., & Taylor, M. (2014). Reporting of health-related quality of life (HRQOL) data in oncology trials: A comparison of the European Organization for Research and Treatment of Cancer Quality of Life (EORTC QLQ-C30) and the Functional Assessment of Cancer Therapy-General (FACT-G). Quality of life Research, 23(3), 971–976.
Article PubMed Google Scholar
McTaggart-Cowan, H., Teckle, P., & Peacock, S. (2013). Mapping utilities from cancer-specific health-related quality of life instruments: A review of the literature. Expert Review of Pharmacoeconomics & Outcomes Research, 13(6), 753–765.
Article Google Scholar
Xie, F., Pickard, A. S., & Krabbe, P. F. (2015). A checklist for reporting valuation studies of multi-attribute utility-based instruments (CREATE). PharmacoEconomics, 33(8), 867–877.
Article PubMed PubMed Central Google Scholar
Finch, A. P., Gamper, E., & Norman, R. (2021). Estimation of an EORTC QLU-C10 value set for spain using a discrete choice experiment. PharmacoEconomics, 39(9), 1085–1098.
Article PubMed PubMed Central Google Scholar
Gamper, E. M., King, M. T., Norman, R., et al. (2020). EORTC QLU-C10D value sets for Austria, Italy, and Poland. Quality of life Research, 29(9), 2485–2495.
Article CAS PubMed PubMed Central Google Scholar
Jansen, F., Verdonck-de Leeuw, I. M., & Gamper, E. (2021). Dutch utility weights for the EORTC cancer-specific utility instrument: The Dutch EORTC QLU-C10D. Quality of life Research, 30(7), 2009–2019.
Article PubMed PubMed Central Google Scholar
Kemmler, G., Gamper, E., & Nerich, V. (2019). German value sets for the EORTC QLU-C10D, a cancer-specific utility instrument based on the EORTC QLQ-C30. Quality of life Research, 28(12), 3197–3211.
Article PubMed PubMed Central Google Scholar
King, M. T., Viney, R., & Simon Pickard, A. (2018). Australian utility weights for the EORTC QLU-C10D, a multi-attribute utility instrument derived from the cancer-specific quality of life questionnaire, EORTC QLQ-C30. PharmacoEconomics, 36(2), 225–238.
Article PubMed Google Scholar
McTaggart-Cowan, H., King, M. T., Norman, R., et al. (2019). The EORTC QLU-C10D: The Canadian valuation study and algorithm to derive cancer-specific utilities from the EORTC QLQ-C30. MDM policy & practice, 4(1), 2381468319842532.
Article Google Scholar
Nerich, V., Gamper, E. M., Norman, R., et al. (2021). French value-set of the QLU-C10D, a cancer-specific utility measure derived from the QLQ-C30. Applied health economics and health policy., 19(2), 191–202.
Article PubMed Google Scholar
Norman, R., Mercieca-Bebber, R., & Rowen, D. (2019). UK utility weights for the EORTC QLU-C10D. Health Economics, 28(12), 1385–401.
Article PubMed Google Scholar
Revicki, D. A., King, M. T., & Viney, R. (2021). United States utility algorithm for the EORTC QLU-C10D, a multiattribute utility instrument based on a cancer-specific quality-of-life instrument. Medical Decision Making, 41(4), 485–501.
Article PubMed Google Scholar
Finch, A., Gamper, E., Norman, R., et al. (2021). Estimation of an EORTC QLU-C10D value set for Spain using a discrete choice experiment. PharmacoEconomics, 39, 1085–1098.
Article PubMed PubMed Central Google Scholar
Gamper, E. M., King, M. T., Norman, R., et al. (2020). EORTC QLU-C10D value sets for Austria, Italy, and Poland. Quality of Life Research, 29(9), 2485–2495.
Article CAS PubMed PubMed Central Google Scholar
Jansen, F., Verdonck-de Leeuw, I., Gamper, E., et al. (2021). Dutch utility weights for the EORTC cancer-specific utility instrument: The Dutch EORTC QLU-C10D. Quality of Life Research., 30, 2009–2019.
Article PubMed PubMed Central Google Scholar
Kemmler, G., King, M., Norman, R., Viney, R., Gamper, E., & Holzner, B. (2019). German value sets for the EORTC QLU-C10D, a cancer-specific utility instrument based on the EORTC QLQ-C30. Quality of Life Research Journal, Quality of Life Research, 28, 3197–3211.
Article PubMed Google Scholar
Nerich, V., Gamper, E. M., Norman, R., et al. (2021). French value-set of the QLU-C10D, a cancer-specific utility measure derived from the QLQ-C30. Applied Health Economics and Health Policy, 19(2), 191–202.
Article PubMed Google Scholar
Revicki, D. A., King, M. T., Viney, R., et al. (2021). United States utility algorithm for the EORTC QLU-C10D, a multiattribute utility instrument based on a cancer-specific quality-of-life instrument. Medical Decision Making, 41(4), 485–501.
Article PubMed Google Scholar
ICC/ESOMAR. The International Code on Market, Opinion and Social Research and Data Analytics 2016 [Available from: https://iccwbo.org/publication/iccesomar-international-code-market-opinion-social-research-data-analytics/.
Shiroiwa, T., Noto, S., & Fukuda, T. (2021). Japanese population norms of EQ-5D-5L and health utilities index mark 3: disutility catalog by disease and symptom in community settings. Value Health, 24(8), 1193–1202.
Article PubMed Google Scholar
Bansback, N., Brazier, J., Tsuchiya, A., & Anis, A. (2012). Using a discrete choice experiment to estimate societal health state utility values. Journal of Health Economics, 31, 306–318.
Article PubMed Google Scholar
Lancsar, E., & Louviere, J. (2008). Conducting discrete choice experiments to inform healthcare decision making: A user’s guide. PharmacoEconomics, 26(8), 661–677.
Article PubMed Google Scholar
Norman, R., Viney, R., & Brazier, J. (2014). Valuing SF-6D health states using a Discrete Choice Experiment. Medical Decision Making., 34(6), 773–786.
Article PubMed Google Scholar
Viney, R., Norman, R., & Brazier, J. E. (2014). An Australian discrete choice experiment to value EQ-5D health states. Health Economics, 23(6), 729–742.
Article PubMed Google Scholar
Mulhern, B., Norman, R., Street, D. J., & Viney, R. (2019). One Method, Many Methodological Choices: A Structured Review of Discrete-Choice Experiments for Health State Valuation. PharmacoEconomics, 37(1), 29–43.
Article PubMed Google Scholar
de Bekker-Grob, E. W., Donkers, B., Jonker, M. F., & Stolk, E. A. (2015). Sample size requirements for discrete-choice experiments in healthcare: A practical guide. Patient, 8(5), 373–384.
Article PubMed PubMed Central Google Scholar
Demirkale, F., Donovan, D., & Street, D. J. (2013). Constructing D-optimal symmetric stated preference discrete choice experiments. Journal of Statistical Planning and Inference, 143, 1380–1391.
Article Google Scholar
Street, D. J., & Burgess, L. (2007). The Construction of Optimal Stated Choice Experiments: Theory and Methods. Wiley.
Book Google Scholar
Ware, J. E., Jr., & Gandek, B. (1998). Overview of the SF-36 health survey and the international quality of life assessment (IQOLA) project. Journal of Clinical Epidemiology, 51(11), 903–912.
Article PubMed Google Scholar
Aaronson, N. K., Ahmedzai, S., Bergman, B., et al. (1993). The European organisation for research and treatment of cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. Journal of the National Cancer Institute., 85(5), 365–376.
Article CAS PubMed Google Scholar
Furukawa, T. A., Kawakami, N., Saitoh, M., et al. (2008). The performance of the Japanese version of the K6 and K10 in the World Mental Health Survey Japan. International Journal of Methods in Psychiatric Research, 17(3), 152–158.
Article PubMed PubMed Central Google Scholar
Herdman, M., Gudex, C., Lloyd, A., et al. (2011). Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Quality of Life Research., 20(10), 1727–1736.
Article CAS PubMed PubMed Central Google Scholar
Norman, R., Cronin, P., & Viney, R. (2013). A pilot discrete choice experiment to explore preferences for EQ-5D-5L health states. Applied Health Economics and Health Policy., 11(3), 287–298.
Article PubMed Google Scholar
Bansback, N., Brazier, J. E., Tsuchiya, A., & Anis, A. (2012). Using a discrete choice experiment to estimate societal health state utility values. Journal of Health Economics., 31, 306–318.
Article PubMed Google Scholar
Bleichrodt, N., Wakker, P., & Johannesson, M. (1997). Characterizing QALYs by risk neutrality. Journal of Risk and Uncertainty., 15(2), 107–114.
Article Google Scholar
StataCorp. 2013 Stata Statistical Software: Release 13. College Station, TX: StataCorp LP
Hole, A. R. (2007). A comparison of approaches to estimating confidence intervals for willingness to pay measures. Health Economics, 16(8), 827–840.
Article PubMed Google Scholar
Deming, W. E., & Stephan, F. F. (1940). On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. The Annals of Mathematical Statistics., 11(4), 427–444.
Article Google Scholar
Shiroiwa, T., Ikeda, S., & Noto, S. (2016). Comparison of Value Set Based on DCE and/or TTO Data: Scoring for EQ-5D-5L Health States in Japan. Value Health, 19(5), 648–54.
Article PubMed Google Scholar
Giesinger, J. M., Efficace, F., Aaronson, N., Calvert, M., Kyte, D., & Cottone, F. (2021). Past and Current Practice of Patient-Reported Outcome Measurement in Randomized Cancer Clinical Trials: A Systematic Review. Value Health., 24(4), 585–91.
Article PubMed Google Scholar
Hagiwara, Y., Shiroiwa, T., Taira, N., et al. (2020). Mapping EORTC QLQ-C30 and FACT-G onto EQ-5D-5L index for patients with cancer. Health and quality of life outcomes, 18(1), 354.
Article PubMed PubMed Central Google Scholar
Kaur, M. N., Skolasky, R. L., & Powell, P. A. (2022). Transforming challenges into opportunities: conducting health preference research during the COVID-19 pandemic and beyond. Quality of Life Research, 31(4), 1191–1198.
Article PubMed Google Scholar
Webb, E. J. D., Kind, P., Meads, D., & Martin, A. (2021). Does a health crisis change how we value health? Health Economics, 30(10), 2547–2560.
Article PubMed Google Scholar

Download references

Acknowledgements

We thank the European Organisation for Research and Treatment of Cancer Quality of Life Group (EORTC QLG), specifically the health technology assessment workgroup, for support with the study. The study followed an established methodology that was developed by the Multi-Attribute Utility in Cancer (MAUCa) Consortium and the EORTC QLG. The MAUCa Consortium members include Neil Aaronson, John Brazier, David Cella, Daniel Costa, Peter Fayers, Peter Grimison, Monika Janda, Georg Kemmler, Madeleine King (Chair), Nan Luo, Helen McTaggart-Cowan, Rebecca Mercieca-Bebber, Richard Norman, Dennis Revicki, Stuart Peacock, Simon Pickard, Donna Rowen, Galina Velikova, Rosalie Viney, Deborah Street, and Tracey Young. Margaret-Ann Tait provided valuable production assistance during submission of this manuscript.

Funding

The Japanese valuation study and value set for the European Organisation for Research and Treatment of Cancer (EORTC) QLU-C10D was funded by the Japanese National Institute of Public Health. The development of the EORTC QLU-C10D descriptive system and valuation methodology was funded by an Australian National Health and Medical Research Council Project Grant (PG632662).

Author information

Authors and Affiliations

Center for Outcomes Research and Economic Evaluation for Health (C2H), National Institute of Public Health, Wako, Saitama, Japan
T. Shiroiwa & T. Fukuda
Faculty of Science, School of Psychology, University of Sydney, Sydney, NSW, Australia
M. T. King & R. Campbell
European Organisation for Research and Treatment of Cancer Quality of Life Group, Brussels, Belgium
M. T. King & G. Kemmler
School of Population Health, Curtin University, Perth, WA, Australia
R. Norman
Medical Psychology, Amsterdam UMC Location University of Amsterdam, Meibergdreef 9, Amsterdam, Netherlands
F. Müller
Amsterdam Public Health, Global Health, Amsterdam, Netherlands
F. Müller
Department of Psychiatry, Psychotherapy and Psychosomatics I, Medical University of Innsbruck, Innsbruck, Austria
G. Kemmler
Crecon Medical Assessment Co., Ltd, Tokyo, Japan
T. Murata
College of Life Sciences, Ritsumeikan University, Kusatsu, Japan
K. Shimozuma

Authors

T. Shiroiwa
View author publications
You can also search for this author in PubMed Google Scholar
M. T. King
View author publications
You can also search for this author in PubMed Google Scholar
R. Norman
View author publications
You can also search for this author in PubMed Google Scholar
F. Müller
View author publications
You can also search for this author in PubMed Google Scholar
R. Campbell
View author publications
You can also search for this author in PubMed Google Scholar
G. Kemmler
View author publications
You can also search for this author in PubMed Google Scholar
T. Murata
View author publications
You can also search for this author in PubMed Google Scholar
K. Shimozuma
View author publications
You can also search for this author in PubMed Google Scholar
T. Fukuda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to T. Shiroiwa.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 110 kb)

Supplementary file2 (PDF 120 kb)

Supplementary file3 (PDF 131 kb)

Supplementary file4 (PDF 117 kb)

Supplementary file5 (PDF 103 kb)

Supplementary file6 (PDF 126 kb)

Supplementary file7 (PDF 174 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shiroiwa, T., King, M.T., Norman, R. et al. Japanese value set for the EORTC QLU-C10D: A multi-attribute utility instrument based on the EORTC QLQ-C30 cancer-specific quality-of-life questionnaire. Qual Life Res 33, 1865–1879 (2024). https://doi.org/10.1007/s11136-024-03655-7

Download citation

Accepted: 18 March 2024
Published: 09 May 2024
Issue Date: July 2024
DOI: https://doi.org/10.1007/s11136-024-03655-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Japanese value set for the EORTC QLU-C10D: A multi-attribute utility instrument based on the EORTC QLQ-C30 cancer-specific quality-of-life questionnaire

Abstract

Purpose

Methods

Results

Conclusions

Similar content being viewed by others

Plain English summary

Introduction

Methods

DCE valuation task

Other survey content

Statistical analyses

Results

Sample characteristics

Respondents’ perception of the DCE valuation task

Data quality

DCE Results

Japanese value set compared with other countries

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation