FormalPara Key Points for Decision Makers

Targeted therapy is increasingly applied in cancer treatment, but unmet needs are significant. This review provides the first synthesized evidence on the patient preference for targeted therapy, with novel approaches developed for evidence synthesis of DCE results.

Patients placed higher weights on the outcome (e.g., overall survival) and cost attributes than process attributes. On average, patients were willing to pay $561 and $716 out-of-pocket for a 1-month increase in progression-free survival and overall survival, respectively. Sample representativeness of the studies was rarely assessed.

The results may inform policymakers, physicians, and other stakeholders about patients’ perceptions of novel therapies and their WTP for survival if treated by targeted cancer therapy.

1 Introduction

Cancer is a significant public health problem and one of the leading causes of death worldwide [1]. Molecular aberrations play a causative role in many cancers [2], and understanding the molecular basis of cancer has ushered in the era of precision medicine [3]. The rapid evolution of precision medicine led to the development of a new type of pharmacotherapy called targeted therapy, which has been increasingly applied in clinical practice. By matching patients with drugs based on specific gene and protein biomarkers, targeted therapies minimize drug resistance, prolong survival, and avoid unwanted side effects [4]. Many targeted drugs have been approved as adjuvant therapy or even first-line options for recurrent or metastatic cancers [5]. In many cases, multiple targeted drugs are available for specific cancer or various stages of the disease, with different levels of efficacy, adverse events, and costs. For example, afatinib has a higher response rate in non-small cell lung cancer patients than gefitinib, but it may cause more adverse events [6]. Therefore, assuming that patients are presented with evidence-based treatment choices, choosing targeted drugs may be more complicated than cytotoxic chemotherapies because the choice largely depends on patients’ ability to trade-off between different factors, mainly cost, efficacy, and adverse events.

Patient preference information, especially synthesized preference evidence, could inform clinical decisions and facilitate shared decision-making to improve health outcomes [7]. Some regulatory agencies explicitly require incorporating patient preferences in clinical evaluation [8, 9]. The discrete choice experiment (DCE) is a commonly used approach for eliciting patient preferences based on random utility theory [10,11,12]. In a DCE, respondents are asked to choose a preferred therapeutic option between two or more alternatives that mimic the real-world choices, described by several attributes and associated levels [11, 13,14,15]. Through the analysis of respondents’ discrete choices, a DCE could estimate patients’ latent preferences (known as part-worth utilities) associated with the descriptive attributes and their willingness-to-pay (WTP) for changes to the attribute levels [13, 16]. To better understand patient preferences toward targeted therapy for cancers, there is a need to systematically review DCE studies on this topic.

Some previous reviews have summarized patient preferences for cancer treatments for specific cancers [17,18,19,20,21,22,23], and two studies synthesized evidence for multiple cancers [24, 25]. Despite these efforts, the published reviews either focused on treating a specific type of cancer or did not use the DCE results to derive patients’ WTP for cancer treatments, which would inform the clinical decisions and pharmaceutical regulations. Moreover, the broad coverage of cancer interventions (not only targeted therapy but also radiotherapy, surgery, chemotherapy, the management of cancer therapy adverse events, follow-up services, and support services) in some reviews may have limited the usefulness of their synthesized evidence. To date, no reviews have synthesized patient preferences specific to targeted therapy. Therefore, we aim to review DCEs on targeted therapy in a clearly defined and intrinsically comparable fashion and provide synthesized evidence on (i) the attributes and patient preferences for targeted therapy, (ii) the relative importance of the attributes, (iii) patients’ WTP for survival if treated by targeted therapy, and (iv) the quality of the DCE studies included in this review.

2 Methods

2.1 Literature Search

We designed the study as per the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [26]. Three widely used databases relevant to targeted therapy were searched, including PubMed, EMBASE, and MEDLINE, up to August 16, 2022. We developed search terms using a combination of the following keywords: DCE, patient preference, and cancer. We did not filter the search using terms such as targeted therapy and cancer treatment to broaden the search results. We supplemented the database search by examining the references of the included articles. We also compared with the past reviews to confirm that we did not miss any studies [24, 25]. The search strategy and results are presented in Supplementary Materials Section A.

Studies were included if they were: (i) peer-reviewed articles, (ii) written in English, (iii) DCEs eliciting patient preferences of medical treatments for cancers, and (iv) having targeted therapy as an option. In addition to DCEs, we included articles titled “conjoint analysis” if the article elicited patients’ part-worth utilities based on random utility theory, which implied the article was a DCE [10]. Studies were excluded if they did not elicit patient preferences for targeted therapy but other types of interventions for cancers (e.g., preventive care, pain management). Two authors (RR and PL) independently screened titles and abstracts, followed by screening the full text of the remaining articles to determine the final inclusion. Any disagreements between individual judgments were resolved by two authors (SJ and YG).

2.2 Information Extraction

An author (SJ) developed an initial version of the information extraction form based on a generic systematic review of DCEs [12] and two specific systematic reviews of DCEs on cancer treatments [24, 25]. The form was later revised and confirmed through discussions among all authors. Two authors (RR and PL) conducted the information extraction independently and cross-checked after completion. The lead author (SJ) examined all the retrieved information, which was discussed with a senior author (YG) when needed. Any disagreements were resolved through discussions involving the four authors.

We extracted the characteristics of the selected articles, including their location, publication year, target population, cancer type(s), administration method (e.g., online questionnaire, face-to-face interview), and sample size. We also extracted technical details on their experimental design and statistical analyses. In terms of the study results, we extracted the part-worth utilities (i.e., coefficients of the statistical models) of attribute levels, the relative importance of attributes if available, and the WTP values. All data generated or analyzed during this study are included in the article.

2.3 Evidence Derivation and Synthesis

To facilitate attribute comparison between studies, we followed the approach adopted by Bien et al [24] and categorized the attributes from the selected studies into three groups: outcome, cost, and process. The outcome attributes referred to either treatment efficacy or adverse events. The treatment efficacy was mainly demonstrated by progression-free survival (PFS) and overall survival (OS), which referred to an increase in median survival time or survival probability at a certain landmark time point. The process attributes included the route of administration, mode of administration, frequency of consultation, etc. After the categorization, we calculated the frequencies of the attribute groups.

We also calculated the relative importance of attributes if the authors did not report this information, following the range method recommended by the ISPOR Conjoint Analysis Good Research Practices Task Force [27]. We divided the range of each attribute (part-worth utility of the best level minus that of the worst level within the same attribute) by the sum of the ranges of all attributes and then multiplied the quotient by 100. Furthermore, we calculated the arithmetic average of the relative importance values of the same attribute in different studies. The average value indicated the overall importance of an attribute based on the included studies. By comparing the values, we assessed how important, on average, these attributes were to the patients.

We synthesized WTP values weighted by patient numbers and calculated the WTP values if the authors did not report them. If the study applied a multinomial logit model (MNL), we derived the WTP of an attribute level (non-cost type) by directly dividing the part-worth utility of the attribute level by the part-worth utility of the cost attribute. If the study applied advanced models such as the mixed logit model (MIXL), we used a simulation approach to derive the WTPs [28]. For each non-cost attribute level we were interested in, we drew a random sample 10,000 times according to the distributional information of the attribute level and the cost attribute. We then paired the sampled values and calculated the quotients to obtain the WTP for each pair. To prevent extreme values from distorting the mean estimates, we removed the largest 2% following Hensher et al [29]. If a study applied advanced models but did not report the standard deviation (SD) of the coefficients, we could not derive the simulated WTP (which requires both mean and SD estimates); instead, we calculated the WTP by simply dividing the coefficients, similar to the approach we adopted for the MNL. After calculating WTPs for individual studies, we converted the values to 2022 US dollars (US$) using Purchasing Power Parities for GDP calculated by the International Monetary Fund [30] and presented their average on a monthly basis weighted by the sample size of patients in each DCE. The weighted average implied, on average, the amount patients were willing to pay per month for a change in the non-cost attribute level.

2.4 Heterogeneity

We examined the analysis for unobserved and observed preference heterogeneity in the selected articles. The analysis for unobserved heterogeneity was by advanced models, including MIXL (also known as random-parameter logit model, RPL), Hierarchical Bayesian, latent class, and more advanced models such as generalized multinomial logit model. The observed heterogeneity was analyzed by exploring the interactions between patient characteristics (e.g., age, gender) and attributes. The interaction analysis implied specific preference patterns in some patient groups, which required attention from the stakeholders.

To detect the heterogeneity of patient preferences between cancers, we selected melanoma and prostate cancer as exemplars because they have different clinical characteristics, which may lead to different patient preferences. Melanoma is a severe form of invasive skin cancer with a high risk of death, while prostate cancer is usually a slow-growing non-cutaneous adenocarcinoma with indolent growth characteristics and a relatively favourable prognosis, commonly affecting older males. We selected melanoma and prostate cancer also because they had more studies included in this review than other cancer types, therefore the evidence synthesis for these two types of cancers would be more robust. We calculated the attribute relative importance for each cancer type and graphically illustrated the differences in results in a radar plot, with the results of all cancers as the background. The comparison indicated how cancer type influenced patient preferences in terms of the relative importance of the attribute.

2.5 Quality Assessment

We used the PREFS checklist (Purpose, Respondents, Explanation, Findings, Significance) to evaluate the quality of the included DCE studies [31]. Compared with other checklists, such as the ISPOR checklist [32], the ease of use, practicality, relevance, and wider acceptance made PREFS more frequently used for the quality assessment of DCEs [24, 33,34,35,36,37]. The PREFS checklist assesses the quality of a DCE from five aspects, including Purpose, Respondents, Explanation, Findings, and Significance. Each aspect has two clearly stated questions to assess the appropriateness of a study. The study receives one mark if deemed acceptable in one aspect and zero if not acceptable. The quality assessment was conducted by two authors (SJ and RR). A score of five indicates good quality, while a score lower than three indicates significant problems in the study. We calculated the score for each included study and their arithmetic mean, which implied an average quality of the studies included in our review.

3 Results

Figure 1 illustrates the process of study selection. The details of our search are in Supplementary Materials Section A. A total of 1133 studies were identified, from which 10 duplicates were removed. By screening the titles and abstracts, we excluded 1050 records. Through the full-text assessment, we further excluded 39 studies and then kept 34 studies for data extraction. No additional study was identified through the reference search.

Fig. 1
figure 1

Selection process of relevant studies for the systematic review

3.1 Study Characteristics

While the included studies are from 13 countries, nearly half were conducted in the USA (n = 17, 50%) (Supplementary Table S1). Twenty-two studies were conducted after 2018, implying an increasing trend of using the DCE to elicit cancer patients’ preferences for medical treatments, including the targeted therapy (Fig. 2).

Fig. 2
figure 2

Yearly number of published DCE studies that are relevant with targeted pharmacotherapy for cancers

As indicated in Supplementary Table S1, the most frequently investigated cancer was breast cancer (n = 6, 18%), followed by melanoma (n = 5, 15%), renal cell carcinoma (n = 4, 12%), and prostate cancer (n = 4, 12%). Others included colorectal cancer, multiple myeloma, ovarian cancer, etc. The sample size ranged from 18 [38] to 444 [39] patients, with a median of 148. About 76% of studies (n = 26) had a sample size greater than 100 and 32% (n = 11) larger than 200 (Table 1).

Table 1 Data extraction and quality assessment for included DCE studies

Most studies reported the way in which the DCE was conducted. Twenty-seven studies (79%) conducted online surveys, and five (15%) conducted face-to-face interviews [39,40,41,42,43]. Four studies (12%) applied paper-based questionnaires [38, 44,45,46], and four (12%) used more than one administration method [38, 41, 44, 46]. The face-to-face approach was less frequently applied for preference elicitation than the online approach, probably because of the difficulties in the practice of the face-to-face approach. However, it is more likely to help respondents understand the survey questions correctly than online surveys [28]. One drawback of the online approach was the low response rate [47]. Five such studies (15%) reported a patient response rate of less than 30% [47,48,49,50,51].

3.2 Study Design

Most studies used literature searches and expert opinion to identify the attributes and associated levels of a DCE. About half of the studies (n = 16, 47%) used in-depth or semi-structured interviews. Only two studies (6%) involved patient focus groups in developing the attributes [41, 52]. We found that the attribute identification process was not sufficiently reported, which may have impacted the transparency and validity of some DCEs. Once attributes and levels were identified, most studies (n = 31, 91%) conducted a pilot study to assess and refine the DCE questionnaire. The number of patients involved in the pilot studies varied from 3 to 39 [53, 54].

About half (n = 17, 50%) of the studies applied the D-efficient or D-optimal design to generate choice sets, while the rest used other designs (e.g., orthogonal arrays) or did not introduce their study designs. The number of total DCE choice sets ranged from 18 [39] to 2250 [55]. When the total choice sets were beyond the cognitive capability of respondents, the blocking technique was used to reduce the number of choice tasks in one questionnaire [13]. Seven studies (21%) included opt-out or status quo options [43, 44, 53, 56,57,58,59].

There was no consensus among the included studies when identifying an appropriate sample size. Seven studies (21%) used the rule of thumb for minimum sample size estimation [42, 43, 46, 54, 60,61,62]. Six studies (18%) referred to other DCE studies to justify the sample size [50, 51, 62,63,64,65]. One study (3%) used the parametric approach for sample size calculation proposed by Rose and Bliemer [59]. Twenty-one studies (62%) did not explicitly report how the sample size was determined.

3.3 Study Quality

Table 1 provides a quality assessment of the included studies. PREFS scores of the 34 studies ranged from 2 to 4, with a mean of 3.38 (SD: 0.65). No study scored 5. Most studies scored 3 (n = 15, 44%) or 4 (n = 16, 47%), and three studies (9%) scored 2 [50, 56, 66]. Although most studies clearly stated the research purpose, some did not design the study well or provide a comprehensive analysis. For example, we found that most studies (n = 32, 94%) did not assess the impact of non-responders on their preferences. Sixteen studies (47%) excluded some responses without a clear rationale.

3.4 Attribute Frequency

Figure 3 illustrates how frequently different attributes (i.e., outcome, cost, or process) were used in the studies we reviewed (a complete list is provided in Supplementary Table S2). We found that the administration regimen was the most common attribute, which appeared in 25 studies (74%). The administration regimen included the frequency/route/mode of administration, treatment duration, and food restrictions.

Fig. 3
figure 3

Frequency of different types of attributes in the selected DCE studies (total number = 34). OOP out-of-pocket, OS overall survival, PFS progression-free survival. X-axis represents the number of studies. Y-axis represents attribute category.

Outcome-type attributes were also common, most referring to the risk of adverse events and treatment efficacy. For example, the risk of nausea/vomiting/diarrhea appeared in 15 studies (44%), the risk of fatigue/tiredness in 13 studies (38%), and the risk of skin disorder in 12 studies (35%). The attributes belonging to treatment efficacy, such as OS and PFS, appeared in most studies. For example, PFS was considered in 24 studies (71%) and OS in 11 studies (32%). No DCEs in this review considered health-related quality-of-life (HRQoL) as an outcome attribute.

Cost-type attributes were included in nine studies (26%). This type included out-of-pocket (OOP) cost of treatment (n = 8, 24%) and work loss due to treatment [45] (n = 1, 3%). All studies considering cost attributes indicated that patients preferred lower costs. Other studies did not include the cost in the DCE, likely because the authors wanted to focus on patients’ trade-offs between efficacy and adverse events.

3.5 Attribute Relative Importance

Figure 4 illustrates the relative importance of different types of attributes. We found that cancer patients perceived OS as the most critical attribute relative to other attributes. All efficacy attributes were considered more important than those concerning adverse events and treatment administration. Cost attributes were perceived as important as efficacy attributes. The least essential attributes were those regarding treatment processes, such as administration regimens.

Fig. 4
figure 4

Relative importance of attributes in the selected DCE studies. OS overall survival, PFS progression-free survival. Y-axis represents the relative importance on a 0-100 scale.

Another perspective we examined was related to ways in which cancer patients perceive the attributes by ranking the proportion of one attribute as the most important among the selected DCEs. Figure 5 illustrates the proportions from this perspective. We found that cancer patients perceived OS as the most (or second most) important attribute in about 81% of studies that incorporated OS as a DCE attribute. PFS was perceived as the most (or second most) important attribute in about 58% of studies, and the cost was the most (or second most) important in about 55% of studies. In contrast, the administration regimen was perceived as most important in about 11% of studies, and other process-type attributes were not perceived as most important in any study.

Fig. 5
figure 5

The proportion of different types of attributes perceived as important in the selected DCE studies. Rank 1, perceived as the most important. Rank 2, perceived as the second most important. Rank 3, perceived as the third most important. Rank 4+, perceived as the fourth most important (or even lower). Y-axis represents the proportion of studies. Example interpretation: Overall survival (OS) was perceived as the most important attribute in 45% of studies that incorporated OS as a DCE attribute, the 2nd most important attribute in 36% of studies, the 3rd most important attribute in 0% of studies, and the 4th most important (or below) attribute in 18% of studies. PFS progression-free survival

3.6 Willingness to Pay

Table 2 provides an overview of patients’ WTP for improvements in survival. There were five studies (15%) examining patients’ WTP for PFS improvement [42, 43, 51, 57, 59]. Our calculation indicated that, on average, patients were willing to pay $561 (95% CI $415–$758) OOP per month for a 1-month increase in PFS. The largest number appeared in Gonzales et al, which found that patients were willing to pay $1039 with a 95% CI ranging from $755 to $1338 for a 1-month increase in PFS [51].

Table 2 Willingness-to-pay for 1-month increase in survival

Four studies examined patients’ WTP for OS improvement [44, 51, 57, 58]. Our calculation indicated that the average WTP for a 1-month increase in OS was $716 (95% CI $524–$958) OOP. Although Stenehjem et al reported that the WTP was as low as $88 (95% CI $67–$110), Gonzales et al found that patients’ WTP was as high as $1638 (95% CI $1207–$2102) for a 1-month increase in OS [51].

3.7 Heterogeneity

Among the included studies, 11 (32%) used MIXL or RPL, 10 (29%) used Hierarchical Bayesian models, and 1 (3%) used the latent class model. The results indicated that most studies investigated unobserved preference heterogeneity.

A total of 16 studies investigated observed preference heterogeneity by analyzing the interactions between patients’ characteristics and attributes. Ten studies (29%) reported investigations of interactions between age and attributes [43, 45, 47, 48, 54, 60, 67,68,69,70] with mixed results. Two studies found that older patients placed higher weight on PFS than younger patients [47, 68]. Four studies reported that younger patients placed higher weight on PFS [54, 69] or OS [67, 70] than older patients. The rest found no significant interactions between age and preferences.

Four studies (12%) explored interactions between gender and preferences [43, 50, 67, 68]. One study reported that female patients were more averse to the shortest PFS (3 months of PFS) than male patients [50], implying that females had a stronger preference for longer PFS than males. Another study identified a significant interaction term between females and cost with a negative coefficient (− 0.075) [43], implying that females had a stronger preference for smaller costs than males. The rest found no significant interactions between gender and preferences [67, 68].

Based on the calculation of attribute relative importance, we found the results between melanoma and prostate cancer contrasted significantly. Figure 6 illustrates how the preferences of patients with melanoma and prostate cancer differ from each other. As indicated in the figure, patients with prostate cancer valued efficacy-type attributes the most, compared with patients with other types of attributes. In contrast, melanoma patients perceived cost as the most important attribute. The preferences toward PFS and OS revealed another difference between the two types of patients. Patients with prostate cancer were more concerned about PFS than OS, while melanoma patients cared about OS more than PFS.

Fig. 6
figure 6

Different preferences towards targeted therapy between patients with melanoma and prostate cancer. OS overall survival, PFS progression-free survival

When comparing the preferences of melanoma patients and all cancer patients, we found they had similar preference patterns, except that melanoma patients ranked cost as the most important factor. Patients with prostate cancer did not prioritize cost but valued PFS the most. The characteristics of different cancers might explain the heterogeneity observed by cancer type. Decision-makers might need to consider the preference patterns between different types of cancer patients.

4 Discussion

Although targeted pharmacotherapy has demonstrated promising outcomes in cancer treatment, patient preferences for this therapy remain unclear. Our study represents the first effort to systematically review the DCE literature on this topic and synthesize evidence on patient preferences for targeted pharmacotherapies. This review identified 34 eligible studies from 13 countries, covering 14 types of cancers, such as breast, ovarian, kidney, prostate, and skin cancers. The review reveals a rising trend of DCEs on this topic, as most studies were published after 2018, which warranted the necessity for evidence synthesis that has yet to be undertaken. In this review, we utilized the primary estimates to derive secondary results for evidence synthesis, such as the relative importance of attributes and patients’ WTP for survival. We explicitly focused on targeted cancer therapy, which may help to inform clinical decisions and pharmaceutical regulations on targeted therapy.

4.1 Attributes and Relative Importance

A large number of attributes were identified and categorized into three groups: outcome, process, and cost. We found that patients placed higher weights on the outcome (e.g., OS) and cost attributes than on the attributes concerning treatment processes (e.g., administration regimen) in their choices. In addition, OS was perceived as the most (or second most) important attribute in most DCEs. Considering that the DCEs had different levels of OS and were conducted among cancer patients in heterogeneous contexts, our results indicate that patients generally value OS the most among all the attributes.

Health economists have widely recognized that the HRQoL of patients is crucial for the health outcome assessment because it provides a comprehensive valuation of health interventions beyond clinical outcomes (e.g., PFS and OS) [71,72,73]. Although cancer patients were found to value outcome attributes the most in this review, no studies included in the review considered HRQoL as an attribute. One possible reason is that, although generic HRQoL instruments exist (e.g., EQ-5D and SF-6D), DCE practitioners may find them too generic to best capture the key outcomes of targeted therapy [74]. In the meantime, widely accepted cancer-specific HRQoL instruments are still lacking. Although some instruments have been developed [75, 76], it may take time for them to gain wide popularity among clinicians and DCE practitioners. Nevertheless, including HRQoL as an outcome attribute in future DCEs in this area is warranted. Researchers could use overall scores or qualitative descriptions of selected dimensions as potential attributes. Researchers could also use graphs (e.g., 0–100 scale) to help explain such attributes.

4.2 WTP for Survival

We found patients, on average, were willing to pay about $561 (95% CI $415–$758) from their own pockets (i.e., OOP cost) for a 1-month increase in PFS if they were treated by targeted therapy. The average WTP for a 1-month increase in OS was even higher, up to $716 (95% CI $524–$958). These imply mean WTPs of $6732 and $8592 for a 1-year increase in PFS and OS, respectively.

The level of WTP was associated with several factors. The first factor is the cancer type. As indicated in Table 2, the WTPs of patients with renal or colorectal cancers were higher than those with ovarian or breast cancers. According to cancer epidemiological statistics, the 5-year survival rate of breast and ovarian cancers are higher than that of renal and colorectal cancers [77]. A previous study found that patients with low-survival-rate cancers are willing to pay more than those with high-survival-rate cancers [78].

The second factor is the income level of patients. The economic theory suggests wealthier people have a greater WTP for goods and services. Therefore, a common finding in the health economics literature is that patients with higher incomes are willing to pay more for healthcare services than those with lower incomes [28, 78]. In the case of targeted therapy, the income level of cancer patients is likely to be positively associated with their WTP. However, the DCEs in the current review did not explore this relationship.

In addition, other characteristics of cancer patients, such as age and gender, may impact their WTPs. Less than half of DCEs (n = 16, 47%) in this review investigated interactions between patients’ characteristics and preferences. For example, some studies indicate that older patients place a higher weight on PFS than younger patients [47, 68], while other studies suggest the opposite [54, 67, 69]. Future DCE studies in this area should further explore such associations. Willingness-to-pay values may also be affected by the different attribute levels used to define the survival attributes [79].

Contingent valuation or model-based estimation may derive WTP values much larger than the synthesized WTP from DCEs in this review. For example, Olofsson and colleagues used a contingent method to measure the end-of-life premium in cancer and concluded that the mean value of a QALY was about €528,000 [80]. Diaby and colleagues used a Markov model to measure the mean value of a QALY for metastatic breast cancer patients and obtained values larger than $117,000 ($117,519.8–$185,210.7) per QALY [81]. The WTP values derived by different approaches might not be directly comparable. One explanation is that we derive WTP estimates from only seven DCEs. Most studies do not use OOP cost as an attribute, and thus we were unable to estimate the WTPs in these studies. Another explanation is that the non-DCE studies may have incorporated an end-of-life premium for cancer patients at advanced stages, future medical and nonmedical costs for patients after treatment, and the impact of cancers on their productivity [82, 83]. The third explanation is the presence of potential biases that occurred in implementing the seven DCEs, such as the selection bias [16, 84]. These issues need to be addressed in future research to reconcile the differences between different methods and produce more robust and reliable WTP estimates.

4.3 Non-Response Bias

The non-response bias remains a major concern in this literature since the preferences are elicited from cancer patients. Some patients may die soon after diagnosis, while others at advanced cancer stages may have difficulties participating in DCE studies. Non-response bias can impact the external validity of the DCEs as they may imply un- or under-representative samples. As indicated by our quality assessment (Table 1), no studies addressed the non-response problem or ascertained the extent of non-response bias to which it may lead to an unrepresentative sample. Future DCE studies may need to compare the responders and non-responders in terms of their characteristics that may impact preferences. Alternatively, one may refer to the characteristics of the target population to assess the sample representativeness. If the characteristics of the sample are not consistent with the target population, the practitioners may consider approaches to reweight their sample to match the key statistics of the population.

4.4 Opt-out Option

This review found that few DCEs provided opt-out or status quo options to respondents. Respondents were often forced to choose between two or three therapeutic alternatives. However, in the context of oncology, palliative care and withdrawal of treatment (that may cause extreme economic burden) are realistic options for patients. These options could be represented in choice tasks by opt-out or status quo options, which allow respondents not to choose the therapeutical alternatives in a DCE task [85]. This is important as the restricted choice tasks may bias the analysis results [86]. We therefore recommend the inclusion of the opt-out/status quo options in future DCEs to mimic the real-world context for cancer patients.

4.5 Other Issues

Most studies included in the review have not followed the general best practice in DCE methodology. Almost half of the studies included in the review did not claim the use of qualitative research to identify attributes, making the assessment of the reliability of the selected attributes challenging. Among those using a qualitative phase, the disclosure of details and results on how qualitative findings affected the DCE design was insufficient. Most studies did not report a pilot phase and how the sample size was calculated, likely because they had not been done. While patient recruitment is understandably challenging in this area, researchers should endeavour to follow the best practice for conducting DCE studies [11, 87].

4.6 Strengths and Limitations

This review provides the first synthesized evidence on patient preference for targeted therapy. The literature search process is comprehensive, with clearly defined inclusion and exclusion criteria. We applied rigorous quality control approaches to ensure correctness and preciseness in the literature screening and information retrieval process. For example, two junior authors independently screened the literature, extracted information from the selected studies and cross-checked the results. Two senior authors examined the retrieved information, and the evidence synthesis results. Any disagreements were resolved by discussion involving the four authors.

Although this review followed best practices, it has some limitations. First, we could not calculate the relative importance of attributes for each DCE included in the review as some studies did not report the estimates of preference weights. We could only approximate the weights from other sources, such as the graphs for these studies. Second, the WTP results reported in this review should be interpreted with caution as they are based on a small number of studies and thus have limited representativeness. It might be difficult to draw solid conclusions from the limited number of studies. Instead, the estimates provide a direction, and future research is needed. Third, the PREFS checklist used for the quality assessment may have limitations. It provides five questions for quality assessment and may miss important criteria, e.g., the validity and comprehensiveness of the attribute identification process and analysis of preference heterogeneity. Also, it does not provide sufficient tools to assess the biases in a DCE, such as selection bias and non-response bias. Considering the highly heterogeneous contexts where DCE studies are conducted, we call for establishing a specific checklist in oncology to help DCE researchers and practitioners with their study design and report, as well as future quality assessments in systematic reviews.

5 Conclusions

This review provides synthesized evidence on patient preferences regarding targeted cancer pharmacotherapies. The results on the attribute relative importance may inform policymakers, physicians, and other stakeholders about patients’ perceptions of this novel therapeutic option. The synthesized WTP for survival, although it included only a limited number of studies due to eligibility issues, may provide a direction for estimating patients’ monetary valuation of targeted therapy. We highlight that in order to achieve a better understanding and consensus among stakeholders surrounding targeted therapy, more DCEs or other stated preference studies are warranted for further evidence. To achieve robust evidence and facilitate evidence synthesis, a standardization of the DCE design and implementation is also needed. Future research could follow the recommendations on study design in this review to improve the internal and external validity and provide more robust results in this increasingly important field.