FormalPara Key Points for Decision Makers

Median willingness to pay for a quality-adjusted life-year (QALY) was estimated at $US19,200–32,000 in the Kingdom of Saudi Arabia (KSA).

This range is in line with the per capita gross domestic product of approximately $US20,000 in the KSA.

Decision makers can use this range as a threshold for the cost per incremental QALY in decision making around reimbursement for a drug or healthcare intervention.

1 Introduction

One key piece of data in the process of medication reimbursement is a threshold that reimbursement decision makers, mainly health authorities, call the willingness to pay (WTP) for a quality-adjusted life-year (QALY). In more simple terms, this is the maximum that a plan or authorities would disburse to secure 1 additional year of life with perfect health and therefore reimburse medications or procedures that would yield incremental cost-effectiveness ratios (ICERs) lower than the WTP for a QALY.

Many Western countries that have some sort of formal medication reimbursement process, such as the UK or Canada, apply threshold values for cost effectiveness, below which a medication is considered cost effective or above which a medication is no longer considered attractive to authorities. Cameron et al. [1] identified 17 countries with a formal threshold; most have either an implicit or otherwise unclear threshold.

Some of those countries use thresholds varying from CAD20,000 per QALY in Canada for definitely attractive drugs [2], to $US50,000 in the USA for cost-effective drugs [3,4,5], or about £20,000–36,000 in the UK [6, 7], and €50,000 in Italy [8], Sweden [9], and the Netherlands [10]. One study evaluated Australian Pharmaceutical Benefits Advisory Committee outcomes for submissions from 2005 to 2009; approximately 50% of the positive recommendations accepted for listing had an ICER of > AUD45,000, whereas 33% of the negative “against” recommendations had an ICER range of AUD45,000–75,000 per QALY gained [11].

Although the reported use of a threshold of $US50,000 in North America emerged from a study of renal dialysis, it appears to have been adopted more because it is an easily applied round number [12]. In turn, Rudmik and Drummond [13] reported a convenient range of $US50,000–100,000 for the threshold in the absence of an official value, whereas King et al. [5] questioned the validity of a threshold of $US50,000. For the UK, Claxton et al. [14] reported new approaches to estimating the value of a QALY, as no empirical estimates had previously been available for the threshold applied by the UK National Institute for Health and Care Excellence.

In fact, very few countries have explicit or even implicit threshold values, and even fewer have the value empirically determined. No QALY threshold exists for the Kingdom of Saudi Arabia (KSA). Hence, the primary objective of this study was to empirically determine the monetary value of a QALY in the KSA.

2 Methods

The QALY is regularly used to assess the cost effectiveness of drugs, especially those used to treat diseases and illnesses with a high impact on quality of life, such as most types of cancer, or drugs that significantly affect patient quality of life, such as some chemotherapy protocols. This type of cost-effectiveness analysis is often called cost-utility analysis. Utility, or health state utility, or simply health utility (HU) is the quantitative measure of quality of life in healthcare, with 1 QALY being equal to 1 year of life with a utility of 1 (or 100%); 1 QALY is also equal to 2 years of life with a utility of 0.5 (or 50%), or 4 years of life with a utility of 0.25 (or 25%), etc., whereas utility scores may range from 0 to 1 (or 0–100%).

Soares Santos et al. [15] identified three approaches for calculating thresholds for a QALY: the WTP method, the precedent method, and the opportunity cost method. Our study applied the WTP approach. The present study consisted of a two-part survey that determined the utility scores of participants and their WTP for a QALY. The online survey was conducted between April and December 2017, with help from three trained interviewers. Recruitment was done at the King Faisal Specialist Hospital and Research Center (KFSHRC), a tertiary institute. This study was approved by the ethics review board of KFSHRC, Riyadh, KSA. All study participants signed informed consent prior to starting the survey. Patients with cancer, their visiting relatives, and some KFSHRC personnel were recruited on site by the interviewers. Interviewers logged in on the survey site with a unique user code for each participant, read the questions to the participant, and entered the participant’s answers.

2.1 Subjects

Participants in the study were citizens of the KSA, either patients with a solid tumor diagnosis from the KFSHRC or members of the general public (MGP) recruited at KFSHRC, including family members of patients, healthcare professionals, or members of the larger community. The minimum age for participation was 18 years; no maximum age was applied. Participants also had to understand the Arabic language and provide informed consent. Patients with hematological malignancies or lymphoma were excluded as the study was centered at the solid tumors unit of the hospital. Online calculators indicated we needed a sample of ≥ 385 participants for the survey for a confidence level of 95% and a margin of error of 5% [16, 17].

2.2 Study Approach and Survey Tool

Data collection was carried out in two steps: a time trade-off (TTO) survey followed by a WTP assessment. Elicitation of HU with the TTO method has been described elsewhere [18, 19], including an electronic version of the TTO [20]. The TTO component is summarized and the WTP component and the calculations to obtain the amount for a full QALY are described in detail.

The first step consisted of eliciting the HUs for patients in a given disease condition, i.e., cancer. This first step could be done through two approaches: (1) through direct survey of patients and measurement of their own health-related quality of life or (2) through the general public by presenting a scenario describing a certain cancer-related health state and asking them to rate the quality of life for the given health state. HUs were measured using the TTO technique. In the TTO, the respondent was presented with two choices: (1) to live for a given length of time with the current condition followed by “instant painless death” or (2) to live fewer years but in full health followed by “instant painless death.” The duration of time in full health was adjusted using a “ping-pong” game technique, until the participant was indifferent between the two options. The maximum number of years that could be traded as applied to the current health state was 10 years or 120 months. This measure allowed us to quantify the health-related quality of life and calculate the number of QALYs over a certain period of time. For example, if a participant was willing to give up 72 months (6 years), then we calculated that 72/120 = 0.6 or 60% of the time was traded off, hence 100–60% = 40% of the remaining time in perfect health indicated an HU of 0.4.

The second step consisted of asking both patients and MGP how much they would pay out of pocket to reach perfect health. This step actually consisted of evaluating the subject’s WTP for transitioning from a given health state to perfect health. For patients, it consisted of their own health state; for MGP, it was the health state described in the presented scenario. The WTP was determined using the bidding game technique, as applied in previous studies [21,22,23] again in a “ping-pong” fashion. Once we had a monetary value for transitioning to perfect health over a certain period of time, we calculated the monetary value of a QALY as estimated by the survey respondent, since we had previously calculated the number of QALYs over that same period for the same respondent.

The content, wording and formatting of the survey were evaluated and assessed for face validity by two members of the research team, an expert clinician (SB) and a researcher with expertise in this area of health economics (MI). Arabic translations were carried out at KFSHRC. MGP were presented with a health state describing a particular scenario that may be faced by a patient with stage 3 colon cancer, including all relevant clinical characteristics, standard treatment options, and their potential impact on a patient’s quality of life.

2.3 Determination of the Willingness to Pay Per Quality-Adjusted Life-Year

Participants were asked for an amount they were willing to pay to move from their present state to 1 year of perfect health. This WTP was then applied to 1 QALY. The participants were first asked the monthly WTP in Saudi Arabia riyals (SAR; $US1 = SAR3.75) for a period of 1 year. After the 1-year bidding game was completed, participants were asked what amount they would pay monthly for 5 years, and a new bidding game was applied. Bidding games started randomly with one of the following amounts: SAR500, SAR1500, or SAR2500. These amounts were chosen to provide reasonably affordable starting figures for all participants, as a monthly salary of < SAR3000 is considered among the lowest. A “ping-pong” sequence was applied with a maximum bid of SAR10,000 and a minimum of zero. If a participant answered yes, a new amount was proposed at the midpoint between the maximum and their starting amount. If the answer was no, a new amount was proposed as the midpoint between zero and their starting amount. The sequence continued with amounts proposed at midpoint between their last choice and either the minimum, the maximum, or the starting amount, varying with the last answer received. Minimum increments were SAR100. When the bidding game yielded a certain WTP amount after the “ping-pong” sequence with no further possibilities, participants were asked whether they were willing to pay that amount. If they said yes, it was considered their answer. If they said no, they were asked what amount they were willing to pay and asked to confirm it. Figure 1 represents a schematic illustration of the WTP component of the survey. Figure 2 presents a sample slide from the patient version of the WTP questionnaire. Finally, results were calculated and presented in $US, with $US1 = SAR3.75 (year 2018 values).

Fig. 1
figure 1

Algorithm of the bidding game

Fig. 2
figure 2

Sample slide from the willingness-to-pay questionnaire (patient version)

In a first step, we calculated WTP = x/(1 − y), where x was the amount proposed by the participant and y was the product of 1 year in the present health state multiplied by the HU for that health state. Accordingly, a WTP amount was determined for a full QALY. For example, a participant who offered $US1000 monthly ($US12,000 annually) to move to perfect health had a utility of 0.4, so this WTP was calculated to be $US12,000/(1–0.4) = $US12,000/0.6 QALY or $US20,000/1.0 QALY. The participant’s WTP per full QALY was then determined to be $US20,000.

2.4 Analysis

Demographic statistics were determined both for the overall sample and separately for both patients and MGP. We calculated means, medians, and percentile results for WTP per QALY. Although the WTP survey was conducted with amounts asked in SAR, results are presented in $US, with $US1 = SAR 3.75 (year 2018 values). The median was considered the most adequate result, as extreme outliers can skew results in health economics, whereas the median remains in the middle of the least and the most amount offered. WTP per QALY was determined for the two time horizons: 1 and 5 years. Analyses were carried out to compare results between subgroups defined by sociodemographic characteristics and to determine whether the difference was statistically significant in some of those comparisons. The statistical package used for analyses was SAS 9.4.

2.5 Sensitivity Analyses

A first series of sensitivity analyses were carried out on the results from the HU component. Adjusted results were calculated after eliminating outliers. Outliers for utility scores were considered as the bottom and top 5% of the utility range, hence the adjusted dataset was limited to 0.05 < utility < 0.95. A second, more restrictive, set of analyses were performed, removing both the outliers on the utility results and outliers on the WTP amounts, hence eliminating outliers who fell outside of those with 0.05 < utility < 0.95 and 10% < WTP < 90%, or the participant answers at the top or bottom 5% of the utility range and those at the top or bottom 10% of the WTP range. These sensitivity analyses were deemed important because, in health economics, especially with a bidding game with an open-ended ceiling, data can be skewed mostly toward the upper end. Hence, these sensitivity analyses were performed to adjust, to a certain extent, for that skewness.

Finally, we performed sensitivity analyses to adjust for actual demographic rates from the general population of the KSA [24]. This was done by multiplying age and sex subgroup results by corresponding epidemiologic figures based on the demographics from the general population statistics of the KSA.

3 Results

From 400 survey participants, data from 378 subjects (177 patients; 201 non-patients) were deemed usable for analysis. Data from the other 22 subjects could not be retrieved and were therefore unusable because of technical issues with data transmission to the central database. In total, 134 patients had stage III or IV cancer, with the five most common types being colon, rectal, renal, gastric, or prostate cancer. Table 1 provides the participant demographics for the overall sample and the two subgroups. Most participants were males (73.5%), and the sample was almost equally divided between employed and unemployed. More than half (58.2%) were aged 36–65 years, and less than one in ten (9.5%) were aged > 65 years. The subgroup aged 26–35 years had the most participants (22%), closely followed by the subgroup aged 36–45 years (21.7%). The subgroup of participants with a bachelor’s degree was the largest (41.5%), followed by those who did not complete secondary school (19.6%). Half of the participants had a monthly income of more than SAR12,000 ($US3200), and the largest subgroup (27.5%) was those earning SAR6001–12,000 ($US1600–3200).

Table 1 Demographics of study participants

Table 2 summarizes the HU scores determined in the first part of the survey, i.e., the TTO component. Mean utilities are presented for the overall group as well as separately for the patient and non-patient groups. After two-way adjustments, i.e., eliminating outliers from the utility scores and from the WTP results, the mean utility scores fluctuated round 0.5, with the patient group reporting a lower mean than the non-patients: 0.42 and 0.51, respectively.

Table 2 Summary results of mean utility scores

Tables 3 and 4 present the WTP results for the 1-year and the 5-year horizons, respectively. For the 1-year time horizon, the median WTP per QALY was $US25,600; after adjustments (without outliers), it was $US32,000. For the 5-year time horizon, the results were slightly lower, at $US19,200 and 22,720 per QALY, respectively, before and after adjustments. After adjustment without outliers, we determined the means to be $US58,000 for the 1-year horizon and $US38,600 for the 5-year horizon, figures that are much closer to the medians than the unadjusted means.

Table 3 Comparative quantiles values of WTP by type of respondent, 1-year horizon
Table 4 Comparative quantiles values of WTP by type of respondent, 5-year horizon

Figure 3 is a logarithmic representation of WTP per QALY by utility scores. It is interesting to note a somewhat positive association between WTP and utilities. In turn, Fig. 4 illustrates the rather negative association between WTP and age, with WTP decreasing with increasing age.

Fig. 3
figure 3

Log representation of WTP per quality-adjusted life-year (1 year) by current health utility. WTP willingness to pay

Fig. 4
figure 4

One-way analysis of WTP/QALY, 1-year horizon, by age group. QALY quality-adjusted life-year, WTP willingness to pay

In sensitivity analyses, where we adjusted for age and sex based on official statistical data for the KSA, after projecting WTP amounts from the age and sex subgroups into corresponding subgroups in the KSA, we obtained WTP per QALY median amounts of $US31,909 for the 1-year horizon and $US22,926 for the 5-year WTP results. These results are very close to the main results, adjusted after excluding outliers.

4 Discussion

This study is, to the best of our knowledge, the first to determine the WTP per QALY in the KSA. It provides threshold results that will be useful in determining whether a new drug or intervention is attractive by comparing expected ICERs with these threshold results, in terms of $/QALY gained. Hence, if an economic analysis of a drug yielded an expected ICER of less than the WTP amount determined in our study, i.e., $US25,600, or 32,000 after adjustments, for the 1-year time horizon, the reimbursement price of the drug may be deemed attractive. The WTP results are somewhat lower in the 5-year horizon analyses ($US19,200–22,720).

This study does not provide definitive WTP per QALY thresholds, but it does present a first provisional set of thresholds that could guide future pharmacoeconomic studies, at least until more solid results are obtained in a larger and more representative sample. Nevertheless, when compared with the gross domestic product (GDP) in the KSA, which is approximately $US20,000 per capita, our results come rather close to the GDP figure within the 1-year horizon and narrowly overlap it within the 5-year horizon. Although we are comparing medians and a mean GDP, mean and median should be similar, if not identical, in a large setting such as a whole country. Obviously, to get such results, we used the median rather than the mean and had to eliminate the outliers, i.e., extreme values in both HU scores and WTP amounts. This approach is used in health economics to avoid very wide variations within a sample. In fact, in our sample, results went from zero to over $US38 million. As it is difficult to obtain such a distribution in this type of analysis, we defined the outliers with what we considered acceptable limits: ± 5% for utilities and ± 10% for WTP. It should be noted that medians were lower than means, as reported in our previous WTP studies [21, 22]. The previous studies also found no starting point bias, as no statistically significant differences were reported between groups starting the bidding game with different amounts. We did not perform such an analysis in the present study because this was not one of the study objectives. This approach may be a limitation. Furthermore, the aim of the present study was not to compare results between patients and MGP, although we present results for both, but rather to present a combined estimate.

With regards to adjusting our results over the KSA population, the sensitivity analyses projecting the WTP result over corresponding age and sex demographics of the KSA, we obtained figures very similar to our main results adjusted for outliers. However, there were differences in the results between the patients and the MGP, either for the overall analysis or in comparisons of results between various demographic subgroups. It has been suggested that preferences from the general population, rather than those from patients, should be used in cost-effectiveness analysis [18]. For the overall WTP results, the patient group reported a median of $US11,100, whereas the median for the MGP was $US42,700 for the 1-year horizon, with a significant difference (p < 0.001). For the 5-year horizon analyses, the patient group reported a median of $US10,000, whereas the MGP reported a median of $US31,000 (p < 0.001). WTP results tended to increase with the reported utility measures (Fig. 3). WTP results tended to decrease with age (Fig. 4). There were some significant differences between various other subgroups, such as between “bachelor’s degree” and “secondary diploma” and between “university degree above bachelor’s” and “secondary diploma.” However, no significant difference was found between “bachelor’s degree” and “registered apprenticeship or trade certificate” (data not shown).

Furthermore, other patient-based results using a TTO survey reported in the USA were almost similar, at $US22,700–32,200 per QALY [5]. Bobinac et al. [25] reported WTP per QALY values in the Netherlands of about €13,000 based on visual analog scale valuations and of €24,500 based on EuroQol tariffs. Cameron et al. [1] reported that most of the threshold values, in countries where they were available, fell within a range of one to three times the respective GDP per capita [1]. The present study provides WTP thresholds that are very similar to the GDP of $US20,000 in the KSA, within a range of approximately 1–1.5 times the GDP per capita [26].

Based on the results of the present study, should one be attempting to select boundaries of a range for WTP per QALY thresholds, we suggest expected ICERs < $US20,000 be considered attractive, as that is approximately the calculated median for the 5-year time horizon, whereas ratios > $US60,000 may be considered much less attractive, as it approximates the higher end of the reported means. Ratios falling between these amounts should be assessed individually according to overall and specific conditions and limitations.

One of the major limitations of this study, as mentioned, is the relatively small sample size when taking a country-wide perspective, although it was adequate for this type of study within a delimited segment of the cancer population in the KSA. Also, as no breakdown between family members and other MGP was reported, because it was not part of the study protocol, we could not determine differences in WTP between these two groups. Furthermore, the adjustments made by eliminating the outliers, including those reporting zero utility, further reduced the size of the sample. In addition, the present sample, based on sociodemographic characteristics of participants, especially with a skewed sex distribution, may not be representative of the KSA as a whole. Nonetheless, we believe it may be sufficiently representative, at the very least for the cancer population in the capital. Furthermore, one limitation, which may also be a strength, is that this is the first study of its kind. With all its limitations and lack of overall population representativeness, it remains the study to which future studies will be compared.

5 Conclusion

This study empirically estimated WTP for QALY thresholds of $US19,000–32,000 based on medians and is, to our knowledge, the first study to determine the WTP per QALY in the KSA. Hence, within certain limitations and under certain conditions, we consider it appropriate as a reference until further evidence is determined.