FormalPara Key Points for Decision Makers

Adding cetuximab to FOLFIRI is more cost-effective in patients with metastatic colorectal cancer (mCRC) and a wild-type RAS gene than in the mCRC population at large.

At a £50,000 willingness-to-pay threshold, cetuximab plus FOLFIRI has 63% probability of being cost-effective among patients with the wild-type RAS gene compared with a 2.8% probability among the mCRC population at large.

These results demonstrate potential economic benefits of personalised medicine based on biomarker testing.

1 Introduction

Personalised medicine uses information about specific patient biological attributes and environment to most effectively prevent, diagnose or treat disease. Biomarker testing for a particular biological attribute is one way through which patients may be differentiated [1,2,3,4]. The premise underlying personalised medicine is that therapies demonstrating a particular therapeutic outcome for an overall disease population may show greater effect when targeting a subgroup with a certain set of attributes. Aside from enabling practitioners to more effectively treat patients, personalised medicine may also benefit healthcare systems through enhancing the cost-effectiveness of a particular treatment [5]. In oncology, biomarker testing and stratified medicine may allow development of precision care plans using the most appropriate medication given the biological status of a patient’s tumour [6]. Improved outcomes translate directly into improved cost-effectiveness if the benefit to the patient outweighs the increased costs to the healthcare system, such as from additional drug acquisition and administration and biomarker testing costs [7].

Colorectal cancer (CRC) is one of the most common cancers in the United Kingdom (UK), with an annual incidence of about 40,000 patients [8]. CRC is one of several cancer types associated with overexpression of the epidermal growth factor receptor (EGFR) signalling pathway. Signalling through this pathway results in cell proliferation, inhibition of apoptosis, activation of invasion, metastasis and angiogenesis [9,10,11]. Based on the role of EGFR in cancer, anti-EGFR therapy has been introduced as an approach to reduce intracellular signalling. There are several molecular components downstream of the EGFR pathway that help regulate the effects of EGFR stimulation. The rat sarcoma oncogene (RAS) proteins Kirsten rat sarcoma viral oncogene homolog (KRAS) and neuroblastoma RAS viral oncogene homolog (NRAS) are components of the second-messenger signalling pathway initiated by EGFR, and they help regulate the cell cycle [9,10,11,12]. In some patients, RAS proteins harbour mutations that render these proteins unaffected by any changes induced by anti-EGFR-based treatment [13, 14]. Patients with these mutations can be identified by biomarker testing [12, 15, 16].

Cetuximab is an immunoglobulin G1 monoclonal antibody that targets EGFR, and has been licensed since 2008 in combination with 5-fluorouracil and leucovorin plus oxaliplatin (FOLFOX) and with 5-fluorouracil and leucovorin plus irinotecan (FOLFIRI) chemotherapy for first-line use in patients with metastatic CRC (mCRC). The pivotal CRYSTAL study was an open-label, randomised, controlled, multicentre phase 3 trial that compared FOLFIRI plus cetuximab versus FOLFIRI alone as first-line therapy for EGFR-expressing mCRC [14, 17, 18]. Study results showed significantly longer progression-free survival (PFS) with cetuximab plus FOLFIRI compared with FOLFIRI alone [17]. In the intent-to-treat (ITT) population, comprising all patients randomised to receive treatment, the cetuximab-plus-FOLFIRI arm showed a median overall survival (OS) of 19.9 months compared with 18.6 months in the FOLFIRI-alone arm, corresponding to a 1.3-month benefit with a hazard ratio (HR) of 0.93 (95% CI 0.81–1.07; P = 0.31) [17]. Based on the role of the EGFR pathway in mCRC, investigators examined subpopulations of patients defined by RAS genotype; specifically, KRAS wild-type (wt) patients (tumours had no detectable mutations in codons 12 and 13 of exon 2 of KRAS), and RAS wt patients [meeting KRAS wt criteria plus no detectable mutations in exons 2 (codons 12 and 13) of NRAS, 3 (codons 59 and 61) and 4 (codons 117 and 146) of KRAS and NRAS] [14].

When comparing median OS in the cetuximab-plus-FOLFIRI arm versus the FOLFIRI-alone arm in biomarker-selected subgroups, patients with a KRAS wt genotype demonstrated a 3.5-month benefit in median OS [23.5 vs. 20.0 months, respectively; HR, 0.796 (95% CI 0.670–0.946); P = 0.0093] [18], while patients in the RAS wt subgroup showed an 8.2-month benefit in median OS [28.4 vs. 20.2 months, respectively; HR, 0.69 (95% CI 0.54–0.88); P = 0.0024] [14]. On the basis of these and other data demonstrating enhanced efficacy of EGFR inhibitors in patients with RAS wt tumours, clinical practise guidelines recommend testing for RAS status prior to determining first-line treatment for patients with mCRC, and the indication for cetuximab was updated accordingly [19, 20].

Economic analyses of the cost-effectiveness of cetuximab have been submitted to several UK regulatory bodies for health technology assessments (HTAs), resulting in recommendations for restricted use (Table 1). In the current study, an objective economic evaluation was performed using CRYSTAL trial cohorts to determine if stratified use of cetuximab based on genetic biomarker detection improves cost-effectiveness. This investigation was based on the hypothesis that treating the population(s) that benefit(s) the most should improve the cost-effectiveness of the intervention. This is the first time a common model or platform has been used so that the incremental cost-effectiveness ratios (ICERs) between three populations can be examined objectively in patients with mCRC.

Table 1 Previous cetuximab submissions for UK health technology assessment

2 Methods

Individual patient data (IPD) from the CRYSTAL study were obtained from Merck KGaA (Darmstadt, Germany). All patients in the CRYSTAL study provided written and oral informed consent [17]. Results were categorised into three cohorts: the ITT population, the KRAS wt subgroup and the RAS wt subgroup (defined above) [14], comparing cetuximab plus FOLFIRI versus FOLFIRI alone. FOLFOX was not investigated, because the phase 2 clinical trial that compared the use of cetuximab in combination with FOLFOX versus FOLFOX alone (OPUS) had a smaller patient population, and, thus, greater volatility around the results [26]. Tumour genotypes were determined using a polymerase-chain-reaction technique on DNA extracted from tumour-biopsy specimens. Screening for KRAS mutations in codons 12 and 13 was performed initially and then samples determined as KRAS wt were subsequently screened for the RAS genotype [14, 17].

For the cost-effectiveness comparison, a proprietary, validated cost-utility model was used to compare cetuximab-based first-line treatment regimens with irinotecan-based chemotherapy for patients with mCRC based on PFS, adverse events (AEs) and resource usage. Utilities were obtained from quality-of-life (QOL) data collected during the original CRYSTAL trial [27]. The model was created using a UK National Health Service (NHS) perspective; therefore, only direct costs to the NHS were included. The model was developed in Microsoft Excel with Visual Basic codes to conduct one-way sensitivity analyses (OWSA) and probabilistic sensitivity analyses (PSA) for appropriate pairwise comparisons [28]. The core of the model is a state-transition Markov cohort method developed to simulate patient outcomes and costs for first and subsequent lines of oncology treatment, including long-term survival after a successful curative resection of liver metastases. As opposed to area-under-the-curve analyses, this model is a Markov state and transition model with the probabilities of transitions dependent on time from the beginning of treatment of cohort and on time in state. This means the probability of transition from second-line to third-line therapy is dependent on the time of progression from first-line therapy to second-line therapy. To apply these time-dependent transition probabilities for this sequence of treatments, the model uses tunnel states. This model structure is based on previous models submitted to HTA bodies and subsequently published [29, 30]. To provide a distribution around each parameter, an upper and lower limit was generated by using the 0.025 percentile as a lower bound and the 0.975 percentile as an upper bound of the relevant variable. The specific model used was based on an earlier model developed to demonstrate the cost-effectiveness in the RAS wt cohort alone (see Ref. [29]). The current model was modified so that all three cohorts (ITT, KRAS wt and RAS wt) within the CRYSTAL study could be analysed together. The main adaptive changes within the current model affect a patient’s progression through the first-line heath state, the curative resection rates and the frequency of AEs between cohorts.

A disease-modelling approach (assuming the PFS benefits of first-line cancer treatments translate directly into OS benefits) was selected to examine costs and benefits outside the trial period to compare cohorts over a longer time horizon than the study provided. This model was previously assessed by a National Institute for Health and Care Excellence (NICE) Evidence Review Group (ERG) [22]. The time horizon was set to 10 years, which was considered reasonable to capture all costs and benefits, including those of patients who receive curative surgery, and is consistent with previous HTA submissions (Table 1). The NICE discounting policy was followed, which is to apply a 3.5% discount rate for costs and benefits. We also made no distinction between no resection and unsuccessful resection, because it may only become apparent after initiation of treatment if surgery is an option [31]. The model was updated by constructing the time-to-event data from the IPD for the ITT, KRAS wt and RAS wt cohorts. Model inputs are provided in Table S1. These parameters were then used to generate the transition probability for the Markov cycles. Adverse events (Table S2) and resection rates (Table S3) were also collated. The risk of progression from first-line treatment was applied in the model using the Weibull parametric extrapolations of the time to progression. R and RStudio (version 3.1.2) were used to derive the numerical coefficients for this distribution. The IPD PFS time in the data was converted to weeks to match the transition cycles within the model and uploaded into RStudio. The Weibull parametric model was fitted using ‘survreg()’ [32], then outputs from RStudio created from the model were entered to determine new extrapolation distributions for each of the three cohorts. The proportional hazards assumption was made for the Kaplan–Meier data for the treatment arms from the CRYSTAL trial. Based on experience that the Weibull model was the best fit for this data, further parametric survival analysis was considered to be unnecessary as the objective was to maintain consistency between the core model parameters to compare like with like.

Patients entering the model started in the first-line therapy health state, then either underwent curative resection of liver metastases and entered the postresection health state or progressed to the second-line treatment health state, then to the third-line health state (Fig. 1). Patients received best supportive care as third-line treatment. In the treatment and control arms, second- and third-line progression was not adjusted within the model. It was assumed that the same principles of further progression in the second line, and survival in the third line, were followed for all patients who progressed at different times from the start of first-line treatment. It was also assumed that the survival probability following curative resection was identical for all treatment arms, and survival estimates of curative surgery were based on those reported by Adam et al. [33]. Grade 3/4 AEs from both arms of the three cohorts were sourced and extracted from the CRYSTAL publications [14, 17, 18] and clinical study reports and entered into the model. Treatment costs (acquisition and administration) for all the health states were derived from the 2013 British National Formulary and UK National Reference Costs (Table S4). AE costs depended on the assumption that there would be an associated outpatient or inpatient visit, and included cost of drugs and outpatient visits. For the ITT analysis, costs associated with biomarker testing were removed from the model as this population would not have been stratified before treatment, but costs were included within the other cohorts. Sensitivity and specificity analysis of the biomarker testing was not conducted, as all patients were presumed to be correctly stratified in the post hoc analysis because these biomarker techniques have high technical accuracy [34].

Fig. 1
figure 1

Demonstration of how patients move between the different health states. Patients entered the model in the first-line therapy health state, then either underwent curative resection of liver metastases and entered the postresection health state or progressed to the second-line treatment health state, then to the third-line health state

Cost-effectiveness was measured in terms of costs per life-year (LY) gained and per quality-adjusted life-year (QALY) gained. Both OWSA and PSA were evaluated using standard methods [35]. The PSA was set at 2000 runs and the distributes for the majority of parameters are presented in Table S5. For the OWSA, where the standard error was missing for some variables, it was assumed to be 10% of the mean value. This was held consistent between all cohorts. As an alternative to comparing the resulting ICERs between groups, two willingness-to-pay (WTP) thresholds were also considered in the model at £30,000 and £50,000 (£30,000 based on the upper level of the NICE WTP threshold for the UK, and £50,000 when end-of-life criteria are considered appropriate for a technology) [36].

3 Results

Baseline characteristics were generally consistent between the ITT, RAS wt and KRAS wt populations, as described previously [14, 17, 18]. Results of the cost-effectiveness analysis of the three CRYSTAL cohorts are presented in Table 2. In the deterministic base-case ICER cost per LY gained, results for cetuximab plus FOLFIRI versus FOLFIRI alone were £98,742 for the ITT, £55,242 for the KRAS wt and £34,171 for the RAS wt cohorts, respectively. The results for the deterministic base-case ICER cost per QALY gained were £130,929 for the ITT, £72,053 for the KRAS wt and £44,185 for the RAS wt cohorts. The analysis output demonstrated that the cetuximab arm resulted in an improvement of 0.16 LYs and 0.12 QALYs among the ITT cohort compared with 0.29 LYs and 0.22 QALYs for KRAS wt and 0.45 LYs and 0.35 QALYs for RAS wt. The incremental cost of treatment per patient changed slightly in the ITT (£15,802), KRAS wt (£15,907) and RAS wt (£15,495) groups, which was mainly driven by the increasing proportion of patients who received curative surgery and no longer required second- or third-line chemotherapy. Together these data suggest that the RAS wt group was the most cost-effective of the cohorts.

Table 2 Incremental cost-effectiveness results

The tornado diagrams in Figure 2 show the results of the OWSA on the deterministic base-case ICER for the cetuximab-plus-FOLFIRI versus FOLFIRI-alone comparisons. In all tornado diagrams, the HR of progression from first-line for cetuximab plus FOLFIRI compared with FOLFIRI and the number of months on treatment had the most impact on the model. Resection rates, body surface area and unit cost of cetuximab also had a substantive effect on the ICER. In the ITT cohort, increasing the HR from the base-case value of 0.80 to the upper bound of 1.196 resulted in an ICER of £878,390. Reducing this HR to its lower bound (0.539) generated an ICER of £65,734 (Fig. 2a). Variation in the duration of treatment had a large impact on the ICER. Assuming that the lower bound of this variable (3.51 months) generated an ICER of £78,461, a treatment regimen lasting 8.03 months resulted in an ICER of £176,003. Resection rates also had a considerable effect on the ICER: the lower rate (0.039) of the cetuximab-plus-FOLFIRI ITT cohort increased the ICER to £187,890, and the higher resection rate (0.087) lowered the ICER to £93,904. In the KRAS wt group, increasing the HR from a base-case value of 0.645 to the upper bound of 0.936 resulted in an ICER of £177,050. Reducing the HR to its lower bound (0.444) generated an ICER of £42,153 (Fig. 2b). Variation in the duration of treatment had a large impact on the ICER in this cohort as well, assuming that the lower bound of this variable (3.51 months) generated an ICER of £42,796, while a treatment regimen lasting 8.03 months resulted in an ICER of £98,675. The unit cost of cetuximab and the average body surface area had a slightly greater impact than the resection rates on the deterministic base-case ICER in this group. In the RAS wt group, increasing the HR from the base-case value of 0.564 to the upper bound of 0.782 resulted in an ICER of £83,362. Reducing this HR to its lower bound (0.407) generated an ICER of £27,893 (Fig. 2c). As in the ITT and KRAS wt cohorts, variation in the duration of treatment also had a large impact on the ICER among patients in the RAS wt cohort, assuming that the lower bound of this variable (3.51 months) generated an ICER of £25,626, while a treatment regimen lasting 8.03 months resulted in an ICER of £61,409. Unit cost of cetuximab and the average body surface area had a greater impact than the resection rates on the deterministic base-case ICER (resection rates decreased to sixth place).

Fig. 2
figure 2

One-way sensitivity analysis results. Results of the OWSA on the deterministic base-case ICER for the cetuximab-plus-FOLFIRI versus FOLFIRI-alone comparisons for the a ITT cohort, b KRAS wt cohort and c RAS wt cohort. Values provided for each entry represent the upper and lower limits generated by using the 0.025 percentile as a lower bound and the 0.975 percentile as an upper bound of the relevant variable. BSA body surface area, BSC best supportive care, FOLFIRI 5-fluorouracil and leucovorin plus irinotecan, HR hazard ratio, ICER incremental cost-effectiveness ratio, ITT intent-to-treat, KRAS Kirsten rat sarcoma viral oncogene homolog, OWSA one-way sensitivity analysis, PD progressive disease, PF progression-free, QALY quality-adjusted life-year, RAS rat sarcoma oncogene

Probabilistic sensitivity analysis results are shown on cost-effectiveness scatter plots (Fig. 3) and by cost-effectiveness acceptability curves (CEAC; Fig. 4). To test overall model uncertainty, results for the £30,000 and £50,000 WTP thresholds are shown. The scatter plots indicate that treatment with cetuximab plus FOLFIRI versus FOLFIRI alone is costlier but also more effective (Fig. 3). The least favourable distribution occurred in the ITT cohort (Fig. 3a), where some of the simulations fell into the northwest quartile, where cetuximab plus FOLFIRI would be considered costlier and less effective than FOLFIRI alone. ICER plots shift further east in the quartile as the cohorts move from ITT to KRAS wt (Fig. 3b) and RAS wt (Fig. 3c) based on increased efficacy of cetuximab in these patient groups and result in an increasing proportion of the ICERs below the WTP line. The clusters become more diffuse, indicating more uncertainty as numbers of patients in the sample are reduced. Using a £50,000 WTP threshold, the RAS wt cohort would still have the highest probability of being cost-effective compared with the other cohorts (Fig. 3). Results of CEACs indicate the probability that the treatment strategy of each cohort is cost-effective at different WTP threshold per QALY thresholds. Figure 4 shows that, at the £30,000 WTP threshold, cetuximab plus FOLFIRI in the ITT cohort has a 0% probability of being cost-effective, and, in the KRAS wt cohort, there is only a 5% chance that it is cost-effective. However, in the RAS wt cohort, at the £30,000 limit, there is a 15% probability that cetuximab plus FOLFIRI versus FOLFIRI alone in this group is cost-effective. The probability that the treatment strategy of each population is cost-effective at a WTP threshold of £50,000 is < 5% for the ITT population, 20% for the KRAS wt group and 63% for the RAS wt group.

Fig. 3
figure 3

Probabilistic sensitivity analysis results on cost-effectiveness scatter plots. To test overall model uncertainty, results for the £30,000 (beige dotted line) and £50,000 (green dotted line) WTP thresholds are shown. Point estimates, represented by dots beneath each dotted line, reflect simulations falling under each WTP threshold. Results are presented for the a ITT, b KRAS wt and c RAS wt cohorts. ITT intent-to-treat, KRAS Kirsten rat sarcoma viral oncogene homolog, QALY quality-adjusted life-year, RAS rat sarcoma oncogene, wt wild type, WTP willingness to pay

Fig. 4
figure 4

Cost-effectiveness acceptability curves for the probabilistic sensitivity analysis. Dotted lines show probabilities for each cohort at the £30,000 and £50,000 WTP thresholds. Note, at the £30,000 WTP threshold, cetuximab plus FOLFIRI versus FOLFIRI alone had a 0% probability of being cost-effective for the ITT cohort (red line), 5% for the KRAS wt cohort (green line) and 15% for the RAS wt cohort (blue line). At the £50,000 WTP threshold, the probability that the treatment strategy is cost-effective was < 5% for the ITT, 20% for the KRAS wt and > 60% for the RAS wt cohorts. FOLFIRI 5-fluorouracil and leucovorin plus irinotecan, ITT intent-to-treat, KRAS Kirsten rat sarcoma viral oncogene homolog, QALY quality-adjusted life-year, RAS rat sarcoma oncogene, wt wild type, WTP willingness to pay

4 Discussion

Research to date has shown that the majority of treatments prescribed for various disease conditions are effective in < 60% of treated patients (25–30% in oncology), underscoring the potential (and need) for efficiency gains in healthcare delivery [37]. Stratified medicine is a new approach that may improve medical outcomes for the patient and the healthcare system by matching therapies to specific patient populations using clinical biomarkers and diagnostics. Stratified medicine can enhance patient care through the development and administration of safer and more effective drugs delivered with a greater chance of successful treatment. Additionally, more accurate targeting of patients with the most effective medication will decrease the burden on healthcare systems by more efficient healthcare delivery [37]. Despite the potential for stratified medicine to foster greater success in treatment outcomes, it is currently not widely used, due, in part, to scientific barriers, economic concerns and difficulties in securing coverage and adequate reimbursement. Many currently available tests do not have the sensitivity required to identify clinically meaningful differences between patient populations to facilitate effective stratification [37].

Healthcare interventions with demonstrated clinical efficacy and marketing authorisation may not ultimately be used in the originally intended population, as access to the healthcare system may be decreased due to a high ICER. However, a treatment that is not used cannot offer a benefit to the patient or the healthcare system. The aim of this study was to investigate this question through an economic evaluation of the three cohorts (ITT, KRAS wt and RAS wt) from the CRYSTAL study and determine if the stratification of cetuximab plus FOLFIRI versus FOLFIRI by biomarker status improves its cost-effectiveness. Based on our results, among patients with mCRC treated with anti-eGFR therapy, treatment of patients in the RAS wt cohort demonstrated the most cost-effectiveness of the three cohorts.

Our results are consistent with those of previous studies that support the notion that targeting patients with RAS wt tumours is more cost-effective than treating patients with KRAS wt tumours; however, our study is unique in that it compares three cetuximab populations at once. The FIRE-3 trial [38, 39] compared cetuximab plus FOLFIRI with bevacizumab plus FOLFIRI (not FOLFIRI alone) and compared only the KRAS wt versus the RAS wt populations. In that study, first-line treatment alone and not sequential use of the products likely drove the outcome [40], whereas the CRYSTAL data demonstrated a clear implication that cetuximab drives the improved clinical benefit and cost-effectiveness in targeting therapy. Additionally, Wen et al. [39] discussed the added costs of RAS screening versus KRAS screening and displayed results in quality-adjusted life-months. Kircher et al. [41] found that the increased societal cost of expanded RAS testing versus standard approved KRAS exon 2 testing was inconsequential compared with the savings achieved by not treating the 18% of patients who harbour additional RAS mutations (beyond exon 2) with anti-EGFR therapy. Another study revealed that additional savings may be obtained when testing for mutations in the BRAF gene is added to KRAS screening [42].

In this study, the economic analysis revealed that variation in incremental cost was driven by the changing proportion of patients in the cetuximab-plus-FOLFIRI or the FOLFIRI-alone arms who received curative surgery and no longer required second- or third-line chemotherapy. Even with the costs of biomarker testing considered, the RAS wt cohort had the lowest overall costs of treatment in the cetuximab arm and the lowest incremental cost of treatment of all the groups.

The OWSA model was particularly sensitive to the time spent in first-line progression. It could be determined from the OWSA that the variables around the hazard function in first-line progression (derived from the Weibull distribution) determine the greatest variation around the base-case ICER. This variation was particularly large in the ITT group, as the upper-bound HR crossed above 1.0. The HR of progression and the number of months on treatment were the variables that had the most impact on the model. In the tornado diagrams, acquisition costs associated with cetuximab are among the largest determinants of the ICERs; however, overall, the relative influence of variables on the ICER changes between the groups. Results of our PSA analysis support the OWSA findings. The shift in the ICER pointed toward greater benefit, and the increasing proportion below the WTP thresholds visually demonstrates the improving ICERs as the sample size decreases from the ITT and KRAS wt to RAS wt cohorts due to extended restriction of the population of interest based on biomarkers. In the ITT scatter plot, modelled observations are located in the northwest quadrant, representing a level of unfavourability in the ITT group. In these scenarios, cetuximab treatment would not be beneficial in this small proportion of patients and may cause harm. Additionally, it should be noted that even targeting therapy by treating RAS wt patients still results in a relatively low cost-effectiveness probability of 63%.

Considering the findings of our study, it should be noted that our results may have been limited by the fact that post hoc analysis or retrospective auditing of clinical studies is not generally considered the best source of evidence due to risks associated with bias. Additionally, a disease-modelling approach was used (assuming the PFS benefits of first-line cancer treatments translate directly into OS benefits), and in this evaluation parametric curves were fitted to the PFS data. In the case of the CRYSTAL study and subsequent analysis, the PFS results do transform into OS gains. However, for completion, repeating the analysis using the OS data would be the next logical step. This may have an impact on the final deterministic ICER results, but it should not alter the cost-effective trends between the cohorts and it is likely to result in a similar conclusion.

5 Conclusions

Targeting therapy to patients based on a genetic biomarker can notably decrease the ICER, which demonstrated that the increase in benefit to patients did outweigh the increased costs to the healthcare system. The base-case deterministic ICER results demonstrated that the ICER declined as the patient population was stratified for the RAS biomarker. The improved ICER corresponded with the increase in survival benefit from cetuximab seen for those patients with the tumour biomarker (KRAS or RAS). The RAS wt cohort had the lowest ICER; therefore, cetuximab plus FOLFIRI was the most cost-effective versus FOLFIRI alone in this subgroup at a WTP threshold of £50,000.