FormalPara Key Points

We examined the cost-effectiveness of trastuzumab + chemotherapy as adjuvant chemotherapy (AdjCT) in patients with human epidermal growth factor receptor 2 (HER2)-positive breast cancer (BC) aged over 70 years in Japan, comparing it with trastuzumab monotherapy using the three-state partitioned survival model.

The base-case analysis suggests that AdjCT with trastuzumab + chemotherapy is likely to be a cost-effective choice for patients with HER2-positive BC aged 70 years or older.

In contrast, this result was uncertain because the incremental cost-effectiveness ratios and quality-adjusted life years of trastuzumab + chemotherapy mainly depended on the number of events and long-term efficacy in both arms.

1 Introduction

Breast cancer (BC) remains the most common cancer among Japanese women, with an estimated 106,583 diagnoses in 2018. The proportion of patients over 70 years, who were the target population in this study, was 34.1% of all patients [1]. BC treatment is cardinally selected based on assessments of patient states and prognostic factors. The Japanese Breast Cancer Society’s clinical practice guidelines 2018 and National Comprehensive Cancer Network guidelines recommend surgical resection in the early stage of cancer (stages 0–2). In addition, neoadjuvant or adjuvant chemotherapy (AdjCT) may be administered, depending on the size of the tumor [2, 3].

Medicines used in adjuvant treatment for BC are classified according to the presence of hormone receptors and human epidermal growth factor receptor type 2 (HER2), and include hormonal therapy, molecular targeted drugs, and other approaches. Several studies have reported the efficacy of trastuzumab as an adjuvant with chemotherapy, showing improved disease outcomes in patient with HER2-positive primary BC [4,5,6]. This combination regimen has been the standard treatment for HER2-positive primary BC in Japan [7,8,9].

Additionally, the increase in medical expenditure for cancer treatment is a major social issue in developed countries. Therefore, it is essential to not only assess treatment efficacy and safety, but also the cost of various treatment modalities that are used to treat cancer. Having access to such information is also essential [10]. So far, health economic evaluation has not been used for political decision-making in Japan. However, examining the health economic aspects of expensive medicines used in oncology, including BC, would significantly help to clarify the medical resource allocation guidelines. The benefits of trastuzumab in AdjCT, which we discuss in this study, have been evaluated from the economical aspect, and many studies have reported it to be cost-effective [11,12,13,14,15,16,17,18,19]. However, several safety risks have been reported for trastuzumab with chemotherapy, and its use may be avoided in elderly patients with BC [20,21,22]. Few studies have examined its effectiveness in AdjCT in patients aged over 70 years. Moreover, we have yet to uncover how the improved efficacy and safety from adding chemotherapy to trastuzumab monotherapy will affect the cost-effectiveness of this regimen. The objective of this study was to examine the cost-effectiveness of trastuzumab + chemotherapy as an AdjCT of patients with HER2-positive primary BC over 70 years in Japan, comparing it with trastuzumab monotherapy. The purpose of this study was one of the secondary endpoints in the RESPECT trial, a randomized, open-label clinical trial. The RESPECT trial was registered with ClinicalTrials.gov (identifier: NCT01104935) on 6 November 2009 [23].

2 Methods

2.1 Patient Population and Clinical Trial

The base-case population modeled in this study was Japanese women with HER2-positive primary BC with a median patient age at treatment initiation of 75 years in the RESPECT trial. We chose trastuzumab + chemotherapy as the comparison arm (H + CT arm) and trastuzumab monotherapy as the control arm (H arm) to achieve the study objective.

Briefly, the main inclusion criteria of the RESPECT trial were invasive BC diagnosed as HER2-positive; stage I, IIA, IIB, or IIIA cancer; and left ventricular ejection fraction above 55%. The full analysis set (FAS) of this trial was made up of patients who initiated the intervention during the treatment period. Extrapolation parameters for cost utility analysis in this study were calculated from FAS results in the RESPECT trial. The patient background of FAS and patient flow diagram in the RESPECT trial are described in the Online Supplementary Material (OSM) Table 1 and Fig. 1. In the RESPECT trial, trastuzumab in both arms comprised a loading dose of 8 mg/kg and a maintenance dose of 6 mg/kg every 3 weeks for 1 year. Chemotherapy in the H + CT arm involved one of the prescribed regimens (paclitaxel, docetaxel, doxorubicin + cyclophosphamide, epirubicin, cyclophosphamide + methotrexate + 5-fluorouracil, docetaxel + cyclophosphamide, and docetaxel + carboplatin) for each patient for up to 24 weeks.

Table 1 Input utility parameters
Fig. 1
figure 1

Partitioned survival model with three health states

2.2 Model Structure and Design

A partitioned-survival model was developed to evaluate the cost-effectiveness of the H + CT arm versus the H arm for AdjCT in older patients with HER2-positive primary BC. A partitioned survival model is a common modeling structure in the field of oncology health technology assessment process and is recommended by the NICE Decision Support Unit [24]. Three health states were created in a partitioned survival model, namely stable disease (SD), progressive disease (PD), and death (Fig. 1). In this model, the proportion of patients in each health state in each cycle was estimated by calculating the area under the curve per cycle from the disease-free survival (DFS) and overall survival (OS) curves for each study arm.

Therefore, all patients in this model entry were either on trastuzumab with chemotherapy or trastuzumab only as the SD in the first month. After the second month, the percentage of patients in each health state varied according to the percentage of DFS and OS for each treatment arm. The duration of treatment for trastuzumab and chemotherapy was assumed to follow the protocol rules of the RESPECT trial. Specifically, trastuzumab was administered for 1 year, and other chemotherapies were administered for between 12 and 24 weeks, depending on the regimen. We calculated the total effectiveness and total costs for each study arm over the time horizon of the model. The effectiveness in this study was quantified as quality-adjusted life years (QALYs). These were used to estimate the incremental cost-effectiveness ratios (ICERs).

We analyzed from the payer’s perspective, including the direct medical cost involved in a patient’s treatment. The cycle length was 30 days (1 month) to capture the impact on both arms in detail. The base-case analysis period was 30 years to follow patients throughout their lifetime. All costs and QALYs in the base case were discounted at 2% per annum (in accordance with the guidelines for cost-effectiveness evaluation in Japan) [25]. The partitioned survival models in this study were created using Microsoft Excel (Microsoft Corporation, 365 MSO, 64 bit), and all extrapolation parameters were analyzed using SAS version 9.4 (SAS Institute, Cary, NC, USA) and R version 3.6.3. In reporting the results of this study, we adhered to ISPOR’s good reporting practice guidelines for health economic evaluations (CHEERS checklist) [26].

2.3 Efficacy

We derived the clinical efficacy (DFS and OS durations) of each arm using individual patient data of FAS in the RESPECT trial. Specifically, we estimated parametric curves to model the probability of OS and DFS during and after the RESPECT trial using individual patient data and model functions. Model functions for DFS and OS parametric curves were selected from exponential, Weibull, Gompertz, log-normal, log-logistic, and generalized gamma distributions based on assessments of curve visualization and goodness of fit given by the Akaike Information Criterion (AIC) value and Bayesian information criterion (BIC) [27]. This process of curve estimation with these model functions is the recommended option in the NICE Decision Support Unit [24].

We also confirmed the clinical and biological plausibility of the curve based on the consistency of the estimated DFS and OS parametric curves and natural history in Japan [28, 29]. We estimated a parametric curve for OS using data from the RESPECT trial with up to 5 years of follow-up because the median follow-up of the trial was 4.1 years (range 0.3–8.0 years), and most patients in the trial were censored at the time of follow-up. However, for the curve selection of visualization, we also referred to the Kaplan–Meier curve generated based on the data of the whole period. The upper limit for the estimation of the OS curve was set using life tables for the general population in Japan, based on the "Statistics Japan Report, 2018" [30]. The survival of a few patients with long-term follow-up in the trial was considered to be useful in estimating the survival curve. The monthly proportion of health states in SD was calculated from the selected DFS parametric curve, and the monthly proportion of health states in PD was calculated from the difference between the DFS parametric curve and selected OS parametric curve.

2.4 Health-Related Utilities

Health-related utility values were derived from individual patient data of the FAS population in the RESPECT trial based on EuroQol 5-Dimension 3-Level (EQ-5D-3L) questionnaires developed by the EuroQol Group task force [31]. The EQ-5D-3L index used five levels of severity in the five dimensions (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression) [32]. We calculated the health-related utility value based on the response of individual patients and utility value set in Japan [33]. The EQ-5D-3L questionnaires in this study were administered at baseline, 2 months, 1 year, and 3 years. All 232 patients included in this study (H + CT arm: 113, H arm: 119) were available at baseline of the RESPECT trial (OSM Table 2a).

Table 2 Input cost parameters (Japanese Yen (JPY)/month)

Due to limited patient data, we extrapolated utility values from the study by Lloyd et al. as health-related utility values in the long-term SD and PD states [34] (Table 1).

2.5 Costs

We derived cost parameters from the receipt data of individual patients in the RESPECT trial as monthly direct medical costs based on receipts and out of hospital prescription data.

The direct cost was defined as “all costs due to resource use that are entirely attributable to the use of a medical intervention or disease treatment” based on receipt data and out-of-hospital prescription data [35]. In the RESPECT trial, 30 patients (H + CT arm: 14, H arm: 16) had been randomly selected for receipt data collection (“Cost population,” OSM Table 2b). Specifically, we calculated the costs in the base case for the SD and PD states from the start of follow-up to 2 years after the beginning of the RESPECT trial (Table 2).

To account for variability in individual patient costs in the RESPECT trial, we calculated the costs for each arm at 6, 12, and 24 months. Because no substantial change in cost was expected unless there was disease progression occurred, we assumed that the cost at month 25 and beyond in the SD state would be the same as at month 24. We estimated the cost of the PD state using patient data of both arms for the PD state in the RESPECT trial because very few patients in that trial reached the PD state. All costs were presented in 2019 Japanese yen (JPY) and converted to US dollars (US$, JPY 1 = US$107). In the COST population, we examined the ICER calculations and utility value, which shows the trends of the population. The results of the utility value in the COST population were similar to those of the overall population. Furthermore, when the patient background of the cost population was compared with the overall population, no significant bias in patient background was identified (OSM Table 3). Therefore, we examined the effects of various parameters using sensitivity analysis.

Table 3 Total cost, total QALYs, and ICER for each regimen

2.6 Sensitivity Analysis

We conducted probabilistic sensitivity analysis (PSA) to estimate the uncertainty in this analysis [36]. DFS, OS, utility values, and costs were calculated with the bootstrapping method using individual patient data from the RESPECT trial. In bootstrapping, we repeated the process 2,000 times, randomly selecting the same number of subjects as in the RESPECT trial from individual patient data and calculating the ICERs. The model function to estimate DFS and OS parametric curves were fixed to those of the base-case analysis, since we examined parameter (sampling) uncertainty in this PSA.

We performed the one-way sensitivity analysis (OWSA) to consider variations in the ICERs when the range of parameters changed independently. The cost parameter was varied by up to ± 20% and the utility parameter was varied by up to ± 5%. The range of variation was set from the 95% confidence interval (CI) for each parameter. The OWSA was performed using time horizons of 10 or 15 years and a discount rate of 3%. In addition, we also performed two scenario analyses. We assumed no difference in OS between the two arms or in OS or quality of life (QOL) after 36 months.

3 Results

3.1 Efficacy Results

The Gompertz and generalized gamma distributions were, respectively, found to be the best fits for the DFS and OS data in the RESPECT trial (Fig. 2). These distributions were selected from AIC, BIC as well as estimated curve visualization results (OSM Table 4 and OSM Fig. 2).

Fig. 2
figure 2

Estimated disease-free survival and overall survival curves. H + CT trastuzumab with chemotherapy, H trastuzumab monotherapy

Table 4 One-way sensitivity analysis

3.2 Base-Case Analysis

Over a lifetime horizon of 30 years, the total cost per patient for trastuzumab + chemotherapy was calculated as JPY 14.6 million (US$137,000), compared with JPY 14.2 million (US$132,000) for trastuzumab monotherapy, and the total QALYs were 9.308 and 9.101, respectively (Table 3). The incremental cost of trastuzumab with chemotherapy per patient was JPY 0.56 million (US$5000), and the incremental QALYs of trastuzumab with chemotherapy was 0.206. As a result, the ICERs of trastuzumab with chemotherapy versus trastuzumab monotherapy was JPY 2.7 million/QALY (US$25,200/QALY).

3.3 Probability Sensitivity Analysis

The results from the PSA (2000 bootstrapping simulations) showed that the mean ICER of trastuzumab with chemotherapy versus trastuzumab monotherapy was JPY 2.9 million/QALY, (US$26,900/QALY) (Fig. 3). The cost-effectiveness acceptability curves showed that at a willingness-to-pay threshold of JPY 5.0 million/QALY, the probability of being cost-effective was 51.75% for trastuzumab with chemotherapy (Fig. 4). However, when we excluded the ICERs with negative values, the probability that the ICER would be less than JPY 5.0 million/QALY was 62.2%.

Fig. 3
figure 3

Scatter plot of probabilistic sensitivity analysis results. WTP willingness to pay, QALY quality-adjusted life year

Fig. 4
figure 4

Cost-effectiveness acceptability curves. QALY quality-adjusted life year

3.4 One-Way Sensitivity Analysis

Table 4 shows the results of the seven OWSAs with the effects on the base-case ICER of trastuzumab + chemotherapy against trastuzumab monotherapy. Scenarios with independent variations in cost and QALYs of SD and PD had a large impact on the ICERs. The range of ICERs in cost scenario (Scenarios 1 and 2) was between dominant and JPY 10.3 million/QALY (between dominant and US$96,400/QALY). The range of ICER in utility value scenario (Scenarios 3 and 4) was between JPY 1.01 million/QALY and dominated (between US$9,460/QALY and dominated).

Additionally, the analysis assuming no difference in OS between the two groups (Scenario 8) and assuming no difference in OS and utility value after 36 months (Scenario 9) resulted in an increase in the total cost and total QALYs in the H arm (difference in total cost from H + CT = JPY 0.39 million, difference in total QALY from H + CT = 0.073 ~ 0.107).

4 Discussion

We conducted a cost-effectiveness analysis of the AdjCT of trastuzumab with chemotherapy against trastuzumab monotherapy for patients with HER2-positive primary BC aged over 70 years. The efficacy of both arms was demonstrated in a randomized, open-label clinical trial, known as the RESPECT trial. No characteristic bias was observed in the patient background between the two arms in the RESPECT study. We compared the H + CT arm with the H arm for adjuvant therapy for patients with HER2-positive BC aged 70 years or older. The ICER of the H + CT arm over the H arm was JPY 2.7 milllion/QALY (US$25,200/QALY). Although there is no clear threshold in Japan for the superiority of ICER-based cost-effectiveness analysis results, considering the study by Shiroiwa et al., we could assume that the H + CT arm with an ICER of less than JPY 5 million/QALY was cost-effective based on the results of the base case [37]. However, the PSA results showed an extensive range of variability in the ICERs due to changes in parameters. Additionally, Moreover, because the HR of the OS in the RESPECT trial was 1.07 (95% CI, 0.36 to 3.19) and no significant difference in OS was observed, we set up Scenarios 8 and 9 to examine the degree of uncertainty in OS and long-term utility. We found that the total cost and total QALYs for the H arm were higher than for the H+CT arm. Furthermore, the clinically meaningful HRQoL deterioration rate showed significant differences 1 year (19% vs. 38%; P value = 0.009). The results of Scenario 9 suggest that even though the rate of conversion to PD with greater disutility was higher in the H arm, the H arm with less disutility during SD had a more substantial impact on total QALYs. In other words, the disutility of trastuzumab + chemotherapy as AdjCT in elderly patients with BC might have a greater impact on total QALYs than the utility of trastuzumab + chemotherapy with a slight extension of DFS. Thus, the greater disutility of adding chemotherapy to trastuzumab may have a greater impact on total QALYs than the utility gained by a small extension of DFS. These results suggest a high uncertainty in the ICER based on long-term estimates of efficacy parameters using the current evidence.

There are three main reasons for this uncertainty. First, the probability of occurrence was low in the RESPECT trial due to the characteristics of BC. The 10-year survival rate for BC in Japan is approximately 80% for all patients and over 90% for patients with stage 0 or 1 BC. In the RESPECT trial, more than 80% of patients had stage 1 or 2 BC. Most subjects did not experience events, such as stage progression or death, during the average 4-year follow-up period. However, long-term estimation of efficacy may have an increasing impact on the ICER and is a source of expanding uncertainty of the ICER. To minimize the uncertainty in the long-term estimation of efficacy, we used OS for 5 years after the start of this model. We also evaluated the curve concerning the curve visualization and natural history of DFS, OS, and the general Japanese population to select an appropriate model function. This suggests that further follow-up trials for evaluating the long-term efficacy in both arms are necessary to perform more accurate analyses.

Second, the difference in survival between the two arms is very small. The loss of OS in the H arm versus the H + CT arm over 3 years of follow-up was less than 1 month in the RESPECT trial, and the difference in incremental QALYs between the two arms in this analysis was only 0.206 in a lifetime (30 years). Therefore, the short analysis period affected the ICER. When the time horizon was 15 years, the ICER of the H + CT arm over the H arm was more than JPY 10 million/QALY. Therefore, considering that the average age of the subjects at the start of the RESPECT trial was 74 years and the life expectancy of women in Japan is 87.5 years (2019), it is suggested that it is uncertain whether H + CT is cost-effective for patients with a low life expectancy [38].

Thirdly, the difference in the total cost and QALYs between the two arms is very small. In the base case, the difference in total cost between the two arms is approximately JPY 0.5 million (US$4670), and the difference in total QALY between the two arms is only 0.206. This difference is even smaller in Scenarios 8 and 9, where the OS and long-term utility value differences are eliminated. Therefore, even if the variation in the cost and utility parameters in OWSA is small, the impact on the ICER is large, and the ICER will fluctuate significantly. This suggests that the economic advantages and disadvantages of the two arms are small.

This study has three limitations. First, the RESPECT study has limited patient data for accurate estimation of direct medical costs. In this study, we collected patient receipt data in the RESPECT trial to calculate the cost parameters so as to increase internal validity. Since the cost parameters collected in the RESPECT trial had a large variability, we were concerned about the impact on the ICER. However the patients used to estimate the cost parameters were randomly selected and their backgrounds do not differ significantly from the overall population, we believe that the cost parameters can ensure a certain degree of representativeness in the RESPECT trial. On the other hand, the variability is not small, and the maximum standard deviation in the cost parameter is approximately JPY 189,700 (US$1770). In fact, from Scenarios 1 and 2, the ICER of the H+CT arm varied from dominant to more than JPY 10 million ($93,500). Therefore, for future analysis, a sample size of at least 150 participants per arm should be used.

Second, the long-term utility value is uncertain. Utility values in the RESPECT trial were measured using the EQ-5D-3L up to 36 months. In this trial, the utility value after 37 months was analyzed by extrapolating the utility value of metastatic BC in previous UK studies because there are no Japanese studies on such parameters [34]. The target population for the outfitted utility values included patients with metastatic BC but not those aged over 70 years, which was the target population for this study. The study in Japan has also reported that age is strongly correlated with QOL values, suggesting that utility values decrease with age [39]. Furthermore, we conducted our analysis using utility values of metastatic BC, which is more advanced in stage than our target disease, i.e., postoperative BC. This is because we can conduct the most conservative analysis possible without overestimating the effect of each arm.

Finally, since the RESPECT trial is an open-label trial, there are concerns that it may affect the determination of DFS and the measurement of utility value.

Based on these analyses, because the ICER of the H + CT arm against the H arm in the base-case analysis was below JPY 5.0 million/QALY, the economic advantage of H + CT therapy over H therapy is not necessarily guaranteed. The reason for this is the uncertainty of the long-term parameters of efficacy, cost, and utility value. To resolve these uncertainties, it will be necessary to collect additional information on the long-term prognosis and associated health-care costs and utilities after AdjCT for BC in elderly patients. In addition, the small difference in the total cost and total QALY between the two arms may also be a factor that increases the uncertainty. Furthermore, our sensitivity analysis showed that H therapy might have similar resource consumption and QALY gain as H + CT therapy. Therefore, the benefit of H therapy as AdjCT for BC in elderly patients with deteriorating systemic conditions should be further investigated.

5 Conclusion

The ICER for trastuzumab with chemotherapy as adjuvant chemotherapy for patients with HER2-positive BC aged 70 years or older in Japan was below JPY 5.0 million/QALY. On the other hand, it was suggested that additional investigation of some parameters with the uncertainty that could have a large impact on ICER is needed. Finally, our findings suggest that the resources consumed and QALYs gained by both treatments may be comparable.