Introduction

Personalised medicine is a twenty-first century focus. The concept implies that the drug/treatment choice for a specific patient is based on their biomarker profile. Oncology research has made important contributions within the field; precision cancer medicine aims at providing anti-cancer drugs to those who are likely to respond to the treatment and to avoid such drugs when the opposite can be expected. Indeed, these drugs are often associated with severe adverse reactions which need to be avoided if not clearly counterbalanced by beneficial effects. Furthermore, there may be great heterogeneity between tumours, and treatment may be effective only in a subset of patients. Drug development in oncology has therefore focused on defining genetic and molecular characteristics of the tumour to select patients likely to benefit from treatment [1,2,3,4,5].

Breast cancer, the most common cancer in women and the leading cause of cancer deaths worldwide [6, 7], has been a pioneer target for personalised medicine. The discovery of the oestrogen receptor (ER) and human epidermal growth factor receptor-2 (HER2) has enabled development of blocking therapies. In 2000–2001, the first gene expression profiling data were published, distinguishing subclasses with differences in biology and outcome [8, 9]. Subsequently, gene expression assays were developed to provide prognostic and predictive information to inform chemotherapy decision making. Several assays, covering various tumour genes, are commercially available providing information on the risk of recurrence [10]. To determine the use of this technology in health care, the benefits and risks have to be assessed. Ideally, such assessments should be based on evidence of effects on patients.

About 75% of breast cancers is hormone-sensitive (luminal) and HER2-negative [11]. Treatment decisions are based on the risk of recurrence, determined by tumour stage, and histopathological data and biomarker status as well as menopausal status [12]. For patients at intermediate risk of recurrence, decision making may be particularly difficult; there may be beneficial effects of adjuvant chemotherapy but the risks associated with such treatment are not negligible. Indeed, chemotherapy is associated with fatalities and severe, sometimes persistent, adverse reactions such as neuropathy [13,14,15,16]. Gene expression assays are included in guidelines to identify patients from whom chemotherapy can be withheld [17,18,19]. As far as we are aware, a summarised evidence base is currently lacking regarding patient effects of use of gene expression assays to inform chemotherapy decisions in the subgroup of patients where the clinical risk of recurrence does not suffice for clear-cut decisions. Indeed, previous systematic reviews within the field have had a wider scope [20,21,22,23]. Therefore, we performed this study to assess the evidence on critical patient effects, such as overall survival, recurrence and health-related quality of life (HRQL), of using molecular profiling to inform chemotherapy decisions in this clinically relevant patient group. This evidence is relevant in decision making at both the patient and the health care levels.

Methods

We performed a systematic review according to established routines at the regional health technology assessment (HTA) centre (HTA-centrum) in Region Västra Götaland, Sweden. The aim was defined in two PICOs (Patients, Intervention, Comparison, Outcome; Fig. 1).

Fig. 1.
figure 1

Patients and comparison groups for whom the outcomes overall survival, health-related quality of life (HRQL) and recurrence were evaluated. CT, chemotherapy; GEP, gene expression profile; HR, hormone receptor; HER2, human epidermal growth factor 2; N0-1, with no (N0) or one to three (N1) axillary lymph node metastases

In the first PICO, the evidence for horizontal prioritisations was reflected i.e. the scientific basis for decision making regarding the use of gene expression assays in early breast cancer from an overall health care perspective. Patients (P) were individuals with post-surgical breast cancer at intermediate clinical risk of recurrence i.e. ER-positive, HER2-negative and with up to three axillary lymph node metastases (N0-1). The intervention (I) was a gene expression assay, including the patient management and chemotherapy decision making based on the test results. The comparison (C) was no gene expression assay, including standard patient management and chemotherapy decision making. Outcomes (O) were overall survival, HRQL and recurrence. The outcome HRQL was chosen to capture the experience of adverse effects of chemotherapy.

In the second PICO, evidence for decision making at the patient level, i.e. the scientific basis to be guided in chemotherapy decisions by a gene expression assay, was reflected. The patients were the same as those in the first PICO, namely patients in whom the clinical risk of recurrence did not suffice for clear-cut decisions, with the addition of the tumour being categorised as low/intermediate risk of recurrence based on a gene expression assay. The intervention was to withhold chemotherapy, and the comparison was to provide chemotherapy. The outcomes were the same as for the first PICO: overall survival, HRQL and recurrence.

We included both randomised controlled trials (RCTs) and non-randomised controlled trials (non-RCTs). We restricted the search to English or Scandinavian-language (Swedish, Danish and Norwegian) publications.

Literature search and study selection

Systematic searches during August 2018, with updates in January 2019 and April 2020, covering the period from January 2002, were performed in Medline, Embase, PubMed, the Cochrane Library, PsycINFO and a number of HTA databases. Search strategies are provided in Appendix 1. Reference lists of relevant articles were scrutinised for additional references. To identify ongoing or completed but not yet published studies, we searched Clinicaltrials.gov in December 2018, with an update in April 2020.

Identified abstracts were screened by two persons and those that did not meet the PICO criteria were excluded in a consensus discussion. When there were uncertainties regarding inclusion/exclusion, the full text was retrieved. For articles excluded in consensus, after full-text reading, reasons for exclusions were recorded. The remaining studies were included in the systematic review.

Data extraction and quality assessment

Data were extracted from the studies by one author and were subsequently checked by the other authors. Data extraction included the number of individuals in the intervention and control groups, the type of gene expression assay used and the results. When the number of events in the randomisation groups was not available in the original RCT for poolable results, the corresponding author was contacted to obtain the relevant information.

The studies were critically appraised by all authors, according to checklists from the Swedish Agency for Health Technology Assessment and Assessment of Social Services (SBU) [24]. These include assessment of three domains: directness, risk of bias and precision. The authors discussed the assessments and categorised each study as having no or minor problems (+), some problems (?) or major problems (–) in each domain. Disagreements were resolved by discussion. The certainty of evidence, i.e. the confidence in the effect estimate, was then assessed using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) [25].

Statistics

RCTs were pooled in random-effects meta-analyses using the software Review Manager (RevMan) version 5.3 (The Nordic Cochrane Centre, The Cochrane Collaboration, Copenhagen, Denmark). Heterogeneity was assessed with I2. The individual studies and the pooled estimates were presented in forest plots. Results are presented as risk ratios (RRs) and 95% confidence intervals (CIs).

Results

After removal of duplicates, 2,824 references were identified, 17 of which fulfilled the criteria of either of the PICOs (Fig. 2). Studies excluded after full-text reading are presented in Appendix 2.

Fig. 2
figure 2

Flowchart of studies included in this systematic review

Study characteristics

Three RCTs and 14 non-RCTs were included in the review (Table 1). Four non-RCTs investigated patient outcome with and without a gene expression assay [26,27,28,29], while the remaining three RCTs [30,31,32] and ten non-RCTs [33,34,35,36,37,38,39,40,41,42] investigated withholding versus providing chemotherapy. From two RCTs, only a subset of the patients fulfilled the P criteria of this review [30, 31].

Table 1 Characteristics of studies fulfilling the PICO criteria. The 95% confidence interval (CI) is presented in parentheses

Gene expression assay versus no gene expression assay

No RCT and four non-RCTs [26,27,28,29] reported results regarding overall survival and/or recurrence in patients with intermediate clinical risk of recurrence where a gene expression assay had been, versus had not been, performed. No studies evaluated effects on HRQL.

Overall survival

Three studies reported results regarding overall survival, using a cohort design [27, 29] or a historic control [28], and including a total of 60,286 patients. Results favouring the use of a gene expression assay were reported in one study [29], whereas the other two studies reported no difference [27, 28]. The certainty of evidence was downgraded one step as the characteristics of the compared groups either differed [27, 28] or were not reported [29]. In summary, it is uncertain whether use of, versus no use of, a gene expression assay affects overall survival in breast cancer patients with intermediate clinical risk of recurrence (very low certainty of evidence, GRADE ⊕◯◯◯).

Recurrence

Three studies reported results regarding recurrence, using a cohort design [26, 27] or a historic control [28], and including a total of 2,756 patients. One study reported prolonged time to recurrence when a gene expression assay had been used [27], whereas the other two studies reported no difference [26, 28]. The certainty of evidence was downgraded one step as the characteristics of the compared groups differed, and there was some uncertainty about the directness and the precision. In summary, it is uncertain whether use of, versus no use of, a gene expression assay affects recurrence in breast cancer patients with intermediate clinical risk of recurrence (very low certainty of evidence, GRADE ⊕◯◯◯).

Withholding versus providing chemotherapy

Three RCTs and ten cohort studies investigated withholding versus providing chemotherapy regarding the overall survival and/or recurrence in patients with intermediate clinical risk of recurrence and low/intermediate genetic risk of recurrence. No studies evaluated effects on HRQL.

Overall survival

Overall survival was reported in one RCT [32] and four cohort studies [33,34,35, 37]. In the RCT, including 6,711 patients, similar overall survival rates were found at 9 years: 93.9% and 93.8% in the intervention and comparison groups, respectively [32]. Loss to follow-up differed between the comparison groups and was large in relation to the number of events. Three cohort studies reported non-significant results [33, 34, 37], whereas the remaining study reported significantly better outcome for patients who had been administered chemotherapy [35]. In summary, withholding adjuvant chemotherapy to breast cancer patients with intermediate clinical risk of recurrence and low/intermediate risk according to a gene expression assay, compared with providing chemotherapy, probably results in little or no difference in medium-term survival (moderate certainty of evidence, GRADE ⊕⊕⊕◯).

Recurrence

Three RCTs and seven cohort studies reported data on recurrence. The RCT results regarding distant recurrence could be pooled in a meta-analysis; 4.29% and 3.88% of the patients had such an event when chemotherapy was not offered and offered, respectively, based on the results of a gene expression assay. The absolute risk difference was 0.41 percentage points (95% CI: − 0.54 to 1.36). The RR for a distant recurrence was 1.12 (95% CI: 0.90 to 1.39; I2 = 0) (Fig. 3).

Fig. 3
figure 3

Meta-analysis of randomised controlled trials (RCTs) comparing withholding versus providing adjuvant chemotherapy regarding distant recurrence. Single asterisk indicates number of events obtained from the corresponding author. CI, confidence interval; M-H, Mantel-Haenszel

Five out of seven cohort studies reported the number of distant recurrences in the comparison groups [36, 39,40,41,42]. In four of these, numerically more patients who had been provided chemotherapy had such an event [36, 39, 41, 42]. The remaining two studies reported either a non-significant hazard ratio with a wide CI [33] or overlapping CIs between the 10-year risk of recurrence of the comparison groups [38]. In three out of the seven cohort studies, patients in the control group were younger and had more advanced cancer [33, 36, 41]. The remaining four studies did not present characteristics of the compared groups [38,39,40, 42].

In summary, withholding adjuvant chemotherapy in breast cancer patients with intermediate clinical risk of recurrence and low/intermediate risk based on a gene expression assay, compared with providing chemotherapy, can probably not exclude a small absolute increased risk of recurrence (moderate certainty of evidence, GRADE ⊕⊕⊕◯).

Ongoing studies

Out of 155 ongoing/completed unpublished studies identified in Clinical Trials, one study completed in 2009 fulfilled the PICO criteria comparing use of a gene expression assay versus no gene expression assay and using a retrospective cohort design (NCT00904566). Two studies, estimated to be completed in 2026 and 2031, respectively, may contribute information regarding the PICO comparing withholding versus providing chemotherapy using a prospective cohort design (NCT03904173, NCT03503799).

Discussion

Our review shows that evidence is largely lacking regarding patient effects of use of, versus no use of, a gene expression assay to inform chemotherapy decisions. Such evidence is useful for prioritisation at the overall health care level. However, there is probably little or no difference in medium-term overall survival when chemotherapy is withheld based on a gene expression assay. Nevertheless, it cannot be ruled out that withholding chemotherapy based on such a test implies an increased risk of recurrence, although the absolute risk is low and the absolute risk difference is small. Given this evidence base, some important knowledge gaps still exist with respect to the use of gene expression assays in breast cancer i.e. personalised medicine. These gaps need to be addressed to inform assessments of the benefit-risk balance.

For diagnostic tools sensitivity and specificity may be the primary issue. For molecular profiling in breast cancer, this has been the main research question [22]. However, as precision cancer medicine is emerging, and the upcoming diagnostic tests imply non-negligible costs, one may argue that scientific evaluations regarding patient effects should be designed and performed to provide a scientific basis for prioritisation.

For technologies to inform chemotherapy decisions, it may be particularly important to evaluate potential effects on HRQL. On the one hand, withholding chemotherapy may increase HRQL because of avoided adverse reactions. On the other, increased fear of recurrence may decrease HRQL [43]. Indeed, to introduce a technology which in itself is costly when available evidence is restricted to negative effects may be problematic. It may be argued that the introduction of gene expression assays will reduce the provision of chemotherapy, thereby reducing the costs to justify a potential worse patient outcome. However, chemotherapy decision making studies in the relevant patient group have reported both increased [44,45,46,47] and decreased [48,49,50,51] administration of chemotherapy when gene expression assay results are provided, and none of these studies had a randomised design. Also, from an ethical perspective, withholding an established treatment may be more problematic than introducing a new one. Therefore, this may call for a more solid evidence base. Conversely, new cancer drugs are sometimes approved based on limited evidence regarding patient-relevant effects [52], and not all meet the threshold for a clinically meaningful effect [53].

Diagnostic tests are used in therapy decision making at the patient level. Most breast cancer patients want to have an active or shared role in decision making regarding chemotherapy [54], as also illustrated in the largest RCT included in this review where the recruitment of patients had to be increased by 73% as 12% of the women chose not to adhere to the assigned treatment [32]. Given the results in the present review, it may be surprising that the guidelines update in 2017 [55], but not in 2019 [17], emphasised that node-positive patients should be informed of the potential benefits from chemotherapy. Indeed, an internationally used “objective” test result may have a large impact on chemotherapy decision making, which is illustrated by the fact that several studies have been performed in patients with intermediate clinical risk of recurrence, in which the chemotherapy decision was based solely on the results of the gene expression assay [26, 45, 56]. Conversely, our results suggest that many oncologists and patients take clinical parameters into account, also when the gene expression assay shows a low/intermediate risk of recurrence. In fact, several cohort studies in this review report that patients given chemotherapy, despite a low/intermediate risk of recurrence according to the gene expression assay, were younger and had more advanced disease [33, 36, 41]. As chemotherapy per se is not likely to increase the risk of recurrence, this finding may also explain that a greater number of distant recurrences occurred in those receiving chemotherapy in cohort studies [36, 41, 42].

In patients with intermediate clinical risk of recurrence and low/intermediate risk of recurrence according to a gene expression assay, the difference between withholding and providing chemotherapy was not statistically significant. However, the confidence interval was quite wide, including an up to 39% increased relative risk of a distant recurrence. To facilitate the process of informing the patient and to contribute to informed decision making, the absolute risk estimate provided in this review may be useful. The mean absolute risk increase of 0.41 percentage points regarding distant recurrence would yield in a number needed to treat (NNT) of 244. Furthermore, the upper confidence limit, of particular interest when investigating non-inferiority, was a 1.36 percentage point increase, yielding a minimum NNT of 74. Consequently, at the minimum, 74 breast cancer patients would have to endure adverse reactions from chemotherapy to avoid one distant recurrence.

As patients live many years after a breast cancer diagnosis, as illustrated by the fact that 94% was still alive after 9 years in the main RCT [32], it would take a long time to achieve mature data on long-term survival when a gene expression assay is used to guide treatment decisions. Indeed, the risk of distant recurrence and death from oestrogen dependent breast cancer persists over at least 20 years, also in low-risk patients [57]. Analyses of register data may contribute valuable information in the meantime, in particular as our evidence synthesis shows that an increased risk, although small in absolute numbers, of distant recurrence cannot be excluded if chemotherapy is withheld based on genetic testing. However, as current drug effectiveness and safety studies often have major methodological problems [58, 59], scientific rigour in the design and reporting will be crucial. For example, efforts have to be made to balance the comparison groups with respect to the severity of disease. Indeed, where data on characteristics of the comparison groups were available in the cohort studies in this review, patients administered chemotherapy had more advanced cancer. It is noteworthy that the one study with minor study limitations evaluating overall survival in matched comparisons reported better outcomes for those treated with chemotherapy [35].

Multivariable analysis may provide information on the association between various factors and patient outcome. Unfortunately, none of the studies in this review which performed such analyses included provision of chemotherapy in the analysis [34,35,36,37,38, 40, 41]. Although pharmacoepidemiological studies should ideally be specifically designed to evaluate drug effects [58, 59], inclusion of the provision of chemotherapy would be of interest. Importantly, causality cannot be claimed in such analyses; a cross-sectional design would be applied although seemingly mimicking a cohort design [58].

Strengths and limitations

The main strength of this systematic review and meta-analysis is that it gives an overview of the compiled current evidence on patient effects using gene expression assays in the subgroup of breast cancer patients where the clinical risk of recurrence does not suffice for clear-cut decisions. In addition, the findings are discussed in a wider context, which is of relevance for decision making at both the health care and the patient levels and for future research within personalised medicine. Indeed, precision cancer medicine is a rapidly growing field.

Limitations include that few studies fulfilled our PICO criteria, in particular regarding the comparison of patient effects of use versus no use of a gene expression assay. Furthermore, the CI for the RR in the meta-analysis, comparing withholding versus providing chemotherapy, was fairly wide, ranging from 10% decreased to 39% increased risk of distant recurrence. Translated to absolute numbers, the risk variation was small, from 0.5% decreased risk to 1.4% increased risk. Nevertheless, the as-treated analysis in the largest RCT showed superiority for the primary composite outcome (invasive-disease recurrence, second primary cancer or death) for the randomisation group allocated to chemotherapy, according to the predetermined statistical non-inferiority definitions [32], supporting the conclusion that an increase in recurrence cannot be excluded in those not allocated to chemotherapy.

Conclusion

In summary, this systematic review and meta-analysis illustrates that the evidence base for decision making at the overall health care level regarding the use of a gene expression assay to guide chemotherapy decisions in breast cancer with intermediate risk of recurrence is still limited. For decision making at the patient level, on the other hand, evidence is more solid; withholding chemotherapy based on the results of such a genetic tumour test probably yields similar chances of medium-term survival, but an increased risk of recurrence, though small in absolute numbers, cannot be excluded. As breast cancer research may be considered fairly advanced within the field of personalised medicine, our results may encourage an increased focus in precision cancer medicine to contribute evidence essential for horizontal prioritisation i.e. a scientific basis for assessments of the overall benefit-risk balance.