Prognostic ability of EndoPredict compared to research-based versions of the PAM50 risk of recurrence (ROR) scores in node-positive, estrogen receptor-positive, and HER2-negative breast cancer. A GEICAM/9906 sub-study
- First Online:
- Cite this article as:
- Martin, M., Brase, J.C., Ruiz, A. et al. Breast Cancer Res Treat (2016) 156: 81. doi:10.1007/s10549-016-3725-z
- 1.3k Downloads
There are several prognostic multigene-based tests for managing breast cancer (BC), but limited data comparing them in the same cohort. We compared the prognostic performance of the EndoPredict (EP) test (standardized for pathology laboratory) with the research-based PAM50 non-standardized qRT-PCR assay in node-positive estrogen receptor-positive (ER+) and HER2-negative (HER2−) BC patients receiving adjuvant chemotherapy followed by endocrine therapy (ET) in the GEICAM/9906 trial. EP and PAM50 risk of recurrence (ROR) scores [based on subtype (ROR-S) and on subtype and proliferation (ROR-P)] were compared in 536 ER+/HER2− patients. Scores combined with clinical information were evaluated: ROR-T (ROR-S, tumor size), ROR-PT (ROR-P, tumor size), and EPclin (EP, tumor size, nodal status). Patients were assigned to risk-categories according to prespecified cutoffs. Distant metastasis-free survival (MFS) was analyzed by Kaplan–Meier. ROR-S, ROR-P, and EP scores identified a low-risk group with a relative better outcome (10-year MFS: ROR-S 87 %; ROR-P 89 %; EP 93 %). There was no significant difference between tests. Predictors including clinical information showed superior prognostic performance compared to molecular scores alone (10-year MFS, low-risk group: ROR-T 88 %; ROR-PT 92 %; EPclin 100 %). The EPclin-based risk stratification achieved a significantly improved prediction of MFS compared to ROR-T, but not ROR-PT. All signatures added prognostic information to common clinical parameters. EPclin provided independent prognostic information beyond ROR-T and ROR-PT. ROR and EP can reliably predict risk of distant metastasis in node-positive ER+/HER2− BC patients treated with chemotherapy and ET. Addition of clinical parameters into risk scores improves their prognostic ability.
5-Fluorouracil, epirubicin, and cyclophosphamide
FEC followed by paclitaxel
Formalin-fixed paraffin embedded
Risk of recurrence
ROR score based on subtype information
ROR based on subtype and proliferation
ROR based on subtype and tumor size
ROR based on subtype, proliferation, and tumor size
Gene expression tests are new tools to clinically determine the risk of relapse in early-stage breast cancer (BC) [1, 2]. The 21-gene recurrence score (Oncotype DX) and Mammaprint (NKI-70) [3–10] have been shown to impact treatment decisions [11–13]. Novel prognostic tests, such as EndoPredict and PAM50, are also able to predict early, as well as late metastases [14, 15].
EndoPredict is a standardized test for the molecular pathology laboratory and was the first multigene test used in a decentralized setting [16, 17]. It was established and validated in two independent clinical validation studies (ABCSG6 and ABCSG8) involving patients with ER+/HER2− BC treated with adjuvant endocrine therapy (ET) only . EndoPredict provides prognostic information beyond all common clinicopathological parameters  and clinical guidelines . The molecular information (EP score) is further combined with tumor size and nodal status resulting in the EPclin score.
The PAM50 assay is an optimized gene set used to identify intrinsic subtypes and predict the Risk Of Recurrence (ROR) at 10 years [20, 21]. The ROR score was developed in a microarray-based cohort of node-negative, untreated BC patients [6, 20]. Four versions of ROR exist in the research setting: ROR based on subtype information (ROR-S), ROR-S with proliferation (ROR-P), ROR-S with tumor size (ROR-T), and ROR-P with tumor size (ROR-PT) [20, 21]. The minimum ROR score of all Luminal B scores was assigned as the low-risk threshold for each model and the maximum ROR score of all Luminal A scores as the high-risk threshold .
The inclusion of established clinicopathological risk factors in ROR and EP scores, such as tumor size (ROR-T, ROR-PT, and EPclin) and nodal status (EPclin), increases their predictive performance [18–21]. EndoPredict and research-based PAM50 were evaluated independently in the GEICAM/9906 trial [22–25]. We compared the prognostic performance of the EP test with the research-based, non-standardized PAM50 assay in node-positive, ER+/HER2− BC patients treated with adjuvant chemotherapy followed by ET.
Patients and methods
Patients and tumor samples
Patients in this study participated in the GEICAM/9906 trial, a randomized phase III trial that compared adjuvant chemotherapy regimen of 5-fluorouracil, epirubicin, and cyclophosphamide (FEC) with FEC followed by weekly paclitaxel (FEC-P), and then followed by 5-year hormonal therapy (tamoxifen, aromatase inhibitors or both) in 1246 women with lymph node-positive disease . This trial was performed in accordance with the Declaration of Helsinki, approved by the ethics committees at all participating institutions (see Supplementary Table 1S) and the Spanish Health Authority, and registered at www.clinicaltrials.gov (NCT00129922). Patients provided their written informed consent for therapy randomization and molecular analyses. Patients whose tumors were ER+/HER2− according to a central review by qRT-PCR and consented to genomic analysis were eligible.
Formalin-fixed paraffin-embedded (FFPE) tumor blocks used to compare PAM50 and EP scores were collected at the time of surgery.
EndoPredict gene expression analysis
RNA extraction and gene expression analysis for identifying the ER+/HER2− subgroup and performing the EndoPredict have been recently described. Briefly, total RNA was extracted from one 5-µm whole FFPE tissue section using a silica bead-based, fully automated isolation method (Tissue Preparation System, VERSANT Tissue Preparation Reagents, Siemens Healthcare Diagnostics) .
To identify patients with ER+/HER2− tumors, ESR1 and ERBB2 gene expression levels were analyzed and predefined cut-off levels were applied as recently described. The EP score is based on eight cancer-related genes (BIRC5, UBE2C, DHCR7, RBBP8, IL6ST, AZGP1, MGP, STC2) and three reference genes (CALM2, OAZ1, RPL37A), and measured by qRT-PCR .
PAM50-ROR gene expression analysis
RNA was extracted from two 1-mm FFPE cores as previously described . To determine the research-based versions of the ROR scores and groups, normalized gene expression data obtained from the qRT-PCR platform were gene-median-centered, and the microarray-based PAM50 intrinsic subtype predictor was applied as previously described . Of note, the microarray-based training dataset, from which survival coefficients were derived, is based on patients with node-negative disease that did not receive adjuvant systemic therapy . In addition, ROR thresholds (low and high) consist of subtype distributions along the ROR scores in the training dataset .
The ROR and EP scores were calculated blinded to clinical data and sent to the GEICAM study group in Madrid for independent statistical analysis. Only the GEICAM group had access to the combined clinical outcome and gene expression data.
The primary endpoint was distant metastasis-free survival (MFS) estimated using the Kaplan–Meier method. Pearson correlations compared gene signatures. Two-sided log-rank tests were used to compare subgroups. P values <5 % were considered statistically significant. Each gene signature was added to a model containing common clinical parameters and one other evaluated signature. C-indices were then calculated for the clinical variables and model combinations to estimate the performance of each variable for predicting distant metastasis. Differences were evaluated using the log-likelihood ratio statistic test (for proportional Cox model hazard rates) and the comparison of c-index with resampling (both one-sided tests).
The PAM50 ROR-S, ROR-P, and ROR-T and ROR-PT scores classified patients as low-, intermediate-, and high-risk, using the following pre-defined cut-off values, respectively: ROR-S (<24; 24–53; >53), ROR-P (<12; 12–53; >53), ROR-T (<29; 29–65; >65), PAM50 ROR-PT (<18; 18–65; >65). Both, EP and EPclin categorized patients into low- (EP score <5; EPclin score <3.3) and high-risk groups (EP score ≥5; EPclin score ≥3.3) . The following clinical parameters were used for the analysis: positive nodal status (1–3; 4–10; >10); tumor size [cm (≤1; >1–≤2; >2–≤5; >5)]; Grade (1; 2; 3); age; and treatment arm (FEC; FEC-P).
A total of 566 (71 %) of 800 available tumor samples were eligible for evaluation. Tumor samples lacking PAM50 or EndoPredict data were excluded (5 %). Characteristics of the patient cohort included in this study are summarized in Supplementary Table 2S.
Comparing EP versus PAM50 gene signatures
Prognostic performance of predictors
Additional prognostic information—GEICAM/9906
ROR-S adds Information to EndoPredict
ROR-P adds information to EndoPredict
EndoPredict adds information to ROR-S
EndoPredict adds information to ROR-P
P = 0.1455
P = 0.1292
P = 0.0557
P = 0.0194
(P = 0.08844)a
(P = 0.09829)a
(P = 0.05750)a
(P = 0.03162)a
P = 0.4786
P = 0.5669
P < 0.001
P < 0.001
P = 0.406
P = 0.3181
P < 0.001
P < 0.001
We compared the prognostic performance of research-based and non-standardized versions of PAM50-ROR scores and EndoPredict in ER+/HER2−, node-positive chemotherapy-treated BC patients from the GEICAM/9906 trial.
ROR-S and ROR-P were significantly correlated with EP score, and gene signatures showed agreement in risk classification, indicating that PAM50-ROR and EP scores identify tumors with similar properties. Despite the significant correlation, the discordance rate of 20–21 % can be explained by the tests’ inherent characteristics. Our c-index analysis indicated that only EP added significant information to ROR-P. None of the other molecular signatures added information to each other, suggesting that additional predictors would not improve prognostic performance. These findings are concordant with our previous combined analysis of hundreds of signatures and clinical-pathological data for prognostic prediction in ER-positive breast cancer where we observed that not much more prognostic power was obtained by including hundreds of signatures into a single model beyond the power contained within a well-developed individual signature when combined with clinical variables .
The PAM50-based ROR-T and ROR-PT scores include tumor size, whereas the EPclin score considers nodal status and tumor size, as part of the risk prediction algorithm. Similar to the research-based version, a ROR-PT score weighted for tumor size and proliferation was used to validate the standardized version of PAM50 assay in the ATAC and ABCSG8 trials. In our analysis, all hybrid scores contributed to identifying low-risk groups for distant metastasis, although number of patients and events differed across score categories. The EPclin low-risk group was smaller than the ROR-T and ROR-PT ones and showed no distant-metastatic events. EPclin had been established in a node-positive/node-negative cohort and the predefined cut-off level consequently classified more patients as high-risk in the node-positive GEICAM/9906 trial. In contrast, the research-based versions of ROR-T and ROR-PT scores were derived in a systemically untreated node-negative BC cohort, and thresholds were based on subtype distribution and not actual survival outcomes; therefore, the number of low-risk cases with distant-metastatic events was higher, as reflected by an MFS of 87–92 % in low-risk groups.
Kaplan–Meier analysis of discordant cases, c-index analysis, and log-likelihood tests showed that the EPclin-based risk classification provided independent prognostic information to the ROR-T and ROR-PT scores. The improved performance of the EPclin score over pure molecular scores may be partially explained by the inclusion of nodal status, one of the strongest single prognostic factors, in the EPclin score, but which is not included in any of the other models tested. EndoPredict validation studies demonstrated that molecular EP score, tumor size, and nodal status were the only independent prognostic parameters . Hybrid scores’ superior performance compared to their molecular counterparts supports the recommendation of the Evaluation of Genomic Application in Practice and Prevention working group to integrate clinicopathological factors into gene expression tests  rather than relying on pure RNA-based molecular scores.
To the best of our knowledge, our study reports the first direct comparison of EndoPredict and a research-based version of the PAM50 assay. Earlier comparisons of multigene signatures suggested similar prognostic performances [30, 31]. Recently, the transATAC study, the first large phase III study comparing different standardized gene expression-based biomarkers in the same patient cohort , compared the standardized and clinically validated version of PAM50 assay, developed under the nCounter system (Nanostring Technologies), with the 21-gene recurrence score. PAM50-ROR provided more prognostic information than the recurrence score . Although our study did not evaluate the standardized and clinically validated PAM50-ROR score, the GEICAM/9906 trial is an additional valuable source for biomarker comparisons. In the context of this trial, we could identify high-risk patients who need additional treatment to the standard anthracycline-based chemotherapy (±taxane) and could be eligible for further treatment with novel drugs, such as CDK4/6 or mTOR inhibitors.
Our results should be interpreted in the context of its limitations. First, ROR scores were generated using research-based and non-standardized versions from qRT-PCR platform. Although research-based PAM50 classification has been evaluated in several clinical trials using qRT-PCR [22, 33, 34], different methods may influence prognostic ability. Of note, large validation studies (ATAC and ABCSG8) for the PAM50 assay were performed using the standardized version with pre-specified cutoffs based on actual survival outcomes (<10, 10–20, and >20 % risk of distant relapse at 10 years) and not subtype distribution . Second, the PAM50 vs. EP comparison was not conducted according to their intended use. Whereas our patients were treated with chemotherapy, these predictors were clinically validated using patients cohorts receiving endocrine therapy alone. Therefore, next steps should compare PAM50 and EP in clinical trials with ER+/HER2− BC patients treated with ET alone. EndoPredict and the standardized PAM50 were recently evaluated in the ABCSG8 trial, which would allow a direct comparison of both clinical predictors .
Despite the differences in establishment and the limited overlap in genes, all molecular predictors evaluated showed similar prognostic performance. The addition of clinical parameters, such as tumor size and nodal status, into risk-score determination improves the prognostic ability of these assays.
MM, CP, PSB, KEW, RK, JCB, CMP, and AP participated in the conception and design of the study. MM, AR, LC, MRB, CAR, IMA, MAS, ARL, PBS, and RC contributed to the collection and assembly of data. MM, JCB, RK, RC, and MC participated in the analysis of the data. MM, JCB, RK, CP, PSB, KEW, CMP, AP, EC, and RC participated in the interpretation of the data. MM, JCB, CP, RK, CMP, AP, EC, and RC helped in the drafting of the manuscript. All authors contributed to the writing of the manuscript, read, and approved the final manuscript.
Compliance with ethical standards
Conflict of interest
CP and RK are employees and shareholders of Sividon Diagnostics GmbH as well as inventors of EndoPredict patent applications. JCB and KEW were employees of Sividon Diagnostics GmbH at the time point of this study and both are inventors on the EndoPredict patent application. KEW is a shareholder of Sividon Diagnostics GmbH. CMP is a stockholder in Bioclassifier LLC, and an inventor on the PAM50 patent application. AP has an uncompensated advisory role for Nanostring Technologies. MM, EC, and RC are inventors on the PAM50 patent application. All remaining authors declare that they have no competing interest.
All patients provided written informed consent to participate in the GEICAM/9906 trial and to perform molecular analysis on their breast tumors.
Research involving Human Participants
All procedures performed in studies involving human participants were in accordance with the applicable legislation and the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.