Abstract
Current predictors for estrogen receptor-positive (ER-positive) breast cancer patients receiving tamoxifen are often invalid in inter-laboratory validation. We aim to develop a robust predictor based on the relative ordering of expression measurement (ROE) in gene pairs. Using a large integrated dataset of 420 normal controls and 1,129 ER-positive breast tumor samples, we identified the gene pairs with stable ROEs in normal control and significantly reversed ROEs in ER-positive tumor. Using these gene pairs, we characterized each sample of a cohort of 292 ER-positive patients who received tamoxifen monotherapy for 5 years and then identified relapse risk-associated gene pairs. We extracted a gene pair subset that resulted in the largest positive and negative predictive values for predicting 10-year relapse-free survival (RFS) using a genetic algorithm. A predictor was developed based on the gene pair subset and was validated in 2 large multi-laboratory cohorts (N = 250 and 248, respectively) of ER-positive patients who received 5-year tamoxifen alone. In the first validation cohort, the patients predicted to be tamoxifen sensitive had a 10-year RFS of 91 % (95 % confidence interval [CI] 85–97 %) with an absolute risk reduction of 34 % (95 % CI 17–51 %). The patients predicted to be tamoxifen insensitive had a significantly higher relapse risk than the patients predicted to be tamoxifen sensitive (hazard ratio = 4.99, 95 % CI 2.45–10.17, P = 9.13 × 10−7). Similar performance was achieved for the second validation cohort. The predictor performed well in both node-negative and node-positive subsets and added significant predictive power to the clinical parameters. In contrast, 2 previously proposed predictors did not achieve significantly better performances than the baselines of the validation cohorts. In summary, the proposed predictor can accurately and robustly predict tamoxifen sensitivity of ER-positive breast cancer patients and identified patients with a high probability of 10-year RFS following tamoxifen monotherapy.
Similar content being viewed by others
Introduction
Approximately 70 % of breast cancer patients are estrogen receptor-positive (ER-positive), for whom adjuvant tamoxifen monotherapy is the most common treatment option. Large-scale randomized trials have shown that administering tamoxifen for approximately 5 years can significantly reduce the relapse risk and mortality for ER-positive patients throughout the first 10 years of a > 15-year follow-up [1, 2]. Unfortunately, the usefulness of tamoxifen is limited by endocrine resistance, which causes cancer relapse and patient death. Therefore, it is vitally important to develop predictors that can accurately predict outcomes for patients receiving tamoxifen. Although several predictors based on gene expression have been built for this purpose [3–7], their predictive powers are often markedly decreased in independent cohorts from different laboratories [8–11] and were usually invalid in node-positive patients [7, 12]. For example, the 21-gene recurrence score (RS) proposed by Paik et al. [3] was not significantly associated with recurrence risk when adjusted for clinicopathologic parameters in a validation cohort [13], and it might provide little information beyond these parameters [14–16].
Most of the previously proposed predictors make decisions based on the cut-offs of transformed absolute expression measurements which are sensitive to the systematic inter-laboratory biases of microarray experiments, including batch effects, that are introduced by experimental conditions, reagent dosage, and personnel differences [17]. Although the data normalization and batch effect correction algorithms are supposed to eliminate such biases, they distort the real expression signals by smoothing down biological differences and scaling all arrays from different experiments to the same distribution under the unreliable assumption that the absolute amount of total mRNA in each sample is similar across different cell types or conditions [18–21]. In contrast, the relative ordering of expression measurement (ROE) in gene pairs within a sample is invariant to monotonic data transformation (normalization) and rather resistant to batch effects [22], which are promising features for building laboratorially robust predictors, e.g., the predictors of top scoring pair (TSP) and k-TSP [4, 22, 23]. However, as a few TSPs are typically composed of mutually correlated gene pairs, they could miss many patients without the top features. Given the heterogeneity of ER-positive breast cancer [24–26] and the multiple mechanisms involved in tamoxifen resistance [11, 27–31], it would be necessary to develop a predictor with combined gene pairs that are each associated with a subset of resistant patients.
As genes are widely dysregulated in ER-positive tumors [32], we assumed that there would be many gene pairs with reversed ROEs in a significant portion of ER-positive tumor samples compared with normal controls. We could use these gene pairs as tumor-associated features to characterize tumor samples at the individual level and then further identify the patient subgroups with different tamoxifen responses. Based on the above-mentioned assumptions, using an integrated dataset consisting of 420 normal controls and 1,129 ER-positive tumor samples without treatment and outcome information, we identified the ER-positive tumor-associated gene pairs with stable ROEs in normal controls and significantly reversed ROEs in ER-positive tumor samples. Using these gene pairs, we characterized each sample from a multi-laboratory cohort of 292 ER-positive patients receiving tamoxifen and identified the gene pairs (features) whose reversal was significantly associated with the relapse risk of these patients. We extracted a feature subset to develop a predictor and validated the predictor in 2 independent multi-laboratory cohorts of 250 and 248 tamoxifen-treated patients and in a cohort of 297 chemo-endocrine-treated patients. Finally, we compared the proposed predictor with 2 other predictors using inter-laboratory data.
Materials and methods
Data collection
An integrated dataset, including 420 normal controls and 1,129 ER-positive breast cancer samples documented in the GEO [33], ArrayExpress [34], and TCGA [26] repositories (Table S1), was used to identify ER-positive tumor-associated gene pairs.
The characteristics of the patient and sample cohorts used for predictor development and validation are summarized in Table 1 and Table S2, respectively. Notably, these sample cohorts were profiled at various laboratories. The training cohort consisted of 292 ER-positive patients from 3 centers who received tamoxifen monotherapy for 5 years. This cohort was used to assess the association of the ROE reversals of gene pairs with relapse risk and to develop an ROE-based endocrine predictor (RE predictor) to predict the outcomes of tamoxifen-treated ER-positive patients. The first and second validation cohorts consisted of 250 and 248 ER-positive patients, respectively, who received tamoxifen (N = 470) or letrozole (N = 28) alone for 5 years. These 2 cohorts were studied to assess and compare the performance of the RE predictor, sensitivity to endocrine therapy (SET) index [7] and RS [3].
Two untreated cohorts with 209 and 253 ER-positive patients who did not receive any systemic therapy were studied to compare with the tamoxifen-treated patients to assess the benefit from adjuvant tamoxifen for patients who were predicted to be tamoxifen insensitive or sensitive.
A chemo-endocrine cohort was studied to test the predictive power of the RE predictor for endocrine therapy after chemotherapy. This cohort consisted of 297 ER-positive breast cancer patients who received neoadjuvant chemotherapy and subsequent adjuvant endocrine therapy with tamoxifen or aromatase inhibitors (AIs) or both in sequence [35].
Feature selection and predictor construction
Figure 1 describes the processes for developing and validating the RE predictor. In the integrated dataset, all gene pairs were evaluated to extract the ER-positive tumor-associated gene pairs that had stable ROEs in more than 99 % of normal controls and the reversed ROEs in more than 10 % of ER-positive tumor samples, which corresponded to a false discovery rate (FDR) <0.01 %. Gene pairs reversed in >50 % of tumor samples was removed as tamoxifen insensitive occurs in <50 % of ER-positive patients [31, 36, 37]. Then, using the extracted n gene pairs, we characterized each of the samples in the training cohort as a binary vector (x 1, x 2, …, x n ) in which x i (i = 1,2, …, n) was assigned 0 if the ROE of the ith pair of genes was R i1 ≥ R i2 , and 1 otherwise (reversed). The univariate Cox regression model was used to identify the gene pairs (features) whose reversal was significantly associated with the relapse risk of the patients. The final feature subset was obtained from these features through a wrapper method based on a simple classification rule: a patient was predicted to be tamoxifen insensitive for a feature subset if at least 2 gene pairs were reversed in the subset, which was determined by the binominal test, as discribed in detail in the Supplemental Methods. Here, we used a genetic algorithm [38] to search the final feature subset that resulted in the largest product of the positive predictive value (PPV) and negative predictive value (NPV) as defined in the Statistical Analysis section. The genetic algorithm was implemented with a population size of 10,000 and a crossover fraction of 0.9; it was terminated if the optimization objective of the best subset was not improved in 100 generations. The details of the genetic algorithm are shown in the Supplemental Methods. An RE Predictor was developed based on the classification rule with the optimized feature subset.
Statistical analysis
Log-rank tests were used to assess the differences between the Kaplan–Meier estimates of the relapse-free survival (RFS) in the predicted groups. Multivariate analyses with the Cox proportional hazards regression model were performed to calculate the hazard ratios (HRs) and their 95 % confidence intervals (CIs). The additional prognostic value of the RE predictor was assessed by comparing with the full clinical model using the likelihood ratio test.
Since the HRs and P values of the Cox model do not directly test the predictive power of a predictor [39], the performance was assessed by the PPV, defined as the probability of relapse within 10 years for patients predicted to be tamoxifen insensitive; the NPV, defined as the probability of 10-year RFS for patients predicted to be tamoxifen sensitive; and the absolute risk reduction (ARR), defined as the absolute difference in RFS between the 2 predicted groups [35]. These 3 measures were estimated based on the Kaplan–Meier estimates of the survival function of cumulative events [40]. The CIs for NPV and PPV were calculated based on the Greenwood variance estimate and for ARR were estimated by bootstrapping with 10,000 iterations [41]. All statistical computations were performed in R version 2.15.2.
Results
Development of the RE predictor
First, from the integrated dataset, we identified 264,933 ER-positive tumor-associated gene pairs with a FDR < 0.01 %. From these gene pairs, 562 gene pairs associated with relapse risk were identified using the Cox model with a 1 % FDR. Next, 15 gene pairs (Table S3) were finally extracted from the 562 gene pairs using a genetic algorithm [38] based on the classification rule. Finally, the RE predictor was developed based on the ROE of the 15 gene pairs and the classification rule.
Performance of the RE predictor in Independent validation cohorts
The RE predictor was evaluated in the 2 validation cohorts of 250 and 248 patients. In the first cohort, for the patients predicted to be tamoxifen sensitive, the 10-year RFS (NPV) was 91 % (95 % CI 85–97 %) and there was an ARR of 34 % (95 % CI 17–51 %) (Table 2). By comparison, the 10-year point estimate of RFS for the patients predicted to be tamoxifen insensitive was 57 % (95 % CI 47–68 %), corresponding to a PPV of 43 % (32–53 %) and an HR of 4.99 (95 % CI 2.45–10.17, P = 9.13 × 10−7) (Fig. 2a). Similar performance was achieved for the second validation cohort (Table 2; Fig. 2b). Overall, the patients predicted to be tamoxifen sensitive had a significantly higher probability of RFS than the patients predicted to be tamoxifen insensitive.
Kaplan–Meier estimates of relapse-free survival for the patients who received endocrine therapy only in a the first validation cohort, and b the second validation cohort, and presented separately for the subsets with c node-negative and d node-positive patients. The tamoxifen sensitive and insensitive groups were predicted by the RE predictor. Eleven patients were excluded from the analysis of nodal status-defined subsets because of missing data. The hazard ratio (HR) is a measure for the risk of relapse; P values are calculated by the log-rank tests
Considering that many patients (53 %) in the 2 cohorts had partially missing clinicopathologic parameters, we performed multivariate analyses by combining the patients of both cohorts to increase the statistical power. In a multivariate Cox model, the RE prediction was significantly associated with the relapse risk (HR = 5.26, 95 % CI 2.76–10.03, P = 4.52 × 10−7) after adjusting for standard clinicopathologic parameters, including nodal status, age, tumor size and grade (Table 3). Addition of the RE prediction to a multivariate Cox model of the clinicopathologic parameters significantly increased the predictive utility of the model (likelihood ratio of the complete model vs the clinical model, 29.4; P = 5.92 × 10−8). In the complete model, node-positive status was also independently associated with significantly greater risk of relapse (HR = 2.02, 95 % CI 1.22–3.34, P = 0.006; Table 3).
Notably, the RE predictor appears to perform well in both node-negative and node-positive patients. The patients predicted to be tamoxifen insensitive had a significant higher risk of relapse compared with the patients predicted to be tamoxifen sensitive in both node-negative (HR = 3.57, 95 % CI 1.79–7.12, P = 0.0001; Fig. 2c) and node-positive (HR = 4.67, 95 % CI 2.62–8.33, P = 9.36 × 10−9; Fig. 2d) patient subsets. For the patients predicted to be tamoxifen sensitive, the 10-year point estimate of RFS (NPV) was 93 % (95 % CI 88–98 %) in node-negative patients and 81 % (95 % CI 72–91 %) in node-positive patients. Moreover, the PPV was 30 % (95 % CI 21–38 %) and 63 % (95 % CI 51–71 %) for the node-negative and node-positive patients, respectively, corresponding to an ARR of 23 % (95 % CI 13–33 %) and 44 % (95 % CI 30–57 %), respectively.
Note that the tamoxifen-treated patients who were predicted to be tamoxifen sensitive (but not the patients predicted to be tamoxifen insensitive) had a significant improvement of RFS compared with their untreated counterparts (Fig. 3). For the patients predicted to be tamoxifen sensitive, the 10-year point estimate of RFS was 70 % (95 % CI 62–80 %) and 69 % (95 % CI 61–77 %) for the 2 untreated cohorts, while the RFS of the first validation cohort (91 %) was significantly improved compared with the patients from both untreated cohorts (HR = 0.26, 95 % CI 0.12–0.55, P = 0.0002, Fig. 3a; and HR = 0.34, 95 % CI 0.17–0.70, P = 0.002, Fig. 3b). A similar improvement was observed in the second validation cohort (Fig. 3c, d). Conversely, for the patients predicted to be tamoxifen insensitive, there was no significant RFS improvement observed for the 2 tamoxifen-treated cohorts compared with the 2 untreated cohorts (all P > 0.1, Figure S1).
Kaplan–Meier estimates of relapse-free survival for the patients predicted to be tamoxifen sensitive in a, b the first validation cohort and c, d the second validation cohort treated with endocrine therapy (Tam-treated) versus the 2 untreated cohorts (Untreated). The hazard ratio (HR) is a measure for the risk of relapse; P values are calculated by the log-rank tests
Testing of the RE predictor in the chemo-endocrine cohort
The RE predictor was also applied to the chemo-endocrine cohort of 297 patients (Figure S2). The 2 predicted patient groups showed a significantly different relapse risk after neoadjuvant chemotherapy and subsequent adjuvant endocrine therapy (HR = 2.79, 95 % CI 1.40–5.55, P = 0.002). The result suggested that the RE predictor test could still predict the outcomes of endocrine therapy-treated ER-positive patients, despite receiving neoadjuvant chemotherapy. However, the PPV and NPV of this cohort could not be assessed because there was only a 3-year median follow-up [35].
Comparison of the RE predictor with other predictors
We also evaluated the performance and robustness of 2 absolute expression value-based predictors, SET index and RS, which were previously developed to predict the outcomes of patients receiving tamoxifen. Although the SET index performed well in the previous study [7], RFS of the high-SET category of the first validation cohort was not significantly improved, compared with the intermediate or low-SET categories (HR = 0.69, 95 % CI 0.45–1.05, P = 0.08; Figure S3). Additionally, the PPV of the SET index (32 %, 95 % CI 24–39 %) (Table 2) was not significantly larger than the baseline of 29 % (the lower 95 % confidence limit of the PPV was smaller than the baseline). For the RS, the NPV and PPV were 77 % (95 % CI 69–85 %) and 42 % (95 % CI 28–53 %) in the first validation cohort (Table 2), respectively; both of which were not significantly larger than the baselines. Similar NPV and PPV were achieved for the RS in the second validation cohort. In contrast, the RE predictor showed significantly larger NPV and PPV than the baselines in both validation cohorts.
In hierarchical clustering analyses of all genes in the expression profiles using Euclidean distances with ward linkage, the samples were separated according to laboratories (Figure S4), which indicated that there were extensive inter-laboratory biases between the microarrays profiled at different laboratories. As the biases could systematically influence SET and RS scores, thresholds optimized in a training cohort would be unsuitable for validation cohorts from other laboratories (Table S4). For example, when evaluated in technical replicates that were profiled at 2 laboratories, 6, 2, and 1 of the 16 patients were classified into discordant groups for the SET index, RS, and RE predictor, respectively (Table S5).
The above results suggested that scores of the SET index and RS tend to be greatly affected by systematic biases of expression measurements between laboratories, whereas the RE predictor is rather robust against inter-laboratory biases.
Discussion
This study develops the RE predictor and validates its predictive performance in independent multi-laboratory cohorts. The results indicated that the RE predictor could accurately predict the RFS outcome of ER-positive patients receiving tamoxifen and of ER-positive patients receiving neoadjuvant chemotherapy and endocrine therapy. The RE predictor was also observed to be more robust than the absolute expression value-based predictors for samples profiled in different laboratories, as the ROE was barely affected by the inter-laboratory biases. Thus, the RE predictor could be applied to patients from different centers.
In principle, the predictive effect of the RE predictor should be confirmed in a properly designed prospective randomized trial, which would enable testing for the interaction between the RE predictor and the tamoxifen. However, this type of prospective study is unjustifiable because the use of an untreated cohort is unethical after the approval of tamoxifen [1, 2]. Thus, we compared the patients receiving tamoxifen with the untreated patients from different retrospective studies. The results showed that 5-year tamoxifen treatment significantly reduced relapse risk compared with no systemic treatment for the patients predicted to be tamoxifen sensitive, but not for the patients predicted to be tamoxifen insensitive. We recognize that the distributions of age and nodal status were different for the treated and untreated cohorts and that this difference might result in a different relapse rate from the natural disease history (unrelated to tamoxifen). However, the results suggested that the patients predicted to be tamoxifen sensitive would benefit more from tamoxifen than the patients predicted to be tamoxifen insensitive.
In the multivariate Cox analysis, the RE prediction and nodal status were 2 independent prognostic factors. Note that the node-positive patients predicted to be tamoxifen sensitive had a 19 % relapse risk from endocrine therapy alone. This is a common problem of all predictors for ER-positive patients receiving endocrine therapy as the poor prognosis of node-positive patients. Two recent studies have reported that approximately 20 % of the node-positive, low-risk patients identified by predictors would develop a distance relapse [7, 12]. Furthermore, these predictors perform well in node-negative patients but showed either no statistical significance or were barely significant in node-positive patients when comparing predicted low-risk with predicted high-risk groups [7, 12]. For the RE predictor, the node-positive patients predicted to be tamoxifen sensitive and insensitive had significantly different RFS rates, with a large ARR of 43 % for the patients predicted to be tamoxifen sensitive compared with the patients predicted to be tamoxifen insensitive. This result demonstrates the effectiveness of the RE predictor in outcome prediction of node-positive, ER-positive patients treated with tamoxifen. Nevertheless, RFS rate of predicted low-risk group could most likely be further improved if predictor is developed only in node-positive cohorts.
Our analysis showed that the scores of the SET index and RS tended to be greatly affected by the systematic biases of expression measurements between laboratories. Although the RS scores have not been measured by the reverse transcriptase polymerase chain reaction (RT-PCR) assay [3], the scores based on RT-PCR would also be influenced, as systematic biases also affects the RT-PCR measurements [17].
As the RE predictor could accurately predict the outcomes of tamoxifen-treated ER-positive patients and was robust across different laboratories, it could be used to assist in clinical decision-making combined with nodal status. For the node-negative patients predicted to be tamoxifen sensitive, a 93 % 10-year RFS rate justifies the selection of tamoxifen monotherapy. In sharp contrast, the probability of relapse within 10-year was 63 % for the node-positive patients predicted to be tamoxifen insensitive. Although there is no evidence suggesting that the allocation of tamoxifen is useless for these patients (no randomized control), this probability might be sufficiently high for consideration of alternative adjuvant therapies (e.g., AIs and aggressive chemotherapies) or clinical trials of investigational treatments rather than tamoxifen alone. For the node-positive patients predicted to be tamoxifen sensitive and node-negative patients predicted to be tamoxifen insensitive, the probability of RFS was 81 and 70 %, respectively. Thus, sequential chemo-endocrine therapy or prolonging tamoxifen to 10 years [42, 43] or AIs [44–47] could be considered for these patients to further improve the cure likelihood. The RE predictor could be combined with nodal status and prediction test for targeted therapies and chemotherapies [35] to provide predictive tests for personalized adjuvant treatment regimens for breast cancer patients.
Similar content being viewed by others
References
Hackshaw A, Roughton M, Forsyth S, Monson K, Reczko K, Sainsbury R, Baum M (2011) Long-term benefits of 5 years of tamoxifen: 10-year follow-up of a large randomized trial in women at least 50 years of age with early breast cancer. J Clin Oncol 29(13):1657–1663. doi:10.1200/JCO.2010.32.2933
Davies C, Godwin J, Gray R, Clarke M, Cutter D, Darby S, McGale P, Pan HC, Taylor C, Wang YC, Dowsett M, Ingle J, Peto R (2011) Relevance of breast cancer hormone receptors and other factors to the efficacy of adjuvant tamoxifen: patient-level meta-analysis of randomised trials. Lancet 378(9793):771–784. doi:10.1016/S0140-6736(11)60993-8
Paik S, Shak S, Tang G, Kim C, Baker J, Cronin M, Baehner FL, Walker MG, Watson D, Park T, Hiller W, Fisher ER, Wickerham DL, Bryant J, Wolmark N (2004) A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med 351(27):2817–2826. doi:10.1056/NEJMoa041588
Ma XJ, Wang Z, Ryan PD, Isakoff SJ, Barmettler A, Fuller A, Muir B, Mohapatra G, Salunga R, Tuggle JT, Tran Y, Tran D, Tassin A, Amon P, Wang W, Enright E, Stecker K, Estepa-Sabal E, Smith B, Younger J, Balis U, Michaelson J, Bhan A, Habin K, Baer TM, Brugge J, Haber DA, Erlander MG, Sgroi DC (2004) A two-gene expression ratio predicts clinical outcome in breast cancer patients treated with tamoxifen. Cancer Cell 5(6):607–616. doi:10.1016/j.ccr.2004.05.015
Oh DS, Troester MA, Usary J, Hu Z, He X, Fan C, Wu J, Carey LA, Perou CM (2006) Estrogen-regulated genes predict survival in hormone receptor-positive breast cancers. J Clin Oncol 24(11):1656–1664. doi:10.1200/JCO.2005.03.2755
Loi S, Haibe-Kains B, Desmedt C, Wirapati P, Lallemand F, Tutt AM, Gillet C, Ellis P, Ryder K, Reid JF, Daidone MG, Pierotti MA, Berns EM, Jansen MP, Foekens JA, Delorenzi M, Bontempi G, Piccart MJ, Sotiriou C (2008) Predicting prognosis using molecular profiling in estrogen receptor-positive breast cancer treated with tamoxifen. BMC Genom 9:239. doi:10.1186/1471-2164-9-239
Symmans WF, Hatzis C, Sotiriou C, Andre F, Peintinger F, Regitnig P, Daxenbichler G, Desmedt C, Domont J, Marth C, Delaloge S, Bauernhofer T, Valero V, Booser DJ, Hortobagyi GN, Pusztai L (2010) Genomic index of sensitivity to endocrine therapy for breast cancer. J Clin Oncol 28(27):4111–4119. doi:10.1200/JCO.2010.28.4273
Fan C, Oh DS, Wessels L, Weigelt B, Nuyten DS, Nobel AB, van’t Veer LJ, Perou CM (2006) Concordance among gene-expression-based predictors for breast cancer. N Engl J Med 355(6):560–569. doi:10.1056/NEJMoa052933
Iwamoto T, Pusztai L (2010) Predicting prognosis of breast cancer with gene signatures: are we lost in a sea of data? Genom Med 2(11):81. doi:10.1186/gm202
Buyse M, Sargent DJ, Grothey A, Matheson A, de Gramont A (2010) Biomarkers and surrogate end points—the challenge of statistical validation. Nat Rev Clin Oncol 7(6):309–317. doi:10.1038/nrclinonc.2010.43
Weigelt B, Pusztai L, Ashworth A, Reis-Filho JS (2012) Challenges translating breast cancer gene signatures into the clinic. Nat Rev Clin Oncol 9(1):58–64. doi:10.1038/nrclinonc.2011.125
Prat A, Parker JS, Fan C, Cheang MC, Miller LD, Bergh J, Chia SK, Bernard PS, Nielsen TO, Ellis MJ, Carey LA, Perou CM (2012) Concordance among gene expression-based predictors for ER-positive breast cancer treated with adjuvant tamoxifen. Ann Oncol 23(11):2866–2873. doi:10.1093/annonc/mds080
Goldstein LJ, Gray R, Badve S, Childs BH, Yoshizawa C, Rowley S, Shak S, Baehner FL, Ravdin PM, Davidson NE, Sledge GW Jr, Perez EA, Shulman LN, Martino S, Sparano JA (2008) Prognostic utility of the 21-gene assay in hormone receptor-positive operable breast cancer compared with classical clinicopathologic features. J Clin Oncol 26(25):4063–4071. doi:10.1200/JCO.2007.14.4501
Flanagan MB, Dabbs DJ, Brufsky AM, Beriwal S, Bhargava R (2008) Histopathologic variables predict Oncotype DX recurrence score. Mod Pathol 21(10):1255–1261. doi:10.1038/modpathol.2008.54
Geradts J, Bean SM, Bentley RC, Barry WT (2010) The oncotype DX recurrence score is correlated with a composite index including routinely reported pathobiologic features. Cancer Invest 28(9):969–977. doi:10.3109/07357907.2010.512600
Cuzick J, Dowsett M, Pineda S, Wale C, Salter J, Quinn E, Zabaglo L, Mallon E, Green AR, Ellis IO, Howell A, Buzdar AU, Forbes JF (2011) Prognostic value of a combined estrogen receptor, progesterone receptor, Ki-67, and human epidermal growth factor receptor 2 immunohistochemical score and comparison with the Genomic Health recurrence score in early breast cancer. J Clin Oncol 29(32):4273–4278. doi:10.1200/JCO.2010.31.2835
Leek JT, Scharpf RB, Bravo HC, Simcha D, Langmead B, Johnson WE, Geman D, Baggerly K, Irizarry RA (2010) Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genet 11(10):733–739. doi:10.1038/nrg2825
Lusa L, McShane LM, Reid JF, De Cecco L, Ambrogi F, Biganzoli E, Gariboldi M, Pierotti MA (2007) Challenges in projecting clustering results across gene expression-profiling datasets. J Natl Cancer Inst 99(22):1715–1723. doi:10.1093/jnci/djm216
Baggerly KA, Coombes KR, Neeley ES (2008) Run batch effects potentially compromise the usefulness of genomic signatures for ovarian cancer. J Clin Oncol 26(7):1186–1187; author reply 1187–1188. doi:10.1200/JCO.2007.15.1951
Wang D, Cheng L, Zhang Y, Wu R, Wang M, Gu Y, Zhao W, Li P, Li B, Wang H, Huang Y, Wang C, Guo Z (2012) Extensive up-regulation of gene expression in cancer: the normalised use of microarray data. Mol BioSyst 8(3):818–827. doi:10.1039/c2mb05466c
Loven J, Orlando DA, Sigova AA, Lin CY, Rahl PB, Burge CB, Levens DL, Lee TI, Young RA (2012) Revisiting global gene expression analysis. Cell 151(3):476–482. doi:10.1016/j.cell.2012.10.012
Eddy JA, Sung J, Geman D, Price ND (2010) Relative expression analysis for molecular cancer diagnosis and prognosis. Technol Cancer Res Treat 9(2):149–159
Xu L, Tan AC, Winslow RL, Geman D (2008) Merging microarray data from separate breast cancer studies provides a robust prognostic test. BMC Bioinform 9:125. doi:10.1186/1471-2105-9-125
Russnes HG, Navin N, Hicks J, Borresen-Dale AL (2011) Insight into the heterogeneity of breast cancer through next-generation sequencing. J Clin Invest 121(10):3810–3818. doi:10.1172/JCI57088
Bertos NR, Park M (2011) Breast cancer—one term, many entities? J Clin Invest 121(10):3789–3796. doi:10.1172/JCI57100
Koboldt DC, Fulton RS, McLellan MD, Schmidt H, Kalicki-Veizer J, McMichael JF, Fulton LL, Dooling DJ, Ding L (2012) Comprehensive molecular portraits of human breast tumours. Nature 490(7418):61–70. doi:10.1038/nature11412
Miller TW, Balko JM, Ghazoui Z, Dunbier A, Anderson H, Dowsett M, Gonzalez-Angulo AM, Mills GB, Miller WR, Wu H, Shyr Y, Arteaga CL (2011) A gene expression signature from human breast cancer cells with acquired hormone independence identifies MYC as a mediator of antiestrogen resistance. Clin Cancer Res 17(7):2024–2034. doi:10.1158/1078-0432.CCR-10-2567
Louie MC, McClellan A, Siewit C, Kawabata L (2010) Estrogen receptor regulates E2F1 expression to mediate tamoxifen resistance. Mol Cancer Res 8(3):343–352. doi:10.1158/1541-7786.MCR-09-0395
Fox EM, Miller TW, Balko JM, Kuba MG, Sanchez V, Smith RA, Liu S, Gonzalez-Angulo AM, Mills GB, Ye F, Shyr Y, Manning HC, Buck E, Arteaga CL (2011) A kinome-wide screen identifies the insulin/IGF-I receptor pathway as a mechanism of escape from hormone dependence in breast cancer. Cancer Res 71(21):6773–6784. doi:10.1158/0008-5472.CAN-11-1295
Turner N, Pearson A, Sharpe R, Lambros M, Geyer F, Lopez-Garcia MA, Natrajan R, Marchio C, Iorns E, Mackay A, Gillett C, Grigoriadis A, Tutt A, Reis-Filho JS, Ashworth A (2010) FGFR1 amplification drives endocrine therapy resistance and is a therapeutic target in breast cancer. Cancer Res 70(5):2085–2094. doi:10.1158/0008-5472.CAN-09-3746
Musgrove EA, Sutherland RL (2009) Biological determinants of endocrine resistance in breast cancer. Nat Rev Cancer 9(9):631–643. doi:10.1038/nrc2713
Zhou X, Shi T, Li B, Zhang Y, Shen X, Li H, Hong G, Liu C, Guo Z (2013) Genes dysregulated to different extent or oppositely in estrogen receptor-positive and estrogen receptor-negative breast cancers. PLoS One 8(7):e70017
Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A (2013) NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res 41(Database issue):D991–D995. doi:10.1093/nar/gks1193
Rustici G, Kolesnikov N, Brandizi M, Burdett T, Dylag M, Emam I, Farne A, Hastings E, Ison J, Keays M, Kurbatova N, Malone J, Mani R, Mupo A, Pedro Pereira R, Pilicheva E, Rung J, Sharma A, Tang YA, Ternent T, Tikhonov A, Welter D, Williams E, Brazma A, Parkinson H, Sarkans U (2013) ArrayExpress update—trends in database growth and links to data analysis tools. Nucleic Acids Res 41(Database issue):D987–D990. doi:10.1093/nar/gks1174
Hatzis C, Pusztai L, Valero V, Booser DJ, Esserman L, Lluch A, Vidaurre T, Holmes F, Souchon E, Wang H, Martin M, Cotrina J, Gomez H, Hubbard R, Chacon JI, Ferrer-Lozano J, Dyer R, Buxton M, Gong Y, Wu Y, Ibrahim N, Andreopoulou E, Ueno NT, Hunt K, Yang W, Nazario A, DeMichele A, O’Shaughnessy J, Hortobagyi GN, Symmans WF (2011) A genomic predictor of response and survival following taxane-anthracycline chemotherapy for invasive breast cancer. JAMA 305(18):1873–1881. doi:10.1001/jama.2011.593
Garcia-Becerra R, Santos N, Diaz L, Camacho J (2012) Mechanisms of resistance to endocrine therapy in breast cancer: focus on signaling pathways, miRNAs and genetically based resistance. Int J Mol Sci 14(1):108–145. doi:10.3390/ijms14010108
Osborne CK, Schiff R (2011) Mechanisms of endocrine resistance in breast cancer. Annu Rev Med 62:233–247. doi:10.1146/annurev-med-070909-182917
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2):182–197
Kattan MW (2004) Evaluating a new marker’s predictive contribution. Clin Cancer Res 10(3):822–824
Moskowitz CS, Pepe MS (2004) Quantifying and comparing the accuracy of binary biomarkers when predicting a failure time outcome. Stat Med 23(10):1555–1570. doi:10.1002/sim.1747
Therneau TM, Grambsch PM (2000) Modeling survival data: extending the cox model. Springer, New York
Davies C, Pan H, Godwin J, Gray R, Arriagada R, Raina V, Abraham M, Medeiros Alencar VH, Badran A, Bonfill X, Bradbury J, Clarke M, Collins R, Davis SR, Delmestri A, Forbes JF, Haddad P, Hou MF, Inbar M, Khaled H, Kielanowska J, Kwan WH, Mathew BS, Mittra I, Muller B, Nicolucci A, Peralta O, Pernas F, Petruzelka L, Pienkowski T, Radhika R, Rajan B, Rubach MT, Tort S, Urrutia G, Valentini M, Wang Y, Peto R (2013) Long-term effects of continuing adjuvant tamoxifen to 10 years versus stopping at 5 years after diagnosis of oestrogen receptor-positive breast cancer: ATLAS, a randomised trial. Lancet 381(9869):805–816. doi:10.1016/S0140-6736(12)61963-1
Jin H, Tu D, Zhao N, Shepherd LE, Goss PE (2012) Longer-term outcomes of letrozole versus placebo after 5 years of tamoxifen in the NCIC CTG MA.17 trial: analyses adjusting for treatment crossover. J Clin Oncol 30(7):718–721. doi:10.1200/JCO.2010.34.4010
Colleoni M, Giobbie-Hurder A, Regan MM, Thurlimann B, Mouridsen H, Mauriac L, Forbes JF, Paridaens R, Lang I, Smith I, Chirgwin J, Pienkowski T, Wardley A, Price KN, Gelber RD, Coates AS, Goldhirsch A (2011) Analyses adjusting for selective crossover show improved overall survival with adjuvant letrozole compared with tamoxifen in the BIG 1-98 study. J Clin Oncol 29(9):1117–1124. doi:10.1200/JCO.2010.31.6455
Cuzick J, Sestak I, Baum M, Buzdar A, Howell A, Dowsett M, Forbes JF (2010) Effect of anastrozole and tamoxifen as adjuvant treatment for early-stage breast cancer: 10-year analysis of the ATAC trial. Lancet Oncol 11(12):1135–1141. doi:10.1016/S1470-2045(10)70257-6
Dowsett M, Cuzick J, Ingle J, Coates A, Forbes J, Bliss J, Buyse M, Baum M, Buzdar A, Colleoni M, Coombes C, Snowdon C, Gnant M, Jakesz R, Kaufmann M, Boccardo F, Godwin J, Davies C, Peto R (2010) Meta-analysis of breast cancer outcomes in adjuvant trials of aromatase inhibitors versus tamoxifen. J Clin Oncol 28(3):509–518. doi:10.1200/JCO.2009.23.1274
Dubsky PC, Jakesz R, Mlineritsch B, Postlberger S, Samonigg H, Kwasny W, Tausch C, Stoger H, Haider K, Fitzal F, Singer CF, Stierer M, Sevelda P, Luschin-Ebengreuth G, Taucher S, Rudas M, Bartsch R, Steger GG, Greil R, Filipcic L, Gnant M (2012) Tamoxifen and anastrozole as a sequencing strategy: a randomized controlled trial in postmenopausal patients with endocrine-responsive early breast cancer from the Austrian Breast and Colorectal Cancer Study Group. J Clin Oncol 30(7):722–728. doi:10.1200/JCO.2011.36.8993
Acknowledgments
This work was supported by the National Natural Science Foundation of China [grant numbers 81071646 and 91029717 to ZG, 81201822 to YG]; and the Research Fund for the Doctoral Program of Higher Education of China [20112307110011 to ZG]. The funders had no role in study design, data collection and analysis, preparation of the manuscript, or decision to publish.
Conflict of interest
The authors declare that they have no conflict of interest.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Zhou, X., Li, B., Zhang, Y. et al. A relative ordering-based predictor for tamoxifen-treated estrogen receptor-positive breast cancer patients: multi-laboratory cohort validation. Breast Cancer Res Treat 142, 505–514 (2013). https://doi.org/10.1007/s10549-013-2767-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10549-013-2767-8





