The molecular diversity of Luminal A breast tumors

Ciriello, Giovanni; Sinha, Rileen; Hoadley, Katherine A.; Jacobsen, Anders S.; Reva, Boris; Perou, Charles M.; Sander, Chris; Schultz, Nikolaus

doi:10.1007/s10549-013-2699-3

The molecular diversity of Luminal A breast tumors

Preclinical study
Open access
Published: 06 October 2013

Volume 141, pages 409–420, (2013)
Cite this article

Download PDF

You have full access to this open access article

Breast Cancer Research and Treatment Aims and scope Submit manuscript

The molecular diversity of Luminal A breast tumors

Download PDF

Giovanni Ciriello¹,
Rileen Sinha¹,
Katherine A. Hoadley²,
Anders S. Jacobsen¹,
Boris Reva¹,
Charles M. Perou²,
Chris Sander¹ &
…
Nikolaus Schultz¹

5961 Accesses
3 Altmetric
Explore all metrics

Abstract

Breast cancer is a collection of diseases with distinct molecular traits, prognosis, and therapeutic options. Luminal A breast cancer is the most heterogeneous, both molecularly and clinically. Using genomic data from over 1,000 Luminal A tumors from multiple studies, we analyzed the copy number and mutational landscape of this tumor subtype. This integrated analysis revealed four major subtypes defined by distinct copy-number and mutation profiles. We identified an atypical Luminal A subtype characterized by high genomic instability, TP53 mutations, and increased Aurora kinase signaling; these genomic alterations lead to a worse clinical prognosis. Aberrations of chromosomes 1, 8, and 16, together with PIK3CA, GATA3, AKT1, and MAP3K1 mutations drive the other subtypes. Finally, an unbiased pathway analysis revealed multiple rare, but mutually exclusive, alterations linked to loss of activity of co-repressor complexes N-Cor and SMRT. These rare alterations were the most prevalent in Luminal A tumors and may predict resistance to endocrine therapy. Our work provides for a further molecular stratification of Luminal A breast tumors, with potential direct clinical implications.

An integrated genomics approach identifies drivers of proliferation in luminal-subtype human breast cancer

Article 24 August 2014

Nonfamilial Breast Cancer Subtypes

The somatic mutation profiles of 2,433 breast cancers refine their genomic and transcriptomic landscapes

Article Open access 10 May 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Evidence from multiple studies converges in defining breast cancer as a collection of distinct diseases with different molecular traits, prognosis, and therapeutic options.

These diseases are mostly characterized by the status of hormone and growth factor receptors. Estrogen receptor (ER), progesterone receptor (PR), and the Her2 tyrosine kinase play a major role in determining the molecular phenotype of the tumor and dictate treatment [1–5]. In the clinic, the most frequently occurring type of breast cancer is Her2−, ER+, and/or PR+, which represents ~150,000 cases each year in the US.

RNA expression-based signatures [6, 7] provided further insights into the diversity of breast tumors. By expression profiling, the large majority of ER+ and/or PR+ tumors are of the “luminal subtypes” [7, 8]. These tumors can be subdivided into Luminal A and Luminal B, with the former being typically low grade and associated with a better prognosis [9]. Luminal A is overall the most frequently occurring breast cancer expression subtype in the population. mRNA-derived subtypes also include Basal-like breast tumors, which are predominantly negative for ER, PR, and Her2, and Her2-enriched tumors, which are positive for Her2 (Fig. 1a).

Recently, major genomic studies further investigated the heterogeneity of breast tumors using multiple genomic technology platforms and approaches [10–16]. The most comprehensive of these studies, from The Cancer Genome Atlas (TCGA), assayed over 800 breast tumors with six different platforms including SNP arrays for DNA copy number alterations (CNA), whole exome sequencing, mRNA expression microarrays, DNA methylation, and protein expression and phosphorylation using reverse phase protein arrays (RPPA) [10]. Collectively, these studies revealed further complexity and diversity between and within the known subtypes.

In particular, a rather heterogeneous spectrum of CNAs and somatic mutations has been observed across luminal tumors [10–12]. Luminal A and Luminal B tumors have been associated with multiple and distinct copy number-driven clusters in both the dataset from TCGA [10] and the one from METABRIC [12], indicating that different copy number changes characterize subsets of these tumors (Fig. 1b). Similarly, despite an overall low mutation rate per tumor, Luminal B and especially Luminal A tumors have the highest number of genes mutated more frequently than expected by chance as a class [10]. We confirmed this trend by integrating three different datasets (TCGA [10], Broad [16], and WashU [11]) and estimated the statistical significance of recurrence in the unified dataset (Fig. 1c; Table 1). Importantly, Luminal heterogeneity extends to tumor prognosis. Luminal A tumors, while associated with the highest median overall survival, are also characterized by the most variability in survival (Fig. 1d). Moreover, it has been shown that the risk of late mortality in this subtype persists at least over 10 years after initial diagnosis, and is higher than in the other subtypes in the long term [17].

Table 1 Luminal A breast cancer cohorts

Full size table

These preliminary observations point to Luminal A breast cancer as the most heterogeneous both molecularly and clinically. The diversity and incidence of this tumor call for in depth genomic studies to explain its molecular heterogeneity and link it to clinical outcome.

To this purpose, we integrated data from six different datasets to explore the genomic complexity of over 1,000 Luminal A tumors (Table 1). We used the TCGA dataset consisting of 209 Luminal A tumors as a discovery dataset (Table S1) and confirmed our findings in the other cohorts. We identified reproducible subgroups within this subtype, each of which showed distinct DNA CNA, somatic mutations, pathway alterations, and clinical outcome.

Materials and methods

Genomic data

Data from the TCGA study is accessible through the TCGA web portal at https://tcga-data.nci.nih.gov/tcga/tcgaHome2.jsp. Data from METABRIC dataset was made available upon request and is now accessible through the European Genome–phenome Archive (EGA) with the accession number EGAS00000000083. Raw and pre-processed aCGH data for the Russnes et al. combined dataset can be accessed through the Gene Expression Omnibus (GEO) repository with accession numbers: GSE8757 [18], GSE20394 [19], and GSE19425 [20]. Mutation data for the Ellis et al. and Banerji et al. dataset are available as supplemental information within the respective publications. The datasets used in this manuscript can be explored using the cBio Cancer Genomics Portal at http://www.cbioportal.org/public-portal/ [21].

Recurrent mutations in breast cancer subtypes

In this work, we account for somatic mutations reported by multiple studies [10, 11, 16]. To estimate statistical significance of recurrent mutations across multiple datasets, we integrated the full set of reported mutations from these studies and used the binomial distribution to model the somatic mutation frequencies of genes. Given N samples, to estimate if a gene had a higher somatic mutation rate (mutations per nucleotide) than expected by chance, we evaluated if a gene with K observed non-silent mutations (summed over all tumor samples: \(K = \sum\nolimits_{i = 1}^{N} {k_{i} }\)) and a global coding sequence of length L (summed over all tumor samples: L = l * N) had more mutations than expected, i.e., the average somatic mutation rate for all genes (G) with observed somatic mutations \(p = \frac{1}{G}\sum\nolimits_{i = 1}^{G} {\frac{{K_{i} }}{{L_{i} }}.}\)

Thus: \(P(X \ge K) = 1 - P(X < K) = \sum\limits_{i = 0}^{K - 1} {\left( {\begin{array}{*{20}c} L \\ i \\ \end{array} } \right)p^{i} (1 - p)^{L - i} .}\)

From the set of p-values we estimated corresponding false discovery rates, or q-values, using the Benjamini–Hochberg procedure [22].

Copy number clustering

DNA copy number data were produced and processed for TCGA at the Broad Institute [10]. Briefly, copy number levels were inferred from Affymetrix SNP 6.0 CEL files by Birdseed and tangent normalization. Segmentation was then performed by Circular Binary Segmentation [23]. Copy number clustering was performed on normalized and segmented copy number data. A unified breakpoint profile (region by sample matrix) was derived by combining all breakpoints across all samples and determining the minimal common regions of change. Unified breakpoint profiles were computed using the R package CNTools [24] from Bioconductor [25] (http://bioc.ism.ac.jp/2.5/bioc/html/CNTools.html). Hierarchical clustering was done using the R function hclust, with manhattan distance and Ward’s agglomeration method [26].

Cluster centroids for each copy number cluster derived from the Luminal A TCGA dataset were computed by averaging cluster member features (unified breakpoints).

For each centroid we determined the most intrinsically variable breakpoints using the median absolute deviation (MAD) measure. Unified breakpoints with MAD > 0.1 were selected to define the cluster centroids. Average MAD values for this subset is threefold higher than the average of the remaining breakpoints, and twofold higher than the overall average (Fig. S1).

Samples from the METABRIC dataset have been assigned to the cluster whose centroid shows the highest correlation. Pearson correlation coefficient is a scale-independent measure and, thus, able to overcome the difficulty of comparing copy number values obtained with different platforms and on different scales. Nonetheless, we identified few samples (20 out of 594) with low correlation values with each centroids (pearson < 0.1). This subset is characterized by flat copy number spectra, such that all probes have identical values close to zero, with the exception of a few isolated probes likely to be either artifacts of the array or germline copy number variation (CNV). These flat samples have been assigned to the Copy Number Quiet subtype.

The quality of the clusters obtained with the centroids has been evaluated by different metrics: the clusters show similar proportions as those observed for the TCGA dataset (Fig. 1b), in-group proportion (IGP) values determined as in [27] are statistically significant (p < 0.001 for all clusters, Fig. 1b), and copy number pattern of alterations are remarkably similar to those observed in the TCGA dataset (Fig. S2).

Survival analysis

Survival analysis on the METABRIC dataset has been performed using the R package survival (http://cran.r-project.org/web/packages/survival/index.html) [26]. Patient follow-up has been limited to 15 years, and deaths related to other causes have been ignored in the analysis. The same analysis was performed for the Russnes et al. [19] dataset consisting of 77 Luminal A samples and combined three different cohorts: Ull cohort [19] (26 Luminal A samples), MicMa cohort [20] (9), and Chin cohort [18] (42).

Cox regression multivariate analysis was performed suing the coxph function from the survival R package. Multivariate analysis was used to assess dependencies between the classification induced by the Copy Number High (CNH) cluster and multiple covariates: tumor size, grade, stage, and age at diagnosis. This information was available for 468 Luminal A samples in the METABRIC dataset.

Subtype enrichment analysis

Given the tremendous heterogeneity displayed by Luminal A breast tumors, we analyzed the overall spectrum of genomic alterations across different subtypes looking for copy number subtype-specific patterns. Our approach relies on the general abstraction of gene alteration per sample, where each alteration belongs to one of the three categories: (a) gene is altered by mutations; (b) gene is primarily altered by CNA, and mRNA expression levels correlate with copy number changes; (c) “Wild-card” events (e.g., gene shows aberrant mRNA expression and/or methylation status independent of mutations and copy number).

These categories rely on two systematic approaches: for mutations we selectively analyzed the list of SMGs identified by the algorithm MuSiC [28], for copy number we analyzed frequently amplified and deleted region of interest (ROI) as identified by GISTIC [29]. We used the set of wide copy number gains and losses as the wild-card events. First, we selected the chromosome arms that were found to be recurrently gained or lost by GISTIC in [10]. Second, for each event we classified a sample as altered if segments accounting for at least 50 % of the whole chromosome arm length had values above (gain) or below (loss) selected thresholds. In this study, we used T = 0.15 as the absolute value for the threshold, where gains are defined by copy number level >T and losses by copy number levels <−T.

To systematically look for subtype-specific genomic events, we developed a method called Subtype-Enriched Alterations (SEA). Subtype enrichment is tested in two steps: (1) the distribution of alterations is compared to the expected distribution given in the number of samples that belong to each subtype by a goodness-of-fit test; (2) a hypergeometric p-value is derived for the subtype with the highest percentage of alterations when compared against all others. Enrichment p-values are then corrected for multiple testing [22]. Alterations in each category are tested separately and treated independently.

Differential mRNA expression analysis

To characterize the CNH subtype, beyond CNA and somatic mutations, we looked for genes differentially expressed between CNH tumors and the rest of Luminal A. We tested each gene by ANOVA, computed nominal p-values and corrected for multiple testing. High level amplification of the 8q region in the CNH group strongly influences this analysis with a high presence of genes located in this region. For this reason we used slightly less strict thresholds to select genes with nominal p-value < 0.05 and FDR-corrected q-value < 0.2.

Using this procedure we extracted two lists, one for genes that are up-regulated in CNH tumors, and one for genes that are down-regulated in the same set when compared to the other Luminal A tumors. Each list has been tested for functional enrichment using the DAVID Functional Annotation Tool [30].

Interestingly, despite a high number of genes from the highly amplified genomic region 8q21–24, only 3 out of 59 genes associated to “mitotic cell cycle” (GO: 0000278) are in this locus. Thus, mRNA up-regulation driven by this amplification does not seem to be related with mitosis regulation, spindle assembly, and chromatids segregation. We repeated the same functional enrichment test [30] after removing genes from 8q21–24, and separately for the up-regulated genes in this region to untangle potential different phenotypes associated to this copy number amplification and to other mRNA aberrations independent of it.

Up-regulated genes in CNH Luminal A that are not in the 8q21–24 region are similarly highly enriched for “mitosis cell cycle” (GO: 0000278), but also for related processes like “cell division” (GO: 0051301), “spindle” (GO: 000581), and “nuclear division” (GO: 0000280). These categories show elevated expression of both Aurora kinase A and B, as well as several genes in their pathways (e.g., PLK1, CDC25B, CCNA2, CDK1, INCENP, BIRC5, CDCA3/5/8, KIF2C). On the other hand, up-regulated genes in 8q21–24 were not found to be enriched for any functional annotation.

Mutual exclusivity analysis

All pairwise tests of mutual exclusivity were done using the switching permutation procedure described in [31]. This permutation strategy has the desirable property of preserving both number of alterations per gene and number of alterations per samples.

Mutual exclusivity modules (MEMo) were identified using the algorithm MEMo [31]. MEMo automatically identifies mutually exclusive alterations targeting frequently altered genes that are likely to belong to the same pathway. Genomic events were defined, as described in the previous section, following the “gene alteration per sample” abstraction and including focal copy number altered regions from GISTIC and recurrently mutated genes identified by MuSiC. Wild-card events included NF1 down-regulation (<1.5 standard deviations from the average) observed in 12 cases. The corresponding oncoprints for these modules are shown in Fig. S3 highlighting sample specific alterations in each module (samples are in the columns, altered genes in the rows). Module oncoprints were generated using the cBio Cancer Genomics Portal [21].

Results

The landscape of Luminal A CNA

Luminal A tumors show great heterogeneity in terms of somatic mutations and CNA [10], indicating that additional substructure may be present within this large group. Dissecting the genetics of this tumor could be fundamental to inform therapeutics and predict clinical outcome.

We first explored the spectrum of copy number changes across Luminal A tumors, to identify novel and subset-specific alterations. We performed hierarchical clustering of Affymetrix 6.0 SNP copy number data from 209 Luminal A tumors from the TCGA dataset. Segments of uniform copy number value for each patient were compared to compute the set of unified breakpoints across the whole dataset, and the so determined set of minimal segments of change were used as features for the clustering procedure. Hierarchical clustering of copy number changes across the whole genome on the TCGA dataset revealed a complex structure of recurrent patterns of alterations. Based on clustering results and recurrent CNA, we were able to identify four major characteristic patterns and a mixed group (Fig. 2a; Table S1).

The first major pattern is characterized by 1q gain and 16q loss (1q/16q pattern, clusters a, b, and c). This pattern has been frequently observed in breast tumors and has been associated with the translocation der(1;16) [32, 33]. The 1q/16q pattern is dominant in cluster a, which features otherwise mostly diploid genomes. By contrast, cluster b is characterized by a broad deletion occurring on 6q, and cluster c has concurrent 11q13–14 focal amplification and 11q loss. High level amplification of the 11q13 and 11q14 loci is frequently observed in breast tumors and minimal regions of overlap target CCND1 and PAK1, respectively. Notably, these amplicons significantly co-occur with loss of the remaining part of the 11q arm across all tumors in the TCGA dataset (p = 6E−8, by one-tail-Fisher’s exact test).

Another group of Luminal A patients is characterized by a surprisingly quiet copy number spectrum (Copy Number Quiet pattern). These tumors (cluster d) have almost completely diploid genomes, with only few cases showing whole arm loss of 16q.

The third group includes clusters e, f, and g, and is strongly characterized by CNA of chromosome 8, with loss of 8p and gain of 8q (Chr8-associated pattern). Within this group, cluster f shows an interesting pattern of CNA, where 8p loss and 8q gain co-occur with 16p gain and 16q loss. These gains and losses affect the whole arms of the chromosomes, and in this group they do not co-occur with other CNAs. Cluster g, on the other hand, displays more copy number changes and is enriched for focal amplifications of 8p11.23–22 (FGFR1, ZNF703, and WSHC1L1), 8p11.21 (IKBKB), and 11q13–14 (Table S2).

The fourth group is characterized by the highest level of genomic instability among Luminal A tumors, including multiple focal CNAs (CNH pattern, cluster h). This group shows recurrent 20q gain, 5q loss, 8p loss, 8q gain, and is enriched for focal amplifications of the MYC oncogene on 8q24.21 (Table S2). Finally, the Mixed group is characterized by frequent whole-arm and whole-chromosome gains and losses, lacking the recognizable patterns seen in the other four groups (cluster i).

We validated our clusters using the 721 Luminal A samples from the METABRIC dataset [12]. We classified these tumors using centroids derived from the TCGA Luminal A dataset to identify clusters with the same alteration patterns (Fig. S2; Table S3). The clusters obtained from the METABRIC dataset occurred in similar proportions as the TCGA clusters and their quality was confirmed by the IGP [27] measure (Fig. 2b). Interestingly, the CN-Quiet and Chr.8-associated clusters show strong correspondences with two of the Luminal-enriched clusters from METABRIC and colleagues (IC4 and IC7, respectively), whereas components of the Mixed and CN-High groups are spread across multiple clusters (Fig. 2c).

The landscape of Luminal A somatic mutations

Distinct copy number patterns frequently come in tandem with equally variable landscapes of somatic mutations. Indeed, despite the lowest mutation rate among breast cancer subtypes, the Luminal A subtype shows the largest number of genes mutated with statistically significant recurrence (Fig. 3a; Table 2).

Table 2 Recurrent somatic mutations of breast cancer subtypes [10, 11, 16]

Full size table

The most frequently mutated genes (>10 %) are PIK3CA, GATA3, MAP3K1, and TP53. Interestingly, all show significant associations with recurrent copy number patterns. PIK3CA and GATA3 mutations are mostly found in tumors with low CNA (CN-Quiet and 1q/16q), and in particular GATA3 mutations are enriched for the 1q/16q subgroup (p_GATA3 = 0.009). Notably, 9 out of 15 hotspot mutations for GATA3 are in this subgroup. MAP3K1 mutations are enriched in the Chr.8-associated subgroup (p_MAP3K1 = 4.22E−04). MAP3K1 mutations strongly co-occur with the 8p−/8q+/16p+/16q− pattern observed in cluster f (p_{MAP3K1
(f)} = 1.9E−5), and thus, are largely mutually exclusive with focal amplification of 8p11.23–21 (Fig. 3b). All MAP3K1-mutated cases harbor at least one inactivating mutation, indicating loss of function, and most of them have at least two mutations, suggesting bi-allelic inactivation (Fig. 3c). Finally, the CNH subgroup shows an overall depletion for all recurrent Luminal A mutations, except for a strong presence of TP53 mutations (p_TP53 = 9.35E−6).

Luminal A tumors show also an interesting presence of hotspot mutations beyond PIK3CA and GATA3. AKT1 E17K activating mutations were observed in 3 out of 4 datasets and predominantly in Luminal A tumors (14 out of 20). Interestingly these mutations are perfectly mutually exclusive with those targeting PIK3CA. Additional hotspot mutations include those targeting KRAS (G12V/D), splicing factor SF3B1 (K666E, K700E), and c-Myc transcriptional repressor CTCF (R283C, H284P/Y/N).

Associations with clinical outcome reveal high-risk Luminal A subtype

The characterization of Luminal A tumors provided so far clearly identifies distinct subgroups within this tumor subtype, whose clinical relevance needs to be addressed now. The large dataset analyzed by Curtis et al. [12] has extensive clinical follow-up, enabling a reliable survival analysis. Kaplan–Meier analysis showed significantly different disease survival within the Luminal A subgroups (log-rank p = 0.015, Fig. S4). In particular, the CNH subgroup had significantly worse outcome (log-rank p = 4.6E−5, Fig. 4a) despite receiving similar treatments (Table S3). We validated the poor prognosis for the CNH tumors in an independent dataset consisting of 77 Luminal A tumors from three different cohorts [19] (log-rank p = 0.003, Fig. 4b; Table S4).

To assess dependencies between the CNH classification and other clinical covariates, we used Cox multivariate regression. We tested for dependencies for tumor grade, stage, size, and age at diagnosis. We found independent statistically significant association with outcome for tumor size (p = 6E−05), age at diagnosis (p = 0.004), and CNH classification (p = 0.01); no association was found between these covariates. The overall log-rank p-value for the combined covariates is p = 2E−08. The CNH classification showed, therefore, independent prognostic value (Table S5).

Finally, we compared scores derived from research-based versions of Oncotype DX [1], Mammaprint [34], and PAM50 Risk of Recurrence (ROR-S) [8] across Luminal A subgroups. We found that the CNH subgroup is consistently associated with a higher risk than the other Luminal A subgroups (Fig. S5), confirming the prediction of a worse prognosis.

As shown above, the CNH tumors are (1) characterized by high genomic instability, (2) depleted for typical Luminal A mutations (e.g., PIK3CA, 29 vs. 51 %, p_PIK3CA = 0.04), (3) highly enriched for TP53 mutations (48 vs. 7 % on average in the other Luminal A samples), and (4) tend to have MYC focal amplification, 20q gain and 5q loss (Fig. 4c). Interestingly, genes that are significantly over-expressed in CNH tumors compared to the rest of Luminal A cases are enriched for regulators of mitosis including Aurora kinases A and B, PLK1, Cyclin-A, Cyclin-E, CDK1, and Cdc25 (Fig. 4d; Table S6). A similar set of genes was previously found to be associated with 5q loss in Basal-like breast cancers [12]. Most of these genes have also been identified as “proliferation marker” genes [35]. The observed clinical outcome for CNH tumors, confirmed in two independent datasets, is therefore strongly associated to high genomic instability and proliferation as revealed by their molecular features.

Integrated pathway analysis reveals mechanisms of resistance to endocrine therapy

The great diversity of genomic lesions observed in Luminal A tumors maps to different cellular processes. To explore the role of these alterations in a pathway context, we used the MEMo algorithm [31], which identifies micro-pathways or modules whose components are frequently altered in a mutually exclusive manner. Statistically significant mutual exclusivity between recurrent alterations strengthens the hypothesis of functional relatedness and, more importantly, may reflect either functional redundancy, highlighting multiple ways to de-regulate the same pathway, or synthetic lethal interactions [31].

Modules extracted by MEMo in a Luminal A-only analysis highlight frequent alteration to the PI(3)K/Akt, MAP-kinase, and Ras/ERK signaling cascades (Figs. 5a, S3; Table S7). Alterations include PTEN inactivation, mutations of PIK3CA and AKT1, inactivation of MAP3K1 and MAP2K4, amplification of receptor tyrosine kinases (ERBB2 and IGF1R), and RAS activation either through activating mutations of KRAS or NF1 depletion by either DNA homozygous deletion or mRNA down-regulation (Fig. S6). Mutually exclusive inactivation of MAP3K1 and MAP2K4 was confirmed in the Ellis et al. [11] dataset, corroborating the hypothesis of reduced JNK signaling in Luminal A tumors and providing further insights into MAP3K1 mutations.

MEMo also identified a set of less frequent alterations targeting the N-Cor and SMRT complexes (Fig. 5b). While not statistically significant due to the small number of altered samples (28 out of 209 in total), alterations in these modules are almost completely mutually exclusive. Alterations include recurrent mutations targeting core components of the nuclear co-repressor complex (NCOR1, TBL1XR1, and GPS2) and the nuclear co-repressor 2 (NCOR2) or SMRT. Most of these mutations are either frame-shift or nonsense and thereby likely inactivating, they correlate with low mRNA expression, and frequently co-occur with hemizygous loss of the target gene. The modules also include homozygous deletions of ANKRD11, consistent with its ability to inhibit p160 steroid receptor co-activator recruitment [36, 37].

Co-repressor and co-activator complexes play a major role in regulating ER-α transcription and the inhibitory activity of Tamoxifen. Tamoxifen-bound ER has an increased affinity to co-repressors and specifically to N-Cor/SMRT. These co-repressors are required for the anti-proliferative effects of Tamoxifen (Fig. 4c). Repressing the NCor and SMRT complexes in human breast cancer cell lines turns Tamoxifen into an agonist of cell proliferation [38, 39], and lower or absent mRNA expression of NCor in patients correlates with shorter relapse [40, 41]. Here, for the first time, we identify distinct molecular mechanisms of inactivation of these complexes in patients. We directly link multiple genomic alterations, occurring in 13 % of the patients, to loss of co-repressors activity (Fig. 4d). These alterations may predict resistance to endocrine therapy.

Discussion

Recently, multiple studies of human breast cancer provided novel insights into the biology of this cancer and its intrinsic subtypes [7], as well as an unprecedented amount of genomic information still not completely explored and understood. This information is fundamental to inform patient treatments with targeted agents [42, 43]. Our work aimed at complementing recent breast cancer genomic studies, with an in-depth analysis of the most diverse breast cancer subtype: Luminal A. We dissected the genomics of Luminal A tumors in multiple datasets and have explained more of its molecular and clinical heterogeneity.

We identified four major subgroups of Luminal A tumors that are characterized by distinct patterns of CNA, somatic mutations, and clinical outcomes (Table 3). These include a subgroup (i.e., CNH) presenting molecular features atypical of Luminal tumors and associated with worse prognosis. Poor clinical outcome was confirmed in multiple datasets and is independent of other markers, such as tumor size, grade, stage, and age of diagnosis. Interestingly, the CNH distinction was also associated with higher scores coming from current clinical assays for breast cancer prognosis/prediction (Oncotype DX, Mammaprint, PAM50 ROR-S), thus providing a molecular explanation for these gene expression risk assays. This subtype shows a high level of CNA, recurrent TP53 mutations, and over-expression of mitotic regulators including Aurora kinases A and B (Table 3). Over-expression of these genes has been associated with high genomic instability and tumorigenesis [44, 45] and, more importantly, Aurora kinases are targets of specific inhibitors currently in clinical trials [46].

Table 3 Genomic features of Luminal A Copy Number Subtypes

Full size table

Integrated analysis of CNA and mutations showed a significant prevalence of GATA3 and PIK3CA hotspot mutations in tumors characterized by 1q gain and 16q loss; thus, the tumors with the fewest copy number changes showed not only associations with mutations within specific genes, but associations with distinct types of mutations within these genes. These results suggest that one type of mutation within GATA3 (i.e., intron 4 CA deletion) is strongly associated Luminal A 1q/16q cancers, while other mutations (exon 5 frameshifts) cause Luminal B cancers; thus, these mutations are likely to be cancer driving events, and early events within the evolution of these tumors.

Mutations within the PI3K pathway became of particular clinical interest with the advent of many specific PIK3CA inhibitors now in clinical trials [42, 47]. Of particular interest will be if the different pathway mutation types (PIK3CA mutation vs. PTEN mutation vs. AKT1 mutation) are all biomarkers of PIK3CA inhibitor sensitivity, and how these might interact with the different inhibitors, many of which have differing affinities for the PI(K) family of kinases. Lastly, multiple inactivating mutations in MAP3K1 were found in tumors with whole-arm events on chromosome 8 and 16. Alterations of these genes point to deregulated AKT and MAPK signaling, and this was confirmed by an unbiased pathways analysis. In particular, MAP3K1 and its direct target MAP2K4 negatively regulate JNK-mediated cell death, possibly compromising response to chemotherapeutic agents [48]; thus, a pressing clinical question is do mutations in MAP3K1 and/or MAP2K4 predict for lower response rates to chemotherapy and endocrine therapy, which can be addressed retrospectively if mutation detection can be performed using FFPE materials from existing clinical trial archives.

Our analysis of deregulation of cellular pathways in Luminal A tumors also revealed inactivation of the ER co-repressors N-Cor/SMRT. We identified multiple rare, but mutually exclusive, alterations targeting components of these complexes, as well as ANKRD11, an inhibitor of p160 co-activator complexes. Nuclear co-repressors regulate ER transcriptional activity and are required for the anti-proliferative effects of Tamoxifen [40, 41]. Alterations of these complexes may, therefore, promote ER-driven proliferation in the presence of Tamoxifen, and predict a lack of response to endocrine therapy. These alterations were observed almost exclusively in Luminal A tumors. Indeed, the same unbiased pathway analysis performed on the whole TCGA breast cancer dataset was unable to highlight them [10].

Our work integrated multiple genomic and genetic data types within the Luminal A breast cancer subtype and spanned multiple data sets. Our results have shed new light on the intrinsic heterogeneity within this subtype and strengthen the importance of genomic studies within tumor subpopulations. These in-depth analyses already show intrinsic heterogeneity in other breast cancer subtypes. Her-2 positive tumors have been shown to be composed of two main groups defined by different ER and EGFR status [10], and recently Her-2 mutations have been shown to be activating and tumorigenic [49] in tumors without DNA amplification of the Her-2 locus. Moreover, these mutations are prominent in relapsed invasive lobular breast cancer [50]. Similarly, basal-like and triple negative breast cancers have been object of extensive investigations due their aggressive nature [51–53]. We can now add Luminal A disease to the list of heterogeneous diseases with distinct subtypes within this previously defined single subtype. The wealth of genomic data available today enables these in-depth analyses of selected tumor subgroups and highlights the need for comprehensive genomic characterization of tumor samples to inform clinical trials and therapeutic choices.

References

Paik S et al (2004) A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med 351(27):2817–2826
Article PubMed CAS Google Scholar
Paik S et al (2006) Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer. J Clin Oncol 24(23):3726–3734
Article PubMed CAS Google Scholar
Romond EH et al (2005) Trastuzumab plus adjuvant chemotherapy for operable HER2-positive breast cancer. N Engl J Med 353(16):1673–1684
Article PubMed CAS Google Scholar
Slamon DJ, Romond EH, Perez EA (2006) Advances in adjuvant therapy for breast cancer. Clin Adv Hematol Oncol 4(3 Suppl 7): suppl 1, 4–9; discussion suppl 10; quiz 2 p following suppl 10
Perez EA et al (2011) Four-year follow-up of trastuzumab plus adjuvant chemotherapy for operable human epidermal growth factor receptor 2-positive breast cancer: joint analysis of data from NCCTG N9831 and NSABP B-31. J Clin Oncol 29(25):3366–3373
Article PubMed CAS Google Scholar
Sorlie T et al (2001) Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci USA 98(19):10869–10874
Article PubMed CAS Google Scholar
Perou CM et al (2000) Molecular portraits of human breast tumours. Nature 406(6797):747–752
Article PubMed CAS Google Scholar
Parker JS et al (2009) Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol 27(8):1160–1167
Article PubMed Google Scholar
Fan C et al (2006) Concordance among gene-expression-based predictors for breast cancer. N Engl J Med 355(6):560–569
Article PubMed CAS Google Scholar
TCGA (2012) Comprehensive molecular portraits of human breast tumors. Nature 490:61–70
Article Google Scholar
Ellis MJ et al (2012) Whole-genome analysis informs breast cancer response to aromatase inhibition. Nature 486(7403):353–360
PubMed CAS Google Scholar
Curtis C et al (2012) The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486(7403):346–352
PubMed CAS Google Scholar
Stephens PJ et al (2012) The landscape of cancer genes and mutational processes in breast cancer. Nature 486(7403):400–404
PubMed CAS Google Scholar
Nik-Zainal S et al (2012) The life history of 21 breast cancers. Cell 149(5):994–1007
Article PubMed CAS Google Scholar
Nik-Zainal S et al (2012) Mutational processes molding the genomes of 21 breast cancers. Cell 149(5):979–993
Article PubMed CAS Google Scholar
Banerji S et al (2012) Sequence analysis of mutations and translocations across breast cancer subtypes. Nature 486(7403):405–409
Article PubMed CAS Google Scholar
Haque R et al (2012) Impact of breast cancer subtypes and treatment on survival: an analysis spanning two decades. Cancer Epidemiol Biomarkers Prev 21(10):1848–1855
Article PubMed Google Scholar
Chin K et al (2006) Genomic and transcriptional aberrations linked to breast cancer pathophysiologies. Cancer Cell 10(6):529–541
Article PubMed CAS Google Scholar
Russnes HG et al (2010) Genomic architecture characterizes tumor progression paths and fate in breast cancer patients. Sci Transl Med 2(38):38ra47
Article PubMed Google Scholar
Hicks J et al (2006) Novel patterns of genome rearrangement and their association with survival in breast cancer. Genome Res 16(12):1465–1479
Article PubMed CAS Google Scholar
Cerami E et al (2012) The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov 2(5):401–404
Article PubMed Google Scholar
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate—a practical and powerful approach to multiple testing. J R Stat Soc Ser B 57(1):289–300
Google Scholar
Olshen AB et al (2004) Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5(4):557–572
Article PubMed Google Scholar
Zhang J, CNTools: Convert segment data into a region by sample matrix to allow for other high level computational analyses. R package (Version 1.6.0.)
Gentleman RC et al (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 5(10):R80
Article PubMed Google Scholar
R-Development-Core-Team (2010) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Kapp AV, Tibshirani R (2007) Are clusters found in one dataset present in another dataset? Biostatistics 8(1):9–31
Article PubMed Google Scholar
Dees ND et al (2012) MuSiC: identifying mutational significance in cancer genomes. Genome Res 22:1589–1598
Article PubMed CAS Google Scholar
Beroukhim R et al (2007) Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma. Proc Natl Acad Sci USA 104(50):20007–20012
Article PubMed CAS Google Scholar
Dennis G et al (2003) DAVID: database for annotation, visualization, and integrated discovery. Genome Biol 4(5):P3
Article PubMed Google Scholar
Ciriello G et al (2012) Mutual exclusivity analysis identifies oncogenic network modules. Genome Res 22(2):398–406
Article PubMed CAS Google Scholar
Flagiello D et al (1998) Highly recurrent der(1;16)(q10;p10) and other 16q arm alterations in lobular breast cancer. Genes Chromosomes Cancer 23(4):300–306
Article PubMed CAS Google Scholar
Tsuda H et al (1999) der(16)t(1;16)/der(1;16) in breast cancer detected by fluorescence in situ hybridization is an indicator of better patient prognosis. Genes Chromosomes Cancer 24(1):72–77
Article PubMed CAS Google Scholar
van de Vijver MJ et al (2002) A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347(25):1999–2009
Article PubMed Google Scholar
Whitfield ML et al (2006) Common markers of proliferation. Nat Rev Cancer 6(2):99–106
Article PubMed CAS Google Scholar
Zhang A et al (2004) Identification of a novel family of ankyrin repeats containing cofactors for p160 nuclear receptor coactivators. J Biol Chem 279(32):33799–33805
Article PubMed CAS Google Scholar
Zhang A, Li CW, Chen JD (2007) Characterization of transcriptional regulatory domains of ankyrin repeat cofactor-1. Biochem Biophys Res Commun 358(4):1034–1040
Article PubMed CAS Google Scholar
Keeton EK, Brown M (2005) Cell cycle progression stimulated by tamoxifen-bound estrogen receptor-alpha and promoter-specific effects in breast cancer cells deficient in N-CoR and SMRT. Mol Endocrinol 19(6):1543–1554
Article PubMed CAS Google Scholar
Cutrupi S et al (2012) Targeting of the adaptor protein Tab 2 as a novel approach to revert tamoxifen resistance in breast cancer cells. Oncogene 31:4353–4361
Article PubMed CAS Google Scholar
Girault I et al (2003) Expression analysis of estrogen receptor alpha coregulators in breast carcinoma: evidence that NCOR1 expression is predictive of the response to tamoxifen. Clin Cancer Res 9(4):1259–1266
PubMed CAS Google Scholar
Green AR et al (2008) The prognostic significance of steroid receptor co-regulators in breast cancer: co-repressor NCOR2/SMRT is an independent indicator of poor outcome. Breast Cancer Res Treat 110(3):427–437
Article PubMed CAS Google Scholar
Ellis MJ, Perou CM (2013) The genomic landscape of breast cancer as a therapeutic roadmap. Cancer Discov 3(1):27–34
Article PubMed CAS Google Scholar
Zardavas D, Baselga J, Piccart M (2013) Emerging targeted agents in metastatic breast cancer. Nat Rev Clin Oncol 10(4):191–210
Article PubMed CAS Google Scholar
Gautschi O et al (2008) Aurora kinases as anticancer drug targets. Clin Cancer Res 14(6):1639–1648
Article PubMed CAS Google Scholar
Nishida N et al (2007) High copy amplification of the Aurora-A gene is associated with chromosomal instability phenotype in human colorectal cancers. Cancer Biol Ther 6(4):525–533
Article PubMed CAS Google Scholar
Dar AA et al (2010) Aurora kinase inhibitors—rising stars in cancer therapeutics? Mol Cancer Ther 9(2):268–278
Article PubMed CAS Google Scholar
Juric D, Baselga J (2012) Tumor genetic testing for patient selection in phase I clinical trials: the case of PI3K inhibitors. J Clin Oncol 30(8):765–766
Article PubMed CAS Google Scholar
Small GW et al (2007) Mitogen-activated protein kinase phosphatase-1 is a mediator of breast cancer chemoresistance. Cancer Res 67(9):4459–4466
Article PubMed CAS Google Scholar
Bose R et al (2013) Activating HER2 mutations in HER2 gene amplification negative breast cancer. Cancer Discov 3(2):224–237
Article PubMed CAS Google Scholar
Ross JS et al (2013) Relapsed classic E-cadherin (CDH1) mutated invasive lobular breast cancer demonstrates a high frequency of HER2 (ERBB2) gene mutations. Clin Cancer Res 19:2668–2676
Article PubMed CAS Google Scholar
Prat A et al (2013) Molecular characterization of basal-like and non-basal-like triple-negative breast cancer. Oncologist 18(2):123–133
Article PubMed CAS Google Scholar
O’Toole SA et al (2013) Therapeutic targets in triple negative breast cancer. J Clin Pathol 66:530–542
Article PubMed Google Scholar
Pacheco JM et al (2013) Racial differences in outcomes of triple-negative breast cancer. Breast Cancer Res Treat 138(1):281–289
Article PubMed CAS Google Scholar

Download references

Acknowledgments

We wish to acknowledge Tari King, Sarat Chandarlapaty, and Elisa Oricchio for helpful discussions and critical reading of the manuscript. This work was funded by the National Cancer Institute Cancer Genome Atlas Grant U24-CA143840 and U24-CA143848, the National Resource for Network Biology Grant P41 GM103504, and the SU2C-AACR-DT0209 Dream Team Translational Research Grant. This work was also supported by funds from the NCI Breast SPORE program (P50-CA58223-09A1), and the Breast Cancer Research Foundation.

Conflict of interest

C.M. Perou serves as a board member and has an ownership interest (including patents) in University Genomics and Bioclassifier LLC.

Author information

Authors and Affiliations

Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, NY, USA
Giovanni Ciriello, Rileen Sinha, Anders S. Jacobsen, Boris Reva, Chris Sander & Nikolaus Schultz
Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, USA
Katherine A. Hoadley & Charles M. Perou

Authors

Giovanni Ciriello
View author publications
You can also search for this author in PubMed Google Scholar
Rileen Sinha
View author publications
You can also search for this author in PubMed Google Scholar
Katherine A. Hoadley
View author publications
You can also search for this author in PubMed Google Scholar
Anders S. Jacobsen
View author publications
You can also search for this author in PubMed Google Scholar
Boris Reva
View author publications
You can also search for this author in PubMed Google Scholar
Charles M. Perou
View author publications
You can also search for this author in PubMed Google Scholar
Chris Sander
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaus Schultz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giovanni Ciriello.

Additional information

The corresponding email address reaches GC, CP, CS, and NS

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (EPS 393 kb)

Supplementary material 2 (EPS 1661 kb)

Supplementary material 3 (EPS 7601 kb)

Supplementary material 4 (EPS 482 kb)

Supplementary material 5 (EPS 3065 kb)

Supplementary material 6 (EPS 457 kb)

Supplementary material 7 (DOCX 20 kb)

Supplementary material 8 (XLSX 99 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Ciriello, G., Sinha, R., Hoadley, K.A. et al. The molecular diversity of Luminal A breast tumors. Breast Cancer Res Treat 141, 409–420 (2013). https://doi.org/10.1007/s10549-013-2699-3

Download citation

Received: 29 August 2013
Accepted: 10 September 2013
Published: 06 October 2013
Issue Date: October 2013
DOI: https://doi.org/10.1007/s10549-013-2699-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The molecular diversity of Luminal A breast tumors

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Genomic data

Recurrent mutations in breast cancer subtypes

Copy number clustering

Survival analysis

Subtype enrichment analysis

Differential mRNA expression analysis

Mutual exclusivity analysis

Results

The landscape of Luminal A CNA

The landscape of Luminal A somatic mutations

Associations with clinical outcome reveal high-risk Luminal A subtype

Integrated pathway analysis reveals mechanisms of resistance to endocrine therapy

Discussion

References

Acknowledgments

Conflict of interest

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation