Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer

Silva, Grace O.; He, Xiaping; Parker, Joel S.; Gatza, Michael L.; Carey, Lisa A.; Hou, Jack P.; Moulder, Stacy L.; Marcom, Paul K.; Ma, Jian; Rosen, Jeffrey M.; Perou, Charles M.

doi:10.1007/s10549-015-3476-2

Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer

Preclinical Study
Open access
Published: 25 June 2015

Volume 152, pages 347–356, (2015)
Cite this article

Download PDF

You have full access to this open access article

Breast Cancer Research and Treatment Aims and scope Submit manuscript

Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer

Download PDF

Grace O. Silva^1,2,3,
Xiaping He³,
Joel S. Parker^1,3,
Michael L. Gatza^1,3,
Lisa A. Carey^3,4,
Jack P. Hou^5,6,
Stacy L. Moulder⁷,
Paul K. Marcom⁸,
Jian Ma^5,9,
Jeffrey M. Rosen¹⁰ &
…
Charles M. Perou^1,2,3

3002 Accesses
38 Citations
10 Altmetric
Explore all metrics

Abstract

A large number of DNA copy number alterations (CNAs) exist in human breast cancers, and thus characterizing the most frequent CNAs is key to advancing therapeutics because it is likely that these regions contain breast tumor ‘drivers’ (i.e., cancer causal genes). This study aims to characterize the genomic landscape of breast cancer CNAs and identify potential subtype-specific drivers using a large set of human breast tumors and genetically engineered mouse (GEM) mammary tumors. Using a novel method called SWITCHplus, we identified subtype-specific DNA CNAs occurring at a 15 % or greater frequency, which excluded many well-known breast cancer-related drivers such as amplification of ERBB2, and deletions of TP53 and RB1. A comparison of CNAs between mouse and human breast tumors identified regions with shared subtype-specific CNAs. Additional criteria that included gene expression-to-copy number correlation, a DawnRank network analysis, and RNA interference functional studies highlighted candidate driver genes that fulfilled these multiple criteria. Numerous regions of shared CNAs were observed between human breast tumors and GEM mammary tumor models that shared similar gene expression features. Specifically, we identified chromosome 1q21-23 as a Basal-like subtype-enriched region with multiple potential driver genes including PI4KB, SHC1, and NCSTN. This step-wise computational approach based on a cross-species comparison is applicable to any tumor type for which sufficient human and model system DNA copy number data exist, and in this instance, highlights that a single region of amplification may in fact harbor multiple driver genes.

The landscape of chromosomal aberrations in breast cancer mouse models reveals driver-specific routes to tumorigenesis

Article Open access 04 July 2016

Mouse Models of Breast Cancer Share Amplification and Deletion Events with Human Breast Cancer

Article 26 January 2017

An integrated genomics approach identifies drivers of proliferation in luminal-subtype human breast cancer

Article 24 August 2014

Introduction

Breast cancer is a heterogeneous disease that is characterized by distinct histological forms, genetic alterations, and patient outcomes [1–6]. Consistent with these observations, differential gene expression can distinguish molecular subtypes that separate breast cancer into distinct groups including Basal-like, Claudin-low, HER2-enriched, Luminal A, and Luminal B subtypes [2–4, 7–9]. These so called “intrinsic subtypes” are predictive of relapse-free survival, overall survival, and responsiveness to treatment [7–11]. Previous work highlighted numerous somatic mutations [12] and DNA copy number alterations (CNAs) [13] that are linked to specific intrinsic subtypes, suggesting that these genetic events may be causative of these subtypes. Beyond a few well-known drivers, the identification of genetic drivers present in many of these recurrent regions of DNA copy number change remains to be determined. Specifically, numerous CNAs are located on chromosome 1 and occur at high frequency among various cancer types including breast and liver [12, 14]. In breast cancer, copy number loss frequently occurs at 1p while copy number gains are frequent at 1q [13]. Furthermore, copy number gains at 1q often encompass the majority of the 1q arm, which include hundreds of genes.

To identify additional genetic drivers of breast cancer in common regions of amplification, we have taken a cross-species conservation approach based on the hypothesis that important etiological events in breast tumors will occur both in human breast cancers and mouse mammary tumor models. Through combined DNA copy number analyses of human breast tumors and multiple genetically engineered mouse (GEM) mammary tumor models, we identified 662 CNA regions conserved between these two species. Our ultimate selection strategy also incorporated gene expression data, an RNAi screen, and a network analysis to focus the list on the most likely driver genes within CNAs. Furthermore, using published functional studies, we provide new insights on the potential implications of Basal-like tumor-specific chromosome 1 drivers, some of which are therapeutically targetable.

Methods

Breast cancer tumor datasets

For these comparative studies, two human datasets and one mouse dataset were used that contained both gene expression and DNA copy number data (Table 1). The two human datasets were: (1) tumors collected at the University of North Carolina at Chapel Hill and the Oslo University Hospital, Radiumhospitalet, Norway (“UNC”, n = 159, GSE52173), and (2) The Cancer Genome Atlas (TCGA) Project dataset [12] (“TCGA”, n = 485). The third dataset contained tumors from numerous mouse mammary tumor models including GEM mammary models with inactivation of TP53, BRCA1, BRG1, and over-expression of cMYC, HER2/ERBB2/Neu, PyMT, and WNT1 (“mouse”, n = 73, GSE52173) (Supplemental Table 1). The publically available level 3 segmented copy number data for the TCGA dataset was downloaded through the TCGA data portal and the published PAM50 subtype calls were used [12]. Demographic and clinical characteristics of the UNC tumors are provided in Supplemental Table 2.

Table 1 Copy number array sample information of (a) human and (b) mouse tumors

Full size table

Cross-species assessment of subtype-specific changes in genomic DNA copy number

To identify subtype-specific CNAs from segmentation data generated by the various copy number array platforms (see Supplemental File 1 for details), we produced an add-on script to the SWITCHdna method of DNA copy number change point detection [13]. We created an R suite of functions called SWITCHplus, which can identify segments of the genome with copy number changes specific for a user-determined set of tumors, thus providing a supervised method for analyzing copy number data. SWITCHplus is provided as a source script in R and available for download at: https://genome.unc.edu/SWITCHplus/. Note, that we did not perform multiple hypothesis testing corrections as we chose alternative biologically based filtering criteria (Fig. 1) based upon cross-species conservation.

Computational analysis of candidate driver genes within conserved CNAs

In order to identify putative driver alterations within regions of copy number gains or losses, we began with all the conserved CNAs with a subtype segment frequency of 15 % or greater. To distinguish putative drivers from passengers, three further criteria were used. We first identified genes within a CNA that demonstrate concordance between the DNA and RNA expression. The second criterion filtered for conserved CNAs that contained genes with a breast cell line RNAi-associated phenotype as published in the Solimini et al. 2012 RNAi screen on human mammary epithelial cells [15]. The third criterion was to identify top ranking genes when scored using DawnRank [16]. By combining all these features together, we further decrease the false positive genes by filtering out genes without functional implications (Supplemental Table 3). A more extensive and detailed “Methods” section can be found as Supplemental File 1.

Results

Subtype-specific breast cancer copy number landscapes

In order to identify both known and novel genetic drivers of breast cancer on the DNA copy number level, we developed a multi-step and multi-platform computational strategy (Fig. 1). This strategy is predicated on using a “cross-species” comparative genomics approach where we searched for spontaneous copy number events across two different species (human and mouse). For this study, we created a new murine genomic resource of 73 mammary tumors profiled by both gene expression and DNA copy number microarray data (GSE52173); this new resource complements our human data set that contains 644 human breast tumors that have both gene expression and DNA copy number data (GSE52173 and http://tcga-data.nci.nih.gov/tcga).

We began using gene expression data to identify subtypes, separately for human tumor samples and GEM mammary models. For clarity, we refer to the classification of mouse tumors as “groups” to distinguish them from human classes that are termed “subtypes”. Using the PAM50 [8] algorithm and the Claudin-low predictor [9] we assigned each of the human tumor samples within the dataset to a specific intrinsic breast cancer subtype (Table 1). However, since there is no established expression-based classifier for mouse mammary tumors, we performed a supervised hierarchical cluster analysis of the murine mRNA expression data using the Herschkowitz et al. 2007 intrinsic mouse list of 866 genes. SigClust [17] analysis was used to identify 7 significant mouse groups (Supplemental Fig. 1), which were given a unique group name based on the majority mouse model contributor in that group (i.e., Myc, Neu/PyMT, Wnt1, C3Tag, Mixed, p53null-Basal, and p53null-Luminal). The “Mixed” mouse group lacked a single dominant mouse model contributor, however, this group comprised mouse tumors that all demonstrate the previously described Claudin-low gene expression features [18, 19], and hence forth this mouse group is referred to as “ClaudinLow”.

To identify subtype-specific, and mouse group-specific regions of DNA copy number gains and/or losses we developed a new bioinformatics visualization tool called SWITCHplus. Applying this tool to the mouse dataset identified group-specific DNA copy number changes for each of the seven expression-defined groups (Fig. 2). These results suggest that most mouse groups are characterized by numerous DNA copy number changes, many of which are specific to a given model/group (Supplemental Table 4). However, by comparing the copy number landscape between mouse groups, we also identified CNAs that were present in multiple models (Fig. 2; Supplemental Table 4), which can be considered as common CNAs of murine mammary oncogenesis. Therefore, these data support the notion that common spontaneous events may occur within different GEM mammary models irrespective of the initiating genetic event (i.e., transgene). Consistent with previous work, we identified multiple GEM mammary p53null groups based on gene expression patterns [18, 19]. Interestingly, these p53null groups demonstrated not only differences in mRNA expression patterns, but also exhibited differences in the DNA copy number landscapes (Fig. 2c, d). Additionally, we noticed that the p53null-Luminal, p53null-Basal, and C3Tag groups contained more group-specific CNAs than any of the other mouse groups (Supplemental Table 4); this observation is likely due to the loss of TP53 in these three groups. On average, each mouse group exhibited nearly twice the number of group-specific copy number gains versus losses.

We next analyzed the human DNA copy number landscape in the combined UNC/TCGA breast cancer dataset (Supplemental Fig. 2; Supplemental Table 5). Our results, not surprisingly, were consistent with previous publications [6, 12, 13]. For example, our analyses confirmed previously identified breast cancer copy number gains of 8q that is common and present irrespective of breast cancer subtype, as well as a number of subtype-specific CNAs. For instance, we again identified Basal-like-specific DNA copy number losses at 4q, 5q, and gains of 10p; Luminal A-specific copy number gains at 16p; Luminal B-specific copy number gains at 17q; and a Luminal-associated (encompassing both Luminal A and Luminal B) copy number loss at 16q (Supplemental Fig. 2; Supplemental Table 5) [6, 12, 13, 20, 21]. The HER2-enriched subtype contained few subtype-specific CNAs, noting that the HER2/ERBB2 amplicon was not a HER2-enriched subtype-specific copy number gain event as it also occurred in many Luminal tumors. Additionally, the Basal-like subtype contained the highest number of subtype-specific CNAs (Supplemental Table 5). In contrast to what was observed in the mouse groups, human tumors on average demonstrated more frequent subtype-specific regions of copy number loss compared to copy number gains (Supplemental Table 5).

Comparisons of copy number landscapes of mouse and human breast tumors

The extent to which mouse models of breast cancer recapitulate human phenotypes has been examined at the gene expression level [18–20], as well as on the copy number level, albeit only in a much smaller subset of these data [20]. We examined sub-chromosomal events and compared human subtype-specific copy number landscape plots to mouse group-specific landscape plots and identified shared cross-species CNA events [after re-ordering the mouse chromosomal landscape into human chromosome order (see “Methods” section)]. We first selected for “conserved regions”, which were DNA segments/regions that were altered at high frequency (≥15 %) and in the same direction (i.e., amplified or lost) in both human and mouse copy number landscapes. Applying this selection criterion reduced the search space for potential subtype-specific drivers more than 2-fold, leaving a total of 662 conserved regions when all mouse groups and human subtypes were considered (Supplemental Fig. 3; Supplemental Table 7).

In comparison among subtypes, the Claudin-low subtype had the fewest number of conserved regions (and the fewest CNAs overall) (Supplemental Table 7). Conversely, the Basal-like subtype contained the most conserved CNAs; however, this may be due to the fact that the Basal-like subtype also contained the most subtype-specific CNAs (Supplemental Table 7). Consistent with a previous publication [20], shared Basal-like-specific and murine p53null-Basal-specific regions of DNA copy number loss was observed spanning human 4q31-q35.2 and encompassing INPP4B, and also spanning 14q22.1-23.1 (Supplemental Table 7). By comparing shared sub-chromosomal CNAs between the human Basal-like subtype and all mouse groups, we noted that the C3Tag mouse group contained the most human Basal-like-specific copy number amplified regions, while the p53null-Basal mouse group contained the most human Basal-like-specific copy number loss regions (Supplemental Table 7). Both of these mouse models were previously shown to have the Basal-like tumor gene expression phenotypes [18, 20], therefore, for this study, we largely focused on copy number commonalities between human Basal-like tumors and these two mouse groups.

Identification of Basal-like tumor chromosome 1 amplification driver genes

Across all breast tumors, amplification of human chromosome 1q was the most frequent copy number altered event (not depicted). However, as can be seen in Supplemental Figs. 2 and 3, the “shape” of the chromosome 1 amplification varies by subtype, with the subtype-enriched amplification regions being identified within this largest of human chromosome arms. Among the 662 conserved regions identified across the genome, chromosome 1 harbored 18 % of all conserved CNAs (Supplemental Table 7). Focusing on chromosome 1, we determined that chromosome 1q harbored more than twice the number of conserved segments when compared to the 1p arm (Supplemental Table 7). Of particular note, a number of 1q amplified regions that were identified as human Basal-like-specific were also altered in the mouse C3Tag and/or p53null groups (Fig. 3b; Supplemental Table 7); thus our results indicate that this region of human chromosome 1q21-23 is being repeatedly selected for both mouse and human Basal-like breast cancers.

In order to identify the driver(s) present on chromosome 1, we next applied our filtering criteria outlined in Fig. 1. Of the 120 chromosome 1 conserved CNAs, 79 contained at least one gene that showed DNA–RNA concordance (Supplemental Table 8); 25 CNAs contained at least one RNAi-identified essential gene (Supplemental Table 9), and 20 CNAs contained genes showing DNA–RNA concordance and a RNAi-identified essential gene (Supplemental Table 10). Interestingly, all 20 CNAs were copy number gained segments, even among the 1p CNAs (Supplemental Table 10).

To further study the biology of the conserved chromosome 1 genes, we performed a cohort-based DawnRank [16] analysis using genes from human chromosome 1. DawnRank uses gene–gene interaction networks to measure the impact of genomic alterations on the differential gene expression of downstream genes in the network. Then, DawnRank scores (as previously described [16]) the level of perturbation on the gene interaction network caused by the alteration (either amplification or deletion) of the gene of interest. We selected human chromosome 1 gene blocks with shared synteny with the mouse genome for the DawnRank analysis. There were 7 such gene blocks, totaling 1509 genes (Supplemental Table 11). Using the chromosome 1 syntenic regions, we identified 44 chromosome 1 genes that represented the top 5 % DawnRank scores (Supplemental Table 12) using DNA copy number changes as the input “mutation” features along with the gene expression for each human tumor sample. The 44 DawnRank genes mapped to 9 copy number gained segments, which also harbored genes with DNA–RNA concordance, or an RNAi-identified essential gene (Supplemental Table 10). Within the 9 CNAs, encompassing a total of 182 potential genes, only 3 genes met all four filtering criteria of (1) subtype-specific CNA, (2) DNA–RNA concordance, (3) a RNAi “GO” gene, and (4) a DawnRank hit: these genes were phosphatidylinositol 4-kinase (PI4KB), src homology 2 domain containing (SHC1), and nicastrin (NCSTN) (Fig. 3; Supplemental Table 10).

The three chromosome 1 potential driving genes span 1q21-q23 and are altered with an average segment subtype frequency of 47 % (Supplemental Table 10). Interestingly, PI4KB and SHC1 span 1q21, falling less than the average Basal-like subtype segment length apart (Fig. 3), thus suggesting that on chromosome 1q21-23 multiple target genes lie within a single amplicon. Furthermore, SHC1 is in a subtype-specific high frequency altered segment among Basal-like tumors only (Fig. 3; Supplemental Table 5), while NCSTN and PI4KB CNAs appeared across multiple subtypes, passing the significance threshold in the Basal-like and Luminal A subtypes (Supplemental Table 5). However, NCSTN and PI4KB also passed the significance threshold for the p53null-Luminal, p53null-Basal, and C3Tag mouse groups (Supplemental Table 7), the last two of which are models linked to human Basal-like disease as determined in previous gene expression comparative studies [18, 19].

Notch pathway features in 1q21-23 amplified Basal-like breast cancers

Numerous studies have implicated the Notch signaling pathway in Basal-like breast and/or triple-negative breast cancers [22, 23]. Importantly, numerous studies on the functional role of NCSTN have already been performed [24–26]. To evaluate the effect of 1q21-23/NCSTN amplification, we first examined the DawnRank network space around NCSTN and noted that when NCSTN was amplified NOTCH1–3 were also more highly expressed (Fig. 4). In addition, NCSTN is one of the three components of the gamma-secretase complex (GSC), a protein complex that cleaves and activates Notch receptors. Two other GSC members, namely APH1A and PSEN2, were also both altered within the network (Fig. 4), and were also higher in NCSTN amplified samples versus not amplified (Supplemental Fig. 4a). Also, APH1A and PSEN2 are physically located on human chromosome 1q21.2 and 1q42, and are often co-amplified along with NCSTN (although PSEN2 is not within a Basal-like-specific CNA). Thus, three components of the GSC are often co-amplified together, and are more highly expressed, and the NCSTN/Notch network is perturbed in these NCSTN amplified tumors. Following up on these network findings, NCSTN amplification was also correlated with higher NOTCH1 and NOTCH3 mRNA levels (Supplemental Fig. 4b), with this feature showing an even greater difference when examined just among Basal-like breast cancers (Supplemental Fig. 4c). As expected from the previous work, Basal-like tumors as a whole exhibited significantly lower LFNG expression (i.e., a negative regulator of Notch signaling) along with significantly higher expression of NOTCH1, NCSTN, APH1A, MYC, and HEY2 mRNAs (Supplemental Fig. 5), the latter two of which are thought to be targets of activated Notch pathway.

Conclusion

In breast cancer, there are many copy number gains and losses, a few of which like amplification of ERBB2, are of known clinical and biological significance. Over the years, many of these CNAs have been studied and candidate genes identified [12, 13, 27–30], but there are still many regions for which the genetic drivers remain unknown. The simultaneous analysis of DNA copy number change in both human and mouse tumors, and their corresponding gene expression patterns, provides for a biologically meaningful way to identify important regions of CNAs. The basic hypothesis being that a CNA found to spontaneously occur in two different mammalian species breast cancers is being repeatedly selected and must therefore contain an important tumor causing gene(s).

Although many studies have identified frequent CNAs within groups of human breast tumors [13, 21], most do not functionally narrow down the candidate genes within a specific segment. In addition to the mere presence of a highly frequent CNAs being identified across species, we took a biologically based approach to refine the list of genes within a given segment into a subset of candidate driver genes. These analyses prompted the development of a new bioinformatics tool (SWITCHplus) to identify and highlight subtype-specific DNA copy number events using a visual display in a user-friendly format. Using this tool and a systematic data-mining schema that includes identifying regions that show: (1) shared DNA CNAs cross-species, (2) concordance between mRNA expression and relative DNA copy number value, (3) functional effects in a genome-wide RNAi screen, and (4) functional effects in a network analysis (i.e., DawnRank), we identified a limited number of CNAs that harbored potential breast cancer driver genes. From these analyses, we identified human chromosome 1q21-23 as a region of amplification consistently present in human and mouse Basal-like tumors, and which contains at least three potential driver genes (Fig. 3).

The first of these three genes, PI4KB encodes for a lipid kinase member of the phosphoinositide signaling pathway. The phosphoinositide signaling system regulates cell migration [31–33]and proliferation [31–33], and activation of this signaling pathway is observed in many aggressive tumors [33–35]. Specifically, phosphatidylinositol 4-phosphate is utilized by phosphoinositide kinases, such as PI3KCA, to signal to downstream protein kinase targets including AKT and PDK1 [33, 35, 36]. In the 2012 TCGA publication on breast cancer, it was noted that Basal-like cancers showed high activity of the PIK3CA/AKT pathway, and that these tumors tended to show few PIK3CA mutations, but frequent loss of PTEN and/or INPP4B (negative regulators of the pathway) and amplification of PIK3CA and AKT3 (positive regulators of the pathway) [12]. Here we show that yet another positive regulator of the pathway is amplified in Basal-like cancers.

SHC1 encodes for a member of the Shc family of adapter proteins. SHC1 is composed of multiple protein domains that can bind to multiple transmembrane receptors including phosphorylated insulin-like growth factor 1 receptor, and the platelet-derived growth factor receptor (PDGFR), thus potentially activating multiple pathways involved in cell proliferation and differentiation [37, 38]. Specifically, SHC1 is a key signaling mediator, and can act as a scaffold between an activated receptor and downstream signaling proteins [39]. In addition, growth factor signaling through PDGFR is known to occur in many TNBC [40], and thus SHC1 amplification may be contributing to these key signaling processes.

NCSTN encodes for a component of the GSC, which is a multi-protein complex that cleaves a number of transmembrane proteins to typically activate their functions [41, 42]; the GSC targets include Notch 1–4, ErBB4, CD44, and E-cadherin [24, 41, 42]. Importantly, Hu et al. 2002 demonstrated, in Drosophila, that NCSTN provides structural support and is required for GSC cleavage of Notch receptor [43]. In our data, when Basal-like tumors were examined, those with copy number gains at NCSTN showed (1) perturbation/activation of the Notch pathway via the DawnRank network analysis (Fig. 4), (2) significantly higher expression of NOTCH1 and NOTCH3 (Supplemental Fig. 4c), and (3) high expression of other markers of the Notch pathway (Supplemental Fig. 4d). Further support for Notch pathway importance comes from previous mouse model experiments where genetic inactivation of a negative regulator of Notch signaling (i.e., lunatic fringe) resulted in Basal-like mammary tumors [22]. Interestingly, Notch activity is also higher in Basal-like breast cancer cell lines compared with Luminal breast cancer cell lines [44]. In vitro, by RNAi-mediated silencing of NCSTN in the TNBC cell line MDA-MB-231, Filipović et al. 2011 showed reduced transcription of Notch pathway targets, and a reduction in cell motility and invasion [41]. In total, these results strongly suggest that activation of Notch pathway signaling is occurring within Basal-like/TNBC tumors, and we now provide additional evidence for a mechanistic explanation for this in vivo.

Other investigators using different computational approaches have also identified this region, but identified other genes (i.e., NIT1 and PVRL4) as potential drivers [45]. The observed differences in potential driver genes are mostly likely due to the “filtering criteria”, where we focused on species conservation, and they focused on somatic mutation targets. It is clear that a multitude of targets and drivers are present, and that 1q21-23 is a region that is the target of selection as opposed to any single gene being the target of selection. In conclusion, our work here provides an objective analysis path for identifying potential driver genes using a cross-species computational approach, which can be applied to any tumor type for which sufficient mouse and human tumor data exist.

References

Kravchenko J, Akushevich I, Seewaldt VL et al (2011) Breast cancer as heterogeneous disease: contributing factors and carcinogenesis mechanisms. Breast Cancer Res Treat 128:483–493. doi:10.1007/s10549-011-1347-z
Article CAS PubMed Google Scholar
Prat A, Perou CM (2011) Deconstructing the molecular portraits of breast cancer. Mol Oncol 5:5–23. doi:10.1016/j.molonc.2010.11.003
Article CAS PubMed Google Scholar
Sorlie T, Tibshirani R, Parker J et al (2003) Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci USA 100:8418–8423. doi:10.1073/pnas.0932692100
Article CAS PubMed Central PubMed Google Scholar
Sotiriou C, Neo S-Y, McShane LM et al (2003) Breast cancer classification and prognosis based on gene expression profiles from a population-based study. Proc Natl Acad Sci USA 100:10393–10398. doi:10.1073/pnas.1732912100
Article CAS PubMed Central PubMed Google Scholar
Nordgard SH, Johansen FE, Alnaes GIG et al (2007) Genes harbouring susceptibility SNPs are differentially expressed in the breast cancer subtypes. Breast Cancer Res 9:113. doi:10.1186/bcr1784
Article PubMed Central PubMed Google Scholar
Bergamaschi A, Kim Y (2006) Distinct patterns of DNA copy number alteration are associated with different clinicopathological features and gene-expression subtypes of breast cancer. Genes Chromosom Cancer 1040:1033–1040. doi:10.1002/gcc
Article Google Scholar
Sorlie T, Perou CM, Tibshirani R et al (2001) Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci USA 98:10869–10874. doi:10.1073/pnas.191367098
Article CAS PubMed Central PubMed Google Scholar
Parker JS, Mullins M, Cheang MCU et al (2009) Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol 27:1160–1167. doi:10.1200/JCO.2008.18.1370
Article PubMed Central PubMed Google Scholar
Prat A, Parker JS, Karginova O et al (2010) Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer. Breast Cancer Res 12:R68. doi:10.1186/bcr2635
Article PubMed Central PubMed Google Scholar
Hu Z, Fan C, Oh DS et al (2006) The molecular portraits of breast tumors are conserved across microarray platforms. BMC Genomics 7:96. doi:10.1186/1471-2164-7-96
Article PubMed Central PubMed Google Scholar
Cheang MCU, Chia SK, Voduc D et al (2009) Ki67 index, HER2 status, and prognosis of patients with luminal B breast cancer. J Natl Cancer Inst 101:736–750. doi:10.1093/jnci/djp082
Article CAS PubMed Central PubMed Google Scholar
Cancer T, Atlas G (2012) Comprehensive molecular portraits of human breast tumours. Nature 490:61–70. doi:10.1038/nature11412
Article Google Scholar
Weigman VJ, Chao H-H, Shabalin AA et al (2012) Basal-like Breast cancer DNA copy number losses identify genes involved in genomic instability, response to therapy, and patient survival. Breast Cancer Res Treat 113:865–880. doi:10.1007/s10549-011-1846-y
Article Google Scholar
Nishida N, Nishimura T, Ito T, Komeda T (2003) Chromosomal instability and human hepatocarcinogenesis. Histol Histopathol 18:897–909
CAS PubMed Google Scholar
Solimini NL, Xu Q, Mermel CH et al (2012) Recurrent hemizygous deletions in cancers may optimize proliferative potential. Science 337:104–109. doi:10.1126/science.1219580
Article CAS PubMed Central PubMed Google Scholar
Hou JP, Ma J (2014) DawnRank: discovering personalized driver genes in cancer. Genome Med 6:56. doi:10.1186/s13073-014-0056-8
Article PubMed Central PubMed Google Scholar
Liu Y, Hayes DN, Nobel A, Marron JS (2008) Statistical significance of clustering for high-dimension, low-sample size data. J Am Stat Assoc 103:1281–1293. doi:10.1198/016214508000000454
Article CAS Google Scholar
Herschkowitz JI, Simin K, Weigman VJ et al (2007) Identification of conserved gene expression features between murine mammary carcinoma models and human breast tumors. Genome Biol 8:R76. doi:10.1186/gb-2007-8-5-r76
Article PubMed Central PubMed Google Scholar
Pfefferle AD, Herschkowitz JI, Usary J et al (2013) Transcriptomic classification of genetically engineered mouse models of breast cancer identifies human subtype counterparts. Genome Biol 14:R125. doi:10.1186/gb-2013-14-11-r125
Article PubMed Central PubMed Google Scholar
Herschkowitz JI, Zhao W, Zhang M et al (2012) Comparative oncogenomics identifies breast tumors enriched in functional tumor-initiating cells. Proc Natl Acad Sci USA 109:2778–2783. doi:10.1073/pnas.1018862108
Article CAS PubMed Central PubMed Google Scholar
Curtis C, Shah SP, Chin S-F et al (2012) The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486:346–352. doi:10.1038/nature10983
CAS PubMed Central PubMed Google Scholar
Xu K, Usary J, Kousis PC et al (2012) Lunatic fringe deficiency cooperates with the Met/Caveolin gene amplicon to induce basal-like breast cancer. Cancer Cell 21:626–641. doi:10.1016/j.ccr.2012.03.041
Article CAS PubMed Central PubMed Google Scholar
Stoeck A, Lejnine S, Truong A et al (2014) Discovery of biomarkers predictive of GSI response in triple-negative breast cancer and adenoid cystic carcinoma. Cancer Discov 4:1154–1167. doi:10.1158/2159-8290.CD-13-0830
Article CAS PubMed Central PubMed Google Scholar
Lombardo Y, Filipović A, Molyneux G et al (2012) Nicastrin regulates breast cancer stem cell properties and tumor growth in vitro and in vivo. Proc Natl Acad Sci USA 109:16558–16563. doi:10.1073/pnas.1206268109
Murphy MP, Das P, Nyborg AC et al (2003) Overexpression of nicastrin increases Aβ production. FASEB J 17:1138–1140. doi:10.1096/fj.02-1050fje
CAS PubMed Google Scholar
Sarajlić A, Filipović A, Janjić V et al (2014) The role of genes co-amplified with nicastrin in breast invasive carcinoma. Breast Cancer Res Treat 143:393–401. doi:10.1007/s10549-013-2805-6
Shadeo A, Lam WL (2006) Comprehensive copy number profiles of breast cancer cell model genomes. Breast Cancer Res 8:R9. doi:10.1186/bcr1370
Article PubMed Central PubMed Google Scholar
Taylor BS, Barretina J, Socci ND et al (2008) Functional copy-number alterations in cancer. PLoS ONE 3:e3179. doi:10.1371/journal.pone.0003179
Article PubMed Central PubMed Google Scholar
Kristensen VN, Lingjærde OC, Russnes HG et al (2014) Principles and methods of integrative genomic analyses in cancer. Nat Rev Cancer 14:299–313. doi:10.1038/nrc3721
Article CAS PubMed Google Scholar
Beroukhim R, Mermel CH, Porter D et al (2010) The landscape of somatic copy-number alteration across human cancers. Nature 463:899–905. doi:10.1038/nature08822
Article CAS PubMed Central PubMed Google Scholar
Katso R, Okkenhaug K (2001) Cellular function of phosphoinositide 3-kinases: implications for development, immunity, homeostasis, and cancer. Annu Rev Cell Dev Biol 17:615–675
Article CAS PubMed Google Scholar
Cantley L (2002) The phosphoinositide 3-kinase pathway. Science 296:1655–1658
Article CAS PubMed Google Scholar
Bunney TD, Katan M (2010) Phosphoinositide signalling in cancer: beyond PI3K and PTEN. Nat Rev Cancer 10:342–352. doi:10.1038/nrc2842
Article CAS PubMed Google Scholar
Altomare DA, Testa JR (2005) Perturbations of the AKT signaling pathway in human cancer. Oncogene 24:7455–7464. doi:10.1038/sj.onc.1209085
Article CAS PubMed Google Scholar
Chu KME, Minogue S, Hsuan JJ, Waugh MG (2010) Differential effects of the phosphatidylinositol 4-kinases, PI4KIIα and PI4KIIIβ, on Akt activation and apoptosis. Cell Death Dis 1:e106. doi:10.1038/cddis.2010.84
Article CAS PubMed Central PubMed Google Scholar
Balla A, Balla T (2006) Phosphatidylinositol 4-kinases: old enzymes with emerging functions. Trends Cell Biol 16:351–361. doi:10.1016/j.tcb.2006.05.003
Article CAS PubMed Google Scholar
Wagner K, Hemminki K, Grzybowska E et al (2004) The insulin-like growth factor-1 pathway mediator genes: SHC1 Met300Val shows a protective effect in breast cancer. Carcinogenesis 25:2473–2478. doi:10.1093/carcin/bgh263
Article CAS PubMed Google Scholar
Yu H, Rohan T (2000) Role of the insulin-like growth factor family in cancer development and progression. J Natl Cancer Inst 92:1472–1489
Article CAS PubMed Google Scholar
Zheng Y, Zhang C, Croucher DR et al (2013) Temporal regulation of EGF signalling networks by the scaffold protein Shc1. Nature 499:166–171. doi:10.1038/nature12308
Article CAS PubMed Google Scholar
Duncan JS, Whittle MC, Nakamura K et al (2012) Dynamic reprogramming of the kinome in response to targeted MEK inhibition in triple-negative breast cancer. Cell 149:307–321. doi:10.1016/j.cell.2012.02.053
Article CAS PubMed Central PubMed Google Scholar
Filipović A, Gronau JH, Green AR et al (2011) Biological and clinical implications of nicastrin expression in invasive breast cancer. Breast Cancer Res Treat 125:43–53. doi:10.1007/s10549-010-0823-1
Article PubMed Google Scholar
Kopan R, Ilagan M (2004) γ-Secretase: proteasome of the membrane? Nat Rev Mol Cell Biol 5:499–504
Article CAS PubMed Google Scholar
Hu Y, Ye Y, Fortini ME (2002) Nicastrin is required for γ-secretase cleavage of the Drosophila Notch receptor. Dev Cell 2:69–78
Article CAS PubMed Google Scholar
D’Angelo RC, Ouzounova M, Davis A et al (2015) Notch reporter activity in breast cancer cell lines identifies a subset of cells with stem cell activity. Mol Cancer Ther 14:779–787. doi:10.1158/1535-7163.MCT-14-0228
Article PubMed Google Scholar
Sanchez-Garcia F, Villagrasa P, Matsui J et al (2014) Integration of genomic data enables selective discovery of breast cancer drivers. Cell 159:1461–1475. doi:10.1016/j.cell.2014.10.048
Article CAS PubMed Google Scholar

Download references

Acknowledgments

This study was supported by Funds from the Initiative for Maximizing Student Diversity Grant 5R25GM055336, the UNC Bioinformatics and Computational Biology Diversity Fellowship, the NCI Breast SPORE Program Grant P50-CA58223-09A1, RO1-CA148761, and the Breast Cancer Research Foundation. We would like to thank the Translational Breast Cancer Research Consortium for providing some of the tumor specimens used for gene and copy number profiling, which came from TBCRC001.

Conflict of interests

C.M.P. is an equity stock holder, and Board of Director Member of BioClassifier LLC., C.M.P., and J.S.P. are also listed as inventors on a patent application on the PAM50 assay.

Author information

Authors and Affiliations

Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
Grace O. Silva, Joel S. Parker, Michael L. Gatza & Charles M. Perou
Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC, 27599, USA
Grace O. Silva & Charles M. Perou
Lineberger Comprehensive Cancer Center, University of North Carolina, 450 West Drive, CB7295, Chapel Hill, NC, 27599, USA
Grace O. Silva, Xiaping He, Joel S. Parker, Michael L. Gatza, Lisa A. Carey & Charles M. Perou
Division of Hematology/Oncology, Department of Medicine, University of North Carolina, Chapel Hill, NC, 27599, USA
Lisa A. Carey
Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
Jack P. Hou & Jian Ma
Medical Scholars Program, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
Jack P. Hou
Division of Cancer Medicine, Department of Breast Medical Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, 301438, USA
Stacy L. Moulder
Division of Oncology, Department of Medicine, Duke University, Durham, NC, 27710, USA
Paul K. Marcom
Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
Jian Ma
Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030, USA
Jeffrey M. Rosen

Authors

Grace O. Silva
View author publications
You can also search for this author in PubMed Google Scholar
Xiaping He
View author publications
You can also search for this author in PubMed Google Scholar
Joel S. Parker
View author publications
You can also search for this author in PubMed Google Scholar
Michael L. Gatza
View author publications
You can also search for this author in PubMed Google Scholar
Lisa A. Carey
View author publications
You can also search for this author in PubMed Google Scholar
Jack P. Hou
View author publications
You can also search for this author in PubMed Google Scholar
Stacy L. Moulder
View author publications
You can also search for this author in PubMed Google Scholar
Paul K. Marcom
View author publications
You can also search for this author in PubMed Google Scholar
Jian Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey M. Rosen
View author publications
You can also search for this author in PubMed Google Scholar
Charles M. Perou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Charles M. Perou.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Fig. S1 Supervised cluster of mouse gene expression data using an 866 intrinsic gene list. The cluster analysis identified seven murine tumor groups, which were further used to supervise subsequent DNA copy number analyses. Each group is labeled according to the majority representing GEM mammary model within that group

Fig. S2 Copy number frequency plots from SWITCHplus showing human subtype-specific CNAs. Segments of subtype-specific copy number gains are plotted above the x-axis in red and segments of copy number loss are plotted below the x-axis in green. Regions shaded gray indicate segments that are not subtype-specific or high frequent (greater than or equal to 15 %). The frequency of alterations in each subtype is indicated on the y-axis from 0 to 100 %. a Basal-like, b Her2-enriched, c Claudin-low, d Luminal A, and e Luminal B copy number landscapes

Fig. S3 Copy number frequency plots from SWITCHplus showing conserved CNAs. Segments of copy number gains are plotted above the x-axis and segments of copy number loss are plotted below the x-axis. Regions shaded gray indicate segments that are any combination of either not subtype-specific, not mouse group-specific or not high frequent (greater than or equal to 15 %). The conserved segments are colored according to the mouse group(s) in which they appear. The frequency of alterations in each subtype is indicated on the y-axis from 0 to 100 %. a Basal-like, b Her2-enriched, c Claudin-low, d Luminal A, and e Luminal B copy number landscapes

Fig. S4 Box-and-whisker plots for expression of Notch signaling pathway targets. a APH1A and PSEN2 mRNA expressions across all tumors with NCSTN amplification versus all other tumors without NCSTN amplification. b NOTCH1 and NOTCH3 mRNA expressions across all tumors with NCSTN amplification versus all tumors without NCSTN amplification. c NOTCH1 and NOTCH3 mRNA expressions across all Basal-like tumors with NCSTN amplification versus Basal-like tumors without NCSTN amplification. d APH1A and PSEN2 mRNA expressions across all Basal-like tumors with NCSTN amplification versus Basal-like tumors without NCSTN amplification

Fig. S5 Box-and-whisker plots of the mRNA expression of LFNG, NOTCH1, NCSTN, APH1A, MYC, and HEY2 across breast cancer subtypes

Additional data files

The following additional files and Supplemental Figs. 1, 2, 3, 4, and 5 are available with the online version of this paper. Supplemental data file 1 contains the extended “Methods” section. Supplemental data file 2 contains all supplemental data tables. A table of contents is listed on the first worksheet that describes the information presented in Tables S1, S2, S3, S4, S5, S6, S7, S8, S9, S10, S11, and S12.

Supplementary material 1 (PDF 734 kb)

Supplementary material 2 (PDF 1280 kb)

Supplementary material 3 (PDF 1134 kb)

Supplementary material 4 (PDF 318 kb)

Supplementary material 5 (PDF 380 kb)

Supplementary material 6 (DOCX 56 kb)

Supplementary material 7 (XLS 7347 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Silva, G.O., He, X., Parker, J.S. et al. Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer. Breast Cancer Res Treat 152, 347–356 (2015). https://doi.org/10.1007/s10549-015-3476-2

Download citation

Received: 12 June 2015
Accepted: 15 June 2015
Published: 25 June 2015
Issue Date: July 2015
DOI: https://doi.org/10.1007/s10549-015-3476-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer

Abstract

Similar content being viewed by others

The landscape of chromosomal aberrations in breast cancer mouse models reveals driver-specific routes to tumorigenesis

Mouse Models of Breast Cancer Share Amplification and Deletion Events with Human Breast Cancer

An integrated genomics approach identifies drivers of proliferation in luminal-subtype human breast cancer

Introduction