Rare germline copy number variants (CNVs) and breast cancer risk

Dennis, Joe; Tyrer, Jonathan P.; Walker, Logan C.; Michailidou, Kyriaki; Dorling, Leila; Bolla, Manjeet K.; Wang, Qin; Ahearn, Thomas U.; Andrulis, Irene L.; Anton-Culver, Hoda; Antonenkova, Natalia N.; Arndt, Volker; Aronson, Kristan J.; Freeman, Laura E. Beane; Beckmann, Matthias W.; Behrens, Sabine; Benitez, Javier; Bermisheva, Marina; Bogdanova, Natalia V.; Bojesen, Stig E.; Brenner, Hermann; Castelao, Jose E.; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Clarke, Christine L.; Collée, J. Margriet; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Dossus, Laure; Eliassen, A. Heather; Eriksson, Mikael; Evans, D. Gareth; Fasching, Peter A.; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gabrielson, Marike; Gago-Dominguez, Manuela; García-Closas, Montserrat; Giles, Graham G.; González-Neira, Anna; Guénel, Pascal; Hahnen, Eric; Haiman, Christopher A.; Hall, Per; Hollestelle, Antoinette; Hoppe, Reiner; Hopper, John L.; Howell, Anthony; Jager, Agnes; Jakubowska, Anna; John, Esther M.; Johnson, Nichola; Jones, Michael E.; Jung, Audrey; Kaaks, Rudolf; Keeman, Renske; Khusnutdinova, Elza; Kitahara, Cari M.; Ko, Yon-Dschun; Kosma, Veli-Matti; Koutros, Stella; Kraft, Peter; Kristensen, Vessela N.; Kubelka-Sabit, Katerina; Kurian, Allison W.; Lacey, James V.; Lambrechts, Diether; Larson, Nicole L.; Linet, Martha; Ogrodniczak, Alicja; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Mavroudis, Dimitrios; Milne, Roger L.; Muranen, Taru A.; Murphy, Rachel A.; Nevanlinna, Heli; Olson, Janet E.; Olsson, Håkan; Park-Simon, Tjoung-Won; Perou, Charles M.; Peterlongo, Paolo; Plaseska-Karanfilska, Dijana; Pylkäs, Katri; Rennert, Gad; Saloustros, Emmanouil; Sandler, Dale P.; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Shibli, Rana; Smeets, Ann; Soucy, Penny; Southey, Melissa C.; Swerdlow, Anthony J.; Tamimi, Rulla M.; Taylor, Jack A.; Teras, Lauren R.; Terry, Mary Beth; Tomlinson, Ian; Troester, Melissa A.; Truong, Thérèse; Vachon, Celine M.; Wendt, Camilla; Winqvist, Robert; Wolk, Alicja; Yang, Xiaohong R.; Zheng, Wei; Ziogas, Argyrios; Simard, Jacques; Dunning, Alison M.; Pharoah, Paul D. P.; Easton, Douglas F.

doi:10.1038/s42003-021-02990-6

Rare germline copy number variants (CNVs) and breast cancer risk

Article
Open access
Published: 18 January 2022

Volume 5, article number 65, (2022)
Cite this article

Download PDF

You have full access to this open access article

Communications Biology

Rare germline copy number variants (CNVs) and breast cancer risk

Download PDF

Joe Dennis ORCID: orcid.org/0000-0003-4591-1214¹,
Jonathan P. Tyrer ORCID: orcid.org/0000-0003-3724-4757²,
Logan C. Walker ORCID: orcid.org/0000-0003-0018-3719³,
Kyriaki Michailidou ORCID: orcid.org/0000-0001-7065-1237^1,4,5,
Leila Dorling¹,
Manjeet K. Bolla¹,
Qin Wang ORCID: orcid.org/0000-0002-9139-0627¹,
Thomas U. Ahearn⁶,
Irene L. Andrulis ORCID: orcid.org/0000-0002-4226-6435^7,8,
Hoda Anton-Culver⁹,
Natalia N. Antonenkova¹⁰,
Volker Arndt ORCID: orcid.org/0000-0001-9320-8684¹¹,
Kristan J. Aronson ORCID: orcid.org/0000-0002-7865-7243¹²,
Laura E. Beane Freeman⁶,
Matthias W. Beckmann¹³,
Sabine Behrens ORCID: orcid.org/0000-0002-9714-104X¹⁴,
Javier Benitez^15,16,
Marina Bermisheva¹⁷,
Natalia V. Bogdanova^10,18,19,
Stig E. Bojesen^20,21,22,
Hermann Brenner^11,23,24,
Jose E. Castelao²⁵,
Jenny Chang-Claude^14,26,
Georgia Chenevix-Trench²⁷,
Christine L. Clarke²⁸,
NBCS Collaborators,
J. Margriet Collée²⁹,
CTS Consortium,
Fergus J. Couch³⁰,
Angela Cox³¹,
Simon S. Cross³²,
Kamila Czene³³,
Peter Devilee^34,35,
Thilo Dörk ORCID: orcid.org/0000-0002-9458-0282¹⁹,
Laure Dossus ORCID: orcid.org/0000-0003-2716-5748³⁶,
A. Heather Eliassen^37,38,
Mikael Eriksson³³,
D. Gareth Evans^39,40,
Peter A. Fasching^13,41,
Jonine Figueroa ORCID: orcid.org/0000-0002-5100-623X^6,42,43,
Olivia Fletcher⁴⁴,
Henrik Flyger⁴⁵,
Lin Fritschi ORCID: orcid.org/0000-0002-7692-3560⁴⁶,
Marike Gabrielson³³,
Manuela Gago-Dominguez^47,48,
Montserrat García-Closas⁶,
Graham G. Giles ORCID: orcid.org/0000-0003-4946-9099^49,50,51,
Anna González-Neira¹⁶,
Pascal Guénel⁵²,
Eric Hahnen^53,54,
Christopher A. Haiman⁵⁵,
Per Hall^33,56,
Antoinette Hollestelle ORCID: orcid.org/0000-0003-1166-1966⁵⁷,
Reiner Hoppe^58,59,
John L. Hopper⁵⁰,
Anthony Howell⁶⁰,
ABCTB Investigators,
kConFab/AOCS Investigators,
Agnes Jager⁵⁷,
Anna Jakubowska ORCID: orcid.org/0000-0002-5650-0501^61,62,
Esther M. John^63,64,
Nichola Johnson ORCID: orcid.org/0000-0002-8230-5662⁴⁴,
Michael E. Jones ORCID: orcid.org/0000-0001-7479-3451⁶⁵,
Audrey Jung¹⁴,
Rudolf Kaaks¹⁴,
Renske Keeman ORCID: orcid.org/0000-0002-5452-9933⁶⁶,
Elza Khusnutdinova^17,67,
Cari M. Kitahara⁶⁸,
Yon-Dschun Ko⁶⁹,
Veli-Matti Kosma^70,71,72,
Stella Koutros⁶,
Peter Kraft^38,73,
Vessela N. Kristensen ORCID: orcid.org/0000-0001-5012-7438^74,75,
Katerina Kubelka-Sabit⁷⁶,
Allison W. Kurian^63,64,
James V. Lacey^77,78,
Diether Lambrechts^79,80,
Nicole L. Larson ORCID: orcid.org/0000-0002-2129-6934⁸¹,
Martha Linet⁶⁸,
Alicja Ogrodniczak⁶¹,
Arto Mannermaa^70,71,72,
Siranoush Manoukian⁸²,
Sara Margolin^56,83,
Dimitrios Mavroudis⁸⁴,
Roger L. Milne ORCID: orcid.org/0000-0001-5764-7268^49,50,51,
Taru A. Muranen⁸⁵,
Rachel A. Murphy^86,87,
Heli Nevanlinna ORCID: orcid.org/0000-0002-0916-2976⁸⁵,
Janet E. Olson ORCID: orcid.org/0000-0003-4944-7789⁸¹,
Håkan Olsson⁸⁸,
Tjoung-Won Park-Simon¹⁹,
Charles M. Perou⁸⁹,
Paolo Peterlongo ORCID: orcid.org/0000-0001-6951-6855⁹⁰,
Dijana Plaseska-Karanfilska ORCID: orcid.org/0000-0001-8877-2416⁹¹,
Katri Pylkäs ORCID: orcid.org/0000-0002-2449-0521^92,93,
Gad Rennert ORCID: orcid.org/0000-0002-8512-068X⁹⁴,
Emmanouil Saloustros⁹⁵,
Dale P. Sandler⁹⁶,
Elinor J. Sawyer⁹⁷,
Marjanka K. Schmidt^66,98,
Rita K. Schmutzler^53,54,99,
Rana Shibli⁹⁴,
Ann Smeets¹⁰⁰,
Penny Soucy¹⁰¹,
Melissa C. Southey^49,51,102,
Anthony J. Swerdlow^65,103,
Rulla M. Tamimi^38,104,
Jack A. Taylor^96,105,
Lauren R. Teras¹⁰⁶,
Mary Beth Terry¹⁰⁷,
Ian Tomlinson ORCID: orcid.org/0000-0003-3037-1470⁴³,
Melissa A. Troester¹⁰⁸,
Thérèse Truong ORCID: orcid.org/0000-0002-2943-6786⁵²,
Celine M. Vachon ORCID: orcid.org/0000-0002-1962-9322¹⁰⁹,
Camilla Wendt⁸³,
Robert Winqvist^92,93,
Alicja Wolk ORCID: orcid.org/0000-0001-7387-6845^110,111,
Xiaohong R. Yang⁶,
Wei Zheng¹¹²,
Argyrios Ziogas ORCID: orcid.org/0000-0003-4529-3727⁹,
Jacques Simard ORCID: orcid.org/0000-0001-6906-3390¹⁰¹,
Alison M. Dunning ORCID: orcid.org/0000-0001-6651-7166²,
Paul D. P. Pharoah ORCID: orcid.org/0000-0001-8494-732X^1,2 &
…
Douglas F. Easton ORCID: orcid.org/0000-0003-2444-3247^1,2

7888 Accesses
8 Citations
5 Altmetric
Explore all metrics

Abstract

Germline copy number variants (CNVs) are pervasive in the human genome but potential disease associations with rare CNVs have not been comprehensively assessed in large datasets. We analysed rare CNVs in genes and non-coding regions for 86,788 breast cancer cases and 76,122 controls of European ancestry with genome-wide array data. Gene burden tests detected the strongest association for deletions in BRCA1 (P = 3.7E−18). Nine other genes were associated with a p-value < 0.01 including known susceptibility genes CHEK2 (P = 0.0008), ATM (P = 0.002) and BRCA2 (P = 0.008). Outside the known genes we detected associations with p-values < 0.001 for either overall or subtype-specific breast cancer at nine deletion regions and four duplication regions. Three of the deletion regions were in established common susceptibility loci. To the best of our knowledge, this is the first genome-wide analysis of rare CNVs in a large breast cancer case-control dataset. We detected associations with exonic deletions in established breast cancer susceptibility genes. We also detected suggestive associations with non-coding CNVs in known and novel loci with large effects sizes. Larger sample sizes will be required to reach robust levels of statistical significance.

CTNND1 is involved in germline predisposition to early-onset gastric cancer by affecting cell-to-cell interactions

Article Open access 25 May 2024

Hidden secrets of the cancer genome: unlocking the impact of non-coding mutations in gene regulatory elements

Article Open access 20 June 2024

Genetic/epigenetic effects in NF1 microdeletion syndrome: beyond the haploinsufficiency, looking at the contribution of not deleted genes

Article Open access 14 June 2024

Introduction

Copy number variants (CNVs) are pervasive in the human genome but are more challenging to detect with current technologies than single nucleotide variants (SNVs). Recent comprehensive sequencing projects^1,2 have characterised CNVs in large sample sets. The gnomAD project identified a median of 3,505 deletions and 723 duplications covering more than 50 base pairs per genome. Most deletions and duplications tend to be rare with longer variants tending to be rarer, suggesting negative selection against these variants. At the population level the 1000 Genomes project has mapped a large proportion of inherited CNVs³ and observed that 65% had a frequency below 2%.

While somatic copy number alterations play a major role in the pathogenesis of breast tumours^4,5, some germline CNVs are known to be associated with inherited risk of breast cancer. Rare loss of function variants in susceptibility genes such as BRCA1 and CHEK2 are associated with a large increase in risk⁶. While the majority of these variants are single nucleotide mutations and short indels, they also include longer deletions and duplications. It has been reported that up to a third of loss of function BRCA1 variants in some populations may be CNVs⁷.

Large-scale genome-wide association studies (GWAS) have established breast cancer associations with common variants at more than 150 loci, mostly in non-coding regions^8,9,10,11. At two of the loci, deletions imputed from the 1000 Genomes reference panel have been identified as likely causal variants. A deletion of the APOBEC3B gene-coding region increases breast cancer risk¹² and analysis of the tumours of the germline deletion carriers showed an increase in APOBEC-mediated somatic mutations¹³. A deletion in a regulatory region was identified as a likely causal variant at the 2q35 locus^14,15.

Detecting CNVs from the intensity measurements of genotyping array probes is prone to producing unreliable calls due to the high level of noise. We recently developed a new CNV calling method, CamCNV¹⁶, which focuses on rare CNVs and identifies outlier samples that may have a CNV, based on the intensity distribution across all samples at each probe. We showed that this approach is able to detect CNVs using as few as three probes¹⁶. Here, we apply this approach to a very large array genotype dataset to search for breast cancer-associated CNVs. The analyses are outlined in Fig. 1.

**Fig. 1: Flow diagram of the calling and analysis pipeline.**

Results

Summary of CNVs detected

After quality control we detected a mean of 2.9 deletions (standard deviation 1.6) and 2.5 duplications (SD 2.0) per sample. Supplementary Data 5 shows the mean length, probe coverage, and segment z-scores of called CNVs. Duplications tended to be longer than deletions: for example, deletions called on OncoArray covered a mean of 45 Kilobases (Kb) (SD 106 Kb) over 9.8 probes (SD 17.2), while duplications covered a mean of 109 Kb (SD 202 Kb) over 18.9 probes (SD 36.5). CNV calls observed in multiple samples were concentrated in a small proportion of probes (Supplementary Data 6), with <11% of probes having frequency >0.01% and <2% of probes having frequency >0.5%.

We identified called CNVs which overlapped for at least 90% of their length with rare deletions and duplications (frequency < 1%) identified by the 1000 Genomes Project (Supplementary Data 5 and Supplementary Fig. 1.). Forty-nine percent of OncoArray deletions and 47% of iCOGs deletions matched a 1000 Genomes Project variant while 29% of OncoArray duplications and 20% of iCOGs duplications matched. In total, we identified CNVs closely matching 3273 of the deletion variants published by the 1000 Genomes Project (~9% of total) and 1255 of their duplication variants (~24% of the total).

CNVs associated with overall risk

Association results were derived for 1301 regions containing deletions and 992 regions containing duplications. QQ plots are shown in Fig. 2a for deletions and 2b for duplications. There was no evidence for inflation in the test statistics for duplications (lambda = 0.98; lambda₁₀₀₀ = 1.00) and minimal evidence for deletions (lambda = 1.11; lambda₁₀₀₀ = 1.00).

**Fig. 2: QQ plots of association results.**

Seven deletion and two duplication regions were associated with breast cancer risk at p < 0.001 (Table 1); of these, deletions within the BRCA1 region achieved p < 10⁻⁶. The results for all regions are shown in Supplementary Data 7 and 8 and include statistics on the number of probes covered by the calls. The results for individual probes covered by the regions analysed are in Supplementary Data 9−12.

Table 1 CNVs associated with overall risk.

Full size table

The BRCA1 locus contains multiple deletions across the gene. The CHEK2 region (OR 1.94, p = 0.0003) covers the whole gene but nearly all the calls correspond to a deletion of exons nine and ten, which was previously observed in 1% of breast cancer cases and 0.4% of controls in Poland¹⁷. We observed the deletion in 0.9% of Polish cases and 0.5% of controls.

The most significant association (OR = 0.69 P = 0.00001) for duplications covers a large region on 17p13.3 with multiple long variants overlapping shorter duplications. The OncoArray results by probe show the strongest associations at a series of probes (17: 814529–850542) in the first intron of NXN, with the lowest P-value at 17: 819141 (OR = 0.45, P = 0.002). The most significant probe position on iCOGs was also in this region (17:836631, OR = 0.58, P = 0.09) (Fig. 3).

**Fig. 3: Associations with duplications at the 17p13.3 locus.**

Associations with risk of breast cancer subtypes

We repeated the analyses restricting cases to those with ER-positive, ER-negative, and triple negative disease. Deletions and duplications with p-values below 0.001 are shown in Tables 2 and 3 and the results for all regions are in Supplementary Data 13 and 14. An association was observed for BRCA1 for all subtypes, with the exception of duplications for ER-positive disease. The odds ratio for BRCA1 deletions was higher for ER-negative disease (OR = 27.03; 95% CI, 15.66−46.67) than ER-positive (OR = 2.81; CI, 1.56 to 5.08; P = 8.46E–28 for the difference), while for CHEK2 the odds ratio was higher for ER-positive disease (OR = 2.32;CI,1.56–3.44) than ER-negative (OR = 1.36; CI,0.66 to 2.82; P = 0.11 for the difference), consistent with the known subtype-specific associations for deleterious variants in these genes¹⁸.

Table 2 Subtype associations for deletions.

Full size table

Table 3 Subtype associations for duplications.

Full size table

In total, we observed five deletion and two duplication regions with p-values < 0.001 that did not reach p < 0.001 in the overall risk analysis. The strongest novel association for ER-positive was for an intronic deletion in ITGBL1 (OR = 3.3, P = 0.00007, P for difference by ER-status = 0.18). For ER-negative disease the strongest novel association was with an intergenic deletion between ABCC4 (MRP4) and CLDN10 (OR = 2.16, P = 0.0002, P for difference by ER-status = 0.02). Neither of these associations was significant for the other subtype. For triple negative disease, the strongest evidence of association was found for an intergenic duplication between TMC3 and MEX3B (OR = 2.39, P = 0.00009) and for two separate deletions upstream of the DDX18 gene: 2:118258797–118389164 (OR = 6.56, P = 0.00001) and 2:117973154–118107795 (OR = 4.54, P = 0.0008). The association at these two deletions was driven by the same samples, with 75% of the carriers of the first deletion observed to have the second deletion and normal copy number at the 62 kb gap between the deletions.

Associations at established common susceptibility Loci

Three of the most significant associations were observed within regions harbouring known breast cancer susceptibility loci for breast cancer. The most significant (OR = 1.42;CI,1.21−1.67; P = 0.00015) was upstream of FGFR2 and consistent with a 28 kb deletion in the 1000 Genomes Project data (chr10:123433204−123461492). Three independent risk signals have been previously identified at this region^19,20. The effect size for the CNV was larger than those previously reported for these common SNPs (largest OR = 1.27;CI,1.22−1.25). The CNV is in linkage disequilibrium with two of the SNPs identified as likely causally associated variants: rs35054928 (D′ = 0.82) and rs2981578 (D′ = 0.88). Conditioning on those SNPs reduced somewhat the strength of the association for the deletion (OR = 1.30; CI 1.10−1.53; P = 0.002, Supplementary Data 15).

The third strongest signal (OR = 4.9, P = 0.00001) in the deletion analysis for overall breast cancer was at 8: 132199447−132252439, 144Kb downstream of ADCY8. The strongest GWAS signal in this region lies in an intron of ADCY8 (lead SNP rs73348588, OR = 1.13, P = 8.2e−7)⁹. A 3 kb deletion in intron 4 of KLF12 was associated with ER-negative disease (OR = 2.4, P = 0.0007, P for difference by ER-status=0.01). This is 389 kb distant from common SNPs, located between KLF12 and KLF5, associated with ER-negative disease (rs9573140, OR = 0.94, P = 3.62e−9)²¹. The KLF12 and ADCY8 deletions are not in strong linkage disequilibrium with the corresponding GWAS signals and conditioning on these SNPs did not alter the strengths of the association for the CNVs (Supplementary Data 15).

Gene burden tests

We performed gene burden tests based on CNVs that overlapped exons. Analyses were restricted to genes in which at least 24 CNVs were identified, leaving 645 genes with deletions (Supplementary Data 16) and 1596 genes with duplications (Supplementary Data 17). QQ plots are shown in Fig. 2c for deletions and 2d for duplications. The lambda for inflation was 1.18 (; lambda₁₀₀₀ = 1.00) for deletions and 1.07 (; lambda₁₀₀₀ = 1.00) for duplications.

For deletions, we found 10 genes with P < 0.01 (Table 4), the most significant being BRCA1 (OR = 7.66, P = 3.72E−18). Four of these 10 genes (ATM, BRCA1, BRCA2, and CHEK2) are known breast cancer susceptibility genes¹⁸. Deletions were also observed in two other known susceptibility genes: PALB2 (23 cases, 9 controls, OR = 2.02, P = 0.09) and RAD51C (21 cases, 9 controls, OR = 2.04, P = 0.08). The most significant novel association was for SUPT3H (OR = 0.27, P = 0.0004).

Table 4 Gene burden results for deletions.

Full size table

For duplications we observed 15 genes with P < 0.01 (Table 5). The most significant association was for VPS53 (OR = 0.5, P = 0.0009). This gene and ABR (OR = 0.61, P = 0.008) both lie within the region on 17p which had the strongest association in the regional analysis. These associations were driven by duplications in different samples, with only one duplication in one sample overlapping both genes. Duplications were associated with an increased risk for only four of the 15 genes; the most statistically significant was RSU1 (OR = 3.4, P = 0.004). There was also some evidence of association with risk for duplications in BRCA1 (OR = 1.75, P = 0.01). However, analysis restricted to duplications that included exon 12 of BRCA1 showed clearer evidence of association (34 carriers, OR = 4.7, P = 0.0001), consistent with one of the more frequent known BRCA1 duplications that results in a frameshift²².

Table 5 Gene burden results for duplications.

Full size table

The gene burden subtype results are shown in Supplementary Data 18 and 19. The strongest associations were observed for BRCA1 deletions for ER Negative (OR = 33, P = 5.5E−35) and Triple Negative (OR = 49, P = 7.1E−34) disease, CHEK2 deletions for ER positive disease (OR = 2.14, P = 0.0001) and ATM deletions for ER positive disease (OR = 4.85, P = 0.0001). No additional genes significant at P < 0.0001 were found.

Direction of effect tests

In the gene burden and individual probe analyses we observed a directional effect, whereby the strongest associations for deletions tended to increase risk and those for duplications tended to be protective. To test whether these associations deviated from what would be expected by chance, we computed ranked summed z-score tests and evaluated the significance of the maximum test statistic by permutation. Results are summarised in Table 6. The statistic for deletion regions was more significant than any of the permuted statistics (P = 0.04) but was reduced to P = 0.12 after removing the known genes BRCA1 and CHEK2. The significance of the gene burden test for deletions also was reduced from P = 0.04 to P = 0.2 when the known genes were removed. The statistic for the duplication regions was lower than any of the 50 permutations (P = 0.04). The gene burden analysis for duplications was not significant.

Table 6 Direction of effect results.

Full size table

Discussion

We used the largest available breast cancer case-control dataset, comprising more than 86,000 cases and 76,000 controls with array genotyping, to test for associations with rare CNVs. Using the intensities from genotyping arrays to detect CNVs is not ideal due to a high level of noise and uncertainty in the calling, particularly for duplications. However, in tests of known CNVs and replication of calls between duplicate samples, the CamCNV method shows reasonable levels of sensitivity and specificity¹⁶. The main focus of this analysis was low frequency CNVs (<1% frequency) since higher frequency CNVs can generally be studied through imputation to a reference panel. In the 0.05–1% frequency range, we could detect ~20% of the CNVs identified by the 1000 Genomes project. For some loci, we only had evidence from one array because the probes do not exist to detect the variants on the other array. Thus, while this array-based approach provides power to evaluate the CNVs that can be assayed, much denser arrays or direct sequencing would be required to provide a complete evaluation of the contribution of CNVs.

In support of the reliability of the method, we detected evidence for both deletions and duplications in BRCA1, which was stronger for ER-negative disease, and for deletions in CHEK2, which were stronger for ER-positive disease. The latter appeared to be driven by a single founder deletion in East European populations. Weaker evidence of association was also observed for deletions in other susceptibility genes (BRCA2, ATM, PALB2, and RAD51C); the ORs were consistent with those seen for deleterious SNVs and indels¹⁸. In total, around 0.5% of cases in our analysis had a deletion in one of the known susceptibility genes with the proportion rising to ~1% for cases diagnosed under 50 years of age. The majority of coding deletions are expected to affect only part of the gene, with one study observing that a quarter covered only a single exon²³. To detect all of these using array data would require at least three probes per exon. The OncoArray has this level of coverage for a few genes, including BRCA1 and BRCA2, but the coverage is lower for most genes and many coding CNVs will have been missed.

A key issue is the appropriate level of statistical significance to apply to these analyses. For the gene burden tests, P < 2.5 × 10⁻⁶, as used in exome-sequencing, seems an appropriate level. It is less clear what is appropriate for non-coding variants. A level of P < 5 × 10⁻⁸ has become standard in GWAS and has been shown to lead to acceptable replication, but this seems over-conservative for CNVs, which are more likely to be deleterious. Consistent with this, for at least two of the ~200 common susceptibility loci, the likely causal variant is a CNV, a higher fraction than expected given the relative frequencies of CNVs and SNPs. Based on frequency analysis of whole-genome sequence data Abel et al.¹ estimated that rare CNVs are >800 times more likely to be deleterious than rare SNVs and >300 times more likely than rare indels. On the other hand, the significance level for non-coding CNVs should logically be more stringent than for the gene burden tests. Taken together, a significance level of ~10⁻⁶ seems appropriate, while associations at P < 0.001 may be worth following up in future studies. In our analyses, only the association at BRCA1 (both in the overall and gene burden tests) passes the higher threshold. We also calculated Bayesian False Discovery Probabilities (BFDPs)²⁴ (Supplementary Data 20 and 21) for our associations using prior probabilities of 0.001 for regions and 0.002 for genes. Outside the known genes none of the BDFPs gave a probability below 10%, with the lowest BFDP of 0.11 for the deletion in the FGFR2 locus. For a CNV observed with a frequency of 0.1% (n = 91 samples in the OncoArray dataset) we had 40% power to detect an association with an odds ratio of 2 but only 1.5% power to detect an association with an odds ratio of 1.5. An OR of 2, comparable to that seen for deleterious variants in ATM and CHEK2, may be plausible for rare coding CNVs or non-coding CNVs that have a significant effect on gene expression. Larger sample sizes will clearly be required to evaluate rare CNVs with more modest associations.

In addition to the BRCA1 and CHEK2 loci, we found associations in three known susceptibility regions identified through GWAS, harbouring FGFR2, ADCY8, and KLF12. In each case, the variants are rarer than the established associated variants, but confer higher risks. The ADCY8 and KLF12 deletions are not in linkage disequilibrium with the associated SNPs. The FGFR2 deletion is in linkage disequilibrium with two of the likely causal common SNPs although there was still evidence of association with the deletion, albeit weaker, after conditional analysis. In-silico and functional analysis clearly demonstrate that FGFR2 is the target of the previously established variants;^19,20 it will be interesting to establish if the same is true for the CNV.

Excluding loci in known susceptibility regions, the strongest evidence of association was for a 12 kb deletion (13:102121830–102133956) in the second intron of ITGBL1 (OR = 3.3, P = 0.00007 in the ER-positive analysis). This deletes a promoter flanking region (Ensembl ID: ENSR00001563823) and CTCF binding site (Ensembl ID: ENSR00001062398) active in mammary epithelial cells. There is experimental evidence that ITGBL1 expression, mediated by the RUNX2 transcription factor, enables breast cancer cells to form bone metastases²⁵.

In the gene burden analysis, the strongest novel association was for deletions within SUPT3H, which were associated with a reduced risk. SUPT3H encodes human SPT3, a component of the STAGA complex that acts as a co-activator of the MYC oncoprotein²⁶. SUPTH is located within the first intron of the RUNX2 transcription factor and the syntenic relationship between the two genes is highly evolutionarily conserved²⁷. RUNX2 has a role in mammary gland development and high RUNX2 expression is found in ER-negative tumours²⁸. The PCDHGB2 association appears to be due to a single variant (5:140739812–140740918) that deletes the first exon but as this gene is part of the protocadherin gamma gene cluster it is also possible that the deletion may be having an effect on one of the five genes that overlap PCDHGB2. It also deletes a promoter active in mammary epithelial cells (ENSR00001342785). The next strongest signals were for MEAK7 (OR = 2.19, P = 0.001), a gene implicated in a mTOR signalling pathway²⁹, and MAD1L1 ((OR = 2.00, P = 0.005), a component of the mitotic spindle-assembly that has been suggested as a possible tumour suppressor³⁰.

After BRCA1, the most significant association for ER Negative disease in the gene burden analysis was for CYP2C18 which overlaps CYP2C19 (ER-negative OR = 2.6, P = 0.002; triple-negative OR = 4.4, P = 0.0002). A previous analysis of CNVs and breast cancer in the Finnish population identified a founder mutation reaching an overall frequency of ~ 3% and reported a possible association at this locus for triple negative (OR 2.8, p = 0.02) and ER-negative breast cancer (OR = 2.2, p = 0.048)³¹.

The results from duplications are harder to interpret as there are often longer duplications overlapping whole genes with shorter variants covering some part of their length. For the gene burden analysis there was little evidence of strong associations. In the regional analysis, the two strongest associations cover multiple genes. The strongest evidence of association (OR = 0.69, P = 1.1E−05) was for a 1.5 Mb region at the start of chromosome 17 (17:13905–1559829). The probe-specific and gene burden results highlighted some stronger signals within this region, for example within NXN and VPS53, but the direction of effect was consistent across the region with 80% of the OncoArray probes having an odds ratio of 0.75 or lower (Fig. 3). This locus has established associations with prostate and colorectal cancer. Interestingly a possible association with ER-positive breast cancer survival was detected for a rare SNP in the first intron of NXN, rs118021774 (HR = 1.83, P = 3.8E−06)³². The detected duplications are not in LD with this SNP. For the 0.4 Mb duplication region on chromosome 21 (OR = 2.23, P = 0.0001) the probe-specific results from OncoArray highlighted that the signal is specific to a shorter intergenic region (21:33421860−33459975) between HUNK and LINC00159.

We observed some evidence of an aggregate directional effect, both for genes and non-genic regions, such that the deletions in aggregate were associated with increased risk. There was also some suggestion that duplications, in aggregate, were associated with a reduced risk. These results suggest that additional associations are present that could be established with a larger dataset. A new GWAS, Confluence (https://dceg.cancer.gov/research/cancer-types/breast-cancer/confluence-project), aims to double the available sample size for breast cancer. This GWAS includes probes specifically designed to assay some of the most significant CNVs observed in this study (those significant at P < 0.001), and the sample size should be sufficient to confirm or refute these associations.

Methods

Subjects

Data were derived from blood samples from study participants in 66 studies participating in the Breast Cancer Association Consortium (BCAC) and genotyped as part of the OncoArray^9,33 and iCOGS⁸ collaborations (Supplementary Data 1). Studies included population-based and hospital-based case-control studies, and case-control studies nested within prospective cohorts; we only included data from studies that provided both cases and controls. Phenotype data were based on version 12 of the BCAC database. Cases were diagnosed with either invasive breast cancer or carcinoma-in situ. Oestrogen receptor (ER) status was determined from medical records or tissue microarray evaluation, where available. Analyses were restricted to participants of European ancestry, as defined by ancestry informative principal components^8,9. Where samples were genotyped on both arrays, we excluded the iCOGS sample as the OncoArray has better genome-wide coverage. After sample quality control (see below), data on 36,980 cases and 34,706 controls with iCOGS genotyping, and 49,808 cases and 41,416 controls with OncoArray genotyping, were available for analysis (Supplementary Data 2).

Arrays

The Illumina iCOGs genotyping array⁸ includes 211,155 probes for SNVs and short insertions/deletions. Most variants were selected because of previous association in case-control studies for breast prostate and ovarian cancers, or for dense mapping of regions harbouring an association. The OncoArray includes 533,631 probes, of which approximately half were selected from the Illumina HumanCore backbone, a set of SNPs designed to tag the most common variants. The remainder were selected on the basis of evidence of previous association with breast, prostate, ovarian, lung, or colorectal cancer risk. Approximately 32,000 variants on the OncoArray were selected to provide dense coverage of associated loci and known genes. The remainder were mostly selected from lists of common variants ranked by p-value, with a small number from lists of candidate variants.

CNV calling

CNVs were called using the CamCNV pipeline as previously described¹⁶. In brief, the log R (LRR) intensity measurements and B allele frequency (BAF) for each sample at each probe were exported from Illumina’s Genome Studio software. A principal component adjustment (PCA) was applied to the LRR, grouped by study, to remove noise and batch effects. After removing noisy probes and those in regions with known common CNVs, the LRRs for each probe were converted to z-scores using the mean and standard deviation from all BCAC samples. Circular binary segmentation was applied to the z-scores ordered by probe position for each sample using the DNACopy package³⁴. This produces a list of segments for each sample by chromosome where the z-score of consecutive probes changes by more than two standard deviations. Segments with a mean probe z-score between −3.7 and −14 were called as deletions and segments with a mean z-score between +2 and +10 as duplications. We restricted the calls to segments covering a minimum of three and a maximum of 200 probes.

As per the CamCNV pipeline, we then excluded deletions with inconsistent B Allele Frequency and CNVs with a shift in LRR at the sample level that was outside the expected range. The additional CNV exclusions are summarised in Supplementary Data 3. To exclude regions with a high level of noise we also excluded CNVs falling within 1 Mb of telomeres and centromeres and a number of immune loci such as the T-cell receptor genes where somatic mutations in the blood are often observed³⁵.

Sample quality control

Standard sample quality control exclusions were performed, as previously described for the SNP genotype analyses^8,9. These include exclusions for excess heterozygosity, ancestry outliers, mismatches with other genotyping, and close relatives. A stricter sample call rate of >99% was used for the CNV analysis, compared to >95% used in the genotype analyses. We also excluded any participants for whom a DNA sample was not collected from blood and any that had been whole genome amplified.

In addition, we used two metrics to exclude noisy samples liable to produce an excess of unreliable CNV calls. First, we calculated a derivative log ratio spread (DLRS) figure for each sample as the standard deviation of the differences between LRR for probes ordered by genomic position, divided by the square root of two. This measures the variance in the LRR from each probe to the next averaged over the whole genome and thus is insensitive to large fluctuations such as might be expected between different chromosomes in the same sample. An ideal sample would have a small DLRS as the only variance would come from a small number of genuine CNVs. We calculated the DLRS using the dLRs function in R package ADM3 (https://CRAN.R-project.org/package=ADM3) before and after the PCA. At both stages, we excluded samples with a DLRS more than 3.5 standard deviations above the mean DLRS for that study.

Second, we counted the number of short segments (between three and 200 probes) output by DNACopy for each sample. We observed that the distribution of segment counts was skewed to the right with an excess of samples with a large number of segments. We calculated a cut-off for the maximum number of segments using the following formula where x is the segment count for each sample (based on the rationale that the distribution of the true number of segments should be approximately Poisson):

$${y}=2* {{{{{\rm{sqrt}}}}}}({x})$$

$${{{{{\rm{cut}}}}}}-{{{{{\rm{off}}}}}}={{{{{\rm{median}}}}}}({y})+3.5$$

The sample exclusions resulting from these QC steps are summarised in Supplementary Data 4.

Association tests

All analyses were carried out separately for deletions and duplications, since different types of CNV at the same locus do not necessarily have the same effect on risk. As we were only assessing rare CNVs, we treated all carriers as heterozygotes and did not attempt to identify rare homozygotes.

To account for overlapping CNVs and imprecision in the breakpoints, we assigned individual CNVs to regions. To identify the regions, we moved sequentially along each chromosome, identifying the start as an Oncoarray probe position where deletions were observed in at least five samples, and then the end position as the probe position before the first probe where deletions were observed in fewer than five samples. Regions within five probes of each other were then merged together. The process was repeated for duplications. Regions were also merged such that the major susceptibility genes (BRCA1, BRCA2, and CHEK2) were included within a single region. We then assigned individual CNVs to regions where at least 90% of the CNV’s length fell within the region. For iCOGS, which generally has less dense probe coverage, we first assigned CNVs to the OncoArray regions where they showed >90% overlap. We then assigned any remaining CNVs to regions defined using the iCOGS probes, using the same procedure. Using this approach, 3306 deletion regions were identified from OncoArray data, 812 of which were also observed using iCOGS data, and 541 regions identified using iCOGS alone. For duplications, there were 2203 OncoArray regions, with 854 also observed using iCOGS data, and 483 iCOGS specific regions.

Associations were evaluated for each region using logistic regression, with breast cancer status as the outcome, and the presence of a CNV in the region (0 or 1) as a covariate to derive a log odds ratio per deletion/duplication. Statistical significance was evaluated using a likelihood ratio test (based on the above model and one excluding CNV as a covariate). The logistic regression analyses were conducted using in-house software (https://ccge.medschl.cam.ac.uk/software/mlogit/). Study and ten ancestry informative principal components, defined separately for each array, were also included as covariates. The analyses were conducted separately for the iCOGS and OncoArray and then combined in a standard fixed effect meta-analysis using the METAL software (after first deriving the standard error of the log-odds ratio from the likelihood test statistic)³⁶. To avoid regions with too few observations, we excluded regions with fewer than 24 deletions or duplications (~0.015% of samples). Associations significant at P < 0.001 were considered noteworthy.

To detect more precisely the location of association signals, we also generated results for each probe. We created a vector of pseudo-genotypes for each probe with samples, such that a deletion covering that probe was coded as 1 and all other samples were coded as 0. We generated a similar set of genotypes for duplications. The results were analysed using logistic regression, as above.

To test for association between CNVs affecting the coding sequence of genes, in aggregate, and breast cancer risk, we identified samples with a deletion or duplication overlapping the exons of each gene. Exon positions were downloaded from the UCSC Genome Browser hg19 knownGene table. We used logistic regression to generate a log odds ratio (OR) for carriers of coding variants covering each gene, adjusted for study, as above. We generated results for each array and then for carriers combined across both arrays. For the combined analyses we treated studies with samples on both arrays as separate studies.

To calculate BFDPs we assumed a log-normally distributed prior effect size as described by Wakefield²⁴. The prior log(OR) was determined by assuming a 95% probability that the OR was less than some bound K, where K = 3 for the regional and gene-based analysis, except for BRCA1 and BRCA2 where K = 20 was assumed. The prior probability of association was assumed to be 0.001 for the regional analysis, 0.99 for BRCA1, BRCA2, ATM, and CHEK2 and 0.002 for other genes. For the gene-based analysis, only positive associations were considered as the prior evidence for all genes was in favour of PTVs being positively associated with risk.

To determine whether there was a tendency for CNVs to be associated with an excess, or deficit, of risk across genes or regions, we computed signed z-scores as the square root of the chi-squared statistic for each gene, multiplied by ±1 depending on whether the effect estimate was positive or negative. These were ranked and normalised summed z-scores, based on the r most significant associations, were derived. The overall test statistic was the maximum summed z-score over all possible values of r:

$$U=\begin{array}{c}{\max }\\ r\le n\end{array}\frac{1}{\sqrt{r}}\mathop{\sum }\limits_{j=1}^{r}{z}_{j}$$

(1)

Where n is the total number of genes/regions being tested. The significance of U was then determined by permutation, randomly permuting case-control labels within study 50 times.

Ethical approval

All participating studies were approved by their appropriate ethics review board and all subjects provided written informed consent.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Full summary statistics for the regions and probes analysed are available in the Supplementary Data. This includes the source data used to produce Figs. 2 and 3. The majority of the OncoArray dataset analysed in this study is available in the dbGap repository, Study ID: phs001265.v1.p1 (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001265.v1.p1). The iCOGS dataset and complete OncoArray dataset cannot be made publicly available due to restraints imposed by the ethics committees of individual studies; requests for data can be made to the corresponding author or the Data Access Coordination Committee (DACC) of BCAC (http://bcac.ccge.medschl.cam.ac.uk/).

Code availability

Code for the CamCNV calling algorithm and a test dataset with OncoArray genotyping is available at the project home page (https://github.com/jgd29/CamCNV).

References

Abel, H.J. et al. Mapping and characterization of structural variation in 17,795 human genomes. Nature 583, 83–89 (2020).
Collins, R. L. et al. A structural variation reference for medical and population genetics. Nature. 581, 444–451 (2020).
Article CAS Google Scholar
Sudmant, P.H. et al. An integrated map of structural variation in 2,504 human genomes. Nature 526, 75−81 (2015).
Nik-Zainal, S. et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature 534, 47–54 (2016).
Article CAS Google Scholar
Gerstung, M. et al. The evolutionary history of 2,658 cancers. Nature 578, 122–128 (2020).
Article CAS Google Scholar
Easton, D. F. et al. Gene-panel sequencing and the prediction of breast-cancer risk. N. Eng. J. Med. 372, 2243–2257 (2015).
Article CAS Google Scholar
Ewald, I. P. et al. Genomic rearrangements in BRCA1 and BRCA2: A literature review. Genet. Mol. Biol. 32, 437–446 (2009).
Article CAS Google Scholar
Michailidou, K. et al. Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat. Genet. 45, 353–61 (2013).
Michailidou, K. et al. Association analysis identifies 65 new breast cancer risk loci. Nature 551, 92 (2017).
Article Google Scholar
Milne, R. L. et al. Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer. Nat. Genet. 49, 1767 (2017).
Article CAS Google Scholar
Zhang, H. et al. Genome-wide association study identifies 32 novel breast cancer susceptibility loci from overall and subtype-specific analyses. Nat. Genet. 52, 572–581 (2020).
Article CAS Google Scholar
Long, J. et al. A common deletion in the APOBEC3 genes and breast cancer risk. J. Natl Cancer Inst. 105, 573–9 (2013).
Nik-Zainal, S. et al. Association of a germline copy number polymorphism of APOBEC3A and APOBEC3B with burden of putative APOBEC-dependent mutations in breast cancer. Nat. Genet. 46, 487–491 (2014).
Article CAS Google Scholar
Wyszynski, A. et al. An intergenic risk locus containing an enhancer deletion in 2q35 modulates breast cancer risk by deregulating IGFBP5 expression. Hum. Mol. Genet. 25, 3863–3876 (2016).
Baxter, J. Functional annotation of the 2q35 breast cancer risk locus implicates a structural variant in influencing activity of a long-range enhancer element. Am. J. Hum. Genet. 108, 1190–1203 (2021).
Article CAS Google Scholar
Dennis, J., Walker, L., Tyrer, J., Michailidou, K. & Easton, D.F. Detecting rare copy number variants from Illumina genotyping arrays with the CamCNV pipeline: Segmentation of z-scores improves detection and reliability. Genet. Epidemiol. 45, 237–248 (2020).
Cybulski, C. et al. A deletion in CHEK2 of 5,395 bp predisposes to breast cancer in Poland. Breast Cancer Res. Treat. 102, 119–122 (2007).
Article CAS Google Scholar
Dorling, L. et al. Breast cancer risk genes—association analysis in more than 113,000 women. N. Engl. J. Med. 384, 428–439 (2021).
Article CAS Google Scholar
Meyer, K. B. et al. Fine-scale mapping of the FGFR2 breast cancer risk locus: Putative functional variants differentially bind FOXA1 and E2F1. Am. J. Hum. Genet. 93, 1046–1060 (2013).
Article CAS Google Scholar
Fachal, L. et al. Fine-mapping of 150 breast cancer risk regions identifies 191 likely target genes. Nat. Genet. 52, 56–73 (2020).
Article CAS Google Scholar
Milne, R. L. et al. Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer. Nat. Genet. 49, 1767–1778 (2017).
Article CAS Google Scholar
The BRCA1 Exon 13 Duplication Screening Group. The exon 13 duplication in the BRCA1 gene is a founder mutation present in geographically diverse populations. Am. J. Hum. Genet. 67, 207–212 (2000).
Truty, R. et al. Prevalence and properties of intragenic copy-number variation in Mendelian disease genes. Genet. Med. 21, 114–123 (2018).
Wakefield, J. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet. 81, 208–227 (2007).
Article CAS Google Scholar
Li, X. Q. et al. ITGBL1 is a Runx2 transcriptional target and promotes breast cancer bone metastasis by activating the TGFβ signaling pathway. Cancer Res. 75, 3302–3313 (2015).
Article CAS Google Scholar
Liu, X., Vorontchikhina, M., Wang, Y.-L., Faiola, F. & Martinez, E. STAGA recruits mediator to the MYC oncoprotein to stimulate transcription and cell proliferation. Mol. Cell. Biol. 28, 108–121 (2008).
Article Google Scholar
Barutcu, A. R. et al. The bone-specific Runx2-P1 promoter displays conserved three-dimensional chromatin structure with the syntenic Supt3h promoter. Nucleic Acids Res. 42, 10360–10372 (2014).
Article Google Scholar
Ferrari, N., McDonald, L., Morris, J. S., Cameron, E. R. & Blyth, K. RUNX2 in mammary gland development and breast cancer. J. Cell Physiol. 228, 1137–1142 (2013).
Article CAS Google Scholar
Nguyen, J. T. et al. Mammalian EAK-7 activates alternative mTOR signaling to regulate cell proliferation and migration. Sci. Adv. 4, eaao5838 (2018).
Article Google Scholar
Rajaram, M. et al. Two distinct categories of focal deletions in cancer genomes. PLoS One 8, e66264 (2013).
Article CAS Google Scholar
Tervasmaki, A., Winqvist, R., Jukkola-Vuorinen, A. & Pylkas, K. Recurrent CYP2C19 deletion allele is associated with triple-negative breast cancer. BMC Cancer 14, 902 (2014).
Article Google Scholar
Escala-Garcia, M. et al. Genome-wide association study of germline variants and breast cancer-specific mortality. Br. J. Cancer 120, 647–657 (2019).
Article CAS Google Scholar
Amos, C.I. et al. The OncoArray Consortium: A network for understanding the genetic architecture of common cancers. Cancer Epidemiol. Biomarkers Prev. 26, 126–135 (2017).
Venkatraman, E. S. & Olshen, A. B. A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics 23, 657–663 (2007).
Article CAS Google Scholar
Schwienbacher, C. et al. Copy number variation and association over T-cell receptor genes-influence of DNA source. Immunogenetics 62, 561–567 (2010).
Article CAS Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: Fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

Joe Dennis is supported by the CanRisk Cancer Research UK programme grant: PPRPGM-Nov20\100002 and by the Confluence project which is funded with intramural funds from the National Cancer Institute Intramural Research Programme, National Institutes of Health. Logan Walker is supported by a Rutherford Discovery Fellowship (Royal Society of New Zealand). We thank all the individuals who took part in these studies and all the researchers, clinicians, technicians, and administrative staff who have enabled this work to be carried out. The COGS study would not have been possible without the contributions of the following Andrew Berchuck (OCAC), Rosalind A. Eeles, Ali Amin Al Olama, Zsofia Kote-Jarai, Sara Benlloch (PRACTICAL), Antonis Antoniou, Lesley McGuffog and Ken Offit (CIMBA), Andrew Lee, and Ed Dicks, Craig Luccarini and the staff of the Centre for Genetic Epidemiology Laboratory, the staff of the CNIO genotyping unit, Jacques Simard and Daniel C. Tessier, Francois Bacot, Daniel Vincent, Sylvie LaBoissière and Frederic Robidoux and the staff of the McGill University and Génome Québec Innovation Centre, Sune F. Nielsen, Borge G. Nordestgaard, and the staff of the Copenhagen DNA laboratory, and Julie M. Cunningham, Sharon A. Windebank, Christopher A. Hilker, Jeffrey Meyer and the staff of Mayo Clinic Genotyping Core Facility. ABCFS thank Maggie Angelakos, Judi Maskiell, Gillian Dite. ABCS thanks the Blood bank Sanquin, The Netherlands. ABCTB Investigators: Christine Clarke, Deborah Marsh, Rodney Scott, Robert Baxter, Desmond Yip, Jane Carpenter, Alison Davis, Nirmala Pathmanathan, Peter Simpson, J. Dinny Graham, Mythily Sachchithananthan. Samples are made available to researchers on a non-exclusive basis. BBCS thanks Eileen Williams, Elaine Ryder-Mills, Kara Sargus. BCEES thanks Allyson Thomson, Christobel Saunders, Terry Slevin, BreastScreen Western Australia, Elizabeth Wylie, Rachel Lloyd. The BCINIS study would not have been possible without the contributions of Dr. K. Landsman, Dr. N. Gronich, Dr. A. Flugelman, Dr. W. Saliba, Dr. F. Lejbkowicz, Dr. E. Liani, Dr. I. Cohen, Dr. S. Kalet, Dr. V. Friedman, Dr. O. Barnet of the NICCC in Haifa, and all the contributing family medicine, surgery, pathology, and oncology teams in all medical institutes in Northern Israel. BIGGS thanks Niall McInerney, Gabrielle Colleran, Andrew Rowan, Angela Jones. The BREOGAN study would not have been possible without the contributions of the following: Angel Carracedo, Victor Muñoz Garzón, Alejandro Novo Domínguez, Maria Elena Martinez, Sara Miranda Ponte, Carmen Redondo Marey, Maite Peña Fernández, Manuel Enguix Castelo, Maria Torres, Manuel Calaza (BREOGAN), José Antúnez, Máximo Fraga and the staff of the Department of Pathology and Biobank of the University Hospital Complex of Santiago-CHUS, Instituto de Investigación Sanitaria de Santiago, IDIS, Xerencia de Xestion Integrada de Santiago-SERGAS; Joaquín González-Carreró and the staff of the Department of Pathology and Biobank of University Hospital Complex of Vigo, Instituto de Investigacion Biomedica Galicia Sur, SERGAS, Vigo, Spain. CBCS thanks study participants, co-investigators, collaborators, and staff of the Canadian Breast Cancer Study, and project coordinators Agnes Lai and Celine Morissette. CCGP thanks Styliani Apostolaki, Anna Margiolaki, Georgios Nintos, Maria Perraki, Georgia Saloustrou, Georgia Sevastaki, Konstantinos Pompodakis. CGPS thanks staff and participants of the Copenhagen General Population Study. For the excellent technical assistance: Dorthe Uldall Andersen, Maria Birna Arnadottir, Anne Bank, Dorthe Kjeldgård Hansen. The Danish Cancer Biobank is acknowledged for providing infrastructure for the collection of blood samples for the cases. CNIO-BCS thanks Guillermo Pita, Charo Alonso, Nuria Álvarez, Pilar Zamora, Primitiva Menendez, the Human Genotyping-CEGEN Unit (CNIO). Investigators from the CPS-II cohort thank the participants and Study Management Group for their invaluable contributions to this research. They also acknowledge the contribution to this study from central cancer registries supported through the Centres for Disease Control and Prevention National Programme of Cancer Registries, as well as cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results programme. The authors would like to thank the California Teachers Study Steering Committee that is responsible for the formation and maintenance of the Study within which this research was conducted. A full list of California Teachers Study team members is available at https://www.calteachersstudy.org/team. ESTHER thanks Hartwig Ziegler, Sonja Wolf, Volker Hermann, Christa Stegmaier, Katja Butterbach. FHRISK thanks NIHR for funding and the Manchester NIHR Biomedical Research Centre (IS-BRC-1215-20007). GC-HBOC thanks Stefanie Engert, Heide Hellebrand, Sandra Kröber and LIFE - Leipzig Research Centre for Civilisation Diseases (Markus Loeffler, Joachim Thiery, Matthias Nüchter, Ronny Baber). The GENICA Network: Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, and University of Tübingen, Germany [Hiltrud Brauch, RH, Wing-Yee Lo], German Cancer Consortium (DKTK) and German Cancer Reseach Centre (DKFZ), Partner Site Tübingen, 72074 Tübingen, Germany [Hiltrud Brauch], gefördert durch die Deutsche Forschungsgemeinschaft (DFG) im Rahmen der Exzellenzstrategie des Bundes und der Länder – EXC 2180 – 390900677 [Hiltrud Brauch], Department of Internal Medicine, Evangelische Kliniken Bonn gGmbH, Johanniter Krankenhaus, Bonn, Germany [YDK, Christian Baisch], Institute of Pathology, University of Bonn, Germany [Hans-Peter Fischer], Molecular Genetics of Breast Cancer, Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, Germany [Ute Hamann], Institute for Prevention and Occupational Medicine of the German Social Accident Insurance, Institute of the Ruhr University Bochum (IPA), Bochum, Germany [Thomas Brüning, Beate Pesch, Sylvia Rabstein, Anne Lotz]; and Institute of Occupational Medicine and Maritime Medicine, University Medical Centre Hamburg-Eppendorf, Germany [Volker Harth]. HEBCS thanks Johanna Kiiski, Carl Blomqvist, Kristiina Aittomäki, Kirsimari Aaltonen, Karl von Smitten, Irja Erkkilä. HMBCS thanks Peter Hillemanns, Hans Christiansen and Johann H. Karstens. HUBCS thanks Shamil Gantsev. ICICLE thanks Kelly Kohut, Michele Caneppele, Maria Troy. KARMA and SASBAC thank the Swedish Medical Research Counsel. KBCP thanks Eija Myöhänen. kConFab/AOCS wish to thank Heather Thorne, Eveline Niedermayr, all the kConFab research nurses and staff, the heads and staff of the Family Cancer Clinics, and the Clinical Follow Up Study (which has received funding from the NHMRC, the National Breast Cancer Foundation, Cancer Australia, and the National Institute of Health (USA)) for their contributions to this resource, and the many families who contribute to kConFab. LMBC thanks Gilian Peuteman, Thomas Van Brussel, EvyVanderheyden and Kathleen Corthouts. MABCS thanks Milena Jakimovska (RCGEB “Georgi D. Efremov”), Snezhana Smichkoska, Emilija Lazarova (University Clinic of Radiotherapy and Oncology), Mitko Karadjozov (Adzibadem-Sistina Hospital), Andrej Arsovski and Liljana Stojanovska (Re-Medika Hospital) for their contributions and commitment to this study. MARIE thanks Petra Seibold, Dieter Flesch-Janys, Judith Heinz, Nadia Obi, Alina Vrieling, Sabine Behrens, Ursula Eilber, Muhabbet Celik, Til Olchers and Stefan Nickels. MBCSG (Milan Breast Cancer Study Group): Paolo Radice, Bernard Peissel, Jacopo Azzollini, Erica Rosina, Daniela Zaffaroni, Bernardo Bonanni, Irene Feroce, Mariarosaria Calvello, Aliana Guerrieri Gonzaga, Monica Marabelli, Davide Bondavalli and the personnel of the Cogentech Cancer Genetic Test Laboratory. The MCCS was made possible by the contribution of many people, including the original investigators, the teams that recruited the participants and continue working on follow-up, and the many thousands of Melbourne residents who continue to participate in the study. We thank the coordinators, the research staff, and especially the MMHS participants for their continued collaboration on research studies in breast cancer. The following are NBCS Collaborators: Anne-Lise Børresen-Dale (Prof. Em.), Kristine K. Sahlberg (PhD), Lars Ottestad (MD), Rolf Kåresen (Prof. Em.) Dr. Ellen Schlichting (MD), Marit Muri Holmen (MD), Toril Sauer (MD), Vilde Haakensen (MD), Olav Engebråten (MD), Bjørn Naume (MD), Alexander Fosså (MD), Cecile E. Kiserud (MD), Kristin V. Reinertsen (MD), Åslaug Helland (MD), Margit Riis (MD), Jürgen Geisler (MD), OSBREAC and Grethe I. Grenaker Alnæs (MSc). NBHS and SBCGS thank study participants and research staff for their contributions and commitment to the studies. For NHS and NHS2 the study protocol was approved by the institutional review boards of the Brigham and Women’s Hospital and Harvard T.H. Chan School of Public Health, and those of participating registries as required. We would like to thank the participants and staff of the NHS and NHS2 for their valuable contributions, as well as the following state cancer registries for their help: A.L., A.Z., A.R., C.A., C.O., C.T., D.E., F.L., G.A., I.D., I.L., I.N., I.A., K.Y., L.A., M.E., M.D., M.A., M.I., N.E., N.H., N.J., N.Y., N.C., N.D., O.H., O.K., O.R., P.A., R.I., S.C., T.N., T.X., V.A., W.A., and W.Y. The authors assume full responsibility for analyses and interpretation of these data. OBCS thanks Arja Jukkola-Vuorinen, Mervi Grip, Saila Kauppila, Meeri Otsukka, Leena Keskitalo and Kari Mononen for their contributions to this study. The OFBCR thanks Teresa Selander, Nayana Weerasooriya, and Steve Gallinger. ORIGO thanks E. Krol-Warmerdam, and J. Blom for patient accrual, administering questionnaires, and managing clinical information. The LUMC survival data were retrieved from the Leiden hospital-based cancer registry system (ONCDOC) with the help of Dr. J. Molenaar. PBCS thanks Louise Brinton, Mark Sherman, Neonila Szeszenia-Dabrowska, Beata Peplonska, Witold Zatonski, Pei Chao, Michael Stagner. The ethical approval for the POSH study is MREC /00/6/69, UKCRN ID: 1137. We thank the staff in the Experimental Cancer Medicine Centre (ECMC) supported Faculty of Medicine Tissue Bank and the Faculty of Medicine DNA Banking resource. The RBCS thanks Jannet Blom, Saskia Pelders, Wendy J.C. Prager —van der Smissen, and the Erasmus MC Family Cancer Clinic. SBCS thanks Sue Higham, Helen Cramp, Dan Connley, Ian Brock, Sabapathy Balasubramanian, and Malcolm W.R. Reed. We thank the SEARCH and EPIC teams. SZBCS thanks Ewa Putresza. UCIBCS thanks Irene Masunaka. UKBGS thanks Breast Cancer Now and the Institute of Cancer Research for support and funding of the Generations Study, and the study participants, study staff, and the doctors, nurses and other health care providers and health information sources who have contributed to the study. We acknowledge NHS funding to the Royal Marsden/ICR NIHR Biomedical Research Centre. BCAC is funded by the European Union’s Horizon 2020 Research and Innovation Programme (grant numbers 634935 and 633784 for BRIDGES and B-CAST respectively), and the PERSPECTIVE I&I project, funded by the Government of Canada through Genome Canada and the Canadian Institutes of Health Research, the Ministère de l’Économie et de l’Innovation du Québec through Genome Québec, the Quebec Breast Cancer Foundation. The EU Horizon 2020 Research and Innovation Programme funding source had no role in study design, data collection, data analysis, data interpretation, or writing of the report. Additional funding for BCAC is provided via the Confluence project which is funded with intramural funds from the National Cancer Institute Intramural Research Programme, National Institutes of Health. Genotyping of the OncoArray was funded by the NIH Grant U19 CA148065, and Cancer Research UK Grant C1287/A16563 and the PERSPECTIVE project supported by the Government of Canada through Genome Canada and the Canadian Institutes of Health Research (grant GPH-129344) and, the Ministère de l’Économie, Science et Innovation du Québec through Genome Québec and the PSRSIIRI-701 grant, and the Quebec Breast Cancer Foundation. Funding for iCOGS came from: the European Community’s Seventh Framework Programme under grant agreement n° 223175 (HEALTH-F2-2009-223175) (COGS), Cancer Research UK (C1287/A10118, C1287/A10710, C12292/A11174, C1281/A12014, C5047/A8384, C5047/A15007, C5047/A10692, C8197/A16565), the National Institutes of Health (CA128978) and Post-Cancer GWAS initiative (1U19 CA148537, 1U19 CA148065, and 1U19 CA148112—the GAME-ON initiative), the Department of Defence (W81XWH-10-1-0341), the Canadian Institutes of Health Research (CIHR) for the CIHR Team in Familial Risks of Breast Cancer, and Komen Foundation for the Cure, the Breast Cancer Research Foundation, and the Ovarian Cancer Research Fund. The DRIVE Consortium was funded by U19 CA148065. The Australian Breast Cancer Family Study (ABCFS) was supported by grant UM1 CA164920 from the National Cancer Institute (USA). The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centres in the Breast Cancer Family Registry (BCFR), nor does mention of trade names, commercial products, or organisations imply endorsement by the USA Government or the BCFR. The ABCFS was also supported by the National Health and Medical Research Council of Australia, the New South Wales Cancer Council, the Victorian Health Promotion Foundation (Australia), and the Victorian Breast Cancer Research Consortium. J.L.H. is a National Health and Medical Research Council (NHMRC) Senior Principal Research Fellow. M.C.S. is a NHMRC Senior Research Fellow. The ABCS study was supported by the Dutch Cancer Society [grants NKI 2007-3839; 2009 4363]. The Australian Breast Cancer Tissue Bank (ABCTB) was supported by the National Health and Medical Research Council of Australia, The Cancer Institute NSW and the National Breast Cancer Foundation. The AHS study is supported by the intramural research programme of the National Institutes of Health, the National Cancer Institute (grant number Z01-CP010119), and the National Institute of Environmental Health Sciences (grant number Z01-ES049030). The work of the BBCC was partly funded by ELAN-Fond of the University Hospital of Erlangen. The BBCS is funded by Cancer Research UK and Breast Cancer Now and acknowledges NHS funding to the NIHR Biomedical Research Centre, and the National Cancer Research Network (NCRN). The BCEES was funded by the National Health and Medical Research Council, Australia and the Cancer Council Western Australia and acknowledges funding from the National Breast Cancer Foundation (JS). For the BCFR-NY this work was supported by grant UM1 CA164920 from the National Cancer Institute. The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centres in the Breast Cancer Family Registry (BCFR), nor does mention of trade names, commercial products, or organisations imply endorsement by the US Government or the BCFR. The BCINIS study is supported in part by the Breast Cancer Research Foundation (BCRF). For BIGGS, ES is supported by NIHR Comprehensive Biomedical Research Centre, Guy’s & St. Thomas’ NHS Foundation Trust in partnership with King’s College London, United Kingdom. IT is supported by the Oxford Biomedical Research Centre. The BREast Oncology GAlician Network (BREOGAN) is funded by Acción Estratégica de Salud del Instituto de Salud Carlos III (ISCIII) FIS PI12/02125/Cofinanciado FEDER, and ISCIII/PI17/00918/Cofinanciado FEDER; Acción Estratégica de Salud del Instituto de Salud Carlos III FIS Intrasalud (PI13/01136); Programa Grupos Emergentes, Cancer Genetics Unit, Instituto de Investigacion Biomedica Galicia Sur. Xerencia de Xestion Integrada de Vigo-SERGAS, Instituto de Salud Carlos III, Spain; Grant 10CSA012E, Consellería de Industria Programa Sectorial de Investigación Aplicada, PEME I + D e I + D Suma del Plan Gallego de Investigación, Desarrollo e Innovación Tecnológica de la Consellería de Industria de la Xunta de Galicia, Spain; Grant EC11-192. Fomento de la Investigación Clínica Independiente, Ministerio de Sanidad, Servicios Sociales e Igualdad, Spain; and Grant FEDER-Innterconecta. Ministerio de Economia y Competitividad, Xunta de Galicia, Spain. The BSUCH study was supported by the Dietmar−Hopp Foundation, the Helmholtz Society, and the German Cancer Research Centre (DKFZ). CBCS is funded by the Canadian Cancer Society (grant # 313404) and the Canadian Institutes of Health Research. CCGP is supported by funding from the University of Crete. The CECILE study was supported by Fondation de France, Institut National du Cancer (INCa), Ligue Nationale contre le Cancer, Agence Nationale de Sécurité Sanitaire, de l’Alimentation, de l’Environnement et du Travail (ANSES), Agence Nationale de la Recherche (ANR). The CGPS was supported by the Chief Physician Johan Boserup and Lise Boserup Fund, the Danish Medical Research Council, and Herlev and Gentofte Hospital. The CNIO-BCS was supported by the Instituto de Salud Carlos III, the Red Temática de Investigación Cooperativa en Cáncer and grants from the Asociación Española Contra el Cáncer and the Fondo de Investigación Sanitario (PI11/00923 and PI12/00070). The American Cancer Society funds the creation, maintenance, and updating of the CPS-II cohort. The California Teachers Study and the research reported in this publication were supported by the National Cancer Institute of the National Institutes of Health under award number U01-CA199277; P30-CA033572; P30-CA023100; UM1-CA164917; and R01-CA077398. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Cancer Institute or the National Institutes of Health. The collection of cancer incidence data used in the California Teachers Study was supported by the California Department of Public Health pursuant to California Health and Safety Code Section 103885; Centres for Disease Control and Prevention’s National Programme of Cancer Registries, under cooperative agreement 5NU58DP006344; the National Cancer Institute’s Surveillance, Epidemiology and End Results Programme under contract HHSN261201800032I awarded to the University of California, San Francisco, contract HHSN261201800015I awarded to the University of Southern California, and contract HHSN261201800009I awarded to the Public Health Institute. The opinions, findings, and conclusions expressed herein are those of the author(s) and do not necessarily reflect the official views of the State of California, Department of Public Health, the National Cancer Institute, the National Institutes of Health, the Centres for Disease Control and Prevention or their Contractors and Subcontractors, or the Regents of the University of California, or any of its programmes. The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy or views of the International Agency for Research on Cancer/World Health Organization. The national cohorts are supported by: Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l’Education Nationale, Institut National de la Santé et de la Recherche Médicale (INSERM) (France); German Cancer Aid, German Cancer Research Centre (DKFZ), Federal Ministry of Education and Research (BMBF) (Germany); the Hellenic Health Foundation, the Stavros Niarchos Foundation (Greece); Associazione Italiana per la Ricerca sul Cancro-AIRC-Italy and National Research Council (Italy); Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), Statistics Netherlands (The Netherlands); Health Research Fund (FIS), PI13/00061 to Granada, PI13/01162 to EPIC-Murcia, Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, ISCIII RETIC (RD06/0020) (Spain); Cancer Research UK (14136 to EPIC-Norfolk; C570/A16491 and C8221/A19170 to EPIC-Oxford), Medical Research Council (1000143 to EPIC-Norfolk, MR/M012190/1 to EPIC-Oxford) (United Kingdom). The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. Additional cases were recruited in the context of the VERDI study, which was supported by a grant from the German Cancer Aid (Deutsche Krebshilfe). FHRISK is funded from NIHR grant PGfAR 0707-10031. The GC-HBOC (German Consortium of Hereditary Breast and Ovarian Cancer) is supported by the German Cancer Aid (grant no 110837, coordinator: Rita K. Schmutzler, Cologne). This work was also funded by the European Regional Development Fund and Free State of Saxony, Germany (LIFE - Leipzig Research Centre for Civilisation Diseases, project numbers 713-241202, 713-241202, 14505/2470, 14575/2470). The GENICA was funded by the Federal Ministry of Education and Research (BMBF) Germany grants 01KW9975/5, 01KW9976/8, 01KW9977/0 and 01KW0114, the Robert Bosch Foundation, Stuttgart, Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, the Institute for Prevention and Occupational Medicine of the German Social Accident Insurance, Institute of the Ruhr University Bochum (IPA), Bochum, as well as the Department of Internal Medicine, Evangelische Kliniken Bonn gGmbH, Johanniter Krankenhaus, Bonn, Germany. The GESBC was supported by the Deutsche Krebshilfe e. V. [70492] and the German Cancer Research Centre (DKFZ). The HABCS study was supported by the Claudia von Schilling Foundation for Breast Cancer Research, by the Lower Saxonian Cancer Society, and by the Rudolf Bartling Foundation. The HEBCS was financially supported by the Helsinki University Hospital Research Fund, the Finnish Cancer Society, and the Sigrid Juselius Foundation. The HMBCS was supported by a grant from the Friends of Hannover Medical School and by the Rudolf Bartling Foundation. The HUBCS was supported by a grant from the German Federal Ministry of Research and Education (RUS08/017), B.M. was supported by grant 17-44-020498, 17-29-06014 of the Russian Foundation for Basic Research, D.P. was supported by grant 18-29-09129 of the Russian Foundation for Basic Research, E.K. was supported by the programme for supporting the bioresource collections №007-030164/2, and the study was performed as part of the assignment of the Ministry of Science and Higher Education of the Russian Federation (№АААА-А16-116020350032-1). Financial support for KARBAC was provided through the regional agreement on medical training and clinical research (ALF) between Stockholm County Council and Karolinska Institutet, the Swedish Cancer Society, The Gustav V Jubilee foundation and Bert von Kantzows foundation. The KARMA study was supported by Märit and Hans Rausings Initiative Against Breast Cancer. The KBCP was financially supported by the special Government Funding (VTR) of Kuopio University Hospital grants, Cancer Fund of North Savo, the Finnish Cancer Organisations, and by the strategic funding of the University of Eastern Finland. kConFab is supported by a grant from the National Breast Cancer Foundation, and previously by the National Health and Medical Research Council (NHMRC), the Queensland Cancer Fund, the Cancer Councils of New South Wales, Victoria, Tasmania and South Australia, and the Cancer Foundation of Western Australia. Financial support for the AOCS was provided by the United States Army Medical Research and Materiel Command [DAMD17-01-1-0729], Cancer Council Victoria, Queensland Cancer Fund, Cancer Council New South Wales, Cancer Council South Australia, The Cancer Foundation of Western Australia, Cancer Council Tasmania and the National Health and Medical Research Council of Australia (NHMRC; 400413, 400281, 199600). G.C.T. and P.W. are supported by the NHMRC. RB was a Cancer Institute NSW Clinical Research Fellow. LMBC is supported by the ‘Stichting tegen Kanker’. D.L. is supported by the FWO. The MABCS study is funded by the Research Centre for Genetic Engineering and Biotechnology “Georgi D. Efremov”, MASA. The MARIE study was supported by the Deutsche Krebshilfe e.V. [70-2892-BR I, 106332, 108253, 108419, 110826, 110828], the Hamburg Cancer Society, the German Cancer Research Centre (DKFZ), and the Federal Ministry of Education and Research (BMBF) Germany [01KH0402]. The MASTOS study was supported by “Cyprus Research Promotion Foundation” grants 0104/13 and 0104/17, and the Cyprus Institute of Neurology and Genetics. MBCSG is supported by grants from the Italian Association for Cancer Research (AIRC). The MCBCS was supported by the NIH grants R35CA253187, R01CA192393, R01CA116167, R01CA176785 a NIH Specialised Programme of Research Excellence (SPORE) in Breast Cancer [P50CA116201], and the Breast Cancer Research Foundation. The Melbourne Collaborative Cohort Study (MCCS) cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further augmented by Australian National Health and Medical Research Council grants 209057, 396414, and 1074383 and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry and the Australian Institute of Health and Welfare, including the National Death Index and the Australian Cancer Database. The MEC was supported by NIH grants CA63464, CA54281, CA098758, CA132839, and CA164973. The MISS study is supported by funding from ERC-2011-294576 Advanced grant, Swedish Cancer Society, Swedish Research Council, Local hospital funds, Berta Kamprad Foundation, Gunnar Nilsson. The MMHS study was supported by NIH grants CA97396, CA128931, CA116201, CA140286, and CA177150. The work of MTLGEBCS was supported by the Quebec Breast Cancer Foundation, the Canadian Institutes of Health Research for the “CIHR Team in Familial Risks of Breast Cancer” programme—grant # CRN-87521 and the Ministry of Economic Development, Innovation and Export Trade—grant # PSR-SIIRI-701. The NBCS has received funding from the K.G. Jebsen Centre for Breast Cancer Research; the Research Council of Norway grant 193387/V50 (to A-L Børresen-Dale and V.N. Kristensen) and grant 193387/H10 (to A.-L. Børresen-Dale and V.N. Kristensen), South Eastern Norway Health Authority (grant 39346 to A-L Børresen-Dale) and the Norwegian Cancer Society (to A-L Børresen-Dale and V.N. Kristensen). The NBHS was supported by NIH grant R01CA100374. Biological sample preparation was conducted by the Survey and Biospecimen Shared Resource, which is supported by P30 CA68485. The Northern California Breast Cancer Family Registry (NC-BCFR) and Ontario Familial Breast Cancer Registry (OFBCR) were supported by grant U01CA164920 from the USA National Cancer Institute of the National Institutes of Health. The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centres in the Breast Cancer Family Registry (BCFR), nor does mention of trade names, commercial products, or organisations imply endorsement by the USA Government or the BCFR. The Carolina Breast Cancer Study (NCBCS) was funded by Komen Foundation, the National Cancer Institute (P50 CA058223, U54 CA156733, U01 CA179715), and the North Carolina University Cancer Research Fund. The NHS was supported by NIH grants P01 CA87969, UM1 CA186107, and U19 CA148065. The NHS2 was supported by NIH grants UM1 CA176726 and U19 CA148065. The OBCS was supported by research grants from the Finnish Cancer Foundation, the Academy of Finland (grant number 250083, 122715 and Centre of Excellence grant number 251314), the Finnish Cancer Foundation, the Sigrid Juselius Foundation, the University of Oulu, the University of Oulu Support Foundation and the special Governmental EVO funds for Oulu University Hospital-based research activities. The ORIGO study was supported by the Dutch Cancer Society (RUL 1997-1505) and the Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NL CP16). The PBCS was funded by Intramural Research Funds of the National Cancer Institute, Department of Health and Human Services, USA. Genotyping for PLCO was supported by the Intramural Research Programme of the National Institutes of Health, NCI, Division of Cancer Epidemiology and Genetics. The PLCO is supported by the Intramural Research Programme of the Division of Cancer Epidemiology and Genetics and supported by contracts from the Division of Cancer Prevention, National Cancer Institute, National Institutes of Health. The RBCS was funded by the Dutch Cancer Society (DDHK 2004-3124, DDHK 2009-4318). The SASBAC study was supported by funding from the Agency for Science, Technology and Research of Singapore (A*STAR), the US National Institute of Health (NIH), and the Susan G. Komen Breast Cancer Foundation. The SBCS was supported by Sheffield Experimental Cancer Medicine Centre and Breast Cancer Now Tissue Bank. SEARCH is funded by Cancer Research UK [C490/A10124, C490/A16561] and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. The University of Cambridge has received salary support for PDPP from the NHS in the East of England through the Clinical Academic Reserve. The Sister Study (SISTER) is supported by the Intramural Research Programme of the NIH, National Institute of Environmental Health Sciences (Z01-ES044005 and Z01-ES049033). The SMC is funded by the Swedish Cancer Foundation and the Swedish Research Council (VR 2017-00644) grant for the Swedish Infrastructure for Medical Population-based Life-course Environmental Research (SIMPLER). The SZBCS was supported by Grant PBZ_KBN_122/P05/2004 and the programme of the Minister of Science and Higher Education under the name “Regional Initiative of Excellence” in 2019−2022 project number 002/RID/2018/19 amount of financing 12,000,000 PLN. The TNBCC was supported by: a Specialised Programme of Research Excellence (SPORE) in Breast Cancer (CA116201), a grant from the Breast Cancer Research Foundation, a generous gift from the David F. and Margaret T. Grohne Family Foundation. The UCIBCS component of this research was supported by the NIH [CA58860, CA92044] and the Lon V Smith Foundation [LVS39420]. The UKBGS is funded by Breast Cancer Now and the Institute of Cancer Research (ICR), London. ICR acknowledges NHS funding to the NIHR Biomedical Research Centre. The UKOPS study was funded by The Eve Appeal (The Oak Foundation) and supported by the National Institute for Health Research University College London Hospitals Biomedical Research Centre. The US3SS study was supported by Massachusetts (K.M.E., R01CA47305), Wisconsin (P.A.N., R01 CA47147) and New Hampshire (L.T.-E., R01CA69664) centres, and Intramural Research Funds of the National Cancer Institute, Department of Health and Human Services, USA. The USRT Study was funded by Intramural Research Funds of the National Cancer Institute, Department of Health and Human Services, USA.

Author information

Authors and Affiliations

Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
Joe Dennis, Kyriaki Michailidou, Leila Dorling, Manjeet K. Bolla, Qin Wang, Paul D. P. Pharoah & Douglas F. Easton
Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, UK
Jonathan P. Tyrer, Alison M. Dunning, Paul D. P. Pharoah & Douglas F. Easton
Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
Logan C. Walker
Biostatistics Unit, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
Kyriaki Michailidou
Cyprus School of Molecular Medicine, The Cyprus Institute of Neurology & Genetics, Nicosia, Cyprus
Kyriaki Michailidou
Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
Thomas U. Ahearn, Laura E. Beane Freeman, Jonine Figueroa, Montserrat García-Closas, Stella Koutros & Xiaohong R. Yang
Fred A. Litwin Center for Cancer Genetics, Lunenfeld-Tanenbaum Research Institute of Mount Sinai Hospital, Toronto, ON, Canada
Irene L. Andrulis
Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
Irene L. Andrulis
Department of Medicine, Genetic Epidemiology Research Institute, University of California Irvine, Irvine, CA, USA
Hoda Anton-Culver & Argyrios Ziogas
N.N. Alexandrov Research Institute of Oncology and Medical Radiology, Minsk, Belarus
Natalia N. Antonenkova & Natalia V. Bogdanova
Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
Volker Arndt & Hermann Brenner
Department of Public Health Sciences, and Cancer Research Institute, Queen’s University, Kingston, ON, Canada
Kristan J. Aronson
Department of Gynecology and Obstetrics, Comprehensive Cancer Center Erlangen-EMN, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nuremberg (FAU), Erlangen, Germany
Matthias W. Beckmann & Peter A. Fasching
Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
Sabine Behrens, Jenny Chang-Claude, Audrey Jung & Rudolf Kaaks
Biomedical Network on Rare Diseases (CIBERER), Madrid, Spain
Javier Benitez
Human Cancer Genetics Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Javier Benitez & Anna González-Neira
Institute of Biochemistry and Genetics, Ufa Federal Research Centre of the Russian Academy of Sciences, Ufa, Russia
Marina Bermisheva & Elza Khusnutdinova
Department of Radiation Oncology, Hannover Medical School, Hannover, Germany
Natalia V. Bogdanova
Gynaecology Research Unit, Hannover Medical School, Hannover, Germany
Natalia V. Bogdanova, Thilo Dörk & Tjoung-Won Park-Simon
Copenhagen General Population Study, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark
Stig E. Bojesen
Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark
Stig E. Bojesen
Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Stig E. Bojesen
Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany
Hermann Brenner
German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
Hermann Brenner
Oncology and Genetics Unit, Instituto de Investigacion Sanitaria Galicia Sur (IISGS), Xerencia de Xestion Integrada de Vigo-SERGAS, Vigo, Spain
Jose E. Castelao
Cancer Epidemiology Group, University Cancer Center Hamburg (UCCH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Jenny Chang-Claude
Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
Georgia Chenevix-Trench
Westmead Institute for Medical Research, University of Sydney, Sydney, NSW, Australia
Christine L. Clarke & Christine Clarke
Department of Clinical Genetics, Erasmus University Medical Center, Rotterdam, The Netherlands
J. Margriet Collée
Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
Fergus J. Couch
Sheffield Institute for Nucleic Acids (SInFoNiA), Department of Oncology and Metabolism, University of Sheffield, Sheffield, UK
Angela Cox
Academic Unit of Pathology, Department of Neuroscience, University of Sheffield, Sheffield, UK
Simon S. Cross
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Kamila Czene, Mikael Eriksson, Marike Gabrielson & Per Hall
Department of Pathology, Leiden University Medical Center, Leiden, The Netherlands
Peter Devilee
Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Peter Devilee
Nutrition and Metabolism Section, International Agency for Research on Cancer (IARC-WHO), Lyon, France
Laure Dossus
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
A. Heather Eliassen
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
A. Heather Eliassen, Peter Kraft & Rulla M. Tamimi
Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
D. Gareth Evans
North West Genomics Laboratory Hub, Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
D. Gareth Evans
David Geffen School of Medicine, Department of Medicine Division of Hematology and Oncology, University of California at Los Angeles, Los Angeles, CA, USA
Peter A. Fasching
Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh, Edinburgh, UK
Jonine Figueroa
Cancer Research UK Edinburgh Centre, The University of Edinburgh, Edinburgh, UK
Jonine Figueroa & Ian Tomlinson
The Breast Cancer Now Toby Robins Research Centre, The Institute of Cancer Research, London, UK
Olivia Fletcher & Nichola Johnson
Department of Breast Surgery, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark
Henrik Flyger
School of Public Health, Curtin University, Perth, WA, Australia
Lin Fritschi
Fundación Pública Galega de Medicina Xenómica, Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Complejo Hospitalario Universitario de Santiago, SERGAS, Santiago de Compostela, Spain
Manuela Gago-Dominguez
Moores Cancer Center, University of California San Diego, La Jolla, CA, USA
Manuela Gago-Dominguez
Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Victoria, Australia
Graham G. Giles, Roger Milne, Melissa Southey, Roger L. Milne & Melissa C. Southey
Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
Graham G. Giles, John L. Hopper, Roger Milne & Roger L. Milne
Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, Australia
Graham G. Giles, Roger Milne, Melissa Southey, Roger L. Milne & Melissa C. Southey
Center for Research in Epidemiology and Population Health (CESP), Team Exposome and Heredity, INSERM, University Paris-Saclay, Villejuif, France
Pascal Guénel & Thérèse Truong
Center for Familial Breast and Ovarian Cancer, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
Eric Hahnen & Rita K. Schmutzler
Center for Integrated Oncology (CIO), Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
Eric Hahnen & Rita K. Schmutzler
Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
Christopher A. Haiman
Department of Oncology, Södersjukhuset, Stockholm, Sweden
Per Hall & Sara Margolin
Department of Medical Oncology, Erasmus MC Cancer Institute, Rotterdam, The Netherlands
Antoinette Hollestelle & Agnes Jager
Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany
Reiner Hoppe
University of Tübingen, Tübingen, Germany
Reiner Hoppe
Division of Cancer Sciences, University of Manchester, Manchester, UK
Anthony Howell
Department of Genetics and Pathology, Pomeranian Medical University, Szczecin, Poland
Anna Jakubowska & Alicja Ogrodniczak
Independent Laboratory of Molecular Biology and Genetic Diagnostics, Pomeranian Medical University, Szczecin, Poland
Anna Jakubowska
Department of Epidemiology & Population Health, Stanford University School of Medicine, Stanford, CA, USA
Esther M. John & Allison W. Kurian
Department of Medicine, Division of Oncology, Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
Esther M. John & Allison W. Kurian
Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
Michael E. Jones & Anthony J. Swerdlow
Division of Molecular Pathology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
Renske Keeman & Marjanka K. Schmidt
Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa, Russia
Elza Khusnutdinova
Radiation Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
Cari M. Kitahara & Martha Linet
Department of Internal Medicine, Evangelische Kliniken Bonn gGmbH, Johanniter Krankenhaus, Bonn, Germany
Yon-Dschun Ko
Translational Cancer Research Area, University of Eastern Finland, Kuopio, Finland
Veli-Matti Kosma & Arto Mannermaa
Institute of Clinical Medicine, Pathology and Forensic Medicine, University of Eastern Finland, Kuopio, Finland
Veli-Matti Kosma & Arto Mannermaa
Biobank of Eastern Finland, Kuopio University Hospital, Kuopio, Finland
Veli-Matti Kosma & Arto Mannermaa
Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Peter Kraft
Department of Medical Genetics, Oslo University Hospital and University of Oslo, Oslo, Norway
Vessela N. Kristensen, Grethe I. Grenaker Alnæs & Vessela N. Kristensen
Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
Anne-Lise Børresen-Dale, Olav Engebråten & Vessela N. Kristensen
Department of Histopathology and Cytology, Clinical Hospital Acibadem Sistina, Skopje, Republic of North Macedonia
Katerina Kubelka-Sabit
Department of Computational and Quantitative Medicine, City of Hope, Duarte, CA, USA
James Lacey & James V. Lacey
City of Hope Comprehensive Cancer Center, City of Hope, Duarte, CA, USA
James V. Lacey
VIB Center for Cancer Biology, Leuven, Belgium
Diether Lambrechts
Laboratory for Translational Genetics, Department of Human Genetics, University of Leuven, Leuven, Belgium
Diether Lambrechts
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Nicole L. Larson & Janet E. Olson
Unit of Medical Genetics, Department of Medical Oncology and Hematology, Fondazione IRCCS Istituto Nazionale dei Tumori di Milano, Milan, Italy
Siranoush Manoukian
Department of Clinical Science and Education, Södersjukhuset, Karolinska Institutet, Stockholm, Sweden
Sara Margolin & Camilla Wendt
Department of Medical Oncology, University Hospital of Heraklion, Heraklion, Greece
Dimitrios Mavroudis
Department of Obstetrics and Gynecology, Helsinki University Hospital, University of Helsinki, Helsinki, Finland
Taru A. Muranen & Heli Nevanlinna
School of Population and Public Health, University of British Columbia, Vancouver, BC, Canada
Rachel A. Murphy
Cancer Control Research, BC Cancer, Vancouver, Canada
Rachel A. Murphy
Department of Cancer Epidemiology, Clinical Sciences, Lund University, Lund, Sweden
Håkan Olsson
Department of Genetics, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Charles M. Perou
Genome Diagnostics Program, IFOM - the FIRC Institute of Molecular Oncology, Milan, Italy
Paolo Peterlongo
Research Centre for Genetic Engineering and Biotechnology ‘Georgi D. Efremov’, MASA, Skopje, Republic of North Macedonia
Dijana Plaseska-Karanfilska
Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit, Biocenter Oulu, University of Oulu, Oulu, Finland
Katri Pylkäs & Robert Winqvist
Laboratory of Cancer Genetics and Tumor Biology, Northern Finland Laboratory Centre Oulu, Oulu, Finland
Katri Pylkäs & Robert Winqvist
Clalit National Cancer Control Center, Carmel Medical Center and Technion Faculty of Medicine, Haifa, Israel
Gad Rennert & Rana Shibli
Department of Oncology, University Hospital of Larissa, Larissa, Greece
Emmanouil Saloustros
Epidemiology Branch, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC, USA
Dale P. Sandler & Jack A. Taylor
School of Cancer & Pharmaceutical Sciences, Comprehensive Cancer Centre, Guy’s Campus, King’s College London, London, UK
Elinor J. Sawyer
Division of Psychosocial Research and Epidemiology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek hospital, Amsterdam, The Netherlands
Marjanka K. Schmidt
Center for Molecular Medicine Cologne (CMMC), Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
Rita K. Schmutzler
Department of Surgical Oncology, University Hospitals Leuven, Leuven, Belgium
Ann Smeets
Genomics Center, Centre Hospitalier Universitaire de Québec – Université Laval Research Center, Québec City, QC, Canada
Penny Soucy & Jacques Simard
Department of Clinical Pathology, The University of Melbourne, Melbourne, Victoria, Australia
Melissa Southey & Melissa C. Southey
Division of Breast Cancer Research, The Institute of Cancer Research, London, UK
Anthony J. Swerdlow
Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Rulla M. Tamimi
Epigenetic and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC, USA
Jack A. Taylor
Department of Population Science, American Cancer Society, Atlanta, GA, USA
Lauren R. Teras
Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA
Mary Beth Terry
Department of Epidemiology, Gillings School of Global Public Health and UNC Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Melissa A. Troester
Department of Quantitative Health Sciences, Division of Epidemiology, Mayo Clinic, Rochester, MN, USA
Celine M. Vachon
Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
Alicja Wolk
Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
Alicja Wolk
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN, USA
Wei Zheng
Department of Research, Vestre Viken Hospital, Drammen, Norway
Kristine K. Sahlberg
Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital-Radiumhospitalet, Oslo, Norway
Kristine K. Sahlberg & Olav Engebråten
Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital-Radiumhospitalet, Oslo, Norway
Anne-Lise Børresen-Dale
Department of Community Medicine, The Arctic University of Norway, Tromsø, Norway
Inger Torhild Gram
Department of Oncology, Division of Surgery and Cancer and Transplantation Medicine, Oslo University Hospital-Radiumhospitalet, Oslo, Norway
Bjørn Naume
Department of Oncology, Akershus University Hospital, Lørenskog, Norway
Jürgen Geisler
University of California, San Diego, CA, USA
Elena Martinez
The Westmead Institute for Medical Research, The University of Sydney, Sydney, NSW, Australia
Jane Carpenter, Dinny Graham & Mythily Sachchithananthan
University of Technology Sydney, Translational Oncology Group, School of Life Sciences, Faculty of Science, Ultimo, NSW, Australia
Deborah Marsh
School of Biomedical Sciences, University of Newcastle, Newcastle; Hunter Medical Research Institute and NSW Health Pathology North, Newcastle, Australia
Rodney Scott
Kolling Institute of Medical Research, University of Sydney, St Leonards, NSW, Australia
Robert Baxter
Epigenetics & Transcription Laboratory Melanie Swan Memorial Translational Centre, Sci-Tech, University of Canberra, Canberra, Australia
Desmond Yip
Department of Medical Oncology, The Canberra Hospital, Garran, ACT, Australia
Desmond Yip
The Canberra Hospital, Garran, ACT, Australia
Alison Davis
The Australian National University, Canberra, ACT, Australia
Alison Davis
Westmead Breast Cancer Institute, Western Sydney Local Health District, Westmead, NSW, Australia
Nirmala Pathmanathan
University of Sydney, Western Clinical School, Westmead, NSW, Australia
Nirmala Pathmanathan
UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Herston, QLD, Australia
Peter Simpson
Peter MacCallum Cancer Centre, Melbourne, Australia
Ian Campbell, Stephen Fox & Heather Thorne
Westmead Institute for Cancer Research, Sydney, Australia
Anna de Fazio
Department of Medicine, Westmead Hospital, Sydney, Australia
Judy Kirk
Walter and Eliza Hall Institute, Melbourne, Australia
Geoff Lindeman
Queensland Institute of Medical Research, Brisbane, Australia
Amanda Spurdle

Authors

Joe Dennis
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan P. Tyrer
View author publications
You can also search for this author in PubMed Google Scholar
Logan C. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Kyriaki Michailidou
View author publications
You can also search for this author in PubMed Google Scholar
Leila Dorling
View author publications
You can also search for this author in PubMed Google Scholar
Manjeet K. Bolla
View author publications
You can also search for this author in PubMed Google Scholar
Qin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Thomas U. Ahearn
View author publications
You can also search for this author in PubMed Google Scholar
Irene L. Andrulis
View author publications
You can also search for this author in PubMed Google Scholar
Hoda Anton-Culver
View author publications
You can also search for this author in PubMed Google Scholar
Natalia N. Antonenkova
View author publications
You can also search for this author in PubMed Google Scholar
Volker Arndt
View author publications
You can also search for this author in PubMed Google Scholar
Kristan J. Aronson
View author publications
You can also search for this author in PubMed Google Scholar
Laura E. Beane Freeman
View author publications
You can also search for this author in PubMed Google Scholar
Matthias W. Beckmann
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Behrens
View author publications
You can also search for this author in PubMed Google Scholar
Javier Benitez
View author publications
You can also search for this author in PubMed Google Scholar
Marina Bermisheva
View author publications
You can also search for this author in PubMed Google Scholar
Natalia V. Bogdanova
View author publications
You can also search for this author in PubMed Google Scholar
Stig E. Bojesen
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Brenner
View author publications
You can also search for this author in PubMed Google Scholar
Jose E. Castelao
View author publications
You can also search for this author in PubMed Google Scholar
Jenny Chang-Claude
View author publications
You can also search for this author in PubMed Google Scholar
Georgia Chenevix-Trench
View author publications
You can also search for this author in PubMed Google Scholar
Christine L. Clarke
View author publications
You can also search for this author in PubMed Google Scholar
J. Margriet Collée
View author publications
You can also search for this author in PubMed Google Scholar
Fergus J. Couch
View author publications
You can also search for this author in PubMed Google Scholar
Angela Cox
View author publications
You can also search for this author in PubMed Google Scholar
Simon S. Cross
View author publications
You can also search for this author in PubMed Google Scholar
Kamila Czene
View author publications
You can also search for this author in PubMed Google Scholar
Peter Devilee
View author publications
You can also search for this author in PubMed Google Scholar
Thilo Dörk
View author publications
You can also search for this author in PubMed Google Scholar
Laure Dossus
View author publications
You can also search for this author in PubMed Google Scholar
A. Heather Eliassen
View author publications
You can also search for this author in PubMed Google Scholar
Mikael Eriksson
View author publications
You can also search for this author in PubMed Google Scholar
D. Gareth Evans
View author publications
You can also search for this author in PubMed Google Scholar
Peter A. Fasching
View author publications
You can also search for this author in PubMed Google Scholar
Jonine Figueroa
View author publications
You can also search for this author in PubMed Google Scholar
Olivia Fletcher
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Flyger
View author publications
You can also search for this author in PubMed Google Scholar
Lin Fritschi
View author publications
You can also search for this author in PubMed Google Scholar
Marike Gabrielson
View author publications
You can also search for this author in PubMed Google Scholar
Manuela Gago-Dominguez
View author publications
You can also search for this author in PubMed Google Scholar
Montserrat García-Closas
View author publications
You can also search for this author in PubMed Google Scholar
Graham G. Giles
View author publications
You can also search for this author in PubMed Google Scholar
Anna González-Neira
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Guénel
View author publications
You can also search for this author in PubMed Google Scholar
Eric Hahnen
View author publications
You can also search for this author in PubMed Google Scholar
Christopher A. Haiman
View author publications
You can also search for this author in PubMed Google Scholar
Per Hall
View author publications
You can also search for this author in PubMed Google Scholar
Antoinette Hollestelle
View author publications
You can also search for this author in PubMed Google Scholar
Reiner Hoppe
View author publications
You can also search for this author in PubMed Google Scholar
John L. Hopper
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Howell
View author publications
You can also search for this author in PubMed Google Scholar
Agnes Jager
View author publications
You can also search for this author in PubMed Google Scholar
Anna Jakubowska
View author publications
You can also search for this author in PubMed Google Scholar
Esther M. John
View author publications
You can also search for this author in PubMed Google Scholar
Nichola Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Michael E. Jones
View author publications
You can also search for this author in PubMed Google Scholar
Audrey Jung
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Kaaks
View author publications
You can also search for this author in PubMed Google Scholar
Renske Keeman
View author publications
You can also search for this author in PubMed Google Scholar
Elza Khusnutdinova
View author publications
You can also search for this author in PubMed Google Scholar
Cari M. Kitahara
View author publications
You can also search for this author in PubMed Google Scholar
Yon-Dschun Ko
View author publications
You can also search for this author in PubMed Google Scholar
Veli-Matti Kosma
View author publications
You can also search for this author in PubMed Google Scholar
Stella Koutros
View author publications
You can also search for this author in PubMed Google Scholar
Peter Kraft
View author publications
You can also search for this author in PubMed Google Scholar
Vessela N. Kristensen
View author publications
You can also search for this author in PubMed Google Scholar
Katerina Kubelka-Sabit
View author publications
You can also search for this author in PubMed Google Scholar
Allison W. Kurian
View author publications
You can also search for this author in PubMed Google Scholar
James V. Lacey
View author publications
You can also search for this author in PubMed Google Scholar
Diether Lambrechts
View author publications
You can also search for this author in PubMed Google Scholar
Nicole L. Larson
View author publications
You can also search for this author in PubMed Google Scholar
Martha Linet
View author publications
You can also search for this author in PubMed Google Scholar
Alicja Ogrodniczak
View author publications
You can also search for this author in PubMed Google Scholar
Arto Mannermaa
View author publications
You can also search for this author in PubMed Google Scholar
Siranoush Manoukian
View author publications
You can also search for this author in PubMed Google Scholar
Sara Margolin
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrios Mavroudis
View author publications
You can also search for this author in PubMed Google Scholar
Roger L. Milne
View author publications
You can also search for this author in PubMed Google Scholar
Taru A. Muranen
View author publications
You can also search for this author in PubMed Google Scholar
Rachel A. Murphy
View author publications
You can also search for this author in PubMed Google Scholar
Heli Nevanlinna
View author publications
You can also search for this author in PubMed Google Scholar
Janet E. Olson
View author publications
You can also search for this author in PubMed Google Scholar
Håkan Olsson
View author publications
You can also search for this author in PubMed Google Scholar
Tjoung-Won Park-Simon
View author publications
You can also search for this author in PubMed Google Scholar
Charles M. Perou
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Peterlongo
View author publications
You can also search for this author in PubMed Google Scholar
Dijana Plaseska-Karanfilska
View author publications
You can also search for this author in PubMed Google Scholar
Katri Pylkäs
View author publications
You can also search for this author in PubMed Google Scholar
Gad Rennert
View author publications
You can also search for this author in PubMed Google Scholar
Emmanouil Saloustros
View author publications
You can also search for this author in PubMed Google Scholar
Dale P. Sandler
View author publications
You can also search for this author in PubMed Google Scholar
Elinor J. Sawyer
View author publications
You can also search for this author in PubMed Google Scholar
Marjanka K. Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Rita K. Schmutzler
View author publications
You can also search for this author in PubMed Google Scholar
Rana Shibli
View author publications
You can also search for this author in PubMed Google Scholar
Ann Smeets
View author publications
You can also search for this author in PubMed Google Scholar
Penny Soucy
View author publications
You can also search for this author in PubMed Google Scholar
Melissa C. Southey
View author publications
You can also search for this author in PubMed Google Scholar
Anthony J. Swerdlow
View author publications
You can also search for this author in PubMed Google Scholar
Rulla M. Tamimi
View author publications
You can also search for this author in PubMed Google Scholar
Jack A. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Lauren R. Teras
View author publications
You can also search for this author in PubMed Google Scholar
Mary Beth Terry
View author publications
You can also search for this author in PubMed Google Scholar
Ian Tomlinson
View author publications
You can also search for this author in PubMed Google Scholar
Melissa A. Troester
View author publications
You can also search for this author in PubMed Google Scholar
Thérèse Truong
View author publications
You can also search for this author in PubMed Google Scholar
Celine M. Vachon
View author publications
You can also search for this author in PubMed Google Scholar
Camilla Wendt
View author publications
You can also search for this author in PubMed Google Scholar
Robert Winqvist
View author publications
You can also search for this author in PubMed Google Scholar
Alicja Wolk
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohong R. Yang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Argyrios Ziogas
View author publications
You can also search for this author in PubMed Google Scholar
Jacques Simard
View author publications
You can also search for this author in PubMed Google Scholar
Alison M. Dunning
View author publications
You can also search for this author in PubMed Google Scholar
Paul D. P. Pharoah
View author publications
You can also search for this author in PubMed Google Scholar
Douglas F. Easton
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

NBCS Collaborators

Vessela N. Kristensen
, Kristine K. Sahlberg
, Anne-Lise Børresen-Dale
, Inger Torhild Gram
, Olav Engebråten
, Bjørn Naume
, Jürgen Geisler
& Grethe I. Grenaker Alnæs

CTS Consortium

James Lacey
& Elena Martinez

ABCTB Investigators

Christine Clarke
, Jane Carpenter
, Deborah Marsh
, Rodney Scott
, Robert Baxter
, Desmond Yip
, Alison Davis
, Nirmala Pathmanathan
, Peter Simpson
, Dinny Graham
& Mythily Sachchithananthan

kConFab/AOCS Investigators

Ian Campbell
, Georgia Chenevix-Trench
, Anna de Fazio
, Stephen Fox
, Judy Kirk
, Geoff Lindeman
, Roger Milne
, Melissa Southey
, Amanda Spurdle
& Heather Thorne

Contributions

Data management: M.K.B., Q.W., and R.Ke. CNV detection: J.D. and L.W. Statistical/bioinformatic analysis: J.D., J.T., K.M., and L.Dorl. Conceived the OncoArray and obtained financial support: J.S., P.K., and D.F.E. Provided DNA samples and/or phenotypic data: T.A., I.A., H.A., N.A., V.A., K.A., L.B., M.W.B., S.Beh, J.B., M.B., N.B., S.Boj, H.B., J.E.C., J.C.-C., G.C., C.L.C., J.M.C., F.C., A.C., S.C., K.C., P.D., T.D., L.Doss, A.E., M.E., D.G.E., P.F., J.F., O.F., H.F., L.F., M.G., M.G.-D., M.G.-G., G.G., A.G., P.G., E.H., C.H., P.H., A.Hol, R.H., J.H., A.How, A.Jag, A.Jak, E.J., N.J., M.J., A.Jung, R.Ka, E.K., C.K., Y.K., V.-M.K., S.K., P.K., V.N.K., K.K.-S., A.K., J.L., D.L., N.L., M.L., A.L., A.M., S.Man, S.Mar, D.M., R.L.M., T.M., R.A.M., H.N., J.O., H.O., T.P., C.P., P.P., P.D.P.P., D.P.-K., K.P., G.R., E.Sal, D.S., E.J.S., M.S., R.K.S., R.S., J.S., A.S., P.S., M.C.S., A.J.S., R.M.T., J.A.T., L.R.T., M.B.T., I.T., M.A.T., T.T., C.V., C.W., R.W., A.W., X.Y., W.Z., A.Z., A.D., D.F.E., ABCTB Investigators, kConFab/AOCS Investigators, NBCS Collaborators, CTS Consortium. Directed the project: D.F.E. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Joe Dennis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Enrique Hernández-Lemus and Tao Huang for their contribution to the peer review of this work. Primary Handling Editor: Eve Rogers.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1-21

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dennis, J., Tyrer, J.P., Walker, L.C. et al. Rare germline copy number variants (CNVs) and breast cancer risk. Commun Biol 5, 65 (2022). https://doi.org/10.1038/s42003-021-02990-6

Download citation

Received: 08 June 2021
Accepted: 01 December 2021
Published: 18 January 2022
DOI: https://doi.org/10.1038/s42003-021-02990-6
Springer Nature Limited

This article is cited by

Germline mutations of breast cancer susceptibility genes through expanded genetic analysis in unselected Colombian patients
- Diana Carolina Sierra-Díaz
- Adrien Morel
- Carlos M. Restrepo
Human Genomics (2024)
NTHL1 is a recessive cancer susceptibility gene
- Anna K. Nurmi
- Liisa M. Pelttari
- Heli Nevanlinna
Scientific Reports (2023)

Rare germline copy number variants (CNVs) and breast cancer risk

Abstract

Similar content being viewed by others

Introduction

Results

Summary of CNVs detected

CNVs associated with overall risk

Associations with risk of breast cancer subtypes

Associations at established common susceptibility Loci

Gene burden tests

Direction of effect tests

Discussion

Methods

Subjects

Arrays

CNV calling

Sample quality control

Association tests

Ethical approval

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

NBCS Collaborators

CTS Consortium

ABCTB Investigators

kConFab/AOCS Investigators

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation