Abstract
Single germline nucleotide pathogenic variants have been identified in 12 breast cancer predisposition genes, but structural deletions in these genes remain poorly characterized. We conducted in-depth whole genome sequencing (WGS) in genomic DNA samples obtained from 1340 invasive breast cancer cases and 675 controls of African ancestry. We identified 25 deletions in the intragenic regions of ten established breast cancer predisposition genes based on a consensus call from six state-of-the-art SV callers. Overall, no significant case–control difference was found in the frequency of these deletions. However, 1.0% of cases and 0.3% of controls carried any of the eight putative protein-truncating rare deletions located in BRCA1, BRCA2, CDH1, TP53, NF1, RAD51D, RAD51C and CHEK2, resulting in an odds ratio (OR) of 3.29 (95% CI 0.74–30.16). We also identified a low-frequency deletion in NF1 associated with breast cancer risk (OR 1.93, 95% CI 1.14–3.42). In addition, we detected 56 deletions, including six putative protein-truncating deletions, in suspected breast predisposition genes. This is the first large study to systematically search for structural deletions in breast cancer predisposition genes. Many of the deletions, particularly those resulting in protein truncations, are likely to be pathogenic. Results from this study, if confirmed in future large-scale studies, could have significant implications for genetic testing for this common cancer.
Similar content being viewed by others
Availability of data and materials
Access to the whole genome sequencing data could be requested by submission of an inquiry to Dr. Wei Zheng.
Code availability
Access to the custom code could be requested by submission of an inquiry to Dr. Wei Zheng.
References
Alkan C, Coe BP, Eichler EE (2011) Genome structural variation discovery and genotyping. Nat Rev Genet 12:363–376. https://doi.org/10.1038/nrg2958
Brinton LA, Awuah B, Nat Clegg-Lamptey J, Wiafe-Addai B, Ansong D, Nyarko KM, Wiafe S, Yarney J, Biritwum R, Brotzman M, Adjei AA, Adjei E, Aitpillah F, Edusei L, Dedey F, Nyante SJ, Oppong J, Osei-Bonsu E, Titiloye N, Vanderpuye V, Brew Abaidoo E, Arhin B, Boakye I, Frempong M, Ohene Oti N, Okyne V, Figueroa JD (2017) Design considerations for identifying breast cancer risk factors in a population-based study in Africa. Int J Cancer 140:2667–2677. https://doi.org/10.1002/ijc.30688
Chen X, Schulz-Trieglaff O, Shaw R, Barnes B, Schlesinger F, Kallberg M, Cox AJ, Kruglyak S, Saunders CT (2016) Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32:1220–1222. https://doi.org/10.1093/bioinformatics/btv710
Chiang C, Layer RM, Faust GG, Lindberg MR, Rose DB, Garrison EP, Marth GT, Quinlan AR, Hall IM (2015) SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat Methods 12:966–968. https://doi.org/10.1038/nmeth.3505
Consortium EP (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74. https://doi.org/10.1038/nature11247
Couch FJ, Shimelis H, Hu CL, Hart SN, Polley EC, Na J, Hallberg E, Moore R, Thomas A, Lilyquist J, Feng BJ, McFarland R, Pesaran T, Huether R, LaDuca H, Chao EC, Goldgar DE, Dolinsky JS (2017) Associations between cancer predisposition testing panel genes and breast cancer. JAMA Oncol 3:1190–1196. https://doi.org/10.1001/jamaoncol.2017.0424
Cui Y, Deming-Halverson SL, Beeghly-Fadiel A, Lipworth L, Shrubsole MJ, Fair AM, Shu XO, Zheng W (2014) Interactions of hormone replacement therapy, body weight, and bilateral oophorectomy in breast cancer risk. Clin Cancer Res 20:1169–1178. https://doi.org/10.1158/1078-0432.CCR-13-2094
Easton DF, Pharoah PD, Antoniou AC, Tischkowitz M, Tavtigian SV, Nathanson KL, Devilee P, Meindl A, Couch FJ, Southey M, Goldgar DE, Evans DG, Chenevix-Trench G, Rahman N, Robson M, Domchek SM, Foulkes WD (2015) Gene-panel sequencing and the prediction of breast-cancer risk. N Engl J Med 372:2243–2257. https://doi.org/10.1056/NEJMsr1501341
Figueroa JD, Davis Lynn BC, Edusei L, Titiloye N, Adjei E, Clegg-Lamptey JN, Yarney J, Wiafe-Addai B, Awuah B, Duggan MA, Wiafe S, Nyarko K, Aitpillah F, Ansong D, Hewitt SM, Ahearn T, Garcia-Closas M, Brinton LA, Ghana Breast Health Study T (2020) Reproductive factors and risk of breast cancer by tumor subtypes among Ghanaian women: a population-based case-control study. Int J Cancer. https://doi.org/10.1002/ijc.32929
Gonzalez KD, Noltner KA, Buzin CH, Gu D, Wen-Fong CY, Nguyen VQ, Han JH, Lowstuter K, Longmate J, Sommer SS, Weitzel JN (2009) Beyond Li Fraumeni Syndrome: clinical characteristics of families with p53 germline mutations. J Clin Oncol 27:1250–1256. https://doi.org/10.1200/JCO.2008.16.6959
Guo X, Delio M, Haque N, Castellanos R, Hestand MS, Vermeesch JR, Morrow BE, Zheng D (2016) Variant discovery and breakpoint region prediction for studying the human 22q11.2 deletion using BAC clone and whole genome sequencing analysis. Hum Mol Genet 25:3754–3767. https://doi.org/10.1093/hmg/ddw221
Guo XY, Shi JJ, Cai QY, Shu XO, He J, Wen WQ, Allen J, Pharoah P, Dunning A, Hunter DJ, Kraft P, Easton DF, Zheng W, Long JR (2018) Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes. Hum Mol Genet 27:853–859. https://doi.org/10.1093/hmg/ddy005
Guo X, Lin W, Bai M, Li H, Wen W, Zeng C, Chen Z, He J, Chen J, Cai Q, Long J, Jia WH, Shu XO, Zheng W (2019) Discovery of a pathogenic variant rs139379666 (p. P2974L) in ATM for breast cancer risk in Chinese populations. Cancer Epidemiol Biomarkers Prev 28:1308–1315. https://doi.org/10.1158/1055-9965.EPI-18-1294
Guo X, Long J, Chen Z, Shu XO, Xiang YB, Wen W, Zeng C, Gao YT, Cai Q, Zheng W (2020) Discovery of rare coding variants in OGDHL and BRCA2 in relation to breast cancer risk in Chinese women. Int J Cancer 146:2175–2181. https://doi.org/10.1002/ijc.32825
Handsaker RE, Korn JM, Nemesh J, McCarroll SA (2011) Discovery and genotyping of genome structural polymorphism by sequencing on a population scale. Nat Genet 43:269–276. https://doi.org/10.1038/ng.768
Hu C, Hart SN, Gnanaolivu R, Huang H, Lee KY, Na J, Gao C, Lilyquist J, Yadav S, Boddicker NJ, Samara R, Klebba J, Ambrosone CB, Anton-Culver H, Auer P, Bandera EV, Bernstein L, Bertrand KA, Burnside ES, Carter BD, Eliassen H, Gapstur SM, Gaudet M, Haiman C, Hodge JM, Hunter DJ, Jacobs EJ, John EM, Kooperberg C, Kurian AW, Le Marchand L, Lindstroem S, Lindstrom T, Ma H, Neuhausen S, Newcomb PA, O’Brien KM, Olson JE, Ong IM, Pal T, Palmer JR, Patel AV, Reid S, Rosenberg L, Sandler DP, Scott C, Tamimi R, Taylor JA, Trentham-Dietz A, Vachon CM, Weinberg C, Yao S, Ziogas A, Weitzel JN, Goldgar DE, Domchek SM, Nathanson KL, Kraft P, Polley EC, Couch FJ (2021) A population-based study of genes previously implicated in breast cancer. N Engl J Med 384:440–451. https://doi.org/10.1056/NEJMoa2005936
Jeffares DC, Jolly C, Hoti M, Speed D, Shaw L, Rallis C, Balloux F, Dessimoz C, Bahler J, Sedlazeck FJ (2017) Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast. Nat Commun 8:14061. https://doi.org/10.1038/ncomms14061
Karczewski KJ, Francioli LC, Tiao G et al (2020) The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581:434–443
Kolonel LN, Altshuler D, Henderson BE (2004) The multiethnic cohort study: exploring genes, lifestyle and cancer risk. Nat Rev Cancer 4:519–527. https://doi.org/10.1038/nrc1389
Layer RM, Chiang C, Quinlan AR, Hall IM (2014) LUMPY: a probabilistic framework for structural variant discovery. Genome Biol 15:R84. https://doi.org/10.1186/gb-2014-15-6-r84
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. https://doi.org/10.1093/bioinformatics/btp324
Lu HM, Li S, Black MH, Lee S, Hoiness R, Wu S, Mu W, Huether R, Chen J, Sridhar S, Tian Y, McFarland R, Dolinsky J, Tippin Davis B, Mexal S, Dunlop C, Elliott A (2019) Association of breast and ovarian cancers with predisposition genes identified by large-scale sequencing. JAMA Oncol 5:51–57. https://doi.org/10.1001/jamaoncol.2018.2956
Miki Y, Swensen J, Shattuck-Eidens D, Futreal PA, Harshman K, Tavtigian S, Liu Q, Cochran C, Bennett LM, Ding W et al (1994) A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. Science 266:66–71. https://doi.org/10.1126/science.7545954
Pedersen BS, Quinlan AR (2019) Duphold: scalable, depth-based annotation and curation of high-confidence structural variant calls. Gigascience. https://doi.org/10.1093/gigascience/giz040
Rahman N, Seal S, Thompson D, Kelly P, Renwick A, Elliott A, Reid S, Spanova K, Barfoot R, Chagtai T, Jayatilake H, McGuffog L, Hanks S, Evans DG, Eccles D, Breast Cancer Susceptibility C, Easton DF, Stratton MR (2007) PALB2, which encodes a BRCA2-interacting protein, is a breast cancer susceptibility gene. Nat Genet 39:165–167. https://doi.org/10.1038/ng1959
Rausch T, Zichner T, Schlattl A, Stutz AM, Benes V, Korbel JO (2012) DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics 28:i333–i339. https://doi.org/10.1093/bioinformatics/bts378
Renwick A, Thompson D, Seal S, Kelly P, Chagtai T, Ahmed M, North B, Jayatilake H, Barfoot R, Spanova K, McGuffog L, Evans DG, Eccles D, Breast Cancer Susceptibility C, Easton DF, Stratton MR, Rahman N (2006) ATM mutations that cause ataxia-telangiectasia are breast cancer susceptibility alleles. Nat Genet 38:873–875. https://doi.org/10.1038/ng1837
Roller E, Ivakhno S, Lee S, Royce T, Tanner S (2016) Canvas: versatile and scalable detection of copy number variants. Bioinformatics 32:2375–2377. https://doi.org/10.1093/bioinformatics/btw163
Signorello LB, Hargreaves MK, Steinwandel MD, Zheng W, Cai Q, Schlundt DG, Buchowski MS, Arnold CW, McLaughlin JK, Blot WJ (2005) Southern community cohort study: establishing a cohort to investigate health disparities. J Natl Med Assoc 97:972–979
Stankiewicz P, Lupski JR (2010) Structural variation in the human genome and its role in disease. Annu Rev Med 61:437–455. https://doi.org/10.1146/annurev-med-100708-204735
Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, Zhang Y, Ye K, Jun G, Fritz MH, Konkel MK, Malhotra A, Stutz AM, Shi X, Casale FP, Chen J, Hormozdiari F, Dayama G, Chen K, Malig M, Chaisson MJP, Walter K, Meiers S, Kashin S, Garrison E, Auton A, Lam HYK, Mu XJ, Alkan C, Antaki D, Bae T, Cerveira E, Chines P, Chong Z, Clarke L, Dal E, Ding L, Emery S, Fan X, Gujral M, Kahveci F, Kidd JM, Kong Y, Lameijer EW, McCarthy S, Flicek P, Gibbs RA, Marth G, Mason CE, Menelaou A, Muzny DM, Nelson BJ, Noor A, Parrish NF, Pendleton M, Quitadamo A, Raeder B, Schadt EE, Romanovitch M, Schlattl A, Sebra R, Shabalin AA, Untergasser A, Walker JA, Wang M, Yu F, Zhang C, Zhang J, Zheng-Bradley X, Zhou W, Zichner T, Sebat J, Batzer MA, McCarroll SA, Genomes Project C, Mills RE, Gerstein MB, Bashir A, Stegle O, Devine SE, Lee C, Eichler EE, Korbel JO (2015) An integrated map of structural variation in 2,504 human genomes. Nature 526:75–81. https://doi.org/10.1038/nature15394
Tan MH, Mester JL, Ngeow J, Rybicki LA, Orloff MS, Eng C (2012) Lifetime cancer risks in individuals with germline PTEN mutations. Clin Cancer Res 18:400–407. https://doi.org/10.1158/1078-0432.CCR-11-2283
Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, Jordan T, Shakir K, Roazen D, Thibault J, Banks E, Garimella KV, Altshuler D, Gabriel S, DePristo MA (2013) From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics 43:11. https://doi.org/10.1002/0471250953.bi1110s43
Ye K, Schulz MH, Long Q, Apweiler R, Ning Z (2009) Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25:2865–2871. https://doi.org/10.1093/bioinformatics/btp394
Zeng C, Guo X, Wen W, Shi J, Long J, Cai Q, Shu XO, Xiang Y, Zheng W (2020) Evaluation of pathogenetic mutations in breast cancer predisposition genes in population-based studies conducted among Chinese women. Breast Cancer Res Treat 181:465–473. https://doi.org/10.1007/s10549-020-05643-0
Zheng W, Long J, Gao YT, Li C, Zheng Y, Xiang YB, Wen W, Levy S, Deming SL, Haines JL, Gu K, Fair AM, Cai Q, Lu W, Shu XO (2009) Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1. Nat Genet 41:324–328. https://doi.org/10.1038/ng.318
Acknowledgements
The authors thank the study participants and research staff for their contributions and support for this project. The study was supported primarily by NIH grant R01CA202981. The data analyses were conducted using the Advanced Computing Center for Research and Education (ACCRE) at Vanderbilt University. The Nashville Breast Health Study and the Southern Community Cohort Study are supported by NIH grants R01CA100374 and U01CA202929, respectively. The GBHS was funded by Intramural Funds of the National Cancer Institute, USA. Samples from the GBHS were processed at the Cancer Genomics Research Laboratory, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Gaithersburg, MD.
Ghana Breast Health Study team: Ghana Statistical Service, Accra, Ghana: Dr Robertson Adjei and Dr Lucy Afriyie. Korle Bu Teaching Hospital, Accra, Ghana: Dr Anthony Adjei, Dr Florence Dedey, Dr Verna Vanderpuye, Victoria Okyne, Naomi Ohene Oti, Evelyn Tay, Dr Adu‐Aryee, Angela Kenu and Obed Ekpedzor. Komfo Anoyke Teaching Hospital, Kumasi, Ghana: Marion Alcpaloo, Isaac Boakye, Bernard Arhin, Emmanuel Assimah, Samuel Ka‐chungu, Dr Joseph Oppong and Dr Ernest Osei‐Bonsu. Peace and Love Hospital, Kumasi, Ghana: Prof Margaret Frempong, Emma Brew Abaidoo, Bridget Nortey Mensah, Samuel Amanama, Prince Agyapong, Debora Boateng, Ansong Thomas Agyei, Richard Opoku and Kofi Owusu Gyimah. National Cancer Institute, Division of Cancer Epidemiology and Genetics, Bethesda, MD, USA: Louise Brinton; Westat, Inc.: Michelle Brotzman, Shelley Niwa, Usha Singh and Ann Truelove. University of Ghana: Prof Richard Biritwum.
Funding
The study was supported primarily by NIH grant R01CA202981.
Author information
Authors and Affiliations
Consortia
Contributions
Study design: WZ and XG; data analysis: ZC, XG, and JP; data interpretation: JL, QC, X-OS, and WZ; drafting of the manuscript: ZC, XG, and WZ; review of the manuscript: ZC, XG, JL, QC, X-OS, PJR, CAH, MG-C, and WZ; Administrative, technical, or material support: BL, FMK, TUA, SDO, GJ, JF, the GBHS team, PJR, MS, CAH, WJB, MG-C, and WZ.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
Each sub-study was approved by the relevant research ethics committee or institutional review board.
Consent to participate
All participants provided written informed consent.
Consent for publication
All authors have read and approved the manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The members of the Ghana Breast Health Study team are listed in acknowledgements section.
Supplementary Information
Below is the link to the electronic supplementary material.
439_2021_2342_MOESM3_ESM.pdf
Supplementary file3 Supplementary Table 3: List of 40 established and suspected breast cancer predisposition genes included in this study (PDF 20 KB)
439_2021_2342_MOESM4_ESM.pdf
Supplementary file4 Supplementary Table 4: List of 81 SV deletions identified in established and suspected breast cancer predisposition genes (PDF 165 KB)
439_2021_2342_MOESM5_ESM.pdf
Supplementary file5 Supplementary Table 5: Characterization of 25 potential pathogenetic SV deletions identified in established breast cancer predisposition genes (PDF 1554 KB)
439_2021_2342_MOESM6_ESM.pdf
Supplementary file6 Supplementary Table 6: List of SV deletions identified in established predisposition genes with high reciprocal overlap (> 90%) with deletions in gnomAD (PDF 14 KB)
439_2021_2342_MOESM7_ESM.jpg
Supplementary file7 Supplementary Fig 1: SV deletion in the intronic region of the PTEN gene in nine cases. From top to bottom, gene structure; layeredH3K4Me1; DNase clusters; clustered ChIP-seq binding sites; The signals of layer H3K4 methylation from different ENCODE cell lines are shown indifferent colors (JPG 161 KB)
Rights and permissions
About this article
Cite this article
Chen, Z., Guo, X., Long, J. et al. Discovery of structural deletions in breast cancer predisposition genes using whole genome sequencing data from > 2000 women of African-ancestry. Hum Genet 140, 1449–1457 (2021). https://doi.org/10.1007/s00439-021-02342-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-021-02342-8