Skip to main content
Log in

Characterization of intermediate-sized insertions using whole-genome sequencing data and analysis of their functional impact on gene expression

  • Original Investigation
  • Published:
Human Genetics Aims and scope Submit manuscript

Abstract

Intermediate-sized insertions are one of the structural variants contributing to genome diversity. However, due to technical difficulties in identifying them, their importance in disease pathogenicity and gene expression regulation remains unclear. We used whole-genome sequencing data of 174 Japanese samples to characterize intermediate-sized insertions using a highly-accurate insertion calling method (IMSindel software and joint-call recovery) and obtained a catalogue of 4254 insertions. We constructed an imputation panel comprising of insertions and SNVs from all samples, and conducted imputation of intermediate-sized insertions for 82 publicly-available Japanese samples. Positive Predictive Value of imputation, evaluated using Nanopore long-read sequencing data, was 97%. Subsequent eQTL analysis predicted 128 (~ 3.0%) insertions as causative for gene expression level changes. Enrichment analysis of causal insertions for genome regulatory elements showed significant associations with CTCF-binding sites, super-enhancers, and promoters. Among 17 causal insertions found in the same causal set with GWAS hits, there were insertions associated with changes in expression of cancer-related genes such as BRCA1, ZNF222, and ABCB10. Analysis of insertions sequences revealed that 461 insertions were short tandem duplications frequently found in early-replicating regions of genome. Furthermore, comparison of functional importance of intermediate-sized insertions with that of intermediate-sized deletions detected in the same sample set in our previous study showed that insertions were more frequent in genic regions, and proportion of functional candidates was smaller in insertions. Here, we characterize a high-confidence set of intermediate-sized insertions and indicate their importance in gene expression regulation. Our results emphasize the importance of considering intermediate-sized insertions in trait association studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Availability of data and material

The data used in the current study are as follows: Nanopore MinION long-read sequencing data for sample RK067 are available from the Japanese Genotype–phenotype Archive (JGA) database (https://ddbj.nig.ac.jp/jga/viewer/view/studies) under accession number JGAS00000000180. Nanopore MinION long-read sequencing data for sample NA18943 are available from the DNA Data Bank of Japan (DDBJ) Sequence Read Archive (DRA) (https://www.ddbj.nig.ac.jp/dra/index-e.html) under accession number DRA008482. The reference panel used for the imputation in the current study is not publicly available due to the risk of compromise of participants’ privacy but is available from the corresponding author on reasonable request. Other web resources or tools used in this study are as follows: IMSindel software: https://github.com/NCGG-MGC/IMSindel; PLINK: https://www.cog-genomics.org/plink2; IMPUTE2: http://mathgen.stats.ox.ac.uk/impute/impute_v2.html; MatrixeQTL: http://www.bios.unc.edu/research/genomic_software/Matrix_eQTL/; CAVIAR: http://genetics.cs.ucla.edu/caviar/. RepeatMasker software developed by A.F.A. Smit, R. Hubley, and P. Green; see http://www.repeatmasker.org/. All statistical tests were conducted in R (ver3.5.0): https://www.r-project.org/. GTEx portal database: http://gtexportal.org/home/. Gene expression data for 82 Japanese HapMap samples used for the eQTL association analysis were obtained from EMBL-EBI ArrayExpress under accession E-MTAB-264 (Montgomery). Published GWAS association results are available from NHGRI-EBI Catalogue of published GWAS: https://www.ebi.ac.uk/gwas/home. Super-enhancer regions data are available from dbSUPER (http://asntech.org/dbsuper/). CTCF-binding site data are available from CTCFBSDB2.0 (http://insulatordb.uthsc.edu/). Ensembl regulatory regions data were obtained using BioMart for Ensembl Release 93 (https://grch37.ensembl.org/info/index.html). UCSC phastCons genome conservation scores are available from the UCSC genome browser (http://hgdownload.cse.ucsc.edu/goldenpath/hg19/phastCons46way/).

Code availability

Source code is available from https://github.com/Saeideh-Ashouri/Intermediate-Sized-Insertions

References

  • Brandler WM, Antaki D, Gujral M, Kleiber ML, Whitney J, Maile MS, Hong O, Chapman TR, Tan S, Tandon P, Pang T, Tang SC, Vaux KK, Yang Y, Harrington E, Juul S, Turner DJ, Thiruvahindrapuram B, Kaur G, Wang Z, Kingsmore SF, Gleeson JG, Bisson D, Kakaradov B, Telenti A, Venter JC, Corominas R, Toma C, Cormand B, Rueda I, Guijarro S, Messer KS, Nievergelt CM, Arranz MJ, Courchesne E, Pierce K, Muotri AR, Iakoucheva LM, Hervas A, Scherer SW, Corsello C, Sebat J (2018) Paternally inherited cis-regulatory structural variants are associated with autism. Science 360(6386):327–331. https://doi.org/10.1126/science.aan2261

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Casper J, Zweig AS, Villarreal C, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, Lee CM, Lee BT, Karolchik D, Hinrichs AS, Haeussler M, Guruvadoo L, Navarro Gonzalez J, Gibson D, Fiddes IT, Eisenhart C, Diekhans M, Clawson H, Barber GP, Armstrong J, Haussler D, Kuhn RM, Kent WJ (2018) The UCSC genome browser database: 2018 update. Nucleic Acids Res 46(D1):D762–D769. https://doi.org/10.1093/nar/gkx1020

    Article  CAS  PubMed  Google Scholar 

  • Chahal HS, Wu W, Ransohoff KJ, Yang L, Hedlin H, Desai M, Lin Y, Dai H-J, Qureshi AA, Li W-Q, Kraft P, Hinds DA, Tang JY, Han J, Sarin KY (2016) Genome-wide association study identifies 14 novel risk alleles associated with basal cell carcinoma. Nat Commun. https://doi.org/10.1038/ncomms12510

    Article  PubMed  PubMed Central  Google Scholar 

  • Chaisson MJP, Sanders AD, Zhao X, Malhotra A, Porubsky D, Rausch T, Gardner EJ, Rodriguez OL, Guo L, Collins RL, Fan X, Wen J, Handsaker RE, Fairley S, Kronenberg ZN, Kong X, Hormozdiari F, Lee D, Wenger AM, Hastie AR, Antaki D, Anantharaman T, Audano PA, Brand H, Cantsilieris S, Cao H, Cerveira E, Chen C, Chen X, Chin C-S, Chong Z, Chuang NT, Lambert CC, Church DM, Clarke L, Farrell A, Flores J, Galeev T, Gorkin DU, Gujral M, Guryev V, Heaton WH, Korlach J, Kumar S, Kwon JY, Lam ET, Lee JE, Lee J, Lee W-P, Lee SP, Li S, Marks P, Viaud-Martinez K, Meiers S, Munson KM, Navarro FCP, Nelson BJ, Nodzak C, Noor A, Kyriazopoulou-Panagiotopoulou S, Pang AWC, Qiu Y, Rosanio G, Ryan M, Stütz A, Spierings DCJ, Ward A, Welch AE, Xiao M, Xu W, Zhang C, Zhu Q, Zheng-Bradley X, Lowy E, Yakneen S, McCarroll S, Jun G, Ding L, Koh CL, Ren B, Flicek P, Chen K, Gerstein MB, Kwok P-Y, Lansdorp PM, Marth GT, Sebat J, Shi X, Bashir A, Ye K, Devine SE, Talkowski ME, Mills RE, Marschall T, Korbel JO, Eichler EE, Lee C (2019) Multi-platform discovery of haplotype-resolved structural variation in human genomes. Nat Commun. https://doi.org/10.1038/s41467-018-08148-z

    Article  PubMed  PubMed Central  Google Scholar 

  • Chen Y, Ye X, Xia X, Lin X (2019) Circular RNA ABCB10 correlates with advanced clinicopathological features and unfavorable survival, and promotes cell proliferation while reduces cell apoptosis in epithelial ovarian cancer. Cancer Biomark 26(2):151–161. https://doi.org/10.3233/CBM-190064

    Article  CAS  PubMed  Google Scholar 

  • Chiang C, Scott AJ, Davis JR, Tsang EK, Li X, Kim Y, Hadzic T, Damani FN, Ganel L, Montgomery SB, Battle A, Conrad DF, Hall IM (2017) The impact of structural variation on human gene expression. Nat Genet 49(5):692–699. https://doi.org/10.1038/ng.3834

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Day FR, Ruth KS, Thompson DJ, Lunetta KL, Pervjakova N, Chasman DI, Stolk L, Finucane HK, Sulem P, Bulik-Sullivan B, Esko T, Johnson AD, Elks CE, Franceschini N, He C, Altmaier E, Brody JA, Franke LL, Huffman JE, Keller MF, McArdle PF, Nutile T, Porcu E, Robino A, Rose LM, Schick UM, Smith JA, Teumer A, Traglia M, Vuckovic D, Yao J, Zhao W, Albrecht E, Amin N, Corre T, Hottenga J-J, Mangino M, Smith AV, Tanaka T, Abecasis G, Andrulis IL, Anton-Culver H, Antoniou AC, Arndt V, Arnold AM, Barbieri C, Beckmann MW, Beeghly-Fadiel A, Benitez J, Bernstein L, Bielinski SJ, Blomqvist C, Boerwinkle E, Bogdanova NV, Bojesen SE, Bolla MK, Borresen-Dale A-L, Boutin TS, Brauch H, Brenner H, Brüning T, Burwinkel B, Campbell A, Campbell H, Chanock SJ, Chapman JR, Chen Y-DI, Chenevix-Trench G, Couch FJ, Coviello AD, Cox A, Czene K, Darabi H, De Vivo I, Demerath EW, Dennis J, Devilee P, Dörk T, dos-Santos-Silva I, Dunning AM, Eicher JD, Fasching PA, Faul JD, Figueroa J, Flesch-Janys D, Gandin I, Garcia ME, García-Closas M, Giles GG, Girotto GG, Goldberg MS, González-Neira A, Goodarzi MO, Grove ML, Gudbjartsson DF, Guénel P, Guo X, Haiman CA, Hall P, Hamann U, Henderson BE, Hocking LJ, Hofman A, Homuth G, Hooning MJ, Hopper JL, Hu FB, Huang J, Humphreys K, Hunter DJ, Jakubowska A, Jones SE, Kabisch M, Karasik D, Knight JA, Kolcic I, Kooperberg C, Kosma V-M, Kriebel J, Kristensen V, Lambrechts D, Langenberg C, Li J, Li X, Lindström S, Liu Y, Luan J, Lubinski J, Mägi R, Mannermaa A, Manz J, Margolin S, Marten J, Martin NG, Masciullo C, Meindl A, Michailidou K, Mihailov E, Milani L, Milne RL, Müller-Nurasyid M, Nalls M, Neale BM, Nevanlinna H, Neven P, Newman AB, Nordestgaard BG, Olson JE, Padmanabhan S, Peterlongo P, Peters U, Petersmann A, Peto J, Pharoah PDP, Pirastu NN, Pirie A, Pistis G, Polasek O, Porteous D, Psaty BM, Pylkäs K, Radice P, Raffel LJ, Rivadeneira F, Rudan I, Rudolph A, Ruggiero D, Sala CF, Sanna S, Sawyer EJ, Schlessinger D, Schmidt MK, Schmidt F, Schmutzler RK, Schoemaker MJ, Scott RA, Seynaeve CM, Simard J, Sorice R, Southey MC, Stöckl D, Strauch K, Swerdlow A, Taylor KD, Thorsteinsdottir U, Toland AE, Tomlinson I, Truong T, Tryggvadottir L, Turner ST, Vozzi D, Wang Q, Wellons M, Willemsen G, Wilson JF, Winqvist R, Wolffenbuttel BBHR, Wright AF, Yannoukakos D, Zemunik T, Zheng W, Zygmunt M, Bergmann S, Boomsma DI, Buring JE, Ferrucci L, Montgomery GW, Gudnason V, Spector TD, van Duijn CM, Alizadeh BZ, Ciullo M, Crisponi L, Easton DF, Gasparini PP, Gieger C, Harris TB, Hayward C, Kardia SLR, Kraft P, McKnight B, Metspalu A, Morrison AC, Reiner AP, Ridker PM, Rotter JI, Toniolo D, Uitterlinden AG, Ulivi S, Völzke H, Wareham NJ, Weir DR, Yerges-Armstrong LM, Price AL, Stefansson K, Visser JA, Ong KK, Chang-Claude J, Murabito JM, Perry JRB, Murray A (2015) Large-scale genomic analyses link reproductive ageing to hypothalamic signaling, breast cancer susceptibility and BRCA1-mediated DNA repair. Nat Genet 47(11):1294–1303. https://doi.org/10.1038/ng.3412

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • De S, Michor F (2011) DNA replication timing and long-range DNA interactions predict mutational landscapes of cancer genomes. Nat Biotechnol 29(12):1103–1108. https://doi.org/10.1038/nbt.2030

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Delage WJ, Thevenon J, Lemaitre C (2020) Towards a better understanding of the low recall of insertion variants with short-read based variant callers. BMC Genomics 21(1):762. https://doi.org/10.1186/s12864-020-07125-5

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, Epstein CB, Frietze S, Harrow J, Kaul R, Khatun J, Lajoie BR, Landt SG, Lee B-K, Pauli F, Rosenbloom KR, Sabo P, Safi A, Sanyal A, Shoresh N, Simon JM, Song L, Trinklein ND, Altshuler RC, Birney E, Brown JB, Cheng C, Djebali S, Dong X, Dunham I, Ernst J, Furey TS, Gerstein M, Giardine B, Greven M, Hardison RC, Harris RS, Herrero J, Hoffman MM, Iyer S, Kellis M, Khatun J, Kheradpour P, Kundaje A, Lassmann T, Li Q, Lin X, Marinov GK, Merkel A, Mortazavi A, Parker SCJ, Reddy TE, Rozowsky J, Schlesinger F, Thurman RE, Wang J, Ward LD, Whitfield TW, Wilder SP, Wu W, Xi HS, Yip KY, Zhuang J, Bernstein BE, Birney E, Dunham I, Green ED, Gunter C, Snyder M, Pazin MJ, Lowdon RF, Dillon LAL, Adams LB, Kelly CJ, Zhang J, Wexler JR, Green ED, Good PJ, Feingold EA, Bernstein BE, Birney E, Crawford GE, Dekker J, Elnitski L, Farnham PJ, Gerstein M, Giddings MC, Gingeras TR, Green ED, Guigó R, Hardison RC, Hubbard TJ, Kellis M, Kent WJ, Lieb JD, Margulies EH, Myers RM, Snyder M, Stamatoyannopoulos JA, Tenenbaum SA, Weng Z, White KP, Wold B, Khatun J, Yu Y, Wrobel J, Risk BA, Gunawardena HP, Kuiper HC, Maier CW, Xie L, Chen X, Giddings MC, Bernstein BE, Epstein CB, Shoresh N, Ernst J, Kheradpour P, Mikkelsen TS, Gillespie S, Goren A, Ram O, Zhang X, Wang L, Issner R, Coyne MJ, Durham T, Ku M, Truong T, Ward LD, Altshuler RC, Eaton ML, Kellis M, Djebali S, Davis CA, Merkel A, Dobin A, Lassmann T, Mortazavi A, Tanzer A, Lagarde J, Lin W, Schlesinger F, Xue C, Marinov GK, Khatun J, Williams BA, Zaleski C, Rozowsky J, Röder M, Kokocinski F, Abdelhamid RF, Alioto T, Antoshechkin I, Baer MT, Batut P, Bell I, Bell K, Chakrabortty S, Chen X, Chrast J, Curado J, Derrien T, Drenkow J, Dumais E, Dumais J, Duttagupta R, Fastuca M, Fejes-Toth K, Ferreira P, Foissac S, Fullwood MJ, Gao H, Gonzalez D, Gordon A, Gunawardena HP, Howald C, Jha S, Johnson R, Kapranov P, King B, Kingswood C, Li G, Luo OJ, Park E, Preall JB, Presaud K, Ribeca P, Risk BA, Robyr D, Ruan X, Sammeth M, Sandhu KS, Schaeffer L, See L-H, Shahab A, Skancke J, Suzuki AM, Takahashi H, Tilgner H, Trout D, Walters N, Wang H, Wrobel J, Yu Y, Hayashizaki Y, Harrow J, Gerstein M, Hubbard TJ, Reymond A, Antonarakis SE, Hannon GJ, Giddings MC, Ruan Y, Wold B, Carninci P, Guigó R, Gingeras TR, Rosenbloom KR, Sloan CA, Learned K, Malladi VS, Wong MC, Barber GP, Cline MS, Dreszer TR, Heitner SG, Karolchik D, Kent WJ, Kirkup VM, Meyer LR, Long JC, Maddren M, Raney BJ, Furey TS, Song L, Grasfeder LL, Giresi PG, Lee B-K, Battenhouse A, Sheffield NC, Simon JM, Showers KA, Safi A, London D, Bhinge AA, Shestak C, Schaner MR, Ki Kim S, Zhang ZZ, Mieczkowski PA, Mieczkowska JO, Liu Z, McDaniell RM, Ni Y, Rashid NU, Kim MJ, Adar S, Zhang Z, Wang T, Winter D, Keefe D, Birney E, Iyer VR, Lieb JD, Crawford GE, Li G, Sandhu KS, Zheng M, Wang P, Luo OJ, Shahab A, Fullwood MJ, Ruan X, Ruan Y, Myers RM, Pauli F, Williams BA, Gertz J, Marinov GK, Reddy TE, Vielmetter J, Partridge E, Trout D, Varley KE, Gasper C, The ENCODE Project Consortium, Overall coordination (data analysis coordination), Data production leads (data production), Lead analysts (data analysis), Writing group, NHGRI project management (scientific management), Principal investigators (steering committee), Boise State University and University of North Carolina at Chapel Hill Proteomics groups (data production and analysis), Broad Institute Group (data production and analysis), Cold Spring Harbor U of G Center for Genomic Regulation, Barcelona, RIKEN, Sanger Institute, University of Lausanne, Genome Institute of Singapore group (data production and analysis), Data coordination center at UC Santa Cruz (production data coordination), Duke University E University of Texas, Austin, University of North Carolina-Chapel Hill group (data production and analysis), Genome Institute of Singapore group (data production and analysis), HudsonAlpha Institute C UC Irvine, Stanford group (data production and analysis) (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489(7414):57–74. https://doi.org/10.1038/nature11247

    Article  CAS  Google Scholar 

  • Fu Y, Cai L, Lei X, Wang D (2019) Circular RNA ABCB10 promotes hepatocellular carcinoma progression by increasing HMG20A expression by sponging miR-670-3p. Cancer Cell Int 19(1):338. https://doi.org/10.1186/s12935-019-1055-z

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Fujimoto A, Furuta M, Totoki Y, Tsunoda T, Kato M, Shiraishi Y, Tanaka H, Taniguchi H, Kawakami Y, Ueno M, Gotoh K, Ariizumi S, Wardell CP, Hayami S, Nakamura T, Aikata H, Arihiro K, Boroevich KA, Abe T, Nakano K, Maejima K, Sasaki-Oku A, Ohsawa A, Shibuya T, Nakamura H, Hama N, Hosoda F, Arai Y, Ohashi S, Urushidate T, Nagae G, Yamamoto S, Ueda H, Tatsuno K, Ojima H, Hiraoka N, Okusaka T, Kubo M, Marubashi S, Yamada T, Hirano S, Yamamoto M, Ohdan H, Shimada K, Ishikawa O, Yamaue H, Chayama K, Miyano S, Aburatani H, Shibata T, Nakagawa H (2016) Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer. Nat Genet 48(5):500–509. https://doi.org/10.1038/ng.3547

    Article  CAS  PubMed  Google Scholar 

  • Fujimoto A, Fujita M, Hasegawa T, Wong JH, Maejima K, Oku-Sasaki A, Nakano K, Shiraishi Y, Miyano S, Yamamoto G, Akagi K, Imoto S, Nakagawa H (2020) Comprehensive analysis of indels in whole-genome microsatellite regions and microsatellite instability across 21 cancer types. Genome Res 30(3):334–346. https://doi.org/10.1101/gr.255026.119

    Article  CAS  PubMed Central  Google Scholar 

  • Guan P, Sung W-K (2016) Structural variation detection using next-generation sequencing data A comparative technical review. Methods 102:36–49. https://doi.org/10.1016/j.ymeth.2016.01.020

    Article  CAS  PubMed  Google Scholar 

  • Hancks DC, Kazazian HH (2016) Roles for retrotransposon insertions in human disease. Mob DNA 7(1):9. https://doi.org/10.1186/s13100-016-0065-9

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Hendig D, Langmann T, Zarbock R, Schmitz G, Kleesiek K, Götting C (2009) Characterization of the ATP-binding cassette transporter gene expression profile in Y79: a retinoblastoma cell line. Mol Cell Biochem 328(1):85–92. https://doi.org/10.1007/s11010-009-0077-6

    Article  CAS  PubMed  Google Scholar 

  • Hormozdiari F, Kostem E, Kang EY, Pasaniuc B, Eskin E (2014) Identifying causal variants at loci with multiple signals of association. Genetics 198:21

    Article  Google Scholar 

  • Howie BN, Donnelly P, Marchini J (2009) A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5(6):e1000529. https://doi.org/10.1371/journal.pgen.1000529

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Huang Y, Zhang Y, Jia L, Liu C, Xu F (2019) Circular RNA ABCB10 promotes tumor progression and correlates with pejorative prognosis in clear cell renal cell carcinoma. Int J Biol Markers 34(2):176–183. https://doi.org/10.1177/1724600819842279

    Article  CAS  PubMed  Google Scholar 

  • Hubley R, Smit A, Green P (2013) RepeatMasker Open-4.0

  • Kent WJ (2002) BLAT—The BLAST-like alignment tool. Genome Res 12(4):656–664. https://doi.org/10.1101/gr.229202

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Li H (2018) Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34(18):3094–3100. https://doi.org/10.1093/bioinformatics/bty191

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Liang H-F, Zhang X-Z, Liu B-G, Jia G-T, Li W-L (2017) Circular RNA circ-ABCB10 promotes breast cancer proliferation and progression through sponging miR-1271. Am J Cancer Res 7(7):1566–1576

    CAS  PubMed  PubMed Central  Google Scholar 

  • Lonsdale J, Thomas J, Salvatore M, Phillips R, Lo E, Shad S, Hasz R, Walters G, Garcia F, Young N, Foster B, Moser M, Karasik E, Gillard B, Ramsey K, Sullivan S, Bridge J, Magazine H, Syron J, Fleming J, Siminoff L, Traino H, Mosavel M, Barker L, Jewell S, Rohrer D, Maxim D, Filkins D, Harbach P, Cortadillo E, Berghuis B, Turner L, Hudson E, Feenstra K, Sobin L, Robb J, Branton P, Korzeniewski G, Shive C, Tabor D, Qi L, Groch K, Nampally S, Buia S, Zimmerman A, Smith A, Burges R, Robinson K, Valentino K, Bradbury D, Cosentino M, Diaz-Mayoral N, Kennedy M, Engel T, Williams P, Erickson K, Ardlie K, Winckler W, Getz G, DeLuca D, MacArthur D, Kellis M, Thomson A, Young T, Gelfand E, Donovan M, Meng Y, Grant G, Mash D, Marcus Y, Basile M, Liu J, Zhu J, Tu Z, Cox NJ, Nicolae DL, Gamazon ER, Im HK, Konkashbaev A, Pritchard J, Stevens M, Flutre T, Wen X, Dermitzakis ET, Lappalainen T, Guigo R, Monlong J, Sammeth M, Koller D, Battle A, Mostafavi S, McCarthy M, Rivas M, Maller J, Rusyn I, Nobel A, Wright F, Shabalin A, Feolo M, Sharopova N, Sturcke A, Paschal J, Anderson JM, Wilder EL, Derr LK, Green ED, Struewing JP, Temple G, Volpi S, Boyer JT, Thomson EJ, Guyer MS, Ng C, Abdallah A, Colantuoni D, Insel TR, Koester SE, Little AR, Bender PK, Lehner T, Yao Y, Compton CC, Vaught JB, Sawyer S, Lockhart NC, Demchok J, Moore HF (2013) The genotype-tissue expression (GTEx) project. Nat Genet 45(6):580–585. https://doi.org/10.1038/ng.2653

    Article  CAS  Google Scholar 

  • MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, Junkins H, McMahon A, Milano A, Morales J, Pendlington ZM, Welter D, Burdett T, Hindorff L, Flicek P, Cunningham F, Parkinson H (2017) The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res 45(D1):D896–D901. https://doi.org/10.1093/nar/gkw1133

    Article  CAS  PubMed  Google Scholar 

  • Mahmoud M, Gobet N, Cruz-Dávalos DI, Mounier N, Sedlazeck FJ (2019) Structural variant calling: the long and the short of it. Genome Biol 20(1):246. https://doi.org/10.1186/s13059-019-1828-7

    Article  PubMed  PubMed Central  Google Scholar 

  • Michailidou K, Hall P, Gonzalez-Neira A, Ghoussaini M, Dennis J, Milne RL, Schmidt MK, Chang-Claude J, Bojesen SE, Bolla MK, Wang Q, Dicks E, Lee A, Turnbull C, Rahman N, Fletcher O, Peto J, Gibson L, Silva I dos S, Nevanlinna H, Muranen TA, Aittomäki K, Blomqvist C, Czene K, Irwanto A, Liu J, Waisfisz Q, Meijers-Heijboer H, Adank M, van der Luijt RB, Hein R, Dahmen N, Beckman L, Meindl A, Schmutzler RK, Müller-Myhsok B, Lichtner P, Hopper JL, Southey MC, Makalic E, Schmidt DF, Uitterlinden AG, Hofman A, Hunter DJ, Chanock SJ, Vincent D, Bacot F, Tessier DC, Canisius S, Wessels LFA, Haiman CA, Shah M, Luben R, Brown J, Luccarini C, Schoof N, Humphreys K, Li J, Nordestgaard BG, Nielsen SF, Flyger H, Couch FJ, Wang X, Vachon C, Stevens KN, Lambrechts D, Moisse M, Paridaens R, Christiaens M-R, Rudolph A, Nickels S, Flesch-Janys D, Johnson N, Aitken Z, Aaltonen K, Heikkinen T, Broeks A, Van’t Veer LJ, van der Schoot CE, Guénel P, Truong T, Laurent-Puig P, Menegaux F, Marme F, Schneeweiss A, Sohn C, Burwinkel B, Zamora MP, Perez JIA, Pita G, Alonso MR, Cox A, Brock IW, Cross SS, Reed MWR, Sawyer EJ, Tomlinson I, Kerin MJ, Miller N, Henderson BE, Schumacher F, Le Marchand L, Andrulis IL, Knight JA, Glendon G, Mulligan AM, Lindblom A, Margolin S, Hooning MJ, Hollestelle A, van den Ouweland AMW, Jager A, Bui QM, Stone J, Dite GS, Apicella C, Tsimiklis H, Giles GG, Severi G, Baglietto L, Fasching PA, Haeberle L, Ekici AB, Beckmann MW, Brenner H, Müller H, Arndt V, Stegmaier C, Swerdlow A, Ashworth A, Orr N, Jones M, Figueroa J, Lissowska J, Brinton L, Goldberg MS, Labrèche F, Dumont M, Winqvist R, Pylkäs K, Jukkola-Vuorinen A, Grip M, Brauch H, Hamann U, Brüning T, Radice P, Peterlongo P, Manoukian S, Bonanni B, Devilee P, Tollenaar RAEM, Seynaeve C, van Asperen CJ, Jakubowska A, Lubinski J, Jaworska K, Durda K, Mannermaa A, Kataja V, Kosma V-M, Hartikainen JM, Bogdanova NV, Antonenkova NN, Dörk T, Kristensen VN, Anton-Culver H, Slager S, Toland AE, Edge S, Fostira F, Kang D, Yoo K-Y, Noh D-Y, Matsuo K, Ito H, Iwata H, Sueta A, Wu AH, Tseng C-C, Van Den Berg D, Stram DO, Shu X-O, Lu W, Gao Y-T, Cai H, Teo SH, Yip CH, Phuah SY, Cornes BK, Hartman M, Miao H, Lim WY, Sng J-H, Muir K, Lophatananon A, Stewart-Brown S, Siriwanarangsan P, Shen C-Y, Hsiung C-N, Wu P-E, Ding S-L, Sangrajrang S, Gaborieau V, Brennan P, McKay J, Blot WJ, Signorello LB, Cai Q, Zheng W, Deming-Halverson S, Shrubsole M, Long J, Simard J, Garcia-Closas M, Pharoah PDP, Chenevix-Trench G, Dunning AM, Benitez J, Easton DF (2013) Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat Genet 45(4):353–3612. https://doi.org/10.1038/ng.2563

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Michailidou K, Beesley J, Lindstrom S, Canisius S, Dennis J, Lush MJ, Maranian MJ, Bolla MK, Wang Q, Shah M, Perkins BJ, Czene K, Eriksson M, Darabi H, Brand JS, Bojesen SE, Nordestgaard BG, Flyger H, Nielsen SF, Rahman N, Turnbull C, Fletcher O, Peto J, Gibson L, Dossantossilva I, Chang-Claude J, Flesch-Janys D, Rudolph A, Eilber U, Behrens S, Nevanlinna H, Muranen TA, Aittomäki K, Blomqvist C, Khan S, Aaltonen K, Ahsan H, Kibriya MG, Whittemore AS, John EM, Malone KE, Gammon MD, Santella RM, Ursin G, Makalic E, Schmidt DF, Casey G, Hunter DJ, Gapstur SM, Gaudet MM, Diver WR, Haiman CA, Schumacher F, Henderson BE, Le Marchand L, Berg CD, Chanock SJ, Figueroa J, Hoover RN, Lambrechts D, Neven P, Wildiers H, van Limbergen E, Schmidt MK, Broeks A, Verhoef S, Cornelissen S, Couch FJ, Olson JE, Hallberg E, Vachon C, Waisfisz Q, Meijers-Heijboer H, Adank MA, van der Luijt RB, Li J, Liu J, Humphreys K, Kang D, Choi J-Y, Park SK, Yoo K-Y, Matsuo K, Ito H, Iwata H, Tajima K, Guénel P, Truong T, Mulot C, Sanchez M, Burwinkel B, Marme F, Surowy H, Sohn C, Wu AH, Tseng C, Van Den Berg D, Stram DO, González-Neira A, Benitez J, Zamora MP, Perez JIA, Shu X-O, Lu W, Gao Y-T, Cai H, Cox A, Cross SS, Reed MWR, Andrulis IL, Knight JA, Glendon G, Mulligan AM, Sawyer EJ, Tomlinson I, Kerin MJ, Miller N, Lindblom A, Margolin S, Teo SH, Yip CH, Taib NAM, Tan G-H, Hooning MJ, Hollestelle A, Martens JWM, Collée JM, Blot W, Signorello LB, Cai Q, Hopper JL, Southey MC, Tsimiklis H, Apicella C, Shen C-Y, Hsiung C-N, Wu P-E, Hou M-F, Kristensen VN, Nord S, Alnaes GIG, Giles GG, Milne RL, McLean C, Canzian F, Trichopoulos D, Peeters P, Lund E, Sund M, Khaw K-T, Gunter MJ, Palli D, Mortensen LM, Dossus L, Huerta J-M, Meindl A, Schmutzler RK, Sutter C, Yang R, Muir K, Lophatananon A, Stewart-Brown S, Siriwanarangsan P, Hartman M, Miao H, Chia KS, Chan CW, Fasching PA, Hein A, Beckmann MW, Haeberle L, Brenner H, Dieffenbach AK, Arndt V, Stegmaier C, Ashworth A, Orr N, Schoemaker MJ, Swerdlow AJ, Brinton L, Garcia-Closas M, Zheng W, Halverson SL, Shrubsole M, Long J, Goldberg MS, Labrèche F, Dumont M, Winqvist R, Pylkäs K, Jukkola-Vuorinen A, Grip M, Brauch H, Hamann U, Brüning T, Radice P, Peterlongo P, Manoukian S, Bernard L, Bogdanova NV, Dörk T, Mannermaa A, Kataja V, Kosma V-M, Hartikainen JM, Devilee P, Tollenaar RAEM, Seynaeve C, Van Asperen CJ, Jakubowska A, Lubinski J, Jaworska K, Huzarski T, Sangrajrang S, Gaborieau V, Brennan P, McKay J, Slager S, Toland AE, Ambrosone CB, Yannoukakos D, Kabisch M, Torres D, Neuhausen SL, Anton-Culver H, Luccarini C, Baynes C, Ahmed S, Healey CS, Tessier DC, Vincent D, Bacot F, Pita G, Alonso MR, Álvarez N, Herrero D, Simard J, Pharoah PPDP, Kraft P, Dunning AM, Chenevix-Trench G, Hall P, Easton DF, kConFab Investigators, AOCS Group, GENICA Network, NBCS (2015) Genome-wide association analysis of more than 120,000 individuals identifies 15 new susceptibility loci for breast cancer. Nat Genet 47(4):373–380. https://doi.org/10.1038/ng.3242

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Michailidou K, Lindström S, Dennis J, Beesley J, Hui S, Kar S, Lemaçon A, Soucy P, Glubb D, Rostamianfar A, Bolla MK, Wang Q, Tyrer J, Dicks E, Lee A, Wang Z, Allen J, Keeman R, Eilber U, French JD, Chen XQ, Fachal L, McCue K, McCart Reed AE, Ghoussaini M, Carroll J, Jiang X, Finucane H, Adams M, Adank MA, Ahsan H, Aittomäki K, Anton-Culver H, Antonenkova NN, Arndt V, Aronson KJ, Arun B, Auer PL, Bacot F, Barrdahl M, Baynes C, Beckmann MW, Behrens S, Benitez J, Bermisheva M, Bernstein L, Blomqvist C, Bogdanova NV, Bojesen SE, Bonanni B, Børresen-Dale A-L, Brand JS, Brauch H, Brennan P, Brenner H, Brinton L, Broberg P, Brock IW, Broeks A, Brooks-Wilson A, Brucker SY, Brüning T, Burwinkel B, Butterbach K, Cai Q, Cai H, Caldés T, Canzian F, Carracedo A, Carter BD, Castelao JE, Chan TL, Cheng T-YD, Chia KS, Choi J-Y, Christiansen H, Clarke CL, Collée M, Conroy DM, Cordina-Duverger E, Cornelissen S, Cox DG, Cox A, Cross SS, Cunningham JM, Czene K, Daly MB, Devilee P, Doheny KF, Dörk T, Dossantossilva I, Dumont M, Durcan L, Dwek M, Eccles DM, Ekici AB, Eliassen AH, Ellberg C, Elvira M, Engel C, Eriksson M, Fasching PA, Figueroa J, Flesch-Janys D, Fletcher O, Flyger H, Fritschi L, Gaborieau V, Gabrielson M, Gago-Dominguez M, Gao Y-T, Gapstur SM, García-Sáenz JA, Gaudet MM, Georgoulias V, Giles GG, Glendon G, Goldberg MS, Goldgar DE, González-Neira A, Grenaker Alnæs GI, Grip M, Gronwald J, Grundy A, Guénel P, Haeberle L, Hahnen E, Haiman CA, Håkansson N, Hamann U, Hamel N, Hankinson S, Harrington P, Hart SN, Hartikainen JM, Hartman M, Hein A, Heyworth J, Hicks B, Hillemanns P, Ho DN, Hollestelle A, Hooning MJ, Hoover RN, Hopper JL, Hou M-F, Hsiung C-N, Huang G, Humphreys K, Ishiguro J, Ito H, Iwasaki M, Iwata H, Jakubowska A, Janni W, John EM, Johnson N, Jones K, Jones M, Jukkola-Vuorinen A, Kaaks R, Kabisch M, Kaczmarek K, Kang D, Kasuga Y, Kerin MJ, Khan S, Khusnutdinova E, Kiiski JI, Kim S-W, Knight JA, Kosma V-M, Kristensen VN, Krüger U, Kwong A, Lambrechts D, Marchand LL, Lee E, Lee MH, Lee JW, Lee CN, Lejbkowicz F, Li J, Lilyquist J, Lindblom A, Lissowska J, Lo W-Y, Loibl S, Long J, Lophatananon A, Lubinski J, Luccarini C, Lux MP, Ma ESK, MacInnis RJ, Maishman T, Makalic E, Malone KE, Kostovska IM, Mannermaa A, Manoukian S, Manson JE, Margolin S, Mariapun S, Martinez ME, Matsuo K, Mavroudis D, McKay J, McLean C, Meijers-Heijboer H, Meindl A, Menéndez P, Menon U, Meyer J, Miao H, Miller N, Mohd Taib NA, Muir K, Mulligan AM, Mulot C, Neuhausen SL, Nevanlinna H, Neven P, Nielsen SF, Noh D-Y, Nordestgaard BG, Norman A, Olopade OI, Olson JE, Olsson H, Olswold C, Orr N, Pankratz VS, Park SK, Park-Simon T-W, Lloyd R, Perez JIA, Peterlongo P, Peto J, Phillips K-A, Pinchev M, Plaseska-Karanfilska D, Prentice R, Presneau N, Prokofieva D, Pugh E, Pylkäs K, Rack B, Radice P, Rahman N, Rennert G, Rennert HS, Rhenius V, Romero A, Romm J, Ruddy KJ, Rüdiger T, Rudolph A, Ruebner M, Rutgers EJTh, Saloustros E, Sandler DP, Sangrajrang S, Sawyer EJ, Schmidt DF, Schmutzler RK, Schneeweiss A, Schoemaker MJ, Schumacher F, Schürmann P, Scott RJ, Scott C, Seal S, Seynaeve C, Shah M, Sharma P, Shen C-Y, Sheng G, Sherman ME, Shrubsole MJ, Shu X-O, Smeets A, Sohn C, Southey MC, Spinelli JJ, Stegmaier C, Stewart-Brown S, Stone J, Stram DO, Surowy H, Swerdlow A, Tamimi R, Taylor JA, Tengström M, Teo SH, Terry MB, Tessier DC, Thanasitthichai S, Thöne K, Tollenaar RAEM, Tomlinson I, Tong L, Torres D, Truong T, Tseng C, Tsugane S, Ulmer H-U, Ursin G, Untch M, Vachon C, van Asperen CJ, Van Den Berg D, van den Ouweland AMW, van der Kolk L, van der Luijt RB, Vincent D, Vollenweider J, Waisfisz Q, Wang-Gohrke S, Weinberg CR, Wendt C, Whittemore AS, Wildiers H, Willett W, Winqvist R, Wolk A, Wu AH, Xia L, Yamaji T, Yang XR, Yip CH, Yoo K-Y, Yu J-C, Zheng W, Zheng Y, Zhu B, Ziogas A, Ziv E, Lakhani SR, Antoniou AC, Droit A, Andrulis IL, Amos CI, Couch FJ, Pharoah PDP, Chang-Claude J, Hall P, Hunter DJ, Milne RL, García-Closas M, Schmidt MK, Chanock SJ, Dunning AM, Edwards SL, Bader GD, Chenevix-Trench G, Simard J, Kraft P, Easton DF (2017) Association analysis identifies 65 new breast cancer risk loci. Nature 551(7678):92–94. https://doi.org/10.1038/nature24284

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Montgomery SB E-MTAB-264 - HAPMAP3_EXPRESSION. ArrayExpress, E-MTAB-264. https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-264

  • Montgomery SB, Goode DL, Kvikstad E, Albers CA, Zhang ZD, Mu XJ, Ananda G, Howie B, Karczewski KJ, Smith KS, Anaya V, Richardson R, Davis J, MacArthur DG, Sidow A, Duret L, Gerstein M, Makova KD, Marchini J, McVean G, Lunter G, Consortium T 1000 GP (2013) The origin, evolution, and functional impact of short insertion–deletion variants identified in 179 human genomes. Genome Res 23(5):749–761. https://doi.org/10.1101/gr.148718.112

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Pang AW, MacDonald JR, Pinto D, Wei J, Rafiq MA, Conrad DF, Park H, Hurles ME, Lee C, Venter JC, Kirkness EF, Levy S, Feuk L, Scherer SW (2010) Towards a comprehensive structural variation map of an individual human genome. Genome Biol 11:14

    Article  Google Scholar 

  • Psychosis Endophenotypes International Consortium, CNV and Schizophrenia Working Groups of the Psychiatric Genomics Consortium, Marshall CR, Howrigan DP, Merico D, Thiruvahindrapuram B, Wu W, Greer DS, Antaki D, Shetty A, Holmans PA, Pinto D, Gujral M, Brandler WM, Malhotra D, Wang Z, Fajarado KVF, Maile MS, Ripke S, Agartz I, Albus M, Alexander M, Amin F, Atkins J, Bacanu SA, Belliveau RA, Bergen SE, Bertalan M, Bevilacqua E, Bigdeli TB, Black DW, Bruggeman R, Buccola NG, Buckner RL, Bulik-Sullivan B, Byerley W, Cahn W, Cai G, Cairns MJ, Campion D, Cantor RM, Carr VJ, Carrera N, Catts SV, Chambert KD, Cheng W, Cloninger CR, Cohen D, Cormican P, Craddock N, Crespo-Facorro B, Crowley JJ, Curtis D, Davidson M, Davis KL, Degenhardt F, Del Favero J, DeLisi LE, Dikeos D, Dinan T, Djurovic S, Donohoe G, Drapeau E, Duan J, Dudbridge F, Eichhammer P, Eriksson J, Escott-Price V, Essioux L, Fanous AH, Farh K-H, Farrell MS, Frank J, Franke L, Freedman R, Freimer NB, Friedman JI, Forstner AJ, Fromer M, Genovese G, Georgieva L, Gershon ES, Giegling I, Giusti-Rodríguez P, Godard S, Goldstein JI, Gratten J, de Haan L, Hamshere ML, Hansen M, Hansen T, Haroutunian V, Hartmann AM, Henskens FA, Herms S, Hirschhorn JN, Hoffmann P, Hofman A, Huang H, Ikeda M, Joa I, Kähler AK, Kahn RS, Kalaydjieva L, Karjalainen J, Kavanagh D, Keller MC, Kelly BJ, Kennedy JL, Kim Y, Knowles JA, Konte B, Laurent C, Lee P, Lee SH, Legge SE, Lerer B, Levy DL, Liang K-Y, Lieberman J, Lönnqvist J, Loughland CM, Magnusson PKE, Maher BS, Maier W, Mallet J, Mattheisen M, Mattingsdal M, McCarley RW, McDonald C, McIntosh AM, Meier S, Meijer CJ, Melle I, Mesholam-Gately RI, Metspalu A, Michie PT, Milani L, Milanova V, Mokrab Y, Morris DW, Müller-Myhsok B, Murphy KC, Murray RM, Myin-Germeys I, Nenadic I, Nertney DA, Nestadt G, Nicodemus KK, Nisenbaum L, Nordin A, O’Callaghan E, O’Dushlaine C, Oh S-Y, Olincy A, Olsen L, O’Neill FA, Van Os J, Pantelis C, Papadimitriou GN, Parkhomenko E, Pato MT, Paunio T, Perkins DO, Pers TH, Pietiläinen O, Pimm J, Pocklington AJ, Powell J, Price A, Pulver AE, Purcell SM, Quested D, Rasmussen HB, Reichenberg A, Reimers MA, Richards AL, Roffman JL, Roussos P, Ruderfer DM, Salomaa V, Sanders AR, Savitz A, Schall U, Schulze TG, Schwab SG, Scolnick EM, Scott RJ, Seidman LJ, Shi J, Silverman JM, Smoller JW, Söderman E, Spencer CCA, Stahl EA, Strengman E, Strohmaier J, Stroup TS, Suvisaari J, Svrakic DM, Szatkiewicz JP, Thirumalai S, Tooney PA, Veijola J, Visscher PM, Waddington J, Walsh D, Webb BT, Weiser M, Wildenauer DB, Williams NM, Williams S, Witt SH, Wolen AR, Wormley BK, Wray NR, Wu JQ, Zai CC, Adolfsson R, Andreassen OA, Blackwood DHR, Bramon E, Buxbaum JD, Cichon S, Collier DA, Corvin A, Daly MJ, Darvasi A, Domenici E, Esko T, Gejman PV, Gill M, Gurling H, Hultman CM, Iwata N, Jablensky AV, Jönsson EG, Kendler KS, Kirov G, Knight J, Levinson DF, Li QS, McCarroll SA, McQuillin A, Moran JL, Mowry BJ, Nöthen MM, Ophoff RA, Owen MJ, Palotie A, Pato CN, Petryshen TL, Posthuma D, Rietschel M, Riley BP, Rujescu D, Sklar P, St Clair D, Walters JTR, Werge T, Sullivan PF, O’Donovan MC, Scherer SW, Neale BM, Sebat J (2017) Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat Genet 49(1):27–35. https://doi.org/10.1038/ng.3725

    Article  CAS  Google Scholar 

  • Shabalin AA (2012) Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28(10):1353–1358. https://doi.org/10.1093/bioinformatics/bts163

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Shigemizu D, Miya F, Akiyama S, Okuda S, Boroevich KA, Fujimoto A, Nakagawa H, Ozaki K, Niida S, Kanemura Y, Okamoto N, Saitoh S, Kato M, Yamasaki M, Matsunaga T, Mutai H, Kosaki K, Tsunoda T (2018) IMSindel: an accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis. Sci Rep 8(1):5608. https://doi.org/10.1038/s41598-018-23978-z

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Stewart C, Kural D, Strömberg MP, Walker JA, Konkel MK, Stütz AM, Urban AE, Grubert F, Lam HYK, Lee W-P, Busby M, Indap AR, Garrison E, Huff C, Xing J, Snyder MP, Jorde LB, Batzer MA, Korbel JO, Marth GT, Project 1000 Genomes (2011) A Comprehensive Map of Mobile Element Insertion Polymorphisms in Humans. PLOS Genet 7(8):e1002236. https://doi.org/10.1371/journal.pgen.1002236

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Stranger BE, Montgomery SB, Dimas AS, Parts L, Stegle O, Ingle CE, Sekowska M, Smith GD, Evans D, Gutierrez-Arcelus M, Price A, Raj T, Nisbett J, Nica AC, Beazley C, Durbin R, Deloukas P, Dermitzakis ET (2012) Patterns of cis regulatory variation in diverse human populations. PLOS Genet 8(4):e1002639. https://doi.org/10.1371/journal.pgen.1002639

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Tang W, Mun S, Joshi A, Han K, Liang P (2018) Mobile elements contribute to the uniqueness of human genome with 15,000 human-specific insertions and 14 Mbp sequence increase. DNA Res 25(5):521–533. https://doi.org/10.1093/dnares/dsy022

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Taylor MS (2004) Occurrence and consequences of coding sequence insertions and deletions in mammalian genomes. Genome Res 14(4):555–566. https://doi.org/10.1101/gr.1977804

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • The 1000 Genomes Project Consortium (2015) A global reference for human genetic variation. Nature 526(7571):68–74. https://doi.org/10.1038/nature15393

    Article  CAS  Google Scholar 

  • The 1000 Genomes Project Consortium, Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, Zhang Y, Ye K, Jun G, Hsi-Yang Fritz M, Konkel MK, Malhotra A, Stütz AM, Shi X, Paolo Casale F, Chen J, Hormozdiari F, Dayama G, Chen K, Malig M, Chaisson MJP, Walter K, Meiers S, Kashin S, Garrison E, Auton A, Lam HYK, Jasmine MuX, Alkan C, Antaki D, Bae T, Cerveira E, Chines P, Chong Z, Clarke L, Dal E, Ding L, Emery S, Fan X, Gujral M, Kahveci F, Kidd JM, Kong Y, Lameijer E-W, McCarthy S, Flicek P, Gibbs RA, Marth G, Mason CE, Menelaou A, Muzny DM, Nelson BJ, Noor A, Parrish NF, Pendleton M, Quitadamo A, Raeder B, Schadt EE, Romanovitch M, Schlattl A, Sebra R, Shabalin AA, Untergasser A, Walker JA, Wang M, Yu F, Zhang C, Zhang J, Zheng-Bradley X, Zhou W, Zichner T, Sebat J, Batzer MA, McCarroll SA, Mills RE, Gerstein MB, Bashir A, Stegle O, Devine SE, Lee C, Eichler EE, Korbel JO (2015) An integrated map of structural variation in 2,504 human genomes. Nature 526(7571):75–81. https://doi.org/10.1038/nature15394

    Article  CAS  PubMed Central  Google Scholar 

  • The International Schizophrenia Consortium (2008) Rare chromosomal deletions and duplications increase risk of schizophrenia. Nature 455(7210):237–241. https://doi.org/10.1038/nature07239

    Article  CAS  PubMed Central  Google Scholar 

  • The Wellcome Trust Case Control Consortium (2010) Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls. Nature 464(7289):713–720. https://doi.org/10.1038/nature08979

    Article  CAS  PubMed Central  Google Scholar 

  • Wang K, Li M, Hakonarson H (2010) ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 38(16):e164–e164. https://doi.org/10.1093/nar/gkq603

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Weischenfeldt J, Symmons O, Spitz F, Korbel JO (2013) Phenotypic impact of genomic structural variation: insights from and for human disease. Nat Rev Genet 14(2):125–138. https://doi.org/10.1038/nrg3373

    Article  CAS  PubMed  Google Scholar 

  • Wong JH, Shigemizu D, Yoshii Y, Akiyama S, Tanaka A, Nakagawa H, Narumiya S, Fujimoto A (2019) Identification of intermediate-sized deletions and inference of their impact on gene expression in a human population. Genome Med 11(1):44. https://doi.org/10.1186/s13073-019-0656-4

    Article  PubMed  PubMed Central  Google Scholar 

  • Zhang F, Gu W, Hurles ME, Lupski JR (2009) Copy number variation in human health, disease, and evolution. Annu Rev Genomics Hum Genet 10:451–481. https://doi.org/10.1146/annurev.genom.9.081307.164217

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The super-computing resource was provided by Human Genome Center, the Institute of Medical Science, the University of Tokyo (http://sc.hgc.jp/shirokane.html).

Funding

This research was supported by Japan Agency for Medical Research and Development (AMED) under Grant Number 19km0405207h0004 (to AF), 20km04052 (to AF), and 20km0405205 (to KT).

Author information

Authors and Affiliations

Authors

Contributions

All the authors contributed to the study conception and design. Akihiro Fujimoto designed the study. Saeideh Ashouri performed the computational analyses and the experiments. Saeideh Ashouri, Jing Hao Wong, Hidewaki Nakagawa, Mihoko Shimada, Katsushi Tokunaga, and Akihiro Fujimoto interpreted the results. Saeideh Ashouri, Jing Hao Wong, and Akihiro Fujimoto wrote the first draft of manuscript, and all the authors commented on previous versions of the manuscript. All the authors read and approved the final manuscript.

Corresponding author

Correspondence to Akihiro Fujimoto.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

This study was approved by institutional review boards (IRBs) at RIKEN (H20-11) and Kyoto University (G1031).

Consent to participate

This study was conducted in accordance with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 2305 KB)

Supplementary file2 (XLSX 1348 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ashouri, S., Wong, J.H., Nakagawa, H. et al. Characterization of intermediate-sized insertions using whole-genome sequencing data and analysis of their functional impact on gene expression. Hum Genet 140, 1201–1216 (2021). https://doi.org/10.1007/s00439-021-02291-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00439-021-02291-2

Navigation