Copy Number Variation

Macé, Aurélien; Kutalik, Zoltán; Valsesia, Armand

doi:10.1007/978-1-4939-7868-7_14

Aurélien Macé^3,4,5,
Zoltán Kutalik^3,5 &
Armand Valsesia⁶

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1793))

5104 Accesses
26 Citations
2 Altmetric

Abstract

Differences between genomes can be due to single nucleotide variants (SNPs), translocations, inversions and copy number variants (CNVs, gain or loss of DNA). The latter can range from sub-microscopic events to complete chromosomal aneuploidies. Small CNVs are often benign but those larger than 250 kb are strongly associated with morbid consequences such as developmental disorders and cancer. Detecting CNVs within and between populations is essential to better understand the plasticity of our genome and to elucidate its possible contribution to disease or phenotypic traits.

While the link between SNPs and disease susceptibility has been well studied, to date there are still very few published CNV genome-wide association studies; probably owing to the fact that CNV analysis remains a slightly more complex task than SNP analysis (both in term of bioinformatics workflow and uncertainty in the CNV calling leading to high false positive rates and unknown false negative rates). This chapter aims at explaining computational methods for the analysis of CNVs, ranging from study design, data processing and quality control, up to genome-wide association study with clinical traits.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Valsesia A, Mace A, Jacquemont S et al (2013) The growing importance of CNVs: new insights for detection and clinical interpretation. Front Genet 4:92
Article PubMed PubMed Central CAS Google Scholar
Conrad DF, Pinto D, Redon R et al (2010) Origins and functional impact of copy number variation in the human genome. Nature 464(7289):704–712
Article PubMed CAS Google Scholar
Feuk L, Carson AR, Scherer SW (2006) Structural variation in the human genome. Nat Rev Genet 7(2):85–97
Article PubMed CAS Google Scholar
Fiegler H, Redon R, Andrews D et al (2006) Accurate and reliable high-throughput detection of copy number variation in the human genome. Genome Res 16(12):1566–1574
Article PubMed PubMed Central CAS Google Scholar
Freeman JL, Perry GH, Feuk L et al (2006) Copy number variation: new insights in genome diversity. Genome Res 16(8):949–961
Article PubMed CAS Google Scholar
Iafrate AJ, Feuk L, Rivera MN et al (2004) Detection of large-scale variation in the human genome. Nat Genet 36(9):949–951
Article PubMed CAS Google Scholar
Kidd JM, Cooper GM, Donahue WF et al (2008) Mapping and sequencing of structural variation from eight human genomes. Nature 453(7191):56–64
Article PubMed PubMed Central CAS Google Scholar
Perry GH, Yang F, Marques-Bonet T et al (2008) Copy number variation and evolution in humans and chimpanzees. Genome Res 18(11):1698–1710
Article PubMed PubMed Central CAS Google Scholar
Redon R, Ishikawa S, Fitch KR et al (2006) Global variation in copy number in the human genome. Nature 444(7118):444–454
Article PubMed PubMed Central CAS Google Scholar
Sharp AJ, Locke DP, McGrath SD et al (2005) Segmental duplications and copy-number variation in the human genome. Am J Hum Genet 77(1):78–88
Article PubMed PubMed Central CAS Google Scholar
Valsesia A, Rimoldi D, Martinet D et al (2011) Network-guided analysis of genes with altered somatic copy number and gene expression reveals pathways commonly perturbed in metastatic melanoma. PLoS One 6(4):e18369
Article PubMed PubMed Central CAS Google Scholar
Dopman EB, Hartl DL (2007) A portrait of copy-number polymorphism in Drosophila melanogaster. Proc Natl Acad Sci U S A 104(18056801):19920–19925
Article PubMed PubMed Central Google Scholar
Fontanesi L, Martelli PL, Beretti F et al (2010) An initial comparative map of copy number variations in the goat (Capra hircus) genome. BMC Genomics 11(21083884):639
Article PubMed PubMed Central CAS Google Scholar
Graubert TA, Cahan P, Edwin D et al (2007) A high-resolution map of segmental DNA copy number variation in the mouse genome. PLoS Genet 3(1):e3
Article PubMed PubMed Central CAS Google Scholar
Guryev V, Saar K, Adamovic T et al (2008) Distribution and functional impact of DNA copy number variation in the rat. Nat Genet 40(5):538–545
Article PubMed CAS Google Scholar
Lee AS, Gutiérrez-Arcelus M, Perry GH et al (2008) Analysis of copy number variation in the rhesus macaque genome identifies candidate loci for evolutionary and human disease studies. Hum Mol Genet 17(8):1127–1136
Article PubMed CAS Google Scholar
Liu GE, Hou Y, Zhu B et al (2010) Analysis of copy number variations among diverse cattle breeds. Genome Res 20(20212021):693–703
Article PubMed PubMed Central CAS Google Scholar
Valsesia A, Stevenson BJ, Waterworth D et al (2012) Identification and validation of copy number variants using SNP genotyping arrays from a large clinical cohort. BMC Genomics 13:241
Article PubMed PubMed Central CAS Google Scholar
Mannik K, Magi R, Mace A et al (2015) Copy number variations and cognitive phenotypes in unselected populations. JAMA 313(20):2044–2054
Article PubMed PubMed Central Google Scholar
Craddock N, Hurles ME, Cardin N et al (2010) Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls. Nature 464(7289):713–720
Article PubMed CAS Google Scholar
Firth HV, Richards SM, Bevan AP et al (2009) DECIPHER: database of chromosomal imbalance and phenotype in humans using ensembl resources. Am J Hum Genet 84(19344873):524–533
Article PubMed PubMed Central CAS Google Scholar
Grozeva D, Kirov G, Ivanov D et al (2010) Rare copy number variants: a point of rarity in genetic risk for bipolar disorder and schizophrenia. Arch Gen Psychiatry 67(20368508):318–327
Article PubMed PubMed Central Google Scholar
Jacquemont S, Reymond A, Zufferey F et al (2011) Mirror extreme BMI phenotypes associated with gene dosage at the chromosome 16p11.2 locus. Nature 478(7367):97–102
Article PubMed PubMed Central CAS Google Scholar
Walters RG, Jacquemont S, Valsesia A et al (2010) A new highly penetrant form of obesity due to deletions on chromosome 16p11.2. Nature 463(7281):671–675
Article PubMed PubMed Central CAS Google Scholar
Zhang F, Gu W, Hurles ME et al (2009) Copy number variation in human health, disease, and evolution. Annu Rev Genomics Hum Genet 10(19715442):451–481
Article PubMed PubMed Central CAS Google Scholar
Gayán J, Galan JJ, González-Pérez A et al (2010) Genetic structure of the Spanish population. BMC Genomics 11:326
Article PubMed PubMed Central CAS Google Scholar
Li J, Yang T, Wang L et al (2009) Whole genome distribution and ethnic differentiation of copy number variation in Caucasian and Asian populations. PLoS One 4(11):e7958
Article PubMed PubMed Central CAS Google Scholar
Matsuzaki H, Wang P-H, Hu J et al (2009) High resolution discovery and confirmation of copy number variants in 90 Yoruba Nigerians. Genome Biol 10(11):R125
Article PubMed PubMed Central CAS Google Scholar
McElroy JP, Nelson MR, Caillier SJ et al (2009) Copy number variation in African Americans. BMC Genet 10:15
Article PubMed PubMed Central CAS Google Scholar
Lin C-H, Li L-H, Ho S-F et al (2008) A large-scale survey of genetic copy number variations among Han Chinese residing in Taiwan. BMC Genet 9:92
Article PubMed PubMed Central CAS Google Scholar
Takahashi N, Tsuyama N, Sasaki K et al (2008) Segmental copy-number variation observed in Japanese by array-CGH. Ann Hum Genet 72(Pt 2):193–204
Article PubMed CAS Google Scholar
Jeon JP, Shim SM, Jung JS et al (2009) A comprehensive profile of DNA copy number variations in a Korean population: identification of copy number invariant regions among Koreans. Exp Mol Med 41(9):618–628
Article PubMed PubMed Central CAS Google Scholar
Kang T-W, Jeon Y-J, Jang E et al (2008) Copy number variations (CNVs) identified in Korean individuals. BMC Genomics 9:492
Article PubMed PubMed Central CAS Google Scholar
Jakobsson M, Scholz SW, Scheet P et al (2008) Genotype, haplotype and copy-number variation in worldwide human populations. Nature 451(7181):998–1003
Article PubMed CAS Google Scholar
Kato M, Kawaguchi T, Ishikawa S et al (2010) Population-genetic nature of copy number variations in the human genome. Hum Mol Genet 19(5):761–773
Article PubMed CAS Google Scholar
Conrad DF, Hurles ME (2007) The population genetics of structural variation. Nat Genet 39(7 Suppl):S30–S36
Article PubMed PubMed Central CAS Google Scholar
Nistér M, Wedell B, Betsholtz C et al (1987) Evidence for progressional changes in the human malignant glioma line U-343 MGa: analysis of karyotype and expression of genes encoding the subunit chains of platelet-derived growth factor. Cancer Res 47(18):4953–4960
PubMed Google Scholar
Leek JT, Scharpf RB, Bravo HC et al (2010) Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genet 11(10):733–739
Article PubMed CAS Google Scholar
Allison DB, Cui X, Page GP, Sabripour M (2006) Microarray data analysis: from disarray to consolidation and consensus. Nat Rev Genet 7(1):55–65
Article PubMed CAS Google Scholar
Benito M, Parker J, Du Q et al (2004) Adjustment of systematic microarray data biases. Bioinformatics 20(1):105–114
Article PubMed CAS Google Scholar
Irizarry RA, Hobbs B, Collin F et al (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4(2):249–264
Article PubMed Google Scholar
Johnson WE, Li C, Rabinovic A (2007) Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8(1):118–127
Article PubMed Google Scholar
Leek JT, Storey JD (2007) Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet 3(9):1724–1735
Article PubMed CAS Google Scholar
Nygaard V, Rodland EA, Hovig E (2016) Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses. Biostatistics 17(1):29–39
PubMed Google Scholar
Oytam Y, Sobhanmanesh F, Duesing K et al (2016) Risk-conscious correction of batch effects: maximising information extraction from high-throughput genomic datasets. BMC Bioinformatics 17(1):332
Article PubMed PubMed Central Google Scholar
Reese SE, Archer KJ, Therneau TM et al (2013) A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis. Bioinformatics 29(22):2877–2883
Article PubMed PubMed Central CAS Google Scholar
Scharpf RB, Ruczinski I, Carvalho B et al (2011) A multilevel model to address batch effects in copy number estimation using SNP arrays. Biostatistics 12(1):33–50
Article PubMed Google Scholar
Chung NC, Storey JD (2015) Statistical significance of variables driving systematic variation in high-dimensional data. Bioinformatics 31(4):545–554
Article PubMed CAS Google Scholar
Manimaran S, Selby HM, Okrah K et al (2016) BatchQC: interactive software for evaluating sample and batch effects in genomic data. Bioinformatics 32(24):3836–3838
Article PubMed PubMed Central CAS Google Scholar
Novembre J, Johnson T, Bryc K et al (2008) Genes mirror geography within Europe. Nature 456(7218):98–101
Article PubMed PubMed Central CAS Google Scholar
Yang J, Lee SH, Goddard ME et al (2011) GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88(1):76–82
Article PubMed PubMed Central CAS Google Scholar
Lachin JM, Matts JP, Wei LJ (1988) Randomization in clinical trials: conclusions and recommendations. Control Clin Trials 9(4):365–374
Article PubMed CAS Google Scholar
Altman DG (1991) Randomisation. BMJ 302(6791):1481–1482
Article PubMed PubMed Central CAS Google Scholar
Altman DG, Bland JM (1999) How to randomise. BMJ 319(7211):703–704
Article PubMed PubMed Central CAS Google Scholar
Box GEP, Hunter JS, Hunter WG (2005) Statistics for experimenters : design, innovation, and discovery, 2nd edn. Wiley-Interscience.; xvii, Hoboken, N.J, p 633
Google Scholar
Fisher RA, Bennett JH, Fisher RA et al (1990) Statistical methods, experimental design, and scientific inference. Oxford University Press, Oxford England; New York
Google Scholar
Maxwell SE, Delaney HD (2004) Designing experiments and analyzing data : a model comparison perspective, 2nd edn. Lawrence Erlbaum Associates, Mahwah, N.J
Google Scholar
Montgomery DC (2008) Design and analysis of experiments, 7th edn. Wiley. xvii, Hoboken, NJ, p 656
Google Scholar
Blainey P, Krzywinski M, Altman N (2014) Points of significance: replication. Nat Methods 11(9):879–880
Article PubMed CAS Google Scholar
Dowjat K, Włodarska I (1981) G-banding patterns in mouse lymphoblastic leukemia L1210. J Natl Cancer Inst 66(1):177–182
PubMed CAS Google Scholar
Pepler WJ, Smith M, van Niekerk WA (1968) An unusual karyotype in a patient with signs suggestive of Down's syndrome. J Med Genet 5(1):68–71
Article PubMed PubMed Central CAS Google Scholar
International HapMap Consortium (2003) The international HapMap project. Nature 426(6968):789–796
Article CAS Google Scholar
Conrad DF, Andrews TD, Carter NP et al (2006) A high-resolution survey of deletion polymorphism in the human genome. Nat Genet 38(1):75–81
Article PubMed CAS Google Scholar
McCarroll SA, Hadnott TN, Perry GH et al (2006) Common deletion polymorphisms in the human genome. Nat Genet 38(1):86–92
Article PubMed CAS Google Scholar
Attiyeh EF, Diskin SJ, Attiyeh MA et al (2009) Genomic copy number determination in cancer cells from single nucleotide polymorphism microarrays based on quantitative genotyping corrected for aneuploidy. Genome Res 19(2):276–283
Article PubMed PubMed Central CAS Google Scholar
LaFramboise T, Weir BA, Zhao X et al (2005) Allele-specific amplification in cancer revealed by SNP array analysis. PLoS Comput Biol 1(6):e65
Article PubMed PubMed Central CAS Google Scholar
Coin LJM, Asher JE, Walters RG et al (2010) cnvHap: an integrative population and haplotype-based multiplatform model of SNPs and CNVs. Nat Methods 7(7):541–546
Article PubMed CAS Google Scholar
Colella S, Yau C, Taylor JM et al (2007) QuantiSNP: an objective Bayes hidden-Markov model to detect and accurately map copy number variation using SNP genotyping data. Nucleic Acids Res 35(6):2013–2025
Article PubMed PubMed Central CAS Google Scholar
Wang K, Li M, Hadley D, Liu R et al (2007) PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res 17(11):1665–1674
Article PubMed PubMed Central CAS Google Scholar
Illumina. CNVpartition. http://wwwilluminacom/documents/products/technotes/technote_cnv_algorithmspdf
Google Scholar
Carter NP (2007) Methods and strategies for analyzing copy number variation using DNA microarrays. Nat Genet 39(7 Suppl):S16–S21
Article PubMed PubMed Central CAS Google Scholar
Kallioniemi A, Kallioniemi OP et al (1992) Comparative genomic hybridization for molecular cytogenetic analysis of solid tumors. Science 258(5083):818–821
Article PubMed CAS Google Scholar
Redon R, Rigler D, Carter NP (2009) Comparative genomic hybridization: DNA preparation for microarray fabrication. Methods Mol Biol 529:259–266
Article PubMed PubMed Central CAS Google Scholar
Ylstra B, van den Ijssel P, Carvalho B et al (2006) BAC to the future! Or oligonucleotides: a perspective for micro array comparative genomic hybridization (array CGH). Nucleic Acids Res 34(2):445–450
Article PubMed PubMed Central CAS Google Scholar
Curtis C, Lynch AG, Dunning MJ et al (2009) The pitfalls of platform comparison: DNA copy number array technologies assessed. BMC Genomics 10:588
Article PubMed PubMed Central CAS Google Scholar
Pinto D, Darvishi K, Shi X et al (2011) Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants. Nat Biotechnol 29(6):512–520
Article PubMed PubMed Central CAS Google Scholar
Bignell GR, Santarius T, Pole JCM et al (2007) Architectures of somatic genomic rearrangement in human cancer amplicons at sequence-level resolution. Genome Res 17(9):1296–1303
Article PubMed PubMed Central CAS Google Scholar
Pinkel D, Albertson DG (2005) Array comparative genomic hybridization and its applications in cancer. Nat Genet 37(Suppl):S11–S17
Article PubMed CAS Google Scholar
Oostlander AE, Meijer GA, Ylstra B (2004) Microarray-based comparative genomic hybridization and its applications in human genetics. Clin Genet 66(6):488–495
Article PubMed CAS Google Scholar
Shaffer LG, Bejjani BA (2006) Medical applications of array CGH and the transformation of clinical cytogenetics. Cytogenet Genome Res 115(3–4):303–309
Article PubMed CAS Google Scholar
Edelmann L, Hirschhorn K (2009) Clinical utility of array CGH for the detection of chromosomal imbalances associated with mental retardation and multiple congenital anomalies. Ann N Y Acad Sci 1151:157–166
Article PubMed Google Scholar
Boone PM, Bacino CA, Shaw CA et al (2010) Detection of clinically relevant exonic copy-number changes by array CGH. Hum Mutat 31(12):1326–1342
Article PubMed PubMed Central Google Scholar
Goodwin S, McPherson JD, McCombie WR (2016) Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet 17(6):333–351
Article PubMed CAS Google Scholar
Pirooznia M, Goes FS, Zandi PP (2005) Whole-genome CNV analysis: advances in computational approaches. Front Genet 6:138
Google Scholar
Tuzun E, Sharp AJ, Bailey JA et al (2005) Fine-scale structural variation of the human genome. Nat Genet 37(7):727–732
Article PubMed CAS Google Scholar
Ye K, Schulz MH, Long Q et al (2009) Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25(21):2865–2871
Article PubMed PubMed Central CAS Google Scholar
Simpson JT, Wong K, Jackman SD et al (2009) ABySS: a parallel assembler for short read sequence data. Genome Res 19(6):1117–1123
Article PubMed PubMed Central CAS Google Scholar
Li R, Zhu H, Ruan J et al (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20(2):265–272
Article PubMed PubMed Central CAS Google Scholar
Iqbal Z, Caccamo M, Turner I et al (2012) De novo assembly and genotyping of variants using colored de Bruijn graphs. Nat Genet 44(2):226–232
Article PubMed PubMed Central CAS Google Scholar
Simpson JT, Durbin R (2012) Efficient de novo assembly of large genomes using compressed data structures. Genome Res 22(3):549–556
Article PubMed PubMed Central CAS Google Scholar
Abecasis GR, Altshuler D, 1000 Genomes Project Consortium et al (2010) A map of human genome variation from population-scale sequencing. Nature 467(7319):1061–1073
Article PubMed CAS Google Scholar
Mills RE, Walter K, Stewart C et al (2011) Mapping copy number variation by population-scale genome sequencing. Nature 470(7332):59–65
Article PubMed PubMed Central CAS Google Scholar
Wheeler E, Huang N, Bochukova EG et al (2013) Genome-wide SNP and CNV analysis identifies common and low-frequency variants associated with severe early-onset obesity. Nat Genet 45(5):513–517
Article PubMed PubMed Central CAS Google Scholar
Johansson MM, Van Geystelen A, Larmuseau MH et al (2015) Microarray analysis of copy number variants on the human Y chromosome reveals novel and frequent duplications overrepresented in specific haplogroups. PLoS One 10(8):e0137223
Article PubMed PubMed Central CAS Google Scholar
Barnes C, Plagnol V, Fitzgerald T et al (2008) A robust statistical method for case-control association testing with copy number variation. Nat Genet 40(10):1245–1252
Article PubMed PubMed Central CAS Google Scholar
Subirana I, Diaz-Uriarte R, Lucas G, Gonzalez JR (2011) CNVassoc: association analysis of CNV data using R. BMC Med Genet 4:47
Google Scholar
Glessner JT, Li J, Hakonarson H (2013) ParseCNV integrative copy number variation association software with quality tracking. Nucleic Acids Res 41(5):e64
Article PubMed PubMed Central CAS Google Scholar
Mace A, Tuke MA, Beckmann JS et al (2016) New quality measure for SNP array based CNV detection. Bioinformatics 32(21):3298–3305
Article PubMed CAS Google Scholar
Kutalik Z, Johnson T, Bochud M et al (2011) Methods for testing association between uncertain genotypes and quantitative traits. Biostatistics 12(1):1–17
Article PubMed Google Scholar
Ionita-Laza I, Perry GH, Raby BA et al (2008) On the analysis of copy-number variations in genome-wide association studies: a translation of the family-based association test. Genet Epidemiol 32(3):273–284
Article PubMed Google Scholar
Murphy A, Won S, Rogers A et al (2010) On the genome-wide analysis of copy number variants in family-based designs: methods for combining family-based and population-based information for testing dichotomous or quantitative traits, or completely ascertained samples. Genet Epidemiol 34(6):582–590
Article PubMed PubMed Central Google Scholar
Zanda M, Onengut S, Walker N et al (2012) Validity of the family-based association test for copy number variant data in the case of non-linear intensity-genotype relationship. Genet Epidemiol 36(8):895–898
PubMed PubMed Central Google Scholar
Zanda M, Onengut-Gumuscu S, Walker N et al (2014) A genome-wide assessment of the role of untagged copy number variants in type 1 diabetes. PLoS Genet 10(5):e1004367
Article PubMed PubMed Central CAS Google Scholar
McCarroll SA, Kuruvilla FG, Korn JM et al (2008) Integrated detection and population-genetic analysis of SNPs and copy number variation. Nat Genet 40(10):1166–1174
Article PubMed CAS Google Scholar
Greenman CD, Bignell G, Butler A et al (2010) PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data. Biostatistics 11(1):164–175
Article PubMed Google Scholar
Van Loo P, Nordgard SH, Lingjærde OC et al (2010) Allele-specific copy number analysis of tumors. Proc Natl Acad Sci U S A 107(39):16910–16915
Article PubMed PubMed Central Google Scholar
Locke AE, Kahali B, Berndt SI et al (2015) Genetic studies of body mass index yield new insights for obesity biology. Nature 518(7538):197–206
Article PubMed PubMed Central CAS Google Scholar
Wood AR, Esko T, Yang J et al (2014) Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet 46(11):1173–1186
Article PubMed PubMed Central CAS Google Scholar
Voight BF, Kang HM, Ding J et al (2012) The Metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits. PLoS Genet 8(8):e1002793
Article PubMed PubMed Central CAS Google Scholar
Feng S, Liu D, Zhan X et al (2014) RAREMETAL: fast and powerful meta-analysis for rare variants. Bioinformatics 30(19):2828–2829
Article PubMed PubMed Central CAS Google Scholar
Wu MC, Lee S, Cai T et al (2011) Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 89(1):82–93
Article PubMed PubMed Central CAS Google Scholar
Zhan X, Girirajan S, Zhao N et al (2016) A novel copy number variants kernel association test with application to autism spectrum disorders studies. Bioinformatics 32(23):3603–3610
PubMed PubMed Central CAS Google Scholar
Gao X, Starmer J, Martin ER (2008) A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genet Epidemiol 32(4):361–369
Article PubMed Google Scholar
Walters RG, Jacquemont S, Valsesia A et al (2010) A new highly penetrant form of obesity due to deletions on chromosome 16p11.2. Nature 463(20130649):671–675
Article PubMed PubMed Central CAS Google Scholar
Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55(4):997–1004
Article PubMed CAS Google Scholar
Kang HM, Sul JH, Service SK et al (2010) Variance component model to account for sample structure in genome-wide association studies. Nat Genet 42(4):348–354
Article PubMed PubMed Central CAS Google Scholar
Loh PR, Tucker G, Bulik-Sullivan BK et al (2015) Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat Genet 47(3):284–290
Article PubMed PubMed Central CAS Google Scholar
Clevert DA, Mitterecker A, Mayr A et al (2011) Cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate. Nucleic Acids Res 39(12):e79
Article PubMed PubMed Central CAS Google Scholar
Klambauer G, Schwarzbauer K, Mayr A et al (2012) Cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate. Nucleic Acids Res 40(9):e69
Article PubMed PubMed Central CAS Google Scholar
Cardon LR, Palmer LJ (2003) Population stratification and spurious allelic association. Lancet 361(9357):598–604
Article PubMed Google Scholar
Rosenberg NA, Huang L, Jewett EM et al (2010) Genome-wide association studies in diverse populations. Nat Rev Genet 11(5):356–366
Article PubMed PubMed Central CAS Google Scholar
Cheverud JM (2001) A simple correction for multiple comparisons in interval mapping genome scans. Heredity (Edinb) 87(Pt 1):52–58
Article CAS Google Scholar
Nyholt DR (2004) A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. Am J Hum Genet 74(4):765–769
Article PubMed PubMed Central CAS Google Scholar
Stuppia L, Antonucci I, Palka G et al (2012) Use of the MLPA assay in the molecular diagnosis of gene copy number alterations in human genetic diseases. Int J Mol Sci 13(3):3245–3276
Article PubMed PubMed Central CAS Google Scholar
Hupe P, Stransky N, Thiery JP et al (2004) Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics 20(18):3413–3422
Article PubMed CAS Google Scholar
Bengtsson H, Irizarry R, Carvalho B et al (2008) Estimation and assessment of raw copy numbers at the single locus level. Bioinformatics 24(6):759–767
Article PubMed CAS Google Scholar
Pique-Regi R, Monso-Varona J, Ortega A et al (2008) Sparse representation and Bayesian detection of genome copy number alterations from microarray data. Bioinformatics 24(3):309–318
Article PubMed PubMed Central CAS Google Scholar
Olshen AB, Venkatraman ES, Lucito R et al (2004) Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5(4):557–572
Article PubMed Google Scholar
Chen K, Wallis JW, McLellan MD, Larson DE, Kalicki JM, Pohl CS et al (2009) BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods 6(9):677–681
Article PubMed PubMed Central CAS Google Scholar
Korbel JO, Abyzov A, Mu XJ, Carriero N, Cayting P, Zhang Z et al (2009) PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Genome Biol 10(2):R23
Article PubMed PubMed Central CAS Google Scholar
Lee WP, Stromberg MP, Ward A et al (2014) MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping. PLoS One 9(3):e90581
Article PubMed PubMed Central CAS Google Scholar
Hormozdiari F, Alkan C, Eichler EE et al (2009) Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes. Genome Res 19(7):1270–1278
Article PubMed PubMed Central CAS Google Scholar
Korbel JO, Urban AE, Affourtit JP et al (2007) Paired-end mapping reveals extensive structural variation in the human genome. Science 318(5849):420–426
Article PubMed PubMed Central CAS Google Scholar
Lee S, Hormozdiari F, Alkan C et al (2009) Detecting small indels from clone-end sequencing with mixtures of distributions. Nat Methods 6(7):473–474
Article PubMed CAS Google Scholar
Campbell PJ, Stephens PJ, Pleasance ED et al (2008) Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat Genet 40(6):722–729
Article PubMed PubMed Central CAS Google Scholar
Chiang DY, Getz G, Jaffe DB et al (2009) High-resolution mapping of copy-number alterations with massively parallel sequencing. Nat Methods 6(1):99–103
Article PubMed CAS Google Scholar
Li X, Chen S, Xie W et al (2014) PSCC: sensitive and reliable population-scale copy number variation detection method based on low coverage sequencing. PLoS One 9(1):e85096
Article PubMed PubMed Central CAS Google Scholar
Wang H, Nettleton D, Ying K (2014) Copy number variation detection using next generation sequencing read counts. BMC Bioinformatics 15:109
Article PubMed PubMed Central Google Scholar
Alkan C, Kidd JM, Marques-Bonet T et al (2009) Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet 41(10):1061–1067
Article PubMed PubMed Central CAS Google Scholar
Yoon S, Xuan Z, Makarov V et al (2009) Sensitive and accurate detection of copy number variants using read depth of coverage. Genome Res 19(9):1586–1592
Article PubMed PubMed Central CAS Google Scholar
Nguyen HT, Merriman TR, Black MA (2014) The CNVrd2 package: measurement of copy number at complex loci using high-throughput sequencing data. Front Genet 5:248
Article PubMed PubMed Central CAS Google Scholar
Lin K, Smit S, Bonnema G et al (2015) Making the difference: integrating structural variation detection tools. Brief Bioinform 16(5):852–864
Article PubMed Google Scholar
Schroder J, Hsu A, Boyle SE et al (2014) Socrates: identification of genomic rearrangements in tumour genomes by re-aligning soft clipped reads. Bioinformatics 30(8):1064–1072
Article PubMed PubMed Central CAS Google Scholar
Trappe K, Emde AK, Ehrlich HC et al (2014) Detecting and correctly classifying SVs in the NGS twilight zone. Bioinformatics 30(24):3484–3490
Article PubMed CAS Google Scholar
Jiang Y, Wang Y, Brudno M (2012) PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants. Bioinformatics 28(20):2576–2583
Article PubMed CAS Google Scholar
Zhang ZD, Du J, Lam H et al (2011) Identification of genomic indels and structural variations using split reads. BMC Genomics 12:375
Article PubMed PubMed Central Google Scholar
Simpson JT, Durbin R (2010) Efficient construction of an assembly string graph using the FM-index. Bioinformatics 26(12):i367–i373
Article PubMed PubMed Central CAS Google Scholar
Massouras A, Hens K, Gubelmann C et al (2010) Primer-initiated sequence synthesis to detect and assemble structural variants. Nat Methods 7(7):485–486
Article PubMed CAS Google Scholar
Medvedev P, Fiume M, Dzamba M et al (2010) Detecting copy number variation with mated short reads. Genome Res 20(11):1613–1622
Article PubMed PubMed Central CAS Google Scholar
Marschall T, Hajirasouliha I, Schonhuth A (2013) MATE-CLEVER: Mendelian-inheritance-aware discovery and genotyping of midsize and long indels. Bioinformatics 29(24):3143–3150
Article PubMed PubMed Central CAS Google Scholar
Zhang J, Wu Y (2011) SVseq: an approach for detecting exact breakpoints of deletions with low-coverage sequence data. Bioinformatics 27(23):3228–3234
Article PubMed CAS Google Scholar
Quinlan AR, Clark RA, Sokolova S et al (2010) Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome. Genome Res 20(5):623–635
Article PubMed PubMed Central CAS Google Scholar
Hajirasouliha I, Hormozdiari F, Alkan C et al (2010) Detection and characterization of novel sequence insertions using paired-end next-generation sequencing. Bioinformatics 26(10):1277–1283
Article PubMed PubMed Central CAS Google Scholar
Jiang Y, Oldridge DA, Diskin SJ et al (2015) CODEX: a normalization and copy number variation detection method for whole exome sequencing. Nucleic Acids Res 43(6):e39
Article PubMed PubMed Central CAS Google Scholar
Bansal V, Dorn C, Grunert M et al (2014) Outlier-based identification of copy number variations using targeted resequencing in a small cohort of patients with tetralogy of Fallot. PLoS One 9(1):e85375
Article PubMed PubMed Central CAS Google Scholar
Magi A, Tattini L, Cifola I et al (2013) EXCAVATOR: detecting copy number variants from whole-exome sequencing data. Genome Biol 14(10):R120
Article PubMed PubMed Central Google Scholar
Coin LJ, Cao D, Ren J et al (2012) An exome sequencing pipeline for identifying and genotyping common CNVs associated with disease with application to psoriasis. Bioinformatics 28(18):i370–i3i4
Article PubMed PubMed Central CAS Google Scholar
Fromer M, Moran JL, Chambert K et al (2012) Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. Am J Hum Genet 91(4):597–607
Article PubMed PubMed Central CAS Google Scholar
Krumm N, Sudmant PH, Ko A et al (2012) Copy number variation detection and genotyping from exome sequence data. Genome Res 22(8):1525–1532
Article PubMed PubMed Central CAS Google Scholar
Plagnol V, Curtis J, Epstein M et al (2012) A robust model for read count data in exome sequencing experiments and implications for copy number variant calling. Bioinformatics 28(21):2747–2754
Article PubMed PubMed Central CAS Google Scholar
Korn JM, Kuruvilla FG, McCarroll SA et al (2008) Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet 40(10):1253–1260
Article PubMed PubMed Central CAS Google Scholar
Purcell S, Neale B, Todd-Brown K et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81(3):559–575
Article PubMed PubMed Central CAS Google Scholar
Palta P, Kaplinski L, Nagirnaja L et al (2015) Haplotype phasing and inheritance of copy number variants in nuclear families. PLoS One 10(4):e0122713
Article PubMed PubMed Central CAS Google Scholar
Chettier R, Ward K, Albertsen HM (2014) Endometriosis is associated with rare copy number variants. PLoS One 9(8):e103968
Article PubMed PubMed Central CAS Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Social and Preventive Medicine, University Hospital of Lausanne, Lausanne, Switzerland
Aurélien Macé & Zoltán Kutalik
Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
Aurélien Macé
Swiss Institute of Bioinformatics, Lausanne, Switzerland
Aurélien Macé & Zoltán Kutalik
Nestlé Institute of Health Sciences, Lausanne, Switzerland
Armand Valsesia

Authors

Aurélien Macé
View author publications
You can also search for this author in PubMed Google Scholar
Zoltán Kutalik
View author publications
You can also search for this author in PubMed Google Scholar
Armand Valsesia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Armand Valsesia .

Editor information

Editors and Affiliations

Department of Hygiene and Epidemiology, University of Ioannina Medical School, Ioannina, Greece
Evangelos Evangelou

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Macé, A., Kutalik, Z., Valsesia, A. (2018). Copy Number Variation. In: Evangelou, E. (eds) Genetic Epidemiology. Methods in Molecular Biology, vol 1793. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7868-7_14

Download citation

DOI: https://doi.org/10.1007/978-1-4939-7868-7_14
Published: 07 June 2018
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7867-0
Online ISBN: 978-1-4939-7868-7
eBook Packages: Springer Protocols

Publish with us

Policies and ethics