Abstract
Copy number variation is known to be an important component of structural variation in the human genome. Greater than 1 kb in size, these gains and losses of genetic material are known to confer risk to many human diseases, both Mendelian and complex. Therefore, the technologies used to detect copy number variation have been quickly improving in both throughput and cost. From comparative genomic hybridization to synthetic high-density oligonucleotide arrays to next-generation sequencing methods, algorithms used to estimate copy number are plentiful. Here we describe a practical introduction to the copy number variation technology and available analysis methods, and demonstrate the analysis flow on an example case.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Beckmann, J. S., Shapr, A. J., and Antonarakis, S.E. (2008) CNVs and genetic medicine (excitement and consequences of a rediscovery). Cytogenet Genome Res 123, 7–16.
Redon, R., Ishikawa, S., Fitch, K. R., Feuk, L., Perry, G. H., Andrews, T. D., Fiegler, H., Shapero, M. H., Carson, A. R., Chen, W., Cho, E. K., Dallaire, S., Freeman, J. L., Gonza’lez, J. R., Grataco’s, M. N., Huang, J., Kalaitzopoulos, D., Komura, D., MacDonald, J. R., Marshall, C. R., Mei, R., Montgomery, L., Nishimura, K., Okamura, K., Shen, F., Somerville, M. J., Tchinda, J., Valsesia, A., Woodwark, C., Yang, F., Zhang, J., Zerjal, T., Zhang, J., Armengol, L., Conrad, D. F., Estivill, X., Tyler-Smith, C., Carter, N. P., Aburatani, H., Lee, C., Jones, K. W., Scherer, S. W., and Hurles, M. E. (2006) Global variation in copy number in the human genome. Nature 444, 444–54.
Sharp, A. J., Cheng, Z., and Eichler, E. E. (2006) Structural variation of the human genome. Ann Rev Genomics Hum Genet 7, 407–42.
Iafrate, A. J., Feuk, L., Rivera, M. N., Listewnik, M. L., Donahoe, P. K., Qi, Y., Scherer, S. W., Lee, C. (2004) Detection of large-scale variation in the human genome. Nat Genet 36, 949–51.
Hastings, P. J., Lupski, J. R., Rosenberg, S. M., and Ira, G. (2009) Mechanisms of change in gene copy number. Nat Rev Genet 10, 551–64.
Shaikh, T. H., Gai, X., Perin, J. C., et al. (2009) High-resolution mapping and analysis of copy number variations in the human genome: a data resource for clinical and research applications. Genome Res 19, 1682–90.
Itsara, A., Cooper, G. M., Baker, C., Girirajan, S., Li, J., Absher, D., Krauss, R. M., Myers, R. M., Ridker, P. M., Chasman, D. I., Mefford, H., Ying, P., Nickerson, D. A., and Eichler, E. E. (2009) Population analysis of large copy number variants and hotspots of human genetic disease. Am J Hum Genet 84, 148–61.
Takahashi, N., Satoh, Y., Kodaira, M., and Katayame, H. (2008) Large-scale copy number variants (CNVs) detected in different ethnic human populations. Cytogenet Genome Res 123, 224–33.
Jakobsson, M., Scholz, S. W., Scheet, P., Gibbs, J. R., VanLiere, J. M., Fung, H.-C., Szpiech, Z. A., Degnan, J. H., Wang, K., Guerreiro, R., Bras, J. M., Schymick, J. C., Hernandez, D. G., Traynor, B. J., Simon-Sanchez, J., Matarin, M., Britton, A., van de Leemput, J., Rafferty, I., Bucan, M., Cann, H. M., Hardy, J. A., Rosenberg, N. A., and Singleton, A. B. (2008) Genotype, haplotype, and copy-number variation in worldwide human populations. Nature 451, 998–1003.
Conrad, D. F., Pinto, D., Redon, R., Feuk, L., Gokcumen, O., Zhang, Y., Aerts, J., Andrews, T. D., Barnes, C., Campbell, P., Fitzgerald, T., Hu, M., Ihm, C. H., Kristiansson, K., MacArthur, D. G., MacDonald, J. R., Onyiah, I., Wing, A., Pang, C., Robson, S., Stirrups, K., Valsesia, A., Walter, K., Wei, J., Tyler-Smith, C., Carter, N. P., Lee, C., Scherer, S. W., and Hurles, M. E. (2009) Origins and functional impact of copy number variation in the human genome. Nature 464, 704–12.
Shlien, A., and Malkin, D. (2009) Copy number variations and cancer. Genome Med 1, 62.
Kusenda, M., and Sebat, J. (2008) The role of rare structural variants in the genetics of autism spectrum disorders. Cytogenet Genome Res 123, 36–43.
Lachman, H. M. (2008) Copy variations in schizophrenia and bipolar disorder. Cytogenet Genome Res 123, 27–35.
Bauters, M., Weuts, A., Vandewalle, M., Nevelsteen, J., Marynen, P., Esch, H. V., and Froyen, G. (2008) Detection and validation of copy number variation in X-linked mental retardation. Cytogenet Genome Res 123, 44–53.
Ptacek, T., Li, X., Kelley, J. M., and Edberg, J. C. (2008) Copy number variants in genetic susceptibility and severity of systemic lupus erythematosus. Cytogenet Genome Res 123, 142–47.
Nakajima, T., Kaur, G., Mehra, N., and Kimura, A. (2008) HIV-1/AIDS susceptibility and copy number variation in CCL3L1, a gene encoding a natural ligand for HIV-1 co-receptor CCR5. Cytogenet Genome Res 123, 156–60.
Diskin, S. J., Hou, C., Glessner, J. T., Attiyeh, E. F., Laudenslager, M., Bosse, K., Cole, K., Mosse’, Y. P., Wood, A., Lynch, J. E., Pecor, K., Diamond, M., Winter, C., Wang, K., Kim, C., Geiger, E. A., McGrady, P. W., Blakemore, A. I. F., London, W. B., Shaikh, T. H., Bradfield, J., Grant, S. F. A., Li, H., Devoto, M., Rappaport, E. R., Hakonarson, H., and Maris, J. M. (2009) Copy number variation at 1q21.1 associated with neuroblastoma. Nature 459, 987–92.
Vandepoele, K., Roy, N. V., Staes, K., Speleman, F., and van Roy, F. (2005) A novel gene family NBPF: intricate structure generated by gene duplications during primate evolution. Mol Biol Evol 22, 2265–74.
Kallioniemi, O., Kallioniemi, A., Sudar, D., Rutovitz, D., Gray, J., Waldman, F., and Pinkel, D. (1993) Comparative genomic hybridization: a rapid new method for detecting and mapping DNA amplifications in tumors. Semin Cancer Biol 4, 41–6.
Chueng, V. (2001) Integration of cytogenetic landmarks into the draft sequence of the human genome. Nature 409, 953–8.
Snijders, A., Nowak, N., and Segraves, R. (2001) Assembly of microarrays for genome-wide measurement of DNA copy number. Nat Genet 29, 263–4.
Pinkel, D., Segraves, R., Sudar, D., Clark, S., Poole, I., Kowbel, D., Collins, C., Kuo, W.-L., Chen, C., Zhai, Y., Dairkee, S. H., Ljung, B.-M., Gray, J. W., and Albertson, D. G. (1998) High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nat Genet 20, 207–11.
Carvalho, B., Ouwerkerk, E., Meijer, G. A., and Ylstra, B. (2004) High resolution microarray comparative genomic hybridisation analysis using spotted oligonucleotides. J Clin Pathol 57, 644–6.
Bengtsson, H., Wirapati, P., and Speed, T. P. (2009) A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6. Bioinformatics 25, 2149–56.
Margulies, M., Egholm, M., Altman, W. E., Attiya, S., Bader, J. S., Bemben, L. A., Berka, J., Braverman, M. S., Chen, Y.-J., Chen, Z., Dewell, S. B., Du, L., Fierro, J. M., Gomes, X. V., Goodwin, B. C., He, W., Helgesen, S., Ho, C. H., Irzyk, G. P., Jando, S. C., Alenquer, M. L. I., Jarvie, T. P., Jirage, K. B., Kim, J.-B., Knight, J. R., Lanza, J. R., Leamon, J. H., Lefkowitz, S. M., Lei, M., Li, J., Lohman, K. L., Lu, H., Makhijani, V. B., McDade, K. E., McKenna, M. P., Myers, E. W., Nickerson, E., Nobile, J. R., Plant, R., Puc, B. P., Ronan, M. T., Roth, G. T., Sarkis, G. J., Simons, J. F., Simpson, J. W., Srinivasan, M., Tartaro, K. R., Tomasz, A., Vogt, K. A., Volkmer, G. A., Wang, S. H., and Wang, Y. (2005) Genome sequencing in open microfabricated high density picoliter reactors. Nature 437, 376–80.
Bentley, D. R. (2006) Whole-genome re-sequencing. Curr Opin Genet Dev 16, 545–52.
Valouev, A., Ichikawa, J., Tonthat, T., Stuart, J., Ranade, S., Peckham, H., Zeng, K., Malek, J. A., Costa, G., McKernan, K., Sidow, A., Fire, A., and Johnson, S. M. (2008) A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. Genome Res 18, 1051–63.
Xie, C., and Tammi, M. T. (2009) CNV-seq, a new method to detect copy number variation using high-throughput sequencing. BMC Bioinform 10, 80–9.
Yoon, S., Xuan, Z., and Makarov, V. (2009) Sensitive and accurate detection of copy number variants using read depth of coverage. Genome Res 19, 1586–92.
Bengtsson, H., Ray, A., Spellman, P., and Speed, T. P. (2009) A single-sample method for normalizing and combining full-resolution copy numbers from multiple platforms, labs and analysis methods. Bioinformatics 25, 1223–30.
Pique-Regi, R., Ortega, A., and Asgharzadeh, S. (2009) Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA. Bioinformatics 25, 1223–30.
Irizarry, R. A., Hobbs, B., Collin, F., Beazer-Barclay, Y. D., Antonellis, K. J., Scherf, U., and Speed, T. P. (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4, 249–64.
Irizarry, R. A., Bolstad, B. M., Collin, F., Cope, L. M., Hobbs, B., and Speed, T. P. (2003) Summaries of Affymetrix GeneÂChip probe level data. Nucleic Acids Res 31, e15.
Bengsston, H., Irizarry, R., Carvalho, B., and Speed, T. P. (2008) Estimation and assessment of raw copy numbers at the single locus level. Bioinformatics 24, 759–67.
Rueda, O. M., and Diaz-Uriarte, R. (2010) Finding recurrent copy number alteration regions: a review of methods. Curr Bioinform 5, 1–17.
Pique-Regi, R., Tsau, E., Ortega, A., Seeger, R. C., and Asgharzadeh, S. (2007) Wavelet footprints and sparse Bayesian learning for DNA copy number change analysis. IEEE Proc ICASSP.
Pique-Regi, R., Ortega, A., Triche, T. J., Seeger, R. C., and Asgharzadeh, S. (2008) Sparse representation and Bayesian detection of genome copy number alterations from microarray data. Bioinformatics 24, 309–18.
Wang, K., Li, M., Hadley, D., Liu, R., Glessner, J., Grant, S. F. A., Hakonarson, H., and Bucan, M. (2007) PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res 17, 1665–74.
McElroy, J. P., Nelson, M. R., Caillier, S. J., and Oksenberg, J. R. (2009) Copy number variation in African Americans. BMC Genet 10, 15.
Lee, C., Iafrate, A. J., and Brothman, A. R. (2007) Copy number variations and clinical cytogenetic diagnosis of constitutional disorders. Nat Genet 39, S48–54.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Warden, M., Pique-Regi, R., Ortega, A., Asgharzadeh, S. (2011). Bioinformatics for Copy Number Variation Data. In: Mayer, B. (eds) Bioinformatics for Omics Data. Methods in Molecular Biology, vol 719. Humana Press. https://doi.org/10.1007/978-1-61779-027-0_11
Download citation
DOI: https://doi.org/10.1007/978-1-61779-027-0_11
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-026-3
Online ISBN: 978-1-61779-027-0
eBook Packages: Springer Protocols