Abstract
Over the last decade, genetic studies have identified numerous associations between single nucleotide polymorphism (SNP) alleles in the human genome and important human diseases. Unfortunately, extending these initial associative findings to identification of the true causal variants that underlie disease susceptibility is usually not a straightforward task. Causal variant identification typically involves searching through sizable regions of genomic DNA in the vicinity of disease-associated SNPs for sequence variants in functional elements including protein coding, regulatory, and structural sequences. Prioritization of these searches is greatly aided by knowledge of the location of functional sequences in the human genome. This chapter briefly reviews several of the common approaches used to functionally annotate the human genome and discusses how this information can be used in concert with the emerging technology of next generation high-throughput sequencing to identify causal variants of human disease.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Altshuler D, Daly MJ, Lander ES (2008) Genetic mapping in human disease. Science 322:881–888
Shendure J, Ji H (2008) Next-generation DNA sequencing. Nat Biotechnol 26:1135–1145
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J et al (2001) Initial sequencing and analysis of the human genome. Nature 409:860–921
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG et al (2001) The sequence of the human genome. Science 291:1304–1351
Pennisi E (2007) Genetics. Working the (gene count) numbers: finally, a firm answer? Science 316:1113
Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD, Amanatides PG et al (2000) The genome sequence of Drosophila melanogaster. Science 287:2185–2195
Mural RJ, Adams MD, Myers EW, Smith HO, Miklos GL, Wides R et al (2002) A comparison of whole-genome shotgun-derived mouse chromosome 16 and the human genome. Science 296:1661–1671
Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P et al (2002) Initial sequencing and comparative analysis of the mouse genome. Nature 420:520–562
The Chimpanzee Sequencing and Analysis Consortium (2005) Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437:69–87
Pennacchio LA, Ahituv N, Moses AM, Prabhakar S, Nobrega MA, Shoukry M et al (2006) In vivo enhancer analysis of human conserved non-coding sequences. Nature 444:499–502
Bejerano G, Pheasant M, Makunin I, Stephen S, Kent WJ, Mattick JS et al (2004) Ultraconserved elements in the human genome. Science 304:1321–1325
Kapranov P, Willingham AT, Gingeras TR (2007) Genome-wide transcription and the implications for genomic organization. Nat Rev Genet 8:413–423
Hunkapiller T, Kaiser RJ, Koop BF, Hood L (1991) Large-scale and automated DNA sequence determination. Science 254:59–67
Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA et al (1977) Nucleotide sequence of bacteriophage phi X174 DNA. Nature 265:687–695
Cohen JC, Kiss RS, Pertsemlidis A, Marcel YL, McPherson R, Hobbs HH (2004) Multiple rare alleles contribute to low plasma levels of HDL cholesterol. Science 305:869–872
Wang WY, Barratt BJ, Clayton DG, Todd JA (2005) Genome-wide association studies: theoretical and practical concerns. Nat Rev Genet 6:109–118
Porreca GJ, Zhang K, Li JB, Xie B, Austin D, Vassallo SL et al (2007) Multiplex amplification of large sets of human exons. Nat Methods 4:931–936
Okou DT, Steinberg KM, Middle C, Cutler DJ, Albert TJ, Zwick ME (2007) Microarray-based genomic selection for high-throughput resequencing. Nat Methods 4:907–909
Albert TJ, Molla MN, Muzny DM, Nazareth L, Wheeler D, Song X et al (2007) Direct selection of human genomic loci by Âmicroarray hybridization. Nat Methods 4:903–905
Turner EH, Lee C, Ng SB, Nickerson DA, Shendure J (2009) Massively parallel exon capture and library-free resequencing across 16 genomes. Nat Methods 6:315–316
Craig DW, Pearson JV, Szelinger S, Sekar A, Redman M, Corneveaux JJ et al (2008) Identification of genetic variants using bar-coded multiplexed sequencing. Nat Methods 5:887–893
Li H, Ruan J, Durbin R (2008) Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res 18:1851–1858
Ramensky V, Bork P, Sunyaev S (2002) Human non-synonymous SNPs: server and survey. Nucleic Acids Res 30:3894–3900
Wang QF, Prabhakar S, Wang Q, Moses AM, Chanan S, Brown M et al (2006) Primate-specific evolution of an LDLR enhancer. Genome Biol 7:R68
Nejentsev S, Walker N, Riches D, Egholm M, Todd JA (2009) Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes. Science 324:387–389
Ji W, Foo JN, O’Roak BJ, Zhao H, Larson MG, Simon DB et al (2008) Rare independent mutations in renal salt handling genes contribute to blood pressure variation. Nat Genet 40:592–599
Romeo S, Pennacchio LA, Fu Y, Boerwinkle E, Tybjaerg-Hansen A, Hobbs HH et al (2007) Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL. Nat Genet 39:513–516
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Kingsley, C.B. (2011). Identification of Causal Sequence Variants of Disease in the Next Generation Sequencing Era. In: DiStefano, J. (eds) Disease Gene Identification. Methods in Molecular Biology, vol 700. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-61737-954-3_3
Download citation
DOI: https://doi.org/10.1007/978-1-61737-954-3_3
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-61737-953-6
Online ISBN: 978-1-61737-954-3
eBook Packages: Springer Protocols