SNP Discovery and Genotyping for Evolutionary Genetics Using RAD Sequencing

  • Paul D. Etter
  • Susan Bassham
  • Paul A. Hohenlohe
  • Eric A. Johnson
  • William A. Cresko
Part of the Methods in Molecular Biology book series (MIMB, volume 772)

Abstract

Next-generation sequencing technologies are revolutionizing the field of evolutionary biology, opening the possibility for genetic analysis at scales not previously possible. Research in population genetics, quantitative trait mapping, comparative genomics, and phylogeography that was unthinkable even a few years ago is now possible. More importantly, these next-generation sequencing studies can be performed in organisms for which few genomic resources presently exist. To speed this revolution in evolutionary genetics, we have developed Restriction site Associated DNA (RAD) genotyping, a method that uses Illumina next-generation sequencing to simultaneously discover and score tens to hundreds of thousands of single-nucleotide polymorphism (SNP) markers in hundreds of individuals for minimal investment of resources. In this chapter, we describe the core RAD-seq protocol, which can be modified to suit a diversity of evolutionary genetic questions. In addition, we discuss bioinformatic considerations that arise from unique aspects of next-generation sequencing data as compared to traditional marker-based approaches, and we outline some general analytical approaches for RAD-seq and similar data. Despite considerable progress, the development of analytical tools remains in its infancy, and further work is needed to fully quantify sampling variance and biases in these data types.

Key words

Genetic mapping Population genetics Genomics Evolution Genotyping Single-Nucleotide Polymorphisms Next-generation sequencing RAD-seq 

Notes

Acknowledgments

The authors thank the University of Oregon researchers who, over the past 3 years, have helped troubleshoot many preliminary ­versions of this protocol. This work was funded by grants from the National Institutes of Health (1R24GM079486-01A1 and Ruth L. Kirschstein National Research Service Award F32 GM078949) and the National Science Foundation (IOS-0642264 and DEB-0919090).

References

  1. 1.
    Mardis ER (2008) The impact of next-­generation sequencing technology on genetics. Trends Genet 24:133141Google Scholar
  2. 2.
    Shendure J, Ji H (2008) Next-generation DNA sequencing. Nat Biotech 26:1135–1145CrossRefGoogle Scholar
  3. 3.
    Asmann YW, Wallace MB, Thompson EA (2008) Transcriptome profiling using next-generation sequencing. Gastroenterology 135:1466–1468PubMedCrossRefGoogle Scholar
  4. 4.
    Marguerat S, Wilhelm BT, Bahler J (2008) Next-generation sequencing: applications beyond genomes. Biochem Soc Trans 36:1091–1096PubMedCrossRefGoogle Scholar
  5. 5.
    Mortazavi A, Williams BA, McCue K et al (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5:621–628PubMedCrossRefGoogle Scholar
  6. 6.
    Rokas A, Abbot P (2009) Harnessing genomics for evolutionary insights. Trends Ecol Evol 24:192–200PubMedCrossRefGoogle Scholar
  7. 7.
    Mardis ER (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet 9:387–402PubMedCrossRefGoogle Scholar
  8. 8.
    Van Tassell CP, Smith TP, Matukumalli LK et al (2008) SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries. Nat Methods 5:247–252PubMedCrossRefGoogle Scholar
  9. 9.
    Baird NA, Etter PD, Atwood TS et al (2008) Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 3:e3376PubMedCrossRefGoogle Scholar
  10. 10.
    Hohenlohe P, Bassham S, Stiffler N et al (2010) Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet 6:e1000862PubMedCrossRefGoogle Scholar
  11. 11.
    Emerson KJ, Merz CR, Catchen JM et al (2010) Resolving post-glacial phylogeography using high throughput sequencing. Proc Natl Acad Sci USA 107:16196200Google Scholar
  12. 12.
    Gompert Z, Lucas LK, Fordyce JA et al (2010) Secondary contact between Lycaeides idas and L. melissa in the Rocky Mountains: extensive admixture and a patchy hybrid zone. Mol Ecol 19:3171–3192PubMedCrossRefGoogle Scholar
  13. 13.
    Coyne KJ, Burkholder JM, Feldman RA et al (2004) Modified serial analysis of gene expression method for construction of gene expression profiles of microbial eukaryotic species. Appl Environ Microbiol 70:5298–5304PubMedCrossRefGoogle Scholar
  14. 14.
    Quail MA, Kozarewa I, Smith F et al (2008) A large genome center’s improvements to the Illumina sequencing system. Nat Methods 5:1005–1010PubMedCrossRefGoogle Scholar
  15. 15.
    Lynch M (2009) Estimation of allele frequencies from high-coverage genome-sequencing projects. Genetics 18:295–301CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2012

Authors and Affiliations

  • Paul D. Etter
    • 1
  • Susan Bassham
    • 2
  • Paul A. Hohenlohe
    • 2
  • Eric A. Johnson
    • 1
  • William A. Cresko
    • 2
  1. 1.Institute of Molecular BiologyUniversity of OregonEugeneUSA
  2. 2.Center for Ecology and Evolutionary BiologyUniversity of OregonEugeneUSA

Personalised recommendations