Abstract
Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1–5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.
The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res 20:1711, 2010), for accurate identification of rare variants in large DNA pools. Given an average sequencing coverage of 30× per haploid genome, SPLINTER can detect rare variants and short indels up to 4 base pairs (bp) with high sensitivity and specificity (up to 1 haploid allele in a pool as large as 500 individuals). Step-by-step instructions on how to conduct pooled-DNA sequencing experiments and data analyses are described in this chapter.
Key words
- Next-generation sequencing
- NGS
- Rare variants
- Alzheimer’s disease
- Pooled-DNA sequencing
This is a preview of subscription content, access via your institution.
Buying options

References
Gatz M, Reynolds CA, Fratiglioni L et al (2006) Role of genes and environments for explaining Alzheimer disease. Arch Gen Psychiatry 63:168–174
Harold D, Abraham R, Hollingworth P et al (2009) Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer’s disease. Nat Genet 41:1088–1093
Naj AC, Jun G, Beecham GW et al (2011) Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset alzheimer’s disease. Nat Genet 43:436–441
Hollingworth P, Harold D, Sims R et al (2011) Common variants at ABCA7, MS4A6A/MS4A4E, EPHA1, CD33 and CD2AP are associated with Alzheimer’s disease. Nat Genet 43:429–435
Lambert JC, Heath S, Even G et al (2009) Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer’s disease. Nat Genet 41:1094–1099
So HC, Gui AH, Cherny SS et al (2011) Evaluating the heritability explained by known susceptibility variants: a survey of ten complex diseases. Genet Epidemiol 35:310–317
Bodmer W, Bonilla C (2008) Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet 40:695–701
Schork NJ, Murray SS, Frazer KA et al (2009) Common vs. rare allele hypotheses for complex diseases. Curr Opin Genet Dev 19:212–219
Pritchard JK (2001) Are rare variants responsible for susceptibility to complex diseases? Am J Hum Genet 69:124–137
Rivas MA, Beaudoin M, Gardet A et al (2011) Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat Genet 43:1066–1073
Cruchaga C, Chakraverty S, Mayo K et al (2012) Rare variants in APP, PSEN1 and PSEN2 increase risk for AD in late-onset alzheimer’s disease families. PLoS One 7:e31039
Haller G, Kapoor M, Budde J et al (2014) Rare missense variants in CHRNB3 and CHRNA3 are associated with risk of alcohol and cocaine dependence. Hum Mol Genet 23:810–819
Nejentsev S, Walker N, Riches D et al (2009) Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes. Science 324:387–389
Momozawa Y, Mni M, Nakamura K et al (2011) Resequencing of positional candidates identifies low frequency IL23R coding variants protecting against inflammatory bowel disease. Nat Genet 43:43–47
Emison ES, Garcia-Barcelo M, Grice EA et al (2010) Differential contributions of rare and common, coding and noncoding Ret mutations to multifactorial Hirschsprung disease liability. Am J Hum Genet 87:60–74
Guerreiro R, Wojtas A, Bras J et al (2013) TREM2 variants in Alzheimer’s disease. N Engl J Med 368:117–127
Jonsson T, Stefansson H, Steinberg S et al (2013) Variant of TREM2 associated with the risk of Alzheimer’s disease. N Engl J Med 368:107–116
Saunders AM, Strittmatter WJ, Schmechel D et al (1993) Association of apolipoprotein E allele epsilon 4 with late-onset familial and sporadic Alzheimer’s disease. Neurology 43:1467–1472
Cruchaga C, Karch CM, Jin SC et al (2014) Rare coding variants in Phospholipase D3 (PLD3) confer risk for Alzheimer’s disease. Nature 505:550–554
Jin SC, Pastor P, Cooper B et al (2012) Pooled-DNA sequencing identifies novel causative variants in PSEN1, GRN and MAPT in a clinical early-onset and familial Alzheimer’s disease Ibero-American cohort. Alzheimers Res Ther 4:34
Benitez BA, Karch CM, Cai Y et al (2013) The PSEN1, p.E318G variant increases the risk of Alzheimer’s disease in APOE-epsilon4 carriers. PLoS Genet 9:e1003685
Vallania FL, Druley TE, Ramos E et al (2010) High-throughput discovery of rare insertions and deletions in large cohorts. Genome Res 20:1711–1718
Vallania F, Ramos E, Cresci S et al (2012) Detection of rare genomic variants from pooled sequencing using SPLINTER. J Vis Exp (64). pii: 3943. doi: 10.3791/3943
Acknowledgements
This work was supported by grants from the National Institutes of Health (P30-NS069329, R01-AG044546 and P01AG003991, and R01NS085419), the Alzheimer Association (NIRG-11-200110) and Barnes Jewish Foundation. This research was conducted while C.C. was a recipient of a New Investigator Award in Alzheimer’s disease from the American Federation for Aging Research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this protocol
Cite this protocol
Jin, S.C., Benitez, B.A., Deming, Y., Cruchaga, C. (2016). Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer’s Disease. In: Castrillo, J., Oliver, S. (eds) Systems Biology of Alzheimer's Disease. Methods in Molecular Biology, vol 1303. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-2627-5_18
Download citation
DOI: https://doi.org/10.1007/978-1-4939-2627-5_18
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-2626-8
Online ISBN: 978-1-4939-2627-5
eBook Packages: Springer Protocols