Abstract
When designing a case–control study, researchers must decide whether to gather and sequence or genotype controls in parallel with cases or whether to only sequence or genotype cases and to plan on using external, perhaps publically available control data. When using rare-variant genotyping chips, gathering and using internal controls may be the obvious choice as the cost per subject is relatively low although there is still the cost of recruiting or finding a suitable control set. There may be more reason to use external controls for sequencing due to its relatively high cost. Ultimately, the choice often comes down to a balance between available funds, resources, and the number and uniqueness of the cases available for sequencing or genotyping. In addition, other reasons may prompt researchers to use external controls. Researchers may choose to boost their study’s sample size and potentially the power by adding in external, publically available controls to an existing internal control set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abecasis GR, Auton A, Brooks LD et al (2012) An integrated map of genetic variation from 1,092 human genomes. Nature 491(7422):56–65. doi:10.1038/nature11632
Asimit JL, Day-Williams AG, Morris AP, Zeggini E (2012) ARIEL and AMELIA: testing for an accumulation of rare variants using next-generation sequencing data. Hum Hered 73(2):84–94. doi:10.1159/000336982
Berkson J (1978) In dispraise of exact test—do marginal totals of 2×2 table contain relevant information respecting table proportions. J Statist Plan Infer 2(1):27–42. doi:10.1016/0378-3758(78)90019-8
Chen H, Meigs JB, Dupuis J (2013) Sequence kernel association test for quantitative traits in family samples. Genetic Epidemiology 37(2):196–204
Cibulskis K, McKenna A, Fennell T, Banks E, DePristo M, Getz G (2011) ContEst: estimating cross-contamination of human samples in next-generation sequencing data. Bioinformatics 27(18):2601–2602. doi:10.1093/bioinformatics/btr446
DePristo MA, Banks E, Poplin R et al (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43(5):491–498. doi:10.1038/ng.806
Do R, Kathiresan S, Abecasis GR (2012) Exome sequencing and complex disease: practical aspects of rare variant association studies. Hum Mol Genet 21(R1):R1–R9. doi:10.1093/hmg/dds387
Farooqi S, O’Rahilly S (2006) Genetics of obesity in humans. Endocr Rev 27(7):710–718. doi:10.1210/er.2006-0040
Futema M, Plagnol V, Whittall RA et al (2012) Use of targeted exome sequencing as a diagnostic tool for Familial Hypercholesterolaemia. J Med Genet 49(10):644–649. doi:10.1136/jmedgenet-2012-101189
Garner C (2011) Confounded by sequencing depth in association studies of rare alleles. Genet Epidemiol 35(4):261–268. doi:10.1002/gepi.20574
Genomes Project Consortium, Abecasis GR, Auton A et al (2012) An integrated map of genetic variation from 1,092 human genomes. Nature 491(7422):56–65. doi:10.1038/nature11632
Goldstein JI, Crenshaw A, Carey J et al (2012) zCall: a rare variant caller for array-based genotyping—genetics and population analysis. Bioinformatics 28(19):2543–2545. doi:10.1093/bioinformatics/bts479
Hirschhorn JN, Daly MJ (2005) Genome-wide association studies for common diseases and complex traits. Nat Rev Genet 6(2):95–108. doi:10.1038/nrg1521
International HapMap Consortium, Altshuler DM, Gibbs RA et al (2010) Integrating common and rare genetic variation in diverse human populations. Nature 467(7311):52–58. doi:10.1038/nature09298
Jun G, Flickinger M, Hetrick KN et al (2012) Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. Am J Hum Genet 91(5):839–848. doi:10.1016/j.ajhg.2012.09.004
Larntz K (1978) Small-sample comparisons of exact levels for chi-squared goodness-of-fit statistics. J Am Stat Assoc 73(362):253–63. doi:10.2307/2286650
Li L, Li Y, Browning SR et al (2011) Performance of genotype imputation for rare variants identified in exons and flanking regions of genes. PLoS One 6(9):e24945. doi:10.1371/journal.pone.0024945
Mathieson I, McVean G (2012) Differential confounding of rare and common variants in spatially structured populations. Nat Genet 44(3):243–246. doi:10.1038/ng.1074
Pasaniuc B, Rohland N, McLaren PJ et al (2012) Extremely low-coverage sequencing and imputation increases power for genome-wide association studies. Nat Genet 44(6):631–635. doi:10.1038/ng.2283
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38(8):904–909. doi:10.1038/ng1847
Purcell S, Neale B, Todd-Brown K et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81(3):559–575. doi:10.1086/519795
Tennessen JA, Bigham AW, O’Connor TD et al (2012) Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 337(6090):64–69. doi:10.1126/science.1219240
Turner S, Armstrong LL, Bradford Y, et al (2011) Quality control procedures for genome-wide association studies. Curr Protoc Hum Genet Chapter 1:Unit1.19. doi:10.1002/0471142905.hg0119s68
Ziegler A, König IR (2010) A statistical approach to genetic epidemiology: concepts and applications. Wiley-VCH, Germany
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Science+Business Media New York
About this chapter
Cite this chapter
Hendricks, A.E. (2015). Use of Appropriate Controls in Rare-Variant Studies. In: Zeggini, E., Morris, A. (eds) Assessing Rare Variation in Complex Traits. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2824-8_17
Download citation
DOI: https://doi.org/10.1007/978-1-4939-2824-8_17
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-2823-1
Online ISBN: 978-1-4939-2824-8
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)