Abstract
Genomic deletions have long been known to play a causative role in microdeletion syndromes. Recent whole-genome genetic studies have shown that deletions can increase the risk for several psychiatric disorders, suggesting that genomic deletions play an important role in the genetic basis of complex traits. However, the association between genomic deletions and common, complex diseases has not yet been systematically investigated in gene mapping studies. Likelihood-based statistical methods for identifying disease-associated deletions have recently been developed for familial studies of parent-offspring trios. The purpose of this study is to develop statistical approaches for detecting genomic deletions associated with complex disease in case–control studies. Our methods are designed to be used with dense single nucleotide polymorphism (SNP) genotypes to detect deletions in large-scale or whole-genome genetic studies. As more and more SNP genotype data for genome-wide association studies become available, development of sophisticated statistical approaches will be needed that use these data. Our proposed statistical methods are designed to be used in SNP-by-SNP analyses and in cluster analyses based on combined evidence from multiple SNPs. We found that these methods are useful for detecting disease-associated deletions and are robust in the presence of linkage disequilibrium using simulated SNP data sets. Furthermore, we applied the proposed statistical methods to SNP genotype data of chromosome 6p for 868 rheumatoid arthritis patients and 1,197 controls from the North American Rheumatoid Arthritis Consortium. We detected disease-associated deletions within the region of human leukocyte antigen in which genomic deletions were previously discovered in rheumatoid arthritis patients.
Similar content being viewed by others
References
Amos CI, Shete S, Chen J, Yu RK (2003) Positional identification of microdeletions with genetic markers. Hum Hered 56:107–118
Beck S, Trowsdale J (1999) Sequence organization of the class II region of human MHC. Immunol Rev 167:201–210
Conrad DF, Andrews TD, Carter NP, Hurles ME, Pritchard JK (2006) A high-resolution survey of deletion polymorphism in the human genome. Nat Genet 38(1):75–81
Cornelis F, Faure S, Martinez M, Prud’hommer JF, Fritz P, Dib C, Alves H, Barrera P, de Vries N et al (1998) New susceptibility locus for rheumatoid arthritis suggested by a genome-wide linkage study. Proc Natl Acad Sci USA 95:10746–10750
Franke L, de Kovel CG, Aulchenko YS, Trynka G, Zhernakova A, Hunt KA, Blauw HM, van den Berg LH, Ophoff R, Deloukas P et al (2008) Detection, imputation, and association analysis of small deletions and null alleles on oligonucleotide arrays. Am J Hum Genet 82(6):1316–1333
Freeman JL, Perry GH, Feuk L, Redon R, McCarroll SA, Altshuler DM, Aburatani H, Jones KW, Tyler-Smith C, Hurles ME et al (2006) Copy number variation: new insights in genome diversity. Genome Res 16:949–961
Gail MH, Pfeiffer RM, Wheeler W, Pee D (2008) Probability of detecting disease-associated single nucleotide polymorphisms in case–control genome-wide association studies. Biostatistics 9:201–215
Grimson RC (1993) Disease clusters, exact distributions of maxima, and p-values. Stat Med 12:1773–1794
Grimson RC, Mendelsohn S (2000) A method for detecting current temporal clusters of toxic events through data monitoring by poison control center. J Toxicol Clin Toxicol 38(7):761–765
Hinds DA, Kloek AP, Jen M, Chen X, Frazer KA (2006) Common deletions and SNPs are in linkage disequilibrium in the human genome. Nat Genet 38(1):82–85
Hoggart CJ, Chadeau-Hyam M, Clark TG, Lampariello R, Whittaker JC, De Iorio M, Balding DJ (2007) Sequence-level population simulations over large genomic regions. Genetics 177(3):1725–1731
Jawaheer D, Seldin MF, Amos CI, Chen WV, Shigeta R, Monteiro J, Kern M, Criswell LA, Albani S, Nelson JL et al (2001) A genomewide screen in multiplex rheumatoid arthritis families suggests genetic overlap with other autoimmune diseases. Am J Hum Genet 68:927–936
Jawaheer D, Seldin MF, Amos CI, Chen WV, Shigeta R, Etzel C, Damle A, Xiao X, Chen D, Lum RF et al (2003) Screening the genome for rheumatoid arthritis susceptibility genes: a replication study and combined analysis of 512 multicase families. Arthritis Rheum 48:906–916
Kohler JR, Cutler DJ (2007) Simultaneous discovery and testing of deletions for disease association in SNP genotyping studies. Am J Hum Genet 81(4):684–699
Lindsay EA (2001) Chromosomal microdeletions: dissecting del22q11 syndrome. Nat Rev Genet 2(11):858–868
Liu H, Abecasis GR, Heath SC, Knowles A, Demars S, Chen YJ, Roos JL, Rapoport JL, Gogos JA, Karayiorgou M (2002) Genetic variation in the 22q11 locus and susceptibility to schizophrenia. Proc Natl Acad Sci USA 99(26):16859–16864
MacKay K, Eyre S, Myerscough A, Milicic A, Barton A, Laval S, Barrett J, Lee D, White S, John S et al (2002) Whole-genome linkage analysis of rheumatoid arthritis susceptibility loci in 252 affected sibling pairs in the United Kingdom. Arthritis Rheum 46:632–639
McCarroll SA, Hadnott TN, Perry GH, Sabeti PC, Zody MC, Barrett JC, Dallaire S, Gabriel SB, Lee C, Daly MJ et al (2006) International HapMap Consortium. Common deletion polymorphisms in the human genome. Nat Genet 38(1):86–92
McVean GA, Myers SR, Hunt S, Deloukas P, Bentley DR, Donnelly P (2004) The fine-scale structure of recombination rate variation in the human genome. Science 23;304(5670):581–584
Peng B, Amos CI, Kimmel M (2007) Forward-time simulations of human populations with complex diseases. PLoS Genet 23;3(3):e47
Plenge RM, Seielstad M, Padyukov L, Lee AT, Remmers EF, Ding B, Liew A, Khalili H, Chandrasekaran A, Davies LR et al (2007) TRAF1-C5 as a risk locus for rheumatoid arthritis—a genomewide study. N Engl J Med 357:1199–1209
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W et al (2006) Global variation in copy number in the human genome. Nature 444:444–454
Rovelet-Lecrux A, Hannequin D, Raux G, Le Meur N, Laquerrière A, Vital A, Dumanchin C, Feuillette S, Brice A, Vercelletto M et al (2006) APP locus duplication causes autosomal dominant early-onset Alzheimer disease with cerebral amyloid angiopathy. Nat Genet 38(1):24–26
Schaffner SF, Foo C, Gabriel S, Reich D, Daly MJ, Altshuler D (2005) Calibrating a coalescent simulation of human genome sequence variation. Genome Res 15(11):1576–1583
Sebat J, Lakshmi B, Malhotra D, Troge J, Lese-Martin C, Walsh T, Yamrom B, Yoon S, Krasnitz A, Kendall J et al (2007) Strong association of de novo copy number mutations with autism. Science 316:445–449
Seldin MF, Amos CI, Ward R, Gregersen PK (1999) The genetics revolution and the assault on rheumatoid arthritis. Arthritis Rheum 42:1071–1079
Shifman S, Levit A, Chen ML, Chen CH, Bronstein M, Weizman A, Yakir B, Navon R, Darvasi A (2006) A complete genetic association scan of the 22q11 deletion region and functional evidence reveal an association between DGCR2 and schizophrenia. Hum Genet 120(2):160–170
Spies T, Sorrentino R, Boss JM, Okada K, Strominger JL (1985) Structural organization of the DR subregion of the human major histocompatibility complex. Proc Natl Acad Sci USA 82:5165–5169
Stefansson H, Rujescu D, Cichon S, Pietiläinen OP, Ingason A, Steinberg S, Fossdal R, Sigurdsson E, Sigmundsson T, Buizer-Voskamp JE et al (2008) Large recurrent microdeletions associated with schizophrenia. Nature 455:232–236
Stranger BE, Forrest MS, Dunning M, Ingle CE, Beazley C, Thorne N, Redon R, Bird CP, de Grassi A, Lee C et al (2007) Relative impact of nucleotide and copy number variation on gene expression phenotypes. Science 315:848–853
Tuzun E, Sharp AJ, Bailey JA, Kaul R, Morrison VA, Pertz LM, Haugen E, Hayden H, Albertson D, Pinkel D, Olson MV, Eichler EE (2005) Fine-scale structural variation of the human genome. Nat Genet 37:727–732
Wall JD, Przeworski M (2000) When did the human population size start increasing? Genetics 155(4):1865–1874
Wallenstein S, Neff N (1987) An approximation for the distribution of the scan statistic. Stat Med 6(2):197–207
Wang TL, Maierhofer C, Speicher MR, Lengauer C, Vogelstein B, Kinzler KW, Velculescu VE (2002) Digital karyotyping. Proc Natl Acad Sci USA 99:16156–16161
Weiss LA, Shen Y, Korn JM, Arking DE, Miller DT, Fossdal R, Saemundsen E, Stefansson H, Ferreira MA, Green T et al (2008) Association between microdeletion and microduplication at 16p11.2 and autism. N Engl J Med 358(7):667–675
Wong KK, de Leeuw RJ, Dosanjh NS, Kimm LR, Cheng Z, Horsman DE, MacAulay C, Ng RT, Brown CJ, Eichler EE et al (2007) A comprehensive analysis of common copy-number variations in the human genome. Am J Hum Genet 80(1):91–104
Wu CC, Grimson RC, Amos CI, Shete S (2008) Statistical methods for anomalous discrete time series based on minimum cell count. Biom J 50(1):86–96
Yu CE, Dawson G, Munson J, D’Souza I, Osterling J, Estes A, Leutenegger AL, Flodman P, Smith M, Raskind WH et al (2002) Presence of large deletions in kindreds with autism. Am J Hum Genet 71(1):100–115
Acknowledgments
This research was supported by the US National Cancer Institute grants 2P01-CA034936 and 1R03-CA128103.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wu, CC., Shete, S., Chen, W.V. et al. Detection of disease-associated deletions in case–control studies using SNP genotypes with application to rheumatoid arthritis. Hum Genet 126, 303–315 (2009). https://doi.org/10.1007/s00439-009-0672-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-009-0672-3