Summary
Whole-genome association studies of complex diseases hold great promise to identify systematically genetic loci that influence one’s risk of developing these diseases. However, the polygenic nature of the complex diseases and genetic interactions among the genes pose significant challenge in both experimental design and data analysis. High-density genotype data make it possible to identify most of the genetic loci that may be involved in the etiology. On the other hand, utilizing large number of statistic tests could lead to false positives if the tests are not adequately adjusted. In this paper, we discuss a two-stage method that sequentially applies a generalized linear model (GLM) and principal components analysis (PCA) to identify genetic loci that jointly determine the likelihood of developing disease. The method was applied to a pilot case-control study of esophageal squamous cell carcinoma (ESCC) that included 50 ESCC patients and 50 neighborhood-matched controls. Genotype data were determined by using the Affymetrix 10K SNP chip. We will discuss some of the special considerations that are important to the proper interpretation of whole genome-wide association studies, which include multiple comparisons, epistatic interaction among multiple genetic loci, and generalization of predictive models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Abbreviations
- ESCC:
-
esophageal squamous cell carcinoma
- GLM:
-
generalized liner model;
- HWE:
-
Hardy Weinberg equilibrium
- LD:
-
lonkage disequilibrium
- PCA:
-
principal components analysis;
- PCI:
-
first principal component
- SNP:
-
single nucleotide polymorphism
Summary
Botstein, D., and Risch, N. (2003) Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat. Genet. 33(Suppl), 228–237.
Miller, M. P., and Kumar, S. (2001) Understanding human disease mutations through the use of interspecific genetic variation. Hum. Mol. Genet. 10, 2319–2328.
Lander, E. S. (1996) The new genomics: global views of biology. Science 274, 536–539.
Risch, N., and Merikangas, K. (1996) The future of genetic studies of complex human diseases. Science 273, 1516–1517.
Risch, N. J. (2000) Searching for genetic determinants in the new millennium. Nature 405, 847–856.
Campbell, D. B., Sutcliffe, J. S., Ebert, P. J., Militerni, R., Bravaccio, C., Trillo, S., et al. (2006) From the cover: a genetic variant that disrupts MET transcription is associated with autism. Proc. Natl. Acad. Sci. U S A 103, 16834–16839.
Dewan, A., Liu, M., Hartman, S., Zhang, S., Liu, D. T., Zhao, C., et al. (2006) HTRA1 promoter polymorphism in wet age-related macular degeneration. Science 314, 989–992.
Yang, Z., Camp, N. J., Sun, H., Tong, Z., Gibbs, D., Cameron, J., et al. (2006) A Variant of the HTRA1 gene increases susceptibility to age-related macular degeneration. Science 314, 992–993.
International HapMap Consortium. (2003) The international HapMap project. Nature 426, 789–796.
International HapMap Consortium. (2005) A haplotype map of the human genome. Nature 437, 1299–1320.
Carlson, C. S., Eberle, M. A., Rieder, M. J., Yi, Q., Kruglyak, L., and Nickerson, L. (2004) Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am. J. Hum. Genet. 74, 106–120.
Li, J. Y. (1982) Epidemiology of esophageal cancer in China. Natl. Cancer Inst. Monogr. 62, 113–120.
Hu, N., Wang, C., Hu, Y., Yang, H. H., Giffen, C., Tang, Z. Z., et al. (2005) Genome-wide association study in esophageal cancer using GeneChip mapping 10K array. Cancer Res. 65, 2542–2546.
Matsuzaki, H., Loi, H., Dong, S., Tsai, Y. Y., Fang, J., Law, J., et al. (2004) Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array. Genome Res. 14, 414–425.
Matsuzaki, H., Dong, S., Loi, H., Di, X., Liu, G., Hubbell, E., et al. (2004) Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays. Nat. Methods 1, 109–111.
Fan, J. B., Gunderson, K. L., Bibikova, M., Yeakley, J. M., Chen, J., Wickham, G. E., et al. (2006) Illumina universal bead arrays. Methods Enzymol. 410, 57–73.
Oliphant, A., Barker, D. L., Stuelpnagel, J. R., and Chee, M. S. (2002) BeadArray technology: enabling an accurate, cost-effective approach to high-throughput genotyping. Biotechniques Suppl., 56–58, see also pages 60–51.
Elston, R. C., Guo, X., and Williams, L. V. (1996) Two-stage global search designs for linkage analysis using pairs of affected relatives. Genet. Epidemiol. 13, 535–558.
Skol, A. D., Scott, L. J., Abecasis, G. R., and Boehnke, M. (2006) Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nat. Genet. 38, 209–213.
Marchini, J., Donnelly, P., and Cardon, L. R. (2005) Genome-wide strategies for detecting multiple loci that influence complex diseases. Nat. Genet. 37, 413–417.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Yang, H.H., Hu, N., Taylor, P.R., Lee, M.P. (2008). Whole Genome-Wide Association Study Using Affymetrix SNP Chip: A Two-Stage Sequential Selection Method to Identify Genes That Increase the Risk of Developing Complex Diseases. In: Trent, R.J. (eds) Clinical Bioinformatics. Methods in Molecular Medicine™, vol 141. Humana Press. https://doi.org/10.1007/978-1-60327-148-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-60327-148-6_2
Publisher Name: Humana Press
Print ISBN: 978-1-58829-791-4
Online ISBN: 978-1-60327-148-6
eBook Packages: Springer Protocols