More than 6 million single nucleotide polymorphisms (SNPs) in the human genome have been genotyped by the HapMap project. Although only a pro portion of these SNPs are functional, all can be considered as candidate markers for indirect association studies to detect disease-related genetic variants. The complete screening of a gene or a chromosomal region is nevertheless an expensive undertak ing for association studies. A key strategy for improving the efficiency of association studies is to select a subset of informative SNPs, called tag SNPs, for analysis. In the chapter, hierarchical clustering algorithms have been proposed for efficient tag SNP selection.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Sham, P., “Statistics in human genetics”. Arnold, UK, 1998
Cowles, C., Joel, N., Altshuler, D., and Lander, E., “Detection of regulatory variation in mouse genes”. Nat. Genet. 32, 432–437, 2002
Sherry, S., Ward, M., and Sirotkin, K., “Use of molecular variation in the NCBI dbSNP data base”. Hum. Mutat. 15, 68–75, 2000
CIGMR 2005, “Tagging SNPs”. Web Address: http://slack.ser.man.ac.uk/theory/tagging.html. Modified date: March 22 2005
Byng, M. et al., “SNP subset selection for genetic association studies”. Ann. Hum. Genet. 67, 543–556, 2003
Meng, Z. et al., “Selection of genetic markers for association analysis, using linkage disequi librium and haplotypes”. Am. J. Hum. Genet. 73, 115–130, 2003
Couzin, J., “New mapping projects splits the community”. Science 296, 1391–1393, 2002
Ao, S. I., Yip, K., Ng, M. et al., “CLUSTAG: Hierarchical clustering and graph methods for selecting tag SNPs”. Bioinformatics 21(8), 1735–1736, 2005
Ao, S. I., “Data Mining Algorithms for Genomic Analysis”. Ph.D. thesis, The University of Hong Kong, Hong Kong, May 2007
Wucklidge, W., “Efficient visual recognition using the Hausdorff distance”. Springer, 1996
Carlson, C. et al., “Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium”. Am. J. Hum. Genet. 74, 106–120, 2004
Reuven, Y. and Zehavit, K., “Approximating the dense set—cover problem”. J. Comput. Syst. Sci. 69, 547–561, 2004
Johnson, D., “Approximation algorithms for combinatorial problems”. Ann. ACM Symp. Theor. Comput. 38–49, 1973
Barrett, J. et al., “Haploview: Analysis and visualization of LD and haplotype maps”. Bioin— formatics 21(2), 263–265, 2005
Sham, P., Ao, S. I. et al., “Combining functional and linkage disequilibrium information in the selection of tag SNPs”. Bioinformatics 23(1), 129–131, 2007
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media B.V
About this chapter
Cite this chapter
Ao, SI. (2009). CLUSTAG & WCLUSTAG: Hierarchical Clustering Algorithms for Efficient Tag-SNP Selection. In: Ao, SI., Rieger, B., Chen, SS. (eds) Advances in Computational Algorithms and Data Analysis. Lecture Notes in Electrical Engineering, vol 14. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-8919-0_2
Download citation
DOI: https://doi.org/10.1007/978-1-4020-8919-0_2
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-8918-3
Online ISBN: 978-1-4020-8919-0
eBook Packages: Computer ScienceComputer Science (R0)