Abstract
We present a regression-based method of haplotype association analysis for quantitative and dichotomous traits in samples consisting of unrelated individuals. The method takes account of uncertain phase by initially estimating haplotype frequencies and obtaining the posterior probabilities of all possible haplotype combinations in each individual, then using these as weights in a finite mixture of regression models. Using this method, different combinations of marker loci can be modeled, to find a parsimonious set of marker loci that are most predictive and therefore most likely to be closely associated with the a quantitative trait locus. The method has the additional advantage of being able to use individuals with some missing genotype data, by considering all possible genotypes at the missing markers. We have implemented this method using the SNPHAP and Mx programs and illustrated its use on published data on idiopathic generalized epilepsy.
Similar content being viewed by others
REFERENCES
Abecasis, G. R., Noguchi, E., Heinzmann, A., Traherne, J. A., Bhattacharyya, S., Leaves, N. I., Anderson, G. G., Zhang, Y., Lench, N. J., Carey, A., Cardon, L. R., Moffatt, M. F., and Cookson, W. O. (2001). Extent and distribution of linkage disequilibrium in three genomic regions. Am. J. Hum. Genet. 68:191-197.
Akey, J., Jin, L., and Xiong, M. (2001). Haplotypes vs single marker linkage disequilibrium tests: What do we gain? Eur. J. Hum. Genet. 9:291-300.
Bader, J. S. (2001). The relative power of SNPs and haplotypes as genetic markers for association tests. Pharmacogenomics 2:11-24.
Chioza, B., Osei-Lah, A., Nashef, L., Suarez-Merino, B., Wilkie, H., Sham, P., Knight, J., Asherson, P., and Makoff, A. (2002). Haplotype and linkage disequilibrium analysis to characterise a region in the calcium channel gene CACNA1A associated with idiopathic generalised epilepsy. Eur. J. Hum. Genet. 10:857-864.
Cordell, H. J., and Clayton, D. G. (2002). A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: Application to HLA in type 1 diabetes. Am. J. Hum. Genet. 70:124-141.
Daly, M. J., Rioux, J. D., Schaffner, S. F., Hudson, T. J., and Lander, E. S. (2001). High-resolution haplotype structure in the human genome. Nat. Genet. 29:229-232.
Davidson, S. (2000). Research suggests importance of haplotypes over SNPs. Nat. Biotechnol. 18:1134-1135.
Dawson, E., Abecasis, G. R., Bumpstead, S., Chen, Y., Hunt, S., Beare, D. M., Pabial, J., Dibling, T., Tinsley, E., Kirby, S., Carter, D., Papaspyridonos, M., Livingstone, S., Ganske, R., Lohmussaar, E., Zernant, J., Tonisson, N., Remm, M., Magi, R., Puurand, T., Vilo, J., Kurg, A., Rice, K., Deloukas, P., Mott, R., Metspalu, A., Bentley, D. R., Cardon, L. R., and Dunham, I. (2002). A first-generation linkage disequilibrium map of human chromosome 22. Nature 418:544-548.
Fallin, D., Cohen, A., Essioux, L., Chumakov, I., Blumenfeld, M., Cohen, D., and Schork, N. J. (2001). Genetic analysis of case/control data using estimated haplotype frequencies: Application to APOE locus variation and Alzheimer's disease. Genome Res. 1:143-151.
Gabriel, S. B., Schaffner, S. F., Nguyen, H., Moore, J. M., Roy, J., Blumenstiel, B., Higgins, J., DeFelice, M., Lochner, A., Faggart, M., Liu-Cordero, S. N., Rotimi, C., Adeyemo, A., Cooper, R., Ward, R., Lander, E., Daly, M. J., and Altshuler, D. (2002). The structure of haplotype blocks in the human genome. Science 296:2225-2229.
International SNP Map Working Group. (2001). A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 409:928-933.
Johnson, G., Esposito, L., Barratt, B., Smith, A., Heward, J., DiGenova, G., Ueda, H., Cordell, H., Eaves, I., Dudbridge, F., Twells, R., Payne, F., Hughes, W., Nutland, S., Stevens, H., Carr, P., Tuomilehto-Wolf, E., Tuomilehto, J., Gough, S., Clayton, D., and Todd, J. (2001). Haplotype tagging for the identification of common disease genes. Nat. Genet. 29:233-237.
Long, A. D., and Langley, C. H. (1999). The power of association studies to detect the contribution of candidate genetic loci to variation in complex traits. Genome Res. 9:720-731.
Neale, M. C. (1999). Mx: Statistical modeling. Richmond, VA: Department of Psychiatry, Virgina Commonwealth University.
Neale, M. C. (2000). QTL Mapping with sib-pairs: The flexibility of Mx. In Spector, T. D., Snieder, H., and MacGregor, A. J. (eds). Advances in twin and sib-pair analysis. London: Oxford University Press.
Patil, N., Berno, A. J., Hinds, D. A., Barrett, W. A., Doshi, J. M., Hacker, C. R., Kautzer, C. R., Lee, D. H., Marjoribanks, C., McDonough, D. P., Nguyen, B. T. N., Norris, M. C., Sheehan, J. B., Shen, N., Stern, D., Stokowski, R. P., Thomas, D. J., Trulson, M. O., Vyas, K. R., Frazer, K. A., Fodor, S. P. A., and Cox, D. R. (2001). Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science 294:1719-1723.
Phillips, M. S., Lawrence, R., Sachidanandam, R., Morris, A. P., Balding, D. J., Donaldson, M. A., Studebaker, J. F., Ankener, W. M., Alfisi, S. V., Kuo, F. S., Camisa, A. L., Pazorov, V., Scott, K. E., Carey, B. J., Faith, J., Katari, G., Bhatti, H. A., Cyr, J. M., Derohannessian, V., Elosua, C., Forman, A. M., Grecco, N. M., Hock, C. R., Kuebler, J. M., Lathrop, J. A., Mockler, M. A., Nachtman, E. P., Restine, S. L., Varde, S. A., Hozza, M. J., Gelfand, C. A., Broxholme, J., Abecasis, G. R., Boyce-Jacino, M. T., and Cardon, L. R. (2003). Chromosomewide distribution of haplotype blocks and the role of recombination hot spots. Nat. Genet. 33:382-387.
Schaid, D. J., Rowland, C. M., Tines, D. E., Jacobson, R. M., and Poland, G. A. (2002). Score tests for association between traits and haplotypes when linkage phase is ambiguous. Am. J. Hum. Genet. 70:425-434.
Seltman, H., Roeder, K., and Devlin, B. (2003). Evolutionary-based association analysis using haplotype data. Genet. Epidemiol. 25:48-58.
Syvanen, A. C. (2001). Accessing genetic variation: Genotyping single nucleotide polymorphisms. Nat. Rev. Genet. 2:930-942.
Tanck, M. W. T., Klerkx, A. H. E. M., Jukema, J. W., De Knijff, P., Kastelein, J. J. P., and Zwinderman, A. H. (2003). Estimation of multilocus haplotype effects using weighted penalised loglikelihood: Analysis of five sequence variations at the cholesteryl ester transfer protein gene locus. Ann. Hum. Genet. 67:175-184.
Zaykin, D. V., Westfall, P. H., Young, S. S., Karnoub, M. A., Wagner, M. J., and Ehm, M. G. (2002). Testing association of statistically inferred haplotypes with discrete and continuous traits in samples of unrelated Individuals. Hum. Hered. 53:79-91.
Zhao, J. H., Curtis, D., and Sham, P. C. (2000). Model-free analysis and permutation tests for allelic association. Hum. Hered. 50:133-139.
Zhao, J. H., Lissarrague, S., Essioux, L., and Sham, P. C. (2002). Gene-counting for haplotype analysis with missing genotypes. Bioinformatics 18:1694-1695.
Zhao, J. H., and Sham, P. C. (2002). Faster haplotype frequency estimation using unrelated subjects. Hum. Hered. 53:36-41.
Zhao, L. P., Li, S. S., and Khalid, N. (2003). A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in casecontrol studies. Am. J. Hum. Genet. 72:1231-1250.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Sham, P.C., Rijsdijk, F.V., Knight, J. et al. Haplotype Association Analysis of Discrete and Continuous Traits Using Mixture of Regression Models. Behav Genet 34, 207–214 (2004). https://doi.org/10.1023/B:BEGE.0000013734.39266.a3
Issue Date:
DOI: https://doi.org/10.1023/B:BEGE.0000013734.39266.a3