Abstract
An efficient rule-based algorithm is presented for haplotype inference from general pedigree genotype data, with the assumption of no recombination. This algorithm generalizes previous algorithms to handle the cases where some pedigree founders are not genotyped, provided that for each nuclear family at least one parent is genotyped and each non-genotyped founder appears in exactly one nuclear family. The importance of this generalization lies in that such cases frequently happen in real data, because some founders may have passed away and their genotype data can no longer be collected. The algorithm runs in O(m 3 n 3) time, where m is the number of single nucleotide polymorphism (SNP) loci under consideration and n is the number of genotyped members in the pedigree. This zero-recombination haplotyping algorithm is extended to a maximum parsimoniously haplotyping algorithm in one whole genome scan to minimize the total number of breakpoint sites, or equivalently, the number of maximal zero-recombination chromosomal regions. We show that such a whole genome scan haplotyping algorithm can be implemented in O(m 3 n 3) time in a novel incremental fashion, here m denotes the total number of SNP loci along the chromosome.
Similar content being viewed by others
References
Altshuler, D., Daly, M.J., Lander, E.S. Genetic mapping in human disease. Science, 322: 881–888 (2008)
Chan, M.Y., Chan, W., Chin, F., Fung, S., Kao, M. Linear-time haplotype inference on pedigrees without recombinations. In: Proceedings of the 6th Annual Workshop on Algorithms in Bioinformatics (WABI’06), 2006, 56–67
Du, F.X., Woodward, B.W., Denise, S.K. Haplotype construction of sires with progeny genotypes based on an exact likelihood. Journal of Dairy Sciences, 81: 1462–1468 (1998)
Excoffer, L., Slatkin, M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Molecular Biology and Evolution, 12: 921–927 (1995)
Gusfield, D. Inference of haplotypes from samples of diploid populations: complexity and algorithms. Journal of Computational Biology, 8: 305–323 (2001)
Hawley, M.E., Kidd, K.K. HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. Journal of Heredity, 86: 409–411 (1995)
Kruglyak, L., Daly, M.J., Reeve-Daly, M.P., Lander, E.S. Parametric and nonparametric linkage analysis: a unified multipoint approach. American Journal of Human Genetics, 58: 1347–1363 (1996)
Lander, E.S., Green, P. Construction of multilocus genetic linkage maps in human. Proceedings of National Academy of Sciences of USA, 84: 2363–2367 (1987)
Li, J., Jiang, T. Efficient rule-based haplotyping algorithms for pedigree data. In: Proceedings of the 7th Annual Conference on Research in Computational Molecular Biology (RECOMB’03), 2003, 197–206
Lin, G., Wang, Z., Wang, L., Lau, Y.L., Yang, W. Identification of linked regions using high-density SNP genotype data for linkage analyses. Bioinformatics, 24: 86–93 (2008)
Lin, S., Cutler, D.J., Zwick, M.E., Chakravarti, A. Haplotype inference in random population samples. American Journal of Human Genetics, 71: 1129–1137 (2002)
Liu, L., Jiang, T. A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on pedigrees without mating loops. Journal of Combinatorial Optimization, 2009. Online first.
Long, J.C., Williams, R.C., Urbanek, M. An E-M algorithm and testing strategy for multiple-locus haplotypes. American Journal of Human Genetics, 56: 799–810 (1995)
Niu, T., Qin, Z.S., Xu, X., Liu, J.S. Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. American Journal of Human Genetics, 70: 157–169 (2002)
Qin, Z., Niu, T., Liu, J. Partitioning-ligation-expectation maximization algorithm for haplotype inference with single nucleotide polymorphisms. American Journal of Human Genetics, 71: 1242–1247 (2002)
Sobel, E., Lange, K., O’Connell, J.R., Weeks, D.E. Haplotype algorithms. In T.P. Speed and M.S. Waterman, editors. Genetic Mapping and DNA Sequencing. IMA Volumes in Mathematics and Its Applications, P. 89–110. Springer, New York, 1995
Stephens, M., Smith, N., Donnelly, P. A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics, 68: 978–989 (2001)
Weeks, D.E., Sobel, E., O’Connell, J.R., Lange, K. Computer programs for multilocus haplotyping of general pedigrees. American Journal of Human Genetics, 56: 1506–1507 (1995)
Xiao, J., Liu, L., Xia, L., Jiang, T. Fast elimination of redundant linear equations and reconstruction of recombination-free Mendelian inheritance on a pedigree. In: Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’07), 2007, 655–664
Author information
Authors and Affiliations
Corresponding author
Additional information
This research is supported in part by AARI, AICML, ALIDF, iCORE, and NSERC.
Rights and permissions
About this article
Cite this article
Cheng, Y., Sabaa, H., Cai, Z. et al. Efficient haplotype inference algorithms in one whole genome scan for pedigree data with non-genotyped founders. Acta Math. Appl. Sin. Engl. Ser. 25, 477–488 (2009). https://doi.org/10.1007/s10255-008-8821-3
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10255-008-8821-3