Skip to main content
Log in

Efficient haplotype inference algorithms in one whole genome scan for pedigree data with non-genotyped founders

  • Published:
Acta Mathematicae Applicatae Sinica, English Series Aims and scope Submit manuscript

Abstract

An efficient rule-based algorithm is presented for haplotype inference from general pedigree genotype data, with the assumption of no recombination. This algorithm generalizes previous algorithms to handle the cases where some pedigree founders are not genotyped, provided that for each nuclear family at least one parent is genotyped and each non-genotyped founder appears in exactly one nuclear family. The importance of this generalization lies in that such cases frequently happen in real data, because some founders may have passed away and their genotype data can no longer be collected. The algorithm runs in O(m 3 n 3) time, where m is the number of single nucleotide polymorphism (SNP) loci under consideration and n is the number of genotyped members in the pedigree. This zero-recombination haplotyping algorithm is extended to a maximum parsimoniously haplotyping algorithm in one whole genome scan to minimize the total number of breakpoint sites, or equivalently, the number of maximal zero-recombination chromosomal regions. We show that such a whole genome scan haplotyping algorithm can be implemented in O(m 3 n 3) time in a novel incremental fashion, here m denotes the total number of SNP loci along the chromosome.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Altshuler, D., Daly, M.J., Lander, E.S. Genetic mapping in human disease. Science, 322: 881–888 (2008)

    Article  Google Scholar 

  2. Chan, M.Y., Chan, W., Chin, F., Fung, S., Kao, M. Linear-time haplotype inference on pedigrees without recombinations. In: Proceedings of the 6th Annual Workshop on Algorithms in Bioinformatics (WABI’06), 2006, 56–67

  3. Du, F.X., Woodward, B.W., Denise, S.K. Haplotype construction of sires with progeny genotypes based on an exact likelihood. Journal of Dairy Sciences, 81: 1462–1468 (1998)

    Article  Google Scholar 

  4. Excoffer, L., Slatkin, M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Molecular Biology and Evolution, 12: 921–927 (1995)

    Google Scholar 

  5. Gusfield, D. Inference of haplotypes from samples of diploid populations: complexity and algorithms. Journal of Computational Biology, 8: 305–323 (2001)

    Article  Google Scholar 

  6. Hawley, M.E., Kidd, K.K. HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. Journal of Heredity, 86: 409–411 (1995)

    Google Scholar 

  7. Kruglyak, L., Daly, M.J., Reeve-Daly, M.P., Lander, E.S. Parametric and nonparametric linkage analysis: a unified multipoint approach. American Journal of Human Genetics, 58: 1347–1363 (1996)

    Google Scholar 

  8. Lander, E.S., Green, P. Construction of multilocus genetic linkage maps in human. Proceedings of National Academy of Sciences of USA, 84: 2363–2367 (1987)

    Article  Google Scholar 

  9. Li, J., Jiang, T. Efficient rule-based haplotyping algorithms for pedigree data. In: Proceedings of the 7th Annual Conference on Research in Computational Molecular Biology (RECOMB’03), 2003, 197–206

  10. Lin, G., Wang, Z., Wang, L., Lau, Y.L., Yang, W. Identification of linked regions using high-density SNP genotype data for linkage analyses. Bioinformatics, 24: 86–93 (2008)

    Article  Google Scholar 

  11. Lin, S., Cutler, D.J., Zwick, M.E., Chakravarti, A. Haplotype inference in random population samples. American Journal of Human Genetics, 71: 1129–1137 (2002)

    Article  Google Scholar 

  12. Liu, L., Jiang, T. A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on pedigrees without mating loops. Journal of Combinatorial Optimization, 2009. Online first.

  13. Long, J.C., Williams, R.C., Urbanek, M. An E-M algorithm and testing strategy for multiple-locus haplotypes. American Journal of Human Genetics, 56: 799–810 (1995)

    Google Scholar 

  14. Niu, T., Qin, Z.S., Xu, X., Liu, J.S. Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. American Journal of Human Genetics, 70: 157–169 (2002)

    Article  Google Scholar 

  15. Qin, Z., Niu, T., Liu, J. Partitioning-ligation-expectation maximization algorithm for haplotype inference with single nucleotide polymorphisms. American Journal of Human Genetics, 71: 1242–1247 (2002)

    Article  Google Scholar 

  16. Sobel, E., Lange, K., O’Connell, J.R., Weeks, D.E. Haplotype algorithms. In T.P. Speed and M.S. Waterman, editors. Genetic Mapping and DNA Sequencing. IMA Volumes in Mathematics and Its Applications, P. 89–110. Springer, New York, 1995

    Google Scholar 

  17. Stephens, M., Smith, N., Donnelly, P. A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics, 68: 978–989 (2001)

    Article  Google Scholar 

  18. Weeks, D.E., Sobel, E., O’Connell, J.R., Lange, K. Computer programs for multilocus haplotyping of general pedigrees. American Journal of Human Genetics, 56: 1506–1507 (1995)

    Google Scholar 

  19. Xiao, J., Liu, L., Xia, L., Jiang, T. Fast elimination of redundant linear equations and reconstruction of recombination-free Mendelian inheritance on a pedigree. In: Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’07), 2007, 655–664

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guohui Lin.

Additional information

This research is supported in part by AARI, AICML, ALIDF, iCORE, and NSERC.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cheng, Y., Sabaa, H., Cai, Z. et al. Efficient haplotype inference algorithms in one whole genome scan for pedigree data with non-genotyped founders. Acta Math. Appl. Sin. Engl. Ser. 25, 477–488 (2009). https://doi.org/10.1007/s10255-008-8821-3

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10255-008-8821-3

Keywords

2000 MR Subject Classification

Navigation