Experimental Analysis of a New Algorithm for Partial Haplotype Completion

  • Paola Bonizzoni
  • Gianluca Della Vedova
  • Riccardo Dondi
  • Lorenzo Mariani
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3515)


This paper deals with the computational problem of inferring complete information on haplotypes from haplotypes with missing data. This problem is one of the main issues in haplotyping, as the current DNA sequencing technology often produces haplotypes with missing bases and thus the complete information on haplotypes has to be inferred through computational methods. In this paper we propose a new algorithmic approach to the problem that assumes both the Coalescent and the Minimum Entropy models and we provide an experimental analysis relating it to the previously investigated approaches. In particular, the reconstruction of a perfect phylogeny from haplotypes with missing data is addressed.


Greedy Heuristic Average Error Rate Completion Matrix Coalescent Model Perfect Phylogeny 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Bonizzoni, P., Della Vedova, G., Dondi, R., Li, J.: The haplotyping problem: a view of computational models and solutions. Journal of Computer and Science Technology 18, 675–688 (2003)zbMATHCrossRefGoogle Scholar
  2. 2.
    Daly, M.J., Rioux, J.D., Schaffner, S.F., et al.: High-resolution haplotype structure in the human genome. Nat. Genet. 29(2), 229–232 (2001)CrossRefGoogle Scholar
  3. 3.
    Eskin, E., Halperin, E., Karp, R.M.: Large scale reconstruction of haplotypes from genotype data. In: Proceedings of the 7th RECOMB, pp. 104–113 (2003)Google Scholar
  4. 4.
    Gabriel, S.B., Schaffner, S.F., et al.: The structure of haplotype blocks in the human genome. Science 296, 2225–2229 (2002)CrossRefGoogle Scholar
  5. 5.
    Gusfield, D.: Haplotyping as perfect phylogeny: Conceptual framework and efficient solutions. In: Proceedings of the 6th RECOMB, pp. 166–175 (2002)Google Scholar
  6. 6.
    Halperin, E., Karp, R.M.: The minimum-entropy set cover problem. In: Díaz, J., Karhumäki, J., Lepistö, A., Sannella, D. (eds.) ICALP 2004. LNCS, vol. 3142, pp. 733–744. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  7. 7.
    Halperin, E., Karp, R.M.: Perfect phylogeny and haplotype assignment. In: Proceedings of the 8th RECOMB (2004)Google Scholar
  8. 8.
    Helsgaun, K.: An effective implementation of the Lin-Kernighan traveling salesman heuristic. European Journal of Operational Research 126, 106–130 (2000)zbMATHCrossRefMathSciNetGoogle Scholar
  9. 9.
    National Institutes of Health. Large-scale genotyping for the haplotype map of the human genome. RFA (Request For Applications) HG-02-005 (2002)Google Scholar
  10. 10.
    Pe’er, I., Pupko, T., Shamir, R., Sharan, R.: Incomplete directed perfect phylogeny. SIAM Journal on Computing 33(3), 590–607 (2004)zbMATHCrossRefMathSciNetGoogle Scholar
  11. 11.
    Steel, M.: The complexity of reconstructing trees from qualitative characters and subtrees. Journal of Classification 9, 91–116 (1992)zbMATHCrossRefMathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Paola Bonizzoni
    • 1
    • 3
  • Gianluca Della Vedova
    • 2
    • 3
  • Riccardo Dondi
    • 1
    • 3
  • Lorenzo Mariani
    • 1
    • 3
  1. 1.Dipartimento di InformaticaSistemistica e Comunicazione 
  2. 2.Dipartimento di Statistica 
  3. 3.Univ. Milano–BicoccaMilanoItaly

Personalised recommendations