Faster Phylogenetic Inference with MXG

  • David G. Mitchell
  • Faraz Hach
  • Raheleh Mohebali
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4790)


We apply the logic-based declarative programming approach of Model Expansion (MX) to a phylogenetic inference task. We axiomatize the task in multi-sorted first-order logic with cardinality constraints. Using the model expansion solver MXG and SAT+cardinality solver MXC, we compare the performance of several MX axiomatizations on a challenging set of test instances. Our methods perform orders of magnitude faster than previously reported declarative solutions. Our best solution involves polynomial-time pre-processing, redundant axioms, and symmetry-breaking axioms. We also discuss our method of test instance generation, and the role of pre-processing in declarative programming.


Phylogeny Declarative Programming Model Expansion 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Adn, G.W., You, J.-H., Lin, G.: Quartet-based phylogeny reconstruction with answer set programming. IEEE/ACM Transactions on Computational Biology and Bioinformatics 4(1), 139–152 (2007)CrossRefGoogle Scholar
  2. 2.
    Bregman, D.R., Mitchell, D.G.: The sat solver mxc, version 0.5, Solver Description for the 2007 SAT Solver Competition (2007)Google Scholar
  3. 3.
    Brooks, D.R., Erdem, E., Minett, J.W., Rings, D.: Character-based cladistics and answer set programming. In: Hermenegildo, M.V., Cabeza, D. (eds.) PADL 2005. LNCS, vol. 3350, pp. 37–51. Springer, Heidelberg (2005)Google Scholar
  4. 4.
    Day, W.H.E., Johnson, D.S., Sankoff, D.: The computational complexity of inferring rooted phylogenies by parsimony. Mathematical Biosciences 81, 33–42 (1986)zbMATHCrossRefMathSciNetGoogle Scholar
  5. 5.
    Edwards-Ingram, L.C., Gent, M.E., Hoyle, D.C, Hayes, A., Stateva, L.I., Oliver, S.G.: Comparative genomic hybridization provides new insights into the molecular taxonomy of the saccharomyces sensu stricto complex. Genome Research 14, 1043–1051 (2004)CrossRefGoogle Scholar
  6. 6.
    Een, N., Sorensson, N.: An extensible SAT-solver. In: Giunchiglia, E., Tacchella, A. (eds.) SAT 2003. LNCS, vol. 2919, pp. 502–518. Springer, Heidelberg (2004)Google Scholar
  7. 7.
    Erdem, E., Lifschitz, V., Nakhleh, L., Ringe, D.: Reconstructing the evolutionary history of indo-european languages using answer set programming. In: Proc. Practical Aspects of Declarative Languages: 5th Int’l. Symposium, pp. 160–176 (January 2003)Google Scholar
  8. 8.
    Felsenstein, J.: Phylip home page phylip (1980),
  9. 9.
    Gebser, M., Kaufmann, B., Neumann, A., Schaub, T.: clasp: A conflict-driven answer set solver. In: Baral, C., Brewska, G., Schlipf, J. (eds.) LPNMR 2007. LNCS (LNAI), vol. 4483, pp. 260–265. Springer, Heidelberg (2007)Google Scholar
  10. 10.
    Gupta, A., Manuch, J., Zhao, X., Stacho, L.: Characterization of the existence of galled-tree networks. J. Bioinformatics and Computational Biology 4(6), 1309–1328 (2006)CrossRefGoogle Scholar
  11. 11.
    Hendy, M.D., Penny, D.: Branch and bound algorithms to determine minimal evolutionary trees. Mathematical Biosciences 59(2), 277–290 (1982)zbMATHCrossRefMathSciNetGoogle Scholar
  12. 12.
    Hoffmann, M., Tripathi, N., Henz, S.R., Lindholm, A.K., Weigel, D., Breden, F., Dreyer, C.: Opsin gene duplication and diversification in the guppy, a model for sexual selection. Proc. of the Royal Society of London Series B 274, 33–42 (2007)CrossRefGoogle Scholar
  13. 13.
    Kavanagh, J., Mitchell, D.G., Ternovska, E., Manuch, J., Zhao, X., Gupta, A.: Constructing Camin-Sokal phylogenies via answer set programming. In: Hermann, M., Voronkov, A. (eds.) LPAR 2006. LNCS (LNAI), vol. 4246, pp. 452–466. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  14. 14.
    Lierler, Y., Maratea, M.: Cmodels-2: SAT-based answer set solver enhanced to non-tight programs. In: Lifschitz, V., Niemelä, I. (eds.) LNMR 2004. LNCS (LNAI), vol. 2923, pp. 346–350. Springer, Heidelberg (2004)Google Scholar
  15. 15.
    Lynce, I., Silva, J.P.M.: Efficient haplotype inference with boolean satisfiability. In: Proc. AAAI 2006 (2006)Google Scholar
  16. 16.
    Mitchell, D., Ternovska, E., Hach, F., Mohebali, R.: A framework for modelling and solving search problems. Technical Report TR 2006-24, School of Computing Science, Simon Fraser University (December 2006)Google Scholar
  17. 17.
    Mitchell, D.G., Levesque, H.J.: Some pitfalls for experimenters with random SAT. Artificial Intelligence 81(1,2) (March 1996) Special Issue – Frontiers in Problem Solving: Phase Transitions and Complexity. Google Scholar
  18. 18.
    Nozaki, H., Ohta, N., Matsuzaki, M., Misumi, O., Kuroiwa, T.: Phylogeny of plastids based on cladistic analysis of gene loss inferred from complete plastid genome sequences. J. Molecular Evolution 57, 377–382 (2003)CrossRefGoogle Scholar
  19. 19.
    Swofford, D.L.: Paup* 4.0, Phylogenetic Analysis Using Parsimony (*and Other Methods) (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • David G. Mitchell
    • 1
  • Faraz Hach
    • 1
  • Raheleh Mohebali
    • 1
  1. 1.Computational Logic Laboratory, Simon Fraser University, Burnaby BCCanada

Personalised recommendations