A Fast Program for Phylogenetic Tree Inference with Maximum Likelihood

  • Alexandros P. Stamatakis
  • Thomas Ludwig
  • Harald Meier
Conference paper


Inference of large phylogenetic trees using elaborate statistical models is computationally extremely intensive. Thus, progress is primarily achieved via algorithmic innovation rather than by brute-force allocation of all available computational resources. We present simple heuristics which yield accurate trees for synthetic (simulated) as well as real data and improve execution time compared to the currently fastest programs. The new heuristics are implemented in a sequential program (RAxML) which is available as open source code. Furthermore, we present a non-deterministic parallel version of our algorithm which in some cases yielded super-linear speedups for computations with 1000 organisms. We compare sequential RAxML performance with the currently fastest and most accurate programs for phylogenetic tree inference based on statistical methods using 50 synthetic alignments and 9 real-world alignments comprising up to 1000 sequences. RAxML outperforms those programs for real-world data in terms of speed and final likelihood values.


Pairing Interaction Spin Susceptibility Doping Density Hubbard Interaction Quantum Monte Carlo Simulation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Felsenstein, J.: Evolutionary Trees from DNA Sequences: A Maximum Likelihood Approach. In: J. Mol. Evol., 17:368–376, 1981.CrossRefGoogle Scholar
  2. 2.
    Guindon, S., and Gascuel, O.: A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. In: Syst. Biol., 52(5):696–704, 2003.CrossRefGoogle Scholar
  3. 3.
    Holder, M.T., and Lewis, P.O.: Phylogeny Estimation: Traditional and Bayesian Approaches. In: Nat. Rev. Gen., 4:275–284, 2003.CrossRefGoogle Scholar
  4. 4.
    Huelsenbeck, J.P., and Ronquist, F.: MRBAYES: Bayesian inference of phylogenetic trees. In: Bioinf., 17(8):754–5, 2001.CrossRefGoogle Scholar
  5. 5.
    Huelsenbeck, J.P., et al.: Potential Applications and Pitfalls of Bayesian Inference of Phylogeny. In: Syst. Biol., 51(5):673–688, 2002.CrossRefGoogle Scholar
  6. 6.
    Ludwig, W. et al.: ARB: A Software Environment for Sequence Data. In: Nucl. Acids Res., in press, 2003.Google Scholar
  7. 7.
    Olsen, G., et al.: fastdnaml: A Tool for Construction of Phylogenetic Trees of DNA Sequences using Maximum Likelihood. In: Comput. Appl. Biosci., 10:41–48, 1994.Google Scholar
  8. 8.
    PAML Manual:, visited Nov 2003.Google Scholar
  9. 9.
    PAUP:, visited May 2003.Google Scholar
  10. 10.
    PHYLIP:, visited Nov 2003.Google Scholar
  11. 11.
    RRZE:, visited Oct 2003.Google Scholar
  12. 12.
    Stamatakis, A.P., et al: New Fast and Accurate Heuristics for Inference of Large Phylogenetic Trees. In: Proc. of IPDPS2004, to be published.Google Scholar
  13. 13.
    Stamatakis, A.P., et al: A Fast Program for Maximum Likelihood-based Inference of Large Phylogenetic Trees. In: Proc. of SAC'04, to be published.Google Scholar
  14. 14.
    Stamatakis, A.P., et al.: Accelerating Parallel Maximum Likelihood-based Phylogenetic Tree Computations using Subtree Equality Vectors. In: Proc. of SC2002, 2002.Google Scholar
  15. 15.
    Stewart, C. et al.: Parallel Implementation and Performance of fastdnaml-a Program for Maximum Likelihood Phylogenetic Inference. In: Proc. of SC2001, 2001.Google Scholar
  16. 16.
    Strimmer, K., Haeseler, A.v.: Quartet Puzzling: A Maximum-Likelihood Method for Reconstructing Tree Topologies. In: Mol. Biol. Evol., 13:964–969, 1996.Google Scholar
  17. 17.
    Williams, T.L., Moret, B.M.E.: An Investigation of Phylogenetic Likelihood Methods. In: Proc. of BIBE'03, 2003.Google Scholar
  18. 18.
    Tuffley, C., Steel, M.: Links between Maximum Likelihood and Maximum Parsimony under a Simple Sodel of Site Substitution. In: Bull. Math. Biol., 59(3):581–607, 1997.CrossRefMATHGoogle Scholar
  19. 19.
    Wolf, M.J., et al.: TrExML: A Maximum Likelihood Program for Extensive Tree-space Exploration. In: Bioinf., 16(4):383–394, 2000.CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Alexandros P. Stamatakis
    • 1
  • Thomas Ludwig
    • 2
  • Harald Meier
    • 1
  1. 1.Department of Computer ScienceTechnische Universität MünchenGarching b. MünchenGermany
  2. 2.Department of Computer ScienceRuprecht-Karls-UniversitätHeidelbergGermany

Personalised recommendations