Skip to main content

Large-Scale Analysis of Phylogenetic Search Behavior

  • Conference paper
  • First Online:
Advances in Computational Biology

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 680))

Abstract

Phylogenetic analysis is used in all branches of biology with applications ranging from studies on the origin of human populations to investigations of the transmission patterns of HIV. Most phylogenetic analyses rely on effective heuristics for obtaining accurate trees. However, relatively little work has been done to analyze quantitatively the behavior of phylogenetic heuristics in tree space. A better understanding of local search behavior can facilitate the design of better heuristics, which ultimately lead to more accurate depictions of the true evolutionary relationships. In this paper, we present new and novel insights into local search behavior for maximum parsimony on three biological datasets consisting of 44, 60, and 174 taxa. By analyzing all trees from search, we find that, as the search algorithm climbs the hill to local optima, the trees in the neighborhood surrounding the current solution improve as well. Furthermore, the search is quite robust to a small number of randomly selected neighbors. Thus, our work shows how to gain insights into the behavior of local search algorithm by exploring a large diverse collection of trees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. A. R. Deans, J. J. Gillespie, and M. J. Yoder (2006) An evaluation of ensign wasp classification (Hymenoptera: Evanildae) based on molecular data and insights from ribosomal rna secondary structure. Syst. Ento., 31:517–528.

    Article  Google Scholar 

  2. J. Gillespie, C. McKenna, M. Yoder, R. Gutell, J. Johnston, J. Kathirithamby, and A. Cognato (2005) Assessing the odd secondary structural properties of nuclear small subunit ribosomal rna sequences (18s) of the twisted-wing parasites (Insecta: Strepsiptera). Insect Mol. Biol., 15:625–643.

    Article  Google Scholar 

  3. P. A. Goloboff, J. S. Farris, and K. C. Nixon (2008) Tnt, a free program for phylogenetic analysis. Cladistics, 24(5):774–786.

    Article  Google Scholar 

  4. W. J. Murphy, E. Eizirik, S. J. O’Brien, O. Madsen, M. Scally, C. J. Douady, E. Teeling, O. A. Ryder, M. J. Stanhope, W. W. de Jong, and M. S. Springer (2001) Resolution of the early placental mammal radiation using bayesian phylogenetics. Science, 294:2348–2351.

    Article  PubMed  CAS  Google Scholar 

  5. K. C. Nixon (1999) The parsimony ratchet, a new method for rapid parsimony analysis. Cladistics, 15:407–414.

    Article  Google Scholar 

  6. H.-J. Park, S.-J. Sul, and T. L. Williams (2009) Large-scale analysis of phylogenetic search behavior. Technical Report TR-CS-2009-12-1, Department of Computer Science and Engineering, Texas A& M University.

    Google Scholar 

  7. D. F. Robinson and L. R. Foulds (1981) Comparison of phylogenetic trees. Math Biosci, 53:131–147.

    Article  Google Scholar 

  8. U. Roshan, B. M. E. Moret, T. L. Williams, and T. Warnow (2004) Rec-I-DCM3: a fast algorithmic techniques for reconstructing large phylogenetic trees. In Proc. IEEE Computer Society Bioinformatics Conference (CSB 2004), pp 98–109. IEEE Press.

    Google Scholar 

  9. N. Saitou and M. Nei (1987) The neighbor-joining method: A new method for reconstructiong phylogenetic trees. Mol. Biol. Evol., 4:406–425.

    PubMed  CAS  Google Scholar 

  10. S.-J. Sul and T. L. Williams (2008) An experimental analysis of robinson-foulds distance matrix algorithms. In European Symposium of Algorithms (ESA’08), volume 5193 of Lecture Notes in Computer Science, pp 793–804. Springer, New York.

    Google Scholar 

  11. D. L. Swofford (2002) PAUP*: Phylogenetic analysis using parsimony (and other methods), Sinauer Associates, Underland, Massachusetts, Version 4.0.

    Google Scholar 

Download references

Acknowledgments

Funding for this project was supported by the National Science Foundation under grants DEB-0629849 and IIS-0713618. The authors wish to thank Bill Murphy and Matt Yoder for providing us with the biological datasets used in this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hyun Jung Park .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this paper

Cite this paper

Park, H.J., Sul, SJ., Williams, T.L. (2010). Large-Scale Analysis of Phylogenetic Search Behavior. In: Arabnia, H. (eds) Advances in Computational Biology. Advances in Experimental Medicine and Biology, vol 680. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-5913-3_5

Download citation

Publish with us

Policies and ethics