Skip to main content

Phylogenetic Distance Computation Using CUDA

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNBI,volume 7409)

Abstract

Some phylogenetic comparative analyses rely on simulation procedures that use a large number of phylogenetic trees to estimate evolutionary correlations. Because of the computational burden of processing hundreds of thousands of trees, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present a highly parallel and efficient implementation for calculating phylogenetic distances. By using the power of GPU computing and a massive number of threads we are able to achieve performance gains up to 243x when compared to a sequential implementation of the same procedures. New data structures and algorithms are also presented so as to efficiently process irregular pointer-based data structures such as trees. In particular, a GPU-based parallel implementation of the lowest common ancestor (LCA) problem is presented. Moreover, the implementation makes intensive use of bitmaps to efficiently encode paths to the tree nodes, and optimize memory transactions by working with data structures that favors coalesced memory accesses. Our results open up the possibility of dealing with large datasets in evolutionary and ecological analyses.

Keywords

  • parallel computing
  • GPU
  • phylogenetics

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   49.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Felsenstein, J.: Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39(4), 783–791 (1985)

    CrossRef  Google Scholar 

  2. Suchard, M.A., Rambaut, A.: Many-core algorithms for statistical phylogenetics. Bioinformatics 25, 1370–1376 (2009)

    CrossRef  Google Scholar 

  3. Ayres, D.L., Darling, A., Zwickl, D.J., Beerli, P., Holder, M.T., Lewis, P.O., Huelsenbeck, J.P., Ronquist, F., Swofford, D.L., Cummings, M.P., Rambaut, A., Suchard, M.A.: BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics. Syst. Biol. 61(1), 170–173 (2012)

    CrossRef  Google Scholar 

  4. Stamatakis, A.: Parallel and Distributed Computation of Large Phylogenetic Trees. In: Zomaya, A.Y. (ed.) Parallel Computing for Bioinformatics and Computational Biology: Models, Enabling Technologies, and Case Studies. John Wiley & Sons, Inc., Hoboken (2005)

    Google Scholar 

  5. Petzold, E., Merkle, D., Middendorf, M., von Haeseler, A., Schmidt, H.A.: Phylogenetic Parameter Estimation on COWs. In: Zomaya, A.Y. (ed.) Parallel Computing for Bioinformatics and Computational Biology: Models, Enabling Technologies, and Case Studies, John Wiley & Sons, Inc., Hoboken (2005)

    Google Scholar 

  6. Williams, T.L., Bader, D.A., Moret, B.M.E., Yan, M.: High-Performance Phylogeny Reconstruction Under Maximum Parsimony. In: Zomaya, A.Y. (ed.) Parallel Computing for Bioinformatics and Computational Biology: Models, Enabling Technologies, and Case Studies. John Wiley & Sons, Inc., Hoboken (2005)

    Google Scholar 

  7. Martins, E.P.: COMPARE, version 4.6b. Computer programs for the statistical analysis of comparative data. Department of Biology, Indiana University, Bloomington, IN

    Google Scholar 

  8. Webb, C.O., Ackerly, D.D., Kembel, S.W.: Phylocom: software for the analysis of phylogenetic community structure and trait evolution. Bioinformatics 24, 2098–2100 (2008)

    CrossRef  Google Scholar 

  9. Fourment, M., Gibbs, M.: PATRISTIC: a program for calculating patristic distances and graphically comparing the components of genetic change. BMC Evol. Biol. 6, 1 (2006)

    CrossRef  Google Scholar 

  10. Olsen, G.: “Newick’s 8:45” Tree Format Standard (1990), http://evolution.genetics.washington.edu/phylip/newick_doc.html

  11. Aho, A., Hopcroft, J., Ullman, J.: On finding lowest common ancestors in trees. In: Proc. 5th ACM Symp. Theory of Computing (STOC), pp. 253–265 (1973)

    Google Scholar 

  12. Bender, M.A., Farach-Colton, M.: The LCA Problem Revisited. In: Gonnet, G.H., Viola, A. (eds.) LATIN 2000. LNCS, vol. 1776, pp. 88–94. Springer, Heidelberg (2000)

    CrossRef  Google Scholar 

  13. Schieber, B., Vishkin, U.: On finding lowest common ancestors: simplification and parallelization. SIAM Journal on Computing 17(6), 1253–1262 (1988)

    CrossRef  MathSciNet  MATH  Google Scholar 

  14. Dartzmann, T., von Helversen, O., Mayer, F.: Evolution of nectarivory in phyllostomid bats (Phyllostomidae Gray, 1825, Chiroptera: Mammalia). Evolutionary Biology 10, 165 (2010)

    CrossRef  Google Scholar 

  15. Bininda-Emonds, O.R.P., Cardillo, M., Jones, K.E., MacPhee, R.D.E., Beck, R.M.D., Grenyer, R., Price, S.A., Vos, R.A., Gittleman, J.L., Purvis, A.: The delayed rise of present-day mammals. Nature 446, 507–512 (2007)

    CrossRef  Google Scholar 

  16. McGuire, J.A., Witt, C.C., Altshuler, D.L., Remsen Jr., J.V.: Phylogenetic systematics and biogeography of hummingbirds: Bayesian and maximum likelihood analyses of partitioned data and selection of an appropriate partitioning strategy. Systematic Biology 56, 837–856

    Google Scholar 

  17. Pyron, R.A., Wiens, J.J.: A large-scale phylogeny of Amphibia including over 2800 species, and a revised classification of extant frogs, salamanders and caecilians. Molecular Phylogenetics and Evolution 61, 543–583 (2011)

    CrossRef  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Martins, W.S., Rangel, T.F., Lucas, D.C.S., Ferreira, E.B., Caceres, E.N. (2012). Phylogenetic Distance Computation Using CUDA. In: de Souto, M.C., Kann, M.G. (eds) Advances in Bioinformatics and Computational Biology. BSB 2012. Lecture Notes in Computer Science(), vol 7409. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31927-3_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31927-3_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31926-6

  • Online ISBN: 978-3-642-31927-3

  • eBook Packages: Computer ScienceComputer Science (R0)