Abstract
Some phylogenetic comparative analyses rely on simulation procedures that use a large number of phylogenetic trees to estimate evolutionary correlations. Because of the computational burden of processing hundreds of thousands of trees, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present a highly parallel and efficient implementation for calculating phylogenetic distances. By using the power of GPU computing and a massive number of threads we are able to achieve performance gains up to 243x when compared to a sequential implementation of the same procedures. New data structures and algorithms are also presented so as to efficiently process irregular pointer-based data structures such as trees. In particular, a GPU-based parallel implementation of the lowest common ancestor (LCA) problem is presented. Moreover, the implementation makes intensive use of bitmaps to efficiently encode paths to the tree nodes, and optimize memory transactions by working with data structures that favors coalesced memory accesses. Our results open up the possibility of dealing with large datasets in evolutionary and ecological analyses.
Keywords
- parallel computing
- GPU
- phylogenetics
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Felsenstein, J.: Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39(4), 783–791 (1985)
Suchard, M.A., Rambaut, A.: Many-core algorithms for statistical phylogenetics. Bioinformatics 25, 1370–1376 (2009)
Ayres, D.L., Darling, A., Zwickl, D.J., Beerli, P., Holder, M.T., Lewis, P.O., Huelsenbeck, J.P., Ronquist, F., Swofford, D.L., Cummings, M.P., Rambaut, A., Suchard, M.A.: BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics. Syst. Biol. 61(1), 170–173 (2012)
Stamatakis, A.: Parallel and Distributed Computation of Large Phylogenetic Trees. In: Zomaya, A.Y. (ed.) Parallel Computing for Bioinformatics and Computational Biology: Models, Enabling Technologies, and Case Studies. John Wiley & Sons, Inc., Hoboken (2005)
Petzold, E., Merkle, D., Middendorf, M., von Haeseler, A., Schmidt, H.A.: Phylogenetic Parameter Estimation on COWs. In: Zomaya, A.Y. (ed.) Parallel Computing for Bioinformatics and Computational Biology: Models, Enabling Technologies, and Case Studies, John Wiley & Sons, Inc., Hoboken (2005)
Williams, T.L., Bader, D.A., Moret, B.M.E., Yan, M.: High-Performance Phylogeny Reconstruction Under Maximum Parsimony. In: Zomaya, A.Y. (ed.) Parallel Computing for Bioinformatics and Computational Biology: Models, Enabling Technologies, and Case Studies. John Wiley & Sons, Inc., Hoboken (2005)
Martins, E.P.: COMPARE, version 4.6b. Computer programs for the statistical analysis of comparative data. Department of Biology, Indiana University, Bloomington, IN
Webb, C.O., Ackerly, D.D., Kembel, S.W.: Phylocom: software for the analysis of phylogenetic community structure and trait evolution. Bioinformatics 24, 2098–2100 (2008)
Fourment, M., Gibbs, M.: PATRISTIC: a program for calculating patristic distances and graphically comparing the components of genetic change. BMC Evol. Biol. 6, 1 (2006)
Olsen, G.: “Newick’s 8:45” Tree Format Standard (1990), http://evolution.genetics.washington.edu/phylip/newick_doc.html
Aho, A., Hopcroft, J., Ullman, J.: On finding lowest common ancestors in trees. In: Proc. 5th ACM Symp. Theory of Computing (STOC), pp. 253–265 (1973)
Bender, M.A., Farach-Colton, M.: The LCA Problem Revisited. In: Gonnet, G.H., Viola, A. (eds.) LATIN 2000. LNCS, vol. 1776, pp. 88–94. Springer, Heidelberg (2000)
Schieber, B., Vishkin, U.: On finding lowest common ancestors: simplification and parallelization. SIAM Journal on Computing 17(6), 1253–1262 (1988)
Dartzmann, T., von Helversen, O., Mayer, F.: Evolution of nectarivory in phyllostomid bats (Phyllostomidae Gray, 1825, Chiroptera: Mammalia). Evolutionary Biology 10, 165 (2010)
Bininda-Emonds, O.R.P., Cardillo, M., Jones, K.E., MacPhee, R.D.E., Beck, R.M.D., Grenyer, R., Price, S.A., Vos, R.A., Gittleman, J.L., Purvis, A.: The delayed rise of present-day mammals. Nature 446, 507–512 (2007)
McGuire, J.A., Witt, C.C., Altshuler, D.L., Remsen Jr., J.V.: Phylogenetic systematics and biogeography of hummingbirds: Bayesian and maximum likelihood analyses of partitioned data and selection of an appropriate partitioning strategy. Systematic Biology 56, 837–856
Pyron, R.A., Wiens, J.J.: A large-scale phylogeny of Amphibia including over 2800 species, and a revised classification of extant frogs, salamanders and caecilians. Molecular Phylogenetics and Evolution 61, 543–583 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martins, W.S., Rangel, T.F., Lucas, D.C.S., Ferreira, E.B., Caceres, E.N. (2012). Phylogenetic Distance Computation Using CUDA. In: de Souto, M.C., Kann, M.G. (eds) Advances in Bioinformatics and Computational Biology. BSB 2012. Lecture Notes in Computer Science(), vol 7409. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31927-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-31927-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31926-6
Online ISBN: 978-3-642-31927-3
eBook Packages: Computer ScienceComputer Science (R0)
