Abstract
Grid computing defines the combination of computers or clusters of computers across networks, like the internet, to form a distributed supercomputer. This infrastructure allows scientists to process complex and time consuming computations in parallel on demand. Phylogenetic inference for large data sets of DNA/protein sequences is known to be computationally intensive and could greatly benefit from this parallel supercomputing approach. Bayesian algorithms allows the estimation of important parameters on species divergence modus and time but at the price of running repetitive long series of MonteCarlo simulations. As part of the BioinfoGrid project, we ported parallel MrBayes to the EGEE (Enabling Grids for E-sciencE) grid infrastructure. As case study we investigate both a challenging dataset of arthropod phylogeny and the most appropriate model of amino acid replacement for that data set. Our aim is to resolve the position of basal hexapod lineages with respect to Insecta and Crustacea. In this effort, a new matrix of protein change was derived from the dataset itself, and its performance compared with other currently used models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Whelan, S., LiĆ³, P., Goldman, N.: Molecular phylogenetics: State-of-art methods for looking into the past. Trends Genet.Ā 17, 262ā272 (2001)
LiĆ³, P., Goldman, N.: Models of molecular evolution and phylogeny. Genome Res.Ā 8, 1233ā1244 (1998)
Carapelli, A., LiĆ³, P., Nardi, F., van der Wath, E., Frati, F.: Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea. BMC Evolutionary BiologyĀ 7 (2007), doi:10.1186/1471-2148-7-S2-S8
Russo, C.A., Takezaki, N., Nei, M.: Efficiencies of different genes and different tree-building methods in recovering a known vertebrate phylogeny. Mol. Biol. Evol.Ā 13, 933ā942 (1996)
Zardoya, R., Meyer, A.: Phylogenetic performance of mitochondrial protein-coding genes in resolving relationships among vertebrates. Molecular Biology and EvolutionĀ 13, 525ā536 (1996)
Pollock, D.D., Eisen, J.A., Doggett, N.A., Cummings, M.P.: A case for the evolutionary genomics and the comprehensive examination of sequence biodiversity. Mol. Biol. Evol.Ā 17, 1776ā1778 (2000)
Cao, Y., Janke, A., Waddell, P.J., Westerman, M., Takenaka, O., Murata, S., Okada, N., Paabo, S., Hasegawa, M.: Conflict among individual mitochondrial proteins in resolving the phylogeny of eutherian orders. J. Mol. Evol.Ā 47, 307ā322 (1998)
LiĆ³, P.: Phylogenetic and structural analysis of mitochondrial complex I proteins. GeneĀ 345, 55ā64 (1999)
Liu, J.S., Lawrence, C.E.: Bayesian inference on biopolymer models. BioinformaticsĀ 15, 38ā52 (1999)
Shoemaker, J.S., Painter, I.S., Weir, B.: Bayesian statistics in genetics: a guide for the uninitiated. Trends Genet.Ā 15, 354ā358 (1999)
Larget, B., Simon, D.: Markov chain Monte Carlo algorithms for the Bayesian analysis of phylogenetic trees. Mol. Biol. EĀ 16, 750ā759 (1999)
Huelsenbeck, J.P., Ronquist, F.: MrBayes: Bayesian inference in phylogenetic trees. BioinformaticsĀ 17, 754ā755 (2001)
Ronquist, F., Huelsenbeck., J.P.: MrBayes3: Bayesian phylogenetic inference under mixed models. BioinformaticsĀ 19, 1572ā1574 (2003)
Rannala, B., Yang., Z.: Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. GeneticsĀ 164, 1645ā1656 (2003)
Mau, B., Newton, M.A., Larget, B.: Bayesian phylogenetic inference via Markov chain Monte Carlo methods. BiometricsĀ 55, 1ā12 (1999)
Yang, Z., Rannala, B.: Bayesian phylogenetic inference using DNA sequences: Markov chain Monte Carlo methods. Mol. Biol. Evol.Ā 14, 717ā724 (1997)
Altekar1, G., Dwarkadas1, S., Huelsenbeck, J.P., Ronquist3, F.: Parallel Metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference. BioinformaticsĀ 20, 407ā415 (2004)
Richter, S.: The Tetraconata concept: hexapod-crustacean relationships and the phylogeny of Crustacea. Org. Divers Evol.Ā 2, 217ā237 (2002)
Nardi, F., Spinsanti, G., Boore, J.L., Carapelli, A., Dallai, R., Frati, F.: Hexapod origins: monophyletic or polyphyletic? ScienceĀ 299, 1887ā1889 (2003)
Cook, C.E., Yue, Q., Akam, M.: Mitochondrial genomes suggest that hexapods and crustaceans are mutually paraphyletic. Proc. R Soc. Lond. BĀ 272, 1295ā1304 (2005)
Abascal, F., Posada, D., Zardoya, R.: MtArt: a new model of amino acid replacement for Arthropoda. Mol. Biol. Evol.Ā 24, 1ā5 (2007)
Yang, Z., Nielsen, R., Hasegawa: Models of amino acid substitutions and applications to mitochondrial protein evolution. Mol. Biol. Evol.Ā 15, 1600ā1611 (1998)
http://public.eu-egee.org/industry/ifdocuments/glite-flyer.pdf
Boore, J.: Animal mitochondrial genomes. Nucl. Acid Res.Ā 27, 1767ā1780 (1999)
Cameron, S.L., Miller, K.B., DaHaese, C.A., Whiting, M.F., Barker, S.C.: Mitochondrial genome data alone are not enough to unambiguosly resolve the relationships of Entognatha, Insecta and Crustacea sensu lato (Arthropoda). CladisticsĀ 20, 534ā557 (2004)
Hassanin, A., Lger, N., Deutsch, J.: Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syt. Biol.Ā 54, 277ā298 (2005)
Chor, B., Hendy, M.D., Holland, B.R., Penny, D.: Multiple maxima of likelihood in phylogenetic trees: an analytic approach. In: RECOMB 2000, pp. 108ā117 (2000)
Mossel, E., Vigoda, E.: Limitations of Markov chain Monte Carlo algorithms for Bayesian Inference of phylogeny. Ann. Appl. Probab.Ā 16, 2215ā2234 (2006)
Chor, B., Tuller, T.: Finding a maximum likelihood tree is hard. J. ACMĀ 53, 722ā744 (2006)
Gascuel, O.: Mathematics of Evolution and Phylogeny. Oxford University Press, USA (2007)
Yang, Z.: Computational Molecular Evolution (Oxford Series in Ecology and Evolution). Oxford University Press, USA (2006)
Felsenstein, J.: Inferring Phylogenies Sinauer Associates, 2nd edn (2003)
Nielsen, R.: Statistical Methods in Molecular Evolution (Statistics for Biology and Health), 1st edn. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
Ā© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
van der Wath, R.C. et al. (2008). Bayesian Phylogeny on Grid. In: Elloumi, M., KĆ¼ng, J., Linial, M., Murphy, R.F., Schneider, K., Toma, C. (eds) Bioinformatics Research and Development. BIRD 2008. Communications in Computer and Information Science, vol 13. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70600-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-540-70600-7_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70598-7
Online ISBN: 978-3-540-70600-7
eBook Packages: Computer ScienceComputer Science (R0)