Skip to main content

Identifiability Issues in Phylogeny-Based Detection of Horizontal Gene Transfer

  • Conference paper
Comparative Genomics (RCG 2006)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 4205))

Included in the following conference series:

Abstract

Prokaryotic organisms share genetic material across species boundaries by means of a process known as horizontal gene transfer (HGT). Detecting this process bears great significance on understanding prokaryotic genome diversification and unraveling their complexities. Phylogeny-based detection of HGT is one of the most commonly used approaches for this task, and is based on the fundamental fact that HGT may cause gene trees to disagree with one another, as well as with the species phylogeny. Hence, methods that adopt this approach compare gene and species trees, and infer a set of HGT events to reconcile the differences among these trees.

In this paper, we address some of the identifiability issues that face phylogeny-based detection of HGT. In particular, we show the effect of inaccuracies in the reconstructed (species and gene) trees on inferring the correct number of HGT events. Further, we show that a large number of maximally parsimonious HGT scenarios may exist. These results indicate that accurate detection of HGT requires accurate reconstruction of individual trees, and necessitates the search for more than a single scenario to explain gene tree disagreements. Finally, we show that disagreements among trees may be a result of not only HGT, but also lineage sorting, and make initial progress on incorporating HGT into the coalescent model, so as to stochastically distinguish between the two and make an accurate reconciliation. This contribution is very significant, particularly when analyzing closely related organisms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Addario-Berry, L., Hallett, M.T., Lagergren, J.: Towards identifying lateral gene transfer events. In: Proc. 8th Pacific Symp. on Biocomputing (PSB 2003), pp. 279–290 (2003)

    Google Scholar 

  2. Bordewich, M., Semple, C.: On the computational complexity of the rooted subtree prune and regraft distance. Annals of Combinatorics, 1–15 (in press, 2005)

    Google Scholar 

  3. Daubin, V., Moran, N.A., Ochman, H.: Phylogenetics and the cohesion of bacterial genomes. Science 301, 829–832 (2003)

    Article  Google Scholar 

  4. Doolittle, W.F., Boucher, Y., Nesbo, C.L., Douady, C.J., Andersson, J.O., Roger, A.J.: How big is the iceberg of which organellar genes in nuclear genomes are but the tip? Phil. Trans. R. Soc. Lond. B. Biol. Sci. 358, 39–57 (2003)

    Article  Google Scholar 

  5. Paulsen, I.T., et al.: Role of mobile DNA in the evolution of Vacomycin-resistant Enterococcus faecalis. Science 299(5615), 2071–2074 (2003)

    Article  Google Scholar 

  6. Ewens, W.J.: Mathematical Population Genetics. Springer, Berlin (1979)

    MATH  Google Scholar 

  7. Hallett, M.T., Lagergren, J.: Efficient algorithms for lateral gene transfer problems. In: Proc. 5th Ann. Int’l Conf. Comput. Mol. Biol. (RECOMB 2001), pp. 149–156. ACM Press, New York (2001)

    Google Scholar 

  8. Hudson, R.R.: Testing the constant-rate neutral allele model with protein sequence data. Evolution 37, 203–217 (1983)

    Article  Google Scholar 

  9. Hudson, R.R.: Properties of the neutral allele model with intergenic recombination. Theor. Popul. Biol. 23, 183–201 (1983)

    Article  MATH  Google Scholar 

  10. Kimura, M.: The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. Genetics 61, 893–903 (1969)

    Google Scholar 

  11. Kingman, J.F.C.: The coalescent. Stochast. Proc. Appl. 13, 235–248 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  12. Kunin, V., Goldovsky, L., Darzentas, N., Ouzounis, C.A.: The net of life: reconstructing the microbial phylogenetic network. Genome Research 15, 954–959 (2005)

    Article  Google Scholar 

  13. Lerat, E., Daubin, V., Moran, N.A.: From gene trees to organismal phylogeny in prokaryotes: The case of the γ-proteobacteria. PLoS Biology 1(1), 1–9 (2003)

    Article  Google Scholar 

  14. Maddison, W.P.: Gene trees in species trees. Systematic Biology 46(3), 523–536 (1997)

    Article  Google Scholar 

  15. Makarenkov, V.: T-REX: Reconstructing and visualizing phylogenetic trees and reticulation networks. Bioinformatics 17(7), 664–668 (2001)

    Article  Google Scholar 

  16. Moret, B.M.E., Nakhleh, L., Warnow, T., Linder, C.R., Tholse, A., Padolina, A., Sun, J., Timme, R.: Phylogenetic networks: modeling, reconstructibility, and accuracy. IEEE/ACM Transactions on Computational Biology and Bioinformatics 1(1), 13–23 (2004)

    Article  Google Scholar 

  17. Nakhleh, L., Ruths, D., Wang, L.S.: RIATA-HGT: A fast and accurate heuristic for reconstructing horizontal gene transfer. In: Wang, L. (ed.) COCOON 2005. LNCS, vol. 3595, pp. 84–93. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  18. Nakhleh, L., Warnow, T., Linder, C.R.: Reconstructing reticulate evolution in species–theory and practice. In: Proc. 8th Ann. Int’l Conf. Comput. Mol. Biol. (RECOMB 2004), pp. 337–346 (2004)

    Google Scholar 

  19. Ochman, H., Lawrence, J.G., Groisman, E.A.: Lateral gene transfer and the nature of bacterial innovation. Nature 405(6784), 299–304 (2000)

    Article  Google Scholar 

  20. Rambaut, A., Grassly, N.C.: Seq-gen: An application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees. Comp. Appl. Biosci. 13, 235–238 (1997)

    Google Scholar 

  21. Rosenberg, N.: The probability of topological concordance of gene trees and species tree. Theoretical Population Biology 61, 225–247 (2002)

    Article  MATH  Google Scholar 

  22. Rosenberg, N.A.: Gene genealogies. In: Fox, C.W., Wolf, J.B. (eds.) Evolutionary Genetics: Concepts and Case Studies, ch. 15. Oxford University Press, Oxford (2005)

    Google Scholar 

  23. Ruths, D., Nakhleh, L.: Techniques for assessing phylogenetic branch support: A performance study. In: Proceedings of the Fourth Asia-Pacific Bioinformatics Conference (APBC 2006), pp. 187–196 (2006)

    Google Scholar 

  24. Saitou, N., Nei, M.: The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987)

    Google Scholar 

  25. Sanderson, M.: r8s software package, Available from: http://loco.ucdavis.edu/r8s/r8s.html

  26. Swofford, D.L.: PAUP*: Phylogenetic analysis using parsimony (and other methods). Sinauer Associates, Underland (1996); Version 4.0

    Google Scholar 

  27. Tajima, F.: Evolutionary relationship of DNA sequences in finite populations. Genetics 105, 437–460 (1983)

    Google Scholar 

  28. Takahata, N.: Gene genealogy in three related populations: Consistency probability between gene and population trees. Genetics 122, 957–966 (1989)

    Google Scholar 

  29. Welch, R.A., Burland, V., Plunkett, G., Redford, P., Roesch, P., Rasko, D., Buckles, E.L., Liou, S.R., Boutin, A., Hackett, J., et al.: Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 99, 17020–17024 (2002)

    Article  Google Scholar 

  30. Zwickl, D., Hillis, D.: Increased taxon sampling greatly reduces phylogenetic error. Systematic Biology 51(4), 588–598 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Than, C., Ruths, D., Innan, H., Nakhleh, L. (2006). Identifiability Issues in Phylogeny-Based Detection of Horizontal Gene Transfer. In: Bourque, G., El-Mabrouk, N. (eds) Comparative Genomics. RCG 2006. Lecture Notes in Computer Science(), vol 4205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11864127_17

Download citation

  • DOI: https://doi.org/10.1007/11864127_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44529-6

  • Online ISBN: 978-3-540-44530-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics