Journal of Molecular Evolution

, Volume 60, Issue 4, pp 426–433 | Cite as

Accuracy and Power of the Likelihood Ratio Test for Comparing Evolutionary Rates Among Genes

Article

Abstract

Sequences for multiple protein-coding genes are now commonly available from several, often closely related species. These data sets offer intriguing opportunities to test hypotheses regarding whether different types of genes evolve under different selective pressures. Although maximum likelihood (ML) models of codon substitution that are suitable for such analyses have been developed, little is known about the statistical properties of these tests. We use a previously developed fixed-sites model and computer simulations to examine the accuracy and power of the likelihood ratio test (LRT) in comparing the nonsynonymous-to-synonymous substitution rate ratio (ω = dN/dS) between two genes. Our results show that the LRT applied to fixed-sites models may be inaccurate in some cases when setting significance thresholds using a χ2 approximation. Instead, we use a parametric bootstrap to describe the distribution of the LRT statistic for fixed-sites models and examine the power of the test as a function of sampling variables and properties of the genes under study. We find that the power of the test is high (>80%) even when sampling few taxa (e.g., six species) if sequences are sufficiently diverged and the test is largely unaffected by the tree topology used to simulate data. Our simulations show fixed-sites models are suitable for comparing substitution parameters among genes evolving under even strong evolutionary constraint (ω ≈ 0.05), although relative rate differences of 25% or less may be difficult to detect.

Keywords

Likelihood ratio test Nonsynonymous/synonymous rate ratio 

Notes

Acknowledgments

We thank Z. Yang, W. Swanson, and J. Felsenstein for helpful discussions. This work was supported by an NSF IGERT predoctoral fellowship and NSF dissertation improvement grant (DEB-0105176) to J.E.A. and by a grant from the NIH (GM54185) to P.C.P.

References

  1. Anisimova, M, Bielawski, JP, Yang, Z 2001The accuracy and power of likelihood ratio tests to detect positive selection at amino acid sitesMol Biol Evol1815851592PubMedGoogle Scholar
  2. Anisimova, M, Bielawski, JP, Yang, Z 2002Accuracy and power of Bayes prediction of amino acid sites under positive selectionMol Biol Evol19950958PubMedGoogle Scholar
  3. Barrier, M, Bustamante, CD, Yu, J, Purugganan, MD 2003Selection on rapidly evolving proteins in the Arabidopsis genomeGenetics163723733PubMedGoogle Scholar
  4. Endo, T, Ikeo, K, Gojobori, T 1996Large-scale selection for genes on which positive selection may operateMol Biol Evol13685690PubMedGoogle Scholar
  5. Goldman, N 1993Statistical tests of models of DNA substitutionJ Mol Evol36182198CrossRefPubMedGoogle Scholar
  6. Goldman, N, Yang, Z 1994A codon-based model of nucleotide substitution for protein-coding DNA sequencesMol Biol Evol11725736PubMedGoogle Scholar
  7. Huelsenbeck, JP, Crandall, KA 1997Phylogeny estimation and hypothesis testing using maximum likelihoodAnnu Rev Ecol Syst28437466CrossRefGoogle Scholar
  8. Huelsenbeck, JP, Rannala, B 1997Phylogenetic methods come of age: testing hypotheses in an evolutionary contextScience276227231CrossRefPubMedGoogle Scholar
  9. Li, W 1997Molecular evolutionSinauer AssociatesSunderland, MAGoogle Scholar
  10. Lu, Y, Rausher, MD 2003Evolutionary rate variation in anthocyanin pathway genesMol Biol Evol2018441853CrossRefPubMedGoogle Scholar
  11. Muse, SV, Gaut, BS 1994A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genomeMol Biol Evol11715724PubMedGoogle Scholar
  12. Nielsen, R, Yang, Z 1998Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope geneGenetics148929936PubMedGoogle Scholar
  13. Riley, R, Jin, W, Gibson, G 2003Contrasting selection pressures on components of Ras-mediated signal transduction in Droso- philaMol Ecol1213151323CrossRefPubMedGoogle Scholar
  14. Stuart, A, Ord, K, Arnold, S 1999Kendall’s advanced theory of statistics, 6th ed, Vol 2a.ArnoldLondonGoogle Scholar
  15. Whelan, S, Goldman, N 1999Distributions of statistics used for the comparison of models of sequence evolution in phylogeneticsMol Biol Evol1612921299Google Scholar
  16. Yang, Z 1997PAML: a program for package for phylogenetic analysis by maximum likelihoodCABIOS15555556Google Scholar
  17. Yang, Z 1998Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolutionMol Biol Evol15568573PubMedGoogle Scholar
  18. Yang, Z 2002Inference of selection from multiple species alignmentsCurr Opin Genet Dev12688694CrossRefPubMedGoogle Scholar
  19. Yang, Z, Bielaeski, JP 2000Statistical methods for detecting molecular adaptationTREE15496501PubMedGoogle Scholar
  20. Yang, Z, Nielsen, R 2000Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary modelsMol Biol Evol173243PubMedGoogle Scholar
  21. Yang, Z, Nielsen, R 2002Codon-substitution models for detecting molecular adaptation at individual sites along specific lineagesMol Biol Evol19908917PubMedGoogle Scholar
  22. Yang, Z, Swanson, WJ 2002Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classesMol Biol Evol194957PubMedGoogle Scholar
  23. Yang, Z, Nielsen, R, Goldman, N, Petersen, A 2000Codon-substitution models for heterogeneous selection pressure at amino acid sitesGenetics155431499PubMedGoogle Scholar
  24. Zhang, J 1999Performance of likelihood ratio tests of evolutionary hypotheses under inadequate substitution modelsMol Biol Evol16868875PubMedGoogle Scholar

Copyright information

© Springer Science+Business Media, Inc. 2005

Authors and Affiliations

  1. 1.Center for Ecology and Evolutionary BiologyUniversity of OregonEugeneUSA
  2. 2.Department of Genome SciencesUniversity of WashingtonSeattleUSA

Personalised recommendations