Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods
 Ziheng Yang
 … show all 1 hide
Rent the article at a discount
Rent now* Final gross prices may vary according to local VAT.
Get AccessAbstract
Two approximate methods are proposed for maximum likelihood phylogenetic estimation, which allow variable rates of substitution across nucleotide sites. Three data sets with quite different characteristics were analyzed to examine empirically the performance of these methods. The first, called the “discrete gamma model,” uses several categories of rates to approximate the gamma distribution, with equal probability for each category. The mean of each category is used to represent all the rates falling in the category. The performance of this method is found to be quite good, and four such categories appear to be sufficient to produce both an optimum, or nearoptimum fit by the model to the data, and also an acceptable approximation to the continuous distribution. The second method, called “fixedrates model,” classifies sites into several classes according to their rates predicted assuming the star tree. Sites in different classes are then assumed to be evolving at these fixed rates when other tree topologies are evaluated. Analyses of the data sets suggest that this method can produce reasonable results, but it seems to share some properties of a leastsquares pairwise comparison; for example, interior branch lengths in nonbest trees are often found to be zero. The computational requirements of the two methods are comparable to that of Felsenstein's (1981, J Mol Evol 17:368–376) model, which assumes a single rate for all the sites.
 Best, DJ, Roberts, DE (1975) The percentage points of the ζ2 distribution. Appl Statist 24: pp. 385388
 Bhattacharjee, GP (1970) The incomplete gamma integral. Appl Statist 19: pp. 285287
 Brown, WM, Prager, EM, Wang, A, Wilson, AC (1982) Mitochondrial DNA sequences of primates, tempo and mode of evolution. J Mol Evol 18: pp. 225239
 CavalliSforza, LL, Edwards, AWF (1967) Phylogenetic analysis: models and estimation procedures. Evolution 32: pp. 550570
 Felsenstein, J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 17: pp. 368376
 Fitch, WM (1986) The estimate of total nucleotide substitutions from pairwise differences is biased. Philos Trans R Soc Lond Biol 312: pp. 317324
 Fitch, WM, Margolish, E (1967) A method for estimating the number of invariant amino acid coding positions in a gene, using cytochrome c as a model case. Biochem Genet 1: pp. 6571
 Fitch, WM, Markowitz, E (1970) An improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution. Biochem Genet 4: pp. 579593
 Goldman, N (1993) Statistical tests of models of DNA substitution. J Mol Evol 36: pp. 182198
 Hasegawa, M (1991) Molecular phylogeny and man's place in Hominoidea. J Anthrop Soc Nippon 99: pp. 4961
 Hasegawa, M, Horai, J (1991) Time of the deepest root for polymorphism in human mitochondrial DNA. J Mol Evol 32: pp. 3742
 Hasegawa, M, Kishino, H (1989) Confidence limits on the maximum likelihood estimation of the hominoid tree from mitochondrial DNA sequences. Evolution 43: pp. 672677
 Hasegawa, M, Kishino, H, Yano, T (1985) Dating the humanape splitting by a molecular clock of mitochondrial DNA. J Mol Evol 22: pp. 160174
 Hasegawa, M, Rienzo, AD, Kocher, TD, Wilson, AC (1993) Toward a more accurate time scale for the human mitochondrial DNA tree. J Mol Evol 37: pp. 347354
 Holmquist, R, Goodman, M, Conry, T, Czelusniak, I (1983) The spatial distribution of fixed mutations within genes coding for proteins. J Mol Evol 19: pp. 137448
 Jin, L, Nei, M (1990) Limitations of the evolutionary parsimony method of phylogeny analysis. Mol Biol Evol 7: pp. 82102
 Kocher, TD, Wilson, AC Sequence evolution of mitochondrial DNA in humans and chimpanzees: Control region and a proteincoding region. In: Osawa, S, Honjo, T eds. (1991) Evolution of life: fossils, molecules, and culture. SpringerVerlag, Tokyo, pp. 391413
 Li, WH, Gouy, M, Sharp, PM, O'hUigin, C, Yang, YW (1990) Molecular phylogeny of rodentia, lagomorpha, primates, artiodactyla, and carnivora and molecular clocks. Proc Natl Acad Sci USA 87: pp. 67036707
 Navidi, WC, Churchill, GA, Haeseler, A (1991) Methods for inferring phylogenies from nucleotide acid sequence data by using maximum likelihood and linear invariants. Mol Biol Evol 8: pp. 128143
 Nei, M, Gojobori, T (1986) Simple methods for estimating the number of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol 3: pp. 418426
 Palumbi, SR (1989) Rates of molecular evolution and the function of nucleotide positions free to vary. J Mol Evol 29: pp. 180187
 Tamura, K, Nei, M (1993) Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol 10: pp. 512526
 Thorne, JL, Kishino, H, Felsenstein, J (1992) Inching toward reliability: an improved likelihood model of sequence evolution. J Mol Evol 34: pp. 316
 Uzzell, T, Corbin, KW (1971) Fitting discrete probability distributions to evolutionary events. Science 172: pp. 10891096
 Wakeley, J (1993) Substitution rate variation among sites in hypervariable region 1 of human mitochondrial DNA. J Mol Evol 37: pp. 613623
 Yang, Z (1993) Maximum likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. Mol Biol Evol 10: pp. 13961401
 Yang Z (in press) Estimating the pattern of nucleotide substitution. J Mol Evol
 Yang Z, Wang T (in press) Mixed model analysis of DNA sequence evolution. Biometrics
 Yang, Z, Goldman, N, Friday, AE (1994) Comparison of models for nucleotide substitution used in maximum likelihood phylogenetic estimation. Mol Biol Evol 11: pp. 316324
 Title
 Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods
 Journal

Journal of Molecular Evolution
Volume 39, Issue 3 , pp 306314
 Cover Date
 19940901
 DOI
 10.1007/BF00160154
 Print ISSN
 00222844
 Online ISSN
 14321432
 Publisher
 SpringerVerlag
 Additional Links
 Topics
 Keywords

 Phylogeny
 Maximum likelihood
 Rate variation over sites
 The gamma distribution
 Approximate methods
 Industry Sectors
 Authors

 Ziheng Yang ^{(1)}
 Author Affiliations

 1. Department of Zoology, The Natural History Museum, SW7 51313, London, United Kingdom