Abstract
A general model for estimating the number of amino acid substitutions per site (d) from the fraction of identical residues between two sequences (q) is proposed. The well-known Poisson-correction formula q = e −d corresponds to a site-independent and amino-acid-independent substitution rate. Equation q = (1 − e −2d)/2d, derived for the case of substitution rates that are site-independent, but vary among amino acids, approximates closely the empirical method, suggested by Dayhoff et al. (1978). Equation q = 1/(1 + d) describes the case of substitution rates that are amino acid-independent but vary among sites. Lastly, equation q = [ln(1 + 2d)]/2d accounts for the general case where substitution rates can differ for both amino acids and sites.
Similar content being viewed by others
References
Bulmer M (1991) Use of the method of generalized least squares in reconstructing phylogenies from sequence data. Mol Biol Evol 8:868–883
Dayhoff MO, Eck RV, Park CM (1972) A model of evolutionary change in proteins. In; Dayhoff MO (ed) Atlas of protein sequence and structure, vol 5. National Biomedical Research Foundation, Washington, DC, pp 89–99
Dayhoff MO, Schwartz RM, Orcutt BC (1978) A model of evolutionary change in proteins. In: Dayhoff MO (ed) Atlas of protein sequence and structure, vol 5, suppl 3. National Biomedical Research Foundation, Washington, DC, pp 345–352
Holmquist R, Goodman M, Conroy T, Czelusniak J (1983) The spatial distribution of fixed mutations within genes coding for proteins. J Mol Evol 19:437–448.
Ota T, Nei M (1994) Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites. J Mol Evol 38:642–643
Takacs L (1966) Stochastic process. Methuen & Co, London: John Wiley & Sons, New York; pp 28–46.
Uzzell T, Corbin KW (1971) Fitting discrete probability distribution to evolutionary events. Science 172:1089–1096
Zuckerkandl E, Pauling L (1965) Evolutionary divergence and convergence in proteins. In: Bryson V, Vogel HJ (eds) Evolving genes and proteins Academic Press, New York, pp 97–166.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Grishin, N.V. Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites. J Mol Evol 41, 675–679 (1995). https://doi.org/10.1007/BF00175826
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00175826