Abstract
Given the structure of the genetic code, synonymous codons differ in their capacity to minimize the effects of errors due to mutation or mistranslation. I suggest that this may lead, in protein-coding genes, to a preference for codons that minimize the impact of errors at the protein level. I develop a theoretical measure of error minimization for each codon, based on amino acid similarity. This measure is used to calculate the degree of error minimization for 82 genes of Drosophila melanogaster and 432 rodent genes and to study its relationship with CG content, the degree of codon usage bias, and the rate of nucleotide substitution. I show that (i) Drosophila and rodent genes tend to prefer codons that minimize errors; (ii) this cannot be merely the effect of mutation bias; (iii) the degree of error minimization is correlated with the degree of codon usage bias; (iv) the amino acids that contribute more to codon usage bias are the ones for which synonymous codons differ more in the capacity to minimize errors; and (v) the degree of error minimization is correlated with the rate of nonsynonymous substitution. These results suggest that natural selection for error minimization at the protein level plays a role in the evolution of coding sequences in Drosophila and rodents.
Similar content being viewed by others
References
H Akashi (1994) ArticleTitleSynonymous codon usage in Drosophila melanogaster: Natural selection and translational accuracy Genetics 136 927–935 Occurrence Handle1:CAS:528:DyaK2MXpsFaq Occurrence Handle8005445
M Archetti (2004) ArticleTitleCodon usage bias and mutation constraints reduce the level of error minimization of the genetic code J Mol Evol 59 258 Occurrence Handle10.1007/s00239-004-2620-0 Occurrence Handle1:CAS:528:DC%2BD2cXntlCku7w%3D Occurrence Handle15486699
SA Benner MA Cohen GH Gonnet (1994) ArticleTitleAmino acid substitution during functionally constrained divergent evolution of protein sequences Protein Eng 7 IssueID11 1323–1332 Occurrence Handle1:CAS:528:DyaK2MXit1Cku7k%3D Occurrence Handle7700864
G Bernardi B Olofsson J Filipski M Zerial J Salinas G Cuny M Meunier-Rotival F Rodier (1985) ArticleTitleThe mosaic genome of warm-blooded vertebrates Science 228 953–958 Occurrence Handle1:CAS:528:DyaL2MXksVyquro%3D Occurrence Handle4001930
JP Bielawski KA Dunn Z Yang (2000) ArticleTitleRates of nucleotide substitution and mammalian nuclear gene evolution: approximate and maximum-likelihood methods lead to different conclusions Genetics 156 1299–1308 Occurrence Handle1:CAS:528:DC%2BD3cXosFSntro%3D Occurrence Handle11063703
MO Dayhoff RM Schwartz BC Orcutt (1978) A model of evolutionary change in proteins MO Dayhoff (Eds) Atlas of protein sequence and structure, Vol 5, Supp1 3 National Biomedical Research Foundation Washington DC 352
M Di Giulio (2000a) ArticleTitleGenetic code origin and the strength of natural selection J Theor Biol 205 659–661 Occurrence Handle10.1006/jtbi.2000.2115 Occurrence Handle1:CAS:528:DC%2BD3cXlsVSrsrw%3D
M Di Giulio (2000b) ArticleTitleThe origin of the genetic code Trends Biochem Sci 25 IssueID2 44 Occurrence Handle10.1016/S0968-0004(99)01522-4 Occurrence Handle1:CAS:528:DC%2BD3cXhsFejsbY%3D
M Di Giulio (2001) ArticleTitleThe origin of the genetic code cannot be studied using measurements based on the PAM matrix because this matrix reflects the code itself, making any such analyses tautologous J Theor Biol 208 141–144 Occurrence Handle10.1006/jtbi.2000.2206 Occurrence Handle1:CAS:528:DC%2BD3MXjt1yitA%3D%3D Occurrence Handle11162059
L Duret (2002) ArticleTitleEvolution of synonymous codon usage in metazoans Curr Opin Gen Dev 12 640–649 Occurrence Handle10.1016/S0959-437X(02)00353-2 Occurrence Handle1:CAS:528:DC%2BD38XosFOhsrs%3D
CJ Epstein (1966) ArticleTitleRole of the amino-acid “code” and of selection for conformation in the evolution of proteins Nature 210 25–28 Occurrence Handle1:CAS:528:DyaF28XpsVOksg%3D%3D Occurrence Handle5956344
SJ Freeland LD Hurst (1998) ArticleTitleThe genetic code is one in a million J Mol Evol 47 238–248 Occurrence Handle1:CAS:528:DyaK1cXmt1ansrs%3D Occurrence Handle9732450
SJ Freeland RD Knight LF Landweber (2000) ArticleTitleMeasuring adaptation within the genetic code Trends Biochem Sci 25 IssueID2 44–45 Occurrence Handle10.1016/S0968-0004(99)01531-5 Occurrence Handle1:CAS:528:DC%2BD3cXhsFejtr4%3D
DG George WC Barker LT Hunt (1990) ArticleTitleMutation data matrix and its uses Methods Enzymol 183 333–351 Occurrence Handle1:CAS:528:DyaK3MXivFGg Occurrence Handle2314281
R Grantham C Gautier M Gouy R Mercier A Pavè (1980) ArticleTitleCodon catalog usage and the genome hypothesis Nucleic Acid Res 8 r49–r62 Occurrence Handle1:CAS:528:DyaL3cXhtVKiurc%3D Occurrence Handle6986610
D Haig LD Hurst (1991) ArticleTitleA quantitative measure of error minimization in the genetic code J Mol Evol 33 IssueID5 412–417 Occurrence Handle1:CAS:528:DyaK3MXmsleqtrs%3D Occurrence Handle1960738
J Hey RM Kliman (2002) ArticleTitleInteractions between natural selection, recombination and gene density in the genes of Drosopila Genetics 160 595–608 Occurrence Handle1:CAS:528:DC%2BD38XitlWrs7w%3D Occurrence Handle11861564
T Ikemura (1981) ArticleTitleCorrelation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system J Mol Biol 151 389–409 Occurrence Handle10.1016/0022-2836(81)90003-6 Occurrence Handle1:CAS:528:DyaL38Xks1ygsA%3D%3D Occurrence Handle6175758
T Ikemura (1982) ArticleTitleCorrelation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes Differences in synonymous codon choice patterns of yeast and Escherichia coli with reference to the abundance of isoaccepting transfer RNAs J Mol Biol 158 573–597 Occurrence Handle10.1016/0022-2836(82)90250-9 Occurrence Handle1:CAS:528:DyaL38Xlt1Whur8%3D Occurrence Handle6750137
T Ikemura (1985) ArticleTitleCodon usage and tRNA content in unicellular and multicellular organisms Mol Biol Evol 2 13–34 Occurrence Handle1:CAS:528:DyaL2MXhvFyksbk%3D Occurrence Handle3916708
T Ikemura (1992) Correlation between codon usage and tRNA content in microorganisms DL Hatfield BJ Lee RM Pirtle (Eds) Transfer RNA in protein synthesis CRC Press Boca Raton 87–111
M Kimura (1983) The Neutral Theory of Natural Selection Cambridge University Press Cambridge
RD Knight SJ Freeland LF Landweber (1999) ArticleTitleSelection, history and chemistry: The three faces of the genetic code Trends Biochem Sci 24 IssueID6 241–247 Occurrence Handle10.1016/S0968-0004(99)01392-4 Occurrence Handle1:CAS:528:DyaK1MXks1ansLg%3D Occurrence Handle10366854
M Kreitman (1996) ArticleTitleThe neutral theory is dead Long live the neutral theory. BioEssays 18 678–682 Occurrence Handle1:CAS:528:DyaK28XlslyntLk%3D
G Marais D Mouchiroud L Duret (2001) ArticleTitleDoes recombination improve selection on codon usage? Lessons from nematode and fly complete genomes Proc Natl Acad Sci USA 98 5688–5692 Occurrence Handle10.1073/pnas.091427698 Occurrence Handle1:CAS:528:DC%2BD3MXjs1WnsLo%3D Occurrence Handle11320215
AD McLachlan (1971) ArticleTitleTests for comparing related amino-acid sequences cytochrome c and cytochrome c 551 J Mol Biol 61 409–424 Occurrence Handle10.1016/0022-2836(71)90390-1 Occurrence Handle1:CAS:528:DyaE38XhsF2k Occurrence Handle5167087
DT McPherson (1988) ArticleTitleCodon preference reflects mistranslational constraints: A proposal Nucleic Acid Res 16 4111–4120 Occurrence Handle1:CAS:528:DyaL1cXktleqsbg%3D Occurrence Handle3131744
GAT McVean J Vieira (2001) ArticleTitleInferring parameters of mutation, selection and demography from patterns of synonymous site evolution in Drosophila. Genetics 157 IssueID1 245–257 Occurrence Handle1:STN:280:DC%2BD3M7kslOqsw%3D%3D Occurrence Handle11139506
G Modiano G Battistuzzi AG Motulsky (1981) ArticleTitleNonrandom patterns of codon usage and of nucleotide substitutions in human alpha- and beta-globin genes: an evolutionary strategy reducing the rate of mutations with drastic effects? Proc Natl Acad Sci USA 78 1110–1114 Occurrence Handle1:CAS:528:DyaL3MXhsFCis7s%3D Occurrence Handle6940129
EN Moriyama JR Powell (1997) ArticleTitleCodon usage bias and tRNA abundance in Drosophila. J Mol Evol 45 514–523 Occurrence Handle1:CAS:528:DyaK2sXmvFSktbY%3D Occurrence Handle9342399
Nakamura Y, Gojobori T, Ikemura T (2000) Codon usage tabulated from the the international DNA sequence databases: Status for the year 2000. Nucleic Acids Res 28:292
M Nei T Gojobori (1986) ArticleTitleSimple methods for estimating the number of synonymous and non-synonymous nucleotide substitution Mol Biol Evol 3 418–426 Occurrence Handle1:CAS:528:DyaL28Xmt1aisbs%3D Occurrence Handle3444411
J Overington D Donnelly MS, Johnson et al. (1992) ArticleTitleEnvironment-specific amino-acid substitution tables—Tertiary templates and prediction of protein folds Protein Sci 1 IssueID2 216–226 Occurrence Handle1:CAS:528:DyaK38XksVeqtb4%3D Occurrence Handle1304904
LE Post GD Strycharz M Nomura H Lewis PP Dennis (1979) ArticleTitleNucleotide sequence of the ribosomal protein gene cluster adjacent to the gene for RNA polymerase subunit beta in Escherichia coli Proc Natl Acad Sci USA 76 1697–1701 Occurrence Handle1:CAS:528:DyaE1MXksFWhurg%3D Occurrence Handle377281
JR Powell EN Moriyama (1997) ArticleTitleEvolution of codon usage bias in Drosophila Proc Natl Acad Sci USA 94 7784–7790 Occurrence Handle10.1073/pnas.94.15.7784 Occurrence Handle1:CAS:528:DyaK2sXksl2nsbo%3D Occurrence Handle9223264
RP Riek MD Handschumacher SS Sung M Tan MJ Glynias MD Schluchter J Novotny RM Graham (1995) ArticleTitleEvolutionary conservation of both the hydrophilic and hydrophobic nature of transmembrane residues J Theor Biol 172 IssueID3 245–258 Occurrence Handle10.1006/jtbi.1995.0021 Occurrence Handle1:CAS:528:DyaK2MXkslGmsrk%3D Occurrence Handle7715195
JL Risler MO Delorme H Delacroix A Henaut (1988) ArticleTitleAmino acid substitutions in structurally related proteins A pattern recognition approach. Determination of a new and efficient scoring matrix. J Mol Biol 204 IssueID4 1019–1029 Occurrence Handle1:CAS:528:DyaL1MXptV2jtg%3D%3D
NG Smith LD Hurst (1997) ArticleTitleThe effect of tandem substitutions on the correlation between synonymous and non-synonymous rates in rodents Genetics 153 1395–1402
MD Topal JR Fresco (1976) ArticleTitleComplementary base pairing and the origin of substitution mutations Nature 263 285–289 Occurrence Handle1:CAS:528:DyaE2sXltFOisw%3D%3D Occurrence Handle958482
CR Woese (1965) ArticleTitleOn the evolution of the genetic code Proc Natl Acad Sci USA 54 1546–1552 Occurrence Handle1:CAS:528:DyaF28Xltlagsg%3D%3D Occurrence Handle5218910
F Wright (1990) ArticleTitleThe ‘effective number of codons’ used in a gene Gene 87 23–29 Occurrence Handle10.1016/0378-1119(90)90491-9 Occurrence Handle1:CAS:528:DyaK3cXktVWmsbo%3D Occurrence Handle2110097
Z Yang R Nielsen (2000) ArticleTitleEstimating synonymous and non-synonymous rate variation in nuclear genes of mammals J Mol Evol 46 409–418
Z Yang R Nielsen (1998) ArticleTitleSynonymous and non-synonymous substitution rates under realistic evolutionary models Mol Biol Evol 17 IssueID1 32–43
Acknowledgments
Nick Smith provided the substitution rates used by Smith and Hurst (1997).
Author information
Authors and Affiliations
Corresponding author
Additional information
Reviewing Editor: Dr. Massimo Di Giulio
Appendix
Appendix
Rights and permissions
About this article
Cite this article
Archetti, M. Selection on Codon Usage for Error Minimization at the Protein Level. J Mol Evol 59, 400–415 (2004). https://doi.org/10.1007/s00239-004-2634-7
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s00239-004-2634-7