Skip to main content
Log in

A Survey of Codon and Amino Acid Frequency Bias in Microbial Genomes Focusing on Translational Efficiency

  • Published:
Journal of Molecular Evolution Aims and scope Submit manuscript

Abstract

Unequal use of synonymous codons has been found in several prokaryotic and eukaryotic genomes. This bias has been associated with translational efficiency. The prevalence of this bias across lineages is currently unknown. Here, a new method (GCB) to measure codon usage bias is presented. It uses an iterative approach for the determination of codon scores and allows the computation of an index of codon bias suitable for interspecies comparison. A server to calculate GCB-values of individual genes as well as a list of compiled results are available at www.g21.bio.uni-goettingen.de . The method was applied to complete bacterial genomes. The relation of codon usage bias with amino acid composition and the choice of stop codons were determined and discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4

Similar content being viewed by others

References

  1. FM Adamski KK McCaughan F Jørgensen CG Kurland WP Tate (1994) ArticleTitleThe concentration of polypeptide chain release factors 1 and 2 at different growth rates of Escherichia coli. J Mol Biol 238 302–308 Occurrence Handle10.1006/jmbi.1994.1293 Occurrence Handle1:CAS:528:DyaK2cXktVCrt7Y%3D Occurrence Handle8176726

    Article  CAS  PubMed  Google Scholar 

  2. H Akashi (2001) ArticleTitleGene expression and molecular evolution. Curr Op Gen Dev 11 660–666 Occurrence Handle10.1016/S0959-437X(00)00250-1 Occurrence Handle1:CAS:528:DC%2BD3MXnslajtL0%3D

    Article  CAS  Google Scholar 

  3. B Alberts D Bray J Lewis M Raff K Roberts JD Watson (1994) Molecular biology of the cell. Garland Publishing New York

    Google Scholar 

  4. SG Andersson CG Kurland (1990) ArticleTitleCodon preferences of free-living microorganisms. Microbiol Rev 54 198–210 Occurrence Handle1:CAS:528:DyaK3cXksFeiurk%3D Occurrence Handle2194095

    CAS  PubMed  Google Scholar 

  5. JL Bennetzen BD Hall (1982) ArticleTitleCodon selection in yeast. J Biol Chem 257 3026–3031 Occurrence Handle1:CAS:528:DyaL38XitFWkt7w%3D Occurrence Handle7037777

    CAS  PubMed  Google Scholar 

  6. A Coghlan KH Wolfe (2000) ArticleTitleRelationship of codon bias to mRNA concentration and protein length in Saccaromyces cerevisiae. Yeast 16 1131–1145 Occurrence Handle1:CAS:528:DC%2BD3cXmvFSitbY%3D Occurrence Handle10953085

    CAS  PubMed  Google Scholar 

  7. JM Comeron M Aguadé (1998) ArticleTitleAn evaluation of measures of synonymous codon usage bias. J Mol Evol 47 268–274

    Google Scholar 

  8. JF Curran M Yarus (1989) ArticleTitleRates of aa-tRNA selection at 29 sense codons in vivo. J Mol Biol 209 65–77 Occurrence Handle1:CAS:528:DyaK3cXjvFOn Occurrence Handle2478714

    CAS  PubMed  Google Scholar 

  9. JF Curran (1995) ArticleTitleDecoding with the A:I wobble pair is inefficient. Nucleic Acids Res 23 683–688 Occurrence Handle1:CAS:528:DyaK2MXksFKgtrg%3D Occurrence Handle7534909

    CAS  PubMed  Google Scholar 

  10. DE Draper (1996) Translation initiation. FC Neidhardt (Eds) Escherichia coli and Salmonella, second edition. ASM Press Washington D.C. 902–908

    Google Scholar 

  11. H Engelberg-Kulka R Schoulaker-Schwarz (1996) Supression of termination codons. Escherichia coli and Salmonella. FC Neidhardt (Eds) Escherichia coli and Salmonella, second edition. ASM Press Washington D.C. 909–921

    Google Scholar 

  12. MA Freire-Picos MI González-Siso E Rodríguez-Belmonte AM Rodríguez-Torres E Ramil ME Cerdán (1994) ArticleTitleCodon usage in Kluyveromyces lactis and in yeast cytochrome c-encoding genes. Gene 139 43–49 Occurrence Handle10.1016/0378-1119(94)90521-5 Occurrence Handle1:CAS:528:DyaK2cXis1aks7c%3D Occurrence Handle8112587

    Article  CAS  PubMed  Google Scholar 

  13. DV Freistroffer M Kwiatkowski RH Buckingham M Ehrenberg (2000) ArticleTitleThe accuracy of codon recognition by polypeptide release factors. Proc Natl Acad Sci USA 97 2046–2051 Occurrence Handle1:CAS:528:DC%2BD3cXhslKhsLY%3D Occurrence Handle10681447

    CAS  PubMed  Google Scholar 

  14. M Gouy C Gautier (1982) ArticleTitleCodon usage in bacteria: Correlation with gene expressivity. Nucleic Acids Res 10 7055–7074 Occurrence Handle1:CAS:528:DyaL3sXivVOquw%3D%3D Occurrence Handle6760125

    CAS  PubMed  Google Scholar 

  15. R Grantham C Gautier M Gouy M Jacobzone R Mercier (1981) ArticleTitleCodon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res 9 r43–r74 Occurrence Handle7208352

    PubMed  Google Scholar 

  16. H Grosjean W Fiers (1982) ArticleTitlePreferential codon usage in prokaryotic genes: The optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. Gene 18 199–209 Occurrence Handle10.1016/0378-1119(82)90157-3 Occurrence Handle1:CAS:528:DyaL38XmtV2ntr0%3D Occurrence Handle6751939

    Article  CAS  PubMed  Google Scholar 

  17. G Gutiérrez L Marquez A Marín (1996) ArticleTitlePreference for guanosine at first codon position in highly expressed Escherichia coli genes. A relationship with translational efficiency. Nucleic Acids Res 24 2525–2527 Occurrence Handle10.1093/nar/24.13.2525 Occurrence Handle8692691

    Article  PubMed  Google Scholar 

  18. FC Holstege EG Jennings JJ Wyrick TI Lee CJ Hengartner MR Green et al. (1998) ArticleTitleDissecting the regulatory circuitry of a eukaryotic genome. Cell 95 717–728 Occurrence Handle1:CAS:528:DyaK1cXnvFynurk%3D Occurrence Handle9845373

    CAS  PubMed  Google Scholar 

  19. T Ikemura (1981) ArticleTitleCorrelation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. J Mol Biol 146 1–21 Occurrence Handle1:CAS:528:DyaL3MXitVGhsbY%3D Occurrence Handle6167728

    CAS  PubMed  Google Scholar 

  20. S Kanaya Y Yamada Y Kudo T Ikemura (1999) ArticleTitleStudies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: Gene expression level and species-specific diversity of codon usage based on multivariate analysis. Gene 238 143–155 Occurrence Handle10.1016/S0378-1119(99)00225-5 Occurrence Handle1:CAS:528:DyaK1MXntFKnu74%3D Occurrence Handle10570992

    Article  CAS  PubMed  Google Scholar 

  21. S Karlin SF Altschul (1990) ArticleTitleMethods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci USA 87 2264–2268 Occurrence Handle1:CAS:528:DyaK3cXhslGhsbo%3D Occurrence Handle2315319

    CAS  PubMed  Google Scholar 

  22. S Karlin J Mrázek AM Campbell (1998) ArticleTitleCodon usages in different gene classes of the Escherichia coli genome. Mol Microbiology 29 1341–1355 Occurrence Handle10.1046/j.1365-2958.1998.01008.x Occurrence Handle1:CAS:528:DyaK1cXmsVKisLc%3D

    Article  CAS  Google Scholar 

  23. S Karlin J Mrázek (2000) ArticleTitlePredicted highly expressed genes of diverse prokaryotic genomes. J Bact 182 5238–5250 Occurrence Handle1:CAS:528:DC%2BD3cXmsVOqu7k%3D Occurrence Handle10960111

    CAS  PubMed  Google Scholar 

  24. B Lafay JC Atherto PM Sharp (2000) ArticleTitleAbsence of translationally selected synonymous codon usage bias in Heliobacter pylori. Microbiology 146 851–860 Occurrence Handle1:CAS:528:DC%2BD3cXislOjs7c%3D Occurrence Handle10784043

    CAS  PubMed  Google Scholar 

  25. HW Mewes D Frishman C Gruber B Geier D Haase A Kaps et al. (2000) ArticleTitleMIPS: A database for genomes and protein sequences. Nucleic Acids Res 28 37–40 Occurrence Handle10.1093/nar/28.1.37 Occurrence Handle1:CAS:528:DC%2BD3cXhvVGqu70%3D Occurrence Handle10592176

    Article  CAS  PubMed  Google Scholar 

  26. BR Morton (1994) ArticleTitleCodon usage and the rate of divergence of land plant chloroplast genes. Mol Biol Evol 11 231–238 Occurrence Handle1:CAS:528:DyaK2cXivFWis7c%3D Occurrence Handle8170364

    CAS  PubMed  Google Scholar 

  27. I Moszer EPC Rocha A Danchin (1999) ArticleTitleCodon usage and lateral gene transfer in Bacillus subtilis. Curr Opin Micro 2 524–528 Occurrence Handle10.1016/S1369-5274(99)00011-9 Occurrence Handle1:CAS:528:DyaK1MXntlCku7o%3D

    Article  CAS  Google Scholar 

  28. JA Novembre (2002) ArticleTitleAccounting for background nucleotide composition when measuring codon usage bias. Mol Biol Evol 19 1390–1394 Occurrence Handle1:CAS:528:DC%2BD38XmtF2rs7o%3D Occurrence Handle12140252

    CAS  PubMed  Google Scholar 

  29. A Pan C Dutta J Das (1998) ArticleTitleCodon usage in highly expressed genes of Haemophilus influenzae and Mycobacterium tuberculosis: Translational selection versus mutational bias. Gene 215 405–413 Occurrence Handle1:CAS:528:DyaK1cXlvFSgsrg%3D

    CAS  Google Scholar 

  30. MY Pavlov DV Freistroffer V Dincbas J MacDougall RH Buckingham M Ehrenberg (1998) ArticleTitleA direct estimation of the context effect on the efficiency of termination. J Mol Biol 286 579–590 Occurrence Handle10.1006/jmbi.1998.2220

    Article  Google Scholar 

  31. S Pedersen (1984) ArticleTitle Escherichia coli ribosomes translate in vivo with variable rate. EMBO J 3 2895–2898 Occurrence Handle1:CAS:528:DyaL2MXktVyisQ%3D%3D Occurrence Handle6396082

    CAS  PubMed  Google Scholar 

  32. EP Rocha A Danchin A Viari (1999) ArticleTitleTranslation in Bacillus subtilis: Roles and trends of initiation and termination, insights from a genome analysis. Nucleic Acids Res 27 3567–3576 Occurrence Handle10.1093/nar/27.17.3567 Occurrence Handle1:CAS:528:DyaK1MXmtFCnurw%3D Occurrence Handle10446248

    Article  CAS  PubMed  Google Scholar 

  33. PM Sharp TMF Tuohy KR Mosurski (1986) ArticleTitleCodon usage in yeast: Cluster analysis clearly differentiates highly and lowly expressed genes. Nucleic Acids Res 14 5125–5143 Occurrence Handle1:CAS:528:DyaL28Xlt1yksbc%3D Occurrence Handle3526280

    CAS  PubMed  Google Scholar 

  34. PM Sharp WH Li (1987) ArticleTitleThe codon adaptation index—A measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res 15 1281–1295 Occurrence Handle1:CAS:528:DyaL2sXhtlejtrc%3D Occurrence Handle3547335

    CAS  PubMed  Google Scholar 

  35. MA Sørensen CG Kurland S Pedersen (1989) ArticleTitleCodon usage determines translation rate in Escherichia coli. J Mol Biol 207 365–377 Occurrence Handle2474074

    PubMed  Google Scholar 

  36. C Spearman (1904) ArticleTitleThe proof and measurement of association between two things. Am J Psych 15 72–101

    Google Scholar 

  37. AO Urrutia LD Hurst (2001) ArticleTitleCodon usage bias covaries with expression breadth and the rate of synonymous evolution in humans, but this is not evidence for selection. Genetics 159 1191–1199 Occurrence Handle1:CAS:528:DC%2BD38XjvVamtA%3D%3D Occurrence Handle11729162

    CAS  PubMed  Google Scholar 

  38. F Wright (1990) ArticleTitleThe “effective number of codons” used in a gene. Gene 87 23–29

    Google Scholar 

Download references

Acknowledgements

This work was supported by a grant of the Bundesministerium für Bildung und Forschung. I would like to thank W. Kramer and the referees for their assistance in preparing the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rainer Merkl.

Appendix: Genomic Data Sets (Accession numbers in brackets)

Appendix: Genomic Data Sets (Accession numbers in brackets)

Archaea

Archaeoglobus fulgidus (NC000917), Halobacterium sp. NRC-1 (NC002607), Methanobacterium thermoautotrophicum (NC000916), Methanococcus jannaschii (NC000909), Methanosarcina mazei (AE008384), Pyrococcus abyssi (NC000868), Pyrococcus horikoshii (NC000961), Sulfolobus solfataricus (NC002754), Sulfolobus tokodaii (NC003106), Thermoplasma acidophilum (NC002578), Thermoplasma volcanium (NC002689).

Bacteria

Aquifex aeolicus (NC000918), Bacillus halodurans (NC002570), Bacillus subtilis (NC000964), Brucella melitensis (Chromosome 2, AE008917), Borrelia burgdorferi (NC001318), Buchnera sp. APS (NC002528), Campylobacter jejuni (NC002163), Chlamydia muridarum (NC002620), Chlamydia trachomatis (NC000117), Chlamydophila pneumoniae J138 (NC002491), Chlamydophila pneumoniae AR39 (NC002179), Deinococcus radiodurans (Chromosome 1, NC001263), Escherichia coli K-12 (NC000913), Escherichia coli O157:H7 EDL933 (NC002655), Haemophilus influenzae Rd (NC000907), Helicobacter pylori 26695 (NC000915), Helicobacter pylori J99 (NC000921), Lactococcus lactis subsp. lactis (NC002662), Mesorhizobium loti (NC002678), Mycobacterium tuberculosis (NC000962), Mycoplasma genitalium (NC000908), Mycobacterium leprae strain TN (NC002677), Mycoplasma pneumoniae (NC000912), Mycoplasma pulmonis (NC002771), Mycobacterium tuberculosis CDC1551 (CD 002755), Neisseria meningitidis Z2491 (NC003116), Pseudomonas aeruginosa (NC002516), Rickettsia prowazekii (NC000963), Staphylococcus aureus subsp. aureus N315 (NC002745), Staphylococcus aureus strain Mu50 (NC002758), Streptococcus pneumoniae R6 (NC003098), Streptococcus pneumoniae TIGR4 (NC003038), Streptococcus pyogenes (NC002737), Synechocystis sp. PCC6803 (NC000911), Thermotoga maritima (NC000853), Treponema pallidum (NC000919), Ureaplasma urealyticum (NC002162), Vibrio cholerae (NC002505, NC002506), Yersinia pestis (NC003143). The following data sets contained no annotation and could not be analyzed: All data sets of Agrobacterium tumefaciens, Pasteurella multocida (NC002663), Xylella fastidiosa (NC002488).

Eukaryota

Saccharomyces cerevisiae (a data set prepared by MIPS, which contains 6449 entries, Mewes et al. 2000).

mRNA Concentration Data

Data of Holstege et al. (1998) were obtained from http://www.wi.mit.edu/young/expression.html .

Rights and permissions

Reprints and permissions

About this article

Cite this article

Merkl, R. A Survey of Codon and Amino Acid Frequency Bias in Microbial Genomes Focusing on Translational Efficiency . J Mol Evol 57, 453–466 (2003). https://doi.org/10.1007/s00239-003-2499-1

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00239-003-2499-1

Keywords

Navigation