Plant Molecular Biology

, Volume 61, Issue 4–5, pp 699–710 | Cite as

Comparative analysis of expressed sequences reveals a conserved pattern of optimal codon usage in plants



Codon usage bias is a ubiquitous phenomenon, which may be caused by mutational bias, selection, or both. The patterns of codon usage in plants are not well understood. Datasets of expressed sequence tags (ESTs) available for many plant species provide the resources for large-scale comparative analysis of codon usage patterns. We developed a computational approach to translate EST or assembled contig sequences, and then used the coding information for comparative analysis of codon usage in 12 plant species, including 6 eudicots, 5 monocots and the green alga Chlamydomonas reinhardtii. While codon nucleotide composition is highly conserved within eudicots or monocots, there is a significant difference between these two major taxonomic groups of higher plants. The third nucleotide position of codons is AU-rich in the eudicot genomes (35–42% of G+C content), but GC-rich in the monocot genomes (59–61% of G+C content). To identify optimal codons in these species, we used EST counts to estimate gene transcript levels. It was demonstrated that codon usage bias is correlated positively with gene transcript levels. Interestingly, the use of optimal codons appears to be well conserved between eudicots and monocots, and to a lesser degree between the higher plants and C.reinhardtii. Most of the optimal codons end with a C or G base, regardless of the different nucleotide composition in these genomes. The results suggest that plant codon usage is affected by translational selection, and the selective pressure appears to be conserved in the plant kingdom.


Codon usage Expressed sequence tags Gene expression Optimal codons Translational selection 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Akashi, H 1994Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracyGenetics136927935PubMedGoogle Scholar
  2. Batard, Y, Hehn, A, Nedelkina, S, Schalk, M, Pallett, K, Schaller, H, Werck-Reichhart, D 2000Increasing expression of P450 and P450-reductase proteins from monocots in heterologous systemsArch Biochem Biophys379161169PubMedCrossRefGoogle Scholar
  3. Bernardi, G 2000Isochores and the evolutionary genomics of vertebratesGene241317PubMedCrossRefGoogle Scholar
  4. Bulmer, M 1991The selection-mutation-drift theory of synonymous codon usageGenetics129897907PubMedGoogle Scholar
  5. Burge, C, Karlin, S 1997Prediction of complete gene structures in human genomic DNAJ Mol Biol2687894PubMedCrossRefGoogle Scholar
  6. Carels, N, Bernardi, G 2000Two classes of genes in plantsGenetics15418191825PubMedGoogle Scholar
  7. Castillo-Davis, CI, Hartl, DL 2002Genome evolution and developmental constraint in Caenorhabditis elegansMol Biol Evol19728735PubMedGoogle Scholar
  8. Chiapello, H, Lisacek, F, Caboche, M, Henaut, A 1998Codon usage and gene function are related in sequences of Arabidopsis thalianaGene209GC1GC38PubMedCrossRefGoogle Scholar
  9. Chiapello, H, Ollivier, E, Landes-Devauchelle, C, Nitschke, P, Risler, JL 1999Codon usage as a tool to predict the cellular location of eukaryotic ribosomal proteins and aminoacyl-tRNANucleic Acids Res2728482851PubMedCrossRefGoogle Scholar
  10. Coghlan, A, Wolfe, KH 2000Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiaeYeast1611311145PubMedCrossRefGoogle Scholar
  11. Comeron, JM, Kreitman, M, Aguade, M 1999Natural selection on synonymous sites is correlated with gene length and recombination in DrosophilaGenetics151239249PubMedGoogle Scholar
  12. Dong, HJ, Nilsson, L, Kurland, CG 1996Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth ratesJ Mol Biol260649663PubMedCrossRefGoogle Scholar
  13. Duret, L 2000tRNA gene number and codon usage in the C. elegans genome are coadapted for optimal translation of highly expressed genesTrends Genet16287289PubMedCrossRefGoogle Scholar
  14. Duret, L 2002Evolution of synonymous codon usage in metazoansCurr Opin Genet Dev12640649PubMedCrossRefGoogle Scholar
  15. Duret, L, Mouchiroud, D 1999Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and ArabidopsisProc Natl Acad Sci USA9644824487PubMedCrossRefGoogle Scholar
  16. Eisen, MB, Spellman, PT, Brown, PO, Botstein, D 1998Cluster analysis and display of genome-wide expression patternsProc Natl Acad Sci USA951486314868PubMedCrossRefGoogle Scholar
  17. Ewing, RM, Kahla, AB, Poirot, O, Lopez, F, Audic, S, Claverie, JM 1999Large-scale statistical analysis of rice ESTs reveal correlated patterns of gene expressionGenome Res9950959PubMedCrossRefGoogle Scholar
  18. Elf, J, Nilsson, D, Tenson, T, Ehrenberg, M 2003Selective charging of tRNA isoacceptors explains patterns of codon usageScience30017181722PubMedCrossRefGoogle Scholar
  19. Fennoy, SL, Bailey-Serres, J 1993Synonymous codon usage in Zea mays L. nuclear genes is varied by levels of C and G-ending codonsNucleic Acids Res.2152945300PubMedGoogle Scholar
  20. Francino, MP, Ochman, H 2001Deamination as the basis of strand asymmetric evolution in transcribed Escherichia coli sequencesMol Biol Evol1811471150PubMedGoogle Scholar
  21. Franklin, S, Ngo, B, Efuet, E, Mayfield, SP 2002Development of a GFP reporter gene for Chlamydomonas reinhardtii chloroplastPlant J30733744PubMedCrossRefGoogle Scholar
  22. Grocock, RJ, Sharp, PM 2002Synonymous codon usage in Pseudomonas aeruginosa PA01Gene289131139PubMedCrossRefGoogle Scholar
  23. Ikemura, T 1981Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for synonymous codon choice that is optimal for the E. coli translational systemJ Mol Biol151389409PubMedCrossRefGoogle Scholar
  24. Ikemura, T 1982Differences in synonymous codon choice patterns of yeast and correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genesJ Mol Biol158573597PubMedCrossRefGoogle Scholar
  25. Kanaya, S, Yamada, Y, Kudo, Y, Ikemura, T 1999Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysisGene238143155PubMedCrossRefGoogle Scholar
  26. Lin, K, Kuang, Y, Joseph, JS, Kolatkar, PR 2002Conserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomicsNucleic Acids Res3025992607PubMedCrossRefGoogle Scholar
  27. Lynn, DJ, Singer, GA, Hickey, DA 2002Synonymous codon usage is subject to selection in thermophilic bacteriaNucleic Acids Res3042724277PubMedCrossRefGoogle Scholar
  28. Marais, G, Duret, L 2001Synonymous codon usage, accuracy of translation, and gene length in Caenorhabditis elegansJ Mol Evol52275280PubMedGoogle Scholar
  29. Meyers, BC, Lee, DK, Vu, TH, Tej, SS, Edberg, SB, Matvienko, M, Tindell, LD 2004Arabidopsis MPSS. An online resource for quantitative expression analysisPlant Physiol135801813PubMedCrossRefGoogle Scholar
  30. Moriyama, EN, Powell, JR 1997Codon usage bias and tRNA abundance in DrosophilaJ Mol Evol45514523PubMedCrossRefGoogle Scholar
  31. Moriyama, EN, Powell, JR 1998Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coliNucleic Acids Res261883193Google Scholar
  32. Murray, EE, Lotzer, J, Eberle, M 1989Codon usage in plant genesNucleic Acids Res17477498PubMedGoogle Scholar
  33. Naya, H, Romero, H, Carels, N, Zavala, A, Musto, H 2001Translational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtiiFEBS Lett501127130PubMedCrossRefGoogle Scholar
  34. Percudani, R, Ottonello, S 1999Selection at the wobble position of codons read by the same tRNA in Saccharomyces cerevisiaeMol Biol Evol1617521762PubMedGoogle Scholar
  35. Perlak, FJ, Fuchs, RL, Dean, DA, McPherson, SL, Fischhoff, DA 1991Modification of the coding sequence enhances plant expression of insect control protein genesProc Natl Acad Sci USA8833243328PubMedCrossRefGoogle Scholar
  36. Powell, JR, Moriyama, EN 1997Evolution of codon usage bias in DrosophilaProc Natl Acad Sci USA9477847790PubMedCrossRefGoogle Scholar
  37. Quackenbush, J, Liang, F, Holt, I, Pertea, G, Upton, J 2000The TIGR Gene Indices: reconstruction and representation of expressed gene sequencesNucleic Acids Res28141145PubMedCrossRefGoogle Scholar
  38. Rogic, S, Mackworth, AK, Ouellette, FB 2001Evaluation of gene-finding programs on mammalian sequencesGenome Res11817832PubMedCrossRefGoogle Scholar
  39. Rouwendal, G, Mendes, O, Wolbert, E, Boer, W 1997Enhanced expression in tobacco of the gene encoding green fluorescent protein by modification of its codon usagePlant Mol Biol33989999PubMedCrossRefGoogle Scholar
  40. Salamov, AA, Solovyev, VV 2000Ab initio gene finding in Drosophila genomic DNAGenome Res10516522PubMedCrossRefGoogle Scholar
  41. Sharp, PM, Li, WH 1986Codon usage in regulatory genes in Escherichia coli does not reflect selection for ‘rare’; codonsNucleic Acids Res1477377749PubMedGoogle Scholar
  42. Stenico, M, Lloyd, AT, Sharp, PM 1994Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biasesNucleic Acids Res2224372446PubMedGoogle Scholar
  43. Sueoka, N, Kawanishi, Y 2000DNA G+C content of the third codon position and codon usage biases of human genesGene2615362PubMedCrossRefGoogle Scholar
  44. The Arabidopsis Genome Initiative2000Analysis of the genome sequence of the flowering plant Arabidopsis thalianaNature408796815CrossRefGoogle Scholar
  45. Urrutia, AO, Hurst, LD 2001Codon usage bias covaries with expression breadth and the rate of synonymous evolution in humans, but this not evidence for selectionGenetics15911911199PubMedGoogle Scholar
  46. Vervoort, EB, Ravestein, A, Peij, NN, Heikoop, JC, Haastert, PJ, Verheijden, GF, Linskens, MH 2000Optimizing heterologous expression in Dictyostelium: importance of 5′ codon adaptationNucleic Acids Res2820692074PubMedCrossRefGoogle Scholar
  47. Wong, GK, Wang, J, Tao, L, Tan, J, Zhang, J, Passey, DA, Yu, J 2002Compositional gradients in Gramineae genesGenome Res12851856PubMedCrossRefGoogle Scholar
  48. Wright, SI, Yau, CBK, Looseley, M, Meyers, BC 2004Effects of gene expression on molecular evolution in Arabidopsis thaliana and Arabidopsis lyrataMol Biol Evol2117191726PubMedCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media B.V. 2006

Authors and Affiliations

  1. 1.Bioinformatics Center, Division of BiologyKansas State UniversityManhattanUSA
  2. 2.Plant Biology DivisionThe Samuel Roberts Noble FoundationArdmoreUSA

Personalised recommendations