Skip to main content
Log in

Comparative analysis of expressed sequences reveals a conserved pattern of optimal codon usage in plants

  • Published:
Plant Molecular Biology Aims and scope Submit manuscript

Abstract

Codon usage bias is a ubiquitous phenomenon, which may be caused by mutational bias, selection, or both. The patterns of codon usage in plants are not well understood. Datasets of expressed sequence tags (ESTs) available for many plant species provide the resources for large-scale comparative analysis of codon usage patterns. We developed a computational approach to translate EST or assembled contig sequences, and then used the coding information for comparative analysis of codon usage in 12 plant species, including 6 eudicots, 5 monocots and the green alga Chlamydomonas reinhardtii. While codon nucleotide composition is highly conserved within eudicots or monocots, there is a significant difference between these two major taxonomic groups of higher plants. The third nucleotide position of codons is AU-rich in the eudicot genomes (35–42% of G+C content), but GC-rich in the monocot genomes (59–61% of G+C content). To identify optimal codons in these species, we used EST counts to estimate gene transcript levels. It was demonstrated that codon usage bias is correlated positively with gene transcript levels. Interestingly, the use of optimal codons appears to be well conserved between eudicots and monocots, and to a lesser degree between the higher plants and C. reinhardtii. Most of the optimal codons end with a C or G base, regardless of the different nucleotide composition in these genomes. The results suggest that plant codon usage is affected by translational selection, and the selective pressure appears to be conserved in the plant kingdom.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • H Akashi (1994) ArticleTitleSynonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy Genetics 136 927–935 Occurrence Handle8005445 Occurrence Handle1:CAS:528:DyaK2MXpsFaq

    PubMed  CAS  Google Scholar 

  • Y Batard A Hehn S Nedelkina M Schalk K Pallett H Schaller D Werck-Reichhart (2000) ArticleTitleIncreasing expression of P450 and P450-reductase proteins from monocots in heterologous systems Arch Biochem Biophys 379 161–169 Occurrence Handle10864454 Occurrence Handle1:CAS:528:DC%2BD3cXktFehtrc%3D Occurrence Handle10.1006/abbi.2000.1867

    Article  PubMed  CAS  Google Scholar 

  • G Bernardi (2000) ArticleTitleIsochores and the evolutionary genomics of vertebrates Gene 241 3–17 Occurrence Handle10607893 Occurrence Handle1:CAS:528:DyaK1MXotVGksrw%3D Occurrence Handle10.1016/S0378-1119(99)00485-0

    Article  PubMed  CAS  Google Scholar 

  • M Bulmer (1991) ArticleTitleThe selection-mutation-drift theory of synonymous codon usage Genetics 129 897–907 Occurrence Handle1752426 Occurrence Handle1:CAS:528:DyaK38XhsVKhtL0%3D

    PubMed  CAS  Google Scholar 

  • C Burge S Karlin (1997) ArticleTitlePrediction of complete gene structures in human genomic DNA J Mol Biol 268 78–94 Occurrence Handle9149143 Occurrence Handle1:CAS:528:DyaK2sXjtlSqtL4%3D Occurrence Handle10.1006/jmbi.1997.0951

    Article  PubMed  CAS  Google Scholar 

  • N Carels G Bernardi (2000) ArticleTitleTwo classes of genes in plants Genetics 154 1819–1825 Occurrence Handle10747072 Occurrence Handle1:CAS:528:DC%2BD3cXjtVams7s%3D

    PubMed  CAS  Google Scholar 

  • CI Castillo-Davis DL Hartl (2002) ArticleTitleGenome evolution and developmental constraint in Caenorhabditis elegans Mol Biol Evol 19 728–735 Occurrence Handle11961106 Occurrence Handle1:CAS:528:DC%2BD38XjsFaku7o%3D

    PubMed  CAS  Google Scholar 

  • H Chiapello F Lisacek M Caboche A Henaut (1998) ArticleTitleCodon usage and gene function are related in sequences of Arabidopsis thaliana Gene 209 GC1–GC38 Occurrence Handle9583944 Occurrence Handle1:STN:280:DyaK1c3ksF2nsA%3D%3D Occurrence Handle10.1016/S0378-1119(97)00671-9

    Article  PubMed  CAS  Google Scholar 

  • H Chiapello E Ollivier C Landes-Devauchelle P Nitschke JL Risler (1999) ArticleTitleCodon usage as a tool to predict the cellular location of eukaryotic ribosomal proteins and aminoacyl-tRNA Nucleic Acids Res 27 2848–2851 Occurrence Handle10390524 Occurrence Handle1:CAS:528:DyaK1MXkvV2jur4%3D Occurrence Handle10.1093/nar/27.14.2848

    Article  PubMed  CAS  Google Scholar 

  • A Coghlan KH Wolfe (2000) ArticleTitleRelationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae Yeast 16 1131–1145 Occurrence Handle10953085 Occurrence Handle1:CAS:528:DC%2BD3cXmvFSitbY%3D Occurrence Handle10.1002/1097-0061(20000915)16:12<1131::AID-YEA609>3.0.CO;2-F

    Article  PubMed  CAS  Google Scholar 

  • JM Comeron M Kreitman M Aguade (1999) ArticleTitleNatural selection on synonymous sites is correlated with gene length and recombination in Drosophila Genetics 151 239–249 Occurrence Handle9872963 Occurrence Handle1:CAS:528:DyaK1MXovVCgsA%3D%3D

    PubMed  CAS  Google Scholar 

  • HJ Dong L Nilsson CG Kurland (1996) ArticleTitleCo-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates J Mol Biol 260 649–663 Occurrence Handle8709146 Occurrence Handle1:CAS:528:DyaK28XkvVCrsLs%3D Occurrence Handle10.1006/jmbi.1996.0428

    Article  PubMed  CAS  Google Scholar 

  • L Duret (2000) ArticleTitletRNA gene number and codon usage in the C. elegans genome are coadapted for optimal translation of highly expressed genes Trends Genet 16 287–289 Occurrence Handle10858656 Occurrence Handle1:CAS:528:DC%2BD3cXktlCnur0%3D Occurrence Handle10.1016/S0168-9525(00)02041-2

    Article  PubMed  CAS  Google Scholar 

  • L Duret (2002) ArticleTitleEvolution of synonymous codon usage in metazoans Curr Opin Genet Dev 12 640–649 Occurrence Handle12433576 Occurrence Handle1:CAS:528:DC%2BD38XosFOhsrs%3D Occurrence Handle10.1016/S0959-437X(02)00353-2

    Article  PubMed  CAS  Google Scholar 

  • L Duret D Mouchiroud (1999) ArticleTitleExpression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis Proc Natl Acad Sci USA 96 4482–4487 Occurrence Handle10200288 Occurrence Handle1:CAS:528:DyaK1MXjs1yls7Y%3D Occurrence Handle10.1073/pnas.96.8.4482

    Article  PubMed  CAS  Google Scholar 

  • MB Eisen PT Spellman PO Brown D Botstein (1998) ArticleTitleCluster analysis and display of genome-wide expression patterns Proc Natl Acad Sci USA 95 14863–14868 Occurrence Handle9843981 Occurrence Handle1:CAS:528:DyaK1cXotVGmurk%3D Occurrence Handle10.1073/pnas.95.25.14863

    Article  PubMed  CAS  Google Scholar 

  • RM Ewing AB Kahla O Poirot F Lopez S Audic JM Claverie (1999) ArticleTitleLarge-scale statistical analysis of rice ESTs reveal correlated patterns of gene expression Genome Res 9 950–959 Occurrence Handle10523523 Occurrence Handle1:CAS:528:DyaK1MXmvFWksbs%3D Occurrence Handle10.1101/gr.9.10.950

    Article  PubMed  CAS  Google Scholar 

  • J Elf D Nilsson T Tenson M Ehrenberg (2003) ArticleTitleSelective charging of tRNA isoacceptors explains patterns of codon usage Science 300 1718–1722 Occurrence Handle12805541 Occurrence Handle1:CAS:528:DC%2BD3sXksVKis7o%3D Occurrence Handle10.1126/science.1083811

    Article  PubMed  CAS  Google Scholar 

  • SL Fennoy J Bailey-Serres (1993) ArticleTitleSynonymous codon usage in Zea mays L. nuclear genes is varied by levels of C and G-ending codons Nucleic Acids Res. 21 5294–5300 Occurrence Handle8265340 Occurrence Handle1:CAS:528:DyaK2cXkslWmsw%3D%3D

    PubMed  CAS  Google Scholar 

  • MP Francino H Ochman (2001) ArticleTitleDeamination as the basis of strand asymmetric evolution in transcribed Escherichia coli sequences Mol Biol Evol 18 1147–1150 Occurrence Handle11371605 Occurrence Handle1:CAS:528:DC%2BD3MXktFemu7o%3D

    PubMed  CAS  Google Scholar 

  • S Franklin B Ngo E Efuet SP Mayfield (2002) ArticleTitleDevelopment of a GFP reporter gene for Chlamydomonas reinhardtii chloroplast Plant J 30 733–744 Occurrence Handle12061904 Occurrence Handle1:CAS:528:DC%2BD38Xls1Chtb0%3D Occurrence Handle10.1046/j.1365-313X.2002.01319.x

    Article  PubMed  CAS  Google Scholar 

  • RJ Grocock PM Sharp (2002) ArticleTitleSynonymous codon usage in Pseudomonas aeruginosa PA01 Gene 289 131–139 Occurrence Handle12036591 Occurrence Handle1:CAS:528:DC%2BD38XjvFGhsb8%3D Occurrence Handle10.1016/S0378-1119(02)00503-6

    Article  PubMed  CAS  Google Scholar 

  • T Ikemura (1981) ArticleTitleCorrelation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for synonymous codon choice that is optimal for the E. coli translational system J Mol Biol 151 389–409 Occurrence Handle6175758 Occurrence Handle1:CAS:528:DyaL38Xks1ygsA%3D%3D Occurrence Handle10.1016/0022-2836(81)90003-6

    Article  PubMed  CAS  Google Scholar 

  • T Ikemura (1982) ArticleTitleDifferences in synonymous codon choice patterns of yeast and correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes J Mol Biol 158 573–597 Occurrence Handle6750137 Occurrence Handle1:CAS:528:DyaL38Xlt1Whur8%3D Occurrence Handle10.1016/0022-2836(82)90250-9

    Article  PubMed  CAS  Google Scholar 

  • S Kanaya Y Yamada Y Kudo T Ikemura (1999) ArticleTitleStudies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysis Gene 238 143–155 Occurrence Handle10570992 Occurrence Handle1:CAS:528:DyaK1MXntFKnu74%3D Occurrence Handle10.1016/S0378-1119(99)00225-5

    Article  PubMed  CAS  Google Scholar 

  • K Lin Y Kuang JS Joseph PR Kolatkar (2002) ArticleTitleConserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomics Nucleic Acids Res 30 2599–2607 Occurrence Handle12034849 Occurrence Handle1:CAS:528:DC%2BD38XkvV2hsLY%3D Occurrence Handle10.1093/nar/30.11.2599

    Article  PubMed  CAS  Google Scholar 

  • DJ Lynn GA Singer DA Hickey (2002) ArticleTitleSynonymous codon usage is subject to selection in thermophilic bacteria Nucleic Acids Res 30 4272–4277 Occurrence Handle12364606 Occurrence Handle1:CAS:528:DC%2BD38XnvVemurY%3D Occurrence Handle10.1093/nar/gkf546

    Article  PubMed  CAS  Google Scholar 

  • G Marais L Duret (2001) ArticleTitleSynonymous codon usage, accuracy of translation, and gene length in Caenorhabditis elegans J Mol Evol 52 275–280 Occurrence Handle11428464 Occurrence Handle1:CAS:528:DC%2BD3MXislOks70%3D

    PubMed  CAS  Google Scholar 

  • BC Meyers DK Lee TH Vu SS Tej SB Edberg M Matvienko LD Tindell (2004) ArticleTitle Arabidopsis MPSS. An online resource for quantitative expression analysis Plant Physiol 135 801–813 Occurrence Handle15173564 Occurrence Handle1:CAS:528:DC%2BD2cXltlKju74%3D Occurrence Handle10.1104/pp.104.039495

    Article  PubMed  CAS  Google Scholar 

  • EN Moriyama JR Powell (1997) ArticleTitleCodon usage bias and tRNA abundance in Drosophila J Mol Evol 45 514–523 Occurrence Handle9342399 Occurrence Handle1:CAS:528:DyaK2sXmvFSktbY%3D Occurrence Handle10.1007/PL00006256

    Article  PubMed  CAS  Google Scholar 

  • EN Moriyama JR Powell (1998) ArticleTitleGene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli Nucleic Acids Res 26 188–3193

    Google Scholar 

  • EE Murray J Lotzer M Eberle (1989) ArticleTitleCodon usage in plant genes Nucleic Acids Res 17 477–498 Occurrence Handle2644621 Occurrence Handle1:CAS:528:DyaL1MXht12ktb4%3D

    PubMed  CAS  Google Scholar 

  • H Naya H Romero N Carels A Zavala H Musto (2001) ArticleTitleTranslational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtii FEBS Lett 501 127–130 Occurrence Handle11470270 Occurrence Handle1:CAS:528:DC%2BD3MXltlKqtb4%3D Occurrence Handle10.1016/S0014-5793(01)02644-8

    Article  PubMed  CAS  Google Scholar 

  • R Percudani S Ottonello (1999) ArticleTitleSelection at the wobble position of codons read by the same tRNA in Saccharomyces cerevisiae Mol Biol Evol 16 1752–1762 Occurrence Handle10605116 Occurrence Handle1:CAS:528:DyaK1MXnvF2nsrY%3D

    PubMed  CAS  Google Scholar 

  • FJ Perlak RL Fuchs DA Dean SL McPherson DA Fischhoff (1991) ArticleTitleModification of the coding sequence enhances plant expression of insect control protein genes Proc Natl Acad Sci USA 88 3324–3328 Occurrence Handle2014252 Occurrence Handle1:CAS:528:DyaK3MXktVaguro%3D Occurrence Handle10.1073/pnas.88.8.3324

    Article  PubMed  CAS  Google Scholar 

  • JR Powell EN Moriyama (1997) ArticleTitleEvolution of codon usage bias in Drosophila Proc Natl Acad Sci USA 94 7784–7790 Occurrence Handle9223264 Occurrence Handle1:CAS:528:DyaK2sXksl2nsbo%3D Occurrence Handle10.1073/pnas.94.15.7784

    Article  PubMed  CAS  Google Scholar 

  • J Quackenbush F Liang I Holt G Pertea J Upton (2000) ArticleTitleThe TIGR Gene Indices: reconstruction and representation of expressed gene sequences Nucleic Acids Res 28 141–145 Occurrence Handle10592205 Occurrence Handle1:CAS:528:DC%2BD3cXhvVKjsLw%3D Occurrence Handle10.1093/nar/28.1.141

    Article  PubMed  CAS  Google Scholar 

  • S Rogic AK Mackworth FB Ouellette (2001) ArticleTitleEvaluation of gene-finding programs on mammalian sequences Genome Res 11 817–832 Occurrence Handle11337477 Occurrence Handle1:CAS:528:DC%2BD3MXjs1Wmurc%3D Occurrence Handle10.1101/gr.147901

    Article  PubMed  CAS  Google Scholar 

  • G Rouwendal O Mendes E Wolbert W Boer Particlede (1997) ArticleTitleEnhanced expression in tobacco of the gene encoding green fluorescent protein by modification of its codon usage Plant Mol Biol 33 989–999 Occurrence Handle9154981 Occurrence Handle1:CAS:528:DyaK2sXjtlSmtbo%3D Occurrence Handle10.1023/A:1005740823703

    Article  PubMed  CAS  Google Scholar 

  • AA Salamov VV Solovyev (2000) ArticleTitle Ab initio gene finding in Drosophila genomic DNA Genome Res 10 516–522 Occurrence Handle10779491 Occurrence Handle1:CAS:528:DC%2BD3cXjtVKrs7Y%3D Occurrence Handle10.1101/gr.10.4.516

    Article  PubMed  CAS  Google Scholar 

  • PM Sharp WH Li (1986) ArticleTitleCodon usage in regulatory genes in Escherichia coli does not reflect selection for ‘rare’; codons Nucleic Acids Res 14 7737–7749 Occurrence Handle3534792 Occurrence Handle1:CAS:528:DyaL28XmtF2rurY%3D

    PubMed  CAS  Google Scholar 

  • M Stenico AT Lloyd PM Sharp (1994) ArticleTitleCodon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases Nucleic Acids Res 22 2437–2446 Occurrence Handle8041603 Occurrence Handle1:CAS:528:DyaK2cXlsFSht7s%3D

    PubMed  CAS  Google Scholar 

  • N Sueoka Y Kawanishi (2000) ArticleTitleDNA G+C content of the third codon position and codon usage biases of human genes Gene 261 53–62 Occurrence Handle11164037 Occurrence Handle1:CAS:528:DC%2BD3MXmvF2msQ%3D%3D Occurrence Handle10.1016/S0378-1119(00)00480-7

    Article  PubMed  CAS  Google Scholar 

  • InstitutionalAuthorNameThe Arabidopsis Genome Initiative (2000) ArticleTitleAnalysis of the genome sequence of the flowering plant Arabidopsis thaliana Nature 408 796–815 Occurrence Handle10.1038/35048692

    Article  Google Scholar 

  • AO Urrutia LD Hurst (2001) ArticleTitleCodon usage bias covaries with expression breadth and the rate of synonymous evolution in humans, but this not evidence for selection Genetics 159 1191–1199 Occurrence Handle11729162 Occurrence Handle1:CAS:528:DC%2BD38XjvVamtA%3D%3D

    PubMed  CAS  Google Scholar 

  • EB Vervoort A Ravestein Particlevan NN Peij Particlevan JC Heikoop PJ Haastert Particlevan GF Verheijden MH Linskens (2000) ArticleTitleOptimizing heterologous expression in Dictyostelium: importance of 5′ codon adaptation Nucleic Acids Res 28 2069–2074 Occurrence Handle10773074 Occurrence Handle1:CAS:528:DC%2BD3cXktFSrur0%3D Occurrence Handle10.1093/nar/28.10.2069

    Article  PubMed  CAS  Google Scholar 

  • GK Wong J Wang L Tao J Tan J Zhang DA Passey J Yu (2002) ArticleTitleCompositional gradients in Gramineae genes Genome Res 12 851–856 Occurrence Handle12045139 Occurrence Handle1:CAS:528:DC%2BD38Xks12hsr8%3D Occurrence Handle10.1101/gr.189102

    Article  PubMed  CAS  Google Scholar 

  • SI Wright CBK Yau M Looseley BC Meyers (2004) ArticleTitleEffects of gene expression on molecular evolution in Arabidopsis thaliana and Arabidopsis lyrata Mol Biol Evol 21 1719–1726 Occurrence Handle15201397 Occurrence Handle1:CAS:528:DC%2BD2cXntVCitLk%3D Occurrence Handle10.1093/molbev/msh191

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Liangjiang Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, L., Roossinck, M.J. Comparative analysis of expressed sequences reveals a conserved pattern of optimal codon usage in plants. Plant Mol Biol 61, 699–710 (2006). https://doi.org/10.1007/s11103-006-0041-8

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11103-006-0041-8

Keywords

Navigation