Abstract
Regions flanking the translation initiation site (TIS) are thought to play a crucial role in translation efficiency of mRNAs, but their exact sequence and evolution in eukaryotes are still a matter of debate. We investigated the context sequences in 20 nucleotides around the TIS in multi-cellular eukaryotes, with a focus on two model plants and a comparison to human. We identified consensus sequences aaaaaaa(A/G)(A/C)aAUGGcgaataata and ggcggc(g/c)(A/G)(A/C)(G/C)AUGGCggcggcgg for Arabidopsis thaliana and Oryza sativa, respectively. We observe strongly conserved G at position +4 and A or C at position −2; however, the exact nucleotide frequencies vary between the three organisms even at these conserved positions. The frequency of pyrimidines, which are considered sub optimum at position -3, is higher in both plants than in human. Arabidopsis is GC-depleted (AU-enriched) compared to both rice and human, and the enrichment is slightly stronger upstream than downstream of AUG. While both plants are similar though not identical in their variation of nucleotide frequencies, rice and human are more similar to each other than Arabidopsis and human. All three organisms display clear periodicity in A + G and C + U content when analyzing normalized frequencies. These findings suggest that, besides few highly conserved positions, overall structure of the context sequence plays a larger role in TIS recognition than the actual nucleotide frequencies.
Similar content being viewed by others
Abbreviations
- bp:
-
Base pair(s)
- TF:
-
Transcription factor
- TIS:
-
Translation initiation site
- TS:
-
Translation start
- UTR:
-
Untranslated region
- Organisms:
-
Arabidopsis thaliana (Arabidopsis);
Homo sapiens (human);
Oryza sativa (rice)
References
Kozak, M. (1986). Point mutations define a sequence flanking the ATG initiator codon that modulates translation by eukaryotic ribosomes. Cell, 44, 283–292.
Cavener, D. A. (1987). Comparison of the sequence flanking translational start sites in Drosophila and vertebrates. Nucleic Acid Research, 15, 1353–1361.
Joshi, C. P., Zhou, H., Huang, X., & Chiang, V. L. (1997). Context sequence of translation initiation codon in plants. Plant Molecular Biology, 35, 993–1001.
Kozak, M. (1987). An analysis of 5′-noncoding sequences form 699 vertebrate messenger RNAs. Nucleic Acid Research, 15, 8125–8148.
Joshi, C. P. (1987). An inspection of the domain between putative TATA box and translation start site in 79 plant genes. Nucleic Acid Research, 15, 6643–6653.
Kochetov, A. V., Ischenko, I. V., Vorobiev, D. G., Kel, A. E., Babenko, V. N., Kisselev, L. L., & Kolchanov, N. A. (1998). Eukaryotic mRNAs encoding abundant and scarce proteins are statistically dissimilar in many structural features. FEBS Letters, 440, 351–355.
Pesole, G., Gissi, C., Grillo, G., Licciulli, F., Liuni, S., & Saccone, C. (2000). Analysis of oligonucleotide AUG start codon context in eukaryotic mRNAs. Gene, 261, 85–91.
Kawaguchi, R., & Bailey-Serres, J. (2005). mRNA sequence features that contribute to translational regulation in Arabidopsis. Nucleic Acids Research, 33(3), 955–965.
Joshi, C. P., & Nguyen, H. T. (1995). 5′ Untranslated leader sequences of the eukaryotic mRNAs encoding heat shock induced proteins. Nucleic Acid Journal, 23, 541–549.
Kozak, M. (1991a). A short leader sequence impairs the fidelity of initiation by eukaryotic ribosomes. Gene Expression, 1, 111–115.
Kozak, M. (1991b). An analysis of vertebrate mRNA sequences: intimations of translational control. Journal of Cell Biology, 115, 887–903.
Pisarev, A. V., Kolupaeva, V. G., Pisareva, V. P., Merrick, W. C., Hellen, C. U. T., & Pestova, T. V. (2006). Specific functional interactions of nucleotides at key −3 and +4 positions flanking the initiation codon with components of the mammalian 48s translation initiation complex. Genes and Development, 20, 624–636.
Cavener, D. R., & Ray, S. C. (1991). Eukaryotic start and stop translation site. Nucleic Acid Research, 19, 3185–3192.
Fujimori, S., Washio, T., & Tomita, M. (2005). GC-compositional strand bias around transcription start sites in plants and fungi. BMC Genomics, 6, 26, doi: 10. 1186/1471-2164-6-26.
Cleaver, J. E., Karplus, K., Kashani-Sabet, M., & Limoli, C. (2001). Nucleotide excision repairs a legacy of creativity. Mutation Research, 485, 23–36.
Wong G. Ka-Shu., Wang, J., Tao, L., Tan, J., Zhang, J. G., Passey, D. A., & Yu, J. (2002). Compositional gradients in Gramineae genes. Genome Research, 12, 851–856.
Drummond, D. A., Bloom, J. D., Adami, C., Wilke, C. O., & Arnold, F. H. (2005). Why highly expressed proteins evolve slowly. Proceedings of National Academy of Sciences USA, 102(40), 14338–14343.
Acknowledgments
We sincerely thank the Curator of Genome Research TAIR (Arabidopsis) and TIGR (rice) for sequence information. We also thank the Department of Biotechnology, IIT Guwahati for providing access to computational facilities. C.V. acknowledges funding by the International Human Frontier Science Program.
Author information
Authors and Affiliations
Corresponding author
Additional information
L. Rangan and C. Vogel have contributed equally to this work.
Rights and permissions
About this article
Cite this article
Rangan, L., Vogel, C. & Srivastava, A. Analysis of Context Sequence Surrounding Translation Initiation Site from Complete Genome of Model Plants. Mol Biotechnol 39, 207–213 (2008). https://doi.org/10.1007/s12033-008-9036-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12033-008-9036-9