Journal of Molecular Evolution

, Volume 63, Issue 3, pp 393–400

Specific Selection Pressure at the Third Codon Positions: Contribution to 10- to 11-Base Periodicity in Prokaryotic Genomes

  • Amir B. Cohanim
  • Edward N. Trifonov
  • Yechezkel Kashi
Article

Abstract

Prokaryotic sequences are responsible for more than just protein coding. There are two 10- to 11-base periodical patterns superimposed on the protein coding message within the same sequence. Positional auto- and cross-correlation analysis of the sequences shows that these two patterns are a short-range counter-phase oscillation of AA and TT dinucleotides and a medium-range in-phase oscillation of the same dinucleotides, spanning distances of up to ∼30 and ∼100 bases, respectively. The short-range oscillation is encoded by the amino acid sequences themselves, apparently, due to the presence of amphipathic α-helices in the proteins. The medium-range oscillation, related to DNA folding in the cell, is created largely by a special choice of the bases in the third positions of the codons. Interestingly, the amino acid sequences do contribute to that signal as well. That is, the very amino acid sequences are, to some extent, degenerate to serve the same oscillating pattern that is associated with the degenerate third codon positions.

Keywords

Prokaryotic genomes DNA periodicity Dinucleotides Codon bias Codon usage Third codon positions Supercoiling 

References

  1. Andersson SG, Kurland CG (1990) Codon preferences in free-living microorganisms. Microbiol Rev 54:198–210PubMedGoogle Scholar
  2. Aota S, Ikemura T (1986) Diversity in G + C content at the third position of codons in vertebrate genes and its cause. Nucleic Acids Res 14:6345–6355PubMedGoogle Scholar
  3. Chou PY, Fasman GD (1978) Prediction of the secondary structure of proteins from their amino acid sequences. Adv Enzymol Relat Areas Mol Biol 47:45–148PubMedGoogle Scholar
  4. Cohanim AB, Kashi Y, Trifonov EN (2005) Yeast nucleosome DNA pattern: deconvolution from genome sequences of S. cerevisiae. J Biomol Str Dyn 22:687–694Google Scholar
  5. Crick FHC (1976) Linking numbers and nucleosomes. Proc Natl Acad Sci USA 73:2639–2643PubMedCrossRefGoogle Scholar
  6. D’Onofrio G, Mouchiroud D, Aissani B, Gautier C, Bernardi G (1991) Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins. J Mol Evol 32:504–510PubMedCrossRefGoogle Scholar
  7. Duret L, Mouchiroud D (1999) Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. Proc Natl Acad Sci USA 96:4482–4487PubMedCrossRefGoogle Scholar
  8. Duret L, Mouchiroud D (2000) Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. Mol Biol Evol 17:68–74PubMedGoogle Scholar
  9. Engel DE, DeGrado WF (2004) Amino acid propensities are position-dependent throughout the length of α-helix. J Mol Biol 337:1195–1205PubMedCrossRefGoogle Scholar
  10. Garnier J, Osguthorpe DJ, Robson B (1978) Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J Mol Biol 120:95–120CrossRefGoogle Scholar
  11. Goldman E, Rosenberg AH, Zubay G, Studier FW (1995) Consecutive low-usage leucine codons block translation only when near the 5′ end of a message in Escherichia coli. J Mol Biol 245:467–473PubMedCrossRefGoogle Scholar
  12. Grantham R, Gautier C, Gouy M (1980) Codon frequencies in 119 individual genes confirm consistent choices of degenerate bases according to genome type. Nucleic Acids Res 8:1893–1912PubMedGoogle Scholar
  13. Guisez Y, Robbens J, Remaut E, Fiers W (1993) Folding of the MS2 coat protein in Escherichia coli is modulated by translational pauses resulting from mRNA secondary structure and codon usage: a hypothesis. J Theor Biol 162:243–252PubMedCrossRefGoogle Scholar
  14. Herzel H, Trifonov EN, Weiss O, Grosse I (1998a) Interpreting correlations in biosequences. Physica A 249:449–459CrossRefGoogle Scholar
  15. Herzel H, Weiss O, Trifonov EN (1998b) Sequence periodicity in complete genomes of Archaea suggests positive supercoiling. J Biomol Struct Dyn 16:341–345Google Scholar
  16. Herzel H, Weiss O, Trifonov EN (1999) 10–11 bp periodicities in complete genomes reflect protein structure and DNA folding. Bioinformatics 15(3):187–193PubMedCrossRefGoogle Scholar
  17. Hosid S, Trifonov EN, Bolshoy A (2004) Sequence periodicity of Escherichia coli is concentrated in intergenic regions. BMC Mol Biol 5(14):1–7Google Scholar
  18. Ikemura T (1985) Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol 2:13–34PubMedGoogle Scholar
  19. Kanehisa MI, Tsong TY (1980) Hydrophobicity and protein structure. Biopolymers 19:1617–1628PubMedCrossRefGoogle Scholar
  20. Komar A, Jaenicke R (1995) Kinetics of translation of gamma B crystallin and its circularly permuted variant in an in vitro cell-free system: possible relations to codon distribution and protein folding. FEBS Lett 376:195–198PubMedCrossRefGoogle Scholar
  21. Makhoul CH, Trifonov EN (2002) Distribution of rare triplets along mRNA and their relation to protein folding. J Biomol Struct Dyn 20:413–420PubMedGoogle Scholar
  22. Murray EE, Lotzer J, Eberle M (1989) Codon usage in plant genes. Nucleic Acids Res 17:477–498PubMedGoogle Scholar
  23. Pal L, Chakrabarti P, Basu G (2003) Sequence and structure patterns in proteins from an analysis of the shortest helices: implications for helix nucleation. J Mol Biol 326:273–291PubMedCrossRefGoogle Scholar
  24. Penel S, Morisson RG, Mortishire-Smith RJ, Doig AJ (1999) Periodicity in alpha-helix lengths and C-capping preferences. J Mol Biol 293:1211–1219PubMedCrossRefGoogle Scholar
  25. Rodriguez AC (2002) Studies of a positive supercoiling machine. J Biol Chem 277:29865–29873PubMedCrossRefGoogle Scholar
  26. Schieg P, Herzel H (2004) Periodicities of 10–11 bp as indicators of the supercoiled state of genomic DNA. J Mol Biol 343:891–901PubMedCrossRefGoogle Scholar
  27. Sharp PM, Li WH (1987) The Codon Adaptation Index—a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res 15:1281–1295PubMedGoogle Scholar
  28. Shields DC, Sharp PM, Higgins DG, Wright F (1988) “Silent” sites in Drosophila genes are not neutral: evidence of selection among synonymous codons. Mol Biol Evol 5:704–716PubMedGoogle Scholar
  29. Tolstorukov MY, Virnik KM, Adhya S, Zhurkin VB (2005) A-tract clusters may facilitate DNA packaging in bacterial nucleoid. Nucleic Acids Res 33:3907–3918PubMedCrossRefGoogle Scholar
  30. Trifonov EN (1987) Translation framing code and frame-monitoring mechanism as suggested by the analysis of mRNA and 16S rRNA nucleotide sequences. J Mol Biol 194:643–652PubMedCrossRefGoogle Scholar
  31. Trifonov EN, (1989) The multiple codes of nucleotide sequences. Bull Math Biol 51:417–432PubMedGoogle Scholar
  32. Trifonov EN (1997) Genetic sequences as product of compression by inclusive superposition of many codes. Mol Biol 31:759–767Google Scholar
  33. Vologodsky A (1992) Topology and physics of circular DNA.CRC Press, Boca Raton, FLGoogle Scholar

Copyright information

© Springer Science+Business Media, Inc. 2006

Authors and Affiliations

  • Amir B. Cohanim
    • 1
  • Edward N. Trifonov
    • 2
  • Yechezkel Kashi
    • 1
  1. 1.Department of Biotechnology and Food EngineeringHaifaIsrael
  2. 2.Genome Diversity Center, Institute of EvolutionUniversity of HaifaHaifaIsrael

Personalised recommendations