A relationship between GC content and coding-sequence length
- 386 Downloads
Since base composition of translational stop codons (TAG, TAA, and TGA) is biased toward a low G+C content, a differential density for these termination signals is expected in random DNA sequences of different base compositions. The expected length of reading frames (DNA segments of sense codons flanked by in-phase stop codons) in random sequences is thus a function of GC content. The analysis of DNA sequences from several genome databases stratified according to GC content reveals that the longest coding sequences—exons in vertebrates and genes in prokaryotes—are GC-rich, while the shortest ones are GC-poor. Exon lengthening in GC-rich vertebrate regions does not result, however, in longer vertebrate proteins, perhaps because of the lower number of exons in the genes located in these regions. The effects on coding-sequence lengths constitute a new evolutionary meaning for compositional variations in DNA GC content.
Key wordsBase composition Stop-codon density Coding-sequence length Compositional heterogeneity
Unable to display preview. Download preview PDF.
- Holland SK, Blake CCF (1990) Proteins, exons, and molecular evolution. In: Stone EM, Schwartz RJ (eds) Intervening sequences in evolution and development. Oxford University Press, New York, p 32Google Scholar
- Nomura M, Sor F, Yamagishi M, Lawson M (1987) Heterogeneity of GC content within a single bacterial genome and its implications for evolution. Cold Spring Harb Symp Quant Biol 52:658–663Google Scholar
- Sharp PM, Burgess CJ, Lloyd AT, Mitchell KJ (1992) Selective use of termination codons and variations in codon choice. In: Hatfield DL, Lee BL Pirtle RM (eds) Transfer RNA in protein synthesis. CRC Press, Boca Raton, pp 398–425Google Scholar
- Stoehr PJ, Cameron ON (1991) The EMBL data library. Nucleic Acids Res (Suppl) 19:2227–2230Google Scholar
- Stoltzfus A, Spencer DF, Zuker M, Logsdon JM, Doolittle WF (1995) Introns and the origin of protein-coding genes (response). Science 268:1367–1369Google Scholar