Journal of Molecular Evolution

, Volume 64, Issue 2, pp 248–260 | Cite as

Looking for Organization Patterns of Highly Expressed Genes: Purine-Pyrimidine Composition of Precursor mRNAs

  • A. Paz
  • D. Mester
  • E. Nevo
  • A. KorolEmail author


We analyzed precursor messenger RNAs (pre-mRNAs) of 12 eukaryotic species. In each species, three groups of highly expressed genes, ribosomal proteins, heat shock proteins, and amino-acyl tRNA synthetases, were compared with a control group (randomly selected genes). The purine-pyrimidine (R-Y) composition of pre-mRNAs of the three targeted gene groups proved to differ significantly from the control. The exons of the three groups tested have higher purine contents and R-tract abundance and lower abundance of Y-tracts compared to the control (R-tract—tract of sequential purines with R n ≥ 5; Y-tract—tract of sequential pyrimidines with Y n ≥ 5). In species widely employing “intron definition” in the splicing process, the Y content of introns of the three targeted groups appeared to be higher compared to the control group. Furthermore, in all examined species, the introns of the targeted genes have a lower abundance of R-tracts compared to the control. We hypothesized that the R-Y composition of the targeted gene groups contributes to high rate and efficiency of both splicing and translation, in addition to the mRNA coding role. This is presumably achieved by (1) reducing the possibility of the formation of secondary structures in the mRNA, (2) using the R-tracts and R-biased sequences as exonic splicing enhancers, (3) lowering the amount of targets for pyrimidine tract binding protein in the exons, and (4) reducing the amount of target sequences for binding of serine/arginine-rich (SR) proteins in the introns, thereby allowing SR proteins to bind to proper (exonic) targets.


Purine tracts Pyrimidine tracts Splicing efficiency Highly expressed genes Intron definition Exon definition 



We thank Ed Trifonov, Zakharia Frenkel, Axel Meyer, and anonymous reviewers for their helpful comments and suggestions. This work was supported by the Israeli Ministry of Absorption and by the Ancell-Teicher Research Foundation for Molecular Genetics and Evolution. A.P. was supported by a scholarship in bioinformatics from the Eshkol Foundation of the Israeli Ministry of Science and Technology.


  1. Amir-Ahmady B, Boutz PL, Markovtsov V, Phillips ML, Black DL (2005) Exon repression by polypyrimidine tract binding protein. RNA 11:699–716PubMedCrossRefGoogle Scholar
  2. Andolfatto P (2005) Adaptive evolution of non-coding DNA in Drosophila. Nature 437:1149–1152PubMedCrossRefGoogle Scholar
  3. Arava Y, Wang Y, Storey JD, Liu CL, Brown PO, Herschlag D (2003) Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae. Proc Natl Acad Sci USA 100:3889–3894PubMedCrossRefGoogle Scholar
  4. Bec G, Kerjan P, Waller JP (1994) Reconstitution in vitro of the valyl-tRNA synthetase-elongation factor (EF) 1 beta gamma delta complex. Essential roles of the NH2-terminal extension of valyl-tRNA synthetase and of the EF-1 delta subunit in complex formation. J Biol Chem 269:2086–2092PubMedGoogle Scholar
  5. Bell SJ, Forsdyke DR (1999) Deviations from Chargaff’s second parity rule correlate with direction of transcription. J Theor Biol 197:63–76PubMedCrossRefGoogle Scholar
  6. Berget SM (1995) Exon recognition in vertebrate splicing. J Biol Chem 270:2411–2414PubMedGoogle Scholar
  7. Black DL (2003) Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem 72:291–336PubMedCrossRefGoogle Scholar
  8. Bourgeois CF, Lejeune F, Stevenin J (2004) Broad specificity of SR (serine/arginine) proteins in the regulation of alternative splicing of pre-messenger RNA. Prog Nucleic Acid Res Mol Biol 78:37–88PubMedCrossRefGoogle Scholar
  9. Caplan AJ, Cyr DM, Douglas MG (1993) Eukaryotic homologues of Escherichia coli dnaJ: a diverse protein family that functions with hsp70 stress proteins. Mol Biol Cell 4:555–563PubMedGoogle Scholar
  10. Caputi M, Zahler AM (2001) Determination of the RNA binding specificity of the heterogeneous nuclear ribonucleoprotein (hnRNP) H/H’/F/2H9 family. J Biol Chem 276:43850–43859PubMedCrossRefGoogle Scholar
  11. Cazalla D, Newton K, Caceres JF (2005) A novel SR-related protein is required for the second step of Pre-mRNA splicing. Mol Cell Biol 25:2969–2980PubMedCrossRefGoogle Scholar
  12. Collins L, Penny D (2006) Investigating the intron recognition mechanism in eukaryotes. Mol Biol Evol 23:901–910PubMedCrossRefGoogle Scholar
  13. Coolidge CJ, Seely RJ, Patton JG (1997) Functional analysis of the polypyrimidine tract in pre-mRNA splicing. Nucleic Acids Res 25:888–896PubMedCrossRefGoogle Scholar
  14. Dauksaite V, Akusjarvi G (2002) Human splicing factor ASF/SF2 encodes for a repressor domain required for its inhibitory activity on pre-mRNA splicing. J Biol Chem 277:12579–12586PubMedCrossRefGoogle Scholar
  15. Fewell SW, Travers KJ, Weissman JS, Brodsky JL (2001) The action of molecular chaperones in the early secretory pathway. Annu Rev Genet 35:149–191PubMedCrossRefGoogle Scholar
  16. Forsdyke DR (1999) Heat shock proteins as mediators of aggregation-induced ‘danger’ signals: implications of the slow evolutionary fine-tuning of sequences for the antigenicity of cancer cells. Cell Stress Chaperones 4:205–210PubMedGoogle Scholar
  17. Frydman J (2001) Folding of newly translated proteins in vivo: the role of molecular chaperones. Annu Rev Biochem 70:603–647PubMedCrossRefGoogle Scholar
  18. Fujimori S, Washio T, Tomita M (2005) GC-compositional strand bias around transcription start sites in plants and fungi. BMC Genomics 6:26PubMedCrossRefGoogle Scholar
  19. Hartl FU, Hayer-Hartl M (2002) Molecular chaperones in the cytosol: from nascent chain to folded protein. Science 295:1852–1858PubMedCrossRefGoogle Scholar
  20. Herruer MH, Mager WH, Woudt LP, Nieuwint RT, Wassenaar GM, Groeneveld P, Planta RJ (1987) Transcriptional control of yeast ribosomal protein synthesis during carbon-source upshift. Nucleic Acids Res 15:10133–10144PubMedCrossRefGoogle Scholar
  21. Hong SW, Vierling E (2000) Mutants of Arabidopsis thaliana defective in the acquisition of tolerance to high temperature stress. Proc Natl Acad Sci USA 97:4392–4397PubMedCrossRefGoogle Scholar
  22. Hsiao LL, Dangond F, Yoshida T, Hong R, Jensen RV, Misra J, Dillon W, Lee KF, Clark KE, Haverty P, Weng Z, Mutter GL, Frosch MP, Macdonald ME, Milford EL, Crum CP, Bueno R, Pratt RE, Mahadevappa M, Warrington JA, Stephanopoulos G, Stephanopoulos G, Gullans SR (2001) A compendium of gene expression in normal human tissues. Physiol Genomics 7:97–104PubMedGoogle Scholar
  23. Ibrahim el C, Schaal TD, Hertel KJ, Reed R, Maniatis T (2005) Serine/arginine-rich protein-dependent suppression of exon skipping by exonic splicing enhancers. Proc Natl Acad Sci USA 102:5002–5007PubMedCrossRefGoogle Scholar
  24. Kanopka A, Muhlemann O, Akusjarvi G (1996) Inhibition by SR proteins of splicing of a regulated adenovirus pre-mRNA. Nature 381:535–538PubMedCrossRefGoogle Scholar
  25. Karkas JD, Rudner R, Chargaff E (1968) Separation of B. subtilis DNA into complementary strands. II. Template functions and composition as determined by transcription with RNA polymerase. Proc Natl Acad Sci USA 60:915–920PubMedCrossRefGoogle Scholar
  26. Kozak M, (2005) Regulation of translation via mRNA structure in prokaryotes and eukaryotes. Gene 361:13–37PubMedCrossRefGoogle Scholar
  27. Lao PJ, Forsdyke DR (2000) Thermophilic bacteria strictly obey Szybalski’s transcription direction rule and politely purine-load RNAs with both adenine and guanine. Genome Res 10:228–236PubMedCrossRefGoogle Scholar
  28. Lee SW, Cho BH, Park SG, Kim S (2004) Aminoacyl-tRNA synthetase complexes: beyond translation. J Cell Sci 117:3725–3734PubMedCrossRefGoogle Scholar
  29. Le Hir H, Moore MJ, Maquat LE (2000) Pre-mRNA splicing alters mRNP composition: evidence for stable association of proteins at exon-exon junctions. Genes Dev 14:1098–1108PubMedGoogle Scholar
  30. Liu HX, Zhang M, Krainer AR (1998) Identification of functional exonic splicing enhancer motifs recognized by individual SR proteins. Genes Dev 12:1998–2012PubMedGoogle Scholar
  31. Lobry JR, Chessel D (2003) Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria. J Appl Genet 44:235–261PubMedGoogle Scholar
  32. Marygold SJ, Coelho CM, Leevers SJ (2005) Genetic analysis of RpL38 and RpL5, two minute genes located in the centric heterochromatin of chromosome 2 of Drosophila melanogaster. Genetics 169:683–695PubMedCrossRefGoogle Scholar
  33. McDonald J, Kreitman M (1991) Adaptive protein evolution at the Adh locus in Drosophila. Nature 351:652–654PubMedCrossRefGoogle Scholar
  34. Meroueh M, Chow CS (1999) Thermodynamics of RNA hairpins containing single internal mismatches. Nucleic Acids Res 27:1118–1125PubMedCrossRefGoogle Scholar
  35. Meyuhas O (2000) Synthesis of the translational apparatus is regulated at the translational level. Eur J Biochem 267:6321–6330PubMedCrossRefGoogle Scholar
  36. Negrutskii BS, Shalak VF, Kerjan P, El’skaya AV, Mirande M (1999) Functional interaction of mammalian valyl-tRNA synthetase with elongation factor EF-1alpha in the complex with EF-1H. J Biol Chem 274:4545–4550PubMedCrossRefGoogle Scholar
  37. Oberstrass FC, Auweter SD, Erat M, Hargous Y, Henning A, Wenter P, Reymond L, Amir-Ahmady B, Pitsch S, Black DL, Allain FH (2005) Structure of PTB bound to RNA: specific binding and implications for splicing regulation. Science 309:2054–2057PubMedCrossRefGoogle Scholar
  38. Park SG, Ewalt KL, Kim S (2005) Functional expansion of aminoacyl-tRNA synthetases and their interacting factors: new perspectives on housekeepers. Trends Biochem Sci 30:569–574PubMedCrossRefGoogle Scholar
  39. Paz A, Mester D, Baca I, Nevo E, Korol A (2004) Adaptive role of increased frequency of polypurine tracts in mRNA sequences of thermophilic prokaryotes. Proc Natl Acad Sci USA 101:2951–2956PubMedCrossRefGoogle Scholar
  40. Paz A, Kirzhner V, Nevo E, Korol A (2005) Coevolution of DNA-interacting proteins and genome “dialect.” Mol Biol Evol 23:56–64PubMedCrossRefGoogle Scholar
  41. Perry RP (2005) The architecture of mammalian ribosomal protein promoters. BMC Evol Biol 5:15PubMedCrossRefGoogle Scholar
  42. Prabhu VV (1993) Symmetry observations in long nucleotide sequences. Nucleic Acids Res 21:2797–2800PubMedCrossRefGoogle Scholar
  43. Robberson BL, Cote GJ, Berget SM (1990) Exon definition may facilitate splice site selection in RNAs with multiple exons. Mol Cell Biol 10:84–94PubMedGoogle Scholar
  44. Romfo CM, Alvarez CJ, van Heeckeren WJ, Webb CJ, Wise JA (2000) Evidence for splice site pairing via intron definition in Schizosaccharomyces pombe. Mol Cell Biol 20:7955–7970PubMedCrossRefGoogle Scholar
  45. Roscigno RF, Weiner M, Garcia-Blanco MA (1993) A mutational analysis of the polypyrimidine tract of introns. Effects of sequence differences in pyrimidine tracts on splicing. J Biol Chem 268:11222–11229PubMedGoogle Scholar
  46. Rudner R, Karkas JD, Chargaff E (1968) Separation of B. subtilis DNA into complementary strands. 3. Direct analysis. Proc Natl Acad Sci USA 60:921–922PubMedCrossRefGoogle Scholar
  47. Ruskin B, Green MR (1985) Role of the 3’ splice site consensus sequence in mammalian pre-mRNA splicing. Nature 317:732–734PubMedCrossRefGoogle Scholar
  48. Sang Lee J, Gyu Park S, Park H, Seol W, Lee S, Kim S (2002) Interaction network of human aminoacyl-tRNA synthetases and subunits of elongation factor 1 complex. Biochem Biophys Res Commun 291:158–164PubMedCrossRefGoogle Scholar
  49. Schaal TD, Maniatis T (1999) Multiple distinct splicing enhancers in the protein-coding sequences of a constitutively spliced pre-mRNA. Mol Cell Biol 19:261–273PubMedGoogle Scholar
  50. Seshaiah P, Andrew DJ (1999) WRS-85D: A tryptophanyl-tRNA synthetase expressed to high levels in the developing Drosophila salivary gland. Mol Biol Cell 10:1595–1608PubMedGoogle Scholar
  51. Smithies O, Engels WR, Devereux JR, Slightom JL, Shen S (1981) Base substitutions, length differences and DNA strand asymmetries in the human G gamma and A gamma fetal globin gene region. Cell 26:345–353PubMedCrossRefGoogle Scholar
  52. Szybalski W, Kubinski H, Sheldrick P (1966) Pyrimidine clusters on the transcribing strand of DNA and their possible role in the initiation of RNA synthesis. Cold Spring Harb Symp Quant Biol 31:123–127PubMedGoogle Scholar
  53. Tacke R, Manley JL (1999) Determinants of SR protein specificity. Curr Opin Cell Biol 11:358–362PubMedCrossRefGoogle Scholar
  54. Tatarinova T, Brover V, Troukhan M, Alexandrov N (2003) Skew in CG content near the transcription start site in Arabidopsis thaliana. Bioinformatics 19 (Suppl 1):i313–i314PubMedCrossRefGoogle Scholar
  55. Tautz J, Maier S, Groh C, Rossler W, Brockmann A (2003) Behavioral performance in adult honey bees is influenced by the temperature experienced during their pupal development. Proc Natl Acad Sci USA 100:7343–7347PubMedCrossRefGoogle Scholar
  56. van der Velden AW, van Nierop K, Voorma HO, Thomas AA (2002) Ribosomal scanning on the highly structured insulin-like growth factor II-leader 1. Int J Biochem Cell Biol 34:286–297PubMedCrossRefGoogle Scholar
  57. van Ruissen F, Jansen BJ, de Jongh GJ, Zeeuwen PL, Schalkwijk J (2002) A partial transcriptome of human epidermis. Genomics 79:671–678PubMedCrossRefGoogle Scholar
  58. Wagner EJ, Garcia-Blanco MA (2001) Polypyrimidine tract binding protein antagonizes exon definition. Mol Cell Biol 21:3281–3288PubMedCrossRefGoogle Scholar
  59. Warner JR (1999) The economics of ribosome biosynthesis in yeast. Trends Biochem Sci 24:437–440PubMedCrossRefGoogle Scholar
  60. Warrington JA, Nair A, Mahadevappa M, Tsyganskaya M (2000) Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes. Physiol Genomics 2:143–147PubMedGoogle Scholar
  61. Webb CJ, Romfo CM, van Heeckeren WJ, Wise JA (2005) Exonic splicing enhancers in fission yeast: functional conservation demonstrates an early evolutionary origin. Genes Dev 19:242–254PubMedCrossRefGoogle Scholar
  62. Yu Y, Zhang C, Zhou G, Wu S, Qu X, Wei H, Xing G, Dong C, Zhai Y, Wan J, Ouyang S, Li L, Zhang S, Zhou K, Zhang Y, Wu C, He F (2001) Gene expression profiling in human fetal liver and identification of tissue- and developmental-stage-specific genes through compiled expression profiles and efficient cloning of full-length cDNAs. Genome Res 11:1392–1403PubMedCrossRefGoogle Scholar
  63. Zuckerkandl E (1986) Polite DNA: functional density and functional compatibility in genomes. J Mol Evol 24:12–27PubMedCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, Inc. 2007

Authors and Affiliations

  1. 1.Institute of EvolutionHaifa UniversityMount CarmelIsrael

Personalised recommendations