Abstract
Palindromic sequences are important DNA motifs involved in the regulation of different cellular processes, but are also a potential source of genetic instability. In order to initiate a systematic study of palindromes at the whole genome level, we developed a computer program that can identify, locate and count palindromes in a given sequence in a strictly defined way. All palindromes, defined as identical inverted repeats without spacer DNA, can be analyzed and sorted according to their size, frequency, GC content or alphabetically. This program was then used to prepare a catalog of all palindromes present in the chromosomal DNA of the yeast Saccharomyces cerevisiae. For each palindrome size, the observed palindrome counts were significantly different from those in the randomly generated equivalents of the yeast genome. However, while the short palindromes (2–12 bp) were under-represented, the palindromes longer than 12 bp were over-represented, AT-rich and preferentially located in the intergenic regions. The 44-bp palindrome found between the genes CDC53 and LYS21 on chromosome IV was the longest palindrome identified and contained only two C-G base pairs. Avoidance of coding regions was also observed for palindromes of 4–12 bp, but was less pronounced. Dinucleotide analysis indicated a strong bias against palindromic dinucleotides that could explain the observed short palindrome avoidance. We discuss some possible mechanisms that may influence the evolutionary dynamics of palindromic sequences in the yeast genome.
Similar content being viewed by others
References
Bissler JJ (1998) DNA inverted repeats and human disease. Front Biosci 3:408–418
Chen Y, Gill GN (1996) A heteromeric nuclear protein complex binds two palindromic sequences in the proximal enhancer of the human erbB-2 gene. J Biol Chem 271:5183–5188
Cuomo AC, Mundy CL, Oettinger MA (1996) DNA sequence and structure requirements for cleavage of V(D)J recombination signal sequences. Mol Cell Biol 16:5683–5690
Fuglsang A (2004) The relationship between palindrome avoidance and intergenic codon usage variations: a Monte Carlo study. Biochem Biophys Res Commun 316:755–762
Gelfand MS, Koonin EV (1997) Avoidance of palindromic words in bacterial and archaeal genomes: a close connection with restriction enzymes. Nucleic Acids Res 25:2430–2439
Gordenin DA, Resnick MA (1998) Yeast ARMs (DNA at-risk motifs) can reveal sources of genome instability. Mutat Res 400:45–58
Gordenin DA, Lobachev KS, Degtyareva NP, Malkova AL, Perkins E, Resnick MA (1993) Inverted DNA repeats: a source of eukaryotic genomic instability. Mol Cell Biol 13:5315–5322
Karlin S, Mrazek J, Campbell AM (1997) Compositional biases of bacterial genomes and evolutionary implications. J Bacteriol 179:1363–1370
Kruglyak S, Durret RT, Schug MD, Aquadro CE (1998) Equilibrium distribution of microsatellite repeat length resulting from a balance between slippage events and point mutations. Proc Natl Acad Sci USA 95:10774–10778
Kurtz S, Schleiermacher C (1999) REPuter: fast computation of maximal repeats in complete genomes. Bioinformatics 15:426–427
Leach DRF (1994) Long DNA palindromes, cruciform structures, genetic instability and secundary structure repair. Bioessays 16:893–898
LeBlanc MD, Aspeslagh G, Buggia NP, Dyer BD (2000) An annotated catalog of inverted repeats of Caenorhabditis elegans chromosomes III and X, with observations concerning odd/even biases and conserved motifs. Genome Res 10:1381–1392
Lobachev KS, Gordenin DA, Resnick MA (2002) The Mre11 complex is required for repair of hairpin-capped double-strand breaks and prevention of chromosome rearrangements. Cell 103:83–193
Mori K, Ogawa N, Kawahara T, Yanagi H, Yura T (1998) Palindrome with spacer of one nucleotide is characteristic of the cis-acting unfolded protein response element in Saccharomyces cerevisiae. J Biol Chem 273:9912–9929
Novina CD, Sharp PA (2004) The RNAi revolution. Nature 430:161–164
Nussinov R (1984) Doublet frequencies in evolutionary distinct groups. Nucleic Acids Res 12:1749–1763
Repping S, Skaletsky H, Lange J, Silber S, Veen F van der, Oates RD, Page DC, Rozen S (2002) Recombination between palindromes P5 and P1 on the human Y chromosome causes massive deletions and spermatogenic failure. Am J Hum Genet 71:906–922
Rice P, Longden I, Bleasby A (2000) EMBOSS—the european molecular biology open software suite. Trends Genet 15:276–278
Rocha EPC, Danchin A, Viari A (2001) Evolutionary role of restriction/modification systems as revealed by comparative genome analysis. Genome Res 11:946–958
Sharp PM (1986) Molecular evolution of bacteriophages: evidence of selection against the recognition sites of host restriction enzymes. Mol Biol Evol 3:75–83
Sharp PM, Cowe E (1991) Synonymous codon usage in Saccharomyces cerevisiae. Yeast 7:657–678
Tashiro J, Kinoshita K, Honjo T (2001) Palindromic but not G-rich sequences are targets of class switch recombination. Int Immunol 13:495–505
Toth G, Gaspari Z, Jurka J (2000) Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res 10:967–981
Acknowledgements
We are grateful to Ana Vukelić for help in statistical analysis. This work was supported by grant 0058014 from the Croatian Ministry of Science, Education and Sports.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by S. Hohmann
Rights and permissions
About this article
Cite this article
Lisnić, B., Svetec, IK., Šarić, H. et al. Palindrome content of the yeast Saccharomyces cerevisiae genome. Curr Genet 47, 289–297 (2005). https://doi.org/10.1007/s00294-005-0573-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00294-005-0573-5