Journal of Molecular Evolution

, Volume 40, Issue 2, pp 120–126 | Cite as

Simple repetitive DNA sequences from primates: Compilation and analysis

  • Jerzy Jurka
  • Charit Pethiyagoda


Simple repeats composed of tandemly repeated units 1–6 nucleotides (nt) long have been extracted from a selected set of primate genomic DNA sequences. Of the 501 theoretically possible, different types of repeats only 67 were present in the analyzed database in at least two different size ranges over 12 nt. They include all simple repeats known to be polymorphic in the primate genome. A list of moderately expanding and nonexpanding oligonucleotide patterns has also been included. Furthermore, we have compiled statistical data with emphasis on the overall variability of the most abundant 67 types of repeats. We have demonstrated that the expandability of at least some simple repeats may be affected by the overall base composition and by flanking sequences. In particular, the occurrence of tandemly repeated CAG and GCC triplets in exons positively correlates with their G+C content. We also noted that in the vicinity of Alu sequences tetrameric repeats are more abundant than in the total genomic DNA. This paper can be used as a comprehensive guide in identification of the most abundant and potentially polymorphic simple repeats. It is also of broader significance as a step toward understanding the contribution of flanking sequences and the overall sequence composition to variability of simple repeats.

Key words

Simple repetitive DNA Microsatellites Unstable triplets Database Evolution 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Beckmann JS, Weber JL (1992) Survey of human and rat microsatellites. Genomics 12:627–631Google Scholar
  2. Brook JD, McCurrach ME, Harley HG, Buckler AJ, Church D, Aburatani H, Hunter K, Stanton VP, Thirion JP, Hudson T, Sohn R, Zemelman B, Snell RG, Rundle SA, Crow S, Davies J, Shelbourne P, Buxton J, Jones C, Juvonen V, Johnson K, Harper PS, Shaw DJ, Houseman DE (1992) Molecular basis of myotonic dystrophy: expansion of a trinucleotide (CTG) repeat at the 3′ end of a transcript encoding a protein kinase family member. Cell 68:799–808Google Scholar
  3. Brown WRA, MacKinnon PJ, Villasante A, Spurr N, Buckle VJ, Dobson MJ (1990) Structure and polymorphism of human telomere-associated DNA. Cell 63:119–132Google Scholar
  4. Economou EP, Bergen AW, Warren AC, Antonarakis SE (1990) The polydeoxyadenylate tract of Alu repetitive elements is polymorphic in the human genome. Proc Natl Acad Sci USA 87:2951–2954Google Scholar
  5. Epplen JT (1988) On simple repeated GATA/GACA sequences in animal genomes: a critical reappraisal. J Hered 79:409–417Google Scholar
  6. Epplen JT, Melmer G, Schmidt P, Roewer L, Hundrieser J, Epplen C, Buitkamp J (1992) On the potential of simple repetitive DNA for fingerprinting in clinical forensic and evolutionary studies. Clin Invest 70:1043–1051Google Scholar
  7. Fu YH, Kuhl DPA, Pizzuti A, Pieretti M, Sutcliffe JS, Richards S, Verkerk AJMH, Holden JJA, Fenwick Jr RG, Warren ST, Oostra BA, Nelson DL, Caskey CT (1991) Variation of the CGG repeat at the fragile X site results in genetic instability: resolution of the Sherman paradox. Cell 67:1047–1058Google Scholar
  8. Fu YH, Pizzuti A, Fenwick Jr RG, King J, Rajnarayan S, Dunne PW, Dubel J, Nasser GA, Ashizawa T, de Jong P, Wieringa B, Korneluk R, Perryman MB, Epstein HF, Caskey CT (1992) An unstable triplet repeat in a gene related to myotonic muscular dystrophy. Science 255:1256–1258Google Scholar
  9. Huntington's Disease Collaborative Research Group (1993) A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes. Cell 72:971–983Google Scholar
  10. Jurka J, Walichiewicz J, Milosavljevic A (1992) Prototypic sequences for human repetitive DNA. J Mol Evol 35:286–291Google Scholar
  11. Knight SJL, Flannery AV, Hirst MC, Campbell L, Christodoulou Z, Phelps SR, Pointon J, Middleton-Price HR, Barncoat A, Pembrey ME, Holland J, Oostra BA, Bobrow M, Davies KE (1993) Trinucleotide repeat amplification and hypermethylation of a CpG island in FRAXE mental retardation. Cell 74:127–134Google Scholar
  12. Koide R, Ikeuchi T, Onodera O, Tanaka H, Igarashi S, Endo K, Takahashi H, Kondo R, Ishikawa A, Hayashi T, Saito M, Tomoda A, Miike T, Naito H, Ikuta F, Tsuji S (1994) Unstable expansion of CAG repeat in hereditary dentatorubral-pallidoluysian atrophy (DRPLA). Nature Genet 6:9–13Google Scholar
  13. Kremer EJ, Pritchard M, Lynch M, Yu S, Holman K, Baker E, Warren ST, Schlessinger D, Sutherland GR, Richards RI (1991) Mapping of DNA instability at the fragile X to a trinucleotide repeat sequence p(CCG)n. Science 252:1711–1714Google Scholar
  14. La Spada AR, Wilson EM, Lubahn DB, Harding AE, Fischbeck KH (1991) Androgen receptor gene mutations in X-linked spinal and bulbar muscular atrophy. Nature 352:77–79Google Scholar
  15. Mahadevan M, Tsilfidis C, Sabourin L, Shutter G, Amemiya C, Jansen G, Neville C, Narang M, Barcelo J, O'Hoy K, Leblond S, Earle-Macdonald J, de Jong PJ, Wieringa B, Korneluk RG (1992) Myotonic dystrophy mutation: an unstable CTG repeat in the 3′ untranslated region of the gene. Science 255:1253–1255Google Scholar
  16. Nagafuchi S, Yanagisawa H, Sato K, Shirayama T, Ohsaki E, Bundo M, Takeda T, Tadokoro K, Kondo I, Murayama N, Tanaka Y, Kikushima H, Umino K, Kurosawa H, Furukawa T, Nihei K, Inoue T, Sano A, Komure O, Takahashi M Yoshizawa T, Kanazawa I, Yamada M (1994) Dentatorubral and pallidoluysian atrophy expansion of an unstable CAG trinucleotide on chromosome 12p. Nature Genet 6:14–18Google Scholar
  17. Orr HT, Chung MY, Banfi S, Kwiatkowski Jr TJ, Servadio A, Beaudet AL, McCall AE, Duvick LA, Runum LPW, Zoghbi HY (1993) Expansion of an unstable trinucleotide CAG repeat in spino-cerebellar ataxia type 1. Nature Genet 4:221–226Google Scholar
  18. Riggins GJ, Lokey LK, Chastain JL, Leiner HA, Sherman SL, Wilkinson KD, Warren ST (1992) Human genes containing polymorphic trinucleotide repeats. Nature Genet 2:186–191Google Scholar
  19. Toth G, Jurka J (1994) Repetitive DNA in and around translocation breakpoints of the Philadelphia chromosome. Gene 140:285–288Google Scholar
  20. Verkerk AJMH, Pieretti M, Sutcliffe JS, Fu YH, Kuhl DPA, Pizzuti A, Reiner O, Richards S, Victoria MF, Zhang F, Eussen BE, van Ommen GJB, Blonden LAI, Riggins GJ, Chastain JL, Kunst CB, Galjaard H, Caskey CT, Nelson DL, Oostra BA, Warren ST (1991) Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X Syndrome. Cell 65:905–914Google Scholar
  21. Weber JL (1990) Informativeness of human (dC-dA)n · (dG-dT)n polymorphisms. Genomics 7:524–530Google Scholar
  22. Weber JL, May PE (1989) Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction. Am J Hum Genet 44:388–396Google Scholar
  23. Zuliani G, Hobbs HH (1990) A high frequency of length polymorphisms in repeated sequences adjacent to Alu sequences. Am J Hum Genet 46:963–969Google Scholar

Copyright information

© Springer-Verlag 1995

Authors and Affiliations

  • Jerzy Jurka
    • 1
  • Charit Pethiyagoda
    • 1
  1. 1.Linus Paining Institute of Science and MedicinePalo AltoUSA

Personalised recommendations