Advertisement

Journal of Molecular Evolution

, Volume 22, Issue 2, pp 117–133 | Cite as

Rat LINE1: The origin and evolution of a family of long interspersed middle repetitive DNA elements

  • Marcelo Bento Soares
  • Eric Schon
  • Argiris Efstratiadis
Article

Summary

We present approximately 7.0 kb of composite DNA sequence of a long interspersed middle repetitive element (LINE1) present in high copy number in the rat genome. The family of these repeats, which includes transcribing members, is the rat homologue of the mouse MIF-Bam-R and human Kpn I LINEs. Sequence alignments between speciments from these three species define the length of a putative unidentified open reading frame, and document extensive recombination events that, in conjunction with retroposition, have generated this large family of pseudogenes and pseudogene fragments. Comparative mapping of truncated elements indicates that a specific endonucleolytic activity might bei involved in illegitimate (nonhomologous) recombination events. Sequence divergence analyses provide insights into the origin and molecular evolution of these elements.

Key words

Repetitive DNA Retroposon Pseudogene 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bennet KL, Hill RE, Pietras DF, Woodworth-Gutai M, Kane-Haas C, Houston JM, Health JK, Hastie ND (1984) Most highly repeated dispersed DNA families in the mouse genome. Mol Cell Biol 4:1561–1571PubMedGoogle Scholar
  2. Boeke JD, Garfinkel DJ, Styles CA, Fink GR (1985) Ty elements transpose through and RNA intermediate. Cell 40:491–500PubMedGoogle Scholar
  3. Brown SDM (1983) A mouse dispersed repeat sequence showing remarkable similarities to the long terminal repeats of retroviruses. Gene 23:95–97PubMedGoogle Scholar
  4. Brown SDM, Piechaczyk M (1983) Insertion sequences and tandem repetitions as sources of variation in a dispersed repeat family. J Mol Biol 165:249–256PubMedGoogle Scholar
  5. Bullock P, Forrester W, Botchan M (1984) DNA sequence studies of simian virus 40 chromosomal excision and integration in rat cells. J Mol Biol 174:55–84PubMedGoogle Scholar
  6. Clare J, Farbaugh P (1985) Nucleotide sequence of a yeast Ty element: evidence for an unusual mechanism of gene expression. Proc Natl Acad Sci USA 82:2829–2833PubMedGoogle Scholar
  7. Clark JL, Steiner DF (1969) Insulin biosynthesis in the rat: demonstration of two proinsulins. Proc Natl Acad Sci USA 62:278–285PubMedGoogle Scholar
  8. Cooke NE, Baxter JD (1982) Structural analysis of the prolactin gene suggests a separate origin for its 5′ end. Nature 297:603–606PubMedGoogle Scholar
  9. DiGiovanni L, Haynes SR, Misra R, Jelinek WR (1983) KpnI family of long-dispersed repeated DNA sequences of man: evidence for entry into genomic DNA of DNA copies of poly(A)-terminated KpnI RNAs. Proc Natl Acad Sci USA 80: 6533–6537PubMedGoogle Scholar
  10. Di Nocera PP, Digan ME, Dawid IB (1983) A family of oligo-adenylate-terminated transposable sequences inDrosophila melanogaster. J Mol Biol 168:715–727PubMedGoogle Scholar
  11. Dubnick M, Chou J, Petes TD, Farber RA (1983) Relationship among DNA sequenes of the 1.3 kb EcoRI family of mouse DNA. J Mol Evol 19:115–121PubMedGoogle Scholar
  12. Economou-Pachnis A, Lohse MA, Furano AV, Tsichlis PN (1985) Insertion of long interspersed repeated elements at the Igh (immunoglobulin heavy chain) and Mlvi-2 (Moloney leukemia virus integration 2) loci of rats. Proc Natl Acad Sci USA 82:2857–2861PubMedGoogle Scholar
  13. Episkopou V, Murphy AJM, Efstratiadis A (1984) Cell-specified expression of a selectable hybrid gene. Proc Natl Acad Sci USA 81:4657–4661PubMedGoogle Scholar
  14. Fanning TG (1982) Characterization of a highly repetitive family of DNA sequences in the mouse. Nucleic Acids Res 10: 5003–5013PubMedGoogle Scholar
  15. Fanning TG (1983) Size and structure of the highly repetitive Bam HI element in mice. Nucleic Acids Res 11:5073–5091PubMedGoogle Scholar
  16. Flavell AJ, Ish-Horowicz D (1983) The origin of extrachromosomal circular copia elements. Cell 34:415–419PubMedGoogle Scholar
  17. Fujimoto S, Tsuda T, Toda M, Yamagishi H (1985) Transposon-like sequences in extrachromosomal circular DNA from mouse thymocytes. Proc Natl Acad Sci 82:2072–2076PubMedGoogle Scholar
  18. Gebhard W, Zachau HG (1983) Organization of the R family and other interspersed repetitive DNA sequences in the mouse genome. J Mol Biol 170:255–270PubMedGoogle Scholar
  19. Gebhard W, Meitinger T, Hochtl J, Zachau HG (1982) A new family of interspersed repetitive DNA sequences in the mouse genome. J Mol Biol 157:453–471PubMedGoogle Scholar
  20. Grimaldi G, Skowronski J, Singer MF (1984) Defining the beginning and the end of KpnI family segments. EMBO J 3: 1753–1759PubMedGoogle Scholar
  21. Gupta RC (1983) Nucleotide sequence of a reiterated rat DNA fragment. FEBS Lett 164:175–180PubMedGoogle Scholar
  22. Hammarstrom K, Westin G, Bark C, Zabielsky J, Petterson U (1984) Genes and pseudogenes for human U2 RNA. J Mol Biol 179:157–169PubMedGoogle Scholar
  23. Hasson J-F, Mougneau E, Cuzin F, Yaniv M (1984) Simian virus illegitimate recombination occurs near short direct repeats. J Mol Biol 177:53–68PubMedGoogle Scholar
  24. Heller D, Jackson M, Leinwand L (1984) Organization and expression of non-Alu family interspersed repetitive DNA sequences in the mouse genome. J Mol Biol 173:419–436PubMedGoogle Scholar
  25. Jackson M, Heller D, Leinwand L (1985) Transcriptional measurements of mouse repeated DNA sequences. Nucleic Acids Res 13:3389–3403PubMedGoogle Scholar
  26. Jeffreys AJ, Harris S (1985) Pseudogenes. BioEssays 1:253–258Google Scholar
  27. Jelinek WR, Schmid CW (1982) Repetitive sequences in eukaryotic DNA and their expression. Annu Rev Biochem 51: 813–844PubMedGoogle Scholar
  28. Jinks-Robertson S, Petes TD (1985) High-frequency meiotic gene conversion between repeated genes on nonhomologous chromosomes in yeast. Proc Natl Acad Sci USA 82:3350–3354PubMedGoogle Scholar
  29. Jones RS, Potter SS (1985) L1 sequences in HeLa extrachromosomal circular DNA: evidence for circularization by homologous recombination. Proc Natl Acad Sci USA 82:1989–1993PubMedGoogle Scholar
  30. Katzir N, Rechavi G, Cohen JB, Unger T, Simoni F, Segal S, Cohen D, Givol D (1985) “Retroposon” inseration into the cellular oncogene c-myc in canine transmissible venereal tumor. Proc Natl Acad Sci USA 82:1054–1058PubMedGoogle Scholar
  31. Kelly A, Trowsdale J (1985) Complete nucleotide sequence of a functional HLA-DPβ gene and the region between the DPβ1 and Dpα1 genes: comparison of the 5′ ends of HLA class II genes. Nucleic Acids Res 13:1607–1621PubMedGoogle Scholar
  32. Kimura M (1983) The neutral theory of molecular evolution. Cambridge University Press, London New YorkGoogle Scholar
  33. Kole LB, Haynes SR, Jelinek WR (1983) Discrete and heterogeneous high molecular weight RNAs complementary to a long dispersed repeat family (a possible transposon) of human DNA. J Mol Biol 165:257–286PubMedGoogle Scholar
  34. Larhammar D, Hammerling U, Denaro M, Lund T, Flavell R, Rask L, Peterson PA (1983) Structure of the murine immune response I-Aβ locus: sequence of the I-Aβ gene and an adjacent β-chain second domain exon. Cell 34:179–188PubMedGoogle Scholar
  35. Larhammar D, Servenius B, Rask L, Peterson PA (1985) Characterization of an HLA DR pseudogene. Proc Natl Acad Sci USA 82:1475–1479PubMedGoogle Scholar
  36. Lerman MI, Thayer RE, Singer MF (1983) KpnI family of long interspersed repeated DNA sequences in primates: polymorphism of family members and evidence for transcription. Proc Natl Acad Sci USA 80:3966–3970PubMedGoogle Scholar
  37. Lomedico P, Rosenthal N, Efstratiadis A, Gilbert W, Kolodner R, Tizard R (1979) The structure and evolution of the two nonallelic rat preproinsulin genes. Cell 18:545–558PubMedGoogle Scholar
  38. Manuelidis L (1982) Nucleotide sequence definition of a major human repeated DNA, the Hind III 1.9 kb family. Nucleic Acids Res 10:3211–3219PubMedGoogle Scholar
  39. Martin SL, Voliva CF, Burton FH, Edgell MH, Hutchinson CA (1984) A large interspersed repeat found in mouse DNA contains a long open reading frame that evolves as if it encodes a protein. Proc Natl Acad Sci USA 81:2308–2312PubMedGoogle Scholar
  40. Mason AJ, Evans BA, Cox DR, Shine J, Richards RI (1983) Structure of mouse kallikrein gene family suggests a role in specific processing of biologically active peptides. Nature 303: 300–307PubMedGoogle Scholar
  41. Meunier-Rotival M, Bernardi G (1984) The Bam repeats of the mouse genome belong in several superfamilies the longest of which is over 9 kb in size. Nucleic Acids Res 12:1593–1608PubMedGoogle Scholar
  42. Miyake T, Migita K, Sakaki Y (1983) Some KpnI family members are associated with the Alu family in the human genome. Nucleic Acids Res 11:6837–6846PubMedGoogle Scholar
  43. Nomiyama H, Tsuzuki T, Wakasugi S, Fukuda M, Shimada K (1984) Interruption of a human nuclear sequence homologous to mitochondrial DNA by a member of the KpnI 1.8 kb family. Nucleic Acids Res 12:5225–5234PubMedGoogle Scholar
  44. Patarca R, Haseltine WA (1984) Sequence similarity among retroviruses. Nature 309:728PubMedGoogle Scholar
  45. Potter SS (1982) DNA sequence analysis of aDrosophila foldback transposable element rearrangement. Mol Gen Genet 188:107–110PubMedGoogle Scholar
  46. Potter SS (1984) Rearranged sequences of a human KpnI element. Proc Natl Acad Sci USA 81:1012–1016PubMedGoogle Scholar
  47. Potter SS, Jones RS (1983) Unusual domains of human alphoid satellite DNA with contiguous non-satellite sequences: sequence analysis of a junction region. Nucleic Acids Res 11: 3137–3153PubMedGoogle Scholar
  48. Rogers JH (1983) A straight LINE story. Nature 306:113–114PubMedGoogle Scholar
  49. Rogers JH (1985a) The structure and evolution of retroposons. Int Rev Cytol 93:187–279PubMedGoogle Scholar
  50. Rogers JH (1985b) Long interspersed sequences in mammalian DNA. Properties of newly identified specimens. Biochim Biophys Acta 824:113–120PubMedGoogle Scholar
  51. Sanger F, Nicklen S, Coulson AR (1977) DNa sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA 74: 5463–5467PubMedGoogle Scholar
  52. Scarpulla RC (1985) Association of a truncated cytochrome c processed pseudogene with a similarly truncated member from a long interspersed repeat family of rat. Nucleic Acids Res 13:763–775PubMedGoogle Scholar
  53. Schindler CW, Rush MG (1985) The KpnI family of long interspersed nucleotide sequences is present on discrete sizes of circular DNA in monkey (BSC-1) cells. J Mol Biol 181:161–173PubMedGoogle Scholar
  54. Schmeckpeper BJ, Scott AF, Smith KD (1984) Transcripts homologous to a long repeated DNA element in the human genome. J Biol Chem 259:1218–1225PubMedGoogle Scholar
  55. Shafit-Zagardo B, Brown FL, Maio JJ, Adams JW (1982) KpnI families of long interspersed repetitive DNAs associated with the human β-globin gene cluster. Gene 20:397–407PubMedGoogle Scholar
  56. Shafit-Zagardo B, Brown FL, Zavodny PJ, Maio JJ (1983) Transcription of the KpnI families of long interspersed DNAs in human cells. Nature 304:277–280PubMedGoogle Scholar
  57. Singer MF, Thayer RE, Grimaldi G, Lerman MI, Fanning TG (1983) Homology between the KpnI primate and BamHI (MIF-1) rodent families of long interspersed repeated sequences. Nucleic Acids Res 11:5739–5745PubMedGoogle Scholar
  58. Smith LF (1966) Species variation in the amino acid sequence of insulin. Am J Med 40:662–666PubMedGoogle Scholar
  59. Soares MB, Schon E, Henderson A, Karathanasis S, Cate R, Zeitlin S, Chirgwin J, Efstratiadis A (1985) RNA-mediated gene duplication: The rat preproinsulin I gene is a functional retroposon. Mol Cell Biol 5:2090–2103PubMedGoogle Scholar
  60. Southern EM (1975) Detection of specific sequences among DNA fragments separated by gel electrophoresis. J Mol Biol 98:503–517PubMedGoogle Scholar
  61. Thayer RE, Singer MF (1983) Interruption of an α-satellite array by a short member of the KpnI family of interspersed, highly repeated monkey DNA sequences. Mol Cell Biol 3: 967–973PubMedGoogle Scholar
  62. Van Arsdell SW, Weiner AM (1984) Pseudogenes for human U2 small nuclear RNA do not have a fixed site of 3′ truncation. Nucleic Acids Res 12:1463–1471PubMedGoogle Scholar
  63. Vanin EF (1984) Processed pseudogenes: characteristics and evolution. Biochim Biophys Acta 782:231–241PubMedGoogle Scholar
  64. Vieira J, Messing J (1982) The pUC plasmids, and M13mp7-derived system for insertion mutagenesis and sequencing with synthetic universal primers. Gene 19:259–268PubMedGoogle Scholar
  65. Vizard DL, Yarsa J (1985) Comparison of genomic fragment and clone sequence within a long interspersed sequence of the mouse genome. Nucleic Acids Res 13:473–484PubMedGoogle Scholar
  66. Voliva CF, Jahn CL, Comer MB, Hutchison CA, Edgell M (1983) The L1Md long interspersed repeat family in the mouse: Almost all examples are truncated at one end. Nucleic Acids Res 11:8847–8859PubMedGoogle Scholar
  67. Voliva CF, Martin SL, Hutchison CA, Edgell MH (1984) Dispersal process associated with the L1 family of interspersed repetitive DNA sequences. J Mol Biol 178:795–813PubMedGoogle Scholar
  68. Weiner AM, Denison RA (1982) Either gene amplification or gene conversion may maintain the homogeneity of the multigene family encoding human U1 small nuclear RNA. Cold Spring Harbor Symp Quant Biol 47:1141–1149Google Scholar
  69. Whitney FR, Furano AV (1984) Highly repeated DNA families in the rat. J Biol Chem 259:10481–10492PubMedGoogle Scholar
  70. Wilson R, Storb U (1983) Association of two different repetitive DNA elements near immunolobulin light chain genes. Nucleic Acids Res 11:1803–1817PubMedGoogle Scholar
  71. Yang R, Fristensky B, Deutch AH, Huang RC, Tan YH, Narang SA, Wu R (1983) The nucleotide sequence of a new human repetitive DNA consists of eight tandem repeats of 66 base pairs. Gene 25:59–66PubMedGoogle Scholar
  72. Zeitlin S, Efstratiadis A (1984) In vivo splicing products of the rabbit β-globin pre-mRNA. Cell 39:589–602PubMedGoogle Scholar

Copyright information

© Springer-Verlag 1985

Authors and Affiliations

  • Marcelo Bento Soares
    • 1
  • Eric Schon
    • 1
  • Argiris Efstratiadis
    • 1
  1. 1.Department of Human Genetics and DevelopmentColumbia UniversityNew YorkUSA

Personalised recommendations