Skip to main content
Log in

The frequency and distribution of methylatable DNA sequences in leguminous plant protein coding genes

  • Original Articles
  • Published:
Journal of Molecular Evolution Aims and scope Submit manuscript

Summary

Methylation of higher plant DNA occurs at up to 25% of all cytosines, primarily in the sequences CpG2 and CpNpG, both of which are over 80% methylated in wheat and tobacco (Gruenbaum, et al 1981). CpG and CpNpG frequencies and distributions in the known sequences of cloned genes of leguminous plants were analyzed. In this sample CpG occured at only 49% of the frequency expected if the bases were distributed at random. This lower frequency may be attributed to the fixation of mutations generated by a high rate of deamination of 5methylcytosine to thymine (Salser 1977). Consistent with this hypothesis, the product of CpG transitions, TpG and CpA, were significantly above their expected frequency. However CpNpG occured at approximately expeced levels and there was no significant increase in its transistion products CpNpA and TpNpG. Possible explanations for this phenomenon are discussed. An analysis of the distribution of di- and trinucleotides across functionally classified regions of genes showed CpG to be asymmetrically distributed. CpG was on average significantly enriched in the 3′ flanking regions compared to other regions. This may reflect a methylation-mediated regulatory role for this region in some legume genes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Adams RLP, Burdon RH (1982) DNA methylation in eukaryotes. C.R.C. Crit. Rev Biochem 13:349–384

    Google Scholar 

  • Bedbrook JR, Smith SM, Ellis RJ (1980) Molecular cloning and sequencing of cDNA encoding the precursor to the small subunit of chloroplast ribulose-1,5-biphosphate carboxylase. Nature 287:692–697

    Article  Google Scholar 

  • Berry-Lowe SL, McKnight TD, Shah DM, Meagher RB (1982) The nucleotide sequence of one member of a multigene family encoding the small subunit of ribulose-1,5-biphosphate in soybean. J Mol Appl Genetics 1:483–498

    Google Scholar 

  • Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504

    PubMed  Google Scholar 

  • Brisson N, Verma DPS (1982) Soybean leghemoglobin gene family: Normal, pseudo and truncated genes. Proc. Natl Acad Sci USA 79:4055–4059

    PubMed  Google Scholar 

  • Croy RRD, Lycett GW, Gatehouse JA, Yarwood JN, Boulter D (1982) Cloning and analysis of cDNAs encoding plant storage genes. Nature 295:76–79

    Article  Google Scholar 

  • Ergle DR, Katterman RH (1961) DNA of cotton. Plant Physiol 36:811–815

    Google Scholar 

  • Franck A, Guilley H, Jonard G, Richards K, Hirth L (1980) Nucleotide sequence of cauliflower mozaic virus DNA. Cell 21:285–294

    Article  PubMed  Google Scholar 

  • Gardner RC, Howarth AJ, Hahn P, Brown-Luedi M, Shepherd RJ, Messing J (1981) The complete nucleotide sequence of an infectious clone of cauliflower mozaic virus by M13 mp7 shotgun cloning. Nucleic Acids Res 9:2871–2888

    PubMed  Google Scholar 

  • Geraghty D, Peifer MA, Rubenstein I, Messing J (1981) The primary structure of a plant storage protein-zein. Nucleic Acids Res 9:5163–5174

    PubMed  Google Scholar 

  • Gruenbaum Y, Navey-Many T, Cedar H, Razin A (1981) Sequence specificity of methylation in higher plant DNA. Nature 292:860–862

    Article  PubMed  Google Scholar 

  • Hyldig-Nielsen JJ, Jensen EO, Paludan K, Wiborg O, Garrett R, Jorgensen P, Marcker KA (1982) The primary structures of two leghemoglobin genes from soybean. Nucleic Acids Res 10:689–701

    PubMed  Google Scholar 

  • King JL, Jukes TH (1969) Non-Darwinian Evolution. Science 164:788–797

    PubMed  Google Scholar 

  • McClelland M (1981) The effect of site specific methylation on restriction endonuclease digestion. Nucleic Acids Res 9: 5859–5866

    PubMed  Google Scholar 

  • McClelland M (1981a) Purification and characterization of two new modification methylases; M.Cla I from Caryphanon latum L and M.Taq I from Thermus aquaticus YT1, Nucleic Acids Res 9:6795–6804

    PubMed  Google Scholar 

  • McClelland M, Ivarie R (1982) Asymmetrical distribution of CpG in an “average” mammalian gene. Nucleic Acid Res 10:7865–7877

    PubMed  Google Scholar 

  • McClelland M (1983) Effect of site specific methylation on restriction endonuclease cleavage (Update). Nucleic Acids Res 11:r169-r173

    PubMed  Google Scholar 

  • Pedersen K, Devereux J, Wilson DR, Sheldon E, Larkins BA (1982) Cloning and sequence analysis reveals structural variations among related zein genes in maize. Cell 29:1015–1026

    Article  PubMed  Google Scholar 

  • Setlow P (1976) In C.R.C. Handbook of Biochemistry and Molecular Biology 3rd Edition. Fasman GD (ed) CRC Press, Cleveland, Ohio pp 313–318

    Google Scholar 

  • Shah DM, Hightower RC, Meagher RB (1982) Complete nucleotide sequence of a soybean actin gene. Proc Natl Acad Sci USA 79:1022–1026

    Google Scholar 

  • Sulimova GE, Mazin AL, Vanyushin BF, Belozerskii AN (1970) Content of 5-methyl cytosine in higher plant DNA fractions of different composition. Dokl Akad Nauk SSSR 193:1422

    Google Scholar 

  • Sun SM, Slightom JL, Hall TC (1981) Intervening sequences in a plant gene. Comparison of the partial sequences of cDNA and genomic DNA of French bean phaseolin. Nature 287: 37–41

    Article  Google Scholar 

  • Swartz MN, Trautner TA, Kornberg A (1962) Enzymatic synthesis of DNA. 11. Further studies on nearest neighbor base sequences in DNA's. J Biol Chem 237:1961

    PubMed  Google Scholar 

  • Uryson, Belozerskii AN (1959) Content of 5-methyl cytosine in higher plant DNA's. Dokl Akad Nauk SSSR 125:1144

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

McClelland, M. The frequency and distribution of methylatable DNA sequences in leguminous plant protein coding genes. J Mol Evol 19, 346–354 (1983). https://doi.org/10.1007/BF02101638

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02101638

Keywords

Navigation