Summary
Methylation of higher plant DNA occurs at up to 25% of all cytosines, primarily in the sequences CpG2 and CpNpG, both of which are over 80% methylated in wheat and tobacco (Gruenbaum, et al 1981). CpG and CpNpG frequencies and distributions in the known sequences of cloned genes of leguminous plants were analyzed. In this sample CpG occured at only 49% of the frequency expected if the bases were distributed at random. This lower frequency may be attributed to the fixation of mutations generated by a high rate of deamination of 5methylcytosine to thymine (Salser 1977). Consistent with this hypothesis, the product of CpG transitions, TpG and CpA, were significantly above their expected frequency. However CpNpG occured at approximately expeced levels and there was no significant increase in its transistion products CpNpA and TpNpG. Possible explanations for this phenomenon are discussed. An analysis of the distribution of di- and trinucleotides across functionally classified regions of genes showed CpG to be asymmetrically distributed. CpG was on average significantly enriched in the 3′ flanking regions compared to other regions. This may reflect a methylation-mediated regulatory role for this region in some legume genes.
Similar content being viewed by others
References
Adams RLP, Burdon RH (1982) DNA methylation in eukaryotes. C.R.C. Crit. Rev Biochem 13:349–384
Bedbrook JR, Smith SM, Ellis RJ (1980) Molecular cloning and sequencing of cDNA encoding the precursor to the small subunit of chloroplast ribulose-1,5-biphosphate carboxylase. Nature 287:692–697
Berry-Lowe SL, McKnight TD, Shah DM, Meagher RB (1982) The nucleotide sequence of one member of a multigene family encoding the small subunit of ribulose-1,5-biphosphate in soybean. J Mol Appl Genetics 1:483–498
Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504
Brisson N, Verma DPS (1982) Soybean leghemoglobin gene family: Normal, pseudo and truncated genes. Proc. Natl Acad Sci USA 79:4055–4059
Croy RRD, Lycett GW, Gatehouse JA, Yarwood JN, Boulter D (1982) Cloning and analysis of cDNAs encoding plant storage genes. Nature 295:76–79
Ergle DR, Katterman RH (1961) DNA of cotton. Plant Physiol 36:811–815
Franck A, Guilley H, Jonard G, Richards K, Hirth L (1980) Nucleotide sequence of cauliflower mozaic virus DNA. Cell 21:285–294
Gardner RC, Howarth AJ, Hahn P, Brown-Luedi M, Shepherd RJ, Messing J (1981) The complete nucleotide sequence of an infectious clone of cauliflower mozaic virus by M13 mp7 shotgun cloning. Nucleic Acids Res 9:2871–2888
Geraghty D, Peifer MA, Rubenstein I, Messing J (1981) The primary structure of a plant storage protein-zein. Nucleic Acids Res 9:5163–5174
Gruenbaum Y, Navey-Many T, Cedar H, Razin A (1981) Sequence specificity of methylation in higher plant DNA. Nature 292:860–862
Hyldig-Nielsen JJ, Jensen EO, Paludan K, Wiborg O, Garrett R, Jorgensen P, Marcker KA (1982) The primary structures of two leghemoglobin genes from soybean. Nucleic Acids Res 10:689–701
King JL, Jukes TH (1969) Non-Darwinian Evolution. Science 164:788–797
McClelland M (1981) The effect of site specific methylation on restriction endonuclease digestion. Nucleic Acids Res 9: 5859–5866
McClelland M (1981a) Purification and characterization of two new modification methylases; M.Cla I from Caryphanon latum L and M.Taq I from Thermus aquaticus YT1, Nucleic Acids Res 9:6795–6804
McClelland M, Ivarie R (1982) Asymmetrical distribution of CpG in an “average” mammalian gene. Nucleic Acid Res 10:7865–7877
McClelland M (1983) Effect of site specific methylation on restriction endonuclease cleavage (Update). Nucleic Acids Res 11:r169-r173
Pedersen K, Devereux J, Wilson DR, Sheldon E, Larkins BA (1982) Cloning and sequence analysis reveals structural variations among related zein genes in maize. Cell 29:1015–1026
Setlow P (1976) In C.R.C. Handbook of Biochemistry and Molecular Biology 3rd Edition. Fasman GD (ed) CRC Press, Cleveland, Ohio pp 313–318
Shah DM, Hightower RC, Meagher RB (1982) Complete nucleotide sequence of a soybean actin gene. Proc Natl Acad Sci USA 79:1022–1026
Sulimova GE, Mazin AL, Vanyushin BF, Belozerskii AN (1970) Content of 5-methyl cytosine in higher plant DNA fractions of different composition. Dokl Akad Nauk SSSR 193:1422
Sun SM, Slightom JL, Hall TC (1981) Intervening sequences in a plant gene. Comparison of the partial sequences of cDNA and genomic DNA of French bean phaseolin. Nature 287: 37–41
Swartz MN, Trautner TA, Kornberg A (1962) Enzymatic synthesis of DNA. 11. Further studies on nearest neighbor base sequences in DNA's. J Biol Chem 237:1961
Uryson, Belozerskii AN (1959) Content of 5-methyl cytosine in higher plant DNA's. Dokl Akad Nauk SSSR 125:1144
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
McClelland, M. The frequency and distribution of methylatable DNA sequences in leguminous plant protein coding genes. J Mol Evol 19, 346–354 (1983). https://doi.org/10.1007/BF02101638
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02101638