Abstract
Mammalian genomes, unlike the genomes of Drosophila and yeast, are characterized by CpG methylation and concomitant CpG depletion, which is caused by the enhanced mutation rate of 5-methylcytosine. To find out whether local nucleotide sequences around existing methylated CpG dinucleotides have common patterns, we analyzed a large population of CpG-poor regions in human DNA, which are typically methylated. We detected a novel periodic variation in the numbers of purine bases around CpGs in the noncoding parts of these sequences. This periodicity of eight nucleotides gradually diminished over 64 nucleotides on each side of the central CpG. Furthermore, the frequencies of the 5′ and 3′ nearest neighbors of CpGs in CpG-poor regions were biased towards cytosine and guanine, respectively. Such biased sequence contexts may have helped to stabilize CpGs against depletion during mammalian evolution.
Similar content being viewed by others
Literature Cited
Bird, A.P. (1986).Nature 321:209–213.
Gardiner-Garden, M., and Frommer, M. (1987).J. Mol. Biol. 196:261–282.
Matsuo, K., Clay, O., Takahashi, T., Silke, J., and Schaffner, W. (1993).Somat. Cell Mol. Genet. 19:543–555.
Larsen, F., Gundersen, G., Lopez, R., and Prydz, H. (1992).Genomics 13:1095–1107.
Antequera, F., and Bird, A. (1993).Proc. Natl. Acad. Sci. U.S.A. 90:11995–11999.
Brandeis, M., Frank, D., Keshet, I., Siegfried, Z., Mendelsohn, M., Nemes, A., Temper, V., Razin, A., and Cedar, H. (1994).Nature 371:435–438.
Macleod, D., Charlton, J., Mullins, J., and Bird, A.P. (1994).Genes Dev. 8:2282–2292.
Li, E., Bestor, T.H., and Jaenisch, R. (1992).Cell 69:915–926.
Li, E., Beard, C., and Jaenisch, R. (1993).Nature 366:362–365.
Sved, J., and Bird, A. (1990).Proc. Natl. Acad. Sci. U.S.A. 87:4692–4696.
Bestor, T.H., Gundersen, G., Kolstø, A.-B., and Prydz, H. (1992).GATA 9:48–53.
Devereux, J., Haeberli, P., and Smithies, O. (1984).Nucleic Acids Res. 12:387–395.
Kernighan, B.W., and Ritchie, D.M. (1978).The C Programming Language, (Prentice Hall, Englewood Cliffs, New Jersey).
Cooley, J.W., and Tukey, J.W. (1965).Math. Comput. 19:297–301.
Press, W.H., Flannery, B.P., Teukolsky, S.A., and Vetterling, W.T. (1988).Numerical Recipes in C, (Cambridge University Press, New York).
Voss, R.F. (1992).Phys. Rev. Lett. 68:3805–3808.
Kendall, M., and Ord, J.K. (1990).Time Series, 3rd ed., (Edwin Arnold, Sevenoaks, Kent), pp. 24–26.
Ehrlich, M., Zhang, X.-Y., and Inamdar, N.M. (1990).Mutat. Res. 238:277–286.
Shepherd, J.C.W. (1981).Proc. Natl. Acad. Sci. U.S.A. 78:1596–1600.
Voss, R.F. (1993).Phys. Rev. Lett. 71:1777.
Aïssani, B., and Bernardi, G. (1991).Gene 106:173–183.
Aïssani, B., and Bernardi, G. (1991).Gene 106:185–195.
Bucher, P. (1990).J. Mol. Biol. 212:563–578.
Frigerio, G., Burri, M., Bopp, D., Baumgartner, S., and Noll, M. (1986).Cell 47:735–746.
Drew, H.R., and Travers, A.A. (1985).J. Mol. Biol. 186:773–790.
Trifonov, E.N. (1987).J. Mol. Biol. 194:643–652.
Wada, K., Wada, Y., Doi, H., Ishibashi, F., Gojobori, T., and Ikemura, T. (1991).Nucleic Acids Res. 19(Suppl):1981–1986.
Rich, A., Nordheim, A., and Wang, A.H.-J. (1984).Annu. Rev. Biochem. 53:791–846.
Arquès, D.G., and Michel, C.J. (1990).J. Theor. Biol. 143:307–318.
Wolffe, A. (1992).Chromatin: Structure and Function, (Academic Press, London), pp. 20–23.
Arquès, D.G., and Michel, C.J. (1987).Nucleic Acids Res. 15:7581–7592.
MacLeod, M.C. (1993).Nucleic Acids Res. 21:1439–1447.
Bernardi, G., Olofsson, B., Filipski, J., Zerial, M., and Salinas, J., Cuny, G., Meunier-Rotival, M, and Rodier, F. (1985).Science 228:953–958.
Bernardi, G. (1993).Gene 135:57–66.
Nelson, M., Raschke, E., and McClelland, M. (1993).Nucleic Acids Res. 21:3139–3154.
Bird, A.P., and Southern, E.M. (1978).J. Mol. Biol. 118:27–47.
Shen, J.-C., Rideout, W.M., III, and Jones, P.A. (1994).Nucleic Acids Res. 22:972–976.
Brown, T.C., and Jiricny, J. (1987).Cell 50:945–950.
Jost, J.-P. (1993).Proc. Natl. Acad. Sci. U.S.A. 90:4684–4688.
Mitra, R., Pettitt, B.M., Ramé, G.L., and Blake, R.D. (1993).Nucleic Acids Res. 21:6028–6037.
Krüger, T., Wild, C., and Noyer-Weidner, M. (1995).EMBO J. 14:2661–2669.
Meehan, R.R., Lewis, J.D., McKay, S., Kleiner, E.L., and Bird, A.P. (1989).Cell 58:499–507.
Lewis, J.D., Meehan, R.R., Henzel, W.J., Maurer-Fogy, I., Jeppesen, P., Klein, F., and Bird, A., (1992).Cell 69:905–914.
Meehan, R.R., Lewis, J.D., and Bird, A.P. (1992).Nucleic Acids Res. 20:5085–5092.
Nan, X., Meehan, R.R., and Bird, A., (1993).Nucleic Acids Res. 21:4886–4892.
Jost, J.-P., and Hofsteenge, J. (1992).Proc. Natl. Acad. Sci. U.S.A. 89:9499–9503.
Muiznieks, I., and Doerfler, W. (1994).Nucleic Acids Res. 22:2568–2575.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Clay, O., Schaffner, W. & Matsuo, K. Periodicity of eight nucleotides in purine distribution around human genomic CpG dinucleotides. Somat Cell Mol Genet 21, 91–98 (1995). https://doi.org/10.1007/BF02255784
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF02255784