Skip to main content

Dipeptide frequencies in proteins and the CpG deficiency in vertebrate DNA

Summary

Analysis of vertebrate protein sequences totalling 4040 residues shows that amino acids with a high proportion of codons ending in C occur with significantly reduced frequency before amino acids whose codons start with G. This effect is not shown by “control” bacterial protein sequences. The consequent implication of shortage of XXC. GXX codon pairs in vertebrate messenger RNA is discussed in relation to the extreme rarity of the base doublet CpG in vertebrate DNA.

This is a preview of subscription content, access via your institution.

References

  • Angeletti, R. H., Bradshaw, R. A.: Proc. nat. Acad. Sci. (Wash.)68, 2417 (1971).

    Google Scholar 

  • Botes, D. P., Strydom, D. J., Anderson, C. G., Christensen, P. A.: J. biol. Chem.246, 3132 (1971).

    Google Scholar 

  • Bradshaw, R. A., Ericsson, L. H., Walsh, K. A., Neurath, H.: Proc. nat. Acad. Sci. (Wash.)63, 1389 (1969).

    Google Scholar 

  • Brewer, H. B., Ronan, R.: Proc. nat. Acad. Sci. (Wash.)67, 1862 (1970).

    Google Scholar 

  • Carnegie, P. R.: Biochem. J.123, 57 (1971).

    Google Scholar 

  • Dayhoff, M. O.: Atlas of protein sequence and structure 1969. Silver Spring, Md.: National Biomedical Research Foundation 1969.

    Google Scholar 

  • Forget, B. G., Weissman, S. M.: Science158, 1695 (1967).

    Google Scholar 

  • Haas, G. H. de, Slotboom, A. J., Bonsen, P. P. M., Deenen, L. L. M. van, Maroux, S., Puigserver, A., Desnuelle, P.: Biochim. biophys. Acta (Amst.)221, 31 (1970).

    Google Scholar 

  • Haylett, T., Swart, L. S., Parris, D.: Biochem. J.123, 191 (1971).

    Google Scholar 

  • Huang, I. Y., Bergdoll, M. S.: J. biol. Chem.245, 3518 (1970).

    Google Scholar 

  • Jörnvall, H.: Europ. J. Biochem.16, 25 (1970).

    Google Scholar 

  • Josse, J., Kaiser, A. D., Kornberg, A.: J. biol. Chem.236, 864 (1961).

    Google Scholar 

  • Jukes, T. H.: Curr. Top. Microbiol. Immunol.49, 178 (1970).

    Google Scholar 

  • King, J. L., Jukes, T. H.: Science164, 788 (1969).

    Google Scholar 

  • Krzywicki, A., Slonimski, P. P.: J. theor. Biol.17, 136 (1967).

    Google Scholar 

  • Lange, R. J. de, Huang, T.-S.: J. biol. Chem.246, 698 (1971).

    Google Scholar 

  • Pechere, J.-F., Capony, J.-P., Ryden, L., Demaille, J.: Biochem. biophys. Res. Commun.43, 1106 (1971).

    Google Scholar 

  • Staehelin, M., Rogg, H., Baguley, B. C., Ginsberg, T., Wehrli, W.: Nature (Lond.)219, 1363 (1968).

    Google Scholar 

  • Subak-Sharpe, J. H.: In: Proceedings of the Eighth Canadian Cancer Research Conference, p. 242. Oxford: Pergamon Press 1969.

    Google Scholar 

  • Subak-Sharpe, H., Bürk, R. R., Crawford, L. V., Morrison, J. M., Hay, J., Keir, H. M.: Cold Spr. Harb. Symp. quant. Biol.31, 737 (1966).

    Google Scholar 

  • Swartz, M. N., Trautner, T. A., Kornberg, A.: J. biol. Chem.237, 1961 (1962).

    Google Scholar 

  • Tanaka, M., Haniu, M., Yasunobu, K. T.: Biochem. biophys. Res. Commun.39, 1182 (1970).

    Google Scholar 

  • Walter, R., Schlesinger, D. H., Schwartz, I. L., Capra, J. D.: Biochem. biophys. Res. Commun.44, 293 (1971).

    Google Scholar 

  • Zimmerman, J. M., Eliezer, N., Simha, R.: J. theor. Biol.21, 170 (1968).

    Google Scholar 

Download references

Author information

Affiliations

Authors

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Bullock, E., Elton, R.A. Dipeptide frequencies in proteins and the CpG deficiency in vertebrate DNA. J Mol Evol 1, 315–325 (1972). https://doi.org/10.1007/BF01653960

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01653960

Key words

  • Vertebrate Protein Sequences
  • Doublet Frequencies
  • CpG Deficiency