Journal of Molecular Evolution

, Volume 27, Issue 4, pp 321–325 | Cite as

The effects of guanine and cytosine variation on dinucleotide frequency and amino acid composition in the human genome

  • Ryo Hanai
  • Akiyoshi Wada


One hundred twelve human DNA sequences were analyzed with respect to dinucleotide frequency and amino acid composition. The variation in guanine and cytosine (G+C) content revealed: (1) at 2–3 and 3-1 doublet positions CG discrimination is attenuated at high G+C, but TA disfavor is enhanced, and (2) several amino acids are subject to G+C change. These findings have been reported in part for collections of sequences from various species. The present study confirms that in a single organism-the human-the G+C effects do exist. Aspects of the argument that connects G+C with protein thermal stability are also discussed.

Key words

G+C content Dinucleotide frequency Amino acid composition Human genome Protein stability 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Aota S-I, Ikemura T (1986) Diversity in G+C content at the third letter position of codons in vertebrate genes and its cause. Nucleic Acids Res 14:6345–6355, 8702 (erratum)PubMedGoogle Scholar
  2. Bernardi G, Bernardi G (1985) Codon usage and genome composition. J Mol Evol 22:363–365PubMedGoogle Scholar
  3. Bernardi G, Bernardi G (1986a) Compositional constraints and genome evolution. J Mol Evol 24:1–11PubMedGoogle Scholar
  4. Bernardi G, Bernardi G (1986b) The human genome and its evolutionary context. Cold Spring Harbor Symp Quant Biol 51:479–487PubMedGoogle Scholar
  5. Bernardi G, Olofsson B, Filipski J, Zerial M, Salinas J, Cury G, Meunier-Rotival M, Rodier F (1985) The mosaic genome of warm-blooded vertebrates. Science 228:953–958PubMedGoogle Scholar
  6. Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504PubMedGoogle Scholar
  7. Endo S, Nagayama K, Wada A (1985) Probing stability and dynamics of proteins by protease digestion I: comparison of protease susceptibility and thermal stability of cytochromesc. J Biomol Struct & Dyn 3:409–421Google Scholar
  8. Goldberg AL, Dice JF (1974) Intracellular protein degradation in mammalian and bacterial cells. Annu Rev Biochem 43:835–869PubMedGoogle Scholar
  9. Goldberg AL, St. John AC (1976) Intracellular protein degradation in mammalian and bacterial cells: part 2. Annu Rev Biochem 45:747–803PubMedGoogle Scholar
  10. Goldman MA, Holmquist GP, Gray MC, Caston LA, Nag A (1984) Replication timing of genes and middle repetitive sequences. Science 224:686–692PubMedGoogle Scholar
  11. Grantham R, Gautier C, Gouy M (1980a) Codon frequencies in 119 individual genes confirm consistent choices of degenerate bases according to genome type. Nucleic Acids Res 8:1893–1912PubMedGoogle Scholar
  12. Grantham R, Gautier C, Gouy M, Mercier R, Pavé A (1980b) Codon catalog usage and the genome hypothesis. Nucleic Acids Res 8:r49–62PubMedGoogle Scholar
  13. Grantham R, Gautier C, Gouy M, Jacobzone M, Mercier R (1981) Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res 9:r43–74PubMedGoogle Scholar
  14. Ikemura T (1985) Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol 2:13–34PubMedGoogle Scholar
  15. Jukes TH (1985) A change in the genetic code inMycoplasma capricolum. J Mol Evol 22:361–362PubMedGoogle Scholar
  16. Jukes TH, Bhushan V (1986) Silent nucleotide substitutions and G+C content of some mitochondrial and bacterial genes J Mol Evol 24:39–44PubMedGoogle Scholar
  17. Levitt M (1976) A simplified representation of protein conformation for rapid simulation of protein folding. J Mol Biol 104:59–107PubMedGoogle Scholar
  18. McLendon G, Radany E (1978) Is protein turnover thermodynamically controlled? J Biol Chem 253:6335–6337PubMedGoogle Scholar
  19. Muto A, Osawa S (1987) The guanine and cytosine content of genomic DNA and bacterial mutation. Proc Natl Acad Sci USA 84:166–169PubMedGoogle Scholar
  20. Nussinov R (1980) Some rules in the ordering of nucleotides in the DNA. Nucleic Acids Res 8:4545–4562PubMedGoogle Scholar
  21. Nussinov R (1981a) Nearest neighbor nucleotide patterns: structural and biological implications. J Biol Chem 256:8458–8462PubMedGoogle Scholar
  22. Nussinov R (1981b) Eukaryotic dinucleotide preference rules and their implications for degenerate codon usage. J Mol Biol 149:125–131PubMedGoogle Scholar
  23. Nussinov R (1981c) The universal dinucleotide asymmetry rules in DNA and the amino acid codon choice. J Mol Evol 17: 237–244PubMedGoogle Scholar
  24. Nussinov R (1984a) Strong doublet preferences in nucleotide sequence and DNA geometry. J Mol Evol 20:111–119PubMedGoogle Scholar
  25. Nussinov R (1984b) Doublet frequencies in evolutionary distinct groups. Nucleic Acids Res 12:1749–1763PubMedGoogle Scholar
  26. Osawa S, Jukes TH, Muto A, Yamao F, Ohama T, Andachi Y (1987) Role of GC/AT-biased mutation pressure in evolution of eubacterial code. Cold Spring Harbor Symp Quant Biol 52:777–789PubMedGoogle Scholar
  27. Russel GJ, McGeoch DJ, Elton RA, Subak-Sharpe JH (1973) Doublet frequency analysis of bacterial DNAs. J Mol Evol 2:277–292PubMedGoogle Scholar
  28. Russel GJ, Walker PMB, Elton RA, Subak-Sharpe JH (1976) Doublet frequency analysis of fractionated vertebrate nuclear DNA. J Mol Biol 108:1–23PubMedGoogle Scholar
  29. Salser W (1977) Globin mRNA sequences: analysis of base pairing and evolutionary implications. Cold Spring Harbor Symp Quant Biol 42:985–1002Google Scholar
  30. Subak-Sharpe H, Bürk RR, Crawford LV, Morrison JM, Hay J, Keir HM (1966) An approach to evolutionary relationships of mammalian DNA viruses through analysis of the pattern of nearest neighbor base sequences. Cold Spring Harbor Symp Quant Biol 31:737–748PubMedGoogle Scholar
  31. Sueoka N (1961) Correlation between base composition of deoxyribonucleic acid and amino acid composition of protein. Proc Natl Acad Sci USA 47:1141–1149Google Scholar
  32. Sueoka N (1962) On the genetic basis of variation and heterogeneity of DNA base composition. Proc Natl Acad Sci USA 48:582–592PubMedGoogle Scholar
  33. Swartz MN, Trautner TA, Kornberg A (1962) Enzymatic synthesis of deoxyribonucleic acid: XI. Further studies on the nearest neighbor base sequences in deoxyribonucleic acids. J Biol Chem 237:1961–1967PubMedGoogle Scholar
  34. Yamao F, Muto A, Kawauchi Y, Iwami M, Iwagami S, Azumi Y, Osawa S (1985) UGA is read as tryptophan inMycoplasma capricolum. Proc Natl Acad Sci USA 82:2306–2309PubMedGoogle Scholar

Copyright information

© Springer-Verlag New York Inc 1988

Authors and Affiliations

  • Ryo Hanai
    • 1
  • Akiyoshi Wada
    • 1
  1. 1.Department of Physics, Faculty of ScienceThe University of TokyoTokyoJapan

Personalised recommendations