Skip to main content

Chaos game representation of coding regions of human globin genes and alcohol dehydrogenase genes of phylogenetically divergent species

Summary

Chaos game representation (CGR) is a novel holistic approach that provides a visual image of a DNA sequence quite different from the traditional linear arrangement of nucleotides. Although it is known that CGR patterns depict base composition and sequentiality, the biological significance of the specific features of each pattern is not understood. To systematically examine these features, we have examined the coding sequences of 7 human globin genes and 29 relatively conserved alcohol dehydrogenase (Adh) genes from phylogenetically divergent species. The CGRs of human globin cDNAs were similar to one another and to the entire human globin gene complex. Interestingly, human globin CGRs were also strikingly similar to human Adh CGRs. Adh CGRs were similar for genes of the same or closely related species but were different for relatively conserved Adh genes from distantly related species. Dinucleotide frequencies may account for the self-similar pattern that is characteristic of vertebrate CGRs and the genome-specific features of CGR patterns. Mutational frequencies of dinucleotides may vary among genome types. The special features of CG dinucleotides of vertebrates represent such an example. The CGR patterns examined thus far suggest that the evolution of a gene and its coding sequence should not be examined in isolation. Consideration should be given to genome-specific differential mutation rates for different dinucleotides or specific oligonucleotides.

This is a preview of subscription content, access via your institution.

References

  • Bilofsky HS, Burks C (1988) The GenBank genetic sequence data bank. Nucleic Acids Res 16:1861–1863

    Google Scholar 

  • Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504

    Google Scholar 

  • Coulondre C, Miller JH, Farabaugh PJ, Gilbert W (1978) Molecular basis of base substitution hot spots in Escherichia coli. Nature 274:775–780

    Google Scholar 

  • Devereux J, Haeberli P, Smithies O (1981) A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res 12:387–395

    Google Scholar 

  • Ehrlich M, Wang RY-H (1981) 5 Methylcytosine in eukaryotic DNA. Science 212:1350–1357

    Google Scholar 

  • Gross RH (1986) A DNA sequence analysis program for the Apple Macintosh. Nucleic Acids Res 14:591–596

    Google Scholar 

  • Jeffrey HJ (1990) Chaos game representation of gene structure. Nucleic Acids Res 18:2163–2170

    Google Scholar 

  • May R (1976) Simple mathematical models with very complicated dynamics. Nature 261:459–467

    Google Scholar 

  • Needleman SB, Wunsch CD (1970) A general method applicable to search for similarities in the amino acid sequence of two proteins. J Mol Biol 48:443–453

    CAS  PubMed  Google Scholar 

  • Nei M (1987) Molecular evolutionary genetics. Columbia University Press, New York

    Google Scholar 

  • Russell GJ, Walker PMB, Elton RA, Subad-Sharpe JH (1976) Doublet frequency analysis of fractionated vertebrate nuclear DNA. J Mol Biol 108:1–23

    Google Scholar 

  • Wilkinson L (1991) Systat: the system for statistics. Systat Inc., Evanston, IL

    Google Scholar 

  • Yokoyama S, Yokoyama R, Kinlaw CS, Harry DE (1990) Molecular evolution of zinc-containing long-chain alcohol dehydrogenase genes. Mol Biol Evol 7:143–154

    CAS  Google Scholar 

Download references

Author information

Affiliations

Authors

Additional information

Offprint requests to: S. M. Singh

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Hill, K.A., Schisler, N.J. & Singh, S.M. Chaos game representation of coding regions of human globin genes and alcohol dehydrogenase genes of phylogenetically divergent species. J Mol Evol 35, 261–269 (1992). https://doi.org/10.1007/BF00178602

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00178602

Key words

  • Chaos game representation
  • Sequence composition
  • Sequence structure
  • Oli gonucleotide frequencies
  • Human globin gene complex
  • Alcohol dehydrogenase genes
  • CG mutation