Skip to main content
Log in

Pattern recognition of sequence similarities in globular proteins by Fourier analysis: A novel approach to molecular evolution

  • Published:
Journal of Molecular Evolution Aims and scope Submit manuscript

Summary

A new algorithm is introduced for analyzing gene-duplication-independent (orthologous) and gene-duplication-dependent amino acid sequence similarities between proteins of different species. It is based on the calculation of an autocorrelation function D(x) as a Fourier series analogous to that used in crystal analysis by x-ray diffraction.

The primary structure of the protein is decomposed into “homopolypeptide-defective sequences” containing identical or similar amino acid residues and vacancies corresponding to the missing amino acid residues. The Fourier transforms F(h) simulating the diffraction patterns of defective linear gratings corresponding to the defective homopolypeptide sequences are calculated. The squared F(h) values are then used as coefficients of Fourier series corresponding to the autocorrelation functions D(x). A peak of D(x) corresponds to a vector of length x, which is the distance between two identical amino acid residues.

It is pointed out that optical diffraction methods, instead of computer methods, would also be useful.

It is shown through a number of examples that this method allows satisfactory pattern recognition of homologies and internal duplications of an initial segment of the polypeptide chain. In the latter case the value of the above method may be seen from the fact that it detects repeated duplications in proteins such as spinach ferredoxin and myoglobin, for which other methods had either failed or given inconclusive results.

The above approach appears most promising for studies of molecular evolution and structure-sequence correlations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Acher R (1974) Recent discoveries in the evolution of proteins. Angew Chem Int Ed 13:186–197

    Google Scholar 

  • Ahrens JH, Dieter V, Grube A (1970) Pseudo random numbers. A new proposal for the choice of multiplicators. Computing 6:121–138

    Google Scholar 

  • Barker WC, Ketcham LK, Dayhoff MO (1978) In: Dayhoff MO (ed) Atlas of protein sequence and structure. National Biomedical Research Foundation, vol 5, suppl 3, pp 359–362

  • Blundell TL, Sewell T, Turnell B (1981) Symmetry in the structure and organization of proteins. In: Dodson G, Glusker JP, Sayre D (eds) Structural studies on molecules of biological interest. Clarendon Press, Oxford, 1981, pp 390–403

    Google Scholar 

  • Bragg L (1948) The crystalline state, vol A: general survey. G Bell and Sons, Lodon

    Google Scholar 

  • Chothia C (1975) Structural invariants in protein folding. Nature 254:304–308

    PubMed  Google Scholar 

  • Dayhoff MO (ed) (1972) Atlas of protein sequence and structure. National Biomedical Research Foundation, Silver Spring, Maryland

    Google Scholar 

  • Dickerson RE, Timkovich A (1975) Cytochromes c. Enzymes 11:397–547

    Google Scholar 

  • Doolittle RF (1981) Similar amino acid sequences: chance or common ancestry? Science 214:149–159

    PubMed  Google Scholar 

  • Fasman GD (ed) (1976) Handbook of biochemistry and molecular biology, vol III: proteins. CES Press, Cleveland

    Google Scholar 

  • Fietzek PP, Kuehn K (1976) The primary structure of collagen. Int Rev Connect Tissue Res 7:1–60

    PubMed  Google Scholar 

  • Grantham R (1974) Amino acid difference formula to help explain protein evolution. Science 185:862–864

    PubMed  Google Scholar 

  • Grutter MG, Weaver LH, Mathews BW (1983) Goose lysozyme structure; an evolutionary link between hen and bacteriophage lysozymes? Nature 297:343–345

    Google Scholar 

  • Ivanov OC, Ivanov CP (1980) Some evidence for the universality of structural periodicity in proteins. J Mol Evol 16: 47–68

    PubMed  Google Scholar 

  • Jukes TH (1966) Molecules and evolution. Columbia University Press, New York

    Google Scholar 

  • Kakudo M (1983) Protein crystallography at Osaka. In: Srinivasan R, Sarma RH (eds) Conformation in biology. Adenine Press, New York, pp 461–466

    Google Scholar 

  • Liquori AM (1965) Minimum energy conformations of biological polymers. In: Wolstenholme GEW, Connor MO, Ciba Foundation symposium on principles of biomolecular organization. JA Churchill, London, p 40–62

    Google Scholar 

  • Liquori AM (1968) Stereochemical code of amino acid residues in polypeptides and proteins. In: Enggstrom A, Standeberger B (eds) Symmetry and function of biological systems at the macromolecular level, proceedings of the 11th Nobel symposium, Almquist & Wiksell, Stockholm, and Wiley-Interscience, New York London Sydney, pp 101–121

    Google Scholar 

  • Liquori AM (1981) Order versus regularity in globular proteins. In: Structural order in polymers, IUPAC international symposium, Florence, 1980. Pergamon Press, Oxford, pp 181–187

    Google Scholar 

  • Liquori AM, Sadun C (1978) Invariance in the packing of amino acid residues of cytochrome c and myoglobin during biological evolution. Gazz Chim Ital 108:513–517

    Google Scholar 

  • Liquori AM, Sadun C (1981) Close packing of amino acid residues in globular proteins: specific volume and site binding of water molecules. Int J Biol Macromol 3:56–59

    Google Scholar 

  • Liquori AM, Ottani S, Ripamonti A, Sadun C (1983a) Genes as quasi-periodic linear lattices: a novel approach to the analysis of nucleotide sequences. Rend Accad Naz Lincei [VIII] 74(6):389–395

    Google Scholar 

  • Liquori AM, Ripamonti A, Sadun C, Ottani S, Braga D (1983b) Fourier analysis of the primary structure of globular proteins. Rend Accad Naz Lincei [VIII] 75(1–2):71–78

    Google Scholar 

  • Liquori AM, Sadun C, Sorrentino F, Ottani S (1985) Quasi periodic patterns in the coding sequences of nucleic acids. Contribution to the discussion on primary structure, conformation and evolution of nucleic acids and Proteins. Accademia Nazionale dei Licei, Roma, in press

    Google Scholar 

  • McLachlan AD (1972) Repeating sequences and gene duplication in proteins. J Mol Biol 64:417–437

    PubMed  Google Scholar 

  • McLachlan AD (1976) Evidence for gene duplication in collagen. J Mol Biol 107:159–174

    PubMed  Google Scholar 

  • McLachlan AD (1977) Analysis of periodic patterns in amino acid sequences: collagen. Biopolymers 16:1271–1297

    PubMed  Google Scholar 

  • Rossman MG, Argos P (1976) Exploring structural homology of proteins. J Mol Biol 105:75–95

    PubMed  Google Scholar 

  • Sarma R (1983) Lysozyme structure: Why still work on it? In: Srinivasan R, Sarma RH (eds) Comformation in biology. Adenine Press, New York, pp 69–78

    Google Scholar 

  • Scoloudi H (1978) A preliminary comparison of metmyoglobin molecules from seal and sperm whale. J Mol Biol 126:661–671

    PubMed  Google Scholar 

  • Steinert PM, Rice RH, Roop DR, Trus BL, Steven AC (1983) Complete amino acid sequence of a mouse epidermal keratin subunit and implications for the structure of intermediate filaments. Nature 302:794–800

    PubMed  Google Scholar 

  • Taylor CA, Lipson H (1964) Optical transforms. G Bell and Sons, London

    Google Scholar 

  • Urry DW, Venkatachalam CM, Long MM, Prasad KU (1983) Dynamic spiral and a librational entropy mechanism of elasticity. In: Srinivasan R, Sarma RH (eds) Conformation in biology. Adenine Press, New York, p 11

    Google Scholar 

  • Ycas M (1972) De novo origin of periodic proteins. J Mol Evol 2:17–27

    PubMed  Google Scholar 

  • Yockey HP (1977) A prescription which predicts functionally equivalent residues at given sites in protein sequence. J Theor Biol 67:337–343

    PubMed  Google Scholar 

  • Zuckerkandl E, Pauling L (1965) Evolutionary divergence and convergence in proteins. In: Bryson V, Vogel H (eds) Evolving genes and proteins. Academic Press, New York, p 97

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liquori, A.M., Ripamonti, A., Sadun, C. et al. Pattern recognition of sequence similarities in globular proteins by Fourier analysis: A novel approach to molecular evolution. J Mol Evol 23, 80–87 (1986). https://doi.org/10.1007/BF02101001

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02101001

Key words

Navigation