Abstract
A protein sequence database (PFDB) containing about 11,000 entries is available for Macintosh computers. The PFDB can be easily updated by importing sequences from the PIR collection through the internet. The most important feature of the database is its organization in families of closely related sequences, each family being characterized by its average dipeptide composition [Petrilli (1993), Comput. Appl. Biosci. 2, 89–93]. This allows one to perform a rapid and sensitive protein similarity search by comparing the precalculated family dipeptide composition with that of the query sequence by a linear correlation coefficient. An example of an application in which a new protein was classsified by using a sequence of a fragment just 19 residues long is reported.
Similar content being viewed by others
REFERENCES
Blaisdell, B. E. (1989). J. Mol. Evol. 29, 526–537.
Callebaut, C., Krust, B., Jacotot, E., and Hovanessian, A. G. (1993). Science 262, 2045–2050.
Doolittle, R. F. (1986). Of Urfs and Orfs: A Primer on How to Analyze Derived Amino Acid Sequences, University Science Books, Mill Valley, California.
Fuchs, R., and Cameron, G. N. (1991). Prog. Biophys. Mol. Biol. 56, 215–245.
George, D. G., Hunt, L. T., and Barker, W. C. (1988). Macromolecular Sequencing and Synthesis. Selected Methods and Applications, Liss, New York, pp. 127–149.
Hobohm, U., and Sander, V. (1995). J. Mol. Biol. 251, 390–399.
Kennedy, M. B. (1995). Trends Biol. Sci. 20, 349–350.
Lipman, D. J., and Pearson, W. R. (1985). Science 227, 1435–1441.
Pabo, C. O. (1987). Nature 327, 467.
Pearson, W. R., and Lipman, D. J. (1988). Proc. Natl. Acad. Sci. USA 85, 2444–2448.
Petrilli, P. (1993). Comput. Appl. Biosci. 9, 205–209.
Pongor, S. (1988). Nature 332, 24.
Rawlings, C. J. (1988). Nature 334, 477.
Snedecor, G. W., and Cochran, W. G. (1967). Statistical Methods, Iowa State University Press, Ames, Iowa.
Starratt, A. N., and Brown, B. E. (1975). Life Sci. 17, 1253–1256.
Strelets, V. B., and Lim, H. A. (1995). Computer Appl. Biosci. 11, 557–561.
Umezawa, H., Aoyagy, T., Ogawa, K., Naganawa, N., and Takeuchi, T. (1984). J. Antibiot. 37, 422–425.
Van Heel, M. (1991). J. Mol. Biol. 220, 887–887.
Walker, J. R., and Willett, P. (1986). Comput. Appl. Biosci. 2, 89–93.
Wirth, N. (1976). In Algorithms + Data Structures = Programs, Prentice-Hall, Englewood Cliffs, New Jersey.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Petrilli, P., Tonukari, N.J. PFDB: A Protein Families DataBase for Macintosh Computers. The Effectiveness of Its Organization in Searching for Protein Similarity. J Protein Chem 16, 713–720 (1997). https://doi.org/10.1023/A:1026310621698
Published:
Issue Date:
DOI: https://doi.org/10.1023/A:1026310621698