PFDB: A Protein Families DataBase for Macintosh Computers. The Effectiveness of Its Organization in Searching for Protein Similarity

Petrilli, P.; Tonukari, Nyerhovwo J.

doi:10.1023/A:1026310621698

PFDB: A Protein Families DataBase for Macintosh Computers. The Effectiveness of Its Organization in Searching for Protein Similarity

Published: 01 October 1997

Volume 16, pages 713–720, (1997)
Cite this article

Journal of Protein Chemistry Aims and scope Submit manuscript

P. Petrilli¹^nAff2 &
Nyerhovwo J. Tonukari³

35 Accesses
4 Citations
Explore all metrics

Abstract

A protein sequence database (PFDB) containing about 11,000 entries is available for Macintosh computers. The PFDB can be easily updated by importing sequences from the PIR collection through the internet. The most important feature of the database is its organization in families of closely related sequences, each family being characterized by its average dipeptide composition [Petrilli (1993), Comput. Appl. Biosci. 2, 89–93]. This allows one to perform a rapid and sensitive protein similarity search by comparing the precalculated family dipeptide composition with that of the query sequence by a linear correlation coefficient. An example of an application in which a new protein was classsified by using a sequence of a fragment just 19 residues long is reported.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Distinguishing Proteins From Arbitrary Amino Acid Sequences

Article Open access 22 January 2015

ProtDCal: A program to compute general-purpose-numerical descriptors for sequences and 3D-structures of proteins

Article Open access 16 May 2015

Reduction, alignment and visualisation of large diverse sequence families

Article Open access 02 August 2016

REFERENCES

Blaisdell, B. E. (1989). J. Mol. Evol. 29, 526–537.
Article CAS PubMed Google Scholar
Callebaut, C., Krust, B., Jacotot, E., and Hovanessian, A. G. (1993). Science 262, 2045–2050.
Article CAS PubMed Google Scholar
Doolittle, R. F. (1986). Of Urfs and Orfs: A Primer on How to Analyze Derived Amino Acid Sequences, University Science Books, Mill Valley, California.
Google Scholar
Fuchs, R., and Cameron, G. N. (1991). Prog. Biophys. Mol. Biol. 56, 215–245.
Article CAS PubMed Google Scholar
George, D. G., Hunt, L. T., and Barker, W. C. (1988). Macromolecular Sequencing and Synthesis. Selected Methods and Applications, Liss, New York, pp. 127–149.
Google Scholar
Hobohm, U., and Sander, V. (1995). J. Mol. Biol. 251, 390–399.
Article CAS PubMed Google Scholar
Kennedy, M. B. (1995). Trends Biol. Sci. 20, 349–350.
Article Google Scholar
Lipman, D. J., and Pearson, W. R. (1985). Science 227, 1435–1441.
Article CAS PubMed Google Scholar
Pabo, C. O. (1987). Nature 327, 467.
Article CAS PubMed Google Scholar
Pearson, W. R., and Lipman, D. J. (1988). Proc. Natl. Acad. Sci. USA 85, 2444–2448.
Article CAS PubMed PubMed Central Google Scholar
Petrilli, P. (1993). Comput. Appl. Biosci. 9, 205–209.
CAS PubMed Google Scholar
Pongor, S. (1988). Nature 332, 24.
Article CAS PubMed Google Scholar
Rawlings, C. J. (1988). Nature 334, 477.
Article CAS PubMed Google Scholar
Snedecor, G. W., and Cochran, W. G. (1967). Statistical Methods, Iowa State University Press, Ames, Iowa.
Google Scholar
Starratt, A. N., and Brown, B. E. (1975). Life Sci. 17, 1253–1256.
Article CAS PubMed Google Scholar
Strelets, V. B., and Lim, H. A. (1995). Computer Appl. Biosci. 11, 557–561.
CAS Google Scholar
Umezawa, H., Aoyagy, T., Ogawa, K., Naganawa, N., and Takeuchi, T. (1984). J. Antibiot. 37, 422–425.
Article CAS Google Scholar
Van Heel, M. (1991). J. Mol. Biol. 220, 887–887.
Article Google Scholar
Walker, J. R., and Willett, P. (1986). Comput. Appl. Biosci. 2, 89–93.
CAS PubMed Google Scholar
Wirth, N. (1976). In Algorithms + Data Structures = Programs, Prentice-Hall, Englewood Cliffs, New Jersey.
Google Scholar

Download references

Author information

P. Petrilli
Present address: Dipartimento di Scienze Agronomiche e Genetica Vegetale, Università degli Studi di Napoli “Federico II”, 80055, Portici, Italy

Authors and Affiliations

Dipartimento di Scienza dell'Alimentazione, Università degli Studi di Napoli “Federico II”, Portici, Italy
P. Petrilli
International Institute of Tropical Agriculture, Ibadan, Nigeria
Nyerhovwo J. Tonukari

Authors

P. Petrilli
View author publications
You can also search for this author in PubMed Google Scholar
Nyerhovwo J. Tonukari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Petrilli.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Petrilli, P., Tonukari, N.J. PFDB: A Protein Families DataBase for Macintosh Computers. The Effectiveness of Its Organization in Searching for Protein Similarity. J Protein Chem 16, 713–720 (1997). https://doi.org/10.1023/A:1026310621698

Download citation

Published: 01 October 1997
Issue Date: October 1997
DOI: https://doi.org/10.1023/A:1026310621698

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PFDB: A Protein Families DataBase for Macintosh Computers. The Effectiveness of Its Organization in Searching for Protein Similarity

Abstract

Access this article

Similar content being viewed by others

Distinguishing Proteins From Arbitrary Amino Acid Sequences

ProtDCal: A program to compute general-purpose-numerical descriptors for sequences and 3D-structures of proteins

Reduction, alignment and visualisation of large diverse sequence families

REFERENCES

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Navigation

PFDB: A Protein Families DataBase for Macintosh Computers. The Effectiveness of Its Organization in Searching for Protein Similarity

Abstract

Access this article

Similar content being viewed by others

Distinguishing Proteins From Arbitrary Amino Acid Sequences

ProtDCal: A program to compute general-purpose-numerical descriptors for sequences and 3D-structures of proteins

Reduction, alignment and visualisation of large diverse sequence families

REFERENCES

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation