Abstract
Understanding and managing genomic data has become a major bottleneck of biomedical research that calls for novel informatics approaches. The task is immensely complex: to “understand” the role of a protein for example implies inserting it into a host of interconnected and evolving frameworks of biological knowledge, including 3-D structures, molecular interactions, biochemical pathyways, genomic locations, spatial and temporal roles within the cell, the organism, the population and the species. Similarity based predictions play an important role in this process: similar biological functions or roles are mostly inferred from similar structure or similar molecular interactions, etc. This is usually carried out by comparing a protein sequence with a database of known sequences, using such programs as BLAST (Altschul et al., 1990; Altschul et al., 1997).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. and Lipman, D. J., 1990, Basic Local Alignment Search Tool.J. Mol. Biol. 215: 403-10.
Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W. and Lipman,D. J., 1997, Gapped Blast and Psi-Blast: A New Generation of Protein Database Search Programs.Nucleic Acids Res. 25: 3389-402.
Atwood, T. K., 2000, The Role of Pattern Databases in Sequence Analysis.Briefings in Bioinformatics 1: 45-59.
Gribskov, M., McLachlan, A. D. and Eisenberg, D., 1987, Profile Analysis: Detection of Distantly Related Proteins.Proc. Natl. Acad. Sci. 84: 4355-4358.
Hegyi, H. and Pongor, S., 1993, Predicting Potential Domain Homologies from Fasta Search Results.Comput. Appl. Biosci. 9: 371-2.
Krause, A., Stoye, J. and Vingron, M., 2000, The Systers Protein Sequence Cluster Set.Nucleic Acids Res. 28: 270-2.
Murvai, J., Vlahovicek, K., Barta, E., Cataletto, B. and Pongor, S., 2000a, The Sbase Protein Domain Library, Release 7.0: A Collection of Annotated Protein Sequence Segments.Nucleic Acids Res. 28: 260-2.
Murvai, J., Vlahovicek, K., Barta, E., Parthasarathy, S., Hegyi, H., Pfeiffer, F. and Pongor, S.,1999, The Domain-Server: Direct Prediction of Protein Domain-Homologies from Blast Search.Bioinformatics 15: 343-4.
Murvai, J., Vlahovicek, K. and Pongor, S., 2000b, A Simple Probabilistic Scoring Method for Protein Domain Identification.Bioinformatics, in press.
Murvai, J., Vlahovicek, K., Szepesvári, C. and Pongor, S., 2000c, Prediction of Protein Functional Domains from Sequences Using Artificial Neural Networks.Genom Research,in press.
Ripley, B. D. and Hjort, N. L., 1995,Pattern Recognition and Neural Networks.Cambridge University Press. Cambridge.
Smith, E. E. and Medin, D. L., 1981,Cathegories and Concepts.Harvard University Press,Cambridge.
Sonnhammer, E. L. L., Eddy, S. R. and Durbin, R., 1997, Pfam: A Comprehensive Database of Protein Domain Families Based on Seed Alignments.Proteins 28: 405-420.
Stanfíll, C. and Waltz, D., 1986, Toward Memory-Based Reasoning. Communications of theACM 29: 1213-1228.
Tatusov, R. L., Galperin, M. Y., Natale, D. A. and Koonin, E. V., 2000, The Cog Database: A Tool for Genome-Scale Analysis of Protein Functions and Evolution.Nucleic Acids Res. 28: 33-6.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media New York
About this chapter
Cite this chapter
Murvai, J., Vlahoviček, K., Pongor, S. (2001). Towards a Memory-Based Interpretation of Proteome Data. In: Pifat-Mrzljak, G. (eds) Supramolecular Structure and Function 7. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1363-6_13
Download citation
DOI: https://doi.org/10.1007/978-1-4615-1363-6_13
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5517-5
Online ISBN: 978-1-4615-1363-6
eBook Packages: Springer Book Archive