Abstract
The paper describes a method for machine discovery of protein functional models from protein databases using Inductive Logic Programming (ILP). The method uses domain knowledge in ILP to generate appropriate hypotheses to predict functions of a protein from its amino acid sequence. The method is based on top-down search for relative least general generalization and uses domain knowledge defining the conceptual hierarchy of protein functions and search biases. The method discovers effectively protein function models that explain the relationship between functions of proteins and their amino acid sequences described in protein databases. The method succeeds in discovering protein functional models for forty membrane proteins, which coincide with conjectured models in literature of molecular biology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Attwood, T. K. and Parry-Smith, D. J.: Introduction to bioinformatics. Longman (1999)
Bairoch, A and Boechmann, B.: Nucl. Acids Res., Vol.22, pp.3578–3580 (1994)
Bairoch, A., Bucher, P., and Hofmann, K.: The PROSITE database, its status in 1997, Nucl. Acids Res., Vol.24, pp.217–221 (1997)
Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R. (eds.).: Advances in Knowledge Discovery and Data Mining, AAAI Press/The MIT Press (1996)
Putai, M. (ed.): Biomembrane Engineering (in Japanese), Maruzen (1991)
Ishikawa, T., Mitaku, S., Terano, T., Hirokawa, T., Suwa, M., and Seah, B-C: Building A Knowledge-Base for Protein Function Prediction using Multistrategy Learning, In Proceedings of Genome Informatics Workshop 1995, pp.39–48 (1995)
Ishikawa, T., Terano, T., and Numao, M.: A Computation Method of Relative Least General Generalization Using Literal Association and MDL Criteria, Journal of Japanese Society for Artificial Intelligence (in Japanese), Vol.14, No. 2, pp.326–333 (1999)
Ishikawa, T., Mitaku, S., and Terano, T.: Discovery of Protein Functional Models with an ILP Method Using Conceptual Hierarchy and Search Biases, Journal of Japanese Society for Artificial Intelligence (in Japanese), Vol.15, No.l, pp.169–176 (2000)
Lloyd, J.: Foundations of Logic Programming, Springer Verlag (1984)
LPA-PROLOG References, Logic Programming Associates Ltd. (1994)
Muggleton, S. and Feng, C: Efficient Induction of Logic Programs, In Proceedings of the 1st Conference on Algorithmic Learning Theory, Ohmsha (1990)
Muggleton, S., King, R., and Sternberg, M.: Protein Secondary Structure Prediction using Logic, Protein Engineering, Vol.5, pp.647–657 (1992)
Muggleton, S. and De Raedt, L.: Inductive Logic Programming: Theory and Methods, The Journal of Logic Programming, Vol.19, pp.629–679 (1994)
Muggleton, S.: Inverse Entailment and Progol. New Generation Computing, Vol.13, pp.245–286 (1995)
Plotkin, G. D.: A Note on Inductive Generalization. Machine Intelligence, Vol. 5, pp. 153–163 (1970)
Quinlan, R.: Learning Logical Definition from Relations, Machine Learning, Vol.5, pp.239–266 (1990)
Zelle, J. M., Mooney, R. J. and Konvisser, J. B.: Combining Top-down and Bottom-up Techniques in Inductive Logic Programming. In Proceedings of the Eleventh International Workshop on Machine Learning, pp.343–351 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ishikawa, T., Numao, M., Terano, T. (2000). Using Domain Knowledge in ILP to Discover Protein Functional Models. In: Mizoguchi, R., Slaney, J. (eds) PRICAI 2000 Topics in Artificial Intelligence. PRICAI 2000. Lecture Notes in Computer Science(), vol 1886. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44533-1_12
Download citation
DOI: https://doi.org/10.1007/3-540-44533-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67925-7
Online ISBN: 978-3-540-44533-3
eBook Packages: Springer Book Archive