DS 2001: Discovery Science pp 100-112 | Cite as

Towards Discovery of Deep and Wide First-Order Structures: A Case Study in the Domain of Mutagenicity

  • Tamás Horváth
  • Stefan Wrobel
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2226)

Abstract

In recent years, it has been shown that methods from Inductive Logic Programming (ILP) are powerful enough to discover new fist-order knowledge from data, while employing a clausal representation language that is relatively easy for humans to understand. Despite these successes, it is generally acknowledged that there are issues that present fundamental challenges for the current generation of systems. Among these, two problems are particularly prominent: learning deep clauses, i.e., clauses where a long chain of literals is needed to reach certain variables, and learning wide clauses, i.e., clauses with a large number of literals. In this paper we present a case study to show that by building on positive results on acyclic conjunctive query evaluation in relational database theory, it is possible to construct ILP learning algorithms that are capable of discovering clauses of significantly greater depth and width. We give a detailed description of the class of clauses we consider, describe a greedy algorithm to workwith these clauses, and show, on the popular ILP challenge problem of mutagenicity, how indeed our method can go beyond the depth and width barriers of current ILP systems.

Keywords

Query Evaluation Inductive Logic Program Conjunctive Query Membership Problem Inductive Logic Program System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, Reading, Mass., 1995.Google Scholar
  2. 2.
    H. Arimura. Learning acyclic first-order Horn sentences from entailment. In M. Li and A. Maruoka, editors, Proceedings of the 8th International Workshop on AlgorithmicLearning Theory, volume 1316of LNAI, pages 432–445, Springer, Berlin, 1997.Google Scholar
  3. 3.
    C. Beeri, R. Fagin, D. Maier, and M. Yannakakis. On the desirability of acyclic database schemes. Journal of the ACM, 30(3):479–513, 1983.MATHCrossRefMathSciNetGoogle Scholar
  4. 4.
    P. A. Bernstein and N. Goodman. The power of natural semijoins. SIAM Journal on Computing, 10(4):751–771, 1981.MATHCrossRefMathSciNetGoogle Scholar
  5. 5.
    H. Blockeel and L. D. Raedt. Lookahead and discretization in ILP. In N. Lavraĉ and S. Džeroski, editors, Proceedings of the 7th International Workshop on Inductive Logic Programming, volume 1297 of LNAI, pages 77–84, Springer, Berlin, 1997.Google Scholar
  6. 6.
    A. K. Chandra and P. M. Merlin. Optimal implementations of conjunctive queries in relational databases. In Proceedings of the 9th ACM Symposium on Theory of Computing, pages 77–90. ACM Press, 1977.Google Scholar
  7. 7.
    C. Chekuri and A. Rajaraman. Conjunctive query containment revisited. Theoretical Computer Science, 239(2):211–229, 2000.MATHCrossRefMathSciNetGoogle Scholar
  8. 8.
    T. H. Cormen, C. E. Leiserson, and R. L. Rivest. Introduction to Algorithms. MIT Press, Cambridge, Mass., 1990.Google Scholar
  9. 9.
    R. Fagin. Degrees of acyclicity for hypergraphs and relational database schemes. Journal of the ACM, 30(3):514–550, 1983.MATHCrossRefMathSciNetGoogle Scholar
  10. 10.
    A. Giordana and L. Saitta. Phase transitions in relational learning. Machine Learning, 41(2):217–251, 2000.MATHCrossRefGoogle Scholar
  11. 11.
    G. Gottlob. Subsumption and implication. Information Processing Letters, 24(2):109–111, 1987.MATHCrossRefMathSciNetGoogle Scholar
  12. 12.
    G. Gottlob and A. Leitsch. On the efficiency of subsumtion algorithms. Journal of the ACM, 32(2):280–295, 1985.MATHCrossRefMathSciNetGoogle Scholar
  13. 13.
    G. Gottlob, N. Leone, and F. Scarcello. The complexity of acyclic conjunctive queries. In Proceedings of the 39th Annual Symposium on Foundations of Computer Science, pages 706–715. IEEE Computer Society Press, 1998.Google Scholar
  14. 14.
    M. Graham. On the universal relation. Technical report, Univ. of Toronto, Toronto, Canada, 1979.Google Scholar
  15. 15.
    K. Hirata. On the hardness of learning acyclic conjunctive queries. In Proceedings of the 11th International Conference on Algorithmic Learning Theory, volume 1968 of LNAI, pages 238–251. Springer, Berlin, 2000.CrossRefGoogle Scholar
  16. 16.
    T. Horváth and G. Turán. Learning logic programs with structured background knowledge. Artificial Intelligence, 128(1–2):31–97, 2001.MATHCrossRefMathSciNetGoogle Scholar
  17. 17.
    J.-U. Kietz. Some lower bounds for the computational complexity of inductive logic programming. In P. Brazdil, editor, Proceedings of the European Conference on Machine Learning, volume 667 of LNAI, pages 115–123. Springer, Berlin, 1993.Google Scholar
  18. 18.
    J.-U. Kietz and M. Lübbe. An efficient subsumption algorithm for inductive logic programming. In W. Cohen and H. Hirsh, editors, Proc. Eleventh International Conference on Machine Learning (ML-94), pages 130–138, 1994.Google Scholar
  19. 19.
    Kolaitis and Vardi. Conjunctive-query containment and constraint satisfaction. JCSS: Journal of Computer and System Sciences, 61(2):302–332, 2000.MATHCrossRefMathSciNetGoogle Scholar
  20. 20.
    N. Lavraĉ and S. Džeroski. Inductive Logic Programming: Techniques and Applications. Ellis Horwood, 1994.Google Scholar
  21. 21.
    T. M. Mitchell. Machine Learning. McGraw-Hill, 1997.Google Scholar
  22. 22.
    S. Muggleton. Inverse entailment and Progol. New Generation Computing, 13(3–4):245–286, 1995.CrossRefGoogle Scholar
  23. 23.
    S. Muggleton and L. De Raedt. Inductive logic programming: Theory and methods. The Journal of Logic Programming, 19/20:629–680, 1994.CrossRefGoogle Scholar
  24. 24.
    C. Nédellec, C. Rouveirol, H. Adé, F. Bergadano, and B. Tausend. Declarative bias in ILP. In L. De Raedt, editor, Advances in Inductive Logic Programming, pages 82–103. IOS Press, 1996.Google Scholar
  25. 25.
    S.-H. Nienhuys-Cheng and R. Wolf. Foundations of Inductive Logic Programming, volume 1228 of LNAI. Springer, Berlin, 1997.Google Scholar
  26. 26.
    T. Scheffer, R. Herbrich, and F. Wysotzki. Efficient ø-subsumption based on graph algorithms. In S. Muggleton, editor, Proceedings of the 6th International Workshopon Inductive Logic Programming, volume 1314 of LNAI, pages 212–228, Springer, Berlin, 1997.Google Scholar
  27. 27.
    M. Sebag and C. Rouveirol. Resource-bounded relational reasoning: Induction and deduction through stochastic matching. Machine Learning, 38(1/2):41–62, 2000.MATHCrossRefGoogle Scholar
  28. 28.
    G. Silverstein and M. Pazzani. Relational cliches: Constraining constructive induction during relational learning. In Birnbaum and Collins, editors, Proceedings of the 8th International Workshop on Machine Learning, pages 203–207, Morgan Kaufmann, San Mateo, CA, 1991.Google Scholar
  29. 29.
    A. Srinivasan, S. Muggleton, M. J. E. Sternberg, and R. D. King. Theories for mutagenicity: A study in first-order and feature-based induction. Artificial Intelligence, 85(1/2), 1996.Google Scholar
  30. 30.
    J. D. Ullman. Database and Knowledge-Base Systems, Volumes I and II. Computer Science Press, 1989.Google Scholar
  31. 31.
    L. G. Valiant. A theory of the learnable. Communications of the ACM, 27(11):1134–1142, 1985.CrossRefGoogle Scholar
  32. 32.
    S. Wrobel. Inductive logic programming. In G. Brewka, editor, Advances in Knowledge Representation and Reasoning, pages 153–189. CSLI-Publishers, Stanford, CA, USA, 1996. Studies in Logic, Language and Information.Google Scholar
  33. 33.
    M. Yannakakis. Algorithms for acyclic database schemes. In Proceedings of the 7th Conference on Very Large Databases, Morgan Kaufman pubs. (Los Altos CA), Zaniolo and Delobel(eds), 1981.Google Scholar
  34. 34.
    C. T. Yu and Z. M. Ozsoyoglu. On determining tree query membership of a distributed query. INFOR, 22(3), 1984.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Tamás Horváth
    • 1
  • Stefan Wrobel
    • 2
  1. 1.Institute for Autonomous intelligent SystemsFraunhofer GesellschaftSankt Augustin
  2. 2.IWStto-von-Guericke-Universität MagdeburgMagdeburg

Personalised recommendations