ILP, the Blind, and the Elephant: Euclidean Embedding of Co-proven Queries

  • Hannes Schulz
  • Kristian Kersting
  • Andreas Karwath
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5989)

Abstract

Relational data is complex. This complexity makes one of the basic steps of ILP difficult: understanding the data and results. If the user cannot easily understand it, he draws incomplete conclusions. The situation is very much as in the parable of the blind men and the elephant that appears in many cultures. In this tale the blind work independently and with quite different pieces of information, thereby drawing very different conclusions about the nature of the beast. In contrast, visual representations make it easy to shift from one perspective to another while exploring and analyzing data. This paper describes a method for embedding interpretations and queries into a single, common Euclidean space based on their co-proven statistics. We demonstrate our method on real-world datasets showing that ILP results can indeed be captured at a glance.

Keywords

Environmental Estrogen Weighted Neighbour Instance Base Learning Inductive Logic Program Common Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499. Morgan Kaufmann, San Francisco (1994)Google Scholar
  2. 2.
    De Raedt, L., Ramon, J.: Condensed representations for inductive logic programming. In: Proceedings of 9th International Conference on the Principles of Knowledge Representation and Reasoning, pp. 438–446 (2004)Google Scholar
  3. 3.
    Fang, H., Tong, W., Shi, L.M., Blair, R., Perkins, R., Branham, W., Hass, B.S., Xie, Q., Dial, S.L., Moland, C.L., Sheehan, D.M.: Structure-activity relationships for a large diverse set of natural, synthetic, and environmental estrogens. Chem. Res. Tox 14, 280–294 (2001)CrossRefGoogle Scholar
  4. 4.
    Globerson, A., Chechik, G., Pereira, F., Tishby, N.: Euclidean Embedding of Co-occurrence Data. The Journal of Machine Learning Research 8, 2265–2295 (2007)MathSciNetGoogle Scholar
  5. 5.
    Helma, C., Kramer, S., De Raedt, L.: The molecular feature miner MolFea. In: Proceedings of the Beilstein-Institut Workshop (2002)Google Scholar
  6. 6.
    Kramer, S., De Raedt, L., Helma, C.: Molecular feature mining in HIV data. In: Provost, F., Srikant, R. (eds.) Proc. KDD 2001, August 26-29, pp. 136–143. ACM Press, New York (2001)CrossRefGoogle Scholar
  7. 7.
    Lewis, D.D.: Evaluating and optimizing autonomous text classification systems. In: Proceedings of the 18th Int. ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 246–254 (1995)Google Scholar
  8. 8.
    Ramon, J.: Clustering and instance based learning in first order logic. PhD thesis, CS Dept., K.U. Leuven (2002)Google Scholar
  9. 9.
    Srinivasan, A., Muggleton, S.H., King, R.D., Sternberg, M.J.E.: Theories for Mutagenicity: A Study of First-Order and Feature -based Induction. Artificial Intelligence Journal 85, 277–299 (1996)CrossRefGoogle Scholar
  10. 10.
    Stolle, C., Karwath, A., De Raedt, L.: CLASSIC’CL: An Integrated ILP System. In: Hoffmann, A.G., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 354–362. Springer, Heidelberg (2005)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Hannes Schulz
    • 1
  • Kristian Kersting
    • 2
  • Andreas Karwath
    • 1
  1. 1.Institut für InformatikAlbert-Ludwigs UniversitätFreiburgGermany
  2. 2.Dept. of Knowledge DiscoveryFraunhofer IAIS, Schloss BirlinghovenSt AugustinGermany

Personalised recommendations