A Multi-relational Hierarchical Clustering Method for Datalog Knowledge Bases

  • Nicola Fanizzi
  • Claudia d’Amato
  • Floriana Esposito
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4994)


A clustering method is presented which can be applied to relational knowledge bases (e.g. Datalog deductive databases). It can be used to discover interesting groups of resources through their (semantic) annotations expressed in the standard logic programming languages. The method exploits an effective and language-independent semi-distance measure for individuals., that is based on the resource semantics w.r.t. a number of dimensions corresponding to a committee of features represented by a group of concept descriptions (discriminating features). The algorithm is a fusion of the classic Bisecting k-Means with approaches based on medoids that are typically applied to relational representations. We discuss its complexity and potential applications to several tasks.


Dissimilarity Measure Inductive Logic Inductive Logic Programming Concept Description Conceptual Cluster 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    d’Amato, C., Fanizzi, N., Esposito, F.: Induction of optimal semantic semi-distances for clausal knowledge bases. In: Proceedings of the 17th International Conference on Inductive Logic Programming, ILP 2007. LNCS (LNAI), Springer, Heidelberg (2007)Google Scholar
  2. 2.
    Emde, W., Wettschereck, D.: Relational instance-based learning. In: Saitta, L. (ed.) Proceedings of the 13th International Conference on Machine Learning, ICML 1996, pp. 122–130. Morgan Kaufmann, San Francisco (1996)Google Scholar
  3. 3.
    Fanizzi, N., Iannone, L., Palmisano, I., Semeraro, G.: Concept formation in expressive description logics. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 99–113. Springer, Heidelberg (2004)Google Scholar
  4. 4.
    Hutchinson, A.: Metrics on terms and clauses. In: van Someren, M., Widmer, G. (eds.) ECML 1997. LNCS, vol. 1224, pp. 138–145. Springer, Heidelberg (1997)Google Scholar
  5. 5.
    Jain, A., Murty, M., Flynn, P.: Data clustering: A review. ACM Computing Surveys 31(3), 264–323 (1999)CrossRefGoogle Scholar
  6. 6.
    Kaufman, L., Rousseeuw, P.: Finding Groups in Data: an Introduction to Cluster Analysis. John Wiley & Sons, Chichester (1990)Google Scholar
  7. 7.
    Nienhuys-Cheng, S., de Wolf, R.: Foundations of Inductive Logic Programming. LNCS (LNAI), vol. 1228. Springer, Heidelberg (1997)Google Scholar
  8. 8.
    Ramon, J., Bruynooghe, M.: A framework for defining distances between first-order logic objects. Technical Report CW 263, Department of Computer Science, Katholieke Universiteit Leuven (1998)Google Scholar
  9. 9.
    Sebag, M.: Distance induction in first order logic. In: Džeroski, S., Lavrač, N. (eds.) ILP 1997. LNCS, vol. 1297, pp. 264–272. Springer, Heidelberg (1997)Google Scholar
  10. 10.
    Stepp, R.E., Michalski, R.S.: Conceptual clustering of structured objects: A goal-oriented approach. Artificial Intelligence 28(1), 43–69 (1986)CrossRefGoogle Scholar
  11. 11.
    Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach. Springer, Heidelberg (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Nicola Fanizzi
    • 1
  • Claudia d’Amato
    • 1
  • Floriana Esposito
    • 1
  1. 1.Dipartimento di InformaticaUniversità degli Studi di BariBariItaly

Personalised recommendations