Refining Aggregate Conditions in Relational Learning

Vens, Celine; Ramon, Jan; Blockeel, Hendrik

doi:10.1007/11871637_37

Celine Vens²¹,
Jan Ramon²¹ &
Hendrik Blockeel²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4213))

Included in the following conference series:

European Conference on Principles of Data Mining and Knowledge Discovery

3455 Accesses
11 Citations

Abstract

In relational learning, predictions for an individual are based not only on its own properties but also on the properties of a set of related individuals. Many systems use aggregates to summarize this set. Features thus introduced compare the result of an aggregate function to a threshold. We consider the case where the set to be aggregated is generated by a complex query and present a framework for refining such complex aggregate conditions along three dimensions: the aggregate function, the query used to generate the set, and the threshold value. The proposed aggregate refinement operator allows a more efficient search through the hypothesis space and thus can be beneficial for many relational learners that use aggregates. As an example application, we have implemented the refinement operator in a relational decision tree induction system. Experimental results show a significant efficiency gain in comparison with the use of a less advanced refinement operator.

Download to read the full chapter text

Chapter PDF

A scalable robust and automatic propositionalization approach for Bayesian classification of large mixed numerical and categorical data

Article 22 August 2018

Recursive Rules with Aggregation: A Simple Unified Semantics

Construction of Complex Aggregates with Random Restart Hill-Climbing

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Ng, R.T., Lakshmanan, L.V.S., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained associations rules. In: SIGMOD International Conference on Management of Data, pp. 13–24 (1998)
Google Scholar
Krogel, M.A., Wrobel, S.: Transformation-based learning using multi-relational aggregation. In: Rouveirol, C., Sebag, M. (eds.) ILP 2001. LNCS, vol. 2157, pp. 142–155. Springer, Heidelberg (2001)
Chapter Google Scholar
Knobbe, A., de Haas, M., Siebes, A.: Propositionalisation and aggregates. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS, vol. 2168, pp. 277–288. Springer, Heidelberg (2001)
Chapter Google Scholar
Neville, J., Jensen, D., Friedland, L., Hay, M.: Learning relational probability trees. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2003)
Google Scholar
Koller, D.: Probabilistic relational models. In: Džeroski, S., Flach, P.A. (eds.) ILP 1999. LNCS, vol. 1634, pp. 3–13. Springer, Heidelberg (1999)
Chapter Google Scholar
Perlich, C., Provost, F.: Aggregation-based feature invention and relational concept classes. In: Proceedings of the 9th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 167–176. ACM Press, New York (2003)
Chapter Google Scholar
Krogel, M.A., Wrobel, S.: Facets of aggregation approaches to propositionalization. In: Proceedings of the Work-in-Progress Track at the 13th International Conference on Inductive Logic Programming, pp. 30–39 (2003)
Google Scholar
Knobbe, A., Siebes, A., Marseille, B.: Involving aggregate functions in multi-relational search. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS, vol. 2431, pp. 287–298. Springer, Heidelberg (2002)
Chapter Google Scholar
Van Assche, A., Vens, C., Blockeel, H., Džeroski, S.: First order random forests: Learning relational classifiers with complex aggregates. Machine Learning, Special Issue on ILP (to appear, 2006)
Google Scholar
Uwents, W., Blockeel, H.: Classifying relational data with neural networks. In: Kramer, S., Pfahringer, B. (eds.) ILP 2005. LNCS, vol. 3625, pp. 384–396. Springer, Heidelberg (2005)
Chapter Google Scholar
Muggleton, S. (ed.): Inductive Logic Programming. Academic Press, London (1992)
MATH Google Scholar
Plotkin, G.: A note on inductive generalization. Machine Intell. 5, 153–163 (1969)
MATH Google Scholar
Blockeel, H., De Raedt, L.: Top-down induction of first order logical decision trees. Artificial Intelligence 101(1-2), 285–297 (1998)
Article MATH MathSciNet Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann series in Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Blockeel, H., Dehaspe, L., Demoen, B., Janssens, G., Ramon, J., Vandecasteele, H.: Improving the efficiency of Inductive Logic Programming through the use of query packs. Journal of Artificial Intelligence Research 16, 135–166 (2002)
MATH Google Scholar
Srinivasan, A., King, R., Bristol, D.: An assessment of ILP-assisted models for toxicology and the PTE-3 experiment. In: Džeroski, S., Flach, P.A. (eds.) ILP 1999. LNCS, vol. 1634, pp. 291–302. Springer, Heidelberg (1999)
Chapter Google Scholar
Berka, P.: Guide to the financial data set. In: The ECML/PKDD 2000 Discovery Challenge (2000)
Google Scholar
Džeroski, S., Schulze-Kremer, S., Heidtke, K.R., Siems, K., Wettschereck, D., Blockeel, H.: Diterpene structure elucidation from ¹³C NMR spectra with inductive logic programming. Applied Artificial Intelligence 12(5), 363–384 (1998)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001, Leuven, Belgium
Celine Vens, Jan Ramon & Hendrik Blockeel

Authors

Celine Vens
View author publications
You can also search for this author in PubMed Google Scholar
Jan Ramon
View author publications
You can also search for this author in PubMed Google Scholar
Hendrik Blockeel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vens, C., Ramon, J., Blockeel, H. (2006). Refining Aggregate Conditions in Relational Learning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Knowledge Discovery in Databases: PKDD 2006. PKDD 2006. Lecture Notes in Computer Science(), vol 4213. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871637_37

Download citation

DOI: https://doi.org/10.1007/11871637_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45374-1
Online ISBN: 978-3-540-46048-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Refining Aggregate Conditions in Relational Learning

Abstract

Chapter PDF

Similar content being viewed by others

A scalable robust and automatic propositionalization approach for Bayesian classification of large mixed numerical and categorical data

Recursive Rules with Aggregation: A Simple Unified Semantics

Construction of Complex Aggregates with Random Restart Hill-Climbing

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Refining Aggregate Conditions in Relational Learning

Abstract

Chapter PDF

Similar content being viewed by others

A scalable robust and automatic propositionalization approach for Bayesian classification of large mixed numerical and categorical data

Recursive Rules with Aggregation: A Simple Unified Semantics

Construction of Complex Aggregates with Random Restart Hill-Climbing

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation