On Subgroup Discovery in Numerical Domains

Grosskreutz, Henrik; Rüping, Stefan

doi:10.1007/978-3-642-04180-8_15

Henrik Grosskreutz²² &
Stefan Rüping²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5781))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

2556 Accesses
7 Citations

Abstract

Subgroup discovery is a Knowledge Discovery task that aims at finding subgroups of a population with high generality and distributional unusualness. While several subgroup discovery algorithms have been presented in the past, they focus on databases with nominal attributes or make use of discretization to get rid of the numerical attributes. In this paper, we illustrate why the replacement of numerical attributes by nominal attributes can result in suboptimal results. Thereafter, we present a new subgroup discovery algorithm that prunes large parts of the search space by exploiting bounds between related numerical subgroup descriptions. The same algorithm can also be applied to ordinal attributes. In an experimental section, we show that the use of our new pruning scheme results in a huge performance gain when more that just a few split-points are considered for the numerical attributes.

This is an extended abstract of an article published in the Data Mining and Knowledge Discovery journal [1].

Download to read the full chapter text

Chapter PDF

DISDi: Discontinuous Intervals in Subgroup Discovery

Optimal Subgroup Discovery in Purely Numerical Data

Fast exhaustive subgroup discovery with numerical target concepts

Article 13 November 2015

References

Grosskreutz, H., Rüping, S.: On Subgroup Discovery in Numerical Domains. Data Mining and Knowledge Discovery (2009) doi: 10.1007/s10618-009-0136-3
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer IAIS, Schloss Birlinghoven, Sankt Augustin, Germany
Henrik Grosskreutz & Stefan Rüping

Authors

Henrik Grosskreutz
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Rüping
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

NICTA, Locked Bag 8001, Canberra, 2601, Australia and Helsinki Institute of IT,, Finland
Wray Buntine
Dept. of Knowledge Technologies, Jožef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Marko Grobelnik & Dunja Mladenić &
University College London, The Centre for Computational Statistics and Machine Learning Department of Computer Science, Gower St., WC1E 6BT, London, UK
John Shawe-Taylor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Grosskreutz, H., Rüping, S. (2009). On Subgroup Discovery in Numerical Domains. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2009. Lecture Notes in Computer Science(), vol 5781. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04180-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-04180-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04179-2
Online ISBN: 978-3-642-04180-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On Subgroup Discovery in Numerical Domains

Abstract

Chapter PDF

Similar content being viewed by others

DISDi: Discontinuous Intervals in Subgroup Discovery

Optimal Subgroup Discovery in Purely Numerical Data

Fast exhaustive subgroup discovery with numerical target concepts

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

On Subgroup Discovery in Numerical Domains

Abstract

Chapter PDF

Similar content being viewed by others

DISDi: Discontinuous Intervals in Subgroup Discovery

Optimal Subgroup Discovery in Purely Numerical Data

Fast exhaustive subgroup discovery with numerical target concepts

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation