Abstract
Decision tree learner inspect marginal class distributions of numerical attributes to infer a predicate that can be used as a decision node in the tree. Since such discretization techniques examine the marginal distribution only, they may fail completely to predict the class correctly even in cases for which a decision tree with a 100% classification rate exists. In this paper, an objective function-based clustering algorithm is modified to yield a discretization of numerical variables that overcomes these problems. The underlying clustering algorithm is the fuzzy c-means algorithm, which is modified to (a) take the class information into account and (b) to organize all cluster prototypes in a regular grid such that the grid rather than the individiual clusters are optimized.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BEZDEK, J.C. (1981): Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York.
ELOMAA, T. and ROUSU, J. (1996): Finding Optimal Multi-Splits for Numerical Attributes in Decision Tree Learning. Technical Report of the Dept. of Computer Science, Univ. of Helsinki, Finland, 41.
HÖPPNER, F., KLAWONN, F., KRUSE, R. and RUNKLER, T. (1997): Fuzzy Cluster Analysis. John Wiley & Sons.
MITCHELL, T. (1997): Machine Learning. McGraw Hill.
PEDRYZC, W. and WALETZKY, J. (1997): Fuzzy-Clustering with Partial Supervision. IEEE Trans. on Systems, Man and Cybernetics — Part B, 27(5), 787–795.
STUTZ, C. (1998): Partially Supervised Fuzzy c-Means Clustering with Cluster merging. In: Proc. of the Europ. Congress on Intelligent Techniques and Soft Computing, 1725–1729.
TIMM, H. (2001): Fuzzy Cluster Analysis of Classified Data. In: Proc. Joint IFSA World Congress and Int. Conf. of the North Am. Inf. Proc. Society, Vancouver, Canada.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer Berlin · Heidelberg
About this paper
Cite this paper
Höppner, F. (2006). Objective Function-based Discretization. In: Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., Gaul, W. (eds) From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31314-1_53
Download citation
DOI: https://doi.org/10.1007/3-540-31314-1_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31313-7
Online ISBN: 978-3-540-31314-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)