Abstract
Algorithms that extract information from data are required to provide correct information. However, data mining algorithms have an additional requirement. The information they extract must not only be correct, but also useful. The usefulness metric was developed to meet these needs. Although it has been shown to work on classification algorithms, the usefulness metric’s success lies in its ability to be applied to other data mining algorithms. This paper will show two different methods of applying the usefulness metric to a clustering algorithm in order to obtain more useful clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BERRY, M.J.A. and LINOFF, G. (1997): Data Mining Techniques For Marketing, Sales, and Customer Support. John Wiley and Sons, New York, NY.
FAYYAD U., PIATETSKY-SHAPIRO, G. and SMYTH, P. (1996): From Data Mining to Knowledge Discovery: An Overview. In: U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, R. Uthurusamy (Eds.): Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park, CA, 1–36.
HAN, J. and KAMBER, M. (2001): Data Mining Concepts and Techniques. Morgan Kaufmann, San Diego, CA.
SILBERSCHATZ, A. and TUZHILIN, A. (1996): What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Transactions on Knowledge and Data Engineering. December. IEEE Computer Society, New York, NY.
ST. CLAIR, C., LIU, C. and PISSINOU, N. (1998) Attribute Weighting: A Method of Applying Domain Knowledge in the Decision Tree Process. In: G. Gardarin, J. French, N. Pissinou, K. Makki, L. Bouganim (Eds.): Proceeding of the Sev enth International Conference on Information and Knowledge Management. ACM Press, New York, NY, 259–266.
ST. CLAIR, C. (2000): A Usefulness Metric and Its Application to Decision Tree Based Classification. Ph.D Dissertation. DePaul University, Chicago, IL.
UTHURUSAMY, R. (1996): Current Challenges and Future Directions. In: U. Fayyad, G. Piatetsky-Shapiro, P. 6Smyth, R. Uthurusamy (Eds.): Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park, CA, 561–572.
WEISS, S.M. and INDURKHYA, N. (1998): Predictive Data Mining: A Practical Guide. Morgan Kaufmann, San Francisco, CA.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Clair, C.S. (2003). Finding the Most Useful Clusters: Clustering and the Usefulness Metric. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_55
Download citation
DOI: https://doi.org/10.1007/978-3-642-18991-3_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40354-8
Online ISBN: 978-3-642-18991-3
eBook Packages: Springer Book Archive