Finding the Most Useful Clusters: Clustering and the Usefulness Metric

Clair, Caroline St.

doi:10.1007/978-3-642-18991-3_55

Caroline St. Clair⁷

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

822 Accesses

Abstract

Algorithms that extract information from data are required to provide correct information. However, data mining algorithms have an additional requirement. The information they extract must not only be correct, but also useful. The usefulness metric was developed to meet these needs. Although it has been shown to work on classification algorithms, the usefulness metric’s success lies in its ability to be applied to other data mining algorithms. This paper will show two different methods of applying the usefulness metric to a clustering algorithm in order to obtain more useful clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

BERRY, M.J.A. and LINOFF, G. (1997): Data Mining Techniques For Marketing, Sales, and Customer Support. John Wiley and Sons, New York, NY.
Google Scholar
FAYYAD U., PIATETSKY-SHAPIRO, G. and SMYTH, P. (1996): From Data Mining to Knowledge Discovery: An Overview. In: U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, R. Uthurusamy (Eds.): Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park, CA, 1–36.
Google Scholar
HAN, J. and KAMBER, M. (2001): Data Mining Concepts and Techniques. Morgan Kaufmann, San Diego, CA.
Google Scholar
SILBERSCHATZ, A. and TUZHILIN, A. (1996): What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Transactions on Knowledge and Data Engineering. December. IEEE Computer Society, New York, NY.
Google Scholar
ST. CLAIR, C., LIU, C. and PISSINOU, N. (1998) Attribute Weighting: A Method of Applying Domain Knowledge in the Decision Tree Process. In: G. Gardarin, J. French, N. Pissinou, K. Makki, L. Bouganim (Eds.): Proceeding of the Sev enth International Conference on Information and Knowledge Management. ACM Press, New York, NY, 259–266.
Google Scholar
ST. CLAIR, C. (2000): A Usefulness Metric and Its Application to Decision Tree Based Classification. Ph.D Dissertation. DePaul University, Chicago, IL.
Google Scholar
UTHURUSAMY, R. (1996): Current Challenges and Future Directions. In: U. Fayyad, G. Piatetsky-Shapiro, P. 6Smyth, R. Uthurusamy (Eds.): Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park, CA, 561–572.
Google Scholar
WEISS, S.M. and INDURKHYA, N. (1998): Predictive Data Mining: A Practical Guide. Morgan Kaufmann, San Francisco, CA.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

North Central College, 30 N. Brainard St., Naperville, IL, 60540, USA
Caroline St. Clair

Authors

Caroline St. Clair
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Systems, University of Mannheim, Schloss, 68131, Mannheim, Germany
Martin Schader
Institute of Decision Theory, University of Karlsruhe, Kaiserstr. 12, 76128, Karlsruhe, Germany
Wolfgang Gaul
Department of Statistics, University of Rome, Piazzale Aldo Moro, 00185, Rome, Italy
Maurizio Vichi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Clair, C.S. (2003). Finding the Most Useful Clusters: Clustering and the Usefulness Metric. In: Schader, M., Gaul, W., Vichi, M. (eds) Between Data Science and Applied Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18991-3_55

Download citation

DOI: https://doi.org/10.1007/978-3-642-18991-3_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40354-8
Online ISBN: 978-3-642-18991-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics