Efficient construction of comprehensible hierarchical clusterings

Talavera, Luis; Béjar, Javier

doi:10.1007/BFb0094809

Luis Talavera¹ &
Javier Béjar¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1510))

Included in the following conference series:

European Symposium on Principles of Data Mining and Knowledge Discovery

314 Accesses
2 Citations

Abstract

Clustering is an important data mining task which helps in finding useful patterns to summarize the data. In the KDD context, data mining is often used for description purposes rather than for prediction. However, it turns out difficult to find clustering systems that help to ease the interpretation task to the user in both, statistics and Machine Learning fields. In this paper we present Isaac, a hierarchical clustering system which employs traditional clustering ideas combined with a feature selection mechanism and heuristics in order to provide comprehensible results. At the same time, it allows to efficiently deal with large datasets by means of a preprocessing step. Results suggest that these aims are achieved and encourage further research.

Download to read the full chapter text

Chapter PDF

References

P. Cheeseman and J. Stutz. Bayesian classification (autoclass): theory and results. In U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in knowledge discovery and data mining, pages 153–180. AAAI Press, Menlo Park, CA, 1996.
Google Scholar
U. M. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery: An overview. In U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining, pages 1–34. AAAI Press, Cambridge, Massachusetts, 1996.
Google Scholar
D. H. Fisher. Knowledge acquisition via incremental conceptual clustering. Machine Learning, (2):139–172, 1987.
Google Scholar
D. F. Gordon, P. M. Tag, and R. L. Bankert. Unsupervised classification procedures applied to satellite cloud data. Technical Report AIC95-005, Navy Center for Applied Research in Artificial Intelligence, 1995.
Google Scholar
S. J. Hanson and M. Bauer. Conceptual clustering, categorization and polymorphy. Machine Learning, (3):343–372, 1989.
Google Scholar
M. Lebowitz. Experiments with incremental concept formation: UNIMEM. Machine Learning, (2):103–138, 1987.
Google Scholar
R. López de Mántaras. A distance based attribute selection measure for decision tree induction. Machine Learning, (6):81–92, 1991.
Google Scholar
R. S. Michalski and R. E. Stepp. Learning from observation: Conceptual clustering. In R. S. Michalski, J. G. Carbonell, and T. M. Mitchell, editors, Machine Learning: An Artificial intelligence approach, pages 331–363. Morgan Kauffmann, 1983.
Google Scholar
L. Talavera and U. Cortés. Exploiting bias shift in knowledge acquisition. In 10th European Workshop on Knowledge Acquisition, Modeling, and Management, Lecture Notes in Artificial Intelligence, Sant Feliu de Guixols, Barcelona, Spain, 1997. Springer.
Google Scholar
L. Talavera and J. Roure. A buffering strategy to avoid ordering effects in clustering. In Proceedings of the Tenth European Conference on Machine Learning, volume 1398 of Lecture Notes in Artificial Intelligence, Chemnitz, Germany, 1998. Springer.
Google Scholar

Download references

Author information

Authors and Affiliations

Department de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, Campus Nord, Mòdul C6, Jordi Girona 1-3, 08034, Barcelona, Catalonia, Spain
Luis Talavera & Javier Béjar

Authors

Luis Talavera
View author publications
You can also search for this author in PubMed Google Scholar
Javier Béjar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Jan M. Żytkow Mohamed Quafafou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Talavera, L., Béjar, J. (1998). Efficient construction of comprehensible hierarchical clusterings. In: Żytkow, J.M., Quafafou, M. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1998. Lecture Notes in Computer Science, vol 1510. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0094809

Download citation

DOI: https://doi.org/10.1007/BFb0094809
Published: 19 October 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65068-3
Online ISBN: 978-3-540-49687-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics