Abstract
It is widely reported in the literature that incremental clustering systems suffer from instance ordering effects and that under some orderings, extremely poor clusterings may be obtained. In this paper we present a new strategy aimed to mitigate these effects, the Not-Yet strategy which has a general and open formulation and it is not coupled to any particular system. Results suggest that the strategy improves the clustering quality and also that performance is limited by its limited foresight. We also show that, when combined with other strategies, the Not-Yet strategy may help the system to get high quality clusterings.
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
J. R. Anderson and M. Matessa. Explorations of an incremental, bayesian algorithm for categorization. Machine Learning, (9):275–308, 1992.
J. Béjar. Adquisición automática de conocimiento en dominios poco estructurados. PhD thesis, Facultat d'Informàtica de Barcelona, UPC, 1995.
D. H. Fisher. Knowledge acquisition via incremental conceptual clustering. Machine Learning, (2):139–172, 1987.
D. H. Fisher. Optimization and simplification of hierarchical clusterings. Journal of Artificial Intelligence Research, (4):147–180, 1995.
D. H. Fisher and P. Langley. Conceptual clustering and its relation to numerical taxonomy. In W. A. Gale, editor, Artificial Intelligence and Statistics. Addison-Wesley, Reading,MA, 1986.
D. H. Fisher, L. Xu, and N. Zard. Ordering effects in clustering. In Proceedings of the Ninth International Conference on Machine Learning, pages 163–168, 1992.
J. H. Gennari, P. Langley, and D. Fisher. Models of incremental concept formation. Artificial Intelligence, (40):11–61, 1989.
P. Langley. Order effects in incremental learning. In P. Reimann and H. Spada, editors, Learning in humans and machines: Towards an Interdisciplinary Learning Science. Pergamon, 1995.
M. Lebowitz. Deferred commitment in unimem: waiting to learn. In Proceedings of the Fifth International Conference on Machine Learning, pages 80–86, 1988.
J. Roure. Study of methods and heuristics to improve the fuzzy classifications of LINNEO+. Master's thesis, Facultat d'Informática de Barcelona, UPC, 1994.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Talaveral, L., Roure, J. (1998). A buffering strategy to avoid ordering effects in clustering. In: Nédellec, C., Rouveirol, C. (eds) Machine Learning: ECML-98. ECML 1998. Lecture Notes in Computer Science, vol 1398. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026702
Download citation
DOI: https://doi.org/10.1007/BFb0026702
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64417-0
Online ISBN: 978-3-540-69781-7
eBook Packages: Springer Book Archive