Abstract
Collecting unlabelled data is often effortless while labelling them can be difficult. Either the amount of data is too large or samples cannot be assigned a specific class label with certainty. In semi-supervised clustering the aim is to set the cluster centres close to their label-matching samples and unlabelled samples. Kernel based clustering methods are known to improve the cluster results by clustering in feature space. In this paper we propose a semi-supervised kernel based clustering algorithm that minimizes convergently an error function with sample-to-cluster weights. These sample-to-cluster weights are set dependent on the class label, i.e. matching, not-matching or unlabelled. The algorithm is able to use many kernel based clustering methods although we suggest Kernel Fuzzy C-Means, Relational Neural Gas and Kernel K-Means. We evaluate empirically the performance of this algorithm on two real-life dataset, namely Steel Plates Faults and MiniBooNE.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kohonen, T.: Self-organizing maps. Springer, Heidelberg (1997)
Wu, K.-L., Yang, M.-S.: A fuzzy-soft learning vector quantization. Neurocomputing 55(3-4), 681–697 (2003)
Basu, S., Bilenko, M., Mooney, R.J.: A Probabilistic Framework for Semi-Supervised Clustering. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data-Mining, pp. 59–68 (2004)
Kulis, B., Basu, S., Dhillon, I., Mooney, R.: Semi-supervised Graph Clustering: A Kernel Approach. In: Proceedings of the 25th International Conference on Machine Learning, vol. 74(1), pp. 1–22 (2008)
Yan, B., Domeniconi, C.: Exploration of Different Constraints and Query Methods with Kernel-based Semi-Supervised Clustering. In: IEEE International Conference on Systems, Man and Cybernetics, SMC 2006, pp. 829–834 (2006)
Hu, E., Chen, S., Zhang, D., Yin, X.: Semisupervised Kernel Matrix Learning by Kernel Propagation. IEEE Transactions on Neural Networks 21(11) (2010)
Weston, J.: Large-Scale Semi-Supervised Learning. In: Proceedings of NATO Advanced Study Institute on Mining Massive Data Sets for Security, vol. 19, pp. 62–75 (2008)
Zhang, D.Q., Chen, S.C.: Fuzzy clustering using kernel methods. In: International Conference of Control and Automatation, ICCA 2002, pp. 123–128 (2002)
Hammer, B., Hasenfuss, A.: Relational Neural Gas. In: Hertzberg, J., Beetz, M., Englert, R. (eds.) KI 2007. LNCS (LNAI), vol. 4667, pp. 190–204. Springer, Heidelberg (2007)
Frank, A., Asuncion, A.: UCI Machine Learning Repository, University of California, School of Information and Computer Sciences, Irvine (2010), http://archive.ics.uci.edu/ml
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Faußer, S., Schwenker, F. (2012). Semi-Supervised Kernel Clustering with Sample-to-Cluster Weights. In: Schwenker, F., Trentin, E. (eds) Partially Supervised Learning. PSL 2011. Lecture Notes in Computer Science(), vol 7081. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28258-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-28258-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28257-7
Online ISBN: 978-3-642-28258-4
eBook Packages: Computer ScienceComputer Science (R0)