Abstract
Clustering is an important unsupervised learning tool to find natural grouping of instances from a given unlabeled data based on some similarity/dissimilarity measures. However, clustering the data points at the boundary of multiple overlapping clusters is really a challenge. The challenge also lies in many real life applications where the boundaries among clusters are not rigid and clear. There are several hard and soft clustering algorithms which attempt to address these challenges. Hard clustering algorithms though perform well in the presence of well separated clusters, it shows poor performance in presence of such difficulties mentioned above. Researchers have recommended several soft clustering algorithms for handling such dilemma. In this article, we have proposed different schemes toward improvement of Fuzzy clustering employing supervised classification. The superiority of our prescribed schemes are demonstrated by comparing its performance with that of Fuzzy c-means clustering. The comparison has also been made among different schemes we have proposed in this article.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hansen, P., Jaumard, B.: Cluster analysis and mathematical programming. Math. Program. 79(1–3), 191–215 (1997)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall Inc., Upper Saddle River (1988)
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: An efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 881–892 (2002)
Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell (1981)
Maji, P., Paul, S.: Robust rough-fuzzy c-means algorithm: design and applications in coding and non-coding RNA expression data clustering. Fundam. Inf. 124, 153–174 (2013)
Maji, P., Pal, S.K.: Rough set based generalized fuzzy \(c\) -means algorithm and quantitative indices. IEEE Trans. Syst. Man Cybern. Part B Cybern. 37(6), 1529–1540 (2007)
Maji, P., Roy, S.: Rough-fuzzy clustering and multiresolution image analysis for text-graphics segmentation. Appl. Soft Comput. 30, 705–721 (2015)
Pedrycz, W.: Collaborative fuzzy clustering. Pattern Recogn. Lett. 23, 1675–1686 (2002)
Shen, Y., Pedrycz, W.: Collaborative fuzzy clustering algorithm: some refinements. Int. J. Approximate Reasoning 86, 41–61 (2017)
Mukhopadhyay, A., Maulik, U.: Towards improving fuzzy clustering using support vector machine: application to gene expression data. Pattern Recogn. 42(11), 2744–2763 (2009)
Bandyopadhyay, S., Mukhopadhyay, A., Maulik, U.: An improved algorithm for clustering gene expression data. Bioinformatics 23(21), 2859–2865 (2007)
Groll, L., Jakel, J.: A new convergence proof of fuzzy c-means. IEEE Trans. Fuzzy Syst. 13(5), 717–720 (2005)
Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13(1), 21–27 (1967)
Altman, N.S.: An introduction to kernel and nearest-neighbor nonparametric regression. Taylor and Francis 46(3), 175–185 (1992)
Samworth, R.J.: Optimal weighted nearest neighbour classifiers. Ann. Stat. 40(5), 2733–2763 (2012)
Yeung, K.Y., Ruzzo, W.: Details of the adjusted rand index and clustering algorithms supplement to the paper (an emperical study on principal component analysis for clustering gene expression data). Bioinformatics 17(3), 763–774 (2001)
Maulik, U., Mukhopadhyay, A., Bandyopadhyay, S.: Efficient clustering with multiclass point identification. J. 3D Images 17, 763–774 (2001)
Saha, I., Mukhopadhyay, A.: Improved crisp and fuzzy clustering techniques for categorical data. IAENG Int. J. Comput. Sci. 35(1), 438–450 (2008)
Beyer, K., Goldstein, J., Ramakrishnan, R., Shaft, U.: When is “nearest neighbor” meaningful? In: Beeri, C., Buneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 217–235. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-49257-7_15
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mallick, A.K., Mukhopadhyay, A. (2019). Different Schemes for Improving Fuzzy Clustering Through Supervised Learning. In: Mandal, J., Mukhopadhyay, S., Dutta, P., Dasgupta, K. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2018. Communications in Computer and Information Science, vol 1030. Springer, Singapore. https://doi.org/10.1007/978-981-13-8578-0_13
Download citation
DOI: https://doi.org/10.1007/978-981-13-8578-0_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8577-3
Online ISBN: 978-981-13-8578-0
eBook Packages: Computer ScienceComputer Science (R0)