Abstract
This paper describes a new method to the determination of the optimal number of well-separable clusters in data sets. The determination of this parameter is necessary for many clustering algorithms to define the naturally existing clusters correctly. In the presented method the idea of the agglomerative hierarchical clustering has been used, and the modified RS cluster validity index has been applied. In the first phase of the method, clusters are created due to the idea of hierarchical clustering. Then, for the optimal number of clusters the k-means algorithm is performed. The method has been used for multidimensional data, and the received results confirm very good performances of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York (1981)
Bradley, P., Fayyad, U.: Refining initial points for k-means clustering. In: Proceedings of the Fifteenth International Conference on Knowledge Discovery and Data Mining, pp. 9–15. AAAI Press, New York (1998)
Ester, M., Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proc 2nd KDD. AAAI Press (1996)
Faber, V.: Clustering and the continuous k-means algorithm. Los Alamos Science 22, 138–144 (1994)
Greblicki, W., Rutkowski, L.: Density-free Bayes risk consistency of nonparametric pattern recognition procedures. Proceedings of the IEEE 69(4), 482–483 (1981)
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: Clustering validity checking methods: Part II. ACM SIGMOD Record 31(3) (2002)
Jain, A., Dubes, R.: Algorithms for clustering data. Prentice-Hall, Englewood Cliffs (1988)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: A review. ACM Comput. Surveys 31(3), 264–323 (1999)
Li, X., Er, M.J., Lim, B.S., et al.: Fuzzy Regression Modeling for Tool Performance Prediction and Degradation Detection. International Journal of Neural Systems 20(5), 405–419 (2010)
Lu, S.Y., Fu, K.S.: A sentence-to-sentence clustering procedure for pattern analysis. IEEE Trans. Syst. Man Cybern. 8, 381–389 (1978)
Mertez, C.J., Murphy, P.M.: UCI repository of machine learning databases, http://www.ics.uci.edu/pub/machine-learning-databases
Mcqueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistic and Probability, pp. 281–297 (1967)
Murtagh, F.: A survey of recent advantces in hierarchical clustering algorithms. The Computer Journal 26(4), 354–359 (1983)
Pei, J., Yang, X.: Study of clustering validity based on fuzzy similarity. In: Proceedings of the 3rd World Congress on Intelligent Control and Automation, Hefei, China, pp. 2444–2447. IEEE (2000)
Rohlf, F.: Single link clustering algorithms. In: Krishnaiah, P., Kanal, L. (eds.) Handbook of Statistics, vol. 2, pp. 267–284. North-Holland, Amsterdam (1982)
Rutkowski, L.: A general approach for nonparametric fitting of functions and their derivatives with applications to linear circuits identification. IEEE Transactions Circuits Systems CAS-33, 812–818 (1986)
Rutkowski, L., Przybył, A., Cpałka, K.: Novel Online Speed Profile Generation for Industrial Machine Tool Based on Flexible Neuro-Fuzzy Approximation. IEEE Transactions on Industrial Electronics 59(2), 1238–1247 (2012)
Rutkowski, L.: Generalized regression neural networks in time-varying environment. IEEE Trans. Neural Networks 15, 576–596 (2004)
Rutkowski, L.: Adaptive probabilistic neural-networks for pattern classification in time-varying environment. IEEE Trans. Neural Networks 15, 811–827 (2004)
Sharma, S.: Applied Multivariate Techniques. John Wiley & Sons, New York (1996)
Starczewski, A.: A new approach to creating multisegment fuzzy systems. In: Rutkowski, L., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2008. LNCS (LNAI), vol. 5097, pp. 324–332. Springer, Heidelberg (2008)
Starczewski, A.: A cluster validity index for hard clustering. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2012, Part II. LNCS, vol. 7268, pp. 168–174. Springer, Heidelberg (2012)
Weka 3: Data Mining Software in Java,University of Waikato, New Zealand, http://www.cs.waikato.ac.nz/ml/weka
Yang, M., Wu, K.: A new validity index for fuzzy clustering. In: Proceedings of the 10th IEEE International Conference on Fuzzy Systems, vol. 1, pp. 89–92 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Starczewski, A. (2013). A Clustering Method Based on the Modified RS Validity Index. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science(), vol 7895. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38610-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-38610-7_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38609-1
Online ISBN: 978-3-642-38610-7
eBook Packages: Computer ScienceComputer Science (R0)