Abstract
Two approaches to solving the problem of clustering with gaps for a specified number of clusters are considered. The first approach is based on restoring the values of unknown attributes and solving the problem of clustering of calculated complete data. The second approach is based on solving a finite set of tasks of clustering of corresponding to incomplete data complete sample descriptions and the construction of collective decision. For both approaches, the clustering quality criteria have been proposed as functions of incomplete descriptions. Results of practical experiments are considered.
Chapter PDF
Similar content being viewed by others
References
Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data. Wiley, New York (1987)
Zloba, E.: Statistical methods of reproducing of missing data. J. Computer Modelling & New Technologies 6(1), 51–61 (2002)
Zhang, S.: Parimputation: From imputation and null-imputation to partially imputation. IEEE Intelligent Informatics Bulletin 9(1), 32–38 (2008)
Honghai, F., Guoshun, C., Cheng, Y., Bingru, Y., Yumei, C.: A SVM Regression Based Approach to Filling in Missing Values. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds.) KES 2005. LNCS (LNAI), vol. 3683, pp. 581–587. Springer, Heidelberg (2005)
Sarkar, M., Leong, T.-Y.: Fuzzy k-means Clustering with Missing Values. In. AMIA Symp., pp. 588–592 (2001)
Honda, K., Ichihashi, H.: Linear Fuzzy Clustering Techniques With Missing Values and Their Application to Local Principal Component Analysis. IEEE Transactions on Fuzzy Systems 12(2), 183–193 (2004)
Wagstaff, K.: Clustering with missing values: No imputation required. In: Meeting of the International Federation of Classification Societies “Classification, Clustering, and Data Mining”, pp. 649–658. Springer (2004)
Ryazanov, V.: Some Imputation Algorithms for Restoration of Missing Data. In: San Martin, C., Kim, S.-W. (eds.) CIARP 2011. LNCS, vol. 7042, pp. 372–379. Springer, Heidelberg (2011)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley Interscience (2001)
Ryazanov, V.V.: The committee synthesis of pattern recognition and classification algorithms, Zh. Vychisl. Mat. i Mat. Fiziki 21(6), 1533–1543 (1981) (in Russian) (Printed in Great Britain, 1982. Pergamon Press. Ltd.)
Biryukov, A.S., Ryazanov, V.V., Shmakov, A.S.: Solving Clusterization Problems Using Groups of Algorithms. Zh. Vychisl. Mat. i Mat. Fiziki 48(1), 176–192 (2008) (Printed in Great Britain, 2008. Pergamon Press. Ltd.)
Mangasarian, O.L., Wolberg, W.H.: Cancer diagnosis via linear programming. SIAM News 23(5), 1–18 (1990)
Arseev, A.S., Kotochigov, K.L., Ryazanov, V.V.: Universal criteria for clustering and stability problems. In: 13th All-Russian Conference “Mathematical Methods for Pattern Recognition”, pp. 63–64. S.-Peterburg (2007) (in Russian)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ryazanov, V.V. (2012). Clustering of Incomplete Data and Evaluation of Clustering Quality. In: Alvarez, L., Mejail, M., Gomez, L., Jacobo, J. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2012. Lecture Notes in Computer Science, vol 7441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33275-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-33275-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33274-6
Online ISBN: 978-3-642-33275-3
eBook Packages: Computer ScienceComputer Science (R0)