Abstract
Cluster analysis is an important high-level data mining procedure that can be used to identify meaningful groups of objects within large data sets. Various dimension reduction methods are used to reduce the complexity of data before further processing. The lower-dimensional projections of original data sets can be seen as simplified models of the original data. In this paper, several clustering algorithms are used to process low-dimensional projections of complex data sets and compared with each other. The properties and quality of clustering obtained by each method is evaluated and their suitability to process reduced data sets is assessed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Abdi, H.: Metric multidimensional scaling. In: Salkind, N. (ed.) Encyclopedia of Measurement and Statistics, pp. 598–605. Sage, Thousand Oaks (2007)
Bandyopadhyay, S., Saha, S.: Unsupervised Classification: Similarity Measures, Classical and Metaheuristic Approaches, and Applications. SpringerLink, Bücher. Springer, Berlin (2012), https://books.google.cz/books?id=Vb21R9_rMNoC
Borg, I., Groenen, P., Mair, P.: Mds algorithms. In: Applied Multidimensional Scaling, pp. 81–86. SpringerBriefs in Statistics, Springer, Berlin (2013), http://dx.doi.org/10.1007/978-3-642-31848-1_8
Burges, C.J.C.: Dimension reduction: a guided tour. Found. Trends Mach. Learn. 2(4) (2010), http://dx.doi.org/10.1561/2200000002
Cheng, Y.: Mean shift, mode seeking, and clustering. Pattern Anal. Mach. Intell. IEEE Trans. 17(8), 790–799 (1995)
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. Pattern Anal. Mach. Intell., IEEE Trans. 24(5), 603–619 (2002)
Dunn, J.C.: Well separated clusters and optimal fuzzy-partitions. J. Cybern. 4, 95–104 (1974)
Everitt, B., Landau, S., Leese, M., Stahl, D.: Cluster Analysis. Wiley Series in Probability and Statistics, Wiley, Hoboken (2011), https://books.google.cz/books?id=w3bE1kqd-48C
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007), http://www.sciencemag.org/content/315/5814/972.abstract
Fukunaga, K., Hostetler, L.: The estimation of the gradient of a density function, with applications in pattern recognition. Inf. Theor. IEEE Trans. 21(1), 32–40 (1975)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer series in statistics, Springer, Berlin (2001), https://books.google.cz/books?id=VRzITwgNV2UC
Kriegel, H.P., Krüger, P., Sander, J., Zimek, A.: Density-based clustering. Wiley Interdisc. Rev. Data Min. Knowl. Disc. 1(3), 231–240 (2011). doi:10.1002/widm.30
Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Trans. Comput. 18, 401–409 (1969)
Torgerson, W.S.: Multidimensional scaling: I. theory and method. Psychometrika 17, 401–419 (1952)
Wang, J.: Geometric Structure of High-Dimensional Data and Dimensionality Reduction. Springer, Berlin (2012), https://books.google.cz/books?id=0RmZRb2fLpgC
Acknowledgements
This work was supported by the IT4Innovations Centre of Excellence project (CZ.1.05/1.1.00/02.0070), funded by the European Regional Development Fund and the national budget of the Czech Republic via the Research and Development for Innovations Operational Programme and by Project SP2015/146 of the Student Grant System, VŠB—Technical University of Ostrava.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Krömer, P., Platoš, J. (2016). Cluster Analysis of Data with Reduced Dimensionality: An Empirical Study. In: Stýskala, V., Kolosov, D., Snášel, V., Karakeyev, T., Abraham, A. (eds) Intelligent Systems for Computer Modelling . Advances in Intelligent Systems and Computing, vol 423. Springer, Cham. https://doi.org/10.1007/978-3-319-27644-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-27644-1_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27642-7
Online ISBN: 978-3-319-27644-1
eBook Packages: EngineeringEngineering (R0)