Cluster Analysis of Data with Reduced Dimensionality: An Empirical Study

Krömer, Pavel; Platoš, Jan

doi:10.1007/978-3-319-27644-1_12

Cluster Analysis of Data with Reduced Dimensionality: An Empirical Study

Pavel Krömer⁷ &
Jan Platoš⁷

Conference paper
First Online: 14 February 2016

501 Accesses
2 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 423))

Abstract

Cluster analysis is an important high-level data mining procedure that can be used to identify meaningful groups of objects within large data sets. Various dimension reduction methods are used to reduce the complexity of data before further processing. The lower-dimensional projections of original data sets can be seen as simplified models of the original data. In this paper, several clustering algorithms are used to process low-dimensional projections of complex data sets and compared with each other. The properties and quality of clustering obtained by each method is evaluated and their suitability to process reduced data sets is assessed.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
http://www.cc.gatech.edu/projects/large_models/.

References

Abdi, H.: Metric multidimensional scaling. In: Salkind, N. (ed.) Encyclopedia of Measurement and Statistics, pp. 598–605. Sage, Thousand Oaks (2007)
Google Scholar
Bandyopadhyay, S., Saha, S.: Unsupervised Classification: Similarity Measures, Classical and Metaheuristic Approaches, and Applications. SpringerLink, Bücher. Springer, Berlin (2012), https://books.google.cz/books?id=Vb21R9_rMNoC
Borg, I., Groenen, P., Mair, P.: Mds algorithms. In: Applied Multidimensional Scaling, pp. 81–86. SpringerBriefs in Statistics, Springer, Berlin (2013), http://dx.doi.org/10.1007/978-3-642-31848-1_8
Google Scholar
Burges, C.J.C.: Dimension reduction: a guided tour. Found. Trends Mach. Learn. 2(4) (2010), http://dx.doi.org/10.1561/2200000002
Google Scholar
Cheng, Y.: Mean shift, mode seeking, and clustering. Pattern Anal. Mach. Intell. IEEE Trans. 17(8), 790–799 (1995)
Article Google Scholar
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. Pattern Anal. Mach. Intell., IEEE Trans. 24(5), 603–619 (2002)
Article Google Scholar
Dunn, J.C.: Well separated clusters and optimal fuzzy-partitions. J. Cybern. 4, 95–104 (1974)
Article MathSciNet Google Scholar
Everitt, B., Landau, S., Leese, M., Stahl, D.: Cluster Analysis. Wiley Series in Probability and Statistics, Wiley, Hoboken (2011), https://books.google.cz/books?id=w3bE1kqd-48C
Google Scholar
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007), http://www.sciencemag.org/content/315/5814/972.abstract
Google Scholar
Fukunaga, K., Hostetler, L.: The estimation of the gradient of a density function, with applications in pattern recognition. Inf. Theor. IEEE Trans. 21(1), 32–40 (1975)
Article MathSciNet MATH Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer series in statistics, Springer, Berlin (2001), https://books.google.cz/books?id=VRzITwgNV2UC
Kriegel, H.P., Krüger, P., Sander, J., Zimek, A.: Density-based clustering. Wiley Interdisc. Rev. Data Min. Knowl. Disc. 1(3), 231–240 (2011). doi:10.1002/widm.30
Article Google Scholar
Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Trans. Comput. 18, 401–409 (1969)
Article Google Scholar
Torgerson, W.S.: Multidimensional scaling: I. theory and method. Psychometrika 17, 401–419 (1952)
Article MathSciNet MATH Google Scholar
Wang, J.: Geometric Structure of High-Dimensional Data and Dimensionality Reduction. Springer, Berlin (2012), https://books.google.cz/books?id=0RmZRb2fLpgC
Google Scholar

Download references

Acknowledgements

This work was supported by the IT4Innovations Centre of Excellence project (CZ.1.05/1.1.00/02.0070), funded by the European Regional Development Fund and the national budget of the Czech Republic via the Research and Development for Innovations Operational Programme and by Project SP2015/146 of the Student Grant System, VŠB—Technical University of Ostrava.

Author information

Authors and Affiliations

IT4Innovations & Department of Computer Science, VŠB Technical University of Ostrava, Ostrava, Czech Republic
Pavel Krömer & Jan Platoš

Authors

Pavel Krömer
View author publications
You can also search for this author in PubMed Google Scholar
Jan Platoš
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pavel Krömer .

Editor information

Editors and Affiliations

Faculty of Electrical Eng. and Comp., VŠB-Technical University of Ostrava, Ostrava, Czech Republic
Vítězslav Stýskala
Department of Electrical Engineering, VŠB – Technical University of Ostrava, Ostrava, Czech Republic
Dmitrii Kolosov
Faculty of Electrical Engineering and, Technical University of Ostrava, Ostrava-Poruba, Czech Republic
Václav Snášel
after Jusup Balasagyn, Kyrgyz National University named, Bishkek, Kyrgyzstan
Taalaybek Karakeyev
Sci N/w for Innova and Research Exc, Machine Intelligence Research Labs, Washington, USA
Ajith Abraham

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Krömer, P., Platoš, J. (2016). Cluster Analysis of Data with Reduced Dimensionality: An Empirical Study. In: Stýskala, V., Kolosov, D., Snášel, V., Karakeyev, T., Abraham, A. (eds) Intelligent Systems for Computer Modelling . Advances in Intelligent Systems and Computing, vol 423. Springer, Cham. https://doi.org/10.1007/978-3-319-27644-1_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-27644-1_12
Published: 14 February 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27642-7
Online ISBN: 978-3-319-27644-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics