Skip to main content

Cluster Analysis of Data with Reduced Dimensionality: An Empirical Study

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 423))

Abstract

Cluster analysis is an important high-level data mining procedure that can be used to identify meaningful groups of objects within large data sets. Various dimension reduction methods are used to reduce the complexity of data before further processing. The lower-dimensional projections of original data sets can be seen as simplified models of the original data. In this paper, several clustering algorithms are used to process low-dimensional projections of complex data sets and compared with each other. The properties and quality of clustering obtained by each method is evaluated and their suitability to process reduced data sets is assessed.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.cc.gatech.edu/projects/large_models/.

References

  1. Abdi, H.: Metric multidimensional scaling. In: Salkind, N. (ed.) Encyclopedia of Measurement and Statistics, pp. 598–605. Sage, Thousand Oaks (2007)

    Google Scholar 

  2. Bandyopadhyay, S., Saha, S.: Unsupervised Classification: Similarity Measures, Classical and Metaheuristic Approaches, and Applications. SpringerLink, Bücher. Springer, Berlin (2012), https://books.google.cz/books?id=Vb21R9_rMNoC

  3. Borg, I., Groenen, P., Mair, P.: Mds algorithms. In: Applied Multidimensional Scaling, pp. 81–86. SpringerBriefs in Statistics, Springer, Berlin (2013), http://dx.doi.org/10.1007/978-3-642-31848-1_8

    Google Scholar 

  4. Burges, C.J.C.: Dimension reduction: a guided tour. Found. Trends Mach. Learn. 2(4) (2010), http://dx.doi.org/10.1561/2200000002

    Google Scholar 

  5. Cheng, Y.: Mean shift, mode seeking, and clustering. Pattern Anal. Mach. Intell. IEEE Trans. 17(8), 790–799 (1995)

    Article  Google Scholar 

  6. Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. Pattern Anal. Mach. Intell., IEEE Trans. 24(5), 603–619 (2002)

    Article  Google Scholar 

  7. Dunn, J.C.: Well separated clusters and optimal fuzzy-partitions. J. Cybern. 4, 95–104 (1974)

    Article  MathSciNet  Google Scholar 

  8. Everitt, B., Landau, S., Leese, M., Stahl, D.: Cluster Analysis. Wiley Series in Probability and Statistics, Wiley, Hoboken (2011), https://books.google.cz/books?id=w3bE1kqd-48C

    Google Scholar 

  9. Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007), http://www.sciencemag.org/content/315/5814/972.abstract

    Google Scholar 

  10. Fukunaga, K., Hostetler, L.: The estimation of the gradient of a density function, with applications in pattern recognition. Inf. Theor. IEEE Trans. 21(1), 32–40 (1975)

    Article  MathSciNet  MATH  Google Scholar 

  11. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer series in statistics, Springer, Berlin (2001), https://books.google.cz/books?id=VRzITwgNV2UC

  12. Kriegel, H.P., Krüger, P., Sander, J., Zimek, A.: Density-based clustering. Wiley Interdisc. Rev. Data Min. Knowl. Disc. 1(3), 231–240 (2011). doi:10.1002/widm.30

    Article  Google Scholar 

  13. Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Trans. Comput. 18, 401–409 (1969)

    Article  Google Scholar 

  14. Torgerson, W.S.: Multidimensional scaling: I. theory and method. Psychometrika 17, 401–419 (1952)

    Article  MathSciNet  MATH  Google Scholar 

  15. Wang, J.: Geometric Structure of High-Dimensional Data and Dimensionality Reduction. Springer, Berlin (2012), https://books.google.cz/books?id=0RmZRb2fLpgC

    Google Scholar 

Download references

Acknowledgements

This work was supported by the IT4Innovations Centre of Excellence project (CZ.1.05/1.1.00/02.0070), funded by the European Regional Development Fund and the national budget of the Czech Republic via the Research and Development for Innovations Operational Programme and by Project SP2015/146 of the Student Grant System, VŠB—Technical University of Ostrava.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pavel Krömer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Krömer, P., Platoš, J. (2016). Cluster Analysis of Data with Reduced Dimensionality: An Empirical Study. In: Stýskala, V., Kolosov, D., Snášel, V., Karakeyev, T., Abraham, A. (eds) Intelligent Systems for Computer Modelling . Advances in Intelligent Systems and Computing, vol 423. Springer, Cham. https://doi.org/10.1007/978-3-319-27644-1_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27644-1_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27642-7

  • Online ISBN: 978-3-319-27644-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics