Evaluation of Two-Step Spectral Clustering Algorithm for Large Untypical Data Sets

Dudek, Andrzej

doi:10.1007/978-3-030-75190-6_1

Andrzej Dudek ORCID: orcid.org/0000-0002-4943-8703²⁰

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

Included in the following conference series:

Conference of the Section on Classification and Data Analysis of the Polish Statistical Association

1067 Accesses

Abstract

Researchers analyzing large (>100,000 objects) data sets with the methods of cluster analysis often face the problem of computational complexity of algorithms that sometimes makes it impossible to analyze in an acceptable time. Common solution of this problem is to use less computationally complex algorithms (like k-means), which in turn can in many cases give much worse results than for example algorithms using eigenvalues decomposition. In the article, the new algorithm from spectral clustering family is proposed and compared with other approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dimitriadou E, Weingessel A, Hornik K (2001) Voting-merging: an ensemble method for clustering. In: Dorffner G, Bischop H, Hornik K (eds) Artificial neural networks—ICANN 2001. Lecture notes in computer science, vol 2130. Springer, Heidelberg, pp 217–224
Google Scholar
Dudek A (2013) Classification of large data sets. Comparison of performance of chosen algorithms. Acta Universitatis Lodziensis. Folia Oeconomica 285:71–78
Google Scholar
Hubert LJ, Arabie P (1985) Comparing partitions. J Classif 2:193–218
Article Google Scholar
Kong T, Tian Y, Shen H (2011) A fast incremental spectral clustering for large data sets, pp 1–5. https://doi.org/10.1109/PDCAT.2011.4
Ng A, Jordan M, Weiss Y (2002) On spectral clustering: analysis and an algorithm. In: Dietterich T, Becker S, Ghahramani Z (eds) Advances in neural information processing systems 14. MIT Press, pp 849–856
Google Scholar
Shinnou H, Sasaki M (2008) Spectral clustering for a large data set by reducing the similarity matrix size. In: Proceedings of the sixth international conference on language resources and evaluation (LREC), pp 201–2014
Google Scholar
von Luxburg U (2006) A tutorial on spectral clustering. Max planck institute for biological cybernetics, Technical Report TR-149
Google Scholar

Download references

Author information

Authors and Affiliations

Wrocław University of Economics and Business, Wrocław, Poland
Andrzej Dudek

Authors

Andrzej Dudek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrzej Dudek .

Editor information

Editors and Affiliations

Department of Financial Investments and Risk Management, Wroclaw University of Economics and Business, Wroclaw, Poland
Krzysztof Jajuga
Department of Statistics, University of Gdańsk, Sopot, Poland
Krzysztof Najman
Department of Econometrics and Computer Science, Wroclaw University of Economics and Business, Jelenia Góra, Poland
Marek Walesiak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dudek, A. (2021). Evaluation of Two-Step Spectral Clustering Algorithm for Large Untypical Data Sets. In: Jajuga, K., Najman, K., Walesiak, M. (eds) Data Analysis and Classification. SKAD 2020. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-030-75190-6_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-75190-6_1
Published: 28 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75189-0
Online ISBN: 978-3-030-75190-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics