Abstract
Clustering can be defined as the process of partitioning a set of patterns into disjoint and homogeneous meaningful groups, called clusters. The growing need for parallel clustering algorithms is attributed to the huge size of databases that is common nowadays. This paper presents a parallel version of a recently proposed algorithm that has the ability to scale very well in parallel environments mainly regarding space requirements but also gaining a time speedup.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aldenderfer, M.S., Blashfield, R.K.: Cluster Analysis. Quantitative Applications in the Social Sciences. SAGE Publications, London (1984)
Alevizos, P.: An Algorithm for Orthogonal Range Search in d \(\geqslant\) 3 Dimensions. In: Proceedings of the 14th European Workshop on Computational Geometry, Barcelona (1998)
Alevizos, P., Boutsinas, B., Tasoulis, D., Vrahatis, M.N.: Improving the Orthogonal Range Search k-windows Clustering Algorithm. In: Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence, Washington D.C., pp. 239–245 (2002)
Bentley, J.L., Maurer, H.A.: Efficient Worst-Case Data Structures for Range Searching. Acta Informatica 13, 1551–1568 (1980)
Chazelle, B.: Filtering Search: A New Approach to Query-Answering. SIAM J. Comput. 15(3), 703–724 (1986)
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: Advances in Knowledge Discovery and Data Mining. MIT Press, Cambridge (1996)
Geist, A., Beguelin, J., Dongarra, W., Jiang, R., Manchek, V.: PVM: Parallel Virtual Machine. In: A User’s Guide and Tutorial for Networked Parallel Computing. MIT Press, Cambridge (1994)
Chazelle, B., Guibas, L.J.: Fractional Cascading: II. Applications. Algorithmica 1, 163–191 (1986)
Judd, D., McKinley, P., Jain, A.: Large-Scale Parallel Data Clustering. In: Proceedings of the Int. Conference on Pattern Recognition (1996)
Judd, D., McKinley, P., Jain, A.: Performance Evaluation on Large-Scale Parallel Clustering in NOW Environments. In: Proceedings of the Eight SIAM Conference on Parallel Processing for Scientific Computing, Minneapolis (March 1997)
MPI The Message Passing Interface standard, http://www-unix.mcs.anl.gov/mpi/
Olson, C.F.: Parallel Algorithms for Hierarchical Clustering. Parallel Computing 21, 1313–1325 (1995)
Preparata, F., Shamos, M.: Computational Geometry. Springer, Heidelberg (1985)
Potts, J.T.: Seeking Parallelism in Discovery Programs, Master Thesis, University of Texas at Arlington (1996)
Ramasubramanian, V., Paliwal, K.: Fast k-dimensional Tree Algorithms for Nearest Neighbor Search with Application to Vector Quantization Encoding. IEEE Transactions on Signal Processing 40(3), 518–531 (1992)
Stoffel, K., Belkoniene, A.: Parallel k-means Clustering for Large Data Sets. In: Amestoy, P.R., Berger, P., Daydé, M., Duff, I.S., Frayssé, V., Giraud, L., Ruiz, D. (eds.) Euro-Par 1999. LNCS, vol. 1685, pp. 1451–1454. Springer, Heidelberg (1999)
Vrahatis, M.N., Boutsinas, B., Alevizos, P., Pavlides, G.: The New k-windows Algorithm for Improving the k-means Clustering Algorithm. Journal of Complexity 18, 375–391 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Alevizos, P.D., Tasoulis, D.K., Vrahatis, M.N. (2004). Parallelizing the Unsupervised k-Windows Clustering Algorithm. In: Wyrzykowski, R., Dongarra, J., Paprzycki, M., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2003. Lecture Notes in Computer Science, vol 3019. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24669-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-24669-5_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21946-0
Online ISBN: 978-3-540-24669-5
eBook Packages: Springer Book Archive