Data Science, Classification, and Related Methods pp 117-124 | Cite as
Partitional Cluster Analysis with Genetic Algorithms: Searching for the Number of Clusters
Conference paper
Summary
In this article we deal with the problem of searching for the number of clusters in partitional clustering in R 2. We set up the problem as an optimization problem by giving a real function on the different partitions that is optimized when the number of clusters and the classes are the most natural. We use the Genetic Algorithm for optimizing this function. The algorithm has been applied to the well-known Ruspini data and to synthetic cally generated datasets, with different cluster numbers and underlying distributions. The results are encouraging.
Keywords
Genetic Algorithm Search Space Convex Hull Travel Salesman Problem Cluster Problem
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Preview
Unable to display preview. Download preview PDF.
References
- Babu, G.P. and Nlurty, M.N. (1994): Clustering with evolution strategies, Pattern Recognition, 27, 2, 321–329.CrossRefGoogle Scholar
- Bezdek, J.C. et al. (1994): Genetic Algorithm Guided Clustering, In: Proc. of The First IEEE Conference on Evolutionary Computation, 34–40.Google Scholar
- Bozdogan. H. (1994): Choosing the number of clusters; subset selection of variables, and outlier detection in the standard mixture-model cluster analysis, In: Diday E., Lechevallier Y., Schader NI., Bertrand P. and Burtschy B. (eds.), New Approaches in Classification and Data Analysis, Springer-Verlag, 169–177.Google Scholar
- Bhuyan, J.N. et al. (1991): Genetic Algorithms for clustering with an ordered representation, In: Belew and Booker (eds.), proceedings of the Fourch international Conference on Gnetic Algorthms, 408–415, Morgan Kaufmann.Google Scholar
- Celeux, G. and Soromenho, G. (1993): An entropy criterion for assesing the number of clusters in a mixture model, Technical Report 1874 INRIA, France.Google Scholar
- Cucchiara, R (1993): Analysis and comparison of different genetic models for the clustering problem in image analysis, In: Albrecht R.F., Reeves C.R. and Steele N.C. (eds.), Artificial Neural Networks and Genetic Algorithms, Springer-Verlag, 423–427.Google Scholar
- Everitt, B.S (1974): Cluster Analysis, John Wiley & Sons, Inc.Google Scholar
- Goldberg, D.E. (1989): Genetic Algorithms in Search, Optimization, Machine Learning, Addison-Wesley.Google Scholar
- Gordon, A.D. (1995): Test for asessing clusters, Statistics in Transition, 2, 207–217.Google Scholar
- Hardy, A. (1994): An examination of procedures for determining the number of clusters in a data set, In: Diday E., Lechevallier Y., Schader NI., Bertrand P. and Burtschy B. (eds.), New Approaches in Classification and Data Analysis, Springer-Verlag, 178–185.Google Scholar
- Hartigan, J.A. (1975): Clustering Algorithms, John Wiley & Sons, New York. Jain, A.Ií. and Dubes, R.C. ( 1988 ): Algorithms for Clustering Data, Prentice Hall.Google Scholar
- Jones, D.R. and Beltramo, M.A. (1993): Solving partitioning problems with Genetic Algorithms, In Albrecht R.F., Reeves C.R and Steele N.C. (eds.), Artificial Neural Networks and Genetic Algorithms, Springer-Verlag, 423–427.Google Scholar
- Krovi, R. (1991): Genetic Algorithms for clustering: A preliminary investigation, In: Proceedings of the Twenty-Fifth International Conference on System Sciences, 4, 540–544.Google Scholar
- Larraüaga, P. et al. (1996): Learning Bayesian Networks Structures by Searching for theGoogle Scholar
- Best Ordering with Genetic Algorithms, IEEE Transactions on Systems Alan and Cybernetics, 26, 4. In press.Google Scholar
- Lozano, J.A. et al. (1995): Genetic Algorithms: Bridging the Convergence Gap, submitted to Evolutionary Computation.Google Scholar
- Luchian, S. et al. (1994): Evolutionary automated classification, In: Proc. of The FirstGoogle Scholar
- IEEE Conference on Evolutionary Computation, 585–589.Google Scholar
- Rasson, J.P. and hubushishi, T. (1994): The gap test: an optimal method for determining the number of natural classes in cluster analysis, In Diday E., Lechevallier Y., Schader M., Bertrand P. and Burtschy B. (eds.), New Approaches in Classification and Data Analysis, Springer-Verlag, 186–193.Google Scholar
- Whitley, D. and Kauth. J. (1988): Genitor: A different Genetic Algorithm, In: Proceedings of the Rocky Jfoun.tain. Conference on Artificial Intelligence, 2, 189–214.Google Scholar
Copyright information
© Springer Japan 1998