Abstract
The problem of clustering a set of n points into k groups under various objective functions is studied. It is shown that under some objective functions clustering problems are NP-hard even when the points to be grouped are restricted to lie in the two dimensional euclidean space. Our results can be extended to show that their corresponding approximation problems are also NP-hard. It is shown that some restricted graph partition problems are also NP-hard.
Preview
Unable to display preview. Download preview PDF.
References
Augustson, J. G. and J. Minker, "An Analysis of Some Graph Theoretical Cluster Techniques," J.ACM, 17,571–588, (October 1970).
Bock, H. H., "Automatische Klassifikation," Vandenhoek und Ruprecht, Gottingen, 1974.
Bodin, L. D., "A Graph Theoretic Approach to the Grouping of Ordering Data," Networks, 2, 307–310, (1972).
Brucker, P. "On the Complexity of Clustering Problems," in R. Henn, B. Korte and W. Oletti (eds), Optimiening and Operations Research, Lecture Notes in Economics and Mathematical Systems, Springer, Berlin (1977).
Duda, R. and P. Hart, "Pattern Classification and Scene Analysis," John Wiley and Sons, New York, 1973.
Fisher, L. and J. Van Ness, "Admissible Clustering Procedures," Biometrica, 58:91–104, 1971.
Fisher, W. D., "On Grouping for Maximum Homogeneity," JASA, 53:789–798, 1958.
Gonzalez, T., "Algorithms on Sets and Related Problems," Technical Report 75–15, The University of Oklahoma, 1975.
Gonzalez, T., Manuscript in preparation.
Garey, M. R. and D. S. Johnson, "The Complexity of Near-Optimal Graph Coloring," JACM, 23, 1, 43–69, (Jan 1976).
Garey, M. R. and D. S. Johnson, "Computers and Intractability: A Guide to the Theory of NP-Completeness," W. H. Freeman and Company, San Francisco, 1980.
Garey, M. R. and D. S. Johnson, Unpublished results referenced in [GJ2].
Horowitz, E. and S. Sahni, "Fundamentals of Computer Algorithms," Computer Science Press, Inc., 1978.
Johnson, D. B. and J. M. Lafuente, "Controlled Single Pass Classification Algorithm with applications to Multilevel Clustering," Scientific Report #ISR-18, Information Science and Retreival, Cornell University, Oct 1970.
Rohlf, F. J. "Single Link Clustering Algorithms," RC 8569 (#37332) Research Report, IBM, T. J. Watson Research Center, Nov. 1980.
Karp, R. M., "Reducibility Among Combinatorial Problems," In Complexity of Computer Computations, R. E. Miller and J. W. Thatcher, Eds, Plenum Press, N. Y. 1972, pp. 85–104.
Meisel, W. S., "Computer-Oriented Approaches to Pattern Recognition," Academic Press, New York, 1972.
Sahni, S. and T. Gonzalez, "P-Complete Approximation Problems," JACM, 23, 555–565, 1976.
Salton, G. "The Smart Retreival System, Experiments in Automatic Document Processing," Prentice-Hall, New Jersey (1971).
Salton, G., "Dynamic Information and Library Processing," Prentice-Hall, New Jersey (1975).
Shamos, M.I., "Geometry and Statistics: Problems at the Interface," in J. F. Traub (ed), Algorithms and Complexity: New Directions and Recent Results, Academic Press, New York, 251–280, 1976.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1982 Springer-Verlag
About this paper
Cite this paper
Gonzalez, T.F. (1982). On the computational complexity of clustering and related problems. In: Drenick, R.F., Kozin, F. (eds) System Modeling and Optimization. Lecture Notes in Control and Information Sciences, vol 38. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0006133
Download citation
DOI: https://doi.org/10.1007/BFb0006133
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-11691-2
Online ISBN: 978-3-540-39459-4
eBook Packages: Springer Book Archive