Skip to main content

On the computational complexity of clustering and related problems

  • Control Theory
  • Conference paper
  • First Online:
System Modeling and Optimization

Part of the book series: Lecture Notes in Control and Information Sciences ((LNCIS,volume 38))

Abstract

The problem of clustering a set of n points into k groups under various objective functions is studied. It is shown that under some objective functions clustering problems are NP-hard even when the points to be grouped are restricted to lie in the two dimensional euclidean space. Our results can be extended to show that their corresponding approximation problems are also NP-hard. It is shown that some restricted graph partition problems are also NP-hard.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Augustson, J. G. and J. Minker, "An Analysis of Some Graph Theoretical Cluster Techniques," J.ACM, 17,571–588, (October 1970).

    Google Scholar 

  2. Bock, H. H., "Automatische Klassifikation," Vandenhoek und Ruprecht, Gottingen, 1974.

    Google Scholar 

  3. Bodin, L. D., "A Graph Theoretic Approach to the Grouping of Ordering Data," Networks, 2, 307–310, (1972).

    Google Scholar 

  4. Brucker, P. "On the Complexity of Clustering Problems," in R. Henn, B. Korte and W. Oletti (eds), Optimiening and Operations Research, Lecture Notes in Economics and Mathematical Systems, Springer, Berlin (1977).

    Google Scholar 

  5. Duda, R. and P. Hart, "Pattern Classification and Scene Analysis," John Wiley and Sons, New York, 1973.

    Google Scholar 

  6. Fisher, L. and J. Van Ness, "Admissible Clustering Procedures," Biometrica, 58:91–104, 1971.

    Google Scholar 

  7. Fisher, W. D., "On Grouping for Maximum Homogeneity," JASA, 53:789–798, 1958.

    Google Scholar 

  8. Gonzalez, T., "Algorithms on Sets and Related Problems," Technical Report 75–15, The University of Oklahoma, 1975.

    Google Scholar 

  9. Gonzalez, T., Manuscript in preparation.

    Google Scholar 

  10. Garey, M. R. and D. S. Johnson, "The Complexity of Near-Optimal Graph Coloring," JACM, 23, 1, 43–69, (Jan 1976).

    Google Scholar 

  11. Garey, M. R. and D. S. Johnson, "Computers and Intractability: A Guide to the Theory of NP-Completeness," W. H. Freeman and Company, San Francisco, 1980.

    Google Scholar 

  12. Garey, M. R. and D. S. Johnson, Unpublished results referenced in [GJ2].

    Google Scholar 

  13. Horowitz, E. and S. Sahni, "Fundamentals of Computer Algorithms," Computer Science Press, Inc., 1978.

    Google Scholar 

  14. Johnson, D. B. and J. M. Lafuente, "Controlled Single Pass Classification Algorithm with applications to Multilevel Clustering," Scientific Report #ISR-18, Information Science and Retreival, Cornell University, Oct 1970.

    Google Scholar 

  15. Rohlf, F. J. "Single Link Clustering Algorithms," RC 8569 (#37332) Research Report, IBM, T. J. Watson Research Center, Nov. 1980.

    Google Scholar 

  16. Karp, R. M., "Reducibility Among Combinatorial Problems," In Complexity of Computer Computations, R. E. Miller and J. W. Thatcher, Eds, Plenum Press, N. Y. 1972, pp. 85–104.

    Google Scholar 

  17. Meisel, W. S., "Computer-Oriented Approaches to Pattern Recognition," Academic Press, New York, 1972.

    Google Scholar 

  18. Sahni, S. and T. Gonzalez, "P-Complete Approximation Problems," JACM, 23, 555–565, 1976.

    Google Scholar 

  19. Salton, G. "The Smart Retreival System, Experiments in Automatic Document Processing," Prentice-Hall, New Jersey (1971).

    Google Scholar 

  20. Salton, G., "Dynamic Information and Library Processing," Prentice-Hall, New Jersey (1975).

    Google Scholar 

  21. Shamos, M.I., "Geometry and Statistics: Problems at the Interface," in J. F. Traub (ed), Algorithms and Complexity: New Directions and Recent Results, Academic Press, New York, 251–280, 1976.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

R. F. Drenick F. Kozin

Rights and permissions

Reprints and permissions

Copyright information

© 1982 Springer-Verlag

About this paper

Cite this paper

Gonzalez, T.F. (1982). On the computational complexity of clustering and related problems. In: Drenick, R.F., Kozin, F. (eds) System Modeling and Optimization. Lecture Notes in Control and Information Sciences, vol 38. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0006133

Download citation

  • DOI: https://doi.org/10.1007/BFb0006133

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-11691-2

  • Online ISBN: 978-3-540-39459-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics