Skip to main content

Cluster Graph Modification Problems

  • Conference paper
  • First Online:
Graph-Theoretic Concepts in Computer Science (WG 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2573))

Included in the following conference series:

Abstract

In a clustering problem one has to partition a set of elements into homogeneous and well-separated subsets. From a graph theoretic point of view, a cluster graph is a vertex-disjoint union of cliques. The clustering problem is the task of making fewest changes to the edge set of an input graph so that it becomes a cluster graph. We study the complexity of three variants of the problem. In the Cluster Completion variant edges can only be added. In Cluster Deletion, edges can only be deleted. In Cluster Editing, both edge additions and edge deletions are allowed. We also study these variants when the desired solution must contain a prespecified number of clusters.

We show that Cluster Editing is NP-complete, Cluster Deletion is NPhard to approximate to within some constant factor, and Cluster Completion is polynomial. When the desired solution must contain exactly p clusters, we show that Cluster Editing is NP-complete for every p≥ 2; Cluster Deletion is polynomial for p = 2 but NP-complete for p> 2; and Cluster Completion is polynomial for any p. We also give a constant factor approximation algorithm for Cluster Editing when p = 2.

Supported in part by the Israel Science Foundation (grant number 565/99).

Supported by an Eshkol fellowship from the Ministry of Science, Israel.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. A. Alizadeh, M. B. Eisen, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature, 403(6769):503–511, 2000.

    Article  Google Scholar 

  2. A. Ben-Dor, R. Shamir, and Z. Yakhini. Clustering gene expression patterns. Journal of Computational Biology, 6(3/4):281–297, 1999.

    Article  Google Scholar 

  3. C. Berge. Graphs and Hypergraphs. North-Holland, Amsterdam, 1973.

    Google Scholar 

  4. M. R. Garey and D. S. Johnson. Computers and Intractability: A Gui de to theTheory of NP-Completeness. W. H. Freeman and Co., San Francisco, 1979.

    Google Scholar 

  5. M. X. Goemans and D. P. Williamson. Improved approximation algorithms for maximum cut and satis.ability problems using semide.nite programming. Journal of the ACM, 42(6):1115–1145, 1995.

    Article  MATH  MathSciNet  Google Scholar 

  6. T. R. Golub, D. K. Slonim, et al. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science, 286:531–537, October 1999.

    Google Scholar 

  7. C. Hagen and A.B. Kahng. New spectral methods for ratio cut partitioning and clustering. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 11(9):1074–1085, 1992.

    Article  Google Scholar 

  8. P. Hansen and B. Jaumard. Cluster analysis and mathematical programming. Mathematical Programming, 79:191–215, 1997.

    MathSciNet  Google Scholar 

  9. J.A. Hartigan. Clustering Algorithms. John Wiley and Sons, 1975.

    Google Scholar 

  10. D. S. Hochbaum, editor. Approximation Alogrithms for NP-Hard Problems. PWS Publishing, Boston, 1997.

    Google Scholar 

  11. L. Lovasz. Covering and coloring of hypergraphs. In Proc. 4th Southeastern Conf. on Combinatorics, Graph Theory, and Computing. Utilitas Mathematica Publishing, 1973.

    Google Scholar 

  12. A. Natanzon. Complexity and approximation of some graph modi.cation problems. Master’s thesis, Department of Computer Science, Tel Aviv University, 1999.

    Google Scholar 

  13. A. Natanzon, R. Shamir, and R. Sharan. Complexity classi.cation of some edge modification problems. Discrete Applied Mathematics, 113:109–128, 2001.

    Article  MATH  MathSciNet  Google Scholar 

  14. C. Papadimitriou and M. Yannakakis. Optimization, approximation, and complexity classes. J. of Computer and System Science, 43:425–440, 1991.

    Article  MATH  MathSciNet  Google Scholar 

  15. R. Sharan and R. Shamir. CLICK: A clustering algorithm with applications to gene expression analysis. In Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology (ISMB), pages 307–316, 2000.

    Google Scholar 

  16. Z. Wu and R. Leahy. An optimal graph theoretic approach to data clustering: theory and its application to image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11):1101–1113, 1993.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shamir, R., Sharan, R., Tsur, D. (2002). Cluster Graph Modification Problems. In: Goos, G., Hartmanis, J., van Leeuwen, J., Kučera, L. (eds) Graph-Theoretic Concepts in Computer Science. WG 2002. Lecture Notes in Computer Science, vol 2573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36379-3_33

Download citation

  • DOI: https://doi.org/10.1007/3-540-36379-3_33

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00331-1

  • Online ISBN: 978-3-540-36379-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics