Clustering with Soft and Group Constraints

Law, Martin H. C.; Topchy, Alexander; Jain, Anil K.

doi:10.1007/978-3-540-27868-9_72

Martin H. C. Law²¹,
Alexander Topchy²¹ &
Anil K. Jain²¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3138))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

1769 Accesses
24 Citations

Abstract

Several clustering algorithms equipped with pairwise hard constraints between data points are known to improve the accuracy of clustering solutions. We develop a new clustering algorithm that extends mixture clustering in the presence of (i) soft constraints, and (ii) group-level constraints. Soft constraints can reflect the uncertainty associated with a priori knowledge about pairs of points that should or should not belong to the same cluster, while group-level constraints can capture larger building blocks of the target partition when afforded by the side information. Assuming that the data points are generated by a mixture of Gaussians, we derive the EM algorithm to estimate the parameters of different clusters. Empirical study demonstrates that the use of soft constraints results in superior data partitions normally unattainable without constraints. Further, the solutions are more robust when the hard constraints may be incorrect.

This work was supported by the U.S. ONR grant no. N000140410183.

Download to read the full chapter text

Chapter PDF

The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering

Article 25 January 2024

A New Clustering Separation Measure Based on Negentropy

Article 04 October 2014

Weighted likelihood mixture modeling and model-based clustering

Article 10 June 2019

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs (1988)
MATH Google Scholar
Yu, S.X., Shi, J.: Segmentation given partial grouping constraints. IEEE Transactions on Pattern Analysis and Machine Intelligence 26, 173–183 (2004)
Article Google Scholar
Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning via equivalence constraints, with applications to the enhancement of image and video retrieval. In: Proc. IEEE Confernce on Computer Vision and Pattern Recognition (2003)
Google Scholar
Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Constrained k-means clustering with background knowledge. In: Proc. International Conference on Machine Learning, pp. 577–584 (2001)
Google Scholar
Wagstaff, K., Cardie, C.: Clustering with instance-level constraints. In: Proc. International Conference on Machine Learning, pp. 1103–1110 (2000)
Google Scholar
Wagstaff, K.: Intelligent Clustering with Instance-Level Constraints. PhD thesis, Department of Computer Science, Cornell University (2002)
Google Scholar
Klein, D., Kamvar, S.D., Manning, C.D.: From instance-level constraints to spacelevel constraints: Making the most of prior knowledge in data clustering. In: Proc. International Conference on Machine Learning, pp. 307–314 (2002)
Google Scholar
Kamvar, S., Klein, D., Manning, C.D.: Spectral learning. In: Proc. of the Eighteenth International Joint Conference on Artificial Intelligence, MIT Press, Cambridge (2003)
Google Scholar
Shental, N., Bar-Hillel, A., Hertz, T., Weinshall, D.: Computing gaussian mixture models with EM using equivalence constraints. In: Advances in Neural Information Processing Systems 16, MIT Press, Cambridge (2004)
Google Scholar
Yu, S.X., Shi, J.: Grouping with bias. In: Advances in Neural Information Processing Systems 13, MIT Press, Cambridge (2001)
Google Scholar
Xing, E.P., Ng, A.Y., Jordan, M.I., Russell, S.: Distance metric learning, with application to clustering with side-information. In: Advances in Neural Information Processing Systems 15, Cambridge, MA, MIT Press, Cambridge (2003)
Google Scholar
Bansal, N., Blum, A., Chawla, S.: Correlation clustering. In: Proc. of the 43d Annual IEEE Symp. on Foundations of Computer Science (2002)
Google Scholar
Charikar, M., Guruswami, V., Wirth, A.: Clustering with qualitative information. In: Proc. of the 44th Annual IEEE Symposium on Foundations of Computer Science (2003)
Google Scholar
Demaine, E.D., Immorlica, N.: Correlation clustering with partial information. In: Proc. of the 6th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, Princeton, New Jersey (2003)
Google Scholar
McLachlan, G., Peel, D.: Finite Mixture Models. John Wiley & Sons, New York (2000)
Book MATH Google Scholar
Figueiredo, M.A.T., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 381–396 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, 48824, USA
Martin H. C. Law, Alexander Topchy & Anil K. Jain

Authors

Martin H. C. Law
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Topchy
View author publications
You can also search for this author in PubMed Google Scholar
Anil K. Jain
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Superior Técnico, Instituto de Telecomunicações, Lisbon, Portugal
Ana Fred
RSISE, the Australian National University, ACT 0200, Canberra, Australia
Terry M. Caelli
Information and Communication Theory Group, Delft University of Technology, P.O. Box 5031, 2600GA, Delft, The Netherlands
Robert P. W. Duin
FEUP - Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
Aurélio C. Campilho
Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, Information and Communication Theory Group, Delft, The Netherlands
Dick de Ridder

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Law, M.H.C., Topchy, A., Jain, A.K. (2004). Clustering with Soft and Group Constraints. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2004. Lecture Notes in Computer Science, vol 3138. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27868-9_72

Download citation

DOI: https://doi.org/10.1007/978-3-540-27868-9_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22570-6
Online ISBN: 978-3-540-27868-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Clustering with Soft and Group Constraints

Abstract

Chapter PDF

Similar content being viewed by others

The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering

A New Clustering Separation Measure Based on Negentropy

Weighted likelihood mixture modeling and model-based clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Clustering with Soft and Group Constraints

Abstract

Chapter PDF

Similar content being viewed by others

The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering

A New Clustering Separation Measure Based on Negentropy

Weighted likelihood mixture modeling and model-based clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation