A Bag  of Constrained  Visual Words  Model for Image Representation

Mukherjee, Anindita; Sil, Jaya; Chowdhury, Ananda S.

doi:10.1007/978-981-32-9291-8_32

A Bag of Constrained Visual Words Model for Image Representation

Anindita Mukherjee¹⁸,
Jaya Sil¹⁹ &
Ananda S. Chowdhury²⁰

Conference paper
First Online: 20 September 2019

552 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1024))

Abstract

We propose a bag of constrained visual words model for image representation. Each image under this model is considered to be an aggregation of patches. SURF features are used to describe each patch. Two sets of constraints, namely, the must-link and the cannot-link, are developed for each patch in a completely unsupervised manner. The constraints are formulated using the distance information among different patches as well as statistical analysis of the entire patch data. All the patches from the image set under consideration are then quantized using the Linear-time-Constrained Vector Quantization Error (LCVQE), a fast yet accurate constrained k-means algorithm. The resulting clusters, which we term as constrained visual words, are then used to label the patches in the images. In this way, we model an image as a bag (histogram) of constrained visual words and then show its utility for image retrieval. Clustering as well as initial retrieval results on COIL-100 dataset indicate the merit of our approach.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Sivic, J., Zisserman, A.: Video Google: efficient visual search of videos. In: Toward Category-Level Object Recognition, pp. 127–144 (2006)
Chapter Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings of the ICCV, pp. 470–1477 (2003)
Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Hartigan, J.A., Wong, M.A.: A k-means clustering algorithm. Appl. Stat. 28, 100–108 (1979)
Article Google Scholar
Bouachir, W., Kardouchi, M., Belacel, N.: Improving bag of visual words image retrieval: a fuzzy weighting scheme for efficient indexation. In: Proceedings of the SITIS, pp. 215–220 (2009)
Google Scholar
Mukherjee, A., Chakraborty, S., Sil, J., Chowdhury, A.S.: A novel visual word assignment model for content based image retrieval. In: Balasubramanian, R., et al. (eds.) Proceedings of the CVIP, Springer AISC, vol. 459, pp. 79–87 (2016)
Google Scholar
Dimitrovski, I., Kocev, D., Loskovska, S., Dzeroski, S.: Improving bag-of-visual-words image retrieval with predictive clustering trees. Inf. Sci. 329(2), 851–865 (2016)
Article Google Scholar
Fu, H., Qiu, G.: Fast semantic image retrieval based on random forest. In: Proceedings of the ACM MM, pp. 909–912 (2012)
Google Scholar
Mukherjee, A., Sil, J., Chowdhury, A.S.: Image retrieval using random forest based semantic similarity measures and SURF based visual words. In: Chaudhuri, B.B., et al. (eds.) Proceedings of the CVIP, Springer AISC, vol. 703, pp. 79–90 (2017)
Chapter Google Scholar
Pelleg, D., Baras, D.: K-means with large and noisy constraint sets. In: Proceedings of the ECML, pp. 674–682 (2007)
Google Scholar
Nene, S.A., Nayar, S.K., Murase, H.: Columbia Object Image Library (COIL-100), Tech. Report, Department of Computer Science, Columbia University CUCS-006-96 (1996)
Google Scholar
Zhang, X., et al.: Spatially constrained bag-of-visual-words for hyperspectral image classification. In: Proceedings of the IEEE IGARSS, pp. 501–504 (2016)
Google Scholar
Davidson, I., Ravi, S.S.: Clustering with constraints: feasibility issues and the k-means algorithm. In: 5th SIAM Data Mining Conference (2005)
Google Scholar
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 2, 224–227 (1979)
Article Google Scholar
Calinski, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. 3(1), 1–27 (1974)
MathSciNet MATH Google Scholar
Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: ideas, influences, and trends of the new age. ACM Comput. Surv. 40(2), 1–60 (2008)
Article Google Scholar
Newsam, S., Yang Y.: Comparing global and interest point descriptors for similarity retrieval in remote sensed imagery. In: Proceedings of the ACM GIS, Article No. 9 (2007)
Google Scholar
Wan, J., et al.: Deep learning for content-based image retrieval: a comprehensive study. In: Proceedings of the ACM MM, pp. 157–166 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Dream Institute of Technology, Kolkata, India
Anindita Mukherjee
IIEST Sibpur, Howrah, India
Jaya Sil
Jadavpur University, Kolkata, 700032, India
Ananda S. Chowdhury

Authors

Anindita Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Jaya Sil
View author publications
You can also search for this author in PubMed Google Scholar
Ananda S. Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ananda S. Chowdhury .

Editor information

Editors and Affiliations

Techno India University, Kolkata, India
Bidyut B. Chaudhuri
Division of Advanced Information Technology and Computer Science, Tokyo University of Agriculture and Technology, Koganei-shi, Tokyo, Japan
Masaki Nakagawa
Department of Computer Science, Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, Madhya Pradesh, India
Pritee Khanna
Department of Mathematics, Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
Sanjeev Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mukherjee, A., Sil, J., Chowdhury, A.S. (2020). A Bag of Constrained Visual Words Model for Image Representation. In: Chaudhuri, B., Nakagawa, M., Khanna, P., Kumar, S. (eds) Proceedings of 3rd International Conference on Computer Vision and Image Processing. Advances in Intelligent Systems and Computing, vol 1024. Springer, Singapore. https://doi.org/10.1007/978-981-32-9291-8_32

Download citation

DOI: https://doi.org/10.1007/978-981-32-9291-8_32
Published: 20 September 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-32-9290-1
Online ISBN: 978-981-32-9291-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics