Supervised Visual Vocabulary with Category Information

Liu, Yunqiang; Caselles, Vicent

doi:10.1007/978-3-642-23687-7_2

Yunqiang Liu²¹ &
Vicent Caselles²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6915))

Included in the following conference series:

International Conference on Advanced Concepts for Intelligent Vision Systems

2148 Accesses
1 Citations

Abstract

The bag-of-words model has been widely employed in image classification and object detection tasks. The performance of bag-of-words methods depends fundamentally on the visual vocabulary that is applied to quantize the image features into visual words. Traditional vocabulary construction methods (e.g. k-means) are unable to capture the semantic relationship between image features. In order to increase the discriminative power of the visual vocabulary, this paper proposes a technique to construct a supervised visual vocabulary by jointly considering image features and their class labels. The method uses a novel cost function in which a simple and effective dissimilarity measure is adopted to deal with category information. And, we adopt a prototype-based approach which tries to find prototypes for clusters instead of using the means in k-means algorithm. The proposed method works as the k-means algorithm by efficiently minimizing a clustering cost function. The experiments on different datasets show that the proposed vocabulary construction method is effective for image classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proc. ICCV, vol. 2, pp. 1470–1477 (2003)
Google Scholar
Lowe, G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: Proc. ICCV (2005)
Google Scholar
Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Proc. ICCV, pp. 1800–1807 (2005)
Google Scholar
Lazebnik, S., Raginsky, M.: Supervised Learning of Quantizer Codebooks by Information Loss Minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(7), 1294–1309 (2009)
Article Google Scholar
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(9), 1632–1646 (2008)
Article Google Scholar
Perronnin, F.: Universal and Adapted Vocabularies for Generic Visual Categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(7), 1243–1256 (2008)
Article Google Scholar
Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual codebook generation with classifier training for object category recognition. In: Proc. CVPR (2008)
Google Scholar
Zhang, C., Liu, J., Ouyang, Y., Tian, Q., Lu, H., Ma, S.: Category sensitive codebook construction for object category recognition. In: Proc. ICIP (2009)
Google Scholar
Lian, X., Li, Z., Wang, C., Lv, B., Zhang, L.: Probabilistic models for supervised dictionary learning. In: Proc. CVPR (2010)
Google Scholar
Jian, A., Dubes, R.: Algorithms for clustering data. Prentice Hall, Englewood Cliffs (1988)
MATH Google Scholar
Huang, Z.: Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values. Data Mining and Knowledge Discovery, 283–304 (1998)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Article MATH Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(4), 712–727 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Barcelona Media - Innovation Center, Barcelona, Spain
Yunqiang Liu
Universitat Pompeu Fabra, Barcelona, Spain
Vicent Caselles

Authors

Yunqiang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Vicent Caselles
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DGA, 7-9 rue des mathurins, 92 221, Bagneux, France
Jacques Blanc-Talon
VITO-TAP, Boeretang 200, 2400, MOl, Belgium
Richard Kleihorst
Ghent University, St. -Pietersnieuwstraat 41, B9000, Ghent, Belgium
Wilfried Philips
CSIRO ICT Centre, Epping,, Po Box 76, 1710, Sydney, NSW, Australia
Dan Popescu
Physics, University of Antwerp, Universiteitsplein 1; Building N, 2610, Wilrijk, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Y., Caselles, V. (2011). Supervised Visual Vocabulary with Category Information. In: Blanc-Talon, J., Kleihorst, R., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2011. Lecture Notes in Computer Science, vol 6915. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23687-7_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-23687-7_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23686-0
Online ISBN: 978-3-642-23687-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics