Abstract
The bag-of-words model has been widely employed in image classification and object detection tasks. The performance of bag-of-words methods depends fundamentally on the visual vocabulary that is applied to quantize the image features into visual words. Traditional vocabulary construction methods (e.g. k-means) are unable to capture the semantic relationship between image features. In order to increase the discriminative power of the visual vocabulary, this paper proposes a technique to construct a supervised visual vocabulary by jointly considering image features and their class labels. The method uses a novel cost function in which a simple and effective dissimilarity measure is adopted to deal with category information. And, we adopt a prototype-based approach which tries to find prototypes for clusters instead of using the means in k-means algorithm. The proposed method works as the k-means algorithm by efficiently minimizing a clustering cost function. The experiments on different datasets show that the proposed vocabulary construction method is effective for image classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proc. ICCV, vol. 2, pp. 1470–1477 (2003)
Lowe, G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: Proc. ICCV (2005)
Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Proc. ICCV, pp. 1800–1807 (2005)
Lazebnik, S., Raginsky, M.: Supervised Learning of Quantizer Codebooks by Information Loss Minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(7), 1294–1309 (2009)
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(9), 1632–1646 (2008)
Perronnin, F.: Universal and Adapted Vocabularies for Generic Visual Categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(7), 1243–1256 (2008)
Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual codebook generation with classifier training for object category recognition. In: Proc. CVPR (2008)
Zhang, C., Liu, J., Ouyang, Y., Tian, Q., Lu, H., Ma, S.: Category sensitive codebook construction for object category recognition. In: Proc. ICIP (2009)
Lian, X., Li, Z., Wang, C., Lv, B., Zhang, L.: Probabilistic models for supervised dictionary learning. In: Proc. CVPR (2010)
Jian, A., Dubes, R.: Algorithms for clustering data. Prentice Hall, Englewood Cliffs (1988)
Huang, Z.: Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values. Data Mining and Knowledge Discovery, 283–304 (1998)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(4), 712–727 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Y., Caselles, V. (2011). Supervised Visual Vocabulary with Category Information. In: Blanc-Talon, J., Kleihorst, R., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2011. Lecture Notes in Computer Science, vol 6915. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23687-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-23687-7_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23686-0
Online ISBN: 978-3-642-23687-7
eBook Packages: Computer ScienceComputer Science (R0)