Encoding Spatial Arrangement of Visual Words

Penatti, Otávio A. B.; Valle, Eduardo; da S. Torres, Ricardo

doi:10.1007/978-3-642-25085-9_28

Otávio A. B. Penatti¹⁸,
Eduardo Valle¹⁸ &
Ricardo da S. Torres¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7042))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

2684 Accesses
8 Citations

Abstract

This paper presents a new approach to encode spatial-relationship information of visual words in the well-known visual dictionary model. The current most popular approach to describe images based on visual words is by means of bags-of-words which do not encode any spatial information. We propose a graceful way to capture spatial-relationship information of visual words that encodes the spatial arrangement of every visual word in an image. Our experiments show the importance of the spatial information of visual words for image classification and show the gain in classification accuracy when using the new method. The proposed approach creates opportunities for further improvements in image description under the visual dictionary model.

Download to read the full chapter text

Chapter PDF

Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification

Improvement the Bag of Words Image Representation Using Spatial Information

Constructing Hierarchical Visual Tree for Discriminative Image Representation and Classification

Keywords

References

Boureau, Y.L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: CVPR, pp. 2559–2566 (2010)
Google Scholar
Cao, Y., Wang, C., Li, Z., Zhang, L., Zhang, L.: Spatial-bag-of-features. In: CVPR, pp. 3352–3359 (2010)
Google Scholar
van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.M.: Visual word ambiguity. TPAMI 32(7), 1271–1283 (2010)
Article Google Scholar
Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset. Tech. Rep. 7694, California Institute of Technology (2007)
Google Scholar
Hoíng, N.V., Gouet-Brunet, V., Rukoz, M., Manouvrier, M.: Embedding spatial information into image content description for scene retrieval. Pattern Recognition 43(9), 3013–3024 (2010)
Article MATH Google Scholar
Wenjun, L., Min, W.: Multimedia forensic hash based on visual words. In: ICIP, pp. 989–992 (2010)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, vol. 2, pp. 2169–2178 (2006)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. Journal of Comp. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. Journal of Comp. Vis. 60, 63–86 (2004)
Article Google Scholar
Penatti, O.A.B., Torres, R.da.S.: Spatial relationship descriptor based on partitions. REIC 7(3) (2007) (in Portuguese)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: Improving particular object retrieval in large scale image databases. In: CVPR (2008)
Google Scholar
Jianzhao, Q., Yung, N.: Category-specific incremental visual codebook training for scene categorization. In: ICIP, pp. 1501–1504 (2010)
Google Scholar
Savarese, S., Winn, J., Criminisi, A.: Discriminative object class models of appearance and shape by correlatons. In: CVPR, vol. 2, pp. 2033–2040 (2006)
Google Scholar
Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering objects and their location in images. In: ICCV, vol. 1, pp. 370–377 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Recod Lab, Institute of Computing, University of Campinas (Unicamp), Campinas, Brazil
Otávio A. B. Penatti, Eduardo Valle & Ricardo da S. Torres

Authors

Otávio A. B. Penatti
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Valle
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo da S. Torres
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad de La Frontera, Avda. Francisco Salazar, 01145, Temuco, Chile
César San Martin
Myongji University, San 38-2, Namdong, 449-728, Cheoingu, Yongin, Republic of Korea
Sang-Woon Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Penatti, O.A.B., Valle, E., da S. Torres, R. (2011). Encoding Spatial Arrangement of Visual Words. In: San Martin, C., Kim, SW. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2011. Lecture Notes in Computer Science, vol 7042. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25085-9_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-25085-9_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25084-2
Online ISBN: 978-3-642-25085-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Encoding Spatial Arrangement of Visual Words

Abstract

Chapter PDF

Similar content being viewed by others

Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification

Improvement the Bag of Words Image Representation Using Spatial Information

Constructing Hierarchical Visual Tree for Discriminative Image Representation and Classification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Encoding Spatial Arrangement of Visual Words

Abstract

Chapter PDF

Similar content being viewed by others

Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification

Improvement the Bag of Words Image Representation Using Spatial Information

Constructing Hierarchical Visual Tree for Discriminative Image Representation and Classification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation