Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification

Anwar, Hafeez; Zambanini, Sebastian; Kampel, Martin

doi:10.1007/978-3-319-11752-2_36

Hafeez Anwar¹⁶,
Sebastian Zambanini¹⁶ &
Martin Kampel¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8753))

Included in the following conference series:

German Conference on Pattern Recognition

3661 Accesses
4 Citations

Abstract

Incorporating the spatial information of visual words enhances the performance of the well-known bag-of-visual words (BoVWs) model for problems like object category recognition. However, object images can undergo various in-plane rotations due to which the spatial information must be added to the BoVWs model in rotation-invariant manner. We present a novel approach to integrate the spatial information to BoVWs model in a rotation-invariant way by encoding the triangular relationship among the positions of identical visual words in the \(2D\) image space. Our proposed BoVWs model is based on densely sampled local features for which the dominant orientations are calculated. Thus we achieve rotation-invariance both globally and locally. We validate our proposed method for rotation-invariance on datasets of ancient coins and butterflies and achieve better performance than the conventional BoVWs model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Improvement the Bag of Words Image Representation Using Spatial Information

Discriminative Image Representation for Classification

Bag-of-Words Image Representation: Key Ideas and Further Insight

References

Anwar, H., Zambanini, S., Kampel, M.: Supporting ancient coin classification by image-based reverse side symbol recognition. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds.) CAIP 2013, Part II. LNCS, vol. 8048, pp. 17–25. Springer, Heidelberg (2013)
Chapter Google Scholar
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV, pp. 1–22 (2004)
Google Scholar
Deselaers, T., Ferrari, V.: Global and efficient self-similarity for object classification and detection. In: CVPR, pp. 1633–1640 (2010)
Google Scholar
Kavelar, A., Zambanini, S., Kampel, M., Vondrovec, K., Siegl, K.: The ILAC-project: supporting ancient coin classification by means of image analysis. In: XXIV International CIPA Symposium (2013)
Google Scholar
Khan, R., Barat, C., Muselet, D., Ducottet, C.: Spatial orientation of visual word pairs to improve bag-of-visual-words model. In: BMVC, pp. 1–11 (2012)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR, pp. 2169–2178 (2006)
Google Scholar
Li, F.F., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: CVPR, pp. 524–531 (2005)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60, 91–110 (2004)
Article Google Scholar
Penatti, O.A.B., Silva, F.B., Valle, E., Gouet-Brunet, V., da Silva Torres, R.: Visual word spatial arrangement for image retrieval and classification. Pattern Recogn. 47(2), 705–720 (2014)
Article Google Scholar
Perdoch, M., Chum, O., Matas, J.: Efficient representation of local geometry for large scale object retrieval. In: CVPR, pp. 9–16 (2009)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: Improving particular object retrieval in large scale image databases. In: CVPR (2008)
Google Scholar
Veksler, O.: Star shape prior for graph-cut image segmentation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 454–467. Springer, Heidelberg (2008)
Chapter Google Scholar
Wang, J., Markert, K., Everingham, M.: Learning models for object recognition from natural language descriptions. In: BMVC, pp. 2.1–2.11 (2009)
Google Scholar
Zambanini, S., Kampel, M.: Robust automatic segmentation of ancient coins. In: VISAPP, pp. 273–276 (2009)
Google Scholar
Zhang, E., Mayo, M.: Enhanced spatial pyramid matching using log-polar-based image subdivision and representation. In: DICTA, pp. 208–213 (2010)
Google Scholar
Zhang, J., Marszałek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. Int. J. Comput. Vision 73(2), 213–238 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision Lab, Vienna University of Technology, Vienna, Austria
Hafeez Anwar, Sebastian Zambanini & Martin Kampel

Authors

Hafeez Anwar
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Zambanini
View author publications
You can also search for this author in PubMed Google Scholar
Martin Kampel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hafeez Anwar .

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, University of Münster, Münster, Germany
Xiaoyi Jiang
Computer Science Department 5, University of Erlangen-Nürnberg, Erlangen, Germany
Joachim Hornegger
Department of Computer Science, University of Kiel, Kiel, Germany
Reinhard Koch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Anwar, H., Zambanini, S., Kampel, M. (2014). Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification. In: Jiang, X., Hornegger, J., Koch, R. (eds) Pattern Recognition. GCPR 2014. Lecture Notes in Computer Science(), vol 8753. Springer, Cham. https://doi.org/10.1007/978-3-319-11752-2_36

Download citation

DOI: https://doi.org/10.1007/978-3-319-11752-2_36
Published: 15 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11751-5
Online ISBN: 978-3-319-11752-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification

Abstract

Access this chapter

Similar content being viewed by others

Improvement the Bag of Words Image Representation Using Spatial Information

Discriminative Image Representation for Classification

Bag-of-Words Image Representation: Key Ideas and Further Insight

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Encoding Spatial Arrangements of Visual Words for Rotation-Invariant Image Classification

Abstract

Access this chapter

Similar content being viewed by others

Improvement the Bag of Words Image Representation Using Spatial Information

Discriminative Image Representation for Classification

Bag-of-Words Image Representation: Key Ideas and Further Insight

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation