Skip to main content

Learning Visual Object Categories with Global Descriptors and Local Features

  • Conference paper
Progress in Artificial Intelligence (EPIA 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5816))

Included in the following conference series:

Abstract

Different types of visual object categories can be found in real-world applications. Some categories are very heterogeneous in terms of local features (broad categories) while others are consistently characterized by some highly distinctive local features (narrow categories). The work described in this paper was motivated by the need to develop representations and categorization mechanisms that can be applied to domains involving different types of categories. A second concern of the paper is that these representations and mechanisms have potential for scaling up to large numbers of categories. The approach is based on combinining global shape descriptors with local features. A new shape representation is proposed. Two additional representations are used, one also capturing the object’s shape and another based on sets of highly distinctive local features. Basic classifiers following the nearest-neighbor rule were implemented for each representation. A meta-level classifier, based on a voting strategy, was also implemented. The relevance of each representation and classifier to both broad and narrow categories is evaluated on two datasets with a combined total of 114 categories.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Al-Ani, A., Deriche, M.: A new technique for combining multiple classifiers using the dempster-shafer theory of evidence. Journal of Artificial Intelligence Research 17, 333–361 (2002)

    MathSciNet  MATH  Google Scholar 

  2. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 509–522 (2002)

    Article  Google Scholar 

  3. Dietterich, T.G.: Ensemble methods in machine learning, pp. 1–15. Springer, Heidelberg (2000)

    Google Scholar 

  4. Goshtasby, A.: Description and discrimination of planar shapes using shape matrices. IEEE Trans. Pattern Anal. Mach. Intell. 7, 738–743 (1985)

    Article  Google Scholar 

  5. Huang, Y.S., Suen, C.Y.: A method of combining multiple experts for the recognition of unconstrained handwritten numerals. IEEE Trans. Pattern Anal. Mach. Intell. 17(1), 90–94 (1995)

    Article  Google Scholar 

  6. Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(3), 226–239 (1998)

    Article  Google Scholar 

  7. Lisin, D., Mattar, M., Blaschko, M., Learned-Miller, E., Benfield, M.: Combining local and global image features for object class recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops, 2005. CVPR Workshops, p. 47 (2005)

    Google Scholar 

  8. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)

    Article  Google Scholar 

  9. Murphy, K.P., Torralba, A.B., Eaton, D., Freeman, W.T.: Object detection and localization using local and global features. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds.) Toward Category-Level Object Recognition. LNCS, vol. 4170, pp. 382–400. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  10. Neumann, J., Samet, H., Soffer, A.: Integration of local and global shape analysis for logo classification. Pattern Recognition Letters 23(12), 1449–1457 (2002)

    Article  MATH  Google Scholar 

  11. Pereira, R., Seabra Lopes, L., Silva, A.: Semantic image search and subset selection for classifier training in object recognition. In: Seabra Lopes, L., et al. (eds.) EPIA 2009. LNCS(LNAI), vol. 5816, pp. 338–349. Springer, Heidelberg (2009)

    Google Scholar 

  12. Ribeiro, L.S.: Object recognition for semantic robot vision. Master’s thesis, Universidade de Aveiro (2008)

    Google Scholar 

  13. Roy, D.K.: Learning Words from Sights and Sounds: A Computational Model. PhD thesis, MIT (2000)

    Google Scholar 

  14. Sarfraz, M., Ridha, A.: Content-based image retrieval using multiple shape descriptors. In: IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2007, pp. 730–737 (2007)

    Google Scholar 

  15. Seabra Lopes, L., Chauhan, A.: How many words can my robot learn? an approach and experiments with one-class learning. Interaction Studies 8(1), 53–81 (2007)

    Article  Google Scholar 

  16. Seabra Lopes, L., Chauhan, A.: Open-ended category learning for language acquisition. Connection Science 8(4) (2008)

    Google Scholar 

  17. Steels, L., Kaplan, F.: Aibo’s first words: the social learning of language and meaning. Evolution of Communication 4(1), 3–32 (2002)

    Article  Google Scholar 

  18. van Erp, M., Vuurpijl, L., Schomaker, L.: An overview and comparison of voting methods for pattern recognition. In: 8th International Workshop on Frontiers in Handwriting Recognition (IWFHR-8), pp. 195–200 (2002)

    Google Scholar 

  19. Wardhani, A., Thomson, T.: Content based image retrieval using category-based indexing. In: IEEE International Conference on Multimedia and Expo, ICME 2004, vol. 2, pp. 783–786 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pereira, R., Seabra Lopes, L. (2009). Learning Visual Object Categories with Global Descriptors and Local Features. In: Lopes, L.S., Lau, N., Mariano, P., Rocha, L.M. (eds) Progress in Artificial Intelligence. EPIA 2009. Lecture Notes in Computer Science(), vol 5816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04686-5_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04686-5_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04685-8

  • Online ISBN: 978-3-642-04686-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics