Sparse Flexible Models of Local Features

Carneiro, Gustavo; Lowe, David

doi:10.1007/11744078_3

Sparse Flexible Models of Local Features

Gustavo Carneiro¹⁹ &
David Lowe²⁰

Conference paper

3222 Accesses
18 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3953))

Abstract

In recent years there has been growing interest in recognition models using local image features for applications ranging from long range motion matching to object class recognition systems. Currently, many state-of-the-art approaches have models involving very restrictive priors in terms of the number of local features and their spatial relations. The adoption of such priors in those models are necessary for simplifying both the learning and inference tasks. Also, most of the state-of-the-art learning approaches are semi-supervised batch processes, which considerably reduce their suitability in dynamic environments, where unannotated new images are continuously presented to the learning system. In this work we propose: 1) a new model representation that has a less restrictive prior on the geometry and number of local features, where the geometry of each local feature is influenced by its k closest neighbors and models may contain hundreds of features; and 2) a novel unsupervised on-line learning algorithm that is capable of estimating the model parameters efficiently and accurately. We implement a visual class recognition system using the new model and learning method proposed here, and demonstrate that our system produces competitive classification and localization results compared to state-of-the-art methods. Moreover, we show that the learning algorithm is able to model not only classes with consistent texture (e.g., faces), but also classes with shape only (e.g., leaves), classes with a common shape but with a great variability in terms of internal texture (e.g., cups), and classes of flexible objects (e.g., snake).

Download to read the full chapter text

Chapter PDF

References

http://www.vision.caltech.edu/html-files/archive.html
Amit, Y., Geman, D.: A computational model for visual selection. Neural Computation 11, 1691–1715 (1999)
Article Google Scholar
Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y.: An optimal algorithm for approximate nearest neighbor searching. Journal of the ACM 45, 891–923 (1998)
Article MATH MathSciNet Google Scholar
Bouchard, G., Triggs, B.: Hierarchical part-based visual object categorization. In: CVPR (2005)
Google Scholar
Burl, M., Weber, M., Perona, P.: A probabilistic approach to object recognition using local photometry and global geometry. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, p. 628. Springer, Heidelberg (1998)
Chapter Google Scholar
Carneiro, G., Jepson, A.: Multi-scale local phase features. In: CVPR (2003)
Google Scholar
Carneiro, G., Jepson, A.: Flexible spatial models for grouping local image features. In: CVPR (2004)
Google Scholar
Carneiro, G., Jepson, A.: The distinctiveness, detectability, and robustness of local image features. In: CVPR (2005)
Google Scholar
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical models. In: CVPR (2005)
Google Scholar
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: ECCV Workshop on Statistical Learning in Computer Vision (2004)
Google Scholar
Everingham, M., Van Gool, L., Williams, C., Zisserman, A.: Pascal Visual Object Classes Challenge Results (2005), http://www.pascal-network.org/challenges/VOC/voc/results_050405.pdf
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. IJCV 61(1), 55–79 (2005)
Article Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR (2003)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: A sparse object category model for efficient learning and exhaustive recognition. In: CVPR (2005)
Google Scholar
Ferrari, V., Tuytelaars, T., Van Gool, L.: Simultaneous object recognition and segmentation by image exploration. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 40–54. Springer, Heidelberg (2004)
Chapter Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Semi-local affine parts for object recognition. In: BMVC (2004)
Google Scholar
Liebe, B., Schiele, B.: Interleaved object categorization and segmentation. In: BMVC (2003)
Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: ICCV (1999)
Google Scholar
Moreels, P., Maire, M., Perona, P.: Recognition by probabilistic hypothesis construction. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 55–68. Springer, Heidelberg (2004)
Chapter Google Scholar
Ramanan, D., Forsyth, D., Barnard, K.: Detecting, localizing, and recovering kinematics of textured animals. In: CVPR (2004)
Google Scholar
Vasconcelos, N.: Bayesian models for visual information retrieval. PhD thesis, Massachusetts Institute of Technology (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Integrated Data Systems Department, Siemens Corporate Research, Princeton, NJ, USA
Gustavo Carneiro
Department of Computer Science, University of British Columbia, Vancouver, BC, Canada
David Lowe

Authors

Gustavo Carneiro
View author publications
You can also search for this author in PubMed Google Scholar
David Lowe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Slovenia
Aleš Leonardis
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria
Horst Bischof
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria
Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carneiro, G., Lowe, D. (2006). Sparse Flexible Models of Local Features. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3953. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744078_3

Download citation

DOI: https://doi.org/10.1007/11744078_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33836-9
Online ISBN: 978-3-540-33837-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics