Efficient Pose Estimation Using View-Based Object Representations

Peters, Gabriele

doi:10.1007/3-540-36592-3_2

Gabriele Peters⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2626))

Included in the following conference series:

International Conference on Computer Vision Systems

673 Accesses

Abstract

We present an efficient method for estimating the pose of a three-dimensional object. Its implementation is embedded in a computer vision system which is motivated by and based on cognitive principles concerning the visual perception of three-dimensional objects. Viewpoint-invariant object recognition has been subject to controversial discussions for a long time. An important point of discussion is the nature of internal object representations. Behavioral studies with primates, which are summarized in this article, support the model of view-based object representations. We designed our computer vision system according to these findings and demonstrate that very precise estimations of the poses of real-world objects are possible even if only a few number of sample views of an object is available. The system can be used for a variety of applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Edelman and H. H. Bülthoff. Orientation Dependence in the Recognition of Familiar and Novel Views of Three-Dimensional Objects. Vision Research, 32(12):2385–2400, 1992.
Article Google Scholar
M. J. Tarr. Orientation Dependence in Three-Dimensional Object Recognition. Ph.D. Thesis, MIT, 1989.
Google Scholar
F. Cutzu and S. Edelman. Canonical Views in Object Representation and Recognition. Vision Research, 34:3037–3056, 1994.
Article Google Scholar
N. K. Logothetis, J. Pauls, H. H. Bülthoff, and Poggio T. View-Dependent Object Recognition by Monkeys. Current Biology, 4:401–414, 1994.
Article Google Scholar
M. Wexler, S. M. Kosslyn, and A. Berthoz. Motor processes in mental rotation. Cognition, 68: 77–94, 1998.
Article Google Scholar
N. K. Logothetis, J. Pauls, and Poggio T. Shape Representation in the Inferior Temporal Cortex of Monkeys. Current Biology, 5(5):552–563, 1995.
Article Google Scholar
D. C. Burr, M. C. Morrone, and D. Spinelli. Evidence for Edge and Bar Detectors in Human Vision. Vision Research, 29(4):419–431, 1989.
Article Google Scholar
J. P. Jones and L. A. Palmer. An evaluation of the two-dimensional gabor filter model of simple receptive fields in cat striate cortex. Journal of Neurophysiology, 58(6):1233–1258, 1987.
Google Scholar
C. Eckes and J. C. Vorbrüggen. Combining Data-Driven and Model-Based Cues for Segmentation of Video Sequences. In Proc. WCNN96, pages 868–875, 1996.
Google Scholar
T. Maurer and C. von der Malsburg. Tracking and Learning Graphs and Pose on Image Sequences of Faces. In Proc. Int. Conf. on Automatic Face-and Gesture-Recognition, pages 176–181, 1996.
Google Scholar
V. Chvatal. A Greedy Heuristic for the Set-Covering Problem. Mathematics of Operations Research, 4(3):233–235, 1979.
Article MATH MathSciNet Google Scholar
R. Horaud, B. Conio, O. Leboulleux, and B. Lacolle. An Analytic Solution for the Perspective 4-Point Problem. Computer Vision, Graphics and Image Processing, 47:33–44, 1989.
Article Google Scholar
M. Dhome, M. Richetin, J. Lapreste, and G. Rives. Determination of the Attitude of 3-D Objects from a Single Perspective View. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11(12):1265–1278, 1989.
Article Google Scholar
R. M. Haralick, C. Lee, K. Ottenberg, and M. Nölle. Analysis and Solutions of the Three Point Perspective Pose Estimation Problem. In Proc. of the IEEE Comp. Society Conf. on Computer Vision and Pattern Recognition, pages 592–598, 1991.
Google Scholar
D. G. Lowe. Three-Dimensional Object Recognition from Single Two-Dimensional Images. Artificial Intelligence, 31:355–395, 1987.
Article Google Scholar
J. Yuan. A General Photogrammetric Method for Determining Object Position and Orientation. IEEE Journal of Robotics and Automation, 5(2):129–142, 1989.
Article Google Scholar
M. Pötzsch. Die Behandlung der Wavelet-Transformation von Bildern in der Nähe von Objektkanten. Technical Report IRINI 94-04, Institut für Neuroinformatik, Ruhr-Universität Bochum, Germany, 1994.
Google Scholar
S. Ullman and R. Basri. Recognition by Linear Combinations of Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(10):992–1006, 1991.
Article Google Scholar
G. Peters and C. von der Malsburg. View Reconstruction by Linear Combination of Sample Views. In Proc. BMVC 2001, pages 223–232, 2001.
Google Scholar
M. Lades, J. C. Vorbrüggen, J. Buhmann, J. Lange, C. von der Malsburg, R. P. Würtz, and W. Konen. Distortion Invariant Object Recognition in the Dynamic Link Architecture. IEEE Trans. Comp., 42: 300–311, 1993.
Article Google Scholar
G. Peters. A View-Based Approach to Three-Dimensional Object Perception. Ph.D. Thesis, Shaker Verlag, Aachen, Germany, 2002.
Google Scholar

Download references

Author information

Authors and Affiliations

Informatik VII, Universität Dortmund, Otto-Hahn-Str. 16, D-44227, Dortmund, Germany
Gabriele Peters

Authors

Gabriele Peters
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INRIA Rhône-Alpes, 655 Ave de l’Europe, 38330, Montbonnot, France
James L. Crowley
Montefiore Institute, University of Liège, 4000, Liège Sart-Tilman, Belgium
Justus H. Piater
Automation and Control Institute, Vienna University of Technology, Gusshausstraße 27/376, 1040, Vienna, Austria
Markus Vincze
Institute of Digital Image Processing, Joanneum Research, Wastiangasse 6, 8010, Graz, Austria
Lucas Paletta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peters, G. (2003). Efficient Pose Estimation Using View-Based Object Representations. In: Crowley, J.L., Piater, J.H., Vincze, M., Paletta, L. (eds) Computer Vision Systems. ICVS 2003. Lecture Notes in Computer Science, vol 2626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36592-3_2

Download citation

DOI: https://doi.org/10.1007/3-540-36592-3_2
Published: 14 March 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00921-4
Online ISBN: 978-3-540-36592-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics