Abstract
One problem in appearance-based pose estimation is the need for many training examples, i.e. images of the object in a large number of known poses. Some invariance can be obtained by considering translations, rotations and scale changes in the image plane, but the remaining degrees of freedom are often handled simply by sampling the pose space densely enough. This work presents a method for accurate interpolation between training views using local linear models. As a view representation local soft orientation histograms are used. The derivative of this representation with respect to the image plane transformations is computed, and a Gauss-Newton optimization is used to optimize all pose parameters simultaneously, resulting in an accurate estimate.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Atkeson, C.G., Moore, A.W., Schaal, S.: Locally weighted learning. Artificial Intelligence Review 11, 11–73 (1997)
Comport, A., Marchand, E., Chaumette, F.: A real-time tracker for markerless augmented reality. In: Proc. The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, pp. 36–45. IEEE Computer Society Press, Los Alamitos (2003)
Felsberg, M., Forssén, P.-E., Scharr, H.: Channel smoothing: Efficient robust smoothing of low-level signal features. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(2), 209–222 (2006)
Granlund, G.H.: An associative perception-action structure using a localized space variant information representation. In: Sommer, G., Zeevi, Y.Y. (eds.) AFPAC 2000. LNCS, vol. 1888, pp. 48–68. Springer, Heidelberg (2000)
Lowe, D.G.: Object recognition from local scale-invariant features. In: IEEE Int. Conf. on Computer Vision, Sept. 1999, IEEE Computer Society Press, Los Alamitos (1999)
Moore, A.W., Schneider, J., Deng, K.: Efficient locally weighted polynomial regression predictions. In: Proc. 14th International Conference on Machine Learning, pp. 236–244. Morgan Kaufmann, San Francisco (1997)
Murase, H., Nayar, S.K.: Visual learning and recognition of 3-d objects from appearance. International Journal of Computer Vision 14(1), 5–24 (1995)
Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, Heidelberg (1999)
Obdrzalek, S., Matas, J.: Object recognition using local affine frames on distinguished regions. In: British Machine Vision Conf. (2002)
Pentland, A., Moghaddam, B., Starner, T.: View-based and modular eigenspaces for face recognition. In: CVPR (1994)
Cipolla, R., Drummond, T.: Real-time visual tracking of complex structures. IEEETransactions on Pattern Analysis and Machine Intelligence 24(7) (2002)
Unser, M.: Splines: A perfect fit for signal and image processing. IEEE Signal Processing Magazine 16(6), 22–38 (1999)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Jonsson, E., Felsberg, M. (2007). Accurate Interpolation in Appearance-Based Pose Estimation. In: Ersbøll, B.K., Pedersen, K.S. (eds) Image Analysis. SCIA 2007. Lecture Notes in Computer Science, vol 4522. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73040-8_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-73040-8_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73039-2
Online ISBN: 978-3-540-73040-8
eBook Packages: Computer ScienceComputer Science (R0)