Local Homography Estimation Using Keypoint Descriptors

  • Alberto Del Bimbo
  • Fernando Franco
  • Federico Pernici
Chapter
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 158)

Abstract

This chapter presents a novel learning-based approach to estimate local homography of points belong to a given surface and shows that it is more accurate than specific affine region detection methods. While others works attempt this by using iterative algorithms developed for template matching, our method introduces a direct estimation of the transformation. It performs the following steps. First, a training set of features captures geometry and appearance information about keypoints taken from multiple views of the surface. Then incoming keypoints are matched against the training set in order to retrieve a cluster of features representing their identity. Finally the retrieved clusters are used to estimate the local homography of the regions around keypoints. Thanks to the high accuracy, outliers and bad estimates are filtered out by multiscale Summed Square Difference (SSD) test.

Keywords

Homography estimation SIFT keypoints Nearest neighbor Robust matching Scale and affine invariant features 

Notes

Acknowledgments

This work is partially supported by the EU IST VidiVideo Project (Contract FP6-045547) and IM3I Project (Contract FP7-222267).

References

  1. 1.
    Beis JS, Lowe DG (1999) Indexing without invariants in 3d object recognition. IEEE Trans Pattern Anal Mach Intell 21(10):1000–1015Google Scholar
  2. 2.
    Benhimane S, Malis E (2007) Homography-based 2d visual tracking and servoing. Int J Rob Res 26(7):661–676Google Scholar
  3. 3.
    Ferrari V, Tuytelaars T, Gool L (2006) Simultaneous object recognition and segmentation from single or multiple model views. Int J Comput Vis 67(2):159–188Google Scholar
  4. 4.
    Goedeme T, Tuytelaars T, Gool LJV (2004) Fast wide baseline matching for visual navigation. In: CVPR (1). Washington, USA, pp 24–29Google Scholar
  5. 5.
    Hinterstoisser S, Benhimane S, Lepetit V, Navab N (2008) Simultaneous recognition and homography extraction of local patches with a simple linear classifier. In: British Machine Vision ConferenceGoogle Scholar
  6. 6.
    Hinterstoisser S, Benhimane S, Navab N, Fua P, Lepetit V (2008) Online learning of patch perspective rectification for efficient object detection. In: Conference on Computer Vision and Pattern RecognitionGoogle Scholar
  7. 7.
    Hinterstoisser S, Kutter O, Navab N, Fua P, Lepetit V (2009) Real-time learning of accurate patch rectification. In:Conference on Computer Vision and Pattern RecognitionGoogle Scholar
  8. 8.
    Jurie F, Dhome M (2002) Hyperplane approximation for template matching. IEEE Trans Pattern Anal Mach Intell 24(7):996--1000Google Scholar
  9. 9.
    Klein G, Murray D (2007) Parallel tracking and mapping for small AR workspaces.In: Proceedings of Sixth IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR'07). Nara, JapanGoogle Scholar
  10. 10.
    Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide baseline stereo from maximally stable extremal regions. In: Image and Vision ComputingGoogle Scholar
  11. 11.
    Mikolajczyk K, Tuytelaars T, Schmid C,Zisserman A, Matas J, Schaffalitzky F, Kadir T, Gool LV (2005) A comparison of affine region detectors. Int J Comput Vis 65(1/2):43–72Google Scholar
  12. 12.
    Özuysal M, Fua P, Lepetit V (2007) Fast keypoint recognition in ten lines of code. In: Conference on Computer Vision and Pattern RecognitionGoogle Scholar
  13. 13.
    Pagani A, Stricker D (2009) Learning local patch orientation with a cascade of sparse regressors.In: Proceedings of British Machine Vision Conference (BMVC 2009). London, UKGoogle Scholar
  14. 14.
    Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In:Conference on Computer Vision and Pattern RecognitionGoogle Scholar
  15. 15.
    Rothganger F, Lazebnik S, Schmid C, Ponce J (2006) 3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. Int J Comput Vis 66(3):231–259Google Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  • Alberto Del Bimbo
    • 1
  • Fernando Franco
    • 1
  • Federico Pernici
    • 1
  1. 1.Media Integration and Communication Center (MICC)University of FlorenceFlorenceItaly

Personalised recommendations