Binary Histogrammed Intensity Patches for Efficient and Robust Matching

Taylor, Simon; Drummond, Tom

doi:10.1007/s11263-011-0430-6

Binary Histogrammed Intensity Patches for Efficient and Robust Matching

Published: 17 March 2011

Volume 94, pages 241–265, (2011)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Simon Taylor¹ &
Tom Drummond²

658 Accesses
15 Citations
Explore all metrics

Abstract

This paper describes a method for feature-based matching which offers very fast runtime performance due to the simple quantised patches used for matching and a tree-based lookup scheme which prevents the need for exhaustively comparing each query patch against the entire feature database. The method enables seven independently moving targets in a test sequence to be localised in an average total processing time of 6.03 ms per frame.

A training phase is employed to identify the most repeatable features from a particular range of viewpoints and to learn a model for the patches corresponding to each feature. Feature models consist of independent histograms of quantised intensity for each pixel in the patch, which we refer to as Histogrammed Intensity Patches (HIPs). The histogram values are thresholded and the feature model is stored in a compact binary representation which requires under 60 bytes of memory per feature and permits the rapid computation of a matching score using bitwise operations.

The method achieves better matching robustness than the state-of-the-art fast localisation schemes introduced by Wagner et al. (IEEE International Symposium on Mixed and Augmented Reality, 2008). Additionally both the runtime memory usage and computation time are reduced by a factor of more than four.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ballard, D. H. (1987). Generalizing the hough transform to detect arbitrary shapes. Readings in Computer Vision: Issues, Problems, Principles, and Paradigms, 1, 714–725.
Google Scholar
Bay, H., Ess, A., Tuytelaars, T., & Gool, L. V. (2008) Surf: speeded up robust features. Computer Vision and Image Understanding, 110(3), 346–359.
Article Google Scholar
Brown, M., Szeliski, R., & Winder, S. (2005). Multi-image matching using multi-scale oriented patches. In IEEE computer, society conference on computer vision and pattern recognition (pp. 510–517).
Google Scholar
Chum, O., & Matas, J. (2005). Matching with PROSAC—progressive sample consensus. In IEEE computer, society conference on computer vision and pattern recognition (pp. 220–226).
Google Scholar
Croes, G. A. (1958). A method for solving traveling-salesman problems. Operations Research, 6(6), 791–812.
Article MathSciNet Google Scholar
Fischler, M. A., & Bolles, R. C. (1981). Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6), 381–395.
Article MathSciNet Google Scholar
Harris, C., & Stephens, M. (1988). A combined corner and edge detector. In 4th ALVEY vision conference (pp. 147–151).
Google Scholar
Heikkilä, M., Pietikäinen, M., & Schmid, C. (2009). Description of interest regions with local binary patterns. Pattern Recognition, 42(3), 425–436.
Article MATH Google Scholar
Hinterstoisser, S., Benhimane, S., Lepetit, V., Fua, P., & Navab, N. (2008). Simultaneous recognition and homography extraction of local patches with a simple linear classifier. In British machine vision conference.
Google Scholar
Lepetit, V., & Fua, P. (2006). Keypoint recognition using randomized trees. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(9), 1465–1479.
Article Google Scholar
Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2, 91–110.
Article Google Scholar
Matas, J., Chum, O., Urbana, M., & Pajdlaa, T. (2004). Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing, 22(10), 761–767.
Article Google Scholar
McIlroy, P., Rosten, E., Taylor, S., & Drummond, T. (2010). Deterministic sample consensus with multiple match hypotheses. In British machine vision conference
Google Scholar
Mikolajczyk, K., & Schmid, C. (2001). Indexing based on scale invariant interest points. In IEEE international conference on computer vision (pp. 525–531).
Google Scholar
Mikolajczyk, K., & Schmid, C. (2002). An affine invariant interest point detector. In European conference on computer vision (pp. 128–142).
Google Scholar
Mikolajczyk, K., & Schmid, C. (2005). A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(10), 1615–1630.
Article Google Scholar
Moravec, H. (1981). Rover visual obstacle avoidance. In International joint conference on artificial intelligence (pp. 785–790).
Google Scholar
Morell, J., & Yu, G. (2009). ASIFT: a new framework for fully affine invariant image comparison. SIAM Journal on Imaging Sciences, 2(2), 438–469.
Article MathSciNet Google Scholar
Ozuysal, M., Fua, P., & Lepetit, V. (2007). Fast keypoint recognition in ten lines of code. In IEEE computer society conference on computer vision and pattern recognition.
Google Scholar
Rosten, E., & Drummond, T. (2006). Machine learning for high speed corner detection. In European conference on computer vision (pp. 430–443).
Google Scholar
Schmid, C., & Mohr, R. (1997). Local greyvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 530–535.
Article Google Scholar
Taylor, S., & Drummond, T. (2009). Multiple target localisation at over 100 FPS. In British machine vision conference.
Google Scholar
Taylor, S., Rosten, E., & Drummond, T. (2009). Robust feature matching in 2.3 μs. In IEEE CVPR workshop on feature detectors and descriptors: the state of the art and beyond.
Google Scholar
Wagner, D., Reitmayr, G., Mulloni, A., Drummond, T., & Schmalstieg, D. (2008). Pose tracking from natural features on mobile phones. In IEEE international symposium on mixed and augmented reality.
Google Scholar
Winder, S. A., & Brown, M. (2007). Learning local image descriptors. In IEEE computer society conference on computer vision and pattern recognition.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Engineering, University of Cambridge, Trumpington Street, Cambridge, CB2 1PZ, UK
Simon Taylor
Department of Electrical and Computer Systems Engineering, Monash University, PO Box 35, Clayton, Victoria, 3800, Australia
Tom Drummond

Authors

Simon Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Tom Drummond
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Simon Taylor.

Additional information

This work was supported by The Boeing Company.

Electronic Supplementary Material

Below are the links to the electronic supplementary material.

(7.764 kb)

(32.060 kb)

(129 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Taylor, S., Drummond, T. Binary Histogrammed Intensity Patches for Efficient and Robust Matching. Int J Comput Vis 94, 241–265 (2011). https://doi.org/10.1007/s11263-011-0430-6

Download citation

Received: 21 September 2009
Accepted: 25 February 2011
Published: 17 March 2011
Issue Date: September 2011
DOI: https://doi.org/10.1007/s11263-011-0430-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Binary Histogrammed Intensity Patches for Efficient and Robust Matching

Abstract

Access this article

Similar content being viewed by others

Microsoft COCO: Common Objects in Context

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

LSD-SLAM: Large-Scale Direct Monocular SLAM

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

(129 kb)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Binary Histogrammed Intensity Patches for Efficient and Robust Matching

Abstract

Access this article

Similar content being viewed by others

Microsoft COCO: Common Objects in Context

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

LSD-SLAM: Large-Scale Direct Monocular SLAM

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

(129 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation