Hyperplane arrangements for the fast matching and classification of visual landmarks

Stommel, Martin; Herzog, Otthein; Xu, Weiliang

doi:10.1007/s10044-014-0417-3

Hyperplane arrangements for the fast matching and classification of visual landmarks

Theoretical Advances
Published: 23 September 2014

Volume 19, pages 621–629, (2016)
Cite this article

Pattern Analysis and Applications Aims and scope Submit manuscript

Martin Stommel¹,
Otthein Herzog² &
Weiliang Xu³

244 Accesses
Explore all metrics

Abstract

Many robotics and mechatronics systems rely on a fast analysis of visual landmarks. Recently, binary feature representations of the popular SIFT and SURF landmarks have been proposed that offer large speed improvements and low memory consumption at high accuracy. In this paper, we compare a binarisation based on median-centred hyperplanes to the dominating approach of random hyperplanes. We describe the algorithms in a joint taxonomy and show that the kernel for median-centred hyperplanes satiesfies Mercer’s condition. Speed and accuracy are benchmarked in a registration and classification task. Both methods achieve the same dramatic speedup in kernel evaluation. But we show that median-centred hyperplanes are faster in binarisation, find better matches and generalise better over pose and individual variation in the classification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning to Rank 3D Features

Robust hand gesture recognition using multiple shape-oriented visual cues

Article Open access 19 July 2021

Generalizing the Prediction Sum of Squares Statistic and Formula, Application to Linear Fractional Image Warp and Surface Fitting

Article 20 September 2016

Notes

http://www.csie.ntu.edu.tw/~cjlin/libsvm.

References

André B, Vercauteren T, Buchner AM, Wallace MB, Ayache N (2011) Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. In: Medical Image Computing and Computer Assisted Intervention (MICCAI), LNCS. Springer, Heidelberg
Anvaripour M, Ebrahimnezhad H (2013) Accurate object detection using local shape descriptors. Pattern Anal Appl
Bay H, Ess A, Tuytelaars T, van Gool L (2006) SURF: speeded up robust features. Computer Vision Image Underst (CVIU) 110(3):346–359
Article Google Scholar
Chandrasekhar V, Takacs G, Chen DM, Tsai SS, Singh JP, Girod B (2009) Transform coding of image feature descriptors. In: visual Communication and Image Processing (VCIP)
Chen H, Sun D, Yang J (2009) Global localization of multirobot formations using ceiling vision SLAM strategy. Mechatronics 19(5):617–628
Article Google Scholar
Choras M, Kozik R (2013) Contactless palmprint and knuckle biometrics for mobile devices. Pattern Anal Appl 15:73–85
Article MathSciNet Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
MATH Google Scholar
Dani A, Fischer N, Kan Z, Dixon W (2012) Globally exponentially stable observer for vision-based range estimation. Mechatronics 22(4):381–389
Article Google Scholar
Diephuis M, Voloshynovskiy S, Koval O, Beekhof F (2011) Statistical Analysis of Binarized SIFT Descriptors. In: International Symposium on Image and Signal Processing and Analysis (ISPA)
Dong Y, Gao S, Tao K, Liu J, Wang H (2013) Performance evaluation of early and late fusion methods for generic semantics indexing. Pattern Anal Appl
Donoho DL, Tanner J (2010) Counting the faces of randomly-projected hypercubes and orthants, with applications. Discrete Comput Geom 43:522–541
Article MathSciNet MATH Google Scholar
Edelkamp S, Stommel M (2012) The bitvector machine: a fast and robust machine learning algorithm for non-linear problems. In: Flach PA, Bie TD, Cristianini N (eds) European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), pp 175–190. Springer .
Gong Y, Lazebnik S (2011) Iterative quantization: a procrustean approach to learning binary codes. In: Computer Vision and Pattern Recognition (CVPR)
Hsu CW, Lin CJ (2002) A comparison of methods for multi-class support vector machines. IEEE Trans Neural Networks 13:415–425
Article Google Scholar
Jain V, Learned-Miller E (2010) Fddb: A benchmark for face detection in unconstrained settings. Tech Rep UM-CS-2010-009, University of Massachusetts, Amherst
Jarrett K, Kavukcuoglu K, Ranzato M, LeCun Y (2009) What is the best multi-stage architecture for object recognition? In: International Conference on Computer Vision (ICCV)
Ke Y, Sukthankar R (2004) PCA-SIFT: A More Distinctive Representation for Local Image Descriptors. Computer Vision and Pattern Recognition (CVPR) 2:506–513
Google Scholar
Kuo YH, Lin HT, Cheng WH, Yang YH, Hsu WH (2011) Unsupervised auxiliary visual words discovery for large-scale image object retrieval. In: IEEE Comp Vision Pattern Recognit (CVPR)
Lee IH, Chai TS (2013) Accurate registration using adaptive block processing for multi-spectral images. Circuits Syst Video Technol, IEEE Trans on PP(99), 1–1
Lowe DG (1999) Object recognition from local scale-invariant features. In: International Converence on Computer Vision (ICCV), pp 1150–1157
Lyu S (2005) Mercer kernels for object recognition with local features. Comp Vision Pattern Recognit (CVPR) 2:223–229
Google Scholar
Makar M, Chang CL, Chen D, Tsai SS, Girod B (2009) Compression of image patches for local feature extraction. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) pp. 821–824. DOI http://doi.ieeecomputersociety.org/10.1109/ICASSP.2009.4959710
Mikolajczyk K, Leibe B, Schiele B (2005) Local features for object class recognition. In: International Conference on Computer Vision (ICCV’05)
Mikolajczyk K, Leibe B, Schiele B (2006) Multiple object class detection with a generative model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’06)
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans. Pattern Anal Mach Intell 27(10):1615–1630
Article Google Scholar
Minh HQ, Niyogi P, Yao Y (2006) Mercer’s theorem, feature maps, and smoothing. In: The 19th Annual Conference on Learning Theory (COLT), pp 154–168
Mühling M, Ewerth R, Freisleben B (2011) On the spatial extents of SIFT descriptors for visual concept detection. In: International Conference on Computer Vision Systems (ICVS), pp 71–80. Springer, Berlin, Heidelberg
Opelt A, Fussenegger M, Pinz A, Auer P (2004) Weak hypotheses and boosting for generic object detection and recognition. In: European Conference on Computer Vision (ECCV), pp 71–84
Pavani SK, Delgado-Gomez D, Frangi AF (2012) Gaussian weak classifiers based on co-occurring haar-like features for face detection. Pattern Analy Appl
Phillips PJ, Rauss PJ, Der SZ (1996) FERET (Face Recognition Technology) recognition algorithm development and test results. Tech Rep 995, Army Research Lab
Rifkin R, Klautau A (2004) In defence of one-vs-all classification. J Mach Learn Res 5:101–141
MathSciNet MATH Google Scholar
Roy S, Sun Q (2007) Robust hash for detecting and localizing image tampering. In: Proc. IEEE Int. Conf. on Image Processing, pp 117–120. San Antonio, TX
Savicky P, Robnik-Sikonja M (2008) Learning random numbers: a MATLAB anomaly. Appl Artif Intell 22(3):254–265
Article Google Scholar
Sleumer NH (2000) Hyperplane arrangements. construction, visualization and application. Ph.D. thesis, Technische Wissenschaften ETH Zürich, Nr. 13502
Stanley RP (2007) An introduction to hyperplane arrangements, IAS/Park City Math. Ser., vol. 13, pp 389–496. Amer Math Soc
Stommel M, Herzog O (2009) Binarising SIFT-descriptors to reduce the curse of dimensionality in histogram-based object recognition. In: Slezak D, Pal SK, Kang BH, Gu J, Kurada H, Kim TH (eds) Signal Processing, Image Processing and Pattern Recognition, pp 320–327. Springer
Stommel M, Langer M, Herzog O, Kuhnert KD (2011) A fast, robust and low bit-rate representation for SIFT and SURF features. In: Proc. IEEE International Symposium on Safety, Security, and Rescue Robotics, pp 278n++-283
Strecha C, Bronstein AM, Bronstein MM, Fua P (2010) LDAHash: Improved matching with smaller descriptors. In: EPFL-REPORT-152487
Su Y, Jurie F (2011) Visual word disambiguation by semantic contexts. In: International Conference on Computer Vision (ICCV)
Vapnik VN, Chervonenkis AY (1974) Theory of pattern recognition [in Russian]. Nauka, USSR
MATH Google Scholar
Viola P, Jones MJ (2004) Robust real-time face detection. Intern J Comp Vision 57(2):137–154
Article Google Scholar
Wiedemeyer T, Stommel M, Herzog O (2011) Wide range face pose estimation by modelling the 3D arrangement of robustly detectable sub-parts. In: Intl. Conf. on Computer Analysis of Images and Patterns (CAIP), pp 237–244. Springer
Winder SAJ, Brown M (2007) Learning Local Image Descriptors. In: IEEE Computer Vision and Pattern Recognition (CVPR)
Yang L, Jin R, Sukthankar R, Jurie F (2008) Unifying discriminative visual codebook generation with classifier training for object category recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Yeo C, Ahammad P, Ramchandran K (2008) Rate-efficient visual correspondences using random projections. In: IEEE International Conference on Image Processing, pp 217–220. San Diego, CA

Download references

Author information

Authors and Affiliations

School of Engineering, Auckland University of Technology, Auckland, New Zealand
Martin Stommel
Visual Information Technologies, Jacobs University Bremen, Bremen, Germany
Otthein Herzog
Mechatronics Group, University of Auckland, Auckland, New Zealand
Weiliang Xu

Authors

Martin Stommel
View author publications
You can also search for this author in PubMed Google Scholar
Otthein Herzog
View author publications
You can also search for this author in PubMed Google Scholar
Weiliang Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Stommel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Stommel, M., Herzog, O. & Xu, W. Hyperplane arrangements for the fast matching and classification of visual landmarks. Pattern Anal Applic 19, 621–629 (2016). https://doi.org/10.1007/s10044-014-0417-3

Download citation

Received: 12 November 2013
Accepted: 07 September 2014
Published: 23 September 2014
Issue Date: August 2016
DOI: https://doi.org/10.1007/s10044-014-0417-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hyperplane arrangements for the fast matching and classification of visual landmarks

Abstract

Access this article

Similar content being viewed by others

Learning to Rank 3D Features

Robust hand gesture recognition using multiple shape-oriented visual cues

Generalizing the Prediction Sum of Squares Statistic and Formula, Application to Linear Fractional Image Warp and Surface Fitting

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Hyperplane arrangements for the fast matching and classification of visual landmarks

Abstract

Access this article

Similar content being viewed by others

Learning to Rank 3D Features

Robust hand gesture recognition using multiple shape-oriented visual cues

Generalizing the Prediction Sum of Squares Statistic and Formula, Application to Linear Fractional Image Warp and Surface Fitting

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation