Learning to Rank 3D Features

Tuzel, Oncel; Liu, Ming-Yu; Taguchi, Yuichi; Raghunathan, Arvind

doi:10.1007/978-3-319-10590-1_34

Oncel Tuzel¹⁹,
Ming-Yu Liu¹⁹,
Yuichi Taguchi¹⁹ &
…
Arvind Raghunathan¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8689))

Included in the following conference series:

European Conference on Computer Vision

37k Accesses
16 Citations

Abstract

Representation of three dimensional objects using a set of oriented point pair features has been shown to be effective for object recognition and pose estimation. Combined with an efficient voting scheme on a generalized Hough space, existing approaches achieve good recognition accuracy and fast operation. However, the performance of these approaches degrades when the objects are (self-)similar or exhibit degeneracies, such as large planar surfaces which are very common in both man made and natural shapes, or due to heavy object and background clutter. We propose a max-margin learning framework to identify discriminative features on the surface of three dimensional objects. Our algorithm selects and ranks features according to their importance for the specified task, which leads to improved accuracy and reduced computational cost. In addition, we analyze various grouping and optimization strategies to learn the discriminative pair features. We present extensive synthetic and real experiments demonstrating the improved results.

Download to read the full chapter text

Chapter PDF

3D object recognition using scale-invariant features

Article 25 October 2017

A fast 3D object recognition algorithm using plane-constrained point pair features

Article 11 August 2020

Ultrarobust support vector registration

Article 17 November 2020

Keywords

References

Bariya, P., Nishino, K.: Scale-hierarchical 3D object recognition in cluttered scenes. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), pp. 1657–1664 (2010)
Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)
Article Google Scholar
Bookstein, F.L.: Principal warps: Thin-plate splines and the decomposition of deformations. IEEE Trans. Pattern Anal. Mach. Intell. 11(6), 567–585 (1989)
Article MATH Google Scholar
Borgefors, G.: Hierarchical chamfer matching: A parametric edge matching algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 10(6), 849–865 (1988)
Article Google Scholar
Cao, Z., Qin, T., Liu, T.Y., Tsai, M.F., Li, H.: Learning to rank: From pairwise approach to listwise approach. In: Proc. Int’l Conf. Mach. Learning (ICML), pp. 129–136 (2007)
Google Scholar
Choi, C., Taguchi, Y., Tuzel, O., Liu, M.Y., Ramalingam, S.: Voting-based pose estimation for robotic assembly using a 3D sensor. In: Proc. IEEE Int’l Conf. Robotics Automation (ICRA), pp. 1724–1731 (May 2012)
Google Scholar
Drost, B., Ulrich, M., Navab, N., Ilic, S.: Model globally, match locally: Efficient and robust 3D object recognition. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), pp. 998–1005 (June 2010)
Google Scholar
Frome, A., Huber, D., Kolluri, R., Bülow, T., Malik, J.: Recognizing objects in range data using regional point descriptors. In: Pajdla, T., Matas, J. (eds.) ECCV 2004. LNCS, vol. 3023, pp. 224–237. Springer, Heidelberg (2004)
Chapter Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey Vision Conference, Manchester, UK, vol. 15, p. 50 (1988)
Google Scholar
Hinterstoisser, S., Holzer, S., Cagniart, C., Ilic, S., Konolige, K., Navab, N., Lepetit, V.: Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV), pp. 858–865 (November 2011)
Google Scholar
Holzer, S., Shotton, J., Kohli, P.: Learning to efficiently detect repeatable interest points in depth data. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part I. LNCS, vol. 7572, pp. 200–213. Springer, Heidelberg (2012)
Chapter Google Scholar
Horn, B.K.P.: Extended gaussian images. Proceedings of the IEEE 72(12), 1671–1686 (1984)
Article Google Scholar
Jain, V., Varma, M.: Learning to re-rank: query-dependent image re-ranking using click data. In: Proc. Int’l Conf. World Wide Web, pp. 277–286 (2011)
Google Scholar
Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge discovery and Data Mining, pp. 133–142 (2002)
Google Scholar
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999)
Article Google Scholar
Kelley Jr., J.E.: The cutting-plane method for solving convex programs. Journal of the Society for Industrial & Applied Mathematics 8(4), 703–712 (1960)
Article MathSciNet Google Scholar
Knopp, J., Prasad, M., Willems, G., Timofte, R., Van Gool, L.: Hough transform and 3D SURF for robust three dimensional classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 589–602. Springer, Heidelberg (2010)
Chapter Google Scholar
Lamdan, Y., Wolfson, H.J.: Geometric hashing: A general and efficient model-based recognition scheme. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV), vol. 88, pp. 238–249 (1988)
Google Scholar
Liu, M.Y., Tuzel, O., Veeraraghavan, A., Chellappa, R.: Fast directional chamfer matching. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), pp. 1696–1703 (June 2010)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int’l J. Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), pp. 1038–1045 (2009)
Google Scholar
Mian, A.S., Bennamoun, M., Owens, R.: Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Trans. Pattern Anal. Mach. Intell. 28, 1584–1601 (2006)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV), vol. 1, pp. 525–531 (2001)
Google Scholar
Nguyen, H., Porikli, F.: Support vector shape: A classifier based shape representation. IEEE Trans. Pattern Anal. Mach. Intell. 35(4), 970–982 (2013)
Article Google Scholar
Parikh, D., Grauman, K.: Relative attributes. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV), pp. 503–510 (2011)
Google Scholar
Rodolà, E., Albarelli, A., Bergamasco, F., Torsello, A.: A scale independent selection process for 3D object recognition in cluttered scenes. Int’l J. Computer Vision 102(1-3), 129–145 (2013)
Article Google Scholar
Rusu, R.B., Blodow, N., Beetz, M.: Fast point feature histograms (FPFH) for 3D registration. In: Proc. IEEE Int’l Conf. Robotics Automation (ICRA), pp. 3212–3217 (May 2009)
Google Scholar
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), pp. 1297–1304 (June 2011)
Google Scholar
Stein, F., Medioni, G.: Structural indexing: Efficient 3-D object recognition. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 125–145 (1992)
Article Google Scholar
Taskar, B., Guestrin, C., Koller, D.: Max-margin markov networks. In: Proc. Neural Information Processing Systems (NIPS), vol. 16 (2003)
Google Scholar
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: Proc. Int’l Conf. Mach. Learning (ICML), p. 104 (2004)
Google Scholar
Tutuncu, R., Toh, K., Todd, M.: Solving semidefinite-quadratic-linear programs using sdpt3. Mathematical Programming Ser. B 95, 198–217 (2003)
Article MathSciNet Google Scholar
Wahl, E., Hillenbrand, U., Hirzinger, G.: Surflet-pair-relation histograms: A statistical 3D-shape representation for rapid classification. In: Proc. Int’l Conf. 3-D Digital Imaging and Modeling (3DIM), pp. 474–481 (October 2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Mitsubishi Electric Research Labs (MERL), Cambridge, MA, USA
Oncel Tuzel, Ming-Yu Liu, Yuichi Taguchi & Arvind Raghunathan

Authors

Oncel Tuzel
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Yu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Taguchi
View author publications
You can also search for this author in PubMed Google Scholar
Arvind Raghunathan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Toronto, 6 King’s College Road, M5H 3S5, Toronto, ON, Canada
David Fleet
Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University in Prague, Technicka 2, 166 27, Prague 6, Czech Republic
Tomas Pajdla
Max-Planck-Institut für Informatik, Campus E1 4, 66123, Saarbrücken, Germany
Bernt Schiele
PSI, iMinds, KU Leuven, ESAT, Kasteelpark Arenberg 10, Bus 2441, 3001, Leuven, Belgium
Tinne Tuytelaars

1 Electronic Supplementary Material

Electronic Supplementary Material (MOV 20,586 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tuzel, O., Liu, MY., Taguchi, Y., Raghunathan, A. (2014). Learning to Rank 3D Features. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol 8689. Springer, Cham. https://doi.org/10.1007/978-3-319-10590-1_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-10590-1_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10589-5
Online ISBN: 978-3-319-10590-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning to Rank 3D Features

Abstract

Chapter PDF

Similar content being viewed by others

3D object recognition using scale-invariant features

A fast 3D object recognition algorithm using plane-constrained point pair features

Ultrarobust support vector registration

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

1 Electronic Supplementary Material

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning to Rank 3D Features

Abstract

Chapter PDF

Similar content being viewed by others

3D object recognition using scale-invariant features

A fast 3D object recognition algorithm using plane-constrained point pair features

Ultrarobust support vector registration

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

1 Electronic Supplementary Material

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation