Object Categorization in Clutter Using Additive Features and Hashing of Part-Graph Descriptors

Marton, Zoltan-Csaba; Balint-Benczedi, Ferenc; Seidel, Florian; Goron, Lucian Cosmin; Beetz, Michael

doi:10.1007/978-3-642-32732-2_2

Zoltan-Csaba Marton²²,
Ferenc Balint-Benczedi²²,
Florian Seidel²²,
Lucian Cosmin Goron²² &
…
Michael Beetz²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7463))

Included in the following conference series:

International Conference on Spatial Cognition

1629 Accesses
6 Citations

Abstract

Detecting objects in clutter is an important capability for a household robot executing pick and place tasks in realistic settings. While approaches from 2D vision work reasonably well under certain lighting conditions and given unique textures, the development of inexpensive RGBD cameras opens the way for real-time geometric approaches that do not require templates of known objects.

This paper presents a part-graph-based hashing method for classifying objects in clutter, using an additive feature descriptor. The method is incremental, allowing easy addition of new training data without recreating the complete model, and takes advantage of the additive nature of the feature to increase efficiency. It is based on a graph representation of the scene created from considering possible groupings of over-segmented scene parts, which can in turn be used in classification. Additionally, the results over multiple segmentations can be accumulated to increase detection accuracy.

We evaluated our approach on a large RGBD dataset containing over 15000 Kinect scans of 102 objects grouped in 16 categories, which we arranged into six geometric classes. Furthermore, tests on complete cluttered scenes were performed as well, and used to showcase the importance of domain adaptation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Biederman, I.: Recognition-by-components: A theory of human image understanding. Psychological Review (1987)
Google Scholar
Dickinson, S.: The evolution of object categorization and the challenge of image abstraction. In: Dickinson, S., Leonardis, A., Schiele, B., Tarr, M. (eds.) Object Categorization: Computer and Human Vision Perspectives (2009)
Google Scholar
Marton, Z.C., Pangercic, D., Blodow, N., Beetz, M.: Combined 2D-3D Categorization and Classification for Multimodal Perception Systems. The International Journal of Robotics Research (2011)
Google Scholar
Lai, K., Fox, D.: Object recognition in 3d point clouds using web data and domain adaptation. The International Journal of Robotics Research 29(8), 1019–1037 (2010)
Article Google Scholar
Mozos, O.M., Marton, Z.C., Beetz, M.: Furniture Models Learned from the WWW – Using Web Catalogs to Locate and Categorize Unknown Furniture Pieces in 3D Laser Scans. Robotics & Automation Magazine 18(2), 22–32 (2011)
Article Google Scholar
Marton, Z.C., Rusu, R.B., Jain, D., Klank, U., Beetz, M.: Probabilistic Categorization of Kitchen Objects in Table Settings with a Composite Sensor. In: Proceedings of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, USA, October 11-15 (2009)
Google Scholar
Malisiewicz, T., Efros, A.A.: Improving Spatial Support for Objects via Multiple Segmentations. In: Proceedings of the British Machine Vision Conference (2007)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Rusu, R.B., Bradski, G., Thibaux, R., Hsu, J.: Fast 3d recognition and pose using the viewpoint feature histogram. In: Proceedings of the 23rd IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Taipei, Taiwan (October 2010)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR, pp. 264–271 (2003)
Google Scholar
Huber, D., Kapuria, A., Donamukkala, R.R., Hebert, M.: Parts-based 3d object classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2004) (July 2004)
Google Scholar
Kanezaki, A., Nakayama, H., Harada, T., Kuniyoshi, Y.: High-speed 3d object recognition using additive features in a linear subspace. In: ICRA, pp. 3128–3134 (2010)
Google Scholar
Watanabe, S., Pakvasa, N.: Subspace method in pattern recognition. In: Proceedings of 1st International Joint Conference on Pattern Recognition (1973)
Google Scholar
Mian, A.S., Bennamoun, M., Owens, R.: Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Trans. Pattern Anal. Mach. Intell. 28, 1584–1601 (2006)
Article Google Scholar
Bergström, N., Björkman, M., Kragic, D.: Generating Object Hypotheses in Natural Scenes through Human-Robot Interaction. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 827–833 (September 2011)
Google Scholar
Mishra, A.K., Aloimonos, Y.: Visual Segmentation of “Simple” Objects for Robots. In: Robotics: Science and Systems (RSS) (2011)
Google Scholar
Fowlkes, C.C., Martin, D.R., Malik, J.: Local figureground cues are valid for natural images. Journal of Vision 7(8) (2007)
Google Scholar
Comaniciu, D., Meer, P., Member, S.: Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 603–619 (2002)
Article Google Scholar
Gould, S., Russakovsky, O., Goodfellow, I., Baumstarck, P., Ng, A.Y., Koller, D.: The stair vision library (v2.4) (2010), http://ai.stanford.edu/~sgould/svl
Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 27:1–27:27 (2011), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Bradski, G.: The OpenCV Library. Dr. Dobb’s Journal of Software Tools (2000)
Google Scholar
Balint-Benczedi, F., Marton, Z.-C., Beetz, M.: Efficient part-graph hashes for object categorization. In: 5th International Conference on Cognitive Systems, CogSys 2012 (2012)
Google Scholar
Pinz, A.: Object categorization. Found. Trends. Comput. Graph. Vis. 1, 255–353 (2005)
Article Google Scholar
Marton, Z.C., Pangercic, D., Rusu, R.B., Holzbach, A., Beetz, M.: Hierarchical object geometric categorization and appearance classification for mobile manipulation. In: Proceedings of 2010 IEEE-RAS International Conference on Humanoid Robots, Nashville, TN, USA, December 6-8 (2010)
Google Scholar
Kanezaki, A., Marton, Z.C., Pangercic, D., Harada, T., Kuniyoshi, Y., Beetz, M.: Voxelized Shape and Color Histograms for RGB-D. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Workshop on Active Semantic Perception and Object Search in the Real World, San Francisco, CA, USA, September, 25–30 (2011)
Google Scholar
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view rgb-d object dataset. In: Proc. of International Conference on Robotics and Automation, ICRA (2011)
Google Scholar
Lam, L., Suen, C.Y.: Optimal combinations of pattern classifiers. Pattern Recognition Letters 16(9), 945–954 (1995)
Article Google Scholar
Goron, L.C., Marton, Z.C., Lazea, G., Beetz, M.: Segmenting cylindrical and box-like objects in cluttered 3D scenes. In: 7th German Conference on Robotics (ROBOTIK 2012), Munich, Germany (May 2012)
Google Scholar
Horswill, I.: Integrating vision and natural language without central models. In: Proceedings of the AAAI Fall Symposium on Embodied Language and Action (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Autonomous Systems, Technische Universität München, Germany
Zoltan-Csaba Marton, Ferenc Balint-Benczedi, Florian Seidel, Lucian Cosmin Goron & Michael Beetz

Authors

Zoltan-Csaba Marton
View author publications
You can also search for this author in PubMed Google Scholar
Ferenc Balint-Benczedi
View author publications
You can also search for this author in PubMed Google Scholar
Florian Seidel
View author publications
You can also search for this author in PubMed Google Scholar
Lucian Cosmin Goron
View author publications
You can also search for this author in PubMed Google Scholar
Michael Beetz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Albert-Ludwigs-University, Georges-Koehler-Allee 79, 79110, Freiburg, Germany
Cyrill Stachniss
Cognitive Neuroinformatics, University of Bremen, Enrique-Schmidt-Str. 5, 28359, Bremen, Germany
Kerstin Schill
Department of Psychology and School of Education and Social Policy, Northwestern University, 2029 Sheridan Road, 60208-2710, Evanston, IL, USA
David Uttal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Marton, ZC., Balint-Benczedi, F., Seidel, F., Goron, L.C., Beetz, M. (2012). Object Categorization in Clutter Using Additive Features and Hashing of Part-Graph Descriptors. In: Stachniss, C., Schill, K., Uttal, D. (eds) Spatial Cognition VIII. Spatial Cognition 2012. Lecture Notes in Computer Science(), vol 7463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32732-2_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-32732-2_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32731-5
Online ISBN: 978-3-642-32732-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics