State-of-the-Art

González Aguirre, David Israel

doi:10.1007/978-3-319-97841-3_2

David Israel González Aguirre⁶

Part of the book series: Cognitive Systems Monographs ((COSMOS,volume 38))

474 Accesses

Abstract

This chapter presents a perspective of the state-of-the-art on technology and methodology for visual perception for humanoid robots. The focus is placed on relevant work in robot vision for object recognition with 6D-pose estimation and vision-based 6D global self-localization. Especial emphasis is paid to visual perception capabilities in human-centered environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
“Simultaneous Localization And Mapping”, is the wide spread usage of the acronym http://openslam.org/. The first appearance of the acronym was at the “Software Library for Appearance Modeling” in the work of H. Murase et al. [25] http://www.cs.columbia.edu/CAVE/software/softlib/slam.php.
2.
A quasi-perceptual experience, it resembles a visual experience, but occurs in the absence of the external stimuli (see [35]).
3.
Percept is the perceptional input object, a mental impression of something perceived by the senses, viewed as the basic component in the formation of concepts (see cognition within artificial intelligence in [65]).
4.
Active vision systems interact with the environment by controlling viewpoint, exposure and other acquisition parameters instead of passively observe the scene with a fixed configuration. Usually, active sensing operates on sequences of images rather than single image, see [66].

References

Kato, I. 1973. Development of WABOT-1. Biomechanism 2, 173–214. Tokyo: The University of Tokyo Press.
Google Scholar
Waseda. 2012. Humanoid History -WABOT. 2012, humanoid Robotics Institute, Waseda University.
Google Scholar
Sugano, S., and I. Kato. 1987. WABOT-2: Autonomous Robot with Dexterous Finger-arm–Finger-arm Coordination Control in Keyboard Performance. In IEEE International Conference on Robotics and Automation, vol. 4, 90–97.
Google Scholar
Konno, A., K. Nagashima, R. Furukawa, K. Nishiwaki, T. Noda, M. Inaba, and H. Inoue. 1997. Development of a Humanoid Robot Saika. In IEEE-RSJ International Conference on Intelligent Robots and Systems, vol. 2, 805–810.
Google Scholar
Yamaguchi, J., S. Inoue, D. Nishino, and A. Takanishi. 1998. Development of a Bipedal Humanoid Robot Having Antagonistic Driven Joints and Three DOF Trunk. In IEEE-RSJ International Conference on Intelligent Robots and Systems, vol. 1, 96–101.
Google Scholar
Brooks, R., C. Breazeal, M. Marjanović, B. Scassellati, and M. Williamson. 1999. The Cog Project: Building a Humanoid Robot. In Computation for Metaphors, Analogy, and Agents, ed. Nehaniv C., 52–87. Lecture Notes in Computer Science. Berlin, Heidelberg: Springer.
Google Scholar
Scassellati, B. 1998. A Binocular, Foveated Active Vision System. Technical report, MIT Artificial Intelligence Lab, Cambridge, MA, USA
Google Scholar
Breazeal, C., A. Edsinger, P. Fitzpatrick, and B. Scassellati. 2001. Active Vision for Sociable Robots. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans 31 (5): 443–453.
Google Scholar
Pfeifer, R., and J. Bongard. 2006. How the Body Shapes the Way We Think: A New View of Intelligence. New York: A Bradford Book.
Google Scholar
Inaba, M., T. Igarashi, S. Kagami, and H. Inoue. 1996. A 35 DOF Humanoid that Can Coordinate Arms and Legs in Standing up, Reaching and Grasping an Object. In IEEE-RSJ International Conference on Intelligent Robots and Systems, vol. 1, 29–36.
Google Scholar
M. Inaba and H. Inoue, Robot Vision Server, International Symposium Industrial Robots, pp. 195–202, 1989.
Google Scholar
Jin, Y., and M. Xie. 2000. Vision Guided Homing for Humanoid Service Robot. In International Conference on Pattern Recognition, vol. 4, 511–514.
Google Scholar
Saegusa, R., G. Metta, and G. Sandini. 2010. Own Body Perception based on Visuomotor Correlation. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 1044–1051.
Google Scholar
Bartolozzi, C., F. Rea, C. Clercq, M. Hofstatter, D. Fasnacht, G. Indiveri, and G. Metta. 2011. Embedded Neuromorphic Vision for Humanoid Robots. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 129–135.
Google Scholar
Vernon, D., G. Metta, and G. Sandini. 2007. The iCub Cognitive Architecture: Interactive Development in a Humanoid Robot. In IEEE International Conference on Development and Learning, 122–127.
Google Scholar
Metta, G., L. Natale, F. Nori, and G. Sandini. 2011. The iCub Project: An Open Source Platform for Research in Embodied Cognition. In IEEE Workshop on Advanced Robotics and its Social Impacts, 24–26.
Google Scholar
Azad, P. 2008. Visual Perception for Manipulation and Imitation in Humanoid Robots. Ph.D. dissertation, University of Karlsruhe. ISBN 978-3-642-04229-4.
Google Scholar
Welke, K. 2011. Memory-Based Active Visual Search for Humanoid Robots. Ph.D. dissertation, KIT, Karlsruhe Institute of Technology, Computer Science Faculty, Institute for Anthropometrics.
Google Scholar
Thompson, S., and S. Kagami. 2005. Humanoid Robot Localisation using Stereo Vision. In IEEE-RAS International Conference on Humanoid Robots, 19–25.
Google Scholar
Lowe, D. 2004. Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60 (2): 91–110.
Google Scholar
Konolige, K. Small Vision System.
Google Scholar
Kagami, S., K. Nishiwaki, J.J. Kuffner, Y. Kuniyoshi, M. Inaba, and H. Inoue. 2002. Online 3D Vision, Motion Planning and Bipedal Locomotion Control Coupling System of Humanoid Robot: H7. In IEEE-RSJ International Conference on Intelligent Robots and Systems, vol. 3, 2557–2562.
Google Scholar
Dellaert, F., D. Fox, W. Burgard, and S. Thrun. 1999. Monte Carlo Localization for Mobile Robots. In IEEE International Conference on Robotics and Automation, vol. 2, 1322–1328.
Google Scholar
Stasse, O., A.J. Davison, R. Sellaouti, and K. Yokoi. 2006. Real-time 3D SLAM for Humanoid Robot considering Pattern Generator Information. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 348–355.
Google Scholar
Murase, H., and S. Nayar. 1993. Learning and Recognition of 3D Objects from Appearance. In IEEE Workshop on Qualitative Vision, 39–50.
Google Scholar
Davison, A., and N. Kita. 2001. 3D Simultaneous Localisation and Map-building using Active Vision for a Robot Moving on Undulating Terrain. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, 384–391.
Google Scholar
Davison, A., I. Reid, N. Molton, and O. Stasse. 2007. MonoSLAM: Real-Time Single Camera SLAM. IEEE Transactions on Pattern Analysis and Machine Intelligence 29 (6): 1052–1067.
Google Scholar
Okada, K., M. Kojima, Y. Sagawa, T. Ichino, K. Sato, and M. Inaba. 2006. Vision based Behavior Verification System of Humanoid Robot for Daily Environment Tasks. In IEEE-RAS International Conference on Humanoid Robots, 7–12.
Google Scholar
Okada, K., M. Kojima, S. Tokutsu, T. Maki, Y. Mori, and M. Inaba. 2007. Multi-cue 3D Object Recognition in Knowledge-based Vision-guided Humanoid Robot System. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 3217–3222.
Google Scholar
Okada, K., M. Kojima, S. Tokutsu, Y. Mori, T. Maki, and M. Inaba. 2008. Task Guided Attention Control and Visual Verification in Tea Serving by the Daily Assistive Humanoid HRP2JSK. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 1551–1557.
Google Scholar
Brooks, R., and L. Stein. 1994. Building Brains for Bodies. Autonomous Robots 1: 7–25.
Google Scholar
Kuffner, J., K. Nishiwaki, S. Kagami, M. Inaba, and H. Inoue. 2003. Motion Planning for Humanoid Robots. In International Symposium on Robotics Research, 365–374.
Google Scholar
Prats, M., S. Wieland, T. Asfour, A. del Pobil, and R. Dillmann. 2008. Compliant Interaction in Household Environments by the Armar-III Humanoid Robot. In IEEE-RAS International Conference on Humanoid Robots, 475–480.
Google Scholar
Wieland, S., D. Gonzalez-Aguirre, T. Asfour, and R. Dillmann. 2009. Combining Force and Visual Feedback for Physical Interaction Tasks in Humanoid Robots. In IEEE-RAS International Conference on Humanoid Robots, 439–446.
Google Scholar
Gonzalez-Aguirre, D., S. Wieland, T. Asfour, and R. Dillmann. 2009. On Environmental Model-Based Visual Perception for Humanoids. In Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Lecture Notes in Computer Science, eds. Bayro-Corrochano, E., and J.-O. Eklundh, vol. 5856, 901–909. Berlin, Heidelberg: Springer.
Google Scholar
Harris, C., and M. Stephens. 1988. A Combined Corner and Edge Detector. In Alvey Vision Conference, 147–151. Manchester, UK.
Google Scholar
Noskovicova, L., and R. Ravas. 2010. Subpixel Corner Detection for Camera Calibration. In MECHATRONIKA, International Symposium, 78–80.
Google Scholar
Shi, J., and C. Tomasi. 1994. Good Features to Track. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 593–600.
Google Scholar
Pérez, P., C. Hue, J. Vermaak, and M. Gangnet. 2002. Color-Based Probabilistic Tracking. In European Conference on Computer Vision-Part I, 661–675. Berlin: Springer.
Google Scholar
Canny, J. 1986. A Computational Approach to Edge Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 8 (6): 679–698.
Google Scholar
Flint, A., C. Mei, I. Reid, and D. Murray. 2010. Growing Semantically Meaningful Models for Visual SLAM. In IEEE Conference on Computer Vision and Pattern Recognition, 467–474.
Google Scholar
Wen, F., X. Chai, Y. Li, W. Zou, K. Yuan, and P. Chen. 2011. An Improved Visual SLAM Algorithm based on Mixed Data Association. In World Congress on Intelligent Control and Automation, 1081–1086.
Google Scholar
Civera, J., D. Galvez-Lopez, L. Riazuelo, J.D. Tardos, and J.M.M. Montiel. 2011. Towards Semantic SLAM using a Monocular Camera. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 1277–1284.
Google Scholar
Ahn, S., M. Choi, J. Choi, and W.K. Chung. 2006. Data Association Using Visual Object Recognition for EKF-SLAM in Home Environment. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 2588–2594.
Google Scholar
Bay, H., T. Tuytelaars, and L.V. Gool. 2006. Surf: Speeded up Robust Features. In European Conference on Computer Vision, 404–417.
Google Scholar
Jeong, W.Y., and K.M. Lee. 2006. Visual SLAM with Line and Corner Features. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 2570–2575.
Google Scholar
Klank, U., D. Pangercic, R. Rusu, and M. Beetz. 2009. Real-time CAD Model Matching for Mobile Manipulation and Grasping. In IEEE-RAS International Conference on Humanoid Robots, 290–296.
Google Scholar
Rusu, R.B., G. Bradski, R. Thibaux, and J. Hsu. 2010. Fast 3D Recognition and Pose Using the Viewpoint Feature Histogram. In IEEE-RSJ International Conference on Intelligent Robots and Systems.
Google Scholar
Meeussen, W., M. Wise, S. Glaser, S. Chitta, C. McGann, P. Mihelich, E. Marder-Eppstein, M. Muja, V. Eruhimov, T. Foote, J. Hsu, R. Rusu, B. Marthi, G. Bradski, K. Konolige, B. Gerkey, and E. Berger. 2010. Autonomous Door Opening and Plugging in with a Personal Robot. In IEEE International Conference on Robotics and Automation, 729–736.
Google Scholar
Sturm, J., K. Konolige, C. Stachniss, and W. Burgard. 2010. 3D Pose Estimation, Tracking and Model Learning of Articulated Objects from Dense Depth Video using Projected Texture Stereo. In Workshop RGB-D: Advanced Reasoning with Depth Cameras at Robotics: Science and Systems.
Google Scholar
Muja, M., R.B. Rusu, G. Bradski, and D.G. Lowe. 2011. REIN - A Fast, Robust, Scalable Recognition Infrastructure. In IEEE International Conference on Robotics and Automation, 2939–2946.
Google Scholar
Kakiuchi, Y., R. Ueda, K. Okada, and M. Inaba. 2011. Creating Household Environment Map for Environment Manipulation using Color Range Sensors on Environment and Robot. In IEEE International Conference on Robotics and Automation, 305–310.
Google Scholar
Chin, R., and C. Dyer. 1986. Model-based Recognition in Robot Vision. ACM Computing Surveys 18: 67–108.
Google Scholar
Ullman, S. 2000. High-Level Vision, Object Recognition and Visual Cognition. The MIT press. ISBN-10: 0-262-71007-2.
Google Scholar
Lepetit, V., and P. Fua. 2005. Monocular Model-Based 3D Tracking of Rigid Objects: A Survey. Foundations and Trends in Computer Graphics and Vision 1 (1): 1–89.
Google Scholar
Roth, P.M., and M. Winter. 2008. Survey of Appearance-based Methods for Object Recognition. Institute for Computer Graphics and Vision, Graz University of Technology, Austria, Technical Reports.
Google Scholar
Kragic, D., A. Miller, and P. Allen. 2001. Real-time Tracking Meets Online Grasp Planning. In IEEE International Conference on Robotics and Automation, vol. 3, 2460–2465.
Google Scholar
Azad, P., T. Asfour, and R. Dillmann. 2006. Combining Appearance-based and Model-based Methods for Real-Time Object Recognition and 6D Localization. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 5339–5344.
Google Scholar
Azad, P., T. Asfour, and R. Dillmann. 2009. Combining Harris Interest Points and the SIFT Descriptor for Fast Scale-Invariant Object Recognition. In IEEE-RSJ International Conference on Intelligent Robots and Systems, 4275–4280.
Google Scholar
Wei-Hsuan C., H. Chih-Hsien, T. Yi-Che, C. Shih-Hung, Y. Fun, and C. Jen-Shiun. 2009. An Efficient Object Recognition System for Humanoid Robot Vision. In Joint Conferences on Pervasive Computing, 209–214.
Google Scholar
Azad, P., D. Muench, T. Asfour, and R. Dillmann. 2011. 6-DoF Model-based Tracking of Arbitrarily Shaped 3D Objects. In IEEE International Conference on Robotics and Automation.
Google Scholar
Davison, A. 1998. Mobile Robot Navigation Using Active Vision. Ph.D. dissertation, Robotics Research Group, Department of Engineering Science, University of Oxford.
Google Scholar
Faugeras, O. 1995. Stratification of 3-D vision: Projective, Affine, and Metric Representations. Journal of the Optical Society of America A 12: 46 548–4.
Google Scholar
Gonzalez-Aguirre, D., T. Asfour, and R. Dillmann. 2011. Towards Stratified Model-based Environmental Visual Perception for Humanoid Robots. Pattern Recognition Letters 32 (16): 2254–2260. (advances in Theory and Applications of Pattern Recognition, Image Processing and Computer Vision).
Google Scholar
Russell, P.N.S. 1995. Artificial Intelligence: A Modern Approach. Prentice Hall Series in artificial intelligence. ISBN 9780136042594.
Google Scholar
Daniilidis, K., and J.-O. Eklundh. 2008. 3-D Vision and Recognition. In Springer Handbook of Robotics, ed. B. Siciliano, and O. Khatib, 543–562. Berlin: Springer.
Google Scholar
Gonzalez-Aguirre, D., T. Asfour, E. Bayro-Corrochano, and R. Dillmann. 2008. Model-based Visual Self-localization Using Geometry and Graphs. In International Conference on Pattern Recognition, 1–5.
Google Scholar

Download references

Author information

Authors and Affiliations

Software and Systems Research (SSR), Anticipatory Computing Lab (ACL), Intel Labs, Hillsboro, OR, USA
David Israel González Aguirre

Authors

David Israel González Aguirre
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Israel González Aguirre .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

González Aguirre, D.I. (2019). State-of-the-Art. In: Visual Perception for Humanoid Robots. Cognitive Systems Monographs, vol 38. Springer, Cham. https://doi.org/10.1007/978-3-319-97841-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-97841-3_2
Published: 02 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97839-0
Online ISBN: 978-3-319-97841-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics