Abstract
This research presents a robust embedded system for following a person, making use of a pipeline of convolutional neural networks. Besides, it features an optical tracking system for supporting the inferences of the neural networks, allowing to determine the position of a person using an RGBD camera. The system is deployed using ROS, and runs in a NVIDIA Jetson TX2, an embedded SoM (System-on-Module), capable of performing computationally demanding tasks onboard, and coping with the complexity required to run a robust tracking and following algorithm. The board is attached to a robotic mobile base, which receives velocity commands to move the system towards the target person.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The tiny version of YOLOv3 is used to the limited memory on the Jetson TX2 board. The full YOLOv3 model requires more memory than the available size.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
References
Pennisi, P., Tonacci, A., Tartarisco, G., Billeci, L., Ruta, L., Gangemi, S., Pioggia, G.: Autism and social robotics: a systematic review, Autism Research (2016)
Cheong, A., et al.: Development of a robotic waiter system,ll IFAC-PapersOnLine, In: 7th IFAC Symposium on Mechatronic Systems MECHATRONICS 2016, vol. 49, no. 21, pp. 681–686 (2016). https://doi.org/10.1016/j.ifacol.2016.10.679, http://www.sciencedirect.com/science/article/pii/S2405896316322911
Gockley, R., Forlizzi, J., Simmons, R.: Natural person-following behavior for social robots. Association for Computing Machinery, New York (2007)
Condés, I., Cañas, J.-M.: Person Following Robot Behavior Using Deep Learning: Proceedings of the 19th International Workshop of Physical Agents (WAF 2018), November 22-23, 2018, Madrid, Spain, pp. 147–161 (2019)
Sidenbladh, H., Kragic, D., Christensen, H.I.: A person following behaviour for a mobile robot. In: 1999 IEEE International Conference on Robotics and Automation, 1999. Proceedings. vol. 1, pp. 670–675. IEEE (1999)
Yoshimi, T., Nishiyama, M., Sonoura, T., Nakamoto, H., Tokura, S., Sato, H., Ozaki, F., Matsuhira, N., Mizoguchi, H.: Development of a person following robot with vision based target detection, In: 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5286–5291. IEEE (2006)
Aguirre, E., García-Silvente, M., Pascual, D.: A multisensor based approach using supervised learning and particle filtering for people detection and tracking. In: Robot : Second Iberian Robotics Conference, vol. 2016, pp. 645–657 Springer, Cham (2015)
Muñoz-Salinas, R., Aguirre, E., García-Silvente, M.: People detection and tracking using stereo vision and color. Image Vision Comput. 25(6), 995–1007 (2007)
Satake, J., Miura, J.: Robust stereo-based person detection and tracking for a person following robot. In: ICRA Workshop on People Detection and Tracking, pp. 1–10 (2009)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005) (2005)
Ilias, B., Shukor, S.A., Yaacob, S., Adom, A., Razali, M.M.: A nurse following robot with high speed Kinect sensor. ARPN J. Eng. Appl. Sci. 9(12), 2454–2459 (2014)
Shimura, K., Ando, Y., Yoshimi, T., Mizukawa, M.: Research on person following system based on RGB-D features by autonomous robot with multi-kinect sensor. In: 2014 IEEE/SICE International Symposium on System Integration (SII) (2014)
Xue, H., Liu, Y., Cai, D., He, X.: Tracking people in RGBD videos using deep learning and motion clues. Neurocomputing 204, 70–76 (2016)
Kouno, D., Shimada, K., Endo, T.: Person identification using top-view image with depth information. In: 2012 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (2012)
You, J., Wu, A., Li, X., Zheng, W.-S.: Top-push video-based person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Yoon, Y., Yoon, H., Kim, J.: Person reidentification in a person-following robot. In: 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN) (2016)
Munaro, M., Ghidoni, S., Dizmen, D.T., Menegatti, E.: A feature-based approach to people re-identification using skeleton keypoints. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 5644–5651. IEEE (2014)
Koide, K., Miura, J.: Identification of a specific person using color, height, and gait features for a person following robot. Robot. Autonomous Syst. 84, 76–87 (2016)
Welsh, J.B.: Real-time pose based human detection and re-identification with a single camera for robot person following, Ph.D. dissertation, University of Maryland, College Park (2017)
Ahmed, E., Jones, M., Marks, T.K.: An improved deep learning architecture for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
McLaughlin, N., Martinez del Rincon, J., Miller, P.: Recurrent convolutional network for video-based person re-identification, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016
“SSD: Single Shot MultiBox Detector.”
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., Adam, H.: Mobilenets: efficient convolutional neural networks for mobile vision applications (2017)
Redmon, J., Farhadi, A.: YOLOv3: An Incremental Improvement (2018)
ModelZoo: TensorFlow Object Detection: Model Zoo. https://modelzoo.co/model/objectdetection
Itzcovich, I.: Faced: CPU Real Time face detection using Deep Learning (2005)
Redmon, J., Farhadi, A.: Yolo9000: Better, faster, stronger (2016)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features, vol. 1, pp. I–511 (2001)
“FaceNet: A unified embedding for face recognition and clustering.”
González-Díaz, I.: Computer Vision: Dense Motion Estimation (2020)
Lucas, B., Kanade, T.: An Iterative Image Registration Technique with an Application to Stereo Vision (IJCAI), vol. 81, April 1981
Shi, J., Tomasi, U.: Good features to track. In: 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (1994)
GoogleAIBlog, “Accelerating Training and Inference with the Tensorflow Object Detection API.” https://ai.googleblog.com/2018/07/accelerated-training-and-inference-with.html
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Condés, I., Cañas, JM., Perdices, E. (2021). Embedded Deep Learning Solution for Person Identification and Following with a Robot. In: Bergasa, L.M., Ocaña, M., Barea, R., López-Guillén, E., Revenga, P. (eds) Advances in Physical Agents II. WAF 2020. Advances in Intelligent Systems and Computing, vol 1285. Springer, Cham. https://doi.org/10.1007/978-3-030-62579-5_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-62579-5_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62578-8
Online ISBN: 978-3-030-62579-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)