Skip to main content

Lifelong Object Localization in Robotic Applications

  • 470 Accesses

Part of the Advances in Intelligent Systems and Computing book series (AISC,volume 1285)

Abstract

One of the most common tasks in assistive robotics is to find some specific object in a home environment. Usually, this task is tackled by adding the objects of interest to a map of the environment as soon as the objects are detected by the vision system of the robot. However, these maps are usually static, and do not take into account the dynamic nature of a home, where anyone could move an object after the robot has seen it. In this paper, we propose a lifelong system to address this problem. The robot takes into account different possible locations for each object, and chooses the more probable one when it is required. We have designed a probability based system that stores possible locations for each object, and updates the probabilities of past locations based on newer detections.

Keywords

  • Image segmentation
  • Object detection
  • Robot vision
  • Deep learning

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-030-62579-5_2
  • Chapter length: 12 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   169.00
Price excludes VAT (USA)
  • ISBN: 978-3-030-62579-5
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   219.99
Price excludes VAT (USA)
Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.
Fig. 7.

References

  1. Tompson, J., Goroshin, R., Jain, A., LeCun, Y., Bregler, C.: Efficient object localization using convolutional networks. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015

    Google Scholar 

  2. Girshick, R.: Fast R-CNN. In: The IEEE International Conference on Computer Vision (ICCV) (2015)

    Google Scholar 

  3. Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017

    Google Scholar 

  4. Romero-González, C., Martínez-Gómez, J., García-Varea, I.: 3D semantic maps for scene segmentation. In: ROBOT 2017: Third Iberian Robotics Conference, Sevilla, Spain, pp. 603–612 (2018)

    Google Scholar 

  5. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)

    CrossRef  Google Scholar 

  6. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25, pp. 1097–1105 (2012)

    Google Scholar 

  7. Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014

    Google Scholar 

  8. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(Aug), 2493–2537 (2011)

    Google Scholar 

  9. Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: common objects in context. In: Computer Vision – ECCV 2014, pp. 740–755. Springer, Heidelberg (2014)

    Google Scholar 

  10. Zhou, B., Zhao, H., Puig, X., Fidler, S., Barriuso, A., Torralba, A.: Scene parsing through ade20k dataset. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)

    Google Scholar 

  11. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015

    Google Scholar 

  12. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9, June 2015

    Google Scholar 

  13. Romero-González, C., Martínez-Gómez, J., García-Varea, I.: A review of segmentation methods for 3D semantic mapping. In: Robot 2019: Fourth Iberian Robotics Conference, Porto, Portugal, pp. 620–631 (2020)

    Google Scholar 

  14. Rabbani, T., Van Den Heuvel, F., Vosselmann, G.: Segmentation of point clouds using smoothness constraint. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 36(5), 248–253 (2006)

    Google Scholar 

  15. Schoeler, M., Papon, J., Worgotter, F.: Constrained planar cuts - object partitioning for point clouds. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015

    Google Scholar 

  16. Zhou, B., Zhao, H., Puig, X., Xiao, T., Fidler, S., Barriuso, A., Torralba, A.: Semantic understanding of scenes through the ADE20K dataset. Int. J. Comput. Vision 127(3), 302–321 (2019)

    CrossRef  Google Scholar 

Download references

Acknowledgments

This work has been partially sponsored by the Spanish Ministry of Science and Innovation under grant numbers TIN2016-77902-C3-1-P and PID2019-106758GB-C33, and by the Regional Council of Education, Culture and Sports of Castilla-La Mancha under grant number SBPLY/17/180501/000493, supported with FEDER funds.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cristina Romero-González .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Verify currency and authenticity via CrossMark

Cite this paper

Romero-González, C., Martínez-Gómez, J., García-Varea, I. (2021). Lifelong Object Localization in Robotic Applications. In: Bergasa, L.M., Ocaña, M., Barea, R., López-Guillén, E., Revenga, P. (eds) Advances in Physical Agents II. WAF 2020. Advances in Intelligent Systems and Computing, vol 1285. Springer, Cham. https://doi.org/10.1007/978-3-030-62579-5_2

Download citation