Skip to main content

Object Detection and 6D Pose Estimation for Precise Robotic Manipulation in Unstructured Environments

  • Conference paper
  • First Online:
Informatics in Control, Automation and Robotics (ICINCO 2017)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 495))

Abstract

In this paper we present an algorithm for the robust 6D pose estimation with an RGB-D camera in harsh and unstructured environments using object detection. While the pose estimation uses clustering and segmentation to find a robust point in multiple frames to track changes in the position of the camera, its functionality is enhanced with Faster-RCNN for classification and detection, providing the operator with information about the object of interest. This work further facilitates the goal of increasing the robot’s autonomy and helping operators to recover 3D reconstructions of the objects to be manipulated with the robot.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Engel, J., Schps, T., Cremers, D.: LSD-SLAM: Large-scale direct monocular SLAM. In: European Conference on Computer Vision, pp. 834–849, Springer, Cham, Berlin (2014)

    Google Scholar 

  2. Engel, J., Koltun, V., Cremers, D.: Direct sparse odometry. IEEE Trans. Pattern Anal. Mach. Intell. (2017)

    Google Scholar 

  3. Hinterstoisser, S., et al.: Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 858–865 (2011)

    Google Scholar 

  4. Prisacariu, V.A., Reid, I.D.: PWP3D: Real-time segmentation and tracking of 3D objects. Int. J. Comput. Vis. 98(3), 335–354 (2012)

    Article  MathSciNet  Google Scholar 

  5. Song, S., Xiao, J.: Sliding shapes for 3d object detection in depth images. In: European Conference on Computer Vision, pp. 634–651, Springer, Cham, Berlin (2014)

    Google Scholar 

  6. Di Castro, M., Vera, J.C., Masi, A. Ferre, M.: Novel pose estimation system for precise robotic manipulation in unstructured environment, pp. 50–55 (2017). https://doi.org/10.5220/0006426700500055

  7. Zeng, A., et al.: Multi-view self-supervised deep learning for 6d pose estimation in the amazon picking challenge. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 1386–1383, IEEE (2017)

    Google Scholar 

  8. Song, S., Xiao, J.: Deep sliding shapes for amodal 3D object detection in RGB-D images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 808–816 (2016)

    Google Scholar 

  9. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)

    Google Scholar 

  10. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)

    Article  Google Scholar 

  11. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  12. Liu, W., et al.: Ssd: Single shot multibox detector. In: European Conference on Computer Vision, pp. 21–37, Springer, Cham (2016)

    Chapter  Google Scholar 

  13. Hong, S., You, T., Kwak, S., Han, B.: Online tracking by learning discriminative saliency map with convolutional neural network. In: International Conference on Machine Learning pp. 597–606 (2015)

    Google Scholar 

  14. He, K., Gkioxari, G., Dollr, P., Girshick, R.: Mask r-cnn (2017). arXiv:1703.06870

  15. Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2d pose estimation using part affinity fields (2016). arXiv:1611.08050

  16. Zhang, H., et al.: Stackgan: text to photo-realistic image synthesis with stacked generative adversarial networks (2016). arXiv:1612.03242

  17. Lin, T.Y., et al.: Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755, Springer, Cham (2014)

    Google Scholar 

  18. Microsoft: Kinect hardware. https://developer.microsoft.com/en-us/windows/kinect/hardware

  19. Intel: Intel RealSense Camera R200 Datasheet. https://software.intel.com/sites/default/files/managed/d7/a9/realsense-camera-r200-product-datasheet.pdf

  20. Intel: Intel RealSense Camera SR300 Datasheet. https://software.intel.com/sites/default/files/managed/0c/ec/realsense-sr300-product-datasheet-rev-1-0.pdf

  21. Orbbec: Orbbec How our 3D camera works. [Online]. https://orbbec3d.com/products/

  22. Orbbec: Orbbec Astra Pro. https://orbbec3d.com/product-astra-pro/

  23. Lunghi, G., Prades, R.M., Di Castro, M.: An advanced, adaptive and Multimodal Graphical User Interface for Human-robot Teleoperation in Radioactive Scenarios. In: ICINCO (2), pp. 224–231 (2016)

    Google Scholar 

  24. Dworak, A., et al.: Middleware trends and market leaders 2011. In: Conference Proceedings, vol. 111010. No. CERN-ATS-2011-196 (2011)

    Google Scholar 

  25. CERN: Controls Middleware. https://wikis.cern.ch/display/MW/CMW+-+Controls+Middleware/

  26. Di Castro, M., et al.: A dual arm robotic platform control for navigation, inspection and telemanipulation. In: Proceedings of ICALEPCS2017, Barcelona, Spain (2017)

    Google Scholar 

  27. Assmann, R.W., et al.: The final collimation system for the LHC. No. LHC-PROJECT-Report-919 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mario di Castro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

di Castro, M., Camarero Vera, J., Ferre, M., Masi, A. (2020). Object Detection and 6D Pose Estimation for Precise Robotic Manipulation in Unstructured Environments. In: Gusikhin, O., Madani, K. (eds) Informatics in Control, Automation and Robotics . ICINCO 2017. Lecture Notes in Electrical Engineering, vol 495. Springer, Cham. https://doi.org/10.1007/978-3-030-11292-9_20

Download citation

Publish with us

Policies and ethics