Skip to main content

Convolutional Gated Recurrent Units for Obstacle Segmentation in Bird-Eye-View

  • Conference paper
  • First Online:
Computer Aided Systems Theory – EUROCAST 2019 (EUROCAST 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12014))

Included in the following conference series:

  • 908 Accesses

Abstract

Obstacle detection is a fundamental problem in autonomous driving. The most common solutions share the idea of modeling the free-space and marking as obstacles all the points that lie outside this model according to a threshold. Manually setting this threshold and adapting the model to the various scenarios is not ideal, whereas a machine learning approach is more suitable for this kind of task. In this work we present an application of Convolutional Neural Networks (CNNs) for the detection of obstacles in front of a vehicle. Our goal is to train a CNN to understand which patterns in this area are connected to the presence of obstacles. Our method does not require any manual annotation, since the training relies on a classification that comes from a LiDAR. During inference, our network requires as input a 3D point cloud generated from stereoscopic images. Moreover, we make use of recurrent units in our network, since they are able to exploit temporal information to provide more accurate results in case of occlusion. We compare different input configurations and show that our final selection is able to correctly predict the position of obstacles and to generalize well in unseen environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Broggi, A., Caraffi, C., Fedriga, R.I., Grisleri, P.: Obstacle detection with stereo vision for off-road vehicle navigation. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Workshops, 2005. CVPR Workshops, pp. 65–65. IEEE (2005)

    Google Scholar 

  2. Broggi, A., Cardarelli, E., Cattani, S., Sabbatelli, M.: Terrain mapping for off-road autonomous ground vehicles using rational b-spline surfaces and stereo vision. In: Intelligent Vehicles Symposium (IV), 2013 IEEE, pp. 648–653. IEEE (2013)

    Google Scholar 

  3. Chen, X., Ma, H., Wan, J., Li, B., Xia, T.: Multi-view 3D object detection network for autonomous driving. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1907–1915 (2017)

    Google Scholar 

  4. Chen, X., Ma, H., Wan, J., Li, B., Xia, T.: Multi-view 3D object detection network for autonomous driving. In: IEEE CVPR, vol. 1, p. 3 (2017)

    Google Scholar 

  5. Chung, J., Gulcehre, C., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014 (2014)

    Google Scholar 

  6. Dequaire, J., Ondrúška, P., Rao, D., Wang, D., Posner, I.: Deep tracking in the wild: end-to-end tracking using recurrent neural networks. Int. J. Robot. Res. 37(4–5), 492–512 (2018)

    Article  Google Scholar 

  7. Hirschmuller, H.: Stereo processing by semiglobal matching and mutual information. IEEE Trans. Pattern Anal. Mach. Intell. 30(2), 328–341 (2007)

    Article  Google Scholar 

  8. Kakegawa, S., Matono, H., Kido, H., Shima, T.: Road surface segmentation based on vertically local disparity histogram for stereo camera. Int. J. Intell. Transp. Syst. Res. 16(2), 90–97 (2018)

    Google Scholar 

  9. Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization (2014). arXiv preprint arXiv:1412.6980

  10. Labayrade, R., Aubert, D., Tarel, J.P.: Real time obstacle detection in stereovision on non flat road geometry through “v-disparit” representation. In: Intelligent Vehicle Symposium, 2002, IEEE. vol. 2, pp. 646–651. IEEE (2002)

    Google Scholar 

  11. Musleh Lancis, B., Escalera Hueso, A.d.l., Armingol Moreno, J.M.: Uv disparity analysis in urban environments (2011)

    Google Scholar 

  12. Oniga, F., Nedevschi, S.: Processing dense stereo data using elevation maps: road surface, traffic isle, and obstacle detection. IEEE Trans. Veh. Technol. 59(3), 1172–1182 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luigi Musto .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Musto, L., Valenti, F., Zinelli, A., Pizzati, F., Cerri, P. (2020). Convolutional Gated Recurrent Units for Obstacle Segmentation in Bird-Eye-View. In: Moreno-Díaz, R., Pichler, F., Quesada-Arencibia, A. (eds) Computer Aided Systems Theory – EUROCAST 2019. EUROCAST 2019. Lecture Notes in Computer Science(), vol 12014. Springer, Cham. https://doi.org/10.1007/978-3-030-45096-0_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-45096-0_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-45095-3

  • Online ISBN: 978-3-030-45096-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics