Abstract
For calculating 3D information with stereo matching, usually correspondence analysis yields a so-called depth hypotheses cost stack, which contains information about similarities of the visible structures at all positions of the analyzed stereo images. Often those cost values comprise a large amount of noise and/or ambiguities, so that regularization is required. The Conditional Random Field (CRF) regularizer from Shekhovtsov et al. [Sh16] is a very good algorithm among various methods. Due to the usual iterative nature of those regularizers, they often do not meet the strict speed and memory requirements posed in many real-world applications. In this paper, we propose to substitute Shekhovtsov’s CRF algorithm with an especially designed U-shaped 3D Convolutional Neural Network (3D-CRF-CNN), which is taught proper regularization by the CRF algorithm as a teacher. Our experiments have shown, that such a 3D-CRF-CNN is not only able to mimic the CRF’s regularizing behavior, but - if properly setup - also comprises remarkable generalization capabilities compared to a state-of-the-art 2D-CNN that is trained on a slightly different, yet equivalent, task. The advantages of such a CNN regularizer are its predictable computational performance and its relatively simple architectural structure, which allows for easy development, speed up, and deployment. We demonstrate the feasibility of the concept of training a 3D-CRF-CNN to take over CRF’s regularizing functionality on the basis of available test data and show that it pays off to invest special effort into tailoring an according CNN architecture.
Chapter PDF
Similar content being viewed by others
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access Dieses Kapitel wird unter der Creative Commons Namensnennung 4.0 International Lizenz (http://creativecommons.org/licenses/by/4.0/deed.de) veröffentlicht, welche die Nutzung, Vervielfältigung, Bearbeitung, Verbreitung und Wiedergabe in jeglichem Medium und Format erlaubt, sofern Sie den/die ursprünglichen Autor(en) und die Quelle ordnungsgemäß nennen, einen Link zur Creative Commons Lizenz beifügen und angeben, ob Änderungen vorgenommen wurden.
Die in diesem Kapitel enthaltenen Bilder und sonstiges Drittmaterial unterliegen ebenfalls der genannten Creative Commons Lizenz, sofern sich aus der Abbildungslegende nichts anderes ergibt. Sofern das betreffende Material nicht unter der genannten Creative Commons Lizenz steht und die betreffende Handlung nicht nach gesetzlichen Vorschriften erlaubt ist, ist für die oben aufgeführten Weiterverwendungen des Materials die Einwilligung des jeweiligen Rechteinhabers einzuholen.
Copyright information
© 2020 Der/die Herausgeber bzw. der/die Autor(en)
About this paper
Cite this paper
Soukup, D., Štolc, S., Thanner, P. (2020). Deep Learning as Substitute for CRF Regularization in 3D Image Processing. In: Jasperneite, J., Lohweg, V. (eds) Kommunikation und Bildverarbeitung in der Automation. Technologien für die intelligente Automation, vol 12. Springer Vieweg, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-59895-5_21
Download citation
DOI: https://doi.org/10.1007/978-3-662-59895-5_21
Published:
Publisher Name: Springer Vieweg, Berlin, Heidelberg
Print ISBN: 978-3-662-59894-8
Online ISBN: 978-3-662-59895-5
eBook Packages: Computer Science and Engineering (German Language)