A new stereo matching energy model based on image local features

Hongjin, Zhang; Hui, Wei; Gang, Ma

doi:10.1007/s11042-023-14706-5

A new stereo matching energy model based on image local features

Published: 10 March 2023

Volume 82, pages 35651–35684, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

279 Accesses
1 Altmetric
Explore all metrics

Abstract

This paper constructs an energy model based on local features used in stereo matching. The local features include the similarity between different image areas, the matching cost function pattern, the connection between neighbor pixels, and the occlusion geometric relationship. Based on these features, we define the weight of each data term and smoothing term in the energy function and then design an algorithm to solve the energy model and get disparity results. The significant improvements of this paper include as following. 1) We modify the structure of the energy function. First, we define the weight of the data term based on the reliability of its corresponding disparity result, which is obtained by cost function features and the occlusion geometric relationship. Then we define the weight of the smoothing term by analyzing the characteristic relation between neighbor super-pixels. We can also reduce the computational complexity by detecting and reducing some low-strength connections. 2) We proposed an algorithm based on pairwise Markov random field (MRF) (Taniai et al., IEEE Trans Pattern Anal Machine Intell 40(11): 2725–2739, 2017) and local greedy iteratively, which can be used to solve the energy model. 3) In post-optimation, we select some areas with severe occlusion and fewer matching clues for post-interpolation fitting to optimize the results. The experiment shows that the proposed method reduced the average percentage of bad pixels (in bad 3) to 6.06 on the Middlebury dataset and 1.42 on the KITTI dataset. Finally, we compare our results with those of MC-Cnn (Zbontar and LeCun 2015), CF-Net (Shen et al., 2021), Guided-Stereo (Poggi et al., 2019), Gwc-Net (Guo et al., 2019) and Patchmatch-Net(PM-Net) (Wang et al., 2021) to verify the improved speed and accuracy of our algorithm, especially at recognizing the depth of changing edges and small objects. This paper’s relevant research can contribute to practical engineering practices such as assisted vision, intelligent driving, and robot grasping control.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 15

Fig. 17

Local Stereo Matching Based on Information Entropy of Image

Article 24 August 2016

A novel non-parametric transform stereo matching method based on mutual relationship

Article 02 January 2019

Stereo matching algorithm with guided filter and modified dynamic programming

Article 05 November 2015

Data Availability

The data that support the findings of this study are available in “MiddleBury” at ”https://vision.middlebury.edu/stereo/data/” and in “KITTI” at “http://www.cvlibs.net/datasets/kitti/eval_scene_flow.php?benchmark=stereo””.

References

Bai C, Ma Q, Hao P, Liu Z, Zhang J (2018) Improving stereo matching algorithm with adaptive cross-scale cost aggregation. Int J Adv Robot Syst 15(1):1729881417751544
Article Google Scholar
Bleyer M, Rhemann C, Rother C (2011) Patchmatch stereo-stereo matching with slanted support windows. In: Bmvc, vol 11, pp 1–11
Chang J-R, Chen Y-S (2018) Pyramid stereo matching network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5410–5418
Chang Y-J, Ho Y-S (2017) Pixel-based adaptive normalized cross correlation for illumination invariant stereo matching. Electronic Imaging 2017(5):124–129
Article Google Scholar
Chang T-A, Lu X, Yang J-F (2017) Robust stereo matching with trinary cross color census and triple image-based refinements. EURASIP Journal on Advances in Signal Processing 2017(1):1–13
Article Google Scholar
Chen Q, Bise R, Gu L, Zheng Y, Sato I, Hwang J-N, Imanishi N, Aiso S (2017) Virtual blood vessels in complex background using stereo x-ray images. In: Proceedings of the IEEE international conference on computer vision workshops, pp 99–106
Chen C, Seff A, Kornhauser A, Xiao J (2015) Deepdriving: Learning affordance for direct perception in autonomous driving. In: Proceedings of the IEEE international conference on computer vision, pp 2722–2730
Cheng F, Zhang H, Yuan D, Sun M (2014) Stereo matching by using the global edge constraint. Neurocomputing 131:217–226
Article Google Scholar
Geiger A, Roser M, Urtasun R (2010) Efficient large-scale stereo matching. In: Asian Conference on computer vision. Springer, pp 25–38
Gu X, Fan Z, Zhu S, Dai Z, Tan F, Tan P (2020) Cascade cost volume for high-resolution multi-view stereo and stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2495–2504
Guo X, Yang K, Yang W, Wang X, Li H (2019) Group-wise correlation stereo network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3273–3282
Hamid MS, Abd Manap N, Hamzah RA, Kadmin AF (2020) Stereo matching algorithm based on deep learning: A survey. Journal of King Saud University-Computer and Information Sciences
Hamzah RA, Kadmin AF, Hamid MS, Ghani SFA, Ibrahim H (2018) Improvement of stereo matching algorithm for 3d surface reconstruction. Signal Process Image Commun 65:165–172
Article Google Scholar
Hirschmüller H, Innocent PR, Garibaldi J (2002) Real-time correlation-based stereo vision with reduced border errors. Int J Comput Vis 47(1):229–246
Article MATH Google Scholar
Hosni A, Bleyer M, Gelautz M, Rhemann C (2009) Local stereo matching using geodesic support weights. In: 2009 16th IEEE international conference on image processing (ICIP). IEEE, pp 2093–2096
Huang C-S, Huang Y-H, Chan D-Y, Yang J-F (2020) Shape-reserved stereo matching with segment-based cost aggregation and dual-path refinement. EURASIP Journal on Image and Video Processing 2020(1):1–19
Article Google Scholar
Jellal RA, Lange M, Wassermann B, Schilling A, Zell A (2017) Ls-elas: Line segment based efficient large scale stereo matching. In: 2017 IEEE international conference on robotics and automation (ICRA). IEEE, pp 146–152
Jiao J, Yang Q, He S, Gu S, Zhang L, Lau RW (2017) Joint image denoising and disparity estimation via stereo structure pca and noise-tolerant cost. Int J Comput Vis 124(2):204–222
Article MathSciNet Google Scholar
Jingui Z, Ying W, Liyuan M (2018) A new stereo matching algorithm based on adaptive weight sad algorithm and census algorithm. Bulletin of Surveying and Mapping (11),11
Knyaz VA, Kniaz VV, Remondino F, Zheltov SY, Gruen A (2020) 3d reconstruction of a complex grid structure combining uas images and deep learning. Remote Sens 12(19):3128
Article Google Scholar
Kong L, Sun X, Rahman M, Xu M (2020) A 3d measurement method for specular surfaces based on polarization image sequences and machine learning. CIRP Ann 69(1):497–500
Article Google Scholar
Kong L, Zhu J, Ying S (2021) Local stereo matching using adaptive cross-region-based guided image filtering with orthogonal weights. Math Probl Eng 2021
Lai H-Y, Tsai Y-H, Chiu W-C (2019) Bridging stereo matching and optical flow via spatiotemporal correspondence. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1890–1899
Lee I, Moon B (2017) An improved stereo matching algorithm with robustness to noise based on adaptive support weight. J Inform Process Syst 13 (2):256–267
Google Scholar
Li H, Li Z, Huang J, Meng B, Zhang Z (2021) Accurate hierarchical stereo matching based on 3d plane labeling of superpixel for stereo images from rovers. Int J Adv Robot Syst 18(2):17298814211002113
Article Google Scholar
Li L, Zhang S, Yu X, Zhang L (2016) Pmsc: Patchmatch-based superpixel cut for accurate stereo matching. IEEE Trans Circuits Syst Video Technol 28(3):679–692
Article Google Scholar
Lim J, Lee S (2018) Patchmatch-based robust stereo matching under radiometric changes. IEEE Trans Pattern Anal Machine Intell 41(5):1203–1212
Article Google Scholar
Lin C, Li Y, Xu G, Cao Y (2017) Optimizing zncc calculation in binocular stereo matching. Signal Process Image Commun 52:64–73
Article Google Scholar
Luo W, Schwing AG, Urtasun R (2016) Efficient deep learning for stereo matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5695–5703
Mehltretter M, Heipke C (2019) Cnn-based cost volume analysis as confidence measure for dense matching. In: Proceedings of the IEEE/CVF international conference on computer vision workshops, pp 0–0
Mozerov MG, Van De Weijer J (2019) One-view occlusion detection for stereo matching with a fully connected crf model. IEEE Trans Image Process 28 (6):2936–2947
Odena A, Dumoulin V, Olah C (2016) Deconvolution and checkerboard artifacts. Distill 1(10):3
Article Google Scholar
Pang J, Sun W, Ren JS, Yang C, Yan Q (2017) Cascade residual learning: a two-stage convolutional neural network for stereo matching. In: Proceedings of the IEEE international conference on computer vision workshops, pp 887–895
Poggi M, Pallotti D, Tosi F, Mattoccia S (2019) Guided stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 979–988
Shen Z, Dai Y, Rao Z (2021) Cfnet: Cascade and fused cost volume for robust stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13906–13915
Shi B, Shi S, Wu J, Chen M (2019) A new basic correlation measurement for stereo matching
Sun J, Liu Y, Ding Y, Zhu X, Xi J (2018) Ncc feature matching optimized algorithm based on constraint fusion. In: 2018 IEEE 3rd international conference on image, vision and computing (ICIVC). IEEE, pp 336–341
Taniai T, Matsushita Y, Sato Y, Naemura T (2017) Continuous 3d label stereo matching using local expansion moves. IEEE Trans Pattern Anal Machine Intell 40(11):2725–2739
Article Google Scholar
Wang F, Galliani S, Vogel C, Speciale P, Pollefeys M (2021) Patchmatchnet: Learned multi-view patchmatch stereo. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14194–14203
Wang H, Pathan MS, Dev S (2021) Stereo matching based on visual sensitive information. In: 2021 6th international conference on image, vision and computing (ICIVC). IEEE, pp 312–316
Wang J, Zickler T (2019) Local detection of stereo occlusion boundaries. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3818–3827
Wu W, Zhu H, Yu S, Shi J (2019) Stereo matching with fusing adaptive support weights. IEEE Access 7:61960–61974
Article Google Scholar
Xu H, Zhang J (2020) Aanet: Adaptive aggregation network for efficient stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1959–1968
Xue T, Owens A, Scharstein D, Goesele M, Szeliski R (2019) Multi-frame stereo matching with edges, planes, and superpixels. Image Vis Comput 91:103771
Article Google Scholar
Yang Q (2012) A non-local cost aggregation method for stereo matching. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 1402–1409
Yang Q (2013) Hardware-efficient bilateral filtering for stereo matching. IEEE Trans Pattern Anal Machine Intell 36(5):1026–1032
Article Google Scholar
Ye X, Li J, Wang H, Huang H, Zhang X (2017) Efficient stereo matching leveraging deep local and context information. IEEE Access 5:18745–18755
Article Google Scholar
Zbontar J, LeCun Y (2015) Computing the stereo matching cost with a convolutional neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1592–1599
Zbontar J, LeCun Y et al (2016) Stereo matching by training a convolutional neural network to compare image patches. J Mach Learn Res 17(1):2287–2318
MATH Google Scholar
Zhang X, Gao Q, Pan D, Cao PC, Huang DH (2021) Research on spatial positioning system of fruits to be picked in field based on binocular vision and ssd model. In: Journal of physics: conference series, vol 1748. IOP Publishing, p 042011
Zhang C, Li Z, Cheng Y, Cai R, Chao H, Rui Y (2015) Meshstereo: a global stereo model with mesh alignment regularization for view interpolation. In: Proceedings of the IEEE international conference on computer vision, pp 2057–2065
Zhang H, Li H, Wang Z, Yue Y, Chen S (2020) Geometry and context guided refinement for stereo matching. IET Image Process 14 (12):2652–2659
Article Google Scholar
Zhang B, Zhu D (2019) Key technologies of robot navigation based on machine vision: A review. Automatic Control, Mechatronics and Industrial Engineering, 31–36
Zhi T, Pires BR, Hebert M, Narasimhan SG (2018) Deep material-aware cross-spectral stereo matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1916–1925
Zhu H, Yin J, Yuan D (2017) Svcv: segmentation volume combined with cost volume for stereo matching. IET Comput Vis 11(8):733–743
Article Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Algorithms for Cognitive Models, School of Computer Science, Fudan University, Shanghai, 200433, China
Zhang Hongjin, Wei Hui & Ma Gang

Authors

Zhang Hongjin
View author publications
You can also search for this author in PubMed Google Scholar
Wei Hui
View author publications
You can also search for this author in PubMed Google Scholar
Ma Gang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Hui.

Ethics declarations

We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, there is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

(mp4 5.21 MB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Hongjin, Z., Hui, W. & Gang, M. A new stereo matching energy model based on image local features. Multimed Tools Appl 82, 35651–35684 (2023). https://doi.org/10.1007/s11042-023-14706-5

Download citation

Received: 18 November 2021
Revised: 16 June 2022
Accepted: 03 February 2023
Published: 10 March 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s11042-023-14706-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new stereo matching energy model based on image local features

Abstract

Access this article

Similar content being viewed by others

Local Stereo Matching Based on Information Entropy of Image

A novel non-parametric transform stereo matching method based on mutual relationship

Stereo matching algorithm with guided filter and modified dynamic programming

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Additional information

Publisher’s note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A new stereo matching energy model based on image local features

Abstract

Access this article

Similar content being viewed by others

Local Stereo Matching Based on Information Entropy of Image

A novel non-parametric transform stereo matching method based on mutual relationship

Stereo matching algorithm with guided filter and modified dynamic programming

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Additional information

Publisher’s note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation