Abstract
This paper constructs an energy model based on local features used in stereo matching. The local features include the similarity between different image areas, the matching cost function pattern, the connection between neighbor pixels, and the occlusion geometric relationship. Based on these features, we define the weight of each data term and smoothing term in the energy function and then design an algorithm to solve the energy model and get disparity results. The significant improvements of this paper include as following. 1) We modify the structure of the energy function. First, we define the weight of the data term based on the reliability of its corresponding disparity result, which is obtained by cost function features and the occlusion geometric relationship. Then we define the weight of the smoothing term by analyzing the characteristic relation between neighbor super-pixels. We can also reduce the computational complexity by detecting and reducing some low-strength connections. 2) We proposed an algorithm based on pairwise Markov random field (MRF) (Taniai et al., IEEE Trans Pattern Anal Machine Intell 40(11): 2725–2739, 2017) and local greedy iteratively, which can be used to solve the energy model. 3) In post-optimation, we select some areas with severe occlusion and fewer matching clues for post-interpolation fitting to optimize the results. The experiment shows that the proposed method reduced the average percentage of bad pixels (in bad 3) to 6.06 on the Middlebury dataset and 1.42 on the KITTI dataset. Finally, we compare our results with those of MC-Cnn (Zbontar and LeCun 2015), CF-Net (Shen et al., 2021), Guided-Stereo (Poggi et al., 2019), Gwc-Net (Guo et al., 2019) and Patchmatch-Net(PM-Net) (Wang et al., 2021) to verify the improved speed and accuracy of our algorithm, especially at recognizing the depth of changing edges and small objects. This paper’s relevant research can contribute to practical engineering practices such as assisted vision, intelligent driving, and robot grasping control.
Similar content being viewed by others
Data Availability
The data that support the findings of this study are available in “MiddleBury” at ”https://vision.middlebury.edu/stereo/data/” and in “KITTI” at “http://www.cvlibs.net/datasets/kitti/eval_scene_flow.php?benchmark=stereo””.
References
Bai C, Ma Q, Hao P, Liu Z, Zhang J (2018) Improving stereo matching algorithm with adaptive cross-scale cost aggregation. Int J Adv Robot Syst 15(1):1729881417751544
Bleyer M, Rhemann C, Rother C (2011) Patchmatch stereo-stereo matching with slanted support windows. In: Bmvc, vol 11, pp 1–11
Chang J-R, Chen Y-S (2018) Pyramid stereo matching network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5410–5418
Chang Y-J, Ho Y-S (2017) Pixel-based adaptive normalized cross correlation for illumination invariant stereo matching. Electronic Imaging 2017(5):124–129
Chang T-A, Lu X, Yang J-F (2017) Robust stereo matching with trinary cross color census and triple image-based refinements. EURASIP Journal on Advances in Signal Processing 2017(1):1–13
Chen Q, Bise R, Gu L, Zheng Y, Sato I, Hwang J-N, Imanishi N, Aiso S (2017) Virtual blood vessels in complex background using stereo x-ray images. In: Proceedings of the IEEE international conference on computer vision workshops, pp 99–106
Chen C, Seff A, Kornhauser A, Xiao J (2015) Deepdriving: Learning affordance for direct perception in autonomous driving. In: Proceedings of the IEEE international conference on computer vision, pp 2722–2730
Cheng F, Zhang H, Yuan D, Sun M (2014) Stereo matching by using the global edge constraint. Neurocomputing 131:217–226
Geiger A, Roser M, Urtasun R (2010) Efficient large-scale stereo matching. In: Asian Conference on computer vision. Springer, pp 25–38
Gu X, Fan Z, Zhu S, Dai Z, Tan F, Tan P (2020) Cascade cost volume for high-resolution multi-view stereo and stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2495–2504
Guo X, Yang K, Yang W, Wang X, Li H (2019) Group-wise correlation stereo network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3273–3282
Hamid MS, Abd Manap N, Hamzah RA, Kadmin AF (2020) Stereo matching algorithm based on deep learning: A survey. Journal of King Saud University-Computer and Information Sciences
Hamzah RA, Kadmin AF, Hamid MS, Ghani SFA, Ibrahim H (2018) Improvement of stereo matching algorithm for 3d surface reconstruction. Signal Process Image Commun 65:165–172
Hirschmüller H, Innocent PR, Garibaldi J (2002) Real-time correlation-based stereo vision with reduced border errors. Int J Comput Vis 47(1):229–246
Hosni A, Bleyer M, Gelautz M, Rhemann C (2009) Local stereo matching using geodesic support weights. In: 2009 16th IEEE international conference on image processing (ICIP). IEEE, pp 2093–2096
Huang C-S, Huang Y-H, Chan D-Y, Yang J-F (2020) Shape-reserved stereo matching with segment-based cost aggregation and dual-path refinement. EURASIP Journal on Image and Video Processing 2020(1):1–19
Jellal RA, Lange M, Wassermann B, Schilling A, Zell A (2017) Ls-elas: Line segment based efficient large scale stereo matching. In: 2017 IEEE international conference on robotics and automation (ICRA). IEEE, pp 146–152
Jiao J, Yang Q, He S, Gu S, Zhang L, Lau RW (2017) Joint image denoising and disparity estimation via stereo structure pca and noise-tolerant cost. Int J Comput Vis 124(2):204–222
Jingui Z, Ying W, Liyuan M (2018) A new stereo matching algorithm based on adaptive weight sad algorithm and census algorithm. Bulletin of Surveying and Mapping (11),11
Knyaz VA, Kniaz VV, Remondino F, Zheltov SY, Gruen A (2020) 3d reconstruction of a complex grid structure combining uas images and deep learning. Remote Sens 12(19):3128
Kong L, Sun X, Rahman M, Xu M (2020) A 3d measurement method for specular surfaces based on polarization image sequences and machine learning. CIRP Ann 69(1):497–500
Kong L, Zhu J, Ying S (2021) Local stereo matching using adaptive cross-region-based guided image filtering with orthogonal weights. Math Probl Eng 2021
Lai H-Y, Tsai Y-H, Chiu W-C (2019) Bridging stereo matching and optical flow via spatiotemporal correspondence. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1890–1899
Lee I, Moon B (2017) An improved stereo matching algorithm with robustness to noise based on adaptive support weight. J Inform Process Syst 13 (2):256–267
Li H, Li Z, Huang J, Meng B, Zhang Z (2021) Accurate hierarchical stereo matching based on 3d plane labeling of superpixel for stereo images from rovers. Int J Adv Robot Syst 18(2):17298814211002113
Li L, Zhang S, Yu X, Zhang L (2016) Pmsc: Patchmatch-based superpixel cut for accurate stereo matching. IEEE Trans Circuits Syst Video Technol 28(3):679–692
Lim J, Lee S (2018) Patchmatch-based robust stereo matching under radiometric changes. IEEE Trans Pattern Anal Machine Intell 41(5):1203–1212
Lin C, Li Y, Xu G, Cao Y (2017) Optimizing zncc calculation in binocular stereo matching. Signal Process Image Commun 52:64–73
Luo W, Schwing AG, Urtasun R (2016) Efficient deep learning for stereo matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5695–5703
Mehltretter M, Heipke C (2019) Cnn-based cost volume analysis as confidence measure for dense matching. In: Proceedings of the IEEE/CVF international conference on computer vision workshops, pp 0–0
Mozerov MG, Van De Weijer J (2019) One-view occlusion detection for stereo matching with a fully connected crf model. IEEE Trans Image Process 28 (6):2936–2947
Odena A, Dumoulin V, Olah C (2016) Deconvolution and checkerboard artifacts. Distill 1(10):3
Pang J, Sun W, Ren JS, Yang C, Yan Q (2017) Cascade residual learning: a two-stage convolutional neural network for stereo matching. In: Proceedings of the IEEE international conference on computer vision workshops, pp 887–895
Poggi M, Pallotti D, Tosi F, Mattoccia S (2019) Guided stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 979–988
Shen Z, Dai Y, Rao Z (2021) Cfnet: Cascade and fused cost volume for robust stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13906–13915
Shi B, Shi S, Wu J, Chen M (2019) A new basic correlation measurement for stereo matching
Sun J, Liu Y, Ding Y, Zhu X, Xi J (2018) Ncc feature matching optimized algorithm based on constraint fusion. In: 2018 IEEE 3rd international conference on image, vision and computing (ICIVC). IEEE, pp 336–341
Taniai T, Matsushita Y, Sato Y, Naemura T (2017) Continuous 3d label stereo matching using local expansion moves. IEEE Trans Pattern Anal Machine Intell 40(11):2725–2739
Wang F, Galliani S, Vogel C, Speciale P, Pollefeys M (2021) Patchmatchnet: Learned multi-view patchmatch stereo. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14194–14203
Wang H, Pathan MS, Dev S (2021) Stereo matching based on visual sensitive information. In: 2021 6th international conference on image, vision and computing (ICIVC). IEEE, pp 312–316
Wang J, Zickler T (2019) Local detection of stereo occlusion boundaries. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3818–3827
Wu W, Zhu H, Yu S, Shi J (2019) Stereo matching with fusing adaptive support weights. IEEE Access 7:61960–61974
Xu H, Zhang J (2020) Aanet: Adaptive aggregation network for efficient stereo matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1959–1968
Xue T, Owens A, Scharstein D, Goesele M, Szeliski R (2019) Multi-frame stereo matching with edges, planes, and superpixels. Image Vis Comput 91:103771
Yang Q (2012) A non-local cost aggregation method for stereo matching. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 1402–1409
Yang Q (2013) Hardware-efficient bilateral filtering for stereo matching. IEEE Trans Pattern Anal Machine Intell 36(5):1026–1032
Ye X, Li J, Wang H, Huang H, Zhang X (2017) Efficient stereo matching leveraging deep local and context information. IEEE Access 5:18745–18755
Zbontar J, LeCun Y (2015) Computing the stereo matching cost with a convolutional neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1592–1599
Zbontar J, LeCun Y et al (2016) Stereo matching by training a convolutional neural network to compare image patches. J Mach Learn Res 17(1):2287–2318
Zhang X, Gao Q, Pan D, Cao PC, Huang DH (2021) Research on spatial positioning system of fruits to be picked in field based on binocular vision and ssd model. In: Journal of physics: conference series, vol 1748. IOP Publishing, p 042011
Zhang C, Li Z, Cheng Y, Cai R, Chao H, Rui Y (2015) Meshstereo: a global stereo model with mesh alignment regularization for view interpolation. In: Proceedings of the IEEE international conference on computer vision, pp 2057–2065
Zhang H, Li H, Wang Z, Yue Y, Chen S (2020) Geometry and context guided refinement for stereo matching. IET Image Process 14 (12):2652–2659
Zhang B, Zhu D (2019) Key technologies of robot navigation based on machine vision: A review. Automatic Control, Mechatronics and Industrial Engineering, 31–36
Zhi T, Pires BR, Hebert M, Narasimhan SG (2018) Deep material-aware cross-spectral stereo matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1916–1925
Zhu H, Yin J, Yuan D (2017) Svcv: segmentation volume combined with cost volume for stereo matching. IET Comput Vis 11(8):733–743
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, there is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
(mp4 5.21 MB)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hongjin, Z., Hui, W. & Gang, M. A new stereo matching energy model based on image local features. Multimed Tools Appl 82, 35651–35684 (2023). https://doi.org/10.1007/s11042-023-14706-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14706-5