Abstract
In multiview/3D video, the amount of data to be transmitted to the decoder increases proportionally with the number of cameras. One way to efficiently compress such video is to use Multiview Video Coding (MVC) which simultaneously reduces temporal and spatial redundancy within the same view and among multiple views. But, existing disparity estimation methods used to extract the redundancy among views are typically too computationally complex. In this paper, we propose a method for fast disparity estimation in multiview/3D video. First, for each block of the predicted frame, the view-dependent geometry and the depth information are used to find the corresponding block in the reference views. Then, a fast and greedy search method is proposed to search the surrounding areas to find the most similar block. Simulation results show that our proposed method achieves better bitrate and quality with much lower computational complexity compared to state-of-the-art methods.
Similar content being viewed by others
References
Afonso V, Susin A, Perleberg M, Conceição R, Corrêa G, Agostini L, Zatt B, Porto M (2018) Hardware-friendly unidirectional disparity-search algorithm for 3D-HEVC. IEEE International Symposium on Circuits and Systems (ISCAS), Florence, Italy, pp. 1–5. https://doi.org/10.1109/ISCAS.2018.8351350
Afonso V, Conceição RA, Saldanha MRF, Braatz LA, Perleberg MR, Corrêa GR, Porto MS, Agostini LV, Zatt B, Susin AA (2019) Energy-aware motion and disparity estimation system for 3D-HEVC with run-time adaptive memory hierarchy. IEEE Trans Circuits Syst Video Technol 29(6):1878–1892. https://doi.org/10.1109/TCSVT.2018.2847633
Aksehir Y, Erdayandi K, Ozcan TZ, Hamzaoglu I (2013) A low energy adaptive motion estimation hardware for H.264 multiview video coding. J Real-Time Image Proc 15:3–12. https://doi.org/10.1007/s11554-013-0383-9
Atienza R (2018) Fast disparity estimation using dense networks. 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia, pp. 3207–3212. https://doi.org/10.1109/ICRA.2018.8463172.
Bjøntegaard G (2001) Calculation of average PSNR differences between R-D curves. Proceedings of the ITU–Telecommunications Standardization Sector STUDY GROUP 16 Video Coding Experts Group (VCEG), 13th Meeting, Austin, TX, USA, pp. 2–4
Boonthep N, Chamnongthai K (2019) A method of motion-estimation-based H.264 video coding using optimal search-range. Wirel Pers Commun. https://doi.org/10.1007/s11277-019-06766-4
Chen Y, Zhao X, Zhang L, Kang J-W (2016) Multiview and 3D Video Compression Using Neighboring Block Based Disparity Vectors. IEEE Trans Multimedia 18(4):576–589. https://doi.org/10.1109/TMM.2016.2525010
Duggal S, Wang S, Ma W-C, Hu R, Urtasun R (2019) DeepPruner: learning efficient stereo matching via differentiable PatchMatch. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), Korea (South), pp. 4383–4392. https://doi.org/10.1109/ICCV.2019.00448
Fan D-P, Lin Z, Zhang Z, Zhu M, Cheng M-M (2020) Rethinking RGB-D salient object detection: models, data sets, and large-scale benchmarks. IEEE Trans Neural Networks Learning Syst:1–1. https://doi.org/10.1109/TNNLS.2020.2996406
Gonzales C, Yeo H, Kuo CJ (1999) Requirements for motion-estimation search range in MPEG-2 coded video. IBM J Res Dev 43(4):453–470. https://doi.org/10.1147/rd.434.0453
Han CH, Lee SW, Choi H (2016) Residual DPCM in HEVC transform skip mode for screen content coding. IEEE Trans Smart Process Comput 5:323–326. https://doi.org/10.5573/IEIESPC.2016.5.5.323
Hartley R, Zisserman A (2003) Multiple view geometry in computer vision. Cambridge University Press. https://doi.org/10.1017/CBO9780511811685
Researcher tools: code & datasets http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload. Accessed 3 May 2020
ITU-T and ISO/IEC JTC, ERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS, Infrastructure of audiovisual services – Coding of moving video (2013) ITU-T Recommendation H.265 and ISO/IEC 23008–2 (HEVC)
Jiang C, Nooshabadi S (2016) A scalable massively parallel motion and disparity estimation scheme for multiview video coding. IEEE Trans Circuits SystVideo Technol 26:346–359. https://doi.org/10.1109/TCSVT.2015.2402853
Jiang C, Nooshabadi S (2016) Decision zone-based parallel fast motion and disparity estimation scheme for multiview coding. Data compression conference (DCC). Snowbird, UT, USA, pp. 609–609. https://doi.org/10.1109/DCC.2016.20
Jin N, Li F, Lai X (2011) Disparity estimation with disparity field correlation and epipolar geometry constraint for multiview video coding. 2nd International Conference on Intelligent Control and Information Processing. Harbin, China, pp 260–263. https://doi.org/10.1109/ICICIP.2011.6008244
Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG 16 WP3 and ISO/IEC JTC 1/SC 29/WG 11 (2011) HEVC Test Model (HM) 5.0 Reference Software. https://hevc.hhi.fraunhofer.de/HM-doc, Accessed 3 May 2020
Kaup A, Fecker U (2006) Analysis of multi-reference block matching for multi-view video coding. 7th Workshop Digital Broadcasting, Erlangen, Germany, pp. 33–39
Lin C-L, Tsai P-H (2018) Low-complexity encoding scheme for multiview video content. Microsyst Technol 24:4057–4066. https://doi.org/10.1007/s00542-017-3620-5
Micallef BW, Debono CJ, Farrugia RA (2010) Exploiting depth information for fast multi-view video coding. Picture coding symposium (PCS), Nagoya, Japan, pp. 1–4. https://doi.org/10.1109/3DTV.2011.5877170
Micallef BW, Debono CJ, Farrugia RA (2014) Reducing 3D video coding complexity through more efficient disparity estimation. IEEE Trans Consum Electron 60:74–82. https://doi.org/10.1109/TCE.2014.6780928
Nie G-Y, Cheng M-M, Liu Y, Liang Z, Fan D-P, Liu Y, Wang Y (2019) Multi-level context ultra-aggregation for stereo matching. IEEE/CVF conference on computer vision and pattern recognition (CVPR), Long Beach, CA, USA, pp. 3278–3286. https://doi.org/10.1109/CVPR.2019.00340
Ohm J-R (1999) Stereo/multiview video encoding using the mpeg family of standards.Proc. SPIE 3639, Stereoscopic Displays and Virtual Reality Systems VI, San Jose, CA, United States, pp. 242–253. https://doi.org/10.1117/12.349385
Qiao Z, Li X, Zhao D, Liu Y, Gao W (2011) Fast disparity estimation utilizing depth information for multiview video coding. IEEE Int Symposium Circuits Syst. Rio de Janeiro, Brazil, pp. 2805–2808. https://doi.org/10.1109/ISCAS.2011.5938188
Shen L, Liu Z, Yan T, Zhang Z, An P (2010) View-adaptive motion estimation and disparity estimation for low complexity multiview video coding. IEEE Trans Circuits Syst Video Technol 20:925–930. https://doi.org/10.1109/TCSVT.2010.2045910
Song Y, Jia K (2013) Multiview video coding algorithm based on HBP prediction structure. Ninth international conference on intelligent information hiding and Multimedia signal processing. Beijing, China, pp. 173–176. https://doi.org/10.1109/IIH-MSP.2013.52
Nagoya University Multi-view Sequences Download List http://www.fujii.nuee.nagoya-u.ac.jp/multiview-data. Accessed 3 May 2020
Wang Y, Yang J, Mo Y, Xiao C, An W (2018) Disparity estimation for camera arrays using reliability guided disparity propagation. IEEE Access 6:21840–21849. https://doi.org/10.1109/ACCESS.2018.2827085
Zakeri FS, Bätz M, Jaschke T, Keinert J, Chuchvara A (2019) Benchmarking of several disparity estimation algorithms for light field processing. In: Proceeding of SPIE 11172, fourteenth international conference on quality control by artificial vision, p 111721C. https://doi.org/10.1117/12.2521747
Zhang C, Li Z, Cheng Y, Cai R, Chao H, Rui Y (2015) MeshStereo: a global stereo model with mesh alignment regularization for view interpolation. International conference on computer vision (ICCV), Santiago, Chile, pp. 2057–2065. https://doi.org/10.1109/ICCV.2015.238
Zhaoqing P, Zhang Y, Kwong S (2015) Efficient motion and disparity estimation optimization for low complexity multiview video coding. IEEE Trans Broadcast 61:166–176. https://doi.org/10.1109/TBC.2015.2419824
Zhu W, Tian X, Zhou F, Chen Y (2010) Fast disparity estimation using spatio-temporal correlation of disparity field for multiview video coding. IEEE Trans Consum Electron 56:957–964. https://doi.org/10.1109/TCE.2010.5506026
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zandi, G., Roodaki, H. & Shirmohammadi, S. A novel fast search method to find disparity vectors in multiview video coding. Multimed Tools Appl 80, 10821–10837 (2021). https://doi.org/10.1007/s11042-020-10260-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10260-6