A Shape-Adaptive Low-Complexity Technique for 3D Free-Viewpoint Visual Applications

Chen, Kuan-Hung; Chen, Cheng-Hao; Chang, Chih-Hao; Liu, Jun-Yu; Su, Ching-Lung

doi:10.1007/s00034-014-9870-x

A Shape-Adaptive Low-Complexity Technique for 3D Free-Viewpoint Visual Applications

Published: 14 August 2014

Volume 34, pages 579–604, (2015)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Kuan-Hung Chen¹,
Cheng-Hao Chen¹,
Chih-Hao Chang¹,
Jun-Yu Liu¹ &
…
Ching-Lung Su²

203 Accesses
2 Citations
Explore all metrics

Abstract

View synthesis (VS) and depth map generation (DMG) are two of the most crucial operations for facilitating the development of free-viewpoint 3D video systems. This demand for high computational complexity has motivated the development of the proposed design. Conventional designs have directly adopted dedicated hardware to increase the speed of these operations independently, which has resulted in algorithmic-level redundancies. Thus, the conventional per-pixel computation results in high redundancy in both VS and DMG operations. Accordingly, this paper proposes a shape-adaptive low-complexity (SALC) technique for removing such algorithmic-level redundancies for both VS and DMG operations. Moreover, adopting shape-adaptive searching blocks can improve the accuracy of stereo matching (SM), which subsequently enhances the resulting VS. To overcome the problem of border misalignment in VS, a novel edge alignment technique is presented for detecting and aligning misaligned borders. In addition, an unreliable region repairing (URR) technique was adopted to enhance the accuracy of DMG operations, thereby optimizing the image quality. The proposed SALC technique was verified using \(C\)/Matlab simulations. Through the presented complexity reduction without any hardware acceleration, this design can achieve a high performance of 48 fps for synthesizing the virtual views of the CIF format (352 \(\times \) 288) based on a portable platform equipped with a 1 GHz ARM Cortex-A9 RISC processor only. The experimental results revealed that the proposed SALC technique saves up to 96 % computational complexity for VS, and the computation time of DMG operations is reduced to approximately 2 % of that required for conventional SM. The average improved image quality was 32.46 dB (i.e., 7.33 % improvement from the original 30.46 dB) when using the presented edge alignment technique. When considering the overall efficiency evaluation, the proposed SALC-based SM method is 172 times more efficient than the conventional fixed block-sized SM based on the combined indices of the BPCT product. Compared with the designs of previous studies, the proposed SALC-based design obtained superior efficiency in both VS and DMG as a result of the computation reuse property.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Real-time stereo to multi-view conversion system based on adaptive meshing

Article 20 February 2015

Stereo Matching and Viewpoint Synthesis FPGA Implementation

Warping and Blending Enhancement for 3D View Synthesis Based on Grid Deformation

References

D. Aliprandi, E. Piccinelli, Image-based three-dimensional free viewpoint video synthesis, in Proceedings of the IEEE 3DTV Conference (2009), pp. 1–4
J. Canny, A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. pami–8(6), 679–698 (1986)
Article Google Scholar
T.S. Chang, J.I. Guo, C.W. Jen, Hardware-efficient DFT designs with cyclic convolution and subexpression sharing. IEEE Trans. Circuits Syst. II 47(9), 886–892 (2000)
Article Google Scholar
T.S. Chang, C.S. Kung, C.W. Jen, A simple processor core design for DCT/IDCT. IEEE Trans. Circuits Syst. Video Technol. 10(3), 439–447 (2000)
Article Google Scholar
N. Chang, T.H. Tsai, B.H. Hsu, Y.C. Chen, T.S. Chang, Algorithm and architecture of disparity estimation with mini-census adaptive support weight. IEEE Trans. Circuits Syst. Video Technol. 20(6), 792–805 (2010)
Article Google Scholar
S. Chen, L. Williams, View interpolation for image synthesis, in Proceedings of the 20th Annual Conference on Computer Graphics Interactive Techniques (1993), pp. 279–288
S.S. Chen, H.M. Hang, New view generation from a video sequence, in Proceedings of the IEEE International Symposium on Circuits and Systems (1998), pp. 81–84
K.H. Chen, J.I. Guo, J.S. Wang, C.W. Yeh, J.W. Chen, An energy-aware IP core design for the variable-length DCT/IDCT targeting at MPEG4 shape-adaptive transforms. IEEE Trans. Circuits Syst. Video Technol. 15(5), 704–715 (2005)
Article Google Scholar
C.M. Cheng, S.J. Lin, S.H. Lai, K. Lee, Efficient multiple virtual view generation based on reduced depth stereo image for advanced autostereoscopic displays, in Proceedings of the IEEE International Conference on Multimedia Expo (2009), pp. 241–244
K.H. Chen, C.H. Chen, C.H. Chang, Y.C. Yang, Choose your own viewpoint: a high-quality/low-complexity free-viewpoint 3D visual system, in Proceedings of the IEEE International Emerging Signal Processing Applications Conference (Las Vegas, 2012), pp. 1–4
A.K.R. Chowdhury, A. Kale, R. Chellappa, Video synthesis of arbitrary views for approximately planar scenes, in Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (2003), pp. 497–500
D. Donoho, Wedgelet: nearly minimax estimation of edges. Ann. Stat. 27(3), 859–897 (1999)
Article MATH MathSciNet Google Scholar
Y. Feng, D. Li, K. Luo, M. Zhang, Depth-image based view synthesis for three-dimensional television, in Proceedings of the IEEE Conference on Industrial Electronics Applications (2009), pp. 2428–2431
M. Gong, R. Yang, L. Wang, M. Gong, A performance study on different cost aggregation approaches used in real-time stereo matching. Int. J. Comput. Vis. 75(2), 283–296 (2007)
Article Google Scholar
M. Hariyama, H. Sasaki, M. Kameyama, Architecture of a stereo matching VLSI processor based on hierarchically parallel memory access, in Proceedings of the IEEE International Midwest Symposium on Circuits Systems (2004), pp. II-245-247
M. Hariyama, N. Yokoyama, M. Kameyama, Y. Kobayashi, FPGA implementation of a stereo matching processor based on window-parallel-and-pixel-parallel architecture, in Proceedings of the IEEE Midwest Symposium on Circuits Systems (2005), pp. 1219–1222
A. Hore, D. Ziou, Image quality metrics: PSNR vs. SSIM, in Proceedings of the International Conference on Pattern Recognition (2010), pp. 2366–2369
Y.R. Horng, Y.C. Tseng, T.S. Chang, VLSI architecture for real-time HD1080p view synthesis engine. IEEE Trans. Circuits Syst. Video Technol. 21(9), 1329–1340 (2011)
Article Google Scholar
T. Hosokawa, S. Jarusirisawad, H. Saito, Online video synthesis for removing occluding objects using multiple uncalibrated cameras via plane sweep algorithm, in Proceedings of the 3rd ACM/IEEE International Conference on Distributed Smart Cameras (2009), pp. 1–8
N. Inamoto, H. Saito, Free viewpoint video synthesis and presentation from multiple sporting videos, in Proceedings of the IEEE International Conference on Multimedia Expo (2005), pp. 1–4
N. Inamoto, H. Saito, Virtual viewpoint replay for a soccer match by view interpolation from multiple cameras. IEEE Trans. Multimedia 9(6), 1155–1166 (2007)
Article Google Scholar
S. Jarusirisawad, H. Saito, 3DTV view generation using uncalibrated cameras, in Proceedings of the 3DTV Conference (2008), pp. 57–60
H. Jiang, S. Fels, J.J. Little, Optimizing multiple object tracking and best view video synthesis. IEEE Trans. Multimedia 10(6), 997–1012 (2008)
Article Google Scholar
C. Jin, H. Jeong, Intermediate view synthesis for multi-view 3D displays using belief propagation-based stereo matching, in Proceedings of the 3rd International Conference on Convergence Hybrid Information Technology (2008), pp. 919–924
H.C. Karcithanasis, J.A. Vlontws, VLSI architectures for accurate aotion and disparity estimation using full-search block-matching and edge preserving non-linear smoothing, in Proceedings of the IEEE Workshop VLSI, Signal Processing (1993), pp. 75–83
A. Kubota, K. Aizawa, Virtual view generation by linear processing of two differently focused images, in Proceedings of the International Conference Pattern Recognition (2002), pp. 504–507
W.N. Lie, B.E. Wei, Intermediate view synthesis from binocular images for stereoscopic applications, in Proceedings of the IEEE International Symposium on Circuits Systems (2001), pp. 287–290
Z. Liu, P. An, S. Liu, Z. Zhang, Arbitrary view generation based on DIBR, in Proceedings of the International Symposium on Intelligent Signal Processing Communication Systems (2007), pp. 168–171
J. Lu, S. Rogmans, G. Lafruit, F. Catthoor, Stream-centric stereo matching and view synthesis: a high-speed approach on GPUs. IEEE Trans. Circuits Syst. Video Technol. 19(11), 1598–1611 (2009)
Article Google Scholar
P. Merkle, Y. Morvan, A. Smolic, D. Farin, K. Muller, P.H.N. de With, T. Wiegand, The effects of multiview depth video compression on multiview rendering. Signal Process. Image Commun. 24, 73–88 (2009)
Article Google Scholar
Middlebury Stereo Vision Page [online]. http://vision.middlebury.edu/stereo/
Y. Mori, N. Fukushima, T. Fujii, M. Tanimoto, View generation with 3D warping using depth information for FTV, in Proceedings of the 3DTV Conference (2008), pp. 229–232
Y. Mori, N. Fukushima, T. Yendo, T. Fujii, M. Tanimoto, View generation with 3D warping using depth information for FTV. Signal Process. Image Commun. 24, 65–72 (2009)
Article Google Scholar
K. Muller, A. Smolic, K. Dix, P. Kauff, T. Wiegand, Reliability-based generation and view synthesis in layered depth video, in Proceedings of the IEEE Workshop on Multimedia, Signal Processing (2008), pp. 34–39
L. Nalpantidis, A. Amanatiadis, G. Sirakoulis, N. Kyriakoulis, A. Gasteratos: dense disparity estimation using a hierarchical matching technique from uncalibrated stereo vision, in Proceedings of the International Workshop on Imaging Systems and Techniques (2009)
P. Ndjiki-Nya, M. Koppel, D. Doshkov, H. Lakshman, P. Merkle, K. Muller, T. Wiegand, Depth image based rendering with advanced texture synthesis, in Proceedings of the International Conference on Multimedia Expo (2010), pp. 424–429
J.H. Park, H.W. Park, Fast view interpolation of stereo images using image gradient and disparity triangulation, in Proceedings of the International Conference on Image Processing (2003), pp. 381–384
J.I. Park, S. Inoue, Arbitrary view generation from multiple cameras, in Proceedings of the International Conference on Image Processing (1997), pp. 149–152
S. Pelletier, S.P. Spackman, J.R. Cooperstock, High-resolution video synthesis from mixed-resolution video based on the estimate-and-correct method, in Proceedings of the 7th IEEE Workshops Application Computer Vision (2005), pp. 172–177
N. Plath, S. Knorr, L. Goldmann, T. Sikora, Adaptive image warping for hole prevention in 3D view synthesis. IEEE Trans. Image Process. 22(9), 3420–3432 (2013)
Article MathSciNet Google Scholar
H. Saito, S. Baba, T. Kanade, Appearance-based virtual view generation from multicamera videos captured in the 3-D room. IEEE Trans. Multimedia 5(3), 303–316 (2003)
Article Google Scholar
A. Smolic, K. Mueller, P. Merkle, P. Kauff, T. Wiegand, An overview of available and emerging 3D video formats and depth enhanced stereo as efficient generic solution, in Proceedings of the Picture Coding Symposium (2009), pp. 1–4
A. Smolic, K. Muller, K. Dix, P. Merkle, P. Kauff, T. Wiegand, Intermediate view interpolation based on multiview video plus depth for advanced 3D video systems, in Proceedings of the 15th IEEE International Conference on Image Processing (2008), pp. 2448–2451
M. Solh, G. AlRegib, Hierarchical hole filling for depth-based view synthesis in FTV and 3D video. IEEE J. Sel. Top. Signal Process. 6(5), 495–504 (2012)
Article Google Scholar
J. Sun, N.N. Zheng, H.Y. Shum, Stereo matching using belief propagation. IEEE Trans. Pattern Anal. Mach. Intell. 25(7), 787–800 (2003)
Article Google Scholar
F. Tombari, S. Mattoccia, L. Di Stefano, Segmentation-based adaptive support for accurate stereo correspondence, in Proceedings of the Pacific-Rim Symposium Image Video Technology (2007), pp. 427–438
F. Tombari, S. Mattoccia, L.D. Stefano, E. Addimanda, Classification and evaluation of cost aggregation methods for stereo correspondence, in Proceedings of the IEEE Conference on Computer Vision, Pattern Recognition (2008), pp. 1–8
T.H. Tsai, N. Chang, T.S. Chang, Data reuse analysis of local stereo matching, in Proceedings of the International Symposium on Circuits Systems (2008), pp. 812–815
Y.C. Tseng, N.Y.C. Chang, T.S. Chang, Low-memory cost belief propagation architecture for disparity estimation, in Proceedings of the IEEE International Symposium on Circuits Systems (2009), pp. 153–157
Y.C. Tseng, P.H. Hsu, T.S. Chang, Fast stereo matching with predictive search range, in Proceedings of the Picture Coding Symposium (2010), pp. 506–509
P.K. Tsung, W.Y. Chen, L.F. Ding, S.Y. Chien, L.G. Chen, Cache-based integer motion/disparity estimation for quad-HD H.264/AVC and HD multiview video coding, in Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing (2009), pp. 2013–2017
P.K. Tsung, P.C. Lin, L.F. Ding, S.Y. Chien, L.G. Chen, Single iteration view interpolation for multiview video applications, in Proceedings of the 3DTV Conference (2009), pp. 1–4
T. Uemori, T. Yendo, T. Fujii, M. Tanimoto, View generation by ray-space method in circular camera setup for FTV, in Proceedings of the 3DTV Conference (2008), pp. 233–236
Z. Wang, A.C. Bovik, H.R. Sheikh, E.P. Simoncelli, Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 1–14 (2004)
Article Google Scholar
S.A. White, Applications of distributed arithmetic to digital signal processing: a tutorial review, IEEE ASSP MAGAZINE JULY. (1989)
R.M. Willett, R.D. Nowak, Platelet: a multiscale approach for recovering edges and surfaces in photon-limited medical imaging. IEEE Trans. Med. Imag. 22(3), 332–350 (2003)
Article Google Scholar
X. Xiu, J. Liang, Projective rectification-based view interpolation for multiview video coding and free viewpoint generation, in Proceedings of the Picture Coding Symposium (2009), pp. 1–4
K. Yamamoto, M. Kitahara, H. Kimata, T. Yendo, T. Fujii, M. Tanimoto, S. Shimizu, K. Kamikura, Yashima: multiview video coding using view interpolation and color correction. IEEE Trans. Circuits Syst. Video Technol. 17(11), 1436–1449 (2007)
Article Google Scholar
L. Yang, T. Yendo, M.P. Tehrani, T. Fujii, M. Tanimoto, Error suppression in view synthesis using reliability reasoning for FTV in Proceedings of the 3DTV-Conference (2010), pp. 1–4
S. Yea, A. Vetro, RD-optimized view synthesis prediction for multiview video coding, in Proceedings of the IEEE International Conference on Image Processing (2007), pp. 209–212
S. Yea, A. Vetro, View synthesis prediction for rate-overhead reduction in FTV, in Proceedings of the 3DTV Conference (2008), pp. 145–148
G. Zhang, W. Hua, X. Qin, T.T. Wong, H. Bao, Stereoscopic video synthesis from a monocular video. IEEE Trans. Vis. Comput. Graph. 13(4), 686–696 (2007)
Article Google Scholar
K. Zhang, J. Lu, F. Catthoor, Cross-based local stereo matching using orthogonal integral images. IEEE Trans. Circuits Syst. Video Technol. 19(7), 1073–1079 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronic Engineering, Feng-Chia University, 100 Wen-hwa Rd., Sea-twen, Taichung, 40724, Taiwan, ROC
Kuan-Hung Chen, Cheng-Hao Chen, Chih-Hao Chang & Jun-Yu Liu
Department of Electronic Engineering, National Yunlin University of Science and Technology, 123 University Road, Section 3, Douliou, Yunlin, 64002, Taiwan, ROC
Ching-Lung Su

Authors

Kuan-Hung Chen
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Hao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Hao Chang
View author publications
You can also search for this author in PubMed Google Scholar
Jun-Yu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ching-Lung Su
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kuan-Hung Chen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, KH., Chen, CH., Chang, CH. et al. A Shape-Adaptive Low-Complexity Technique for 3D Free-Viewpoint Visual Applications. Circuits Syst Signal Process 34, 579–604 (2015). https://doi.org/10.1007/s00034-014-9870-x

Download citation

Received: 18 December 2013
Revised: 28 July 2014
Accepted: 29 July 2014
Published: 14 August 2014
Issue Date: February 2015
DOI: https://doi.org/10.1007/s00034-014-9870-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Shape-Adaptive Low-Complexity Technique for 3D Free-Viewpoint Visual Applications

Abstract

Access this article

Similar content being viewed by others

Real-time stereo to multi-view conversion system based on adaptive meshing

Stereo Matching and Viewpoint Synthesis FPGA Implementation

Warping and Blending Enhancement for 3D View Synthesis Based on Grid Deformation

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Shape-Adaptive Low-Complexity Technique for 3D Free-Viewpoint Visual Applications

Abstract

Access this article

Similar content being viewed by others

Real-time stereo to multi-view conversion system based on adaptive meshing

Stereo Matching and Viewpoint Synthesis FPGA Implementation

Warping and Blending Enhancement for 3D View Synthesis Based on Grid Deformation

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation