Abstract
Bi-modal image processing can be defined as a series of steps taken to enhance a target image with a guidance image. This is done by using exploitable information derived from acquiring two images of the same scene with different image modalities. However, while the potential benefit of bi-modal image processing may be significant, there is an inherent risk; if noise or defects in the guidance image are allowed to transfer to the target image, the target image could become corrupted rather than improved. In this chapter, we present a new method to enhance a noisy depth map from its color information via the joint bilateral filter (JBF) based on common distance transform (CDT). This method is composed of two main steps: CDT map generation and CDT-based JBF. In the first step, a CDT map is generated that represents the degree of pixel-modal similarity between a depth pixel and its corresponding color pixel. Then, based on the CDT map, JBF is carried out in order to enhance depth information with the aid of color information. Experimental results show that CDT-based JBF outperforms other conventional methods objectively and subjectively in terms of noise reduction, as well as inherent visual artifacts suppression.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Borgefors G (1988) Hierarchical chamfer matching: a parametric edge matching algorithm. IEEE Trans Pattern Anal Mach Intell 10(6):849–865
Bose K, Ahuja N (2006) Superresolution and noise filtering using moving least squares. IEEE Trans Image Process 15(8):2239–2248
Brooks S, Saunders I, Dodgson NA (2007) Image compression using sparse colour sampling combined with non-linear image processing. In: Proceedings of electronic imaging: human vision and electronic imaging, vol XII
Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8(6):679–698
Chiabrando F, Chiabrando R, Piatti D, Rinaudo F (2009) Sensors for 3D imaging: metric evaluation and calibration of a CCD/CMOS time-of-flight camera. Sensors 9(12):10080–10096
Cho J, Kim S-Y, Ho YS, Lee KH (2008) Dynamic 3D human actor generation method using a time-of-flight depth camera. IEEE Trans Consum Electron 54(4):1514–1521
Diebel J, Thrun S (2005) An application of Markov random fields to range sensing. In: Proceedings of advances in neural information processing systems, pp 291–298
Dorrington AA, Payne AD, Cree MJ (2010) An evaluation of time-of-flight cameras for close range methodology applications. International archives of photogrammetry, remote sensing and spatial information sciences, vol. XXXVIII, Part 5 Commission V Symposium
Elad M (2002) On the origin of the bilateral filter and ways to improve it. IEEE Trans Image Process 11(10):1141–1150
Fehn C (2003) A 3D-TV system based on video plus depth information. In: Proceedings of asilomar conference on signals, systems and computers, vol 2, pp 1529–1533
Fehn C, Barré R, Pastoor S (2006) Interactive 3-D TV- concepts and key technologies. Proc. IEEE 94(3):524–538
Felzenszwalb PF, Huttenlocher DP (2004) Distance transforms of sampled functions. Cornell computing and information science, TR2004-1963
Feng X, Milanfar P (2002) Multiscale principal components analysis for image local orientation estimation. In: Proceedings of asilomar conference on signals, systems and computers, pp 478–482
Gangwal OP, Djapic B (2010) Real-time implementation of depth map post-processing for 3D-TV in dedicated hardware. In: Proceedings of international conference on consumer electronics
Gonzalez RC, Woods RE (2002) Digital image processing. Prentice Hall, Englewood Cliffs
Grabb R, Tracey C, Puranik A, Davis J (2008) Real-time foreground segmentation via range and color imaging. In: Proceedings of IEEE computer vision and pattern recognition workshops, pp 1–5
Iddan GJ, Yahav G (2001) 3D imaging in the studio and elsewhere. In: Proceedings of SPIE videometrics and optical methods for 3D shape measurements, pp 48–55
Jachalsky J, Schlosser M, Gandolph D (2010) Reliability aware cross multilateral filtering for robust disparity map refinement. In: Proceedings of 3DTV conference
Kauff P, Atzpadin N, Fehn C, Müller M, Schreer O, Smolic A, Tanger R (2007) Depth map creation and image-based rendering for advanced 3D-TV services providing interoperability and scalability. Signal Process. Image Commun. 22(2):217–234
Kim SM, Cha J, Ryu J, Lee KH (2006) Depth video enhancement for haptic interaction using a smooth surface reconstruction. IEICE Trans Inf Syst E89-D(1):37–44
Kim S-Y, Cho J, Koschan A, Abidi M (2010) Spatial and temporal enhancement of depth images captured by a time-of-flight depth sensor. In: Proceedings of IEEE international conference on pattern recognition, pp 2358–2361
Kim S-Y, Cho W, Koschan A, Abidi M (2011) Depth map enhancement using adaptive steering kernel regression based on distance transform. In: Proceedings of international symposium and visual computing, vol 6938. Lecture notes in computer science, pp 291–300
Kim S-Y, Lee SB, Ho YS (2006) Three-dimensional natural video system based on layered representation of depth maps. IEEE Trans Consum Electron 52(3):1035–1042
Kopf J, Cohen MF, Lischinski D, Uyttendaele M (2007) Joint bilateral upsampling. ACM Trans Comput Graph 26(3):1–6
Lee EK, Ho YS (2011) Generation of high-quality depth maps using hybrid camera system for 3-D video. J Vis Commun Image Represent 22:73–84
Petschnigg G, Agrawala M, Hoppe H, Szeliski R, Cohen M, Toyama K (2004) Digital photography with flash and no-flash image pairs. ACM Trans Comput Graph 23(3):664–672
Riemens AK, Gangwal OP, Barenbrug B, Berretty RM (2009) Multi-step joint bilateral depth upsampling. In Proceedings of electronic imaging: visual communications and image processing
Rosenfeld A, Pfaltz J (1968) Distance functions in digital pictures. Pattern Recognit 1:33–61
Scharstein D, Szeliski R (2002) A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int J Comput Vision 47(1–3):7–42
Shao L, Yan R, Li X, Liu Y (2014) From heuristic optimization to dictionary learning: a review and comprehensive comparison of image denoising algorithms. IEEE Trans Cybern. doi:10.1109/TCYB.2013.2278548
Shotton J, Fitzgibbon A, Cook M, Blake A (2011) Real-time human pose recognition in parts from single depth images. In: Proceedings of IEEE computer vision and pattern recognition, pp 1297–1304
Takeda H, Farsiu S, Milanfar P (2007) Kernel regression for image processing and reconstruction. IEEE Trans Image Process 16(2):349–366
Takeda H, Farsiu S, Milanfar P (2008) Deblurring using regularized locally-adaptive kernel regression. IEEE Trans Image Process 17(4):550–563
Tan H, Tian F, Qiu Y, Wang S, Zhang J (2010) Multihypothesis recursive video denoising based on separation of motion state. IET Image Process 4(4):261–268
Tomasi C, Manduchi R (1998) Bilateral filtering for gray and color images. In: Proceedings of IEEE international conference on computer vision, pp 839–846
Wand MP, Jones MC (1995) Kernel smoothing, ser. Monographs on statistics and applied probability. Chapman and Hall, London
Xiao C, Nie Y, Hua W, Zheng W (2010) Fast multi-scale joint bilateral texture upsampling. Visual Comput 26(3):263–275
Yan R, Shao L, Liu Y (2013) Nonlocal hierarchical dictionary learning using wavelets for image denoising. IEEE Trans Image Process 22(12):4689–4698
Yang Q, Tan KH, Ahuja N (2009) Real-time O(1) bilateral filtering. In: Proceedings of IEEE computer vision and pattern recognition, pp 557–564
Yang Q, Yang R, Davis J, Nistér D (2007) Spatial-depth super resolution for range images. In: Proceedings of IEEE computer vision and pattern recognition, pp 1–8
Yoon SU, Ho YS (2007) Multiple color and depth video coding using a hierarchical representation. IEEE Trans. Circ. Syst. Video Technol. 17(11):1450–1460
Yu H, Zhao L, Wang H (2009) Image denoising using trivariate shrinkage filter in the wavelet domain and joint bilateral filter in the spatial domain. IEEE Trans Image Process 18(10):2364–2369
Acknowledgments
This work was supported in part by the U.S. Air Force under Grant FA8650-10-1-5902. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of Air Force Research Laboratory or the U.S. Government. We are thankful to Dr. Sung-Yeol Kim for the implementation and evaluation of various filters.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Koschan, A., Abidi, M. (2014). Depth Map Denoising via CDT-Based Joint Bilateral Filter. In: Shao, L., Han, J., Kohli, P., Zhang, Z. (eds) Computer Vision and Machine Learning with RGB-D Sensors. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-08651-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-08651-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08650-7
Online ISBN: 978-3-319-08651-4
eBook Packages: Computer ScienceComputer Science (R0)