Abstract
The Microsoft Kinect depth sensor provides depth maps of indoor environments with unknown depth gaps because of infrared (IR) reflectance properties of the objects inside the scene and occlusion. In this paper, we propose a grid-based hierarchical learning algorithm for predicting the depth values of the gaps inside the Kinect’s depth map. The scene is divided into hierarchical grids and the depth of each grid is modeled using supervised learning. The learned models can be directly applied to upcoming frames without repeating the learning procedure and occluded regions in the background can be recovered. The proposed algorithm outperforms an inpainting-based approach quantitatively and successfully recovers the objects that are occluded at the background.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Tomasi, C., Kanade, T.: Shape and Motion from Image Streams Under Orthography: A Factorization Method. International Journal of Computer Vision (IJCV) 9(2), 137–154 (1992)
Zhang, R., Tsai, P., Cryer, J.E., Shah, M.: Shape from Shading: A Survey. Pattern Recognition and Machine Intelligence (PAMI) 21(8), 690–706 (1999)
Favaro, P., Soatto, S.: Shape and Radiance Estimation from the Information Divergence of Blurred Images. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 755–768. Springer, Heidelberg (2000)
Scharstein, D., Szelinski, R.: A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. IJCV 47(1), 4–42 (2002)
Seitz, S., Curless, B., Diebel, J., Scharstein, D., Szelinski, R.: A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms. In: Computer Vision and Patter Recognition (CVPR), pp. 519–526 (2006)
Dal Mutto, C., Zanuttigh, P., Cortelazzo, G.M.: Time-of-Flight Cameras and Microsoft Kinect. Springer (2012)
Chiu, W.C., Blanke, U., Fritz, M.: Improving the Kinect by Cross-Modal Stereo. In: British Machine Vision Conference (BMVC) (2011)
Chiu, W.C., Blanke, U., Fritz, M.: I Spy with My Little Eye: Learning Optimal Filters for Cross-Modal Stereo Under Projected Patterns. In: ICCV Workshops, pp. 1209–1214 (2011)
Qi, F., Wang, P., Shi, G., Li, F.: Structure Guided Fusion for Depth Map Inpainting. Pattern Recognition Letters (2012)
Matyunin, S., Vatolin, D., Berdnikov, Y., Smirnov, M.: Temporal Filtering for Depth Maps Generated by Kinect Depth Camera. In: 3D Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON), pp. 1–4 (2011)
Saxena, A., Sun, M., Ng, A.Y.: Make3d: Learning 3d Scene Structure from a Single Still Image. PAMI 31(5), 824–840 (2009)
Comaniciu, D., Meer, P.: Mean Shift: A Robust Approach Toward Feature Space Analysis. PAMI 24(5), 603–619 (2002)
Hoiem, D., Efrosand, A., Herbert, M.: Geometric Context from a Single Still Image. In: ICCV, pp. 654–661 (2005)
Burrus, N.: Kinect Calibration, http://labs.manctl.com/rgbdemo/index.php/Documentation/KinectCalibrationTheory.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saygili, G., Balim, C., Kalkan, H., Hendriks, E.A. (2013). Hierarchical Grid-Based Learning Approach for Recovering Unknown Depths in Kinect Depth Maps. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2013. Lecture Notes in Computer Science, vol 7950. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39094-4_75
Download citation
DOI: https://doi.org/10.1007/978-3-642-39094-4_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39093-7
Online ISBN: 978-3-642-39094-4
eBook Packages: Computer ScienceComputer Science (R0)