Efficient Multi-scale Plane Extraction Based RGBD Video Segmentation

Liu, Hong; Wang, Jun; Wang, Xiangdong; Qian, Yueliang

doi:10.1007/978-3-319-51811-4_50

Hong Liu¹⁸,
Jun Wang¹⁸,
Xiangdong Wang¹⁸ &
…
Yueliang Qian¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10132))

Included in the following conference series:

International Conference on Multimedia Modeling

3234 Accesses

Abstract

To improve the robustness and efficiency of RGBD video segmentation, we propose a novel video segmentation method combining multi-scale plane extraction and hierarchical graph-based video segmentation. Firstly, to reduce depth data noise, we extract plane structures of 3D RGBD point clouds in three levels including voxel, pixel and neighborhood with geometry and color features. To solve uneven distribution of depth data and object occlusion problem, we further propose multi-scale voxel based plane fusion algorithm and use amodal completion strategy to improve plane extraction performance. Then hierarchical graph-based RGBD video segmentation is used to segment the rest of the non-plane pixels. Finally, we fuse above plane extraction and video segmentation results to get final RGBD video scene segmentation results. The qualitative and quantitative results of plane extraction and RGBD scene video segmentation show the effectiveness of proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Paris, S., Durand, F.: A topological approach to hierarchical segmentation using mean shift. In: CVPR (2007)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. TPAMI 22(8), 888–905 (2000)
Article Google Scholar
Sharon, E., Galun, M., Sharon, D., Basri, R., Brandt, A.: Hierarchy and adaptivity in segmenting visual scenes. Nature 442(7104), 810–813 (2006)
Article Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. IJCV 2(59), 167–181 (2004)
Article Google Scholar
Stuckler, J., Behnke, S.: Efficient dense rigid-body motion segmentation and estimation in RGB-D video. IJCV 113(3), 233–245 (2015)
Article MathSciNet Google Scholar
Song, J.K., Gao, L.L., Pusca M.M., et al.: Joint graph learning and video segmentation via multiple cues and topology calibration. In: ACM MM (2016)
Google Scholar
Xu, C., Corso, J.J.: Evaluation of super-voxel methods for early video processing. In: CVPR (2012)
Google Scholar
Corso, J.J., Sharon, E., et al.: Efficient multilevel brain tumor segmentation with integrated Bayesian model classification. IEEE Trans. Med. Imaging 27(5), 629–640 (2008)
Article Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. IJCV 59(2), 167–181 (2004)
Google Scholar
Gupta, S., Arbeláez, P., Malik, J.: Perceptual organization and recognition of indoor scenes from RGB-D images. In: CVPR (2013)
Google Scholar
Grundmann, M., Kwatra, V., et al.: Efficient hierarchical graph-based video segmentation. In: CVPR (2010)
Google Scholar
Fowlkes, C., Belongie, S., et al.: Spectral grouping using the Nystrom method. TPAMI 26(2), 214–225 (2004)
Article Google Scholar
Steven, H., Stan B., et al.: Efficient hierarchical graph-based segmentation of RGBD videos. In: CVPR (2014)
Google Scholar
Wang, Z., Liu, H., Qian, Y., Xu, T.: Real-time plane segmentation and obstacle detection of 3D point clouds for indoor scenes. In: Fusiello, A., Murino, V., Cucchiara, R. (eds.) ECCV 2012. LNCS, vol. 7584, pp. 22–31. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33868-7_3
Chapter Google Scholar
Wang, Z., Liu, H., Wang, X.D., Qian, Y.L.: Segment and label indoor scene based on RGB-D for the visually impaired. In: MMM (2014)
Google Scholar
labelme.csail.mit.edu
Holz, D., Holzer, S., Rusu, R.B., Behnke, S.: Real-time plane segmentation using RGB-D cameras. In: Röfer, T., Mayer, N.,Michael, Savage, J., Saranlı, U. (eds.) RoboCup 2011. LNCS (LNAI), vol. 7416, pp. 306–317. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32060-6_26
Chapter Google Scholar
Dube, D., Zell, A.: Real-time plane extraction from depth images with the randomized Hough transform. In: ICCV Workshops (2011)
Google Scholar
Liu, H., Wang, J., Qian, Y. L., Wang, X.D.: iSee: obstacle detection and feedback system for the blind. In: UbiComp (2015)
Google Scholar

Download references

Acknowledgments

This work is supported in part by Beijing Natural Science Foundation (4142051) and National Key Technology R&D Program of China (2014BAK15B02).

Author information

Authors and Affiliations

Key Laboratory of Intelligent Information Processing and Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
Hong Liu, Jun Wang, Xiangdong Wang & Yueliang Qian

Authors

Hong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiangdong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yueliang Qian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hong Liu .

Editor information

Editors and Affiliations

CNRS–IRISA, Rennes, France
Laurent Amsaleg
Reykjavík University, Reykjavik, Iceland
Gylfi Þór Guðmundsson
Dublin City University, Dublin, Ireland
Cathal Gurrin
Reykjavik University, Reykjavik, Ireland
Björn Þór Jónsson
National Institute of Informatics, Tokyo, Japan
Shin’ichi Satoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, H., Wang, J., Wang, X., Qian, Y. (2017). Efficient Multi-scale Plane Extraction Based RGBD Video Segmentation. In: Amsaleg, L., Guðmundsson, G., Gurrin, C., Jónsson, B., Satoh, S. (eds) MultiMedia Modeling. MMM 2017. Lecture Notes in Computer Science(), vol 10132. Springer, Cham. https://doi.org/10.1007/978-3-319-51811-4_50

Download citation

DOI: https://doi.org/10.1007/978-3-319-51811-4_50
Published: 31 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51810-7
Online ISBN: 978-3-319-51811-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics