Interactive Multi-label Segmentation of RGB-D Images

Diebold, Julia; Demmel, Nikolaus; Hazırbaş, Caner; Moeller, Michael; Cremers, Daniel

doi:10.1007/978-3-319-18461-6_24

Julia Diebold¹⁶,
Nikolaus Demmel¹⁶,
Caner Hazırbaş¹⁶,
Michael Moeller¹⁶ &
…
Daniel Cremers¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9087))

Included in the following conference series:

International Conference on Scale Space and Variational Methods in Computer Vision

2326 Accesses
12 Citations

Abstract

We propose a novel interactive multi-label RGB-D image segmentation method by extending spatially varying color distributions [14] to additionally utilize depth information in two different ways. On the one hand, we consider the depth image as an additional data channel. On the other hand, we extend the idea of spatially varying color distributions in a plane to volumetrically varying color distributions in 3D. Furthermore, we improve the data fidelity term by locally adapting the influence of nearby scribbles around each pixel. Our approach is implemented for parallel hardware and evaluated on a novel interactive RGB-D image segmentation benchmark with pixel-accurate ground truth. We show that depth information leads to considerably more precise segmentation results. At the same time significantly less user scribbles are required for obtaining the same segmentation accuracy as without using depth clues.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arbelaez, P., Maire, M., Fowlkes, C.C., Malik, J.: From contours to regions: an empirical evaluation. In: CVPR (2009)
Google Scholar
Batra, D., Kowdle, A., Parikh, D., Luo, J., Chen, T.: iCoseg: interactive co-segmentation with intelligent scribble guidance. In: CVPR (2010)
Google Scholar
Blake, A., Rother, C., Brown, M., Perez, P., Torr, P.: Interactive image segmentation using an adaptive GMMRF model. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3021, pp. 428–441. Springer, Heidelberg (2004)
Google Scholar
Boykov, Y.Y., Jolly, M.-P.: Interactive graph cuts for optimal boundary & region segmentation of objects in ND images. In: ICCV (2001)
Google Scholar
Couprie, C., Farabet, C., Najman, L., LeCun, Y.: Indoor semantic segmentation using depth information. In: ICLR (2013)
Google Scholar
Esser, E., Zhang, X., Chan, T.F.: A general framework for a class of first order primal-dual algorithms for convex optimization in imaging science. SIIMS (2010)
Google Scholar
Hermans, A., Floros, G., Leibe, B.: Dense 3D semantic mapping of indoor scenes from RGB-D images. In: ICRA (2014)
Google Scholar
Hernandez, J. Marcotegui, B.: Morphological segmentation of building facade images. In: ICIP (2009)
Google Scholar
Kohli, P., Ladicky, L., Torr, P.H.S.: Robust higher order potentials for enforcing label consistency. IJCV (2009)
Google Scholar
Levin, A., Lischinski, D., Weiss, Y.: Colorization using optimization. In: TOG (2004)
Google Scholar
Li, Y., Sun, J., Tang, C.-K., Shum, H.-Y.: Lazy snapping. TOG (2004)
Google Scholar
Liu, D., Pulli, K., Shapiro, L.G., Xiong, Y.: Fast interactive image segmentation by discriminative clustering. In: MCMC (2010)
Google Scholar
Lombaert, H., Sun, Y., Grady, L., Xu, C.: A multilevel banded graph cuts method for fast image segmentation. In: ICCV (2005)
Google Scholar
Nieuwenhuis, C., Cremers, D.: Spatially varying color distributions for interactive multilabel segmentation. PAMI (2013)
Google Scholar
Nieuwenhuis, C., Hawe, S., Kleinsteuber, M., Cremers, D.: Co-sparse textural similarity for interactive segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part VI. LNCS, vol. 8694, pp. 285–301. Springer, Heidelberg (2014)
Google Scholar
Pock, T., Chambolle, A.: Diagonal preconditioning for first order primal-dual algorithms in convex optimization. In: ICCV (2011)
Google Scholar
Pock, T., Cremers, D., Bischof, H., Chambolle, A.: An algorithm for minimizing the mumford-shah functional. In: ICCV (2009)
Google Scholar
Richtsfeld, A., Morwald, T., Prankl, J., Zillich, M., Vincze, M.: Segmentation of unknown objects in indoor environments. In: IROS (2012)
Google Scholar
Rosman, G., Bronstein, A.M., Bronstein, M.M., Tai, X.-C., Kimmel, R.: Group-valued regularization for analysis of articulated motion. In: Fusiello, A., Murino, V., Cucchiara, R. (eds.) ECCV 2012 Ws/Demos, Part I. LNCS, vol. 7583, pp. 52–62. Springer, Heidelberg (2012)
Google Scholar
Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. In: TOG (2004)
Google Scholar
Santner, J., Pock, T., Bischof, H.: Interactive multi-label segmentation. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part I. LNCS, vol. 6492, pp. 397–410. Springer, Heidelberg (2011)
Google Scholar
Shao, T., Xu, W., Zhou, K., Wang, J., Li, D., Guo, B.: An interactive approach to semantic modeling of indoor scenes with an rgbd camera. TOG (2012)
Google Scholar
Silberman, N., Fergus, R.: Indoor scene segmentation using a structured light sensor. In: ICCV (2011)
Google Scholar
Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inference from RGBD images. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 746–760. Springer, Heidelberg (2012)
Google Scholar
Silverman, B.: Density estimation for statistics and data analysis. Chapman and Hall Ltd (1986)
Google Scholar
Teboul, O., Simon, L., Koutsourakis, P., Paragios, N.: Segmentation of building facades using procedural shape priors. In: CVPR (2010)
Google Scholar
Vicente, S., Kolmogorov, V., Rother, C.: Joint optimization of segmentation and appearance models. In: ICCV (2009)
Google Scholar
Wang, J.: Discriminative gaussian mixtures for interactive image segmentation. In: ICASSP (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Technical University of Munich, München, Germany
Julia Diebold, Nikolaus Demmel, Caner Hazırbaş, Michael Moeller & Daniel Cremers

Authors

Julia Diebold
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaus Demmel
View author publications
You can also search for this author in PubMed Google Scholar
Caner Hazırbaş
View author publications
You can also search for this author in PubMed Google Scholar
Michael Moeller
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Cremers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julia Diebold .

Editor information

Editors and Affiliations

University of Bordeaux, Talence, France
Jean-François Aujol
ENS Cachan, Cachan, France
Mila Nikolova
University of Bordeaux, Talence, France
Nicolas Papadakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Diebold, J., Demmel, N., Hazırbaş, C., Moeller, M., Cremers, D. (2015). Interactive Multi-label Segmentation of RGB-D Images. In: Aujol, JF., Nikolova, M., Papadakis, N. (eds) Scale Space and Variational Methods in Computer Vision. SSVM 2015. Lecture Notes in Computer Science(), vol 9087. Springer, Cham. https://doi.org/10.1007/978-3-319-18461-6_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-18461-6_24
Published: 28 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18460-9
Online ISBN: 978-3-319-18461-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics