Skip to main content

Spatio-temporally Coherent Interactive Video Object Segmentation via Efficient Filtering

  • Conference paper
Pattern Recognition (DAGM/OAGM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7476))

Abstract

In this paper we propose a fast, interactive object segmentation and matting framework for videos that allows users to extract objects from a video using only a few foreground scribbles. Our approach is based on recent work [12] that obtains high-quality image segmentations by smoothing the likelihood of a color model with a fast edge-preserving filter. The previous approach was originally intended for single static images and does not achieve temporally coherent segmentations for videos. Our main contribution is to extend the approach of [12] to the temporal domain. Our results are spatially and temporally coherent segmentations, in which the borders of the foreground object are aligned with spatio-temporal color edges in the video. The obtained binary segmentation can be further refined in a temporally coherent and equally efficient alpha matting step. Quantitative and qualitative evaluations show that our extension significantly reduces flickering in the video segmentations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bai, X., Wang, J., Simons, D., Sapiro, G.: Video SnapCut: Robust video object cutout using localized classifiers. In: SIGGRAPH 2009, pp. 70:1–70:11 (2009)

    Google Scholar 

  2. Bai, X., Sapiro, G.: Geodesic matting: A framework for fast interactive image and video segmentation and matting. IJCV 82(2), 113–132 (2009)

    Article  Google Scholar 

  3. Boykov, Y., Jolly, M.P.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: ICCV 2001, vol. 1, pp. 105–112 (2001)

    Google Scholar 

  4. Brosch, N., Rhemann, C., Gelautz, M.: Segmentation-based depth propagation in videos. In: ÖAGM/AAPR 2011, pp. 1–8 (2011)

    Google Scholar 

  5. Criminisi, A., Cross, G., Blake, A., Kolmogorov, V.: Bilayer segmentation of live video. In: CVPR 2006, pp. 53–60 (2006)

    Google Scholar 

  6. He, K., Sun, J., Tang, X.: Guided Image Filtering. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 1–14. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  7. Hosni, A., Rhemann, C., Bleyer, M., Gelautz, M.: Temporally Consistent Disparity and Optical Flow via Efficient Spatio-temporal Filtering. In: Ho, Y.-S. (ed.) PSIVT 2011, Part I. LNCS, vol. 7087, pp. 165–177. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Lee, S.Y., Yoon, J.C., Lee, I.K.: Temporally coherent video matting. Graphical Models 72(3), 25–33 (2010)

    Article  Google Scholar 

  9. Li, Y., Sun, J., Shum, H.Y.: Video object cut and paste. In: SIGGRAPH 2005, pp. 595–600 (2005)

    Google Scholar 

  10. Liu, J., Sun, J., Shum, H.Y.: Paint selection. In: SIGGRAPH 2009, pp. 69:1–69:7 (2009)

    Google Scholar 

  11. Price, B., Morse, B., Cohen, S.: LIVEcut: Learning-based interactive video segmentation by evaluation of multiple propagated cues. In: ICCV 2009, pp. 779–786 (2009)

    Google Scholar 

  12. Rhemann, C., Hosni, A., Bleyer, M., Rother, C., Gelautz, M.: Fast cost-volume filtering for visual correspondence and beyond. In: CVPR 2011, pp. 3017–3024 (2011)

    Google Scholar 

  13. Richardt, C., Orr, D., Davies, I., Criminisi, A., Dodgson, N.A.: Real-time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral Grid. In: Daniilidis, K. (ed.) ECCV 2010, Part III. LNCS, vol. 6313, pp. 510–523. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  14. Rother, C., Kolmogorov, V., Blake, A.: ”GrabCut”: Interactive foreground extraction using iterated graph cuts. In: SIGGRAPH 2004. pp. 309–314 (2004)

    Google Scholar 

  15. Wang, J., Bhat, P., Colburn, R., Agrawala, M., Cohen, M.: Interactive video cutout. Trans. Graph. 24(3), 585–594 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Brosch, N., Hosni, A., Rhemann, C., Gelautz, M. (2012). Spatio-temporally Coherent Interactive Video Object Segmentation via Efficient Filtering. In: Pinz, A., Pock, T., Bischof, H., Leberl, F. (eds) Pattern Recognition. DAGM/OAGM 2012. Lecture Notes in Computer Science, vol 7476. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32717-9_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32717-9_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32716-2

  • Online ISBN: 978-3-642-32717-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics