Geodesic Matting: A Framework for Fast Interactive Image and Video Segmentation and Matting
- 1.2k Downloads
An interactive framework for soft segmentation and matting of natural images and videos is presented in this paper. The proposed technique is based on the optimal, linear time, computation of weighted geodesic distances to user-provided scribbles, from which the whole data is automatically segmented. The weights are based on spatial and/or temporal gradients, considering the statistics of the pixels scribbled by the user, without explicit optical flow or any advanced and often computationally expensive feature detectors. These could be naturally added to the proposed framework as well if desired, in the form of weights in the geodesic distances. An automatic localized refinement step follows this fast segmentation in order to further improve the results and accurately compute the corresponding matte function. Additional constraints into the distance definition permit to efficiently handle occlusions such as people or objects crossing each other in a video sequence. The presentation of the framework is complemented with numerous and diverse examples, including extraction of moving foreground from dynamic background in video, natural and 3D medical images, and comparisons with the recent literature.
KeywordsInteractive image and video segmentation Matting Geodesic computations Weighted distance functions Fast algorithms User-provided scribbles
Unable to display preview. Download preview PDF.
- ADOBE SYSTEMS INCORP. (2002). Adobe photoshop user guide. Google Scholar
- ADOBE SYSTEMS INCORP. (2007). Adobe photoshop CS3 new features. http://www.adobe.com/products/photoshop/photoshop.
- Agarwala, A., Hertzmann, A., Salesin, D., & Seitz, S. (2004). Keyframe-based tracking for rotoscoping and animation. In Proceedings of SIGGRAPH’04. Google Scholar
- Bai, X., & Sapiro, G. (2007). A geodesic framework for fast interactive image and video segmentation and matting. In Proc. international conference computer vision, Rio de Janeiro, Brazil, 16–19 October 2007. Google Scholar
- Boykov, Y. Y., & Jolly, M.-P. (2001). Interactive graph cuts for optimal boundary and region segmentation of objects in n-d images. In IEEE ICCV 2001 (Vol. 01, pp. 105). Google Scholar
- Chuang, Y.-Y., Curless, B., Salesin, D. H., & Szeliski, R. (2001). A Bayesian approach to digital matting. In Proceedings of IEEE CVPR 2001 (Vol. 2, pp. 264–271). December 2001. Google Scholar
- Chuang, Y.-Y., Agarwala, A., Curless, B., Salesin, D. H., & Szeliski, R. (2002). Video matting of complex scenes. In SIGGRAPH ’02 (pp. 243–248). Google Scholar
- COREL CORPORATION. (2002). Knockout user guide. Google Scholar
- Criminisi, A., Cross, G., Blake, A., & Kolmogorov, V. (2006). Bilayer segmentation of live video. In Proceedings of IEEE CVPR 2006 (pp. 53–60). Google Scholar
- Grady, L., Schiwietz, T., Aharon, S., & Westermann, R. (2005). Random walks for interactive alpha-matting. In Proceedings of VIIP 2005 (pp. 423–429). Spain, September 2005. ACTA Press. Google Scholar
- Levin, A., Lischinski, D., & Weiss, Y. (2004). Colorization using optimization. SIGGRAPH’04, 23(3), 689–694. Google Scholar
- Levin, A., Lischinski, D., & Weiss, Y. (2006). A closed form solution to natural image matting. In Proceedings of IEEE CVPR 2006 (pp. 61–68). Google Scholar
- Levin, A., Rav-Acha, A., & Lischinski, D. (2007). Spectral matting. In Proceedings of IEEE CVPR 2007, June 2007. Google Scholar
- Rother, C., Kolmogorov, V., & Blake, A. (2004). Grabcut: Interactive foreground extraction using iterated graph cuts. In SIGGRAPH’04. Google Scholar
- Sapiro, G. (2001). Geometric partial differential equations and image processing. Cambridge: Cambridge University Press. Google Scholar
- Sinop, A. K., & Grady, L. (2007). A seeded image segmentation framework unifying graph cuts and random walker which yields a new algorithm. In Proc. international conference computer vision, Rio de Janeiro, Brazil, 16–19 October 2007. Google Scholar
- Sun, J., Jia, J., Tang, C.-K., & Shum, H.-Y. (2004). Poisson matting. In SIGGRAPH’04 (pp. 315–321). Google Scholar
- Sun, J., Kang, S. B., Xu, Z., Tang, X., & Shum, H.-Y. (2007). Flash cut: Foreground extraction with flash and no-flash image pairs. In Proceedings of IEEE CVPR 2007, June 2007. Google Scholar
- Thorup, M. (1997). Undirected single source shortest paths in linear time. In Proceedings IEEE symposium on foundations of computer science (pp. 12–21). Google Scholar
- Wang, J., & Cohen, M. F. (2005). An iterative optimization approach for unified image segmentation and matting. In Proceedings of IEEE ICCV 2005 (pp. 936–943). Google Scholar
- Wang, J., & Cohen, M. (2007b). Optimized color sampling for robust matting. In Proceedings of IEEE CVPR 2007, June 2007. Google Scholar
- Wang, J., Bhat, P., Colburn, R. A., Agrawala, M., & Cohen, M. F. (2005). Interactive video cutout. SIGGRAPH’05, 24(3), 585–594. Google Scholar
- Wang, J., Agrawala, M., & Cohen, M. F. (2007). Soft scissors: An interactive tool for realtime high quality matting. In SIGGRAPH’07. Google Scholar
- Yang, C., Duraiswami, R., Gumerov, N., & Davis, L. (2003). Improved fast Gauss transform and efficient kernel density estimation. In Proceedings IEEE ICCV 2003 (pp. 464–471). Nice, France. Google Scholar
- Yatziv, L., & Sapiro, G. (2005). Image and video data blending using intrinsic distances. Patent pending. Google Scholar
- Virtual colonoscopy screening resource center. (2005) http://www.vcscreen.com.