Practical automatic background substitution for live video

Huang, Haozhi; Fang, Xiaonan; Ye, Yufei; Zhang, Songhai; Rosin, Paul L.

doi:10.1007/s41095-016-0074-0

Practical automatic background substitution for live video

Research Article
Open access
Published: 15 March 2017

Volume 3, pages 273–284, (2017)
Cite this article

Download PDF

You have full access to this open access article

Computational Visual Media Aims and scope Submit manuscript

Practical automatic background substitution for live video

Download PDF

Haozhi Huang¹,
Xiaonan Fang¹,
Yufei Ye¹,
Songhai Zhang¹ &
…
Paul L. Rosin²

1149 Accesses
14 Citations
9 Altmetric
Explore all metrics

Abstract

In this paper we present a novel automatic background substitution approach for live video. The objective of background substitution is to extract the foreground from the input video and then combine it with a new background. In this paper, we use a color line model to improve the Gaussian mixture model in the background cut method to obtain a binary foreground segmentation result that is less sensitive to brightness differences. Based on the high quality binary segmentation results, we can automatically create a reliable trimap for alpha matting to refine the segmentation boundary. To make the composition result more realistic, an automatic foreground color adjustment step is added to make the foreground look consistent with the new background. Compared to previous approaches, our method can produce higher quality binary segmentation results, and to the best of our knowledge, this is the first time such an automatic and integrated background substitution system has been proposed which can run in real time, which makes it practical for everyday applications.

Article PDF

Foreground Segmentation Using Multimode Background Subtraction in Real-Time Perspective

Efficient Background Subtraction under Abrupt Illumination Variations

Foreground Background Segmentation in Front of Changing Footage on a Video Screen

References

Bai, X.; Wang, J.; Simons, D.; Sapiro, G. Video SnapCut: Robust video object cutout using localized classifiers. ACM Transactions on Graphics Vol. 28, No. 3, Article No. 70, 2009.
Google Scholar
Chen, T.; Zhu, J.-Y.; Shamir, A.; Hu, S.-M. Motionaware gradient domain video composition. IEEE Transactions on Image Processing Vol. 22, No. 7, 2532–2544, 2013.
Article Google Scholar
Liu, Z.; Cohen, M. Head-size equalization for better visual perception of video conferencing. In: Proceedings of the IEEE International Conference on Multimedia and Expo, 4, 2005.
Google Scholar
Zhu, Z.; Martin, R. R.; Pepperell, R.; Burleigh, A. 3D modeling and motion parallax for improved videoconferencing. Computational Visual Media Vol. 2, No. 2, 131–142, 2016.
Article Google Scholar
Van Krevelen, D. W. F.; Poelman, R. A survey of augmented reality technologies, applications and limitations. International Journal of Virtual Reality Vol. 9, No. 2, 1–21, 2010.
Google Scholar
Apostoloff, N.; Fitzgibbon, A. Bayesian video matting using learnt image priors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 1, I-407–I-414, 2004.
Google Scholar
Bouwmans, T.; El Baf, F.; Vachon, B. Background modeling using mixture of Gaussians for foreground detection—A survey. Recent Patents on Computer Science Vol. 1, No. 3, 219–237, 2008.
Article Google Scholar
Maddalena, L.; Petrosino, A. A self-organizing approach to background subtraction for visual surveillance applications. IEEE Transactions on Image Processing Vol. 17, No. 7, 1168–1177, 2008.
Article MathSciNet Google Scholar
Tsai, D.-M.; Lai, S.-C. Independent component analysis-based background subtraction for indoor surveillance. IEEE Transactions on Image Processing Vol. 18, No. 1, 158–167, 2009.
Article MathSciNet MATH Google Scholar
Barnich, O.; Van Droogenbroeck, M. ViBe: A universal background subtraction algorithm for video sequences. IEEE Transactions on Image Processing Vol. 20, No. 6, 1709–1724, 2011.
Article MathSciNet MATH Google Scholar
Hofmann, M.; Tiefenbacher, P.; Rigoll, G. Background segmentation with feedback: The pixel-based adaptive segmenter. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 38–43, 2012.
Google Scholar
Sun, J.; Zhang, W.; Tang, X.; Shum, H.-Y. Background cut. In: Computer Vision–ECCV 2006. Leonardis, A.; Bischof, H.; Pinz, A. Eds. Springer Berlin Heidelberg, 628–641, 2006.
Chapter Google Scholar
Criminisi, A.; Cross, G.; Blake, A.; Kolmogorov, V. Bilayer segmentation of live video. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 53–60, 2006.
Google Scholar
Yin, P.; Criminisi, A.; Winn, J.; Essa, I. Bilayer segmentation of webcam videos using tree-based classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 33, No. 1, 30–42, 2011.
Article Google Scholar
Chuang, Y.-Y.; Agarwala, A.; Curless, B.; Salesin, D. H.; Szeliski, R. Video matting of complex scenes. ACM Transactions on Graphics Vol. 21, No. 3, 243–248, 2002.
Article Google Scholar
Gong, M.; Qian, Y.; Cheng, L. Integrated foreground segmentation and boundary matting for live videos. IEEE Transactions on Image Processing Vol. 24, No. 4, 1356–1370, 2015.
Article MathSciNet Google Scholar
Pérez, P.; Gangnet, M.; Blake, A. Poisson image editing. ACM Transactions on Graphics Vol. 22, No. 3, 313–318, 2003.
Article Google Scholar
Jia, J.; Sun, J.; Tang, C.-K.; Shum, H.-Y. Drag-anddrop pasting. ACM Transactions on Graphics Vol. 25, No. 3, 631–637, 2006.
Article Google Scholar
Buchsbaum, G. A spatial processor model for object colour perception. Journal of the Franklin Institute Vol. 310, No. 1, 1–26, 1980.
Article Google Scholar
Finlayson, G. D.; Hordley, S. D.; Hubel, P. M. Color by correlation: A simple, unifying framework for color constancy. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 23, No. 11, 1209–1221, 2001.
Article Google Scholar
Cheng, D.; Prasad, D. K.; Brown, M. S. Illuminant estimation for color constancy: Why spatial-domain methods work and the role of the color distribution. Journal of the Optical Society of America A Vol. 31, No. 5, 1049–1058, 2014.
Article Google Scholar
Cheng, D.; Price, B.; Cohen, S.; Brown, M. S. Beyond white: Ground truth colors for color constancy correction. In: Proceedings of the IEEE International Conference on Computer Vision, 298–306, 2015.
Google Scholar
Omer, I.; Werman, M. Color lines: Image specific color representation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2, II-946–II-953, 2004.
Google Scholar
Levin, A.; Lischinski, D.; Weiss, Y. A closed-form solution to natural image matting. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 30, No. 2, 228–242, 2008.
Article Google Scholar
Land, E. H.; McCann, J. J. Lightness and retinex theory. Journal of the Optical Society of America Vol. 61, No. 1, 1–11, 1971.
Article Google Scholar
Zhang, Y.; Tang, Y.-L.; Cheng, K.-L. Efficient video cutout by paint selection. Journal of Computer Science and Technology Vol. 30, No. 3, 467–477, 2015.
Article Google Scholar
Smith, A. R.; Blinn, J. F. Blue screen matting. In: Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, 259–268, 1996.
Google Scholar
Mumtaz, A.; Zhang, W.; Chan, A. B. Joint motion segmentation and background estimation in dynamic scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 368–375, 2014.
Google Scholar
Zhang, L.; Huang, H.; Fu, H. EXCOL: An extractand- complete layering approach to cartoon animation reusing. IEEE Transactions on Visualization and Computer Graphics Vol. 18, No. 7, 1156–1169, 2012.
Article Google Scholar
Gastal, E. S. L.; Oliveira, M. M. Shared sampling for real-time alpha matting. Computer Graphics Forum Vol. 29, No. 2, 575–584, 2010.
Article Google Scholar
Chen, X.; Zou, D.; Zhou, S.; Zhao, Q.; Tan, P. Image matting with local and nonlocal smooth priors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1902–1907, 2013.
Google Scholar
Wong, B.-Y.; Shih, K.-T.; Liang, C.-K.; Chen, H. H. Single image realism assessment and recoloring by color compatibility. IEEE Transactions on Multimedia Vol. 14, No. 3, 760–769, 2012.
Article Google Scholar
Cohen-Or, D.; Sorkine, O.; Gal, R.; Leyvand, T.; Xu, Y.-Q. Color harmonization. ACM Transactions on Graphics Vol. 25, No. 3, 624–630, 2006.
Article Google Scholar
Kuang, Z.; Lu, P.; Wang, X.; Lu, X. Learning selfadaptive color harmony model for aesthetic quality classification. In: Proceedings of SPIE 9443, the 6th International Conference on Graphic and Image Processing, 94431O, 2015.
Google Scholar
Chen, T.; Cheng, M.-M.; Tan, P.; Shamir, A.; Hu, S.-M. Sketch2Photo: Internet image montage. ACM Transactions on Graphics Vol. 28, No. 5, Article No. 124, 2009.
Google Scholar
Farbman, Z.; Hoffer, G.; Lipman, Y.; Cohen-Or, D.; Lischinski, D. Coordinates for instant image cloning. ACM Transactions on Graphics Vol. 28, No. 3, Article No. 67, 2009.
Google Scholar
Boykov, Y.; Kolmogorov, V. An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. In: Energy Minimization Methods in Computer Vision and Pattern Recognition. Figueiredo, M.; Zerubia, J.; Jain, A. K. Eds. Springer Berlin Heidelberg, 359–374, 2001.
Chapter Google Scholar
Sigari, M. H.; Mozayani, N.; Pourreza, H. R. Fuzzy running average and fuzzy background subtraction: concepts and application. International Journal of Computer Science and Network Security Vol. 8, No. 2, 138–143, 2008.
Google Scholar
Sobral, A. BGSLibrary. 2016. Available at https://github.com/andrewssobral/bgslibrary.
Google Scholar
Rosin, P. L.; Ioannidis, E. Evaluation of global image thresholding for change detection. Pattern Recognition Letters Vol. 24, No. 14, 2345–2356, 2003.
Article MATH Google Scholar
Kerbyson, D. J.; Atherton, T. J. Circle detection using Hough transform filters. In: Proceedings of the 5th International Conference on Image Processing and its Applications, 370–374, 1995.
Chapter Google Scholar
Graphics and Media Lab. Videomatting benchmark. 2016. Available at http://videomatting.com.
Google Scholar

Download references

Acknowledgements

We thank the reviewers for their valuable comments. This work was supported by the National High-Tech R&D Program of China (Project No. 2012AA011903), the National Natural Science Foundation of China (Project No. 61373069), the Research Grant of Beijing Higher Institution Engineering Research Center, and Tsinghua–Tencent Joint Laboratory for Internet Innovation Technology.

Author information

Authors and Affiliations

Department of Computer Science, Tsinghua University, Beijing, 100084, China
Haozhi Huang, Xiaonan Fang, Yufei Ye & Songhai Zhang
School of Computer Science and Informatics, Cardiff University, Cardiff, CF24 3AA, UK
Paul L. Rosin

Authors

Haozhi Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaonan Fang
View author publications
You can also search for this author in PubMed Google Scholar
Yufei Ye
View author publications
You can also search for this author in PubMed Google Scholar
Songhai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Paul L. Rosin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Songhai Zhang.

Additional information

This article is published with open access at Springerlink.com

Haozhi Huang is currently a Ph.D. student in the Department of Computer Science and Technology, Tsinghua University, China. He received his bachelor degree from Tsinghua University, in 2012. His research interests include image and video editing, and computer graphics.

Xiaonan Fang is currently an undergraduate student in the Department of Computer Science and Technology, Tsinghua University, China. His research interests include computational geometry, image processing, and video processing.

Yufei Ye is currently an undergraduate student in the Department of Computer Science and Technology, Tsinghua University, China. Her research interests lie in video processing, object detection, generative models, unsupervised learning, and representation learning.

Songhai Zhang received his Ph.D. degree from Tsinghua University, China, in 2007. He is currently an associate professor in the Department of Computer Science and Technology, Tsinghua University. His research interests include image and video processing, and geometric computing.

Paul L. Rosin is a professor in the School of Computer Science & Informatics, Cardiff University, UK. Previous posts include lecturer in the Department of Information Systems and Computing, Brunel University London, UK, research scientist at the Institute for Remote Sensing Applications, Joint Research Centre, Ispra, Italy, and lecturer at Curtin University of Technology, Perth, Australia. His research interests include the representation, segmentation, and grouping of curves, knowledge-based vision systems, early image representations, low level image processing, machine vision approaches to remote sensing, methods for evaluation of approximation algorithms, medical and biological image analysis, mesh processing, non-photorealistic rendering, and the analysis of shape in art and architecture.

Electronic supplementary material

Supplementary material, approximately 10.3 MB.

Rights and permissions

Open Access The articles published in this journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www.editorialmanager.com/cvmj.

Reprints and permissions

About this article

Cite this article

Huang, H., Fang, X., Ye, Y. et al. Practical automatic background substitution for live video. Comp. Visual Media 3, 273–284 (2017). https://doi.org/10.1007/s41095-016-0074-0

Download citation

Received: 18 August 2016
Accepted: 20 December 2016
Published: 15 March 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s41095-016-0074-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Practical automatic background substitution for live video

Abstract

Article PDF

Similar content being viewed by others

Foreground Segmentation Using Multimode Background Subtraction in Real-Time Perspective

Efficient Background Subtraction under Abrupt Illumination Variations

Foreground Background Segmentation in Front of Changing Footage on a Video Screen

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material, approximately 10.3 MB.

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Practical automatic background substitution for live video

Abstract

Article PDF

Similar content being viewed by others

Foreground Segmentation Using Multimode Background Subtraction in Real-Time Perspective

Efficient Background Subtraction under Abrupt Illumination Variations

Foreground Background Segmentation in Front of Changing Footage on a Video Screen

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material, approximately 10.3 MB.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation