Skip to main content

Multi-camera Skype: Enhancing the Quality of Experience of Video Conferencing

  • Conference paper
  • First Online:
Book cover The Era of Interactive Media

Abstract

We propose a novel approach towards real-time control, selection and transmission of the best view of human faces in Skype video conferencing. Our goal is to improve the Quality-of-Experience (QoE) of current video conferencing services by incorporating real-time multi-camera control and selection mechanism. Traditional 3D viewpoint selection algorithms rely on complex 3D-model computation and are not applicable for real-time applications. We define a new image-based metric, Viewpoint Saliency (VS), for evaluating the quality of views of human subject and a centralized multi-camera control mechanism to track and select the best view of human.

Keywords

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.youtube.com/watch?v=xHM4PDfFTLE

References

  1. Ahmad, S., VISIT: A Neural Model of Covert Attention. In Advances in Neural Information Processing Systems, 4, 420–427, 1991.

    Google Scholar 

  2. Ang, K.H., Chong, G.C.Y., and Li, Y., PID Control System Analysis, Design, and Technology. In IEEE Trans. on Control Systems Technology, 13(4), 559–576, 2005.

    Article  Google Scholar 

  3. Barral, P., Dorme, G., and Plemenos, D., Scene Understanding Techniques using a Virtual Camera. In Proc. of Eurographics, 2000.

    Google Scholar 

  4. Doubek, P., Nummiaro, K., Koller-Meier, E. and Gool, L. V., Face Tracking in a Multi-Camera Environment. 266, 2003.

    Google Scholar 

  5. Feixas, M., Sbert, M., and Gonzalez, F., A Unified Information-Theoretic Framework for Viewpoint Selection and Mesh Saliency. In ACM Trans. on Applied Perception, 6, 1–23, 2008.

    Article  Google Scholar 

  6. Goldberg, K., Gentner, S., Sutter, C., and Wiegley, J., The Mercury Project: A feasibility study for internet robots. In IEEE Int. Conf. on Robotics and Automation, 1995.

    Google Scholar 

  7. Hornler, B., Arsic, D., Schuller, B., and Rigoll, G., Boosting multi-modal camera selection with semantic features. In Proc. of ICME, 1298–1301, 2009.

    Google Scholar 

  8. Itti, L., Koch, C., and Niebur, E., A model of saliency based visual attention for rapid scene analysis. In IEEE Trans. on PAMI., 1254–1259, 1998.

    Google Scholar 

  9. Khwaja, A. A., Goecke, and R., Image reconstruction from contrast information. In Digital Image Computing: Techniques and Applications, 226–233, 2008.

    Google Scholar 

  10. Liu, Q., Kimber, D., Wilcox, L., Cooper, M., Foote, J., and Boreczky, J., Managing a camera system to serve different video requests. In Proc. of ICME, 13–16, 2002.

    Google Scholar 

  11. Miniwatts Marketing Group., World Internet Users and Population Stats, 2010. Retrieved August 4, 2010, from Internet World Stats. http://www.internetworldstats.com/stats.htm.

  12. Mosher, R. S., Industrial Manipulators. Scientific American, 211(4), 88–96, 1964.

    Article  Google Scholar 

  13. Niebur, E., Koch, C., and Parasuraman, R., Computational architectures for attention. The attentive brain, Cambridge, MA: MIT Press, 163–186, 1998.

    Google Scholar 

  14. Ranjan, A., Birnholtz, J. P., and Balakrishnan, R., Dynamic shared visual spaces: experimenting with automatic camera control in a remote repair task. In Proc. of SIGCHI Conf. on Human Factors in Computing Systems., 1177–1186, 2007.

    Google Scholar 

  15. Schreer, O., Feldmann, I., Atzpadin, N., Eisert, P. Kauff, P., and Belt, H., 3D Presence -A System Concept for Multi-User and Multi-Party Immersive 3D Videoconferencing. In Proc. of European Conference on Visual Media Production, 1–8, 2008.

    Google Scholar 

  16. Song, D., Qin, N., and Goldberg, K., Systems, Control Models, and Codec for Collaborative Observation of Remote Environments with an Autonomous Networked Robotic Camera. In Autonomous Robots, 24(4), 435–449, 2008.

    Article  Google Scholar 

  17. Vazquez, P. P., Feixas, M., Sbert, M., and Heidrich, W., Viewpoint selection using viewpoint entropy. In Proc. of Vision, Modeling and Visualization, 273–280, 2001.

    Google Scholar 

  18. Vazquez, P. P., Feixas, M., Sbert, M., and Llobet, A., Viewpoint Entropy: A new tool for obtaining good views for molecules. Data Visualization 2002 (Eurographics /IEEE TCVG Symposium Proceedings), 27–29, 2002.

    Google Scholar 

  19. Wallhoff, F., Zobl, M., Rigoll, G.,and Potucek, I., Face tracking in meeting room scenarios using omnidirectional views. In Int. Conf. on Pattern Recognition, 933–936, 2004.

    Google Scholar 

  20. Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P., Image quality assessment: From error measurement to structural similarity. In IEEE Trans. on Image Processing, 13(4), 600–612, 2004.

    Article  Google Scholar 

  21. Wang, J., Kankanhalli, M.S., Yan, W.Q., and Jain, R., Experiential Sampling for Video Surveillance. In Proc. of First ACM Int. Workshop on Video Surveillance, 77–86, 2003.

    Google Scholar 

  22. Wu, W., Arefin, A., Rivas, R., Nahrstedt, K., Sheppard, R. M. and Yang, Z., Quality of Experience in distributed interactive multimedia environments: toward a theoretical framework. In ACM Int. Conf. on Multimedia, 481–490, 2009.

    Google Scholar 

  23. Zotkin, D., Duraiswami, R., Philomin, V., and Davis, L., Smart Videoconferencing. In Proc. of ICME, 1597–1600, 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Prabhu Natarajan .

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media, LLC

About this paper

Cite this paper

Wang, Y., Natarajan, P., Kankanhalli, M. (2013). Multi-camera Skype: Enhancing the Quality of Experience of Video Conferencing. In: The Era of Interactive Media. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3501-3_20

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-3501-3_20

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-3500-6

  • Online ISBN: 978-1-4614-3501-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics