Abstract
We propose a novel approach towards real-time control, selection and transmission of the best view of human faces in Skype video conferencing. Our goal is to improve the Quality-of-Experience (QoE) of current video conferencing services by incorporating real-time multi-camera control and selection mechanism. Traditional 3D viewpoint selection algorithms rely on complex 3D-model computation and are not applicable for real-time applications. We define a new image-based metric, Viewpoint Saliency (VS), for evaluating the quality of views of human subject and a centralized multi-camera control mechanism to track and select the best view of human.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmad, S., VISIT: A Neural Model of Covert Attention. In Advances in Neural Information Processing Systems, 4, 420–427, 1991.
Ang, K.H., Chong, G.C.Y., and Li, Y., PID Control System Analysis, Design, and Technology. In IEEE Trans. on Control Systems Technology, 13(4), 559–576, 2005.
Barral, P., Dorme, G., and Plemenos, D., Scene Understanding Techniques using a Virtual Camera. In Proc. of Eurographics, 2000.
Doubek, P., Nummiaro, K., Koller-Meier, E. and Gool, L. V., Face Tracking in a Multi-Camera Environment. 266, 2003.
Feixas, M., Sbert, M., and Gonzalez, F., A Unified Information-Theoretic Framework for Viewpoint Selection and Mesh Saliency. In ACM Trans. on Applied Perception, 6, 1–23, 2008.
Goldberg, K., Gentner, S., Sutter, C., and Wiegley, J., The Mercury Project: A feasibility study for internet robots. In IEEE Int. Conf. on Robotics and Automation, 1995.
Hornler, B., Arsic, D., Schuller, B., and Rigoll, G., Boosting multi-modal camera selection with semantic features. In Proc. of ICME, 1298–1301, 2009.
Itti, L., Koch, C., and Niebur, E., A model of saliency based visual attention for rapid scene analysis. In IEEE Trans. on PAMI., 1254–1259, 1998.
Khwaja, A. A., Goecke, and R., Image reconstruction from contrast information. In Digital Image Computing: Techniques and Applications, 226–233, 2008.
Liu, Q., Kimber, D., Wilcox, L., Cooper, M., Foote, J., and Boreczky, J., Managing a camera system to serve different video requests. In Proc. of ICME, 13–16, 2002.
Miniwatts Marketing Group., World Internet Users and Population Stats, 2010. Retrieved August 4, 2010, from Internet World Stats. http://www.internetworldstats.com/stats.htm.
Mosher, R. S., Industrial Manipulators. Scientific American, 211(4), 88–96, 1964.
Niebur, E., Koch, C., and Parasuraman, R., Computational architectures for attention. The attentive brain, Cambridge, MA: MIT Press, 163–186, 1998.
Ranjan, A., Birnholtz, J. P., and Balakrishnan, R., Dynamic shared visual spaces: experimenting with automatic camera control in a remote repair task. In Proc. of SIGCHI Conf. on Human Factors in Computing Systems., 1177–1186, 2007.
Schreer, O., Feldmann, I., Atzpadin, N., Eisert, P. Kauff, P., and Belt, H., 3D Presence -A System Concept for Multi-User and Multi-Party Immersive 3D Videoconferencing. In Proc. of European Conference on Visual Media Production, 1–8, 2008.
Song, D., Qin, N., and Goldberg, K., Systems, Control Models, and Codec for Collaborative Observation of Remote Environments with an Autonomous Networked Robotic Camera. In Autonomous Robots, 24(4), 435–449, 2008.
Vazquez, P. P., Feixas, M., Sbert, M., and Heidrich, W., Viewpoint selection using viewpoint entropy. In Proc. of Vision, Modeling and Visualization, 273–280, 2001.
Vazquez, P. P., Feixas, M., Sbert, M., and Llobet, A., Viewpoint Entropy: A new tool for obtaining good views for molecules. Data Visualization 2002 (Eurographics /IEEE TCVG Symposium Proceedings), 27–29, 2002.
Wallhoff, F., Zobl, M., Rigoll, G.,and Potucek, I., Face tracking in meeting room scenarios using omnidirectional views. In Int. Conf. on Pattern Recognition, 933–936, 2004.
Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P., Image quality assessment: From error measurement to structural similarity. In IEEE Trans. on Image Processing, 13(4), 600–612, 2004.
Wang, J., Kankanhalli, M.S., Yan, W.Q., and Jain, R., Experiential Sampling for Video Surveillance. In Proc. of First ACM Int. Workshop on Video Surveillance, 77–86, 2003.
Wu, W., Arefin, A., Rivas, R., Nahrstedt, K., Sheppard, R. M. and Yang, Z., Quality of Experience in distributed interactive multimedia environments: toward a theoretical framework. In ACM Int. Conf. on Multimedia, 481–490, 2009.
Zotkin, D., Duraiswami, R., Philomin, V., and Davis, L., Smart Videoconferencing. In Proc. of ICME, 1597–1600, 2000.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media, LLC
About this paper
Cite this paper
Wang, Y., Natarajan, P., Kankanhalli, M. (2013). Multi-camera Skype: Enhancing the Quality of Experience of Video Conferencing. In: The Era of Interactive Media. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3501-3_20
Download citation
DOI: https://doi.org/10.1007/978-1-4614-3501-3_20
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-3500-6
Online ISBN: 978-1-4614-3501-3
eBook Packages: Computer ScienceComputer Science (R0)