Estimation of viewers’ ratings of TV programs based on behaviors in home environments

Takahashi, Masaki; Clippingdale, Simon; Naemura, Masahide; Shibata, Masahiro

doi:10.1007/s11042-014-2352-0

Estimation of viewers’ ratings of TV programs based on behaviors in home environments

Published: 12 November 2014

Volume 74, pages 8669–8684, (2015)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Masaki Takahashi¹,
Simon Clippingdale¹,
Masahide Naemura¹ &
…
Masahiro Shibata¹

334 Accesses
10 Citations
Explore all metrics

Abstract

A system is described that can estimate a viewer’s ratings of TV programs on the basis of his/her behaviors in a home environment. A Kinect sensor, a motion-sensing device developed by Microsoft for its Xbox game console, is used to measure various behavioral parameters. The system first detects whether a viewer is present by extracting keypoint trajectories in video sequences captured by the sensor’s video camera. It then identifies whether the viewer is gazing at the TV screen or not by extracting head pose information. The extraction is carried out using two modules: a color-image-based module and a color- and depth-image-based module. The two modules share their parameters and complement each other’s characteristics. The proposed system was evaluated by having 30 participants individually spend about 2 h watching 15 TV programs in a simulated home environment, capturing video images of their behaviors, and having them rate each program on a five-point scale. Comparison of the system’s estimated ratings with the actual viewer ratings demonstrated that the system can robustly estimate a viewer’s ratings of TV programs in a home environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Evaluating Emotional State during 3DTV Viewing Using Psychophysiological Measurements

Towards Cognitive and Perceptive Video Systems

Pictorial Human Spaces: A Computational Study on the Human Perception of 3D Articulated Poses

Article 01 April 2016

References

AlMejrad AS (2010) Human emotions detection using brain wave signals: a challenging. Eur J Sci Res 44(4):640–659
MATH Google Scholar
Cai Q, Gallup D, Zhang C, Zhang Z (2010) 3D Deformable face tracking with a commodity depth camera, In: Proc. of the European conference on computer vision (ECCV), pp. 229–249
Calvo RA, D’Mello S (2010) Affect detection: an interdisciplinary review of models, methods, and their applications. IEEE Trans Affect Comput 1(1):18–37
Article Google Scholar
Clippingdale S, Fujii M (2012) Video face tracking and recognition with skin region extraction and deformable template matching. Int J Multimedia Data Eng Manag 3(1):36–48
Article Google Scholar
Csurka G, Dance CR, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints, In: Proc. ECCV Workshop on Statistical Learning in Computer Vision, pp. 1–22
Fanelli G, Weise T, Gall J, Van Gool L (2011) Real time head pose estimation from consumer depth cameras, In: Proc. Annual symposium of the German association for pattern recognition (DAGM), pp. 101–110
Felfernig A, Gordea S, Jannach D, Teppan E, Zanker M (2007) A short survey of recommendation technologies in travel and tourism. OEGAI J 25(7):17–22
Google Scholar
Funes Mora KA, Odobez J (2012) Gaze estimation from multimodal Kinect data, In: Proc. Computer vision and pattern recognition workshop (CVPRW), pp. 25–30
Gajsek R, Struc V, Dobrisek S, Mihelic F (2009) Emotion recognition using linear transformations in combination with video, In: Proc. annual conference of the international speech communication association (Interspeech), pp. 1967–1970
Grafsgaard JF, Fulton RM, Boyer KE, Wiebe EN, Lester JC (2012) Multimodal analysis of the implicit affective channel in computer-mediated textual communication, In: Proc. ACM international conference on multimodal interaction (ICMI), pp. 145–152
Gunes H, Schuller B, Pantic M, Cowie R (2011) Emotion representation, analysis and synthesis in continuous space: a survey, In: Proc. Automatic face & gesture recognition and workshops, pp. 827–834
Hernandez J, Liu Z, Hulten G, DeBarr D, Krum K, Zhang Z (2013) Measuring the engagement level of TV viewers, In: Proc. Automatic face and gesture recognition (FG), pp. 1–7
Kimura A, Yonetani R, Hirayama T (2013) Computational models of human visual attention and their implementations: a survey. IEICE Trans Inf Syst E96-D(3):562–578
Article Google Scholar
Kinect for Windows SDK (2013) Programming guide: face tracking, doi: http://msdn.microsoft.com/en-us/library/jj130970.aspx, Microsoft MSDN
Leavitt N (2006) Recommendation technology: will it boost e-commerce? IEEE Comput 39(5):13–16
Article Google Scholar
Lu K, Jia Y (2012) Audio-visual emotion recognition using Boltzmann Zippers, In: Proc. IEEE international conference on image processing (ICIP), pp. 2589–2592
Microsoft, USA. XBOX Kinect, doi: http://www.xbox.com/kinect
Mohd Zaid NH, Mohamed AM, Soliman AH (2012) Eye gesture analysis for driver Hazard awareness, World academy of science, engineering and technology (WASET) 6 (5), 1240–1246
Murphy-Chutorian E (2009) Head pose estimation in computer vision: a survey. IEEE Trans Pattern Anal Mach Intel 31(4):607–626
Article Google Scholar
Nakano T, Yamamoto Y, Kitajo K, Takahashi T, Kitazawa S (2009) Synchronization of spontaneous eye blinks while viewing video stories. Proc R Soc B 276:3635–3644
Article Google Scholar
Open CV (Open source computer vision). doi: http://opencv.org/
Posner MI (1980) Orienting of attention. Q J Exp Psychol 32(1):3–25
Article MathSciNet Google Scholar
Shi J, Tomasi C (1994) Good features to track, In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 593–600
Stanley D (2013) Measuring attention using Microsoft Kinect, Master’s Thesis, Rochester Institute of Technology
Stiefelhagen R, Yang J, Waibel A (2002) Modeling focus of attention for meeting indexing based on multiple cues. IEEE Trans Neural Netw 13(4):928–938
Article Google Scholar
Takahashi M, Clippingdale S, Okuda M, Yamanouchi Y, Naemura M, Shibata M (2013) An estimator for rating video contents on the basis of a viewer’s behavior in typical home environments, In: Proc. International conference on signal-image technology & internet-based systems (SITIS), pp. 6–13
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features, In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 511–518
Wiskott L, Fellous J-M, Krüger N, von der Malsburg C (1996) Face recognition by elastic bunch graph matching, TR96-08, Institut für Neuroinformatik, Ruhr-Universität Bochum
Yamamoto M, Nitta N, Babaguchi N (2008) Automatic personal preference acquisition from TV viewer’s behaviors, In: Proc. IEEE international conference on multimedia & expo (ICME), pp. 1165–1168
Yasuma Y, Nakanishi M (2011) User characteristic-based information-providing service for museum with optical see-through head-mounted display: does it evoke enthusiasm? In: Proc. International conference on human-computer interaction (HCI), pp. 234–242

Download references

Acknowledgments

Part of this work was supported by the Strategic Information and Communications R&D Promotion Programme (SCOPE) of the Ministry of Internal Affairs and Communication of Japan.

Author information

Authors and Affiliations

Japan Broadcasting Corporation (NHK) Science and Technology Research Laboratories, 1-10-11, Kinuta, Setagaya-ku, Tokyo, Japan
Masaki Takahashi, Simon Clippingdale, Masahide Naemura & Masahiro Shibata

Authors

Masaki Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Simon Clippingdale
View author publications
You can also search for this author in PubMed Google Scholar
Masahide Naemura
View author publications
You can also search for this author in PubMed Google Scholar
Masahiro Shibata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masaki Takahashi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Takahashi, M., Clippingdale, S., Naemura, M. et al. Estimation of viewers’ ratings of TV programs based on behaviors in home environments. Multimed Tools Appl 74, 8669–8684 (2015). https://doi.org/10.1007/s11042-014-2352-0

Download citation

Received: 15 May 2014
Revised: 20 September 2014
Accepted: 03 November 2014
Published: 12 November 2014
Issue Date: October 2015
DOI: https://doi.org/10.1007/s11042-014-2352-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimation of viewers’ ratings of TV programs based on behaviors in home environments

Abstract

Access this article

Similar content being viewed by others

Evaluating Emotional State during 3DTV Viewing Using Psychophysiological Measurements

Towards Cognitive and Perceptive Video Systems

Pictorial Human Spaces: A Computational Study on the Human Perception of 3D Articulated Poses

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Estimation of viewers’ ratings of TV programs based on behaviors in home environments

Abstract

Access this article

Similar content being viewed by others

Evaluating Emotional State during 3DTV Viewing Using Psychophysiological Measurements

Towards Cognitive and Perceptive Video Systems

Pictorial Human Spaces: A Computational Study on the Human Perception of 3D Articulated Poses

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation