Abstract
Mobile eye-tracking systems have been available for about a decade now and are becoming increasingly popular in different fields of application, including marketing, sociology, usability studies and linguistics. While the user-friendliness and ergonomics of the hardware are developing at a rapid pace, the software for the analysis of mobile eye-tracking data in some points still lacks robustness and functionality. With this paper, we investigate which state-of-the-art computer vision algorithms may be used to automate the post-analysis of mobile eye-tracking data. For the case study in this paper, we focus on mobile eye-tracker recordings made during human-human face-to-face interactions. We compared two recent publicly available frameworks (YOLOv2 and OpenPose) to relate the gaze location generated by the eye-tracker to the head and hands visible in the scene camera data. In this paper we will show that the use of this single pipeline framework provides robust results, which are both more accurate and faster than previous work in the field. Moreover, our approach does not rely on manual interventions during this process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Brône G, Oben B, Goedemé T (2011) Towards a more effective method for analysing mobile eye-tracking data: integrating gaze data with object recognition algorithms. In: Proceedings of the PETMEI. ACM, pp 53–56
Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2d pose estimation using part affinity fields. In: CVPR. (2017)
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. IJCV 60(2):91–110
Bay H, Tuytelaars T, Van Gool L (2006) Surf: speeded up robust features. ECCV 2006:404–417
De Beugher S, Brône G, Goedemé T (2014) Automatic analysis of in-the-wild mobile eye-tracking experiments using object, face and person detection. In: VISAPP, vol 1. IEEE, pp 625–633
Felzenszwalb PF, Girshick RB, McAllester D (2010) Cascade object detection with deformable part models. In: CVPR. IEEE, pp 2241–2248
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: CVPR, vol 1. IEEE, pp I–I
Mittal A, Zisserman A, Torr PH (2011) Hand detection using multiple proposals. In: BMVC, pp 1–11
De Beugher S, Brône G, Goedemé T (2015) Semi-automatic hand detection: a case study on real life mobile eye-tracker data. In: Proceedings VISAPP 2015, vol 2. SciTePress, pp 121–129
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition, 2009. CVPR 2009. IEEE, pp 248–255
Oertel C, Wlodarczak, M., Edlund J, Wagner P, Gustafson J (2012) Gaze patterns in turn-taking. In: Thirteenth annual conference of the international speech communication association
Brône G, Oben B, Jehoul A, Vranjes J, Feyaerts K (2017) Eye gaze and viewpoint in multimodal interaction management. Cogn Linguist 28(3):449–483
Redmon J, Farhadi A Yolo9000: better, faster, stronger
Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2009) The PASCAL visual object classes challenge 2009 (VOC2009) results
Wei SE, Ramakrishna V, Kanade T, Sheikh Y (2016) Convolutional pose machines. In: CVPR
Simon T, Joo H, Matthews I, Sheikh Y (2017) Hand keypoint detection in single images using multiview bootstrapping. In: CVPR
De Beugher S (2016) Computer vision techniques for automatic analysis of mobile eye-tracking data. PhD thesis, KU Leuven, Belgium
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE computer society conference on computer vision and pattern recognition, 2005. CVPR 2005, vol 1. IEEE, pp 886–893
Yang B, Yan J, Lei Z, Li SZ (2014) Aggregate channel features for multi-view face detection. In: IJCB 2014. IEEE, pp 1–8
Buehler P, Everingham M, Huttenlocher DP, Zisserman A (2008) Long term arm and hand tracking for continuous sign language tv broadcasts. In: BMVC, pp 1105–1114
Yang Y, Ramanan D (2011) Articulated pose estimation with flexible mixtures-of-parts. In: CVPR. IEEE, pp 1385–1392
Neuendorf KA (2016) The content analysis guidebook. Sage
Scott WA (1955) Reliability of content analysis: the case of nominal scale coding. Public opinion quarterly, pp 321–325
Cohen J (1960) A coefficient of agreement for nominal scales. Educ Psychol Measur 20(1):37–46
Krippendorff K (2012) Content analysis: an introduction to its methodology. SAGE
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Callemein, T., Van Beeck, K., Brône, G., Goedemé, T. (2019). Automated Analysis of Eye-Tracker-Based Human-Human Interaction Studies. In: Kim, K., Baek, N. (eds) Information Science and Applications 2018. ICISA 2018. Lecture Notes in Electrical Engineering, vol 514. Springer, Singapore. https://doi.org/10.1007/978-981-13-1056-0_50
Download citation
DOI: https://doi.org/10.1007/978-981-13-1056-0_50
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1055-3
Online ISBN: 978-981-13-1056-0
eBook Packages: EngineeringEngineering (R0)