Abstract
In this paper, we propose an automatic multimodal approach for inferring interaction occasions (spontaneous or reactive) in multiparty meetings. A variety of features such as head gesture, attention from others, attention towards others, speech tone, speaking time, and lexical cue are integrated. A support vector machines classifier is used to classify interaction occasions based on these features. Our experimental results verified that the proposed approach was really effective, which successfully inferred the occasions of human interactions with a recognition rate by number of 0.870, an accuracy by time of 0.773, and a class average accuracy of 0.812. We also found that the reactive interactions are easier to recognize than the spontaneous ones, and the lexical cue is very important in detecting spontaneous interactions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yu ZW, Yu ZY, Aoyama H, Ozeki M, Nakamura Y (2010) Capture, recognition, and visualization of human semantic interactions in meetings. The 8th IEEE international conference on pervasive computing and communications (PerCom 2010), pp 107–115
Yu ZW, Yu ZY, Ko Y, Zhou X, Nakamura Y (2009) Inferring human interactions in meetings: a multimodal approach. The 6th international conference on ubiquitous intelligence and computing (UIC 2009), pp 14–24
Yu ZW, Yu ZY, Zhou X, Becker C, Nakamura Y (2011) Tree-based mining for discovering patterns of human interaction in meetings. IEEE Trans Knowl Data Eng
Vapnik VN (1995) The nature of statistical learning theory. Springer, Heidelberg
Yu Z, Nakamura Y (2010) Smart meeting systems: a survey of state-of-the-art and open issues. ACM Comput Surv 42(2):1–20
Germesin S, Wilson T (2009) Agreement detection in multiparty conversation. In: Proceedings of ICMI, pp 7–14
Hillard D, Ostendorf M, Shriberg, E (2003) Detection of agreement vs. disagreement in meetings: training with unlabeled data. In: Proceedings of HLT-NAACL, pp 34–36
Kumano S, Otsuka K, Mikami D, Yamato J (2009) Recognizing communicative facial expressions for discovering interpersonal emotions in group meetings. In: Proceedings of ICMI, pp 99–106
Jaimes A, Nagamine T, Liu J, Omura K, Sebe N (2005) Affective meeting video analysis. In: IEEE international Conference on multimedia and expo, pp 1412–1415
Wrede B, Shriberg E (2003) Spotting hotspots in meetings: human judgments and prosodic cues. In: Proceedings of Eurospeech, pp 2805–2808
Gatica-Perez D, McCowan I, Zhang D, Bengio S (2005) Detecting group interest-level in meetings. In: Proceedings Of IEEE ICASSP, pp 489–492
Yu ZW, Yu ZY, Ko Y, Zhou X, Nakamura Y (2010) Multimodal sensing, recognizing and browsing group social dynamics. Pers Ubiquit Comput 14(8):695–702
Tomobe H, Nagao, K (2006) Discussion ontology: knowledge discovery from human activities in meetings. In: Proceedings of JSAI, pp 33–41
Rienks R, Zhang D, Gatica-Perez D, Post W (2006) Detection and application of influence rankings in small group meetings. In: Proceedings of ICMI, pp 257–264
Kim T, Chang A, Holland L, Pentland A (2008) Meeting mediator: enhancing group collaboration using sociometric feedback. In: Proceedings of CSCW, pp 457–466
Banerjee S, Rudnicky AI (2004) Using simple speech based features to detect the state of a meeting and the roles of the meeting participants. In: Proceedings of the 8th international conference on spoken language processing (Interspeech)
Zancanaro M, Lepri B, Pianesi F (2006) Automatic detection of group functional roles in face to face interactions. In: Proceedings of ICMI, pp 28–34
Garg NP, Favre, S, Salamin H, Tur DH, Vinciarelli A (2008) Role recognition for meeting participants: an approach based on lexical information and social network analysis. In: Proceedings of ACM Multimedia, pp 693–696
Julius Speech Recognition Engine. http://julius.sourceforge.jp/en/
Rabiner L (1989) A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc IEEE 77(2):257–286
Hsu CW, Chang CC, Lin CJ (2005) A practical guide to support vector classification. Technical report
Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines, Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Kasteren T, Noulas A, Englebienne G, Kröse B (2008) Accurate activity recognition in a home setting. In: UbiComp, pp 1–9
Acknowledgments
This work was partially supported by the National Natural Science Foundation of China (No. 60903125), the Program for New Century Excellent Talents in University (No. NCET-09-0079), and the Natural Science Basic Research Plan in Shaanxi Province of China (No. 2010JM8033).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media B.V.
About this paper
Cite this paper
Yu, Z., Zhou, X. (2011). Automatic Inference of Interaction Occasion in Multiparty Meetings: Spontaneous or Reactive. In: Park, J., Jin, H., Liao, X., Zheng, R. (eds) Proceedings of the International Conference on Human-centric Computing 2011 and Embedded and Multimedia Computing 2011. Lecture Notes in Electrical Engineering, vol 102. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2105-0_14
Download citation
DOI: https://doi.org/10.1007/978-94-007-2105-0_14
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2104-3
Online ISBN: 978-94-007-2105-0
eBook Packages: EngineeringEngineering (R0)