A Verbal Interaction Measure Using Acoustic Signal Correlation for Dyadic Cooperation Support

  • Alexander Neumann
  • Thomas Hermann
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 219)


We introduce a method for detecting whether two users are engaged in focused interaction using a windowed correlation measure on their acoustic signals, assuming that a continued exchange of verbal turns contributes to anticorrelation of acoustic activity. We tested our method with manually annotated transitions between focused and unfocused interaction stemming from experiments on AR-based cooperation within a research project on alignment in communication. The results show that a high degree and extended duration of speech activity anticorrelation reliably indicates focused interaction, and might thus be a valuable asset for situation-aware technical systems.


situation awareness collaboration speech activity data mining multiscale analysis correlation 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bull, M., Aylett, M.: An analysis of the timing of turn-taking in a corpus of goal-oriented dialogue. In: Proceedings of ICSLP (1998)Google Scholar
  2. 2.
    Clark, H.: Using language, vol. 4. Cambridge University Press, Cambridge (1996)Google Scholar
  3. 3.
    Dierker, A., Mertes, C., Hermann, T., Hanheide, M., Sagerer, G.: Mediated attention with multimodal augmented reality. In: Proceedings of ICMI-MLMI, p. 245. ACM Press, New York (2009)Google Scholar
  4. 4.
    Dierker, A., Pitsch, K., Hermann, T.: An augmented-reality-based scenario for the collaborative construction of an interactive museum. Tech. rep., Bielefeld University (2011)Google Scholar
  5. 5.
    Edelsky, C.: Who’s got the floor. Language in Society 10(3), 383–421 (1981)Google Scholar
  6. 6.
    Grudin, J.: The Computer Reaches Out: The Historical Continuity of Interface Design. In: Proceedings of CHI 1990, pp. 261–268 (1990)Google Scholar
  7. 7.
    Lecouteux, B., Vacher, M., Portet, F.: Distant Speech Recognition in a Smart Home: Comparison of Several Multisource ASRs in Realistic Conditions. In: Proceedings of Interspeech, pp. 2273–2276 (2011)Google Scholar
  8. 8.
    Wagner, D., Schmalstieg, D.: Artoolkitplus for pose tracking on mobile devices. In: Proceedings of CVWW (2007)Google Scholar
  9. 9.
    Weilhammer, K., Rabold, S.: Durational aspects in turn taking. In: International Congresses of Phonetic Sciences (2003)Google Scholar
  10. 10.
    Zehe, S.: BRIX - An Easy-to-Use Modular Sensor and Actuator Prototyping Toolkit. In: Proceedings of SeNAmI 2012, Lugano, Switzerland, pp. 823–828 (2012)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2013

Authors and Affiliations

  1. 1.Bielefeld UniversityBielefeldGermany

Personalised recommendations