Incompleteness and Fragmentation: Possible Formal Cues to Cognitive Processes Behind Spoken Utterances

Hunyadi, L.; Kiss, H.; Szekrenyes, I.

doi:10.1007/978-3-319-21209-8_14

L. Hunyadi⁹,
H. Kiss⁹ &
I. Szekrenyes⁹

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 42))

729 Accesses

Abstract

What may eventually connect engineers and linguists most is their common interest in language, more specifically language technology: engineers build more and more intelligent robots desirably communicating with humans through language. Linguists wish to verify their theoretical understanding of language and speech through practical implementations. Robotics is then a place for the two to meet. However, speech, especially within spontaneous communication seems to often withstand usual generalizations: the sounds you hear are not the sounds you describe in a laboratory, the words you read in a written text may be hard to identify by speech segmentation, the sequences of words that make up a sentence are often too fragmented to be considered a “real” sentence from a grammar book. Yet, humans communicate, and this is most often, successful. Typically this is achieved through cognition, where people not only use words, these are used in context. People also use words in semantic context, by combining voices and gestures , in a dynamically changing, multimodal situational context. Each individual does not simply pick out words from the flow of a verbal interaction, but also observes and reacts to other, using multimodal cues as a point of reference and inference making navigation in communication. It is reasonable to believe that participants in a multimodal communication event follow a set of general, partly innate rules based on a general model of communication. The model presented below interperate numerous forms of dialogue by uncovering their syntax , prosody and overall multimodality within the HuComTech corpus of Hungarian. The research aims at improving the robustness of the spoken form of natural language technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://bach.arts.kuleuven.be/pmertens/prosogram/.
2.
The analysis did not specifically consider clauses of such complex syntactic relations as subordination, coordination and embedding. Therefore it is left open whether the cognitive processing of these types is reflected by pause durations corresponding to their complexity.
3.
The HuComTech corpus is available from two locations: https://corpus1.mpi.nl/ds/imdi_browser?openpath=MPI1761660%23 (Nijmegen) and https://clarin.nytud.hu/ds/imdi_browser?openpath=MPI13%23 (Budapest).

References

Siciliano, B., Sciavicco, L., Villani, L., Oriolo, G.: Robotics, Modelling, Planning and Control. Springer, New York (2010)
Google Scholar
Markou, M., Singh, S.: Novelty detection: a review–part1: statistical approaches. Signal Process. 83, 2481–2497 (2003)
Article MATH Google Scholar
Kube, C.R., Zhang, H.: Task modelling in collective robotics. Auton. Robots 4, 53–72 (1997)
Article Google Scholar
Kasher, A.: Modular speech act theory: research program and results [collection on the foundations of speech act theory]. In: Tsohatzidis, S.L.T. (ed.) Foundations of Speech Act Theory Philosophical and Linguistic Perspectives, pp. 312–322. Routledge, New York (1994)
Google Scholar
Hunyadi, L.: Multimodal human-computer interaction technologies. Theoretical modeling and application in speech processing. Argumentum 7 313–329 (2011)
Google Scholar
Hunyadi, L., Földesi, A., Szekrényes, I., Staudt, A., Kiss, H., Abuczki, Á., Bódog, A.: Az ember-gép kommunikáció elméleti-technológiai modellje és nyelvtechnológiai vonatkozásai [a theoretical-technological model of human-machine communication and its relation to language technology]. Általános Nyelvészeti Tanulmányok XXIV (2012) 265–310
Google Scholar
Hunyadi, L., Szekrényes, I., Borbély, A., Kiss, H.: Annotation of spoken syntax in relation to prosody and multimodal pragmatics. Cognitive Infocommunications (CogInfoCom), pp. 537–541 (2012)
Google Scholar
Rizzolatti, G., Arbib, M.A.: Language within our grasp. Trends Neurosci. 21(5), 188–194 (1998)
Article Google Scholar
Gallese, V., Fadiga, L., Fogassi, L., Rizzolatti, G.: Action recognition in the premotor cortex. Brain 119(Pt 2), 593–609 (1996)
Article Google Scholar
Willems, R.M., Hagoort, P.: Neural evidence for the interplay between language, gesture, and action: a review. Brain Lang. 101(3), 278–289 (2007)
Article Google Scholar
Willems, R.M., Ozyurek, A., Hagoort, P.: Seeing and hearing meaning: ERP and fMRI evidence of word versus picture integration into a sentence context. J. Cogn. Neurosci. 20, 1235–1249 (2008)
Article Google Scholar
Willems, R.M., Hagoort, P.: Hand preference influences neural correlates of action observation. Brain Res. 1269, 90–104 (2009)
Article Google Scholar
McNeill, D.: Hand and Mind: What Gestures Reveal about Thought. University of Chicago Press, Chicago (1992)
Google Scholar
Kiss, H.: A hucomtech audio adatbázis szintaktikai szintjének multimodális vizsgálata [the multimodal study of the syntactic level of the hucomtech audio database]. MSZNY 2014, pp. 27–38 (2014)
Google Scholar
Alessandro, C., Mertens, P.: Prosogram: semi-automatic transcription of prosody based on a tonal perception model. In: Proceedings of the 2nd International Conference of Speech Prosody pp. 23–26 (2004)
Google Scholar
Szekrényes, I.: Annotation and interpretation of prosodic data in the hucomtech corpus for multimodal user interfaces. J. Multimodal User Interfaces 8, 143–150 (2014)
Google Scholar
Boersma, P.: Praat, a system for doing phonetics by computer. Glot Int. 5(9/10), 341–345 (2001)
Google Scholar
Abuczki, Á.: A multimodal analysis of the sequential organization of verbal and nonverbal interaction. Argumentum 7, 261–279 (2011)
Google Scholar

Download references

Acknowledgments

The research presented in this chapter was partly supported by project TÁMOP 4.2.2-C/11/1/KONV-2012-0002. Further support was received from NeDiMAH (Network for Digital Methods in the Arts and Humanities), a cross-European project of the European Science Foundation.

Author information

Authors and Affiliations

University of Debrecen, Egyetem Tér 1, Debrecen, 4012, Hungary
L. Hunyadi, H. Kiss & I. Szekrenyes

Authors

L. Hunyadi
View author publications
You can also search for this author in PubMed Google Scholar
H. Kiss
View author publications
You can also search for this author in PubMed Google Scholar
I. Szekrenyes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to L. Hunyadi .

Editor information

Editors and Affiliations

School of Engineering, KES Centre, University of South Australia , Adelaide, South Australia, Australia
Jeffrey W. Tweedale
FCT Campus, UNINOVA, Caparica, Portugal
Rui Neves-Silva
School of Engineering, University of South Australia, Adelaide, South Australia, Australia
Lakhmi C. Jain
Sellinger School of Business and Management, Loyola University Maryland, Baltimore, Maryland, USA
Gloria Phillips-Wren
Graduate School of IPS, Waseda University, Fukuoka, Japan
Junzo Watada
KES International, Shoreham-by-sea, United Kingdom
Robert J. Howlett

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hunyadi, L., Kiss, H., Szekrenyes, I. (2016). Incompleteness and Fragmentation: Possible Formal Cues to Cognitive Processes Behind Spoken Utterances. In: Tweedale, J., Neves-Silva, R., Jain, L., Phillips-Wren, G., Watada, J., Howlett, R. (eds) Intelligent Decision Technology Support in Practice. Smart Innovation, Systems and Technologies, vol 42. Springer, Cham. https://doi.org/10.1007/978-3-319-21209-8_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-21209-8_14
Published: 23 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21208-1
Online ISBN: 978-3-319-21209-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics