Abstract
Listeners face multiple challenges in mapping prosody onto intentions: The relevant intentions vary with the general context of an utterance (e.g., the speaker’s goals) and how prosodic contours are realized varies across speakers, accents, and speech conditions. We propose that listeners map acoustic information onto prosodic representations using (rational) probabilistic inference, in the form of generative models, which are updated on the fly based on the match between predictions and the input. We review some ongoing work, motivated by this framework, focusing on the “It looks like an X” construction, which, depending on the pitch contour and context, can be interpreted as “It looks like an X and it is” or “It looks like an X and it isn’t.” We use this construction to investigate the hypothesis that pragmatic processing shows the pattern of adaptation effects that is expected if the mapping of speech onto intentions involves rational inference.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Beckman, M. E., & Ayers, G. M. (1994). Guidelines for ToBI labeling. http://www.ling.ohio-state.edu/research/phonetics/E ToBI. Accessed 23 Sept 2008.
Beckman, M. E., & Hirschberg, J. (1994). The ToBI annotiation conventions. http://www.ling.ohio-state.edu/˜tobi/ame tobi/annotation conventions.html. Accessed 23 Sept 2008.
Bibyk, S., Kurumada, C., & Tanenhaus, M. K. (n.d.). Context constraints on intonation interpretation (in preparation).
Boersma, P., & Weenink, D. (2008). Praat: Doing phonetics by computer (version 5.0.26) [computer program]. http://www.praat.org/. Accessed 16 June 2008.
Brown, M., Dilley, L. C., & Tanenhaus, M. K. (2015). Real-time expectations based on context speech rate can cause words to appear or disappear. In Proceedings of the 34th Annual Conference of the Cognitive Science Society (pp. 1374–1379).
Brown, M., Salverda, A. P., Gunlogson, C., & Tanenhaus, M. K. (2015). Interpreting prosodic cues in discourse context. Language, Cognition and Neuroscience, 30, 149–166.
Chang, F., Dell, G. S., & Bock, K. (2006). Becoming syntactic. Psychological Review, 113(2), 234–272.
Clark, H. H. (1992). Arenas of language use. Chicago: University of Chicago Press.
Clayards, M., Tanenhaus, M. K., Aslin, R. N., & Jacobs, R. A. (2008). Perception of speech reflects optimal use of probabilistic speech cues. Cognition, 108(3), 804–809.
Connine, C. M., & Clifton, C., Jr. (1987). Interactive use of lexical information in speech perception. Journal of Experimental Psychology: Human perception and performance, 13(2), 291.
Cooper, R. M. (1974). Control of eye fixation by meaning of spoken language: New methodology for real-time investigation of speech perception, memory, and language processing. Cognitive Psychology, 6, 84–107.
Degen, J. (2013). Alternatives in pragmatic reasoning. PhD dissertation, University of Rochester.
Dell, G., & Chang, F. (2013). The P-chain: Relating sentence production and its disorders to comprehension and acquisition. Philosophical Transactions of the Royal Society B, 369.
Dennison, H. Y., & Schafer, A. (2010). Online construction of implicature through contrastive prosody. Proceedings of 5th Speech Prosody Conference.
Elman, J. L., & McClelland, J. L. (1988). Cognitive penetration of the mechanisms of perception: Compensation for coarticulation of lexically restored phonemes. Journal of Memory and Language, 27, 143–165.
Farmer, T., Brown, M., & Tanenhaus, M. (2013). Prediction, explanation, and the role of generative models in language processing. Behavioral and Brain Sciences, 36, 211–212.
Fernald, A., & Kuhl, P. (1987). Acoustic determinants of infant preference for motherese speech. Infant Behavior and Development, 10(3), 279–293.
Fine, A. B., & Jaeger, T. F. (2013). Evidence for implicit learning in syntactic comprehension. Cognitive Science, 37(3), 578–591.
Fine, A. B., Jaeger, T. F., Farmer, T., & Qian, T. (2013). Rapid expectation adaptation during syntactic comprehension. PLoS ONE, 8(10), e77661. doi:10.1371/journal.pone.0077661.
Ganong, W. F. (1990). Phonetic categorization in auditory word perception. Journal of Experimental Psychology: Human Perception and Performance, 6, 110–125.
Grice, H. P. (1975). Logic and conversation. Syntax and Semantics, 3, 41–58.
Hanna, J., Tanenhaus, M. K., & Trueswell, J. C. (2003). The effects of common ground and perspective on domains of referential interpretation. Journal of Memory and Language, 49(1), 43–61.
Hansen, M. B., & Markman, E. M. (2005). Appearance questions can be misleading: Adiscourse-based account of the appearance-reality problem. Cognitive Psychology, 50(3), 233–263.
Heller, D., Grodner, D., & Tanenhaus, M. K. (2008). The role of perspective in identifying domains of reference. Cognition, 108(3), 831–836.
Isaacs, A., & Watson, D. (2010). Accent detection is a slippery slope: Direction and rate of f0 change drives listeners comprehension. Language Cognitive Processes, 25(7), 1178–1200.
Ito, K., & Speer, S. R. (2008). Anticipatory effects of intonation: Eye movements during instructed visual search. Journal of Memory and Language, 58, 541–573.
Jackendoff, R. (1972). Semantics in generative grammar. Cambridge: MIT Press.
Jaeger, T. F., & Snider, N. (2013). Alignment as a consequence of expectation adaptation: Syntactic priming is affected by the prime’s prediction error given both prior and recent experience. Cognition, 127(1), 57–83.
Kleinschmidt, D., & Jaeger, T. F. (2011). A Bayesian belief updating model of phonetic recalibration and selective adaptation. In ACL workshop on cognitive modeling and computational linguistics. Portland.
Kleinschmidt, D., & Jaeger, T. F. (2015). Robust speech perception: Recognizing the familiar, generalizing to the similar, and adapting to the novel. Psychological Review, 122(2), 148–203.
Kleinschmidt, D., Fine, A., & Jaeger, T. (2012). A belief-updating model of adaptation and cue combination in syntactic comprehension. Proceedings of the 34th annual conference of the cognitive science society.
Kraljic, T., & Samuel, A. G. (2006). Generalization in perceptual learning for speech. Psychonomic Bulletin and Review, 13(2), 262–268.
Kurumada, C. (2013). Navigating variability in the linguistic signal: Learning to interpret contrastive prosody. PhD dissertation, Stanford University.
Kurumada, C., Brown, M., & Tanenhaus, M. K. (2012). Prosody and pragmatic inference: It looks like speech adaptation. Proceedings of the 34th Annual Conference of the Cognitive Science Society.
Kurumada, C., Brown, M., Bibyk, S., Pontillo, D., & Tanenhaus, M. K. (2013). Incremental processing in the pragmatic interpretation of contrastive prosody. Proceedings of the 35th Annual Meeting of the Cognitive Science Society.
Kurumada, C., Brown, M., Bibyk, S., Pontillo, D., & Tanenhaus, M. K. (2014a). Is it or isn’t it: Listeners make rapid use of prosody to infer speaker meanings. Cognition, 133, 335–342.
Kurumada, C., Brown, M., Bibyk, S., Pontillo, D., & Tanenhaus, M. K. (2014b). Rapid adaptation in online pragmatic interpretation of contrastive prosody. Proceedings of the 36th Annual Meeting of the Cognitive Science Society.
Kurumada, C., Brown, M., & Tanenhaus, M. K. (n.d.). Probabilistic inferences in pragmatic interpretation of English contrastive prosody (submitted).
Ladd, D. R. (2008). Intonational phonology (2nd ed.). Cambridge: Cambridge University Press.
McClelland, J. L., & Elman, J. L. (1986). The trace model of speech perception. Cognitive Psychology, 18(1), 1–86.
McMurray, B., & Jongman, A. (2011). What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations. Psychological Review, 118(2), 218–246.
Miller, J. L., Green, K., & Schermer, T. (1984). A distinction between the effects of sentential speaking rate and semantic congruity on word identification. Perception & Psychophysics, 36, 329–337.
Moulines, E., & Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9(5–6), 453–467.
Norris, D., McQueen, J. M., & Cutler, A. (2003). Perceptual learning in speech. Cognitive Psychology, 47(2), 204–238.
Pierrehumbert, J. & Hirschberg, J. (1990). The meaning of intonational contours in the interpretation of discourse. In P. Cohen, J. Morgan, & M. Pollack (Eds.), Intentions and plans in communication and discourse (pp. 271–311). Cambridge: MIT Press.
Sedivy, J. C., Tanenhaus, M. K., Chambers, C. G., & Carlson, G. N. (1999). Achieving incremental semantic interpretation through contextual representation. Cognition, 71, 109–147.
Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., et al. (1992). ToBI: A standard for labeling English prosody. In International conference on spoken language processing (Vol. 2, pp. 867–870). Banff.
Tanenhaus, M. K., Spivey-Knowlton, M., Eberhard, K., & Sedivy, J. (1995). Integration of visual and linguistic information in spoken language comprehension. Science, 268, 1632–1634.
Ward, G., & Hirschberg, J. (1985). Implicating uncertainty: The pragmatics of fall-rise intonation. Language, 61, 747–776.
Watson, D., Gunlogson, C., & Tanenhaus, M. (2008). Interpreting pitch accents in on-line comprehension: H* vs L + H*. Cognitive Science, 32, 1232–1244.
Weber, A., Braun, B., & Crocker, M. W. (2006). Finding referents in time: Eye-tracking evidence for the role of contrastive accents. Language and Speech, 49, 367–392.
Yildirim, I., Degen, J., Tanenhaus, M. K. & Jaeger, T. F. (2013). Linguistic variability and adaptation in quantifier meanings. In Proceedings of the Thirty-Fifth Annual Conference of the Cognitive Science Society.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Tanenhaus, M., Kurumada, C., Brown, M. (2015). Prosody and Intention Recognition. In: Frazier, L., Gibson, E. (eds) Explicit and Implicit Prosody in Sentence Processing. Studies in Theoretical Psycholinguistics, vol 46. Springer, Cham. https://doi.org/10.1007/978-3-319-12961-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-12961-7_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12960-0
Online ISBN: 978-3-319-12961-7
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)