Audiovisual speech gating: examining information and information processing

de la Vaux, Steven K.; Massaro, Dominic W.

doi:10.1007/s10339-004-0014-2

Audiovisual speech gating: examining information and information processing

Research Report
Published: 23 April 2004

Volume 5, pages 106–112, (2004)
Cite this article

Cognitive Processing Aims and scope Submit manuscript

Steven K. de la Vaux¹ &
Dominic W. Massaro¹

123 Accesses
5 Citations
Explore all metrics

Abstract

We investigated the perceptual processing of facial information and vocal information to test the nature of the multisensory integration process. Single-syllable words were presented in background noise at one of eight stimulus durations ranging from 45 to 80% of the total word duration. Performance was evaluated relative to a fuzzy logical model of perception (FLMP) and an additive model (ADD). These two models differ in the nature of the integration process: optimal multiplicative integration in the FLMP and additive in the ADD. The FLMP provided a significantly better description of the word identifications relative to the ADD. Consistent with the outcomes of several other tasks in audiovisual speech perception, the FLMP also accurately describes the continuous uptake of information during word perception.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Perception of vocoded speech in domestic dogs

Article Open access 16 April 2024

Speech Emotion Recognition: A Comprehensive Survey

Article 08 March 2023

Cochlear Implant Research and Development in the Twenty-first Century: A Critical Update

Article Open access 25 August 2021

References

Anastasio TJ, Patton PE, Belkacem-Boussaid K (2000) Using Bayes’ rule to model multisensory enhancement in the superior colliculus. Neural Comp 12:1165–1187
Article CAS Google Scholar
Bernstein LE, Eberhardt SP (1986) Johns Hopkins lipreading corpus video-disk set. The Johns Hopkins University Press, Baltimore
Chandler JP (1969) Finds local minima of a smooth function of several parameters. Behav Sci 14:81–82
Google Scholar
Cotton S, Grosjean F (1984) The gating paradigm: a comparison of successive and individual presentation formats. Percept Psychophys 35:41–48
CAS PubMed Google Scholar
Cutting JE, Bruno N, Brady NP, Moore C (1992) Selectivity, scope, and simplicity of models: a lesson from fitting judgments of perceived depth. J Exp Psychol 121:362–381
Google Scholar
Grant KW, Walden BE (1996) Evaluating the articulation index for auditory–visual consonant recognition. J Acoust Soc Am 100:2415–2424
CAS PubMed Google Scholar
Green KP, Kuhl PK (1989) The role of visual information in the processing of place and manner features in speech perception. Percept Psychophys 45:34–42
CAS PubMed Google Scholar
Grosjean F (1980) Spoken word recognition processes and the gating paradigm. Percept Psychophys 28:267–283
CAS PubMed Google Scholar
Grosjean F (1997) Gating. In: Grosjean F, Frauenfelder UH (eds) A guide to spoken word recognition paradigms. Psychology, Sussex
Luce RD (1959) Individual choice behavior. Wiley, New York
Marslen-Wilson WD, Warren P (1994) Levels of perceptual representation and process in lexical access: words, phonemes, and features. Psychol Rev 101:653–675
Article CAS PubMed Google Scholar
Massaro DW (1975) Understanding language: an information processing analysis of speech perception, reading, and psycholinguistics. Academic, New York
Google Scholar
Massaro DW (1987) Speech perception by ear and eye: a paradigm for psychological inquiry. Lawrence Erlbaum Associates, Hillsdale
Google Scholar
Massaro DW (1993) Broadening the domain of the fuzzy logical model of perception. In: Pick HL Jr, Van Den Broek P, Knill DC (eds) Cognition: conceptual and methodological issues. APA, Washington, D.C., pp 51–84
Massaro DW (1998) Perceiving talking faces: from speech perception to a behavioral principle. MIT Press, Cambridge
Google Scholar
Massaro DW (2002) Multimodal speech perception: a paradigm for speech science. In: Granstrom B, House D, Karlsson I (eds) Multilmodality in language and speech systems. Kluwer Academic, Dordrecht, The Netherlands, pp 45–71
Massaro DW, Friedman D (1990) Models of integration given multiple sources of information. Psychol Rev 97:225–252
Article CAS PubMed Google Scholar
Massaro DW, Weldon MS, Kitzis SN (1991) Integration of orthographic and semantic information in memory retrieval. J Exp Psychol Learn Mem Cogn 17:277–287
Article CAS PubMed Google Scholar
McQueen JM, Norris D, Cutler A (1999) Lexical influence in phonetic decision making: evidence from subcategorical mismatches. J Exp Psychol Hum Percept Perform 25:1363–1389
Article Google Scholar
McGurk H, MacDonald J (1976) Hearing lips and seeing voices. Nature 264:744–748
Google Scholar
Miller GA, Nicely P (1955) An analysis of perception confusions among some English consonants. J Acoust Soc Am 27:338–352
Google Scholar
Oden GC, Massaro DW (1978) Integration of featural information in speech perception. Psychol Rev 85:172–191
Article CAS PubMed Google Scholar
Salasoo A, Pisoni DB (1985) Interaction of information in spoken word identification. J Mem Lang 24:210–231
Google Scholar
Smeele PT (1994) Perceiving speech: integrating auditory and visual speech. Unpublished doctoral dissertation, Delft University of Technology
Google Scholar
Smits R, Warner N, McQueen JM, Cutler A (2003) Unfolding of phonetic information over time: a database of Dutch diphone perception. J Acoust Soc Am 113:563–574
Article PubMed Google Scholar
Sumby WH, Pollack I (1954) Visual contribution to speech intelligibility in noise. J Acoust Soc Am 26:212–215
Google Scholar
Summerfield AQ (1979) Use of visual information in phonetic perception. Phonetica 36:314–331
CAS PubMed Google Scholar
Teigland AD, Wilson WR (1982) Visual backward masking of selected visemes. J Speech Hear Res 25:269–274
CAS PubMed Google Scholar
Tyler LK (1985) The structure of the initial cohort: evidence from gating. Percept Psychophys 36:417–427
Google Scholar
Walley A, Michela V, Wood D (1995) The gating paradigm: effects of presentation format on spoken word recognition by children and adults. Percept Psychophys 57:343–351
CAS PubMed Google Scholar
Warren P, Marslen-Wilson WD (1987) Continuous uptake of acoustic cues in spoken word recognition. Percept Psychophys 41:262–275
CAS PubMed Google Scholar
Warren P, Marslen-Wilson WD (1988) Cues to lexical choice: discriminating place and voice. Percept Psychophys 43:21–30
CAS PubMed Google Scholar
Weldon MS, Massaro DW (1996) Integration of orthographic, conceptual, and episodic information on implicit and explicit tests. Can J Exp Psychol 50:72–85
CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported in part by grants from the National Science Foundation (NSF CHALLENGE Grant CDA-9726363 and NSF Grant BCS-9905176), a grant from the Public Health Service (Grant PHS R01 DC00236), cooperative grants from the Intel Corporation and the University of California Digital Media Program (D97-04), and grants from the University of California, Santa Cruz. The authors thank Alexandra Jesse for a critical reading of the manuscript.

Author information

Authors and Affiliations

Department of Psychology, University of California, Santa Cruz, CA, 95064, USA
Steven K. de la Vaux & Dominic W. Massaro

Authors

Steven K. de la Vaux
View author publications
You can also search for this author in PubMed Google Scholar
Dominic W. Massaro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dominic W. Massaro.

Additional information

Edited by: Marie-Hélène Giard and Mark Wallace

Rights and permissions

Reprints and permissions

About this article

Cite this article

de la Vaux, S.K., Massaro, D.W. Audiovisual speech gating: examining information and information processing. Cogn Process 5, 106–112 (2004). https://doi.org/10.1007/s10339-004-0014-2

Download citation

Received: 03 November 2003
Revised: 07 February 2004
Accepted: 17 February 2004
Published: 23 April 2004
Issue Date: June 2004
DOI: https://doi.org/10.1007/s10339-004-0014-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Audiovisual speech gating: examining information and information processing

Abstract

Access this article

Similar content being viewed by others

Perception of vocoded speech in domestic dogs

Speech Emotion Recognition: A Comprehensive Survey

Cochlear Implant Research and Development in the Twenty-first Century: A Critical Update

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Audiovisual speech gating: examining information and information processing

Abstract

Access this article

Similar content being viewed by others

Perception of vocoded speech in domestic dogs

Speech Emotion Recognition: A Comprehensive Survey

Cochlear Implant Research and Development in the Twenty-first Century: A Critical Update

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation