Processing of Short Auditory Stimuli: The Rapid Audio Sequential Presentation Paradigm (RASP)

Suied, Clara; Agus, Trevor R.; Thorpe, Simon J.; Pressnitzer, Daniel

doi:10.1007/978-1-4614-1590-9_49

Clara Suied^6,7,8,
Trevor R. Agus^6,7,
Simon J. Thorpe⁹ &
…
Daniel Pressnitzer^6,7

Part of the book series: Advances in Experimental Medicine and Biology ((volume 787))

4288 Accesses
5 Citations

Abstract

Human listeners seem to be remarkably able to recognise acoustic sound sources based on timbre cues. Here we describe a psychophysical paradigm to estimate the time it takes to recognise a set of complex sounds differing only in timbre cues: both in terms of the minimum duration of the sounds and the inferred neural processing time. Listeners had to respond to the human voice while ignoring a set of distractors. All sounds were recorded from natural sources over the same pitch range and equalised to the same duration and power. In a first experiment, stimuli were gated in time with a raised-cosine window of variable duration and random onset time. A voice/non-voice (yes/no) task was used. Performance, as measured by d′, remained above chance for the shortest sounds tested (2 ms); d′s above 1 were observed for durations longer than or equal to 8 ms. Then, we constructed sequences of short sounds presented in rapid succession. Listeners were asked to report the presence of a single voice token that could occur at a random position within the sequence. This method is analogous to the “rapid sequential visual presentation” paradigm (RSVP), which has been used to evaluate neural processing time for images. For 500-ms sequences made of 32-ms and 16-ms sounds, d′ remained above chance for presentation rates of up to 30 sounds per second. There was no effect of the pitch relation between successive sounds: identical for all sounds in the sequence or random for each sound. This implies that the task was not determined by streaming or forward masking, as both phenomena would predict better performance for the random pitch condition. Overall, the recognition of familiar sound categories such as the voice seems to be surprisingly fast, both in terms of the acoustic duration required and of the underlying neural time constants.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agus TA, Suied C, Thorpe SJ, Pressnitzer D (2012) Fast recognition of musical sounds based on timbre. J Acoust Soc Am 131:4124–4133
Article PubMed Google Scholar
Belin P, Zatorre RJ, Lafaille P, Ahad P, Pike B (2000) Voice-selective areas in human auditory cortex. Nature 403:309–312
Article PubMed CAS Google Scholar
Cusack R, Carlyon RP (2003) Perceptual asymmetries in audition. J Exp Psychol Hum Percept Perform 29:713–725
Article PubMed Google Scholar
Drullman R, Festen JM, Plomp R (1994) Effect of temporal envelope smearing on speech reception. J Acoust Soc Am 95:1053–1064
Article PubMed CAS Google Scholar
Duncan J, Martens S, Ward R (1997) Restricted attentional capacity within but not between sensory modalities. Nature 387:808–810
Article PubMed CAS Google Scholar
Goto M, Hashiguchi H, Nishimura T, Oka R (2003) RWC music database: music genre database and musical instrument sound database. In: 4th international conference on Music Information Retrieval, Baltimore, 2003
Google Scholar
Gray GW (1942) Phonemic microtomy: the minimum duration of perceptible speech sounds. Speech Monogr 9:75–90
Article Google Scholar
Keysers C, Xiao DK, Foldiak P, Perrett DI (2001) The speed of sight. J Cogn Neurosci 13:90–101
Article PubMed CAS Google Scholar
Macmillan NA, Creelman CD (2005) Detection theory: a user’s guide, 2nd edn. Lawrence Erlbaum Associates, Mahwah
Google Scholar
Patterson RD, Allerhand MH, Giguere C (1995) Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. J Acoust Soc Am 98:1890–1894
Article PubMed CAS Google Scholar
Pressnitzer D, Patterson RD, Krumbholz K (2001) The lower limit of melodic pitch. J Acoust Soc Am 109:2074–2084
Article PubMed CAS Google Scholar
Robinson K, Patterson RD (1995) The stimulus duration required to identify vowels, their octave, and their pitch chroma. J Acoust Soc Am 98:1858–1865
Article Google Scholar
Subramaniam S, Biederman I, Madigan S (2000) Accurate identification but no priming and chance recognition memory for pictures in RSVP sequences. Vis Cogn 7:511–535
Article Google Scholar
Woods DL, Alain C (1993) Feature processing during high-rate auditory selective attention. Percept Psychophys 53:391–402
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This work was supported by the Fondation Pierre Gilles de Gennes pour la Recherche.

Author information

Authors and Affiliations

Département d’études cognitives, Equipe Audition, Ecole Normale Supérieure, Paris, France
Clara Suied, Trevor R. Agus & Daniel Pressnitzer
Laboratoire de Psychologie de la Perception (UMR CNRS 8158), Université Paris Descartes, Paris, France
Clara Suied, Trevor R. Agus & Daniel Pressnitzer
Département Action et Cognition en Situation Opérationnelle, Institut de Recherche Biomédicale des Armées, Brétigny-sur-Orge, France
Clara Suied
Centre de Recherche Cerveau et Cognition, Université Toulouse 3, CNRS. CHU Purpan, Pavillon Baudot, Toulouse, France
Simon J. Thorpe

Authors

Clara Suied
View author publications
You can also search for this author in PubMed Google Scholar
Trevor R. Agus
View author publications
You can also search for this author in PubMed Google Scholar
Simon J. Thorpe
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Pressnitzer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Clara Suied .

Editor information

Editors and Affiliations

Department of Experimental Psychology, University of Cambridge, Cambridge, United Kingdom
Brian C. J. Moore
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Roy D. Patterson
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Ian M. Winter
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Robert P. Carlyon
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Hedwig E Gockel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suied, C., Agus, T.R., Thorpe, S.J., Pressnitzer, D. (2013). Processing of Short Auditory Stimuli: The Rapid Audio Sequential Presentation Paradigm (RASP). In: Moore, B., Patterson, R., Winter, I., Carlyon, R., Gockel, H. (eds) Basic Aspects of Hearing. Advances in Experimental Medicine and Biology, vol 787. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1590-9_49

Download citation

DOI: https://doi.org/10.1007/978-1-4614-1590-9_49
Published: 16 April 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1589-3
Online ISBN: 978-1-4614-1590-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics