Temporal Coherence and the Streaming of Complex Sounds

Shamma, Shihab; Elhilali, Mounya; Ma, Ling; Micheyl, Christophe; Oxenham, Andrew J.; Pressnitzer, Daniel; Yin, Pingbo; Xu, Yanbo

doi:10.1007/978-1-4614-1590-9_59

Shihab Shamma⁶,
Mounya Elhilali⁸,
Ling Ma^7,6,
Christophe Micheyl⁹,
Andrew J. Oxenham⁹,
Daniel Pressnitzer^10,11,
Pingbo Yin⁶ &
…
Yanbo Xu⁶

Part of the book series: Advances in Experimental Medicine and Biology ((volume 787))

4490 Accesses
25 Citations

Abstract

Humans and other animals can attend to one of multiple sounds, and follow it selectively over time. The neural underpinnings of this perceptual feat remain mysterious. Some studies have concluded that sounds are heard as separate streams when they activate well-separated populations of central auditory neurons, and that this process is largely pre-attentive. Here, we propose instead that stream formation depends primarily on temporal coherence between responses that encode various features of a sound source. Furthermore, we postulate that only when attention is directed toward a particular feature (e.g., pitch or location) do all other temporally coherent features of that source (e.g., timbre and location) become bound together as a stream that is segregated from the incoherent features of other sources. Experimental neurophysiological evidence in support of this hypothesis will be presented. The focus, however, will be on a computational realization of this idea and a discussion of the insights learned from simulations to disentangle complex sound sources such as speech and music. The model consists of a representational stage of early and cortical auditory processing that creates a multidimensional depiction of various sound attributes such as pitch, location, and spectral resolution. The following stage computes a coherence matrix that summarizes the pair-wise correlations between all channels making up the cortical representation. Finally, the perceived segregated streams are extracted by decomposing the coherence matrix into its uncorrelated components. Questions raised by the model are discussed, especially on the role of attention in streaming and the search for further neural correlates of streaming percepts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bregman A (1990) Auditory scene analysis: the perceptual organization of sound. MIT Press, Cambridge
Google Scholar
Carlyon R, Cusack R, Foxton J, Robertson I (2001) Effects of attention and unilateral neglect on auditory stream segregation. J Exp Psychol Hum Percept Perform 27:115–127
Article PubMed CAS Google Scholar
Chi T, Ru P, Shamma S (2006) Multiresolution spectrotemporal analysis of complex sounds. J Acoust Soc Am 118:887–906
Article Google Scholar
deCharms R, Blake D, Merzenich M (1998) Optimizing sound features for cortical neurons. Science 280:1439–1443
Article PubMed CAS Google Scholar
Duda R, Hart P (1973) Pattern classification and scene analysis. John Wiley and Sons, New York
Google Scholar
Elhilali M, Shamma S (2008) A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation. J Acoust Soc Am 124:3751–3771
Article PubMed Google Scholar
Elhilali M, Ma L, Micheyl C, Oxenham A, Shamma S (2009) Temporal coherence in the perceptual organization and cortical representation of auditory scenes. Neuron 61:317–329
Article PubMed CAS Google Scholar
Fritz J, Shamma S, Elhiliali M (2007) Does attention play a role in dynamic receptive field adaptation to changing acoustic salience in A1? Hear Res 229:186–203
Article PubMed Google Scholar
Hartmann W, Johnson D (1991) Stream segregation and peripheral channeling. Music Percept 9:155–184
Article Google Scholar
Lyon R, Shamma S (1997) Computational strategies for pitch and timbre. In: Hawkins H, McMullen T, Popper A, Fay R (eds) Auditory computations. Springer, New York, pp 221–270
Google Scholar
Ma L (2011) Auditory streaming: behavior, physiology, and modeling. PhD Thesis, Bioengineering Program, University of Maryland, College Park
Google Scholar
Shamma S, Elhilali M, Micheyl C (2011) Temporal coherence and attention in auditory scene analysis. Trends Neurosci 34:114–123
Article PubMed CAS Google Scholar
Sussman E, Horvát J, Winkler I, Orr M (2007) The role of attention in the formation of auditory streams. Percept Psychophys 69:136–152
Article PubMed Google Scholar
Yin P, Ma L, Elhilali M, Fritz J, Shamma S (2006) Primary auditory cortical responses while attending to different streams. In: Kollmeier B, Klump K, Hohmann V, Langemann U, Mauermann M, Uppenkamp S, Verhey J (eds) Hearing – from sensory processing to perception. Springer, New York, pp 257–266
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Institute for Systems Research, University of Maryland, College Park, MD, 20742, USA
Shihab Shamma, Ling Ma, Pingbo Yin & Yanbo Xu
Bioengineering Program, University of Maryland, College Park, MD, 20742, USA
Ling Ma
Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD, USA
Mounya Elhilali
Department of Psychology, University of Minnesota, Minneapolis, MN, USA
Christophe Micheyl & Andrew J. Oxenham
Département d’études Cognitives, Equipe Audition, Ecole Normale Superieure, Paris, France
Daniel Pressnitzer
Laboratoire de Psychologie de la Perception (UMR CNRS 8158), Université Paris Descartes, Paris, France
Daniel Pressnitzer

Authors

Shihab Shamma
View author publications
You can also search for this author in PubMed Google Scholar
Mounya Elhilali
View author publications
You can also search for this author in PubMed Google Scholar
Ling Ma
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Micheyl
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Oxenham
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Pressnitzer
View author publications
You can also search for this author in PubMed Google Scholar
Pingbo Yin
View author publications
You can also search for this author in PubMed Google Scholar
Yanbo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shihab Shamma .

Editor information

Editors and Affiliations

Department of Experimental Psychology, University of Cambridge, Cambridge, United Kingdom
Brian C. J. Moore
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Roy D. Patterson
Physiology Department, University of Cambridge, Cambridge, United Kingdom
Ian M. Winter
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Robert P. Carlyon
MRC-Cognition and Brain Sciences Unit, MRC-Cognition and Brain Sciences Unit, Cambridge, United Kingdom
Hedwig E Gockel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shamma, S. et al. (2013). Temporal Coherence and the Streaming of Complex Sounds. In: Moore, B., Patterson, R., Winter, I., Carlyon, R., Gockel, H. (eds) Basic Aspects of Hearing. Advances in Experimental Medicine and Biology, vol 787. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1590-9_59

Download citation

DOI: https://doi.org/10.1007/978-1-4614-1590-9_59
Published: 16 April 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1589-3
Online ISBN: 978-1-4614-1590-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics