Binaural Glimpses at the Cocktail Party?
- 396 Downloads
Humans often have to focus on a single target sound while ignoring competing maskers in everyday situations. In such conditions, speech intelligibility (SI) is improved when a target speaker is spatially separated from a masker (spatial release from making, SRM) compared to situations where both are co-located. Such asymmetric spatial configurations lead to a ‘better-ear effect’ with improved signal-to-noise ratio (SNR) at one ear. However, maskers often surround the listener leading to more symmetric configurations where better-ear effects are absent in a long-term, wideband sense. Nevertheless, better-ear glimpses distributed across time and frequency persist and were suggested to account for SRM (Brungart and Iyer 2012). Here, speech reception was assessed using symmetric masker configurations while varying the spatio-temporal distribution of potential better-ear glimpses. Listeners were presented with a frontal target and eight single-talker maskers in four different symmetrical spatial configurations. Compared to the reference condition with co-located target and maskers, an SRM of up to 6 dB was observed. The SRM persisted when the frequency range of the maskers above or below 1500 Hz was replaced with stationary speech-shaped noise. Comparison to a recent short-time binaural SI model showed that better-ear glimpses can account for half the observed SRM, while binaural interaction utilizing phase differences is required to explain the other half.
Keywordsbetter-ear listening glimpsing release from masking speech intelligibility model speech reception thresholds
This work was supported by the Bernstein Center for Computational Neuroscience, the German Center for Vertigo and Balance Disorders (IFB) and the DFG SFB TRR 31. We thank Lisa Benda and Annika Sander for their support during data acquisition.
- Bronkhorst AW (2000) The cocktail party phenomenon: a review of research on speech intelligibility in multiple-talker conditions. Acustica 86:117–128Google Scholar
- Ewert SD (2013) AFC—a modular framework for running psychoacoustic experiments and computational perception models. Proceedings of the International Conference on Acoustics AIA-DAGA 2013 in Merano, Italy: 1326–1329Google Scholar
- Freyman RL, Helfer KS, Balakrishnan, U (2005) Spatial and spectral factors in release from informational masking in speech recognition. Acta Acustica united with Acustica 91:537–545Google Scholar
- Kidd GJ, Mason CR, Richards VM, Gallun FJ, Durlach NI (2008) Informational masking. In: Yost William A., Popper Arthur N., R. FR (eds) Auditory perception of sound sources, vol 29. Springer US, pp 143–189. doi: 10.1007/978-0-387-71305-2_6
- Platte H-J, vom Hövel H (1980) Zur Deutung der Ergebnisse von Sprachverständlichkeitsmessungen mit Störschall im Freifeld. Acta Acustica United with Acustica 45:139–151Google Scholar
- Plomp R, Mimpen AM (1981) Effect of the orientation of the speaker’s head and the azimuth of a noise source on the speech-reception threshold for sentences. Acustica 48:325–329Google Scholar
- Wagener KC, Brand T, Kollmeier B (1999) Entwicklung und Evaluation eines Satztests für die Deutsche Sprache III: Evaluation des Oldenburger Satztests. Zeitschrift für Audiologie 28:86–95Google Scholar