An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears

Reiss, Lina A.J; Molis, Michelle R

doi:10.1007/s10162-021-00790-7

An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears

Research Article
Published: 20 April 2021

Volume 22, pages 443–461, (2021)
Cite this article

Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

791 Accesses
8 Citations
62 Altmetric
9 Mentions
Explore all metrics

A Correction to this article was published on 16 July 2021

This article has been updated

Abstract

Normal-hearing (NH) listeners use frequency cues, such as fundamental frequency (voice pitch), to segregate sounds into discrete auditory streams. However, many hearing-impaired (HI) individuals have abnormally broad binaural pitch fusion which leads to fusion and averaging of the original monaural pitches into the same stream instead of segregating the two streams (Oh and Reiss, 2017) and may similarly lead to fusion and averaging of speech streams across ears. In this study, using dichotic speech stimuli, we examined the relationship between speech fusion and vowel identification. Dichotic vowel perception was measured in NH and HI listeners, with across-ear fundamental frequency differences varied. Synthetic vowels /i/, /u/, /a/, and /ae/ were generated with three fundamental frequencies (F₀) of 106.9, 151.2, and 201.8 Hz and presented dichotically through headphones. For HI listeners, stimuli were shaped according to NAL-NL2 prescriptive targets. Although the dichotic vowels presented were always different across ears, listeners were not informed that there were no single vowel trials and could identify one vowel or two different vowels on each trial. When there was no F₀ difference between the ears, both NH and HI listeners were more likely to fuse the vowels and identify only one vowel. As ΔF₀ increased, NH listeners increased the percentage of two-vowel responses, but HI listeners were more likely to continue to fuse vowels even with large ΔF₀. Binaural tone fusion range was significantly correlated with vowel fusion rates in both NH and HI listeners. Confusion patterns with dichotic vowels differed from those seen with concurrent monaural vowels, suggesting different mechanisms behind the errors. Together, the findings suggest that broad fusion leads to spectral blending across ears, even for different ΔF₀, and may hinder the stream segregation and understanding of speech in the presence of competing talkers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Two Ears Are Not Always Better than One: Mandatory Vowel Fusion Across Spectrally Mismatched Ears in Hearing-Impaired Listeners

Article 24 May 2016

Poor early cortical differentiation of speech predicts perceptual difficulties of severely hearing-impaired listeners in multi-talker environments

Article Open access 09 April 2020

Tinnitus impairs segregation of competing speech in normal-hearing listeners

Article Open access 16 November 2020

Change history

15 June 2021
This article was updated to correct the author affiliations, the labeling in Figure 10, and the Acknowledgments.
16 July 2021
A Correction to this paper has been published: https://doi.org/10.1007/s10162-021-00802-6

References

Arehart KH, King CA, Mclean-Mudgett KS (1997) Role of fundamental frequency differences in the perceptual separation of competing vowel sounds by listeners with normal hearing and listeners with hearing loss. J Speech Lang Hear Res 40:1434–1444
Article CAS Google Scholar
Arehart KH, Katz-Rossi J, Prustman JS (2005) Double-vowel perception in listeners with cochlear hearing loss: differences in fundamental frequency, ear of presentation, and relative amplitude. J Speech Lang Hear Res 48:236–252
Article Google Scholar
Boersma P, Weenink D (2016) Praat: doing phonetics by computer [Computer program]. Version 6:22
Google Scholar
Brungart D (2001) Informational and energetic masking effects in the perception of two simultaneous talkers. J Acoust Soc Am 109:1101–1109
Article CAS Google Scholar
Carney LH, Mcdonough JM (2019) Nonlinear auditory models yield new insights into representations of vowels. Atten Percept Psychophys 81(4):1034–1046
Article Google Scholar
Chintanpelli A, Ahlstrohm JB, Dubno JR (2014) Computational model predictions of cues for concurrent vowel identification. J Assoc Res Otolaryngol 15:823–837
Article Google Scholar
Eggermont JJ, Wang X (2011) Temporal coding in auditory cortex. In: Winer J., Schreiner C. (eds) The Auditory Cortex. Springer, Boston, MA
Festen JM, Plomp R (1990) Effect of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. J Acoust Soc Am 88:1725–1736
Article CAS Google Scholar
Fitzpatrick EM, Leblanc S (2010) Exploring the factors influencing discontinued hearing aid use in patients with unilateral cochlear implants. Trends Amplif 14(4):199–210
Article Google Scholar
Florentine M, PopperAN, Fay RR (2011) Chapter 2, Loudness. Springer, 17–56
Folstein MF, Folstein SE, Mchugh PR (1975) Mini-mental state: a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res 12:189–198
Article CAS Google Scholar
Greenhouse SW, Geisser S (1959) On methods in the analysis of profile data. Psychometrika 24:95–112
Article Google Scholar
Hermansky H. (1990) Perceptual linear predictive (PLP) analysis of speech. J Acoust Soc. Am 87:1738-1752
Hromadka T, Deweese MR, Zador AM (2008) Sparse representation of sounds in the unanesthetized auditory cortex. PLoS Biol 6(1):e16
Article Google Scholar
Kidd G Jr, Mason CR, Richards VM, Gallun FJ, Durlarch NI (2008) Informational Masking. In: Yost WA, Popper AN, Fay RR (eds) Springer Handbook of Auditory Research: Auditory Perception of Sound Sources. Springer, New York, NY, pp 143–189
Chapter Google Scholar
Kwon BJ, Perry TT (2014) Identification and multiplicity of double vowels in cochlear implant users. J Speech Lang Hear Res 57(5):1983–1996
Article Google Scholar
Mauchly JW (1940) Significance test for sphericity of a normal n-variate distribution. Ann Math Stat 11(2):204–209
Article Google Scholar
Odenthal DW (1963) Perception and neural representation of simultaneous dichotic pure tone stimuli. Acta Physiol Pharmacol: Neerl 12:453–496
CAS Google Scholar
Oh Y, Hartling CL, Srinivasan NK, Eddolls M, Diedesch AC, Gallun FJ, Reiss L A J (2019) Broad binaural fusion impairs segregation of speech based on voice pitch differences in a ‘cocktail party’ environment. BioRxiv, 805309. https://doi.org/10.1101/805309
Oh Y, Reiss LA (2017) Binaural pitch fusion: pitch averaging and dominance in hearing-impaired listeners with broad fusion. J Acoust Soc Am 142(2):780
Article Google Scholar
Peterson GE, Barney HE (1952) Control of methods used in a study of vowels. J Acoust Soc Am 24:175–184
Article Google Scholar
Reiss LA, Eggleston JL, Walker EP, Oh Y (2016) Two ears are not always better than one: mandatory vowel fusion across spectrally mismatched ears in hearing-impaired listeners. J Assoc Res Otolaryngol 17:341–356
Article Google Scholar
Reiss LAJ, Shayman CS, Walker EP, Benneth KO, Fowler JR, Harting CL, Glickman B, Lasarev MR, Oh Y (2017) Binaural pitch fusion: comparison of normal-hearing and hearing-impaired listeners. J Acoust Soc Am 141(3):1909–1920
Article Google Scholar
Rolls ET, Treves A (2011) The neural encoding of information in the brain. Prog Neurobiol 96:448–890
Article Google Scholar
Smith SS, Chintanpelli A, Heinz MG, Summer CJ (2018) Revisiting models of concurrent vowel identification: the critical case of no pitch differences. Acta Acust 104(5):922–925
Article Google Scholar
Souza PE, Boike KT, Witherall K, Tremblay K (2007) Prediction of speech recognition from audibility in older listeners with hearing loss: effects of age, amplification, and background noise. J Am Acad Audiol 18:54–65
Article Google Scholar
Studebaker GA (1985) A “rationalized” arcsine transform. J Speech Lang Hear Res 28(3):455–462
Article CAS Google Scholar
Summerfield Q, Assann PF (1991) Perception of concurrent vowels: effects of harmonic misalignment and pitch-period asynchrony. J Acoust Soc Am 89:1364–1377
Article CAS Google Scholar
Summers V, Leek MR (1998) F0 processing and the separation of competing speech signals by listeners with normal hearing and with hearing loss. J Speech Lang Hear Res 41:1294–1306
Article CAS Google Scholar
Swanner H, Eddolls M, Molis M, Reiss L (2020) Effects of formant overlap on vowel fusion and identification. American Auditory Society 47th Annual Scientific & Technology Conference, poster 183
Thurfklow WR, Bernstein S (1957) Simultaneous two-tone pitch discrimination. J Acoust Soc Am 29:515–519
Article Google Scholar
Van Den Brink G, Sintnicolaas K, Van Stam WS (1976) Dichotic pitch fusion. J Acoust Soc Am 59(6):1471–1476
Article Google Scholar
Walden TC, Walden BE (2005) Unilateral versus bilateral amplification for adults with impaired hearing. J Am Acad Audiol 16(8):574–584
Article Google Scholar
Zwicker UT (1984) Auditory recognition of diotic and dichotic vowel pairs. Speech Commun 3:265–277
Article Google Scholar

Download references

Acknowledgements

This research was supported by grants R01 DC013307 from the National Institutes of Deafness and Communication Disorders, National Institutes of Health, and VA RR&D NCRAR C2361-C. We thank Sara Simmons, Larissa Katrina, and Morgan Eddolls for help with collection of data.

Author information

Authors and Affiliations

Department of Otolaryngology, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR, 97239, USA
Lina A.J Reiss & Michelle R Molis
National Center for Rehabilitative Auditory Research, VA Portland Health Care System, 3710 SW US Veterans Road, Portland, OR, 97239, USA
Michelle R Molis

Authors

Lina A.J Reiss
View author publications
You can also search for this author in PubMed Google Scholar
Michelle R Molis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lina A.J Reiss.

Ethics declarations

Disclaimer

The contents do not represent the views of the US Department of Veterans Affairs or the U.S. Government.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article was updated to correct the labeling in Figure 10.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Reiss, L.A., Molis, M.R. An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears. JARO 22, 443–461 (2021). https://doi.org/10.1007/s10162-021-00790-7

Download citation

Received: 03 May 2020
Accepted: 07 February 2021
Published: 20 April 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s10162-021-00790-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears

Abstract

Access this article

Similar content being viewed by others

Two Ears Are Not Always Better than One: Mandatory Vowel Fusion Across Spectrally Mismatched Ears in Hearing-Impaired Listeners

Poor early cortical differentiation of speech predicts perceptual difficulties of severely hearing-impaired listeners in multi-talker environments

Tinnitus impairs segregation of competing speech in normal-hearing listeners

Change history

15 June 2021

16 July 2021

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Disclaimer

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears

Abstract

Access this article

Similar content being viewed by others

Two Ears Are Not Always Better than One: Mandatory Vowel Fusion Across Spectrally Mismatched Ears in Hearing-Impaired Listeners

Poor early cortical differentiation of speech predicts perceptual difficulties of severely hearing-impaired listeners in multi-talker environments

Tinnitus impairs segregation of competing speech in normal-hearing listeners

Change history

15 June 2021

16 July 2021

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Disclaimer

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation