Skip to main content
Log in

An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears

  • Research Article
  • Published:
Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

A Correction to this article was published on 16 July 2021

This article has been updated

Abstract

Normal-hearing (NH) listeners use frequency cues, such as fundamental frequency (voice pitch), to segregate sounds into discrete auditory streams. However, many hearing-impaired (HI) individuals have abnormally broad binaural pitch fusion which leads to fusion and averaging of the original monaural pitches into the same stream instead of segregating the two streams (Oh and Reiss, 2017) and may similarly lead to fusion and averaging of speech streams across ears. In this study, using dichotic speech stimuli, we examined the relationship between speech fusion and vowel identification. Dichotic vowel perception was measured in NH and HI listeners, with across-ear fundamental frequency differences varied. Synthetic vowels /i/, /u/, /a/, and /ae/ were generated with three fundamental frequencies (F0) of 106.9, 151.2, and 201.8 Hz and presented dichotically through headphones. For HI listeners, stimuli were shaped according to NAL-NL2 prescriptive targets. Although the dichotic vowels presented were always different across ears, listeners were not informed that there were no single vowel trials and could identify one vowel or two different vowels on each trial. When there was no F0 difference between the ears, both NH and HI listeners were more likely to fuse the vowels and identify only one vowel. As ΔF0 increased, NH listeners increased the percentage of two-vowel responses, but HI listeners were more likely to continue to fuse vowels even with large ΔF0. Binaural tone fusion range was significantly correlated with vowel fusion rates in both NH and HI listeners. Confusion patterns with dichotic vowels differed from those seen with concurrent monaural vowels, suggesting different mechanisms behind the errors. Together, the findings suggest that broad fusion leads to spectral blending across ears, even for different ΔF0, and may hinder the stream segregation and understanding of speech in the presence of competing talkers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Change history

  • 15 June 2021

    This article was updated to correct the author affiliations, the labeling in Figure 10, and the Acknowledgments.

  • 16 July 2021

    A Correction to this paper has been published: https://doi.org/10.1007/s10162-021-00802-6

References

  • Arehart KH, King CA, Mclean-Mudgett KS (1997) Role of fundamental frequency differences in the perceptual separation of competing vowel sounds by listeners with normal hearing and listeners with hearing loss. J Speech Lang Hear Res 40:1434–1444

    Article  CAS  Google Scholar 

  • Arehart KH, Katz-Rossi J, Prustman JS (2005) Double-vowel perception in listeners with cochlear hearing loss: differences in fundamental frequency, ear of presentation, and relative amplitude. J Speech Lang Hear Res 48:236–252

    Article  Google Scholar 

  • Boersma P, Weenink D (2016) Praat: doing phonetics by computer [Computer program]. Version 6:22

    Google Scholar 

  • Brungart D (2001) Informational and energetic masking effects in the perception of two simultaneous talkers. J Acoust Soc Am 109:1101–1109

    Article  CAS  Google Scholar 

  • Carney LH, Mcdonough JM (2019) Nonlinear auditory models yield new insights into representations of vowels. Atten Percept Psychophys 81(4):1034–1046

    Article  Google Scholar 

  • Chintanpelli A, Ahlstrohm JB, Dubno JR (2014) Computational model predictions of cues for concurrent vowel identification. J Assoc Res Otolaryngol 15:823–837

    Article  Google Scholar 

  • Eggermont JJ, Wang X (2011) Temporal coding in auditory cortex. In: Winer J., Schreiner C. (eds) The Auditory Cortex. Springer, Boston, MA

  • Festen JM, Plomp R (1990) Effect of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. J Acoust Soc Am 88:1725–1736

    Article  CAS  Google Scholar 

  • Fitzpatrick EM, Leblanc S (2010) Exploring the factors influencing discontinued hearing aid use in patients with unilateral cochlear implants. Trends Amplif 14(4):199–210

    Article  Google Scholar 

  • Florentine M, PopperAN, Fay RR (2011) Chapter 2, Loudness. Springer, 17–56

  • Folstein MF, Folstein SE, Mchugh PR (1975) Mini-mental state: a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res 12:189–198

    Article  CAS  Google Scholar 

  • Greenhouse SW, Geisser S (1959) On methods in the analysis of profile data. Psychometrika 24:95–112

    Article  Google Scholar 

  • Hermansky H. (1990) Perceptual linear predictive (PLP) analysis of speech. J Acoust Soc. Am 87:1738-1752

  • Hromadka T, Deweese MR, Zador AM (2008) Sparse representation of sounds in the unanesthetized auditory cortex. PLoS Biol 6(1):e16

    Article  Google Scholar 

  • Kidd G Jr, Mason CR, Richards VM, Gallun FJ, Durlarch NI (2008) Informational Masking. In: Yost WA, Popper AN, Fay RR (eds) Springer Handbook of Auditory Research: Auditory Perception of Sound Sources. Springer, New York, NY, pp 143–189

    Chapter  Google Scholar 

  • Kwon BJ, Perry TT (2014) Identification and multiplicity of double vowels in cochlear implant users. J Speech Lang Hear Res 57(5):1983–1996

    Article  Google Scholar 

  • Mauchly JW (1940) Significance test for sphericity of a normal n-variate distribution. Ann Math Stat 11(2):204–209

    Article  Google Scholar 

  • Odenthal DW (1963) Perception and neural representation of simultaneous dichotic pure tone stimuli. Acta Physiol Pharmacol: Neerl 12:453–496

    CAS  Google Scholar 

  • Oh Y, Hartling CL, Srinivasan NK, Eddolls M, Diedesch AC, Gallun FJ, Reiss L A J (2019) Broad binaural fusion impairs segregation of speech based on voice pitch differences in a ‘cocktail party’ environment. BioRxiv, 805309. https://doi.org/10.1101/805309

  • Oh Y, Reiss LA (2017) Binaural pitch fusion: pitch averaging and dominance in hearing-impaired listeners with broad fusion. J Acoust Soc Am 142(2):780

    Article  Google Scholar 

  • Peterson GE, Barney HE (1952) Control of methods used in a study of vowels. J Acoust Soc Am 24:175–184

    Article  Google Scholar 

  • Reiss LA, Eggleston JL, Walker EP, Oh Y (2016) Two ears are not always better than one: mandatory vowel fusion across spectrally mismatched ears in hearing-impaired listeners. J Assoc Res Otolaryngol 17:341–356

    Article  Google Scholar 

  • Reiss LAJ, Shayman CS, Walker EP, Benneth KO, Fowler JR, Harting CL, Glickman B, Lasarev MR, Oh Y (2017) Binaural pitch fusion: comparison of normal-hearing and hearing-impaired listeners. J Acoust Soc Am 141(3):1909–1920

    Article  Google Scholar 

  • Rolls ET, Treves A (2011) The neural encoding of information in the brain. Prog Neurobiol 96:448–890

    Article  Google Scholar 

  • Smith SS, Chintanpelli A, Heinz MG, Summer CJ (2018) Revisiting models of concurrent vowel identification: the critical case of no pitch differences. Acta Acust 104(5):922–925

    Article  Google Scholar 

  • Souza PE, Boike KT, Witherall K, Tremblay K (2007) Prediction of speech recognition from audibility in older listeners with hearing loss: effects of age, amplification, and background noise. J Am Acad Audiol 18:54–65

    Article  Google Scholar 

  • Studebaker GA (1985) A “rationalized” arcsine transform. J Speech Lang Hear Res 28(3):455–462

    Article  CAS  Google Scholar 

  • Summerfield Q, Assann PF (1991) Perception of concurrent vowels: effects of harmonic misalignment and pitch-period asynchrony. J Acoust Soc Am 89:1364–1377

    Article  CAS  Google Scholar 

  • Summers V, Leek MR (1998) F0 processing and the separation of competing speech signals by listeners with normal hearing and with hearing loss. J Speech Lang Hear Res 41:1294–1306

    Article  CAS  Google Scholar 

  • Swanner H, Eddolls M, Molis M, Reiss L (2020) Effects of formant overlap on vowel fusion and identification. American Auditory Society 47th Annual Scientific & Technology Conference, poster 183

  • Thurfklow WR, Bernstein S (1957) Simultaneous two-tone pitch discrimination. J Acoust Soc Am 29:515–519

    Article  Google Scholar 

  • Van Den Brink G, Sintnicolaas K, Van Stam WS (1976) Dichotic pitch fusion. J Acoust Soc Am 59(6):1471–1476

    Article  Google Scholar 

  • Walden TC, Walden BE (2005) Unilateral versus bilateral amplification for adults with impaired hearing. J Am Acad Audiol 16(8):574–584

    Article  Google Scholar 

  • Zwicker UT (1984) Auditory recognition of diotic and dichotic vowel pairs. Speech Commun 3:265–277

    Article  Google Scholar 

Download references

Acknowledgements

This research was supported by grants R01 DC013307 from the National Institutes of Deafness and Communication Disorders, National Institutes of Health, and VA RR&D NCRAR C2361-C. We thank Sara Simmons, Larissa Katrina, and Morgan Eddolls for help with collection of data.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lina A.J Reiss.

Ethics declarations

Disclaimer

The contents do not represent the views of the US Department of Veterans Affairs or the U.S. Government.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article was updated to correct the labeling in Figure 10.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Reiss, L.A., Molis, M.R. An Alternative Explanation for Difficulties with Speech in Background Talkers: Abnormal Fusion of Vowels Across Fundamental Frequency and Ears. JARO 22, 443–461 (2021). https://doi.org/10.1007/s10162-021-00790-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10162-021-00790-7

Keywords

Navigation