Computational Model Predictions of Cues for Concurrent Vowel Identification

Chintanpalli, Ananthakrishna; Ahlstrom, Jayne B.; Dubno, Judy R.

doi:10.1007/s10162-014-0475-7

Computational Model Predictions of Cues for Concurrent Vowel Identification

Research Article
Published: 08 July 2014

Volume 15, pages 823–837, (2014)
Cite this article

Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

Ananthakrishna Chintanpalli¹,
Jayne B. Ahlstrom¹ &
Judy R. Dubno¹

296 Accesses
17 Citations
1 Altmetric
Explore all metrics

Abstract

Although differences in fundamental frequencies (F0s) between vowels are beneficial for their segregation and identification, listeners can still segregate and identify simultaneous vowels that have identical F0s, suggesting that additional cues are contributing, including formant frequency differences. The current perception and computational modeling study was designed to assess the contribution of F0 and formant difference cues for concurrent vowel identification. Younger adults with normal hearing listened to concurrent vowels over a wide range of levels (25–85 dB SPL) for conditions in which F0 was the same or different between vowel pairs. Vowel identification scores were poorer at the lowest and highest levels for each F0 condition, and F0 benefit was reduced at the lowest level as compared to higher levels. To understand the neural correlates underlying level-dependent changes in vowel identification, a computational auditory-nerve model was used to estimate formant and F0 difference cues under the same listening conditions. Template contrast and average localized synchronized rate predicted level-dependent changes in the strength of phase locking to F0s and formants of concurrent vowels, respectively. At lower levels, poorer F0 benefit may be attributed to poorer phase locking to both F0s, which resulted from lower firing rates of auditory-nerve fibers. At higher levels, poorer identification scores may relate to poorer phase locking to the second formant, due to synchrony capture by lower formants. These findings suggest that concurrent vowel identification may be partly influenced by level-dependent changes in phase locking of auditory-nerve fibers to F0s and formants of both vowels.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Midbrain Synchrony to Envelope Structure Supports Behavioral Sensitivity to Single-Formant Vowel-Like Sounds in Noise

Article 20 October 2016

Two Ears Are Not Always Better than One: Mandatory Vowel Fusion Across Spectrally Mismatched Ears in Hearing-Impaired Listeners

Article 24 May 2016

Variability in the Estimated Amplitude of Vowel-Evoked Envelope Following Responses Caused by Assumed Neurophysiologic Processing Delays

Article 24 August 2022

REFERENCES

Arehart KH, King CA, McLean-Mudgett KS (1997) Role of fundamental frequency differences in the perceptual separation of competing vowel sounds by listeners with normal hearing and listeners with hearing loss. J Speech Lang Hear Res 40:1434–1444
Article PubMed CAS Google Scholar
Assmann PF, Summerfield Q (1990) Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. J Acoust Soc Am 88:680–697
Article PubMed CAS Google Scholar
Assmann PF, Summerfield Q (1994) The contribution of waveform interactions to the perception of concurrent vowels. J Acoust Soc Am 95:471–484
Article PubMed CAS Google Scholar
Bernstein JG, Oxenham AJ (2005) An autocorrelation model with place dependence to account for the effect of harmonic number on fundamental frequency discrimination. J Acoust Soc Am 117:3816–3831
Article PubMed PubMed Central Google Scholar
Bruce IC, Sachs MB, Young ED (2003) An auditory-periphery model of the effects of acoustic trauma on auditory nerve responses. J Acoust Soc Am 113:369–388
Article PubMed Google Scholar
Cariani P (2004) A temporal model for pitch multiplicity and tonal consonance. In: Lipscomb SD, Ashley R, Gjerdingen RO, Webster P (eds) Proceedings of the 8th International Conference on Music Perception and Cognition. SMPC, Evanston, pp 310–313
Google Scholar
Cariani PA, Delgutte B (1996) Neural correlates of the pitch of complex tones. I. Pitch and pitch salience. J Neurophysiol 76:1698–1716
PubMed CAS Google Scholar
Carney LH (1993) A model for the responses of low-frequency auditory-nerve fibers in cat. J Acoust Soc Am 93:401–417
Article PubMed CAS Google Scholar
Carney LH (1999) Temporal response properties of neurons in the auditory pathway. Curr Opin Neurobiol 9:442–446
Article PubMed CAS Google Scholar
Chintanpalli A, Heinz MG (2013) The use of confusion patterns to evaluate the neural basis for concurrent vowel identification. J Acoust Soc Am 134:2988–3000
Chintanpalli A, Jennings SG, Heinz MG, Strickland EA (2012) Modeling the anti-masking effects of the olivocochlear reflex in auditory nerve responses to tones in sustained noise. J Assoc Res Otolaryngol 13:219–235
Article PubMed PubMed Central Google Scholar
de Cheveigné A (2005) Pitch perception models. In: Plack CJ, Oxenham A, Fay RR, Popper AN (eds) Pitch - Neural coding and perception. Springer, New York, pp 169–223
Google Scholar
Goldstein JL (1973) An optimum processor theory for the central formation of the pitch of complex tones. J Acoust Soc Am 54:1496–1516
Article PubMed CAS Google Scholar
Heinz MG, Zhang X, Bruce IC, Carney LH (2001) Auditory-nerve model for predicting performance limits of normal and impaired listeners. ARLO 2:91–96
Article Google Scholar
Johnson DH (1980) The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones. J Acoust Soc Am 68:1115–1122
Article PubMed CAS Google Scholar
Kawase T, Delgutte B, Liberman MC (1993) Antimasking effects of the olivocochlear reflex. II. Enhancement of auditory-nerve response to masked tones. J Neurophysiol 70:2533–2549
Keilson SE, Richards VM, Wyman BT, Young ED (1997) The representation of concurrent vowels in the cat anesthetized ventral cochlear nucleus: evidence for a periodicity-tagged spectral representation. J Acoust Soc Am 102:1056–1071
Article PubMed CAS Google Scholar
Klatt DH (1980) Software for a cascade/parallel formant synthesizer. J Acoust Soc Am 67:971–995
Article Google Scholar
Krishna BS, Semple MN (2000) Auditory temporal processing: responses to sinusoidally amplitude-modulated tones in the inferior colliculus. J Neurophysiol 84:255–273
PubMed CAS Google Scholar
Larsen E, Cedolin L, Delgutte B (2008) Pitch representations in the auditory nerve: two concurrent complex tones. J Neurophysiol 100:1301–1319
Article PubMed PubMed Central Google Scholar
Liberman MC (1978) Auditory-nerve response from cats raised in a low-noise chamber. J Acoust Soc Am 63:442–455
Article PubMed CAS Google Scholar
Licklider JCR (1951) A duplex theory of pitch perception. Experientia 7:128–133
Article PubMed CAS Google Scholar
Louage DHG, van der Heijden M, Joris PX (2004) Temporal properties of responses to broadband noise in the auditory nerve. J Neurophysiol 91:2051–2065
Article PubMed Google Scholar
Makary CA, Shin J, Kujawa SG, Liberman MC, Merchant SN (2011) Age-related primary cochlear neuronal degeneration in human temporal bones. J Assoc Res Otolaryngol 12:711–717
Article PubMed PubMed Central Google Scholar
Meddis R, Hewitt MJ (1991) Virtual pitch and phase-sensitivity studied using a computer model of the auditory periphery. I. Pitch identification. J Acoust Soc Am 89:2883–2894
Meddis R, Hewitt MJ (1992) Modeling the identification of concurrent vowels with different fundamental frequencies. J Acoust Soc Am 91:233–245
Article PubMed CAS Google Scholar
Meddis R, O’Mard L (1997) A unitary model of pitch perception. J Acoust Soc Am 102:1811–1820
Article PubMed CAS Google Scholar
Micheyl C, Oxenham AJ (2010) Pitch, harmonicity and concurrent sound segregation: psychoacoustical and neurophysiological findings. Hear Res 266:36–51
Article PubMed PubMed Central Google Scholar
Miller RL, Schilling JR, Franck KR, Young ED (1997) Effects of acoustic trauma on the representation of the vowel/ɛ/in cat auditory nerve fibers. J Acoust Soc Am 101:3602–3616
Article PubMed CAS Google Scholar
Nelson PC, Carney LH (2004) A phenomenological model of peripheral and central neural responses to amplitude-modulated tones. J Acoust Soc Am 116:2173–2186
Article PubMed PubMed Central Google Scholar
Palmer AR (1990) The representation of the spectra and fundamental frequencies of steady-state single- and double-vowel sounds in the temporal discharge patterns of guinea pig cochlear-nerve fibers. J Acoust Soc Am 88:1412–1426
Article PubMed CAS Google Scholar
Palmer AR (1992) Segregation of the responses to paired vowels in the auditory nerve of the guinea pig using autocorrelation. In: Schouten MEH (ed) The Auditory Processing of Speech: From Sounds to Words. Mouton-deGruyter, Berlin, pp 115–124
Peterson GE, Barney HL (1952) Control methods used in the study of vowels. J Acoust Soc Am 24:175–184
Article Google Scholar
Pressnitzer D, Patterson RD, Krumbholz K (2001) The lower limit of melodic pitch. J Acoust Soc Am 109:2074–2084
Article PubMed CAS Google Scholar
Sachs MB, Young ED (1979) Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate. J Acoust Soc Am 66:470–479
Article PubMed CAS Google Scholar
Scheffers M (1983) Sifting vowels: auditory pitch analysis and sound segregation. Dissertation, Groningen University
Schmiedt RA, Mills JH, Boettcher FA (1996) Age-related loss of activity of auditory-nerve fibers. J Neurophysiol 76:2799–2803
PubMed CAS Google Scholar
Smalt CJ, Heinz MG, Strickland EA (2014) Modeling the time-varying and level-dependent effects of the medial olivocochlear reflex in auditory nerve responses. J Assoc Res Otolaryngol 15:159–173
Article PubMed Google Scholar
Studebaker GA (1985) A ‘rationalized’ arcsine transform. J Speech Hear Res 28:455–462
Article PubMed CAS Google Scholar
Summers V, Leek MR (1998) F0 processing and the separation of competing speech signals by listeners with normal hearing and with hearing loss. J Speech Lang Hear Res 41:1294–1306
Article PubMed CAS Google Scholar
Tan Q, Carney LH (2005) A phenomenological model for the responses of auditory-nerve fibers. II. Nonlinear tuning with a frequency glide. J Acoust Soc Am 114:2007–2020
Article Google Scholar
Vongpaisal T, Pichora-Fuller MK (2007) Effect of age on F0 difference limen and concurrent vowel identification. J Speech Lang Hear Res 50:1139–1156
Article PubMed Google Scholar
Young ED, Sachs MB (1979) Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. J Acoust Soc Am 66:1381–1403
Article PubMed CAS Google Scholar
Zhang X, Heinz MG, Bruce IC, Carney LH (2001) A phenomenological model for the responses of auditory-nerve fibers. I. Nonlinear tuning with compression and suppression. J Acoust Soc Am 109:648–670
Zilany MS, Bruce IC (2006) Modeling auditory-nerve responses for high sound pressure levels in the normal and impaired auditory periphery. J Acoust Soc Am 120:1446–1466
Article PubMed Google Scholar
Zilany MS, Bruce IC (2007) Representation of the vowel/ɛ/in normal and impaired auditory nerve fibers: model predictions of responses in cats. J Acoust Soc Am 122:402–417
Article PubMed Google Scholar
Zilany MS, Bruce IC, Nelson PC, Carney LH (2009) A phenomenological model of the synapse between the inner hair cell and auditory nerve: long-term adaptation with power-law dynamics. J Acoust Soc Am 126:2390–2412
Article PubMed PubMed Central Google Scholar

Download references

ACKNOWLEDGMENTS

This work was supported (in part) by research grants R01 DC000184 and P50 DC000422 from NIH/NIDCD and by the South Carolina Clinical and Translational Research (SCTR) Institute, with an academic home at the Medical University of South Carolina, NIH/NCRR Grant number UL1 RR029882. This investigation was conducted in a facility constructed with support from Research Facilities Improvement Program Grant Number C06 RR14516 from the National Center for Research Resources, National Institutes of Health. We thank Ian C. Bruce and Emily Buss for sharing MATLAB code, Michael G. Heinz for his valuable suggestions, Tyler W. Eisenhart for data collection, and Skyler G. Jennings and William J. Bologna for providing valuable comments on a previous version of the manuscript.

Author information

Authors and Affiliations

Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, 135 Rutledge Avenue, MSC 550, Charleston, SC, 29425-5500, USA
Ananthakrishna Chintanpalli, Jayne B. Ahlstrom & Judy R. Dubno

Authors

Ananthakrishna Chintanpalli
View author publications
You can also search for this author in PubMed Google Scholar
Jayne B. Ahlstrom
View author publications
You can also search for this author in PubMed Google Scholar
Judy R. Dubno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Judy R. Dubno.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chintanpalli, A., Ahlstrom, J.B. & Dubno, J.R. Computational Model Predictions of Cues for Concurrent Vowel Identification. JARO 15, 823–837 (2014). https://doi.org/10.1007/s10162-014-0475-7

Download citation

Received: 20 August 2013
Accepted: 03 June 2014
Published: 08 July 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s10162-014-0475-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Computational Model Predictions of Cues for Concurrent Vowel Identification

Abstract

Access this article

Similar content being viewed by others

Midbrain Synchrony to Envelope Structure Supports Behavioral Sensitivity to Single-Formant Vowel-Like Sounds in Noise

Two Ears Are Not Always Better than One: Mandatory Vowel Fusion Across Spectrally Mismatched Ears in Hearing-Impaired Listeners

Variability in the Estimated Amplitude of Vowel-Evoked Envelope Following Responses Caused by Assumed Neurophysiologic Processing Delays

REFERENCES

ACKNOWLEDGMENTS

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Computational Model Predictions of Cues for Concurrent Vowel Identification

Abstract

Access this article

Similar content being viewed by others

Midbrain Synchrony to Envelope Structure Supports Behavioral Sensitivity to Single-Formant Vowel-Like Sounds in Noise

Two Ears Are Not Always Better than One: Mandatory Vowel Fusion Across Spectrally Mismatched Ears in Hearing-Impaired Listeners

Variability in the Estimated Amplitude of Vowel-Evoked Envelope Following Responses Caused by Assumed Neurophysiologic Processing Delays

REFERENCES

ACKNOWLEDGMENTS

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation