A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Heimbauer, Lisa A.; Beran, Michael J.; Owren, Michael J.

doi:10.1007/s10071-021-01478-4

A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Original Paper
Published: 08 February 2021

Volume 24, pages 843–854, (2021)
Cite this article

Animal Cognition Aims and scope Submit manuscript

387 Accesses
4 Citations
7 Altmetric
Explore all metrics

Abstract

Previous research demonstrated that a language-trained chimpanzee recognized familiar English words in sine-wave and noise-vocoded forms (Heimbauer et al. Curr Biol 21:1210–1214, 2011). However, those results did not provide information regarding processing strategies of the specific acoustic cues to which the chimpanzee may have attended. The current experiments tested this chimpanzee and adult humans using sine-wave and noise-vocoded speech manipulated using specific sine-waves and a different number of noise bands, respectively. Similar to humans tested with the same stimuli, the chimpanzee was more successful identifying sine-wave speech when both SW1 and SW2 were present – the components that are modeled on formants F1 and F2 in the natural speech signal. Results with noise-vocoded speech revealed that the chimpanzee and humans performed best with stimuli that included four or five noise bands, as compared to those with three and two. Overall, amplitude and frequency modulation over time were important for identification of sine-wave and noise-vocoded speech, with further evidence that a nonhuman primate is capable of using top-down processes for speech perception when the signal is altered and incomplete.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Degraded and computer-generated speech processing in a bonobo

Article Open access 20 May 2022

The cortical analysis of speech-specific temporal structure revealed by responses to sound quilts

Article 18 May 2015

Distorting temporal fine structure by phase shifting and its effects on speech intelligibility and neural phase locking

Article Open access 17 October 2017

References

Beran MJ (2010) Use of exclusion by a chimpanzee (Pan troglodytes) during speech perception and auditory-visual matching-to-sample. Behav Process 83:287–291
Google Scholar
Beran MJ, Heimbauer LA (2015) A longitudinal assessment of vocabulary retention in symbol-competent chimpanzees (Pan troglodytes). PLoS ONE 10:e0118408. https://doi.org/10.1371/journal.pone.0118408
Article CAS PubMed PubMed Central Google Scholar
Beran MJ, Washburn DA (2002) Chimpanzee responding during matching to sample: Control by exclusion. J Exp Anal Behav 78:497–508
PubMed PubMed Central Google Scholar
Beran MJ, Pate JL, Washburn DA, Rumbaugh DM (2004) Sequential responding and planning in chimpanzees (Pan troglodytes) and rhesus macaques (Macaca mulatta). J Exp Psychol Anim Behav Process 30:203–212
PubMed Google Scholar
Boersma P, Weenink D (2008) Praat: doing phonetics by computer [Computer program]. Version 5.1.11. http://www.praat.org/. Retrieved 1 Sept 2008
Brakke KE, Savage-Rumbaugh ES (1995a) The development of language skills in bonobo and chimpanzee–I. Comprehens Lang Commun 15:121–148
Google Scholar
Brakke KE, Savage-Rumbaugh ES (1995b) The development of language skills Pan-II. Prod Lang Commun 16:361–380
Google Scholar
Davis MH, Johnsrude IS (2007) Hearing speech sounds: Top-down influences on the interface between audition and speech perception. Hear Res 229:132–147
PubMed Google Scholar
Davis MH, Johnsrude IS, Hervais-Adelman A, Taylor K, McGettigan C (2005) Lexical information drives perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences. J Exp Psychol Gen 134:222–241
PubMed Google Scholar
Dorman MF, Loizou PC, Spahr AJ, Maloff E (2002) A comparison of the speech understanding provided by acoustic models of fixed-channel and channel-picking signal processors for cochlear implants. J Speech Lang Hear R 45:783–788
Google Scholar
Drullman R (2006) The significance of temporal modulation frequencies for speech intelligi-bility. In: Greenberg S, Ainsworth WA (eds) Listening to speech: An auditory perspective. Erlbaum, NJ, pp 43–67
Google Scholar
Greenwood DD (1961) Critical bandwidth and the frequency coordinates of the basilar membrane. J Acoust Soc Am 33:1344–1356
Google Scholar
Greenwood DD (1990) A cochlear frequency-position function for several species – 29 years later. J Acoust Soc Am 87:2592–2605
CAS PubMed Google Scholar
Heimbauer LA, Beran MJ, Owren MJ (2011) A chimpanzee recognizes synthetic speech with significantly reduced acoustic cues to phonetic content. Curr Biol 21:1210–1214
CAS PubMed PubMed Central Google Scholar
Heimbauer LA, Beran MJ, Owren MJ (2018) A chimpanzee’s (Pan troglodytes) perception of variations in speech: Identification of familiar words when whispered and when spoken by a variety of talkers. Int J Psychol 31:1–16
Google Scholar
Hillenbrand JM, Clark MJ, Baer CA (2011) Perception of sinewave vowels. J Acoust Soc Am 129:3991–4000
PubMed PubMed Central Google Scholar
Kluender KR, Diehl RL, Killeen PR (1987) Japanese quail can learn phonetic categories. Science 237:1195–1197
CAS PubMed Google Scholar
Kuhl PK (1988) Auditory perception and the evolution of speech. Hum Evol 3:19–43
Google Scholar
Kuhl PK, Miller JD (1975) Speech perception by the chinchilla: Voiced-voiceless distinction in alveolar plosive consonants. Science 190:69–72
CAS PubMed Google Scholar
Kuhl PK, Padden DM (1982) Enhanced discriminability at the phonetic boundaries for the voicing feature in macaques. Percept Psychophys 35:542–550
Google Scholar
Kuhl PK, Padden DM (1983) Enhanced discriminability at the phonetic boundaries for the place feature in macaques. J Acoust Soc Am 73:1003–1010
CAS PubMed Google Scholar
Lenneberg EH (1967) Biological foundations of language. Wiley, NY
Google Scholar
Lewis DE, Carrell TD (2007) The effect of amplitude modulation on intelligibility of time-varying sinusoidal speech in children and adults. Percept Psychophys 69:1140–1151
PubMed Google Scholar
Liberman AM (1982) On finding that speech is special. Am Psychol 37:148–167
Google Scholar
Mann VA, Liberman AM (1983) Some differences between phonetic and auditory modes of perception. Cognition 14:211–235
CAS PubMed Google Scholar
Marcus G, Vijayan S, Rao S, Vishton PM (1999) Rule learning by seven-month-old infants. Science 283:77–80
CAS PubMed Google Scholar
Newman RS (2006) Perceptual restoration in toddlers. Percept Psychophys 68:625–642
PubMed Google Scholar
Nygaard LC, Pisoni DB (1998) Talker specific learning in speech perception. Percept Psychophys 60:355–376
CAS PubMed Google Scholar
Nygaard LC, Sommers MS, Pisoni DB (1994) Speech perception as a talker-contingent process. Psychol Sci 5:42–46
PubMed Google Scholar
Owren MJ (2010) GSU Praat Tools: Scripts for modifying and analyzing sounds using Praat acoustics software. Behav Res Methods 40:822–829
Google Scholar
Pisoni DB (1995). Some thought on "normalization" in speech perception. Research on Spoken Language Processing, Progress Report No. 20, Indiana University 3–29.
Remez RE (2005) Perceptual organization of speech. In: Pisoni DB, Remez RE (eds) The handbook of speech perception. Wiley-Blackwell, Oxford, pp 28–50
Google Scholar
Remez RE, Rubin PE (1990) On the perception of speech from time-varying acoustic information: Contributions of amplitude variation. Percept Psychophy 48:313–325
CAS Google Scholar
Remez RE, Rubin PE, Pisoni DB, Carrell TD (1981) Speech perception without traditional speech cues. Science 212:947–949
CAS PubMed Google Scholar
Remez RE, Rubin PE, Berns SM, Lang PJS, JM, (1994) On the perceptional organization of speech. Psychol Rev 101:129–156
PubMed Google Scholar
Remez RE, Dubowski KR, Broder RS, Davids ML, Grossman YS, Moskalenko M, Pardo JS, Hasbun SM (2011) Auditory-phonetic projection and lexical structure in the recognition of sine-wave words. J Exp Psychol Human 37:968–977
Google Scholar
Remez RE, Thomas EF, Dubowski KR, Koinis SM, Porter NAC, Paddu NU, Moskalenko M, Grossman YS (2013) Modulation sensitivity in the perceptual organization of speech. Atten Percept 75:1353–1358
Google Scholar
Rosner BS, Talcott JB, Witton C, Hogg JD, Richardson AJ, Hansen PC, Stein JF (2003) The perception of “Sine-Wave Speech” by adults with developmental dyslexia. J Speech Lang Hear Res 46(1):68–79
PubMed Google Scholar
Rumbaugh DM, Washburn DA (2003) Intelligence of apes and other rational beings. Yale University, CT
Google Scholar
Saffran JR, Aslin RN, Newport EL (1996) Statistical learning by 8 month-old infants. Science 274:1926–1928
CAS PubMed Google Scholar
Sawusch JR (2005) Acoustic analysis and synthesis of speech. In: Pisoni DB, Remez RE (eds) The handbook of speech perception. Blackwell, Oxford, pp 7–27
Google Scholar
Shannon RV, Zeng F, Kamath V, Wygonski J, Ekelid M (1995) Speech recognition with primarily temporal cues. Science 270:303–304
CAS PubMed Google Scholar
Sommers MS, Kirk KI, Pisoni DB (1997) Some considerations in evaluating spoken word recognition by normal-hearing, noise-masked normal-hearing, and cochlear implant listeners. I. The effects of response format. Ear Hear 18:89–99
CAS PubMed PubMed Central Google Scholar
Souza P, Rosen S (2009) Effects of envelope bandwidth on the intelligibility of sine- and noise-vocoded speech. J Acoust Soc Am 126:792–805
PubMed PubMed Central Google Scholar
Trout JD (2001) The biological basis of speech: What to infer from talking to the animals. Psychol Rev 108:523–549
CAS PubMed Google Scholar
Werker JF, Desjardins RN (1995) Listening to speech in the first year of life: Experiential influences on phoneme perception. Curr Dir Psychol Sci 4:76–81
Google Scholar
Whalen DH, Liberman AM (1987) Speech perception takes precedence over nonspeech perception. Science 237:169–171
CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the care staff at Georgia State University for their care of Panzee throughout this experiment and the College of Arts and Sciences at Georgia State University for financial support for this project.

Funding

Work was supported by the National Institute of Child Health and Human Development, National Science Foundation, GSU’s RCALL and Brains & Behavior programs, the Center for Behavioral Neuroscience under the Science and Technology Centers Program of the National Science Foundation under agreement IBN-9876754, and ERC grant agreement AdG249516. LAH was funded by an RCALL Fellowship and a Duane M. Rumbaugh Fellowship.

Author information

Authors and Affiliations

State University of New York at Delhi, Delhi, NY, USA
Lisa A. Heimbauer
Georgia State University, Atlanta, GA, USA
Michael J. Beran
Emory University, Atlanta, GA, USA
Michael J. Owren

Authors

Lisa A. Heimbauer
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Beran
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Owren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lisa A. Heimbauer.

Ethics declarations

Conflicts of interest

The authors declare that they have no competing interests and no conflict of interest.

Availability of data and material

The data are available in the Open Science Framework.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Michael J. Owren deceased (2014).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Heimbauer, L.A., Beran, M.J. & Owren, M.J. A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech. Anim Cogn 24, 843–854 (2021). https://doi.org/10.1007/s10071-021-01478-4

Download citation

Received: 06 June 2020
Revised: 01 January 2021
Accepted: 11 January 2021
Published: 08 February 2021
Issue Date: July 2021
DOI: https://doi.org/10.1007/s10071-021-01478-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Abstract

Access this article

Similar content being viewed by others

Degraded and computer-generated speech processing in a bonobo

The cortical analysis of speech-specific temporal structure revealed by responses to sound quilts

Distorting temporal fine structure by phase shifting and its effects on speech intelligibility and neural phase locking

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Availability of data and material

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A chimpanzee recognizes varied acoustical versions of sine-wave and noise-vocoded speech

Abstract

Access this article

Similar content being viewed by others

Degraded and computer-generated speech processing in a bonobo

The cortical analysis of speech-specific temporal structure revealed by responses to sound quilts

Distorting temporal fine structure by phase shifting and its effects on speech intelligibility and neural phase locking

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Availability of data and material

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation