Abstract
Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter).
Similar content being viewed by others
References
Allan SE, Suthers RA (1994) Lateralization and motor stereotypy of song production in the brown-headed cowbird. J Neurobiol 25:1154–1166
Amador A, Sanz Perl Y, Mindlin GB, Margoliash D (2013) Elemental gesture dynamics are encoded by song premotor cortical neurons. Nature 495:59–64
Amin N, Doupe A, Theunissen FE (2007) Development of selectivity for natural sounds in the songbird auditory forebrain. J Neurophysiol 97:3517–3531
Armitage DW, Ober HK (2010) A comparison of supervised learning techniques in the classification of bat echolocation calls. Ecol Inform 5:465–473
Ballentine B, Searcy WA, Nowicki S (2008) Reliable aggressive signalling in swamp sparrows. Anim Behav 75:693–703
Bennur S, Tsunada J, Cohen YE, Liu RC (2013) Understanding the neurophysiological basis of auditory abilities for social communication: a perspective on the value of ethological paradigms. Hear Res 305:3–9
Brand LR (1976) Vocal repertoire of chipmunks (genus Eutamias) in california. Anim Behav 24:319–335
Breiman L (2001) Random forests. Mach Learn 45:5–32
Catchpole CK, Slater PBJ (1995) Bird song. Biological themes and variations. Cambridge University Press, Cambridge
Cheng J, Sun Y, Ji L (2010) A call-independent and automatic acoustic system for the individual recognition of animals: a novel model using four passerines. Pattern Recogn 43:3846–3852
Clayton N (1987) Song tutor choice in zebra finches. Anim Behav 35:714–722
Clayton N, Prove E (1989) Song discrimination in female zebra finches and bengalese finches. Anim Behav 38:352–354
Cohen L (1995) Time-frequency analysis. Prentice Hall, Englewood Cliffs
Collias NE (1987) The vocal repertoire of the red junglefowl: a spectrographic classification and the code of communication. Condor 89:510–524
Deaux EC, Clarke JA (2013) Dingo (Canis lupus dingo) acoustic repertoire: form and contexts. Behaviour 150:75–101
Dragonetti M, Caccamo C, Corsi F, Farsi F, Giovacchini P, Pollonara E, Giunchi D (2013) The vocal repertoire of the eurasian stone-curlew (Burhinus oedicnemus). Wilson J Ornithol 125:34–49
Elemans CP, Muller M, Larsen ON, van Leeuwen JL (2009) Amplitude and frequency modulation control of sound production in a mechanical model of the avian syrinx. J Exp Biol 212:1212–1224
Elie JE, Theunissen FE (2015) Meaning in the avian auditory cortex: Neural representation of communication calls. Eur J Neurosci 22:546–567
Elie JE, Mariette MM, Soula HA, Griffith SC, Mathevon N, Vignal C (2010) Vocal communication at the nest between mates in wild zebra finches: a private vocal duet? Anim Behav 80:597–605
Elie JE, Mathevon N, Vignal C (2011a) Same-sex pair-bonds are equivalent to male–female bonds in a life-long socially monogamous songbird. Behav Ecol Sociobiol 65:2197–2208
Elie JE, Soula HA, Mathevon N, Vignal C (2011b) Dynamics of communal vocalizations in a social songbird, the zebra finch (Taeniopygia guttata). J Acoust Soc Am 129:4037–4046
Evans CS, Evans L, Marler P (1993) On the meaning of alarm calls: functional reference in an avian vocal system. Anim Behav 46:23–38
Farabaugh SM (1982) The ecological and social significance of duetting. In: Kroodsma DE, Miller EH (eds) Acoustic communication in birds. Academic Press, New York, pp 85–124
Fee MS (2002) Measurement of the linear and nonlinear mechanical properties of the oscine syrinx: implications for function. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 188:829–839
Fee MS, Shraiman B, Pesaran B, Mitra PP (1998) The role of nonlinear dynamics of the syrinx in the vocalizations of a songbird. Nature 395:67–71
Ficken MS, Ficken RW, Witkin SR (1978) Vocal repertoire of black-capped chickadee. Auk 95:34–48
Fitch WT (1994) Vocal tract length perception and the evolution of language. Brown University, Providence
Fitch WT (1997) Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques. J Acoust Soc Am 102:1213–1222
Fitch WT (2000a) The evolution of speech: a comparative review. Trends Cogn Sci 4:258–267
Fitch WT (2000b) The phonetic potential of nonhuman vocal tracts: comparative cineradiographic observations of vocalizing animals. Phonetica 57:205–218
Fitch WT (2002) Comparative vocal production and the evolution of speech: Reinterpreting the descent of the larynx. In: Wray A (ed) The transition to language. Oxford University Press, Oxford
Fitch WT, Kelley JP (2000) Perception of vocal tract resonances by whooping cranes grus americana. Ethology 106:559–574
Fitch WT, Reby D (2001) The descended larynx is not uniquely human. Proc R Soc B Biol Sci 268:1669–1675
Fuller JL (2014) The vocal repertoire of adult male blue monkeys (Cercopithecus mitis stulmanni): a quantitative analysis of acoustic structure. Am J Primatol 76:203–216
Gill SA, Bierema AMK, Hauber M (2013) On the meaning of alarm calls: a review of functional reference in avian alarm calling. Ethology 119:449–461
Gill LF, Goymann W, Ter Maat A, Gahr M (2015) Patterns of call communication between group-housed zebra ficnhes change during the breeding cycle. eLife 4:e07770
Goller F, Larsen ON (1997) A new mechanism of sound generation in songbirds. Proc Nat Acad Sci USA 94:14787–14791
Goller F, Mallinckrodt MJ, Torti SD (2004) Beak gape dynamics during song in the zebra finch. J Neurobiol 59:289–303
Hahnloser RH, Kozhevnikov AA, Fee MS (2002) An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature 419:65–70
Hall ML (2004) A review of hypotheses for the functions of avian duetting. Behav Ecol Sociobiol 55:415–430
Hall ML (2009) A review of vocal duetting in birds. In: Naguib M, Zuberbuhler K, Clayton NS, Janik VM (eds) Advances in the study of behavior, vol 40. pp 67–121
Hauser MD, Chomsky N, Fitch WT (2002) The faculty of language: what is it, who has it, and how did it evolve? Science 298:1569–1579
Hsu A, Woolley SM, Fremouw TE, Theunissen FE (2004) Modulation power and phase spectrum of natural sounds enhance neural encoding performed by single auditory neurons. J Neurosci 24:9201–9211
Kim G, Doupe A (2011) Organized representation of spectrotemporal features in songbird auditory forebrain. J Neurosci 31:16977–16990
Kriengwatana B, Escudero P, Kerkhoven AH, ten Cate C (2015) A general auditory bias for handling speaker variability in speech? Evidence in humans and songbirds. Front Psychol 6:1243
Kruuk H (1972) The spotted hyena. A study of predation and social behavior. Univ Chicago Press, Chicago
Ladefoged P (2012) Vowels and consonants. Wiley-Blackwell, New York
Levrero F, Durand L, Vignal C, Blanc A, Mathevon N (2009) Begging calls support offspring individual identity and recognition by zebra finch parents. C R Biologies 332:579–589
Lieberma PH, Klatt DH, Wilson WH (1969) Vocal tract limitations on vowel repertoires of rhesus monkey and other non-human primates. Science 164:1185–1187
Lohr B, Dooling RJ (1998) Detection of changes in timbre and harmonicity in complex sounds by zebra finches (Taeniopygia guttata) and budgerigars (Melopsittacus undulatus). J Comp Psychol 112:36–47
Marler P (2004) Bird calls: their potential for behavioral neurobiology. Ann N Y Acad Sci 1016:31–44
McCowan LSC, Mariette MM, Griffith SC (2015) The size and composition of social groups in the wild zebra finch. Emu 115:191–198
Mielke A, Zuberbühler K (2013) A method for automated individual, species and call type recognition in free-ranging animals. Anim Behav 86:475–482
Morton ES (1975) Ecological sources of selection on avian sounds. Am Nat 109:17–34
Morton ES (1977) Occurrence and significance of motivation structural rules in some bird and mammal sounds. Am Nat 111:855–869
Mouterde SC, Elie JE, Theunissen FE, Mathevon N (2014a) Learning to cope with degraded sounds: female zebra finches can improve their expertise in discriminating between male voices at long distances. J Exp Biol 217:3169–3177
Mouterde SC, Theunissen FE, Elie JE, Vignal C, Mathevon N (2014b) Acoustic communication and sound degradation: how do the individual signatures of male and female zebra finch calls transmit over distance? PLoS ONE 9:e102842
Mulard H, Vignal C, Pelletier L, Blanc A, Mathevon N (2010) From preferential response to parental calls to sex-specific response to conspecific calls in juvenile zebra finches. Anim Behav 80:189–195
Mundry R, Sommer C (2007) Discriminant function analysis with nonindependent data: consequences and an alternative. Anim Behav 74:965–976
Murphy KP (2012) Machine learning: a probalistic perspective. MIT Press, Cambridge
Nagel K, Doupe A (2008) Organizing principles of spectro-temporal encoding in the avian primary auditory area field l. Neuron 58:938–955
Nakagawa S, Hauber ME (2011) Great challenges with few subjects: statistical strategies for neuroscientists. Neurosci Behav Rev 35:462–473
Ohms VR, Snelderwaard PC, ten Cate C and Beckers GJL (2010) Vocal tract articulation in zebra finches. PloS one 5(7):e11923
Olson CR, Wirthlin M, Lovell PV, Mello CV (2014) Proper care, husbandry, and breeding guidelines for the zebra finch. Cold Spring Harb Protoc, Taeniopygia guttata. doi:10.1101/pdb.prot084780
O’Shaughnessy D (1999) Speech communication: Human and machine. Wiley-IEEE Press, New York
Owren MJ, Rendall D (2001) Sound on the rebound: bringing form and function back to the forefront in understanding nonhuman primate vocal signaling. Evol Anthropol 10:58–71
Perez EC, Fernandez MSA, Griffith SC, Vignal C, Soula HA (2015) Impact of visual contact on vocal interaction dynamics of pair-bonded birds. Anim Behav 107:125–137
Picone JW (1993) Signal modeling techniques in speech recognition. Proc IEEE 81:1215–1247
Reby D, McComb K, Cargnelutti B, Darwin C, Fitch WT, Clutton-Brock T (2005) Red deer stags use formants as assessment cues during intrasexual agonistic interactions. Proc R Soc B Biol Sci 272:941–947
Rendall D, Owren MJ, Ryan MJ (2009) What do animal signals mean? Anim Behav 78:233–240
Riede T, Goller F (2010) Peripheral mechanisms for vocal production in birds—differences and similarities to human speech and singing. Brain Lang 115:69–80
Riede T, Zuberbuhler K (2003) The relationship between acoustic structure and semantic information in Diana monkey alarm vocalization. J Acoust Soc Am 114:1132–1142
Riede T, Suthers RA, Fletcher NH, Blevins WE (2006) Songbirds tune their vocal tract to the fundamental frequency of their song. Proc Nat Acad Sci USA 103:5543–5548
Riede T, Schilling N, Goller F (2013) The acoustic effect of vocal tract adjustments in zebra finches. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 199:57–69
Robisson P, Aubin T, Bremond JC (1993) Individuality in the voice of the emperor penguin aptenodytes forsteri: adaptation to a noisy environment. Ethology 94:279–290
Salmi R, Hammerschmidt K, Doran-Sheehy DM (2013) Western gorilla vocal repertoire and contextual use of vocalizations. Ethology 119:831–847
Scharff C, Nottebohm F, Cynx J (1998) Conspecific and heterospecific song discrimination in male zebra finches with lesions in the anterior forebrain pathway. J Neurobiol 36:81–90
Searcy WA, Beecher MD (2009) Song as an aggressive signal in songbirds. Anim Behav 78:1281–1292
Seyfarth RM, Cheney DL (2003) Signalers and receivers in animal communication. Ann Rev Psych 54:145–173
Seyfarth RM, Cheney DL, Marler P (1980) Monkey responses to 3 different alarm calls—evidence of predator classification and semantic communication. Science 210:801–803
Seyfarth RM, Cheney DL, Bergman T, Fischer J, Zuberbühler K, Hammerschmidt K (2010) The central importance of information in studies of animal communication. Anim Behav 80:3–8
Singh NC, Theunissen FE (2003) Modulation spectra of natural sounds and ethological theories of auditory processing. J Acoust Soc Am 114:3394–3411
Sitt JD, Amador A, Goller F, Mindlin GB (2008) Dynamical origin of spectrally rich vocalizations in birdsong. Phys Rev E 78:011905
Smith WJ (1994) Animal duets—forcing a mate to be attentive. J Theo Biol 166:221–223
Stowell D, Plumbley MD (2014) Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning. CoRR abs/1405.6524
Sturdy CB, Phillmore LS, Price JL, Weisman RG (1999) Song-note discriminations in zebra finches (Taeniopygia guttata): categories and pseudocategories. J Comp Psychol 113:204–212
Taylor AM, Reby D (2010) The contribution of source-filter theory to mammal vocal communication research. J Zool 280:221–236
Tchernichovski O, Nottebohm F, Ho CE, Pesaran B, Mitra PP (2000) A procedure for an automated measurement of song similarity. Anim Behav 59:1167–1176
Tchernichovski O, Mitra PP, Lints T, Nottebohm F (2001) Dynamics of the vocal imitation process: how a zebra finch learns its song. Science 291:2564–2569
Ter Maat A, Trost L, Sagunsky H, Seltmann S, Gahr M (2014) Zebra finch mates use their forebrain song system in unlearned call communication. PLoS ONE 9:e109334
Theunissen FE, Elie JE (2014) Neural processing of natural sounds. Nat Rev Neurosci 15:355–366
Thorpe WH (1972) Duetting and antiphonal song in birds. Its extent and significance. Behaviour 18:1–197
Vicario DS, Naqvi NH, Raksin JN (2001) Behavioral discrimination of sexually dimorphic calls by male zebra finches requires an intact vocal motor pathway. J Neurobiol 47:109–120
Vignal C, Mathevon N, Mottin S (2004) Audience drives male songbird response to partner’s voice. Nature 430:448–451
Vignal C, Mathevon N, Mottin S (2008) Mate recognition by female zebra finch: analysis of individuality in male call and first investigations on female decoding process. Behav Process 77:191–198
Wickler W, Seibt U (1980) Vocal duetting and the pair bond. 2. Unisono duetting in the African forest weaver, symplectes-bicolor. J Comp Etholog 52:217–226
Wild JM, Kruetzfeldt NEO (2012) Trigeminal and telencephalic projections to jaw and other upper vocal tract premotor neurons in songbirds: sensorimotor circuitry for beak movements during singing. J Comp Neurol 520:590–605
Williams H (2004) Birdsong and singing behavior. Ann N Y Acad Sci 1016:1–30
Williams H, Cynx J, Nottebohm F (1989) Timbre control in zebra finch (Taeniopygia guttata) song syllables. J Comp Psychol 103:366–380
Wilson EO (2000) Sociobiology, the new synthesis, twenty-fith, anniversary edn. Harvard University Press, Cambridge
Woolley SM, Fremouw TE, Hsu A, Theunissen FE (2005) Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds. Nat Neurosci 8:1371–1379
Woolley SM, Gill PR, Fremouw T, Theunissen FE (2009) Functional groups in the avian auditory system. J Neurosci 29:2780–2793
Young S, Evermann G, Gales M, Hain T, Kershaw D, Liu X, Moore G, Odell J, Ollason D, Povey D, Valtchev V, Woodland P (2006) The htk book (for htk version 3.4.1). Engineering Department, Cambridge University
Zann RA (1996a) The zebra finch. Oxford University Press, Oxford
Zann RA (1996b) The zebra finch: a synthesis of field and laboratory studies. Oxford University Press, Oxford
Zuberbühler K, Lemasson A (2014) Primate communication: meaning from strings of calls. In: Lowenthal F, Lefebvre L (eds) Language and recursion. Springer, New York, pp 115–125
Acknowledgments
We would like to dedicate this study to Peter Marler and Richard Zann and Allison Doupe. By his fundamental discoveries and his thoughtful contributions to the field of animal communication, Peter Marler has been a major source of guidance and inspiration for our own research efforts in the field of bird communication. In his seminal Science paper in 1967, Peter Marler said: “We are beginning to understand how the structure of animal signals relates to the function they serve.” We would hope that Peter would agree that we are humbly following his footsteps. Peter Marler is the scientific great-grandparent of FET and great–great-grandparent of JEE. Richard Zann dedicated his life to the study of wild zebra finches in his native Australia. Allison Doupe developed the zebra finch model in crucial seminal studies that examined the neural mechanism of vocal plasticity. She was the scientific parent of FET and grandparent of JEE. She was an outstanding mentor and a wonderful person. We would not be able to appreciate the complexity and the relevance of our studies without their respective contributions to the field. Richard Zann died in a bushfire inferno that occurred in outskirts of Melbourne in February 2009. Peter Marler died in July 2014 following a long illness. Allison Doupe died in September 2014 after a long battle with cancer. This work was supported by an NIH grant CD010132 to FET, a CRCNS NSF grant IIS1311446 to FET and JEE and a fellowship from the Fyssen Foundation to JEE.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
All applicable international, national and/or institutional guidelines for the care and use of animals were followed. All animal procedures were approved by the Animal Care and Use Committee of the University of California Berkeley and were in accordance with the NIH guidelines regarding the care and use of animals for experimental procedures.
Electronic supplementary material
Below is the link to the electronic supplementary material.
10071_2015_933_MOESM1_ESM.pdf
Supplementary Figure 1. Flow-chart showing the calculation of the modulation power spectrum (MPS) and the use of this acoustical representation in the classification procedure. In this feature space each sound is characterized by its MPS. The MPS is the amplitude square of the 2D Fourier Transform (2D FT) of the log spectrogram. The spectrogram was estimated with the same time-frequency scale as in Figure 2. The modulation power spectrum was sampled every 2.85 Hz between -40Hz and 40 Hz for temporal modulations (x-axis) and every 0.0826 cyc/kHz between 0 and 4 cyc/kHz for spectral modulations (y-axis) for a total of 1,500 parameters. As for the spectrographic representation, principal component analysis (PCA) was used to reduce the number of parameters to 40 before classification. The 40 parameters captured 34% of the variance in the modulation power spectrum across all vocalizations in our data set. The classifiers were trained to estimate the vocalization category (PDF 535 kb)
10071_2015_933_MOESM2_ESM.pdf
Supplementary Figure 2. Flow-chart showing the calculation of the Mel frequency cepstral coefficients (MFCC) and the use of this acoustical representation in the classification procedure. In this feature space each sound is characterized by a time sequence of cepstral coefficients. The cepstrum coefficients were obtained from the discrete cosine transform (DCT) of the log of the amplitude in one time slice of the spectrogram. For MFCC, the Mel spectrogram was obtained using 25 filterbank channels approximately logarithmically spaced (Mel spaced frequency bands) between 500 and 8000 Hz. The time windows were 25 ms long and spaced every 10 ms (15 ms overlap). Twelve cepstral indexes (ci) were extracted from each spectral envelope resulting in a 12 ci for 33 time points resulting in a total of 396 parameters. Similar to the spectrographic representation, principal component analysis (PCA) was used to reduce the number of parameters to 40 before classification. The 40 parameters captured 96% of the variance in the MFCC modulation power spectrum across all vocalizations in our dataset. The classifiers were trained to estimate the vocalization category (PDF 503 kb)
10071_2015_933_MOESM3_ESM.eps
Supplementary Figure 3. Spectrograms of three example calls exhibiting a double voice component. Double voices or two pitches were regularly found in zebra finch vocalizations. Here, are shown examples of a Whine, Nest and Begging call where the double voice can clearly be observed on the spectrogram. The arrows show the fundamental or harmonic corresponding to the two voices (EPS 632 kb)
10071_2015_933_MOESM4_ESM.pdf
Supplementary Figure 4. Unsupervised Clustering of Tet calls: Sexual dimorphism and two types of calls. A “mixture-of-Gaussians” model was used to perform unsupervised clustering of groups of calls as described in the legend of Figure 9 and methods. A. Unsupervised clustering of all Tet calls produced by male and female birds resulted in a distribution well fitted by two Gaussians of approximately equal weight (w1=0.38, w2 = 0.62). Assignment to one of the two clusters resulted in significantly different proportions of male and female calls in each group (z = 6.72, P < 10-4) as illustrated on the bar plot on the right column. B and C. Unsupervised clustering of female (B) and male (C) Tet calls only. These distributions were also well fitted by two Gaussians of approximately equal weight (Female: w1=0.53, w2 = 0.47; Male: w1=0.31, w2 = 0.69). The color code on the scatter plots indicates vocalizers’ identity and show that individuals produce calls in each group although some produce mostly one “type”. Note that the mixture-of-Gaussians algorithm is blind to the vocalizer’s identity. We estimated mean values of each acoustical parameter for Tet calls assigned to each group and show the results for the CV of the fundamental (CV F0), the spectral mean (mean S), the duration (std T) and the intensity (RMS) with bar plots on the right panels. Error bars correspond to one sem. The most distinguishing acoustical feature is the CV of the fundamental (Female: t(325)=-14.71 P < 10-4; Male: t(284)=-11.93 P < 10-4) that for both sexes is much lower in one of the groups (group 1 for both). This group of calls with very low modulation of their fundamental has been described as Stacks (Ter Maat et al., 2014). Note that we re-estimated principal components for male and female calls only and therefore the PC axes correspond to different combination of acoustical features in all three rows (PDF 596 kb)
10071_2015_933_MOESM5_ESM.eps
Supplementary Figure 5. Unsupervised Clustering of Nest calls: a unimodal distribution. A “mixture-of-Gaussians” model was used to perform unsupervised clustering of Nest calls as described in the legend of Figure 9 and methods. Although the BIC values suggest that this distribution is better fitted with two Gaussians than one Gaussian, the weights of these two Gaussians is greatly biased towards one group (w1=0.16, w2 = 0.84) demonstrating that the distribution is clearly unimodal, albeit not perfectly Gaussian (EPS 1767 kb)
10071_2015_933_MOESM6_ESM.eps
Supplementary Figure 6. Power Spectrum and Temporal Envelope for Tet and Distance calls. Tet (light purple) and Distance calls (dark purple) are the two calls that show sexual differences (solid male, dotted female). The left panel (A) shows the non-normalized frequency spectra and the right panel (B) the normalized temporal amplitude envelope (right) for these two calls and for male and female birds. In the power spectrum, one can also appreciate the shifts in the formant frequencies between Tet and Distance calls (EPS 635 kb)
Rights and permissions
About this article
Cite this article
Elie, J.E., Theunissen, F.E. The vocal repertoire of the domesticated zebra finch: a data-driven approach to decipher the information-bearing acoustic features of communication signals. Anim Cogn 19, 285–315 (2016). https://doi.org/10.1007/s10071-015-0933-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10071-015-0933-6