Meaning attribution in the West African green monkey: influence of call type and context

Price, Tabitha; Fischer, Julia

doi:10.1007/s10071-013-0660-9

Meaning attribution in the West African green monkey: influence of call type and context

Original Paper
Open access
Published: 12 July 2013

Volume 17, pages 277–286, (2014)
Cite this article

Download PDF

You have full access to this open access article

Animal Cognition Aims and scope Submit manuscript

Meaning attribution in the West African green monkey: influence of call type and context

Download PDF

Tabitha Price^1,2 &
Julia Fischer^1,2

3182 Accesses
24 Citations
7 Altmetric
1 Mention
Explore all metrics

Abstract

The search for the evolutionary roots of human language has fuelled much research into the cognitive mechanisms underlying communication in nonhuman animals. One core issue has been whether the context-specific calls of nonhuman animals are meaningful, with call meaning inferred from recipients’ responses in the absence of supporting contextual cues. This direct inference may well offer an oversimplified view of how vocalisations are perceived, however, as responses under natural conditions are likely guided by contextual cues as well as by the signal. In this study, we investigate how the anti-predator responses of green monkeys, Chlorocebus sabaeus, are affected by alarm call structure and by context. We first simulated the presence of leopards and snakes to elicit alarm vocalisations and to identify predator-typical response behaviours. In both contexts, the monkeys produced chirp calls that revealed only graded variation in relation to predator type. We then carried out playback experiments to explore whether green monkeys would respond with predator-typical behaviour to leopard and snake chirps, and whether contextual cues, in the form of pre-exposure to a leopard or snake model, would modify these responses. Irrespective of context, subjects were more likely to respond to leopard chirps with a leopard-typical response. Predator priming did not have a significant effect on the type of response, but, together with call type, did affect response duration. This suggests that the immediate attribution of meaning was influenced by acoustic cues, whilst receiver’s prior knowledge was incorporated to guide subsequent behaviour.

Context-dependent alarm responses in wild vervet monkeys

Article Open access 17 March 2023

Vervets revisited: A quantitative analysis of alarm call structure and context specificity

Article Open access 19 August 2015

An intentional vocalization draws others’ attention: A playback experiment with wild chimpanzees

Article 24 December 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

What do the vocalisations of animals mean? This question is central to the debate regarding the similarities and differences between nonhuman animal (hereafter animal) communication and human language, and consequently, language evolution. The finding that vervet monkeys (Chlorocebus pygerythrus) produce predator-specific alarm calls that elicit appropriate response behaviours even in the absence of contextual cues led initially to claims that these calls possessed semantic properties (Seyfarth et al. 1980a). The general consensus that, within animal communication, signallers and receivers do not share a representational state and are not motivated to communicate as a result of attributing mental states to one another (Cheney and Seyfarth 1992a; Rendall et al. 2009) implies, however, that animal vocalisations are not meaningful in the linguistic sense of the word (Cheney and Seyfarth 1992b; Rendall et al. 2009; Scarantino 2010).

Over the last 20 years, signals that are elicited only by stimuli belonging to a common category (i.e. are context specific) and that cause signal receivers to respond with stimulus-appropriate behaviours even in the absence of contextual cues have been termed “functionally referential” (Marler et al. 1992; Macedonia and Evans 1993). This terminology was meant to emphasise that such signals are “not exactly like human words, but rather appear to function in the same way” (Hauser 1997 p. 509). Numerous studies indicate that receiver responses cannot be explained only in terms of unconditioned reactions to the acoustic properties of a call (reviewed in Seyfarth et al. 2010) or by perceptual similarities between the call and the stimulus (Zuberbühler et al. 1999). Instead, across a broad array of taxa, signal receivers respond to calls as if they had learnt to associate them with a specific predator class (Manser et al. 2001; Gill and Sealy 2004; Kirchhof and Hammerschmidt 2006), degree of risk (Furrer and Manser 2009), food (Evans and Evans 2007), social situation (Faragó et al. 2010) and/or individual (Cheney and Seyfarth 1982; Vignal et al. 2008). It is worth noting, however, that this is not a universal property of calls. The alarm calls of American red squirrels, for example, demonstrate low predator specificity (Digweed and Rendall 2009), and the recruitment calls of the banded mongoose convey information about the risk posed by a stimulus rather than stimulus type (Furrer and Manser 2009). In addition, whilst the vocalisations of many species are structurally discrete, this is not a pre-requisite for functional reference; context-specific calls that differ along a graded continuum may also elicit appropriate responses from signal receivers in the absence of supporting contextual cues (Fischer 1998), although this ability may require a degree of learning (Fischer et al. 2000).

The above description of receivers associating calls with referents is in line with insights into learning theory and more specifically Pavlovian conditioning (reviewed in Rescorla 1988), whereby functionally referential alarm calls can be classified as a conditioned stimulus (Seyfarth and Cheney 2003) with an indexical relationship between the call and referent (reviewed in Wheeler and Fischer 2012). But whilst laboratory experiments within the framework of learning theory have shown effects of context specificity on the initial formation, extinction and renewal of conditioned responses in humans and other animals (Bouton et al. 2006; Huff et al. 2011), and identified neurological mechanisms underlying these effects (Hobin et al. 2003), the current definition of functional reference requires the attribution of meaning in the absence of relevant contextual cues. An alternative proposal in keeping with the influence of context on meaning attribution is that context specificity is not a requirement for calls to function referentially, only that the less referentially specific a call is, the more important contextual cues will be for an accurate attribution of meaning (Scarantino in press; Wheeler and Fischer 2012). In this study, we therefore use meaning to refer to what the signal receiver infers from a signal, for example the presence of an external stimulus or the subsequent behaviour of the signaller.

Studies of animal communication have shown that the response behaviours of signal receivers are, in some cases, modified by contextual cues, including the signal receiver’s prior experience (Zuberbühler 2000; Engh et al. 2006; Akçay et al. 2009; Arnold and Zuberbühler 2013), and contextual cues at the time of hearing a call (Wheeler and Hammerschmidt in press; Rendall et al. 1999), which may include the presence or absence of additional signals (e.g. multimodal signals; reviewed in Partan and Marler 1999). But despite this, and the fact that the role of context on call perception presents a possible parallel with pragmatics in human language (Scott-Phillips 2009; Wheeler et al. 2011), we know little about how context specificity and structure (discrete versus graded) of a call affect the degree to which contextual cues are incorporated.

More than 40 years have gone by since Struhsaker (1967) described the vervet monkey’s predator-specific alarm calls, and they remain the classic example of functional reference within the animal kingdom. However, a relatively high number of individuals did not respond appropriately to alarm calls when they were broadcast in the absence of supporting contextual cues (Seyfarth et al. 1980b), and chirps are described as being produced in response to both avian and major terrestrial predators (Struhsaker 1967). Taken together, it seems likely that both context and call structure contribute to the attribution of call meaning by conspecifics.

Like adult female vervets, adult female green monkeys (C. sabaeus) produce chirp calls in response to more than one predator class. The green monkey is a close relative of the vervet, and they were previously classified as conspecifics (Napier 1981).We here follow the taxonomy of Groves (2001), however, which places the green monkey as a closely related congener to the vervet. In the case of green monkeys, females produce chirp calls to both snake and leopard models (hereafter referred to as “snake chirps” and “leopard chirps”), and these calls sound acoustically similar to one another. In this study, we first investigated predator-specific behaviours in the green monkeys and analysed the acoustic structure of snake and leopard alarm chirps. We then performed experiments in which subjects were exposed to a predator model (leopard or snake) before playing back a leopard or snake chirp. If chirp calls given to leopards and snakes are strongly referential, they should elicit predator-typical avoidance behaviours irrespective of supporting or conflicting contextual cues. If, however, context also plays a role in how conspecifics’ attribute meaning to these calls, then priming with a corresponding predator model (e.g. priming with a leopard model prior to playing a leopard chirp) should increase the occurrence of predator-typical responses relative to responses elicited by the calls alone, whilst priming with a conflicting predator model (i.e. priming with a snake model prior to playing a leopard chirp) should have the opposite effect.

Study site and subjects

The study was conducted over two field seasons (January-June 2010 and December 2010–June 2011) within Niokolo Koba National Park in southeast Senegal (13°01′34″N, 13°17′41″W), an area encompassing 913,000 ha of predominantly Sudano-Guinean savannah interspersed with woodland and gallery forest (Frederiksen and Lawesson 1992). Green monkeys are found throughout the park, living in species-typical multi-male multi-female groups (Dunbar 1974). Data were collected in the vicinity of the Simenti Centre de Recherche de Primatologie (CRP Simenti) from four groups of free-ranging green monkeys (“Simenti” 16–21 individuals; “Mare” 12–18 individuals; “Lions” 19–26 individuals; “Niokolo” 27–32 individuals; ranges reflect changes in group size over the duration of the study period). Study subjects were habituated adult males and females that were recognised individually from natural markings on the face and body. Pythons, venomous snakes and leopards were all observed in the vicinity of the field site over the course of the study.

Behavioural response to terrestrial predators

Experimental protocol

Vervet monkeys tend to respond to snakes by looking down and standing bipedally, and to leopards by climbing up into trees (Cheney and Seyfarth 1992b). To test whether green monkeys respond to these terrestrial predators with these same predator-typical behaviours, we simulated the presence of snakes and leopards and video-taped their behavioural response. For details of predator simulations and modes of presentation, see Online Resource 1. Subjects were provisioned with peanuts prior to model presentation to position individuals on the ground and to ensure that subject behaviour (stationary feeding) was consistent in the time period preceding all playbacks. Experiments were discarded if the subject moved out of sight within the first 10 s of the experiment (5 cases), if the subject responded to a different stimulus prior to model presentation (3 cases) or if there were technical problems with the equipment (1 case), resulting in a total of 17 leopard model (adult female n = 8, adult male n = 9) and 19 snake model (adult female n = 9, adult male n = 10) experiments for analysis.

Behavioural analysis

Behavioural responses of subjects were filmed using a Sony Handycam (DCR-HC90E), and videos were imported into Adobe Premiere Pro CS4 with a time resolution of 25 frames/second. Frame-by-frame analysis set at five-frame jumps was used to score the subject’s behaviour as one of four mutually exclusive categories (rest, bipedal, terrestrial displacement or arboreal displacement) at 0.2 s intervals for a period of 10 s, starting with the subject’s first response to the predator model. We had initially planned to include looking direction as a behavioural measure, but poor visibility made it impossible to score this reliably from the videos. Maximum height of the subject within 30 s of viewing the model was recorded as 0 m, >0 m but <2 m or >2 m. Because video encoding is susceptible to observer bias, all videos were reanalysed by a second condition-naive observer. Intra-class correlation coefficient (ICC) was 0.986, indicating a high level of inter-observer reliability.

Statistical analysis

We used a generalised linear mixed model (GLMM) with binomial error structure and logit link function to test whether snake models were more likely than leopard models to elicit bipedal behaviour, with bipedal behaviour scored as absent or present. A second GLMM with Poisson error structure and a log link function was run to test whether leopard models would cause subjects to climb into a tree more often than snake models, with response behaviour scored as one of the three height categories described above. Both GLMMs were run with the type of predator model (snake or leopard) as the fixed effect and subject identity included as a random effect using the function lmer of the lme4 Package (Bates et al. 2011). We used a likelihood ratio test (ANOVA using “Chisq” argument) to compare the full models with a null model (comprising only the intercept and the random effect) in order to calculate the overall effect of the predator model. All models were fitted in R (R Development Core Team 2011).

Results and discussion

There was no significant difference in the bipedal behaviour of subjects following the presentation of snake and leopard models (likelihood ratio test: χ ² = 0.47, df = 1, P = 0.491; Fig. 1a). Like vervet monkeys, green monkeys do sometimes respond to snakes by standing bipedally, but since they also responded to leopard models with bipedal behaviour, this did not constitute a predator-specific response. Whilst vervet monkeys were described as responding with bipedal behaviour to snakes, they responded to playbacks of alarm calls given to both snakes and leopards with bipedal behaviour (Seyfarth et al. 1980b). For vervets and green monkeys, bipedalism may therefore function not only as a mobbing behaviour but also as a form of unspecific vigilance. As we were not able to assess gaze direction, we cannot discount that bipedalism for the purpose of either scanning the ground for snakes, or scanning the horizon for cats, could constitute a predator-specific response. In consequence, from the results described in this section, it is not possible to identify a snake-specific behavioural response with which the referential specificity of snake chirps, with and without contextual cues, could be tested.

Green monkeys, like vervets, were more likely to climb into a tree in response to leopard than snake models (likelihood ratio test: χ ² = 22.49, df = 1, P < 0.001, Fig. 1b). In particular, whilst snake models occasionally prompted subjects to jump into trees at <2 m, leopard models always resulted in subjects climbing higher (>2 m) into a tree. This can be explained as an adaptive response, whereby green monkeys, like vervets, are likely safest from leopards high up in the trees (Cheney and Seyfarth 1992b). Thus, it would seem that climbing >2 m into a tree is a more leopard-specific response than simply climbing into a tree.

Chirp playback stimuli

Playback stimuli

Alarm chirps used as playback stimuli were elicited by the presentation of leopard and snake models. Calls were recorded from adult females and juveniles from all four study groups using a Marantz PMD661 solid-state recorder (44.1 kHz sampling rate; 16-bit sampling depth) connected to a Sennheiser ME66/K6 directional microphone. Vocal recordings were transferred to a PC, and Avisoft-SASLab Pro (R. Specht Berlin, Germany, version 5.1.20) was used to check recording quality, filter recordings to remove background noise below 0.1 kHz and to prepare the playback stimuli. Each playback sequence was constructed from chirps produced during a single calling bout, although not always in their natural order, as it was sometimes necessary to replace low quality chirps with higher quality exemplars produced later in the calling bout. A total of ten pairs of playback sequences were compiled, whereby each pair consisted of a sequence of chirps given to a leopard model and a sequence of chirps given to a snake model. The number of chirps, inter-call durations, and sequence duration were consistent between paired sequences, all call sequences were normalised to the same maximum volume and inter-call durations were additionally controlled to fall within the range of naturally emitted calls. When possible, the same individual produced both call sequences within a pair, and at all times, call sequences within a pair were produced by a caller from the same social group. With one exception, all leopard chirp and all snake chirp playback stimuli were taken from the calling bouts of different individuals, and in this exception, different calls from the same individual were used to construct two playback sequences. Calls of nonpredatory birds were recorded locally and modified to be of a similar length and volume to chirp sequences for use as control stimuli. To avoid pseudo-replication, a different playback sequence was used for each playback experiment. Spectrograms illustrating snake and leopard chirps are shown in Fig. 2.

Acoustic analysis

To assess the acoustic structure of chirp calls used as playback stimuli (N = 124), Avisoft-SASLab Pro was used to add silent margins and reduce the sampling frequency of single call units to 22.05 kHz. Call units were then transformed in their frequency–time domain using a fast Fourier transformation (FFT) size of 1,024 points, Hamming window and 93.75 % overlap. The resulting frequency–time spectra were analysed with LMA (K. Hammerschmidt, version 2012_9), a custom software sound analysis tool (Schrader and Hammerschmidt 1997). Using Avisoft, duration was extracted from the wav file, and Wiener entropy was calculated; LMA was used to calculate robust acoustic parameters describing energy distribution throughout the call unit. A description of parameters used for analyses are given in Table 1.

Table 1 Description of the acoustic parameters used to describe chirp call structure

Full size table

Statistical analysis

To avoid entering correlated parameters into the discriminant function analysis (LDA), a stepwise variable selection with leave-one-out cross-validation (stepclass function of the R-package “klaR”, Weihs et al. 2005) was used to first identify an optimum subset of variables for classification. Acoustic parameters were transformed when necessary to meet test assumptions (Online Resource 1) and then entered into the stepwise classification, with predator type set as the grouping variable. Following this, the selected variables were entered (post z-transformation) into a linear LDA using the lda function of the R-package “mass” (Venables and Ripley 2002), with predator type again set as the grouping variable. A leave-one-out procedure was used to calculate the percentage of calls correctly classified, and a subset of the data (N = 93) was entered into a nested permuted discriminant function analysis (pDFA, Mundry and Sommer 2007) to re-calculate classification scores whilst controlling for caller identity.

Results and discussion

Stepwise variable selection identified duration and peak frequency_1 as the most important variables for differentiating between chirps produced in response to different predator types. Based on differences in these two variables, LDA (with leave-one out validation) correctly identified leopard and snake chirps in 75 % of cases. A similar result was found using a pDFA on a subset of the calls in order to control for caller identity, with 72 % of calls correctly classified. On the basis of the LDA classification, chirp calls were correctly assigned to the predator type eliciting calling more often than would be expected by chance (Binomial test, chirps N = 124, P < 0.05), and each snake playback stimulus (with one exception) had a higher mean discriminant score than the leopard playback stimulus with which it was paired. The relatively high number of calls that were incorrectly classified, however, supports the acoustic impression that structural differences between leopard and snake chirps are graded rather than discrete in nature (Fig. 3), suggesting that, for many calls, receivers would be unable to determine whether the signal was indicative of the presence of either a leopard or a snake.

Duration contributed most to distinguishing between leopard and snake chirps, followed by peak frequency_1, with leopard chirps being longer than snake chirps and demonstrating a higher early peak frequency. Studies in a broad array of species suggest that as callers experience an increase in arousal, their vocalisations become longer and higher in frequency (reviewed in Briefer 2012). In line with these findings, the structural differences identified in this study between snake and leopard chirps could be attributed to callers being more aroused in the presence of a leopard than a snake.

This analysis does not allow for conclusions to be drawn about the probability of chirps being produced in the presence of a snake or leopard, or whether chirps are also produced in nonpredator contexts. Results do suggest, however, that the chirp is similar to the graded alarm calls of Barbary macaques (Fischer et al. 1995) and chacma baboons (Fischer et al. 2001a). Given that these two species differ in how they perceive the graded variation in their calls (Fischer 1998; Fischer et al. 2001b), the graded structure of chirps presents an opportunity to further our more general understanding of how signal receivers respond appropriately to acoustically similar alarm calls produced in situations requiring incompatible escape behaviours.