Introduction

Interspecific aggression is widespread in birds and other taxa (including damselflies, ants, reef fish, salamanders, lizards and mammals) and often takes the form of territoriality (Orians and Willson 1964; Grether et al. 2009; Ord and Stamps 2009; Peiman and Robinson 2010; Ord et al. 2011). Aggressive interactions between heterospecifics influence habitat partitioning, species coexistence, range limits and local extinctions, thus having major ecological outcomes (Robinson and Terborgh 1995; Jankowski et al. 2010; Peiman and Robinson 2010; Losin et al. 2016; Freeman et al. 2019). Furthermore, they may drive phenotypic evolution, in particular the evolution of mechanisms to distinguish between different classes of competitors and non-competitors, and phenotypic traits related to species recognition (Irwin and Price 1999; Grether et al. 2009, 2013, 2017; Tobias et al. 2014; Martin et al. 2015; Drury et al. 2020; Simpson et al. 2021). When it occurs, interspecific aggression is thought to be (i) a learned response to the presence of other species (the ‘learned interspecific aggression’ hypothesis), (ii) a consequence of misdirected intraspecific aggression (the ‘mistaken identity’ hypothesis), (iii) an evolved interference response to interspecific competition (the ‘innate interspecific recognition hypothesis’, or a combination of these mechanisms.

The ‘learned interspecific aggression’ mechanism involves a plastic response to the presence of heterospecific competitors for limited resources (Irwin and Price 1999; Peiman and Robinson 2010). Individuals may learn to recognize heterospecifics in areas where the other species coexists. As a result, the interspecific aggressive response may be strong in sympatry and weak or non-existent in allopatry (Emlen et al. 1975; Catchpole 1978; Reed 1982; Gil 1997; Sedláček et al. 2006). Significant differences in interspecific response between sympatric and allopatric areas were observed in cases where these areas were a few kilometers or even just a few hundred meters apart (Catchpole and Leisler 1986; Prescott 1987; Freeman 2016; Freeman et al. 2016). It was suggested that individuals may learn to recognize heterospecifics by making associations between different forms of signals (vocalizations, coloration, morphology and displays), and then use one of these signals to recognize individuals that vary in others (Gill and Murray 1972; Irwin and Price 1999; Price 2008; Grether 2011). The ability to make associations can be expected in individuals that have interacted with heterospecifics during resource defense, but not in those that have not.

According to the ‘mistaken identity’ hypothesis, interspecific territorial aggression arises when two species that (because of close phylogenetic relatedness or chance) share similar traits that stimulate intraspecific territorial aggression come into contact (Murray 1971, 1981). Due to the similarity of traits releasing intraspecific response (e.g. vocalizations, coloration or displays), heterospecifics may be misidentified as conspecifics (Petrusková et al. 2008; Wolfenden et al. 2015; McEntee 2014; Jones et al. 2016). If these two species do not compete with each other for limited resources, costs of misdirected interspecific aggression may trigger the selection for divergence in these traits, resulting in the disappearance of interspecific response in sympatry (Grether et al. 2009). Otherwise, in sympatry benefits of mutual recognition may select for convergence in competitor recognition traits in these species (e.g. Reif et al. 2015). Interspecific discrimination based on the association of species-specific traits is not to be expected in this case.

The ‘innate species recognition’ may result from genetic assimilation of species recognition templates – an evolutionary response to the cost of learning (Irwin and Price 1999; Price 2008; Grether et al. 2009). This cost may be the time required to learn about heterospecifics in the first ever breeding episode. Innate recognition can accelerate learning, thus reducing the time cost. It is suggested that the ability of birds to innately associate species-specific traits cannot be ruled out either (Grether 2011). If this is the case, then interspecific discrimination based on associations of species-specific traits should occur regardless of prior experience with heterospecifics.

Here, with a two-choice song playback experiment utilizing stuffed male models, I investigated the role of associative learning of signals from two sensory modalities (auditory and visual systems) in shaping the interspecific response of male Eurasian Blackcaps (Sylvia atricapilla) to simulated male Garden Warbler (S. borin) intruders to their territories. Eurasian Blackcap males respond aggressively to playbacks of songs of many sympatric species of Sylvia warblers, but the reaction to the singing of the Garden Warbler is the strongest (Darolová et al. 2020). Blackcaps and Garden Warblers both are widely sympatric in most of Europe, resemble each other in song and morphology and have similar ecological requirements, but Garden Warblers overwinter in Africa further south than Blackcaps (Cramp and Brooks 1992; for bird pictures, song recordings and sonograms see publicly available collections, e.g. Macaulay Library at the Cornell Laboratory of Ornithology or xeno-canto.org). They are closely related sister species that diverged about 6.3–6.8 million years ago (Blondel et al. 1996). Blackcaps arrive at the breeding ground earlier in the spring and are able to drive later arriving Garden Warblers out of their breeding territories (Raines 1945; Garcia 1983).

Previous studies (Matyjasiak 2005) have suggested that the interspecific aggressive response of male Blackcaps to Garden Warbler intruders is based on the ability of these males to make associations between species-specific songs and plumage. Furthermore, male Blackcaps were able to remember and recall these associations after several months without contact with Garden Warblers, whose wintering areas are further south. The question remains open whether male Blackcaps acquire the ability to identify male Garden Warblers by social learning of species-specific traits during interactions with congeners in common habitat (during breeding territory defense), which is consistent with the ‘learned interspecific aggression’ hypothesis (Irwin and Price 1999; Peiman and Robinson 2010). To determine if such associations are in response to the presence of heterospecifics, the first step is to consider the presence of these associations in focal species in the absence of—or before contact with—the heterospecifics (Peiman and Robinson 2010). In the case of migratory species, such as the Blackcap and the Garden Warbler, this can be achieved by comparing the interspecific response of second-year old (hereafter SY) males in their first breeding season and without previous experience of defending the breeding territory and older (experienced, after-second-year, ASY) males in the period prior to the spring arrival of the other species.

I performed a playback experiment with male Blackcaps at a site where both warbler species are abundant and co-occur during breeding season, and at the time just before Garden Warblers returned from spring migration. I compared the agonistic response of male Blackcaps of known age to playbacks of songs of one or the other species and stuffed male models of both species presented simultaneously in their territories. If male Blackcaps learn to associate species-specific traits during their first breeding season (the ‘learned interspecific aggression’ hypothesis), I predicted that while ASY males would be able to associate species-specific song with species-specific plumage types, SY males would be unable to do so and would direct their aggressive response to the conspecifics male model, regardless of the song being played back. If male Blackcaps make this association innately (the ‘innate interspecific recognition hypothesis’), associations of species-specific traits should be observed in both SY and older individuals. If male Blackcaps do not discriminate between agonistic signals of Garden Warblers and their own species (the ‘mistaken identity’ hypothesis), associations should not be observed at all (although this scenario is unlikely, as shown earlier by Matyjasiak 2005).

Methods

Field site and study species

Fieldwork was conducted along the floodplain of the Vistula river valley, between Łomianki and Czosnów near Warsaw, central Poland (approximately 52.383° N, 20.809° E). The study area extends for about 15 km along the flood terrace on the left Vistula riverbank, covering a total of 3.0 km2. The main landscape is a large lowland river valley (elevation about 80 m above sea level), which in the riverine zone is covered by a mosaic of fragmented poplar and willow riparian woods of different ages, willow or ash-leaved maple thickets, and grasslands, with a few larger patches of mature poplar and willow riparian forests. The areas lying on the higher floodplain terrace include agricultural land and human settlements separated from the river by flood embankments. Blackcaps and Garden Warblers are both common at the study area. They occupy riparian woods or shrubs with groups of trees, and their territories are often adjacent. The densities of these species are the lowest in a mosaic of fragmented poplar and willow woods, thickets and meadows (Blackcaps: 26 pairs per km2, Garden Warblers: nine pairs per km2) and reach the highest values in larger patches of mature poplar and willow forests with good undergrowth (Blackcaps: 226 pairs per km2, Garden Warblers: 18 pairs per km2; Keller et al. 2017).

Blackcaps arrive at the study area between the beginning of April and mid May, and Garden Warblers arrive on average after 5th May. Playback experiments were conducted between 15th April and 5th May 2009, during the pre-nesting and early nesting stage in the breeding cycle of Blackcaps, before the arrival of Garden Warblers from wintering grounds. This enabled me to study SY male Blackcaps that had never defended their breeding territories against male Garden Warblers.

Playback experiment

I performed playback trials on 45 territorial male Blackcaps. Within the study site singing male Blackcaps were identified by daily walking through suitable habitats. The location and extent of their territories were then determined by 30 min of observation. To be classified as male’s territory, the male had to remain within the area for the full 30 min of the observation period. The next day, if the territory was still occupied and the weather was favorable (without continuous rainfall or high winds), I performed the playback trial in it. In case the weather was inclement, the trial was performed a day or two later. Previously tested males were identified by the presence of leg rings (see below), and in addition, males occupying non-neighboring territories were chosen for the playback trials, which allowed me to avoid double testing the same individuals. Males were alternately assigned to trials with the song of Blackcaps (conspecific playback trials) and the song of Garden Warblers (heterospecific playback trials), with each male receiving only one song treatment (heterospecific or conspecific).

The playback experimental set-up was similar to that used by Matyjasiak (2005) and consisted of two stuffed male models (taxidermic dummies, mounted on 1.5 m tall sticks placed 4 m apart) and a loudspeaker (Anchor Mini-Vox model PB-25, 15 Watt, mounted on a 0.5 m plastic stake, pointing upwards and concealed under a camouflage net) placed between the male models at an equal distance from them (Fig. 1). These two models, one of a male Eurasian Blackcap and one of a Garden Warbler male, both in full breeding plumages and undamaged, represented a paired stimulus. Male models and the loudspeaker were placed well within the subject male’s territory boundaries. A site was chosen where two low shrubs (willows, maples or elders) or short trees (willows or poplars) grew about 2–4 m apart with an empty space between them. Male models were placed by these shrubs or trees so that the subject had the possibility to perch near models, and a 1 m radius around each model was flagged with strips of yellow ribbon. The position of models with respect to side about the loudspeaker (as viewed from the observation point) was randomized. The area up to 6 m from the loudspeaker, flagged with white ribbon, is hereafter referred to as the test arena.

Fig. 1
figure 1

Playback experimental set-up. Schematic representation of the 6 m test arena, placement of the loudspeaker and male models, and correct and wrong model circles. Solid lines represent the test arena and 1 m radii around male models. Dashed lines show distances. Song playback ended immediately when the subject male appeared at the test arena

Then I played back either Garden Warbler or Blackcap territorial songs with a Sony D-SJ301 CD Walkman connected to the loudspeaker with a 20 m long 3.5 mm mini-jack audio cable, which allowed me to hide behind vegetation about 15 m from the loudspeaker. The volume when playing back songs was set by ear to achieve a naturalistic volume. Such a playback test was designed to mimic the territorial intrusion and advertisement of a male warbler. I terminated the playback immediately when the territory owner appeared at the test arena (within 6 m of the loudspeaker) looking for an intruder. The entire trial lasted 10 min from the onset of the song playback.

I measured the following variables: (1) latency to approach the test arena (accuracy of 1 s); (2) time spent within the 1 m radius around the male model of the species whose song was being played (hereafter the correct model circle) and time spent within the 1 m radius around the other male model (hereafter the wrong model circle); (3) song rate—the number of songs per minute sung between the time the subject appeared at the test arena and the end of the trial (for raw data see Supplementary Online Materials). Song rate was measured only during the trial, not before the test since I often could not discriminate singing focal males from neighbors. Playback trials were performed in the morning, between 0600 and 1200. Each male Blackcap was tested only once.

As experimental stimuli, I used recordings of natural territorial songs of Blackcaps and Garden Warblers, recorded with a Telinga 5-PRO parabolic microphone and a Tascam DA-P1 portable DAT recorder (16 bit/44.1-kHz sampling rate WAV files), early in the morning, since mid-April through May 2001. All recordings were made from a distance of less than 15 m, in forest or scrub habitats of the Kampinos Forest, 5–8 km from the present study area, each from a different individual. The recordings were free from distinct background singing, and were unedited. Each was arbitrarily numbered, and then used in playback trials in numerical order and only once to avoid pseudoreplication (Kroodsma 1989; Kroodsma et al. 2001).

Immediately after completing playback trials, I trapped the males by luring them with a playback of the conspecifics song into an ornithological mist-net (12 m long) set-up in the playback test arena. After capture males were aged, distinguishing SY males from ASY males on the basis of their molt patterns and feather quality (Jenni and Winkler 1994), ringed for further identification and released immediately. The percentage of first-year Blackcaps that may undergo complete post-juvenile molt, which would make them indistinguishable from ASY birds in spring, is very low in different populations of this species (< 2%, Morganti et al. 2013). For this reason, it is likely that this potential error in aging (i.e. SY males incorrectly classified as ASY males by plumage) may not affect the results of the present study. Of the 45 two-choice song playback trials performed, 23 were heterospecific trials (16 on SY males and seven on ASY males) and 22 were conspecific trials (13 on SY males and nine on ASY males).

Statistical analysis

All statistical analyses were carried out in Statistica v.13 (TIBCO Software Inc., Palo Alto, CA) software, and non-parametric tests were applied. First, using the Wilcoxon signed-rank test I compared the time spent by focal males in correct and wrong model circles, separately for heterospecific and conspecific playback trials. I expected that if male Blackcaps associate species-specific songs with species-specific plumage they should spend significantly more time in the correct than in the wrong model circle.

Second, I calculated the differences in time spent by subjects in model circles (the time spent in the correct model circle minus the time spent in the wrong model circle). I used the Mann–Whitney U test to compare these differences for SY and ASY males during heterospecific and conspecific playback trials. If SY male Blackcaps learn to associate species-specific traits during their first breeding season, I expected that during heterospecific playback trials, these males would spend more time in the wrong male circle or that they would spent a similar amount time in both model circles.

Finally, using the Kruskal–Wallis one-way ANOVA on ranks, I tested whether three behavioral variables characterizing the intensity of response to song playbacks—latency to approach the test arena, time spent in the right model circle, and song rate—differed significantly as a function of the age of the subject males and the species whose song was played back.

Results

All 45 subject male Blackcaps entered at least one of the two circles in which heterospecific or conspecific male models were presented simultaneously during playback trials. Typically the males approached the models while wing-flicking, singing and sometimes giving tac-tac calls. I did not observe physical attacks on the models.

Overall, male Blackcaps spent more time around the male model of the species whose song was being played than around the model of the other species, and specifically, SY and ASY males did not differ significantly in this regard. In heterospecific playback trials males spent significantly more time in the correct than in the wrong model circle (Wilcoxon signed-rank test, Z = 3.22, n = 23, p = 0.0012; Fig. 2a). In this group, 13 SY males spent more time in the correct model circle and 3 SY males spent more time in the wrong model circle, while all 7 ASY males spent more time in the correct model circle. Furthermore, SY and ASY males did not differ significantly in their tendency to spend more time in the correct than in the wrong model circle (Mann–Whitney U test, Z = 0.0, n = 16, m = 7, p = 1.0). Similarly, in conspecific playback trials subjects spend significantly more time in the correct than in the wrong model circle (Wilcoxon signed-rank test, Z = 4.11, n = 22, p < 0.0001; Fig. 2b). All 13 SY males and all 9 ASY males spent more time in the correct than in the wrong model circle. Here, SY and ASY males did not differ significantly in their tendency to spend more time in the correct than in the wrong model circle (Mann–Whitney U test, Z = 1,60, n = 13, m = 9, p = 0.1).

Fig. 2
figure 2

Time spent by male Eurasian Blackcaps of different ages near models of Garden Warbler and conspecifics males during heterospecific (Garden Warbler) song playback trials (a) and conspecific song playback trials (b). White box plots refer to time spent by subject males near the male model of the species whose song was being played back during the trial (the correct male model) and gray box plots refer to time spent by males near the model of the other species (the wrong male model). The horizontal midlines within the boxes represent the median value. Boxes depict the first (Q1) to second quartile (Q3) range of the data, and the whiskers extend 1.5 times beyond the interquartile range (IQR). Circles indicate outlier values above Q3 + 1.5 × IQR

The intensity of male Blackcaps’ response to song playback did not vary significantly with male age or whether the song of their own species or the song of Garden Warblers was played back. There were no statistically significant differences in latency to approach the test zone, time spent in the correct model circle or song rate response variables among male age groups and song playback treatments (Kruskal–Wallis one-way analysis of variance, latency to approach the test zone: H3 = 2.38, n = 45, p = 0.50; time spent in the correct model circle: H3 = 4.84, n = 45, p = 0.18; song rate: H3 = 3.52, n = 45, p = 0.32; Figs. 2and3).

Fig. 3
figure 3

Behavioral response intensity of male Eurasian Blackcaps of different ages to song playback as expressed by latency to approach the test arena (a) and song rate (b). White box plots refer to males’ response to heterospecific song playbacks and gray box plots refer to males’ response to conspecific song playbacks. The horizontal midlines within the boxes represent the median value. Boxes depict the first (Q1) to second quartile (Q3) range of the data, and the whiskers extend 1.5 times beyond the interquartile range (IQR). Circles indicate outlier values above Q3 + 1.5 × IQR

Discussion

Using a two-choice song playback experiment, I investigated whether male Eurasian Blackcaps are able to associate species-specific songs and species-specific plumage when identifying conspecific or Garden Warbler intruders in their breeding territories, and whether there are age-dependent differences between male Blackcaps in terms of this skill. I found that male Blackcaps, when looking for a rival in their territories and given a choice between two stuffed models of males (one of their own species and the other of the Garden Warbler), spent more time around the male model of the species whose song had been played back moments earlier. This suggests that these males were able to associate species-specific songs with species-specific plumage types. This ability characterized not only experienced ASY males, but also SY males that had not had the opportunity to defend breeding territories against Garden Warblers. This suggests that SY male Blackcaps acquire the ability to associate Garden Warbler song with plumage before they use this skill when defending territory during their first breeding episode. To my knowledge, this study is the first to test empirically whether such associations develop following territorial interactions with heterospecifics. Finally, I found that the intensity of male Blackcaps’ behavioral responses to song playback did not differ depending on whether a song of their own species or Garden Warbler’s song was being played back.

The present study, like the recent work by Darolová et al. 2020, showed that male Blackcaps respond as strongly to the playback of quite similar songs of the Garden Warbler as they do to the playback of conspecific songs. This finding corresponds well with previous studies showing that interspecific aggression by male Blackcaps results in exclusive territories defended against Garden Warblers (Raines 1945; Garcia 1983). This finding could also be consistent with the ‘mistaken identity’ hypothesis, according to which individuals cannot distinguish between heterospecific and conspecific signals (Murray 1971, 1981). However, as in previous studies (Matyjasiak 2005), here I showed that male Blackcaps are able to associate species-specific visual and auditory traits when recognizing conspecifics and heterospecifics. Male Blackcaps therefore do not mistake Garden Warblers for conspecifics, and the ‘mistaken identity’ hypothesis does not explain the existence of a response to heterospecific song in this species.

That male Blackcaps are able to associate species-specific visual and auditory signals might provide support for the ‘learned interspecific aggression’ hypothesis, according to which male birds learn to identify individuals of another species during regular encounters in the same habitat (Irwin and Price 1999; Price 2008; Grether 2011). According to the literature, males can learn these associations during interspecific competition for resources such as breeding territories, food supplies, nesting or roosting sites (Reed 1982; Catchpole and Leisler 1986; Gil 1997; Freeman 2016). But the major finding of this study is that SY male Blackcaps have the ability to associate Garden Warbler song with plumage even before they use it when defending their first ever breeding territories. However, this observation does not rule out the ‘learned interspecific aggression’ hypothesis as an explanation for the existence of interspecific aggressive response in male Blackcaps—learning in the subject males may have occurred earlier. At the same time, it does not exclude the possibility that the interspecific response in male Blackcaps has an innate component (the ‘innate interspecific recognition hypothesis’), which would accelerate the associative learning (Price 2008). Or that male Blackcaps make this association innately (Grether 2011). In both cases, the fact that SY male Blackcaps arriving on the breeding grounds in spring already have the ability to identify Garden Warblers should be advantageous for this short-lived species, whose individuals can reproduce only for 1 or 2 breeding seasons on average (Cramp and Brooks 1992).

When and where do male Blackcaps acquire the ability to associate Garden Warbler song with plumage? If this ability is not innate, this may take place in the natal summer, during autumn or spring migration, or in a wintering area. Juvenile Blackcaps do not defend territories during their post-fledging movements, and tend to track food supplies and congregate in places where there is an abundance of food (Chernetsov 2002). Meanwhile, in July and early August, the older Blackcap and Garden Warbler males continue to sing. In some birds, interspecific aggression by the dominant species continues into the fledgling period (e.g. Rice 1978). Juveniles can learn species-specific traits by eavesdropping on and watching older males contests, including heterospecific interactions (the ‘eavesdropping hypothesis’). They may also initiate direct interactions with older males on their own (the ‘direct interactions hypothesis’, see e.g. Davies and Welbergen 2009; Templeton et al. 2010). Later on, starting with autumn migration, the learning opportunities seem to be fewer. First, there is only a slight overlap between the winter areas of Blackcaps and Garden Warblers. Blackcaps from central European breeding populations spend winter mostly in the Mediterranean or in north Africa (Cramp and Brooks 1992; Hiemer et al. 2018). Garden Warblers winter further south, in the sub-Saharan Africa (Cramp and Brooks 1992; Urban et al. 1997). However, some Blackcaps from central European populations may travel as far south as the Afrotropic, where north European populations of this species typically winter, and where they can meet Garden Warblers during wintering (Cramp and Brooks 1992; Mokwa 2004). Second, while some migratory bird species at least periodically defend territories during stopovers or on winter grounds (Stutchbury et al. 2005), Blackcaps and Garden Warblers rarely defend foraging territories during migration or wintering. At stopover sites or on wintering grounds both species are known to wander around, often in mixed-species groups, tracking ephemeral food sources (they are frugivorous-insectivorous during migration and wintering; Cramp and Brooks 1992; Urban et al. 1997). They tend to congregate in sites where food is plentiful, e.g. around fruiting trees or bushes, where a dominance structure may develop in which dominants chase subordinates away but do not use singing as an agonistic signal (Schaub and Jenni 2000; King and Hutchinson 2001; Belda et al. 2007; Salewski et al. 2007; Iwajomo et at. 2017). In the Mediterranean wintering area, sedentary individuals often defend their home ranges containing food sources from migrating or wintering conspecifics, but they do not sing while doing it (Pérez‐Tris and Tellería 2002; Tellería and Pérez‐Tris 2007; Morganti et al. 2017). In spring, males of both species may begin singing as early as during the migration, at stopovers (Cramp and Brooks 1992). However, opportunities for interspecific contact are unlikely during this period because Blackcaps migrate earlier than Garden Warblers (Rubolini et al. 2010).

Future research might resolve which of the scenarios described above is valid. The importance of innate mechanisms in the development of the associations of song with plumage can be investigated by performing double-choice song playback trials on male Blackcaps raised in captivity, which had or had not contact with congeners. The importance of social variables in associative learning might be investigated by tracking free living juvenile male Blackcaps during their natal summer months (e.g. Templeton et al. 2010; Snyder-Mackler and White 2011). The laboratory phase of such research could include experimentally arranging interactions between captive-bred juvenile Blackcap males and adult tutors. It would be important to investigate whether the association skills vary across development by testing male Blackcaps at various stages of their early life, from the phase shortly after fledging to about 10 months of age when the males return from Africa. Further research is also needed on whether the mechanism described here for identifying heterospecifics also applies to other bird species and how associations, if present, are formed.

Summing up, the present study shows that SY male Blackcaps acquire the ability to associate Garden Warbler song with plumage before using this ability to defend their breeding territories during the first breeding season. This suggests that these males do not mistake male Garden Warblers for males of their species, but rather learn these associations during the first months of life. However, this study does not exclude the possibility that these associations are innate.