Introduction

Negotiation of territorial boundaries is a costly and complex process (Vehrencamp et al. 2014). The taking and maintenance of a territory can be time-consuming, energetically demanding, and risky (Copenhaver and Ewald 1980; Low 2006). And while the goals of territorial rivals are divergent, they do share common goals of minimizing cost and avoiding injury. Therefore, animals are expected to act strategically in an effort to balance costs and potential benefits under conditions of uncertainty (Searcy and Nowicki 2005).

Species’ territorial strategies involve syndromes of signals and direct actions (Sih et al. 2004). The signaling component is the first line of defense and aims to minimize cost and uncertainty (McGregor 1993; Bradbury and Vehrencamp 2011), whereas the direct or non-signaling component, such as attack or exploration, involves direct physical actions aimed at taking or maintaining territory (Kaiser et al. 2019). Species-wide and in the long-term, territorial intrusions prompt specific combinations of both components. However, individual decisions and tactics are influenced by a combination of factors including resource-holding potential, experience, resource value, and motivational state (Hurd 2006; Arnott and Elwood 2008; Kasumovic et al. 2009; Bergman et al. 2010).

In birds, most territorial strategies combine vocal signaling with some form of direct aggression, yet taxonomic groups differ substantially. Vocal signaling of songbirds, for example, has a complex, hierarchical, and learned structure, which is used sequentially and flexibly in parallel with conflict escalation (Searcy and Beecher 2009; Vehrencamp et al. 2014). Consequently, direct aggression among songbirds is limited and rudimentary in form (e.g., Krebs 1982; Searcy et al. 2006; de Kort et al. 2009; Ali and Anderson 2018). Social interactions among vocally non-learning species are equally complex (Marler 2004). However, innate programming and limited functionality of the vocal apparatus create a constraint that must be counterbalanced. Many non-passerines rely on threat postures and low-frequency growling calls, which are more reminiscent of mammalian behavior than passerine behavior (e.g., Craig 1977; Hand 1986; Hansen 1986; Waas 1991; Pandolfi and D’Astore 1992; Côté 2000). This information suggests that the diversity of territorial strategies among different species appears to have deep evolutionary roots.

In addition to phylogenetic differences, the inherent risk of the interaction, i.e., unpredictability and irreversibility of consequences, likely determines the signaling and non-signaling behaviors used by the species. Predators, such as raptors or owls, have deadly weapons at their disposal for hunting prey. However, because the use of these weapons can be fatal, predators typically avoid fighting with conspecifics (Garcelon 1990). Bird species do not usually have adaptations specifically for fighting, but long, sharp beaks or strong legs are prevalent among families such as Ardeidae, Gaviidae, Laridae, Struthionidae, or Rallidae, which could result in serious injury during a conflict (Pierotti and Annett 1994; Jemieson 1997; Piper et al. 2008). Therefore, as long as the risk is high, rivals should rarely approach each other during territorial disputes, but should instead rely on long-range signaling or ritualization (Enquist and Leimar 1990), as observed in many raptors and owls (e.g., Temeles 1990; Hansen 1986; Penteriani et al. 2007; Severinghaus 2008). Nevertheless, fights are undoubtedly the most definite solution of any conflict and, as long as it is not necessary to avoid them, fighting should make an inherent part of any territorial strategy. Fighting should not be exchanged with any purely signaling strategy (Enquist 1985), as shown even for many small passerines (Searcy et al. 2013).

Vocal systems of non-passerines are evolutionarily limited, but they can still effectively substitute direct aggression. Some birds, such as crakes, modify the temporal distribution of their calls or switch between loud and soft calls when interacting with intruders (Ręk and Osiejuk 2011, 2013; Ręk 2015). However, such signals have motivational character and short duty cycles, and so they accompany direct actions rather than a substitute for them (Ręk and Osiejuk 2011, 2013; Ręk 2015). By contrast, because vocal production is subject to morphological, physiological, and neural mechanistic constraints (Podos and Nowicki 2004), some acoustic parameters can provide information about the sender’s inherent qualities. For example, the fundamental frequency of vocalizations is generally negatively related to body size (Goller and Riede 2013; Riede et al. 2016), yet the reliability of this relationship depends on the mechanism used. Birds use several mechanisms to produce low-frequency sounds, either open or closed-mouthed (Fitch 1999; Suthers 1990). The close-mouthed mechanism evolved independently in several lineages of non-passerines and generates conditions that favor low-frequency sounds so that they can produce lower frequencies than vocalizations into an open vocal tract of similar size (Riede et al. 2016), suggesting that reliable signaling with low-frequency might have only evolved in some species.

We studied territorial behavior and communication of two sympatric non-passerine species: Water Rail (Rallus aquaticus) and Little Crake (Zapornia parva). Both Rallidae species are cryptic, socially-monogamous for at least one breeding season, and occupy similar wetland habitats (Taylor 1998; Jedlikowski et al. 2016). They produce simple repertoires of innately programmed calls, with males and females having the same repertoire in addition to sex-specific courtship calls (Cramp and Simmons 1980). In both species, partners cooperate in territorial defense and breeding, and facultatively in vocal duetting (Bengtson 1967; Cramp and Simmons 1980; Dittberner and Dittberner 1990; Jedlikowski and Brambilla 2017). However, Water Rails are three times heavier, have beaks that are twice as long, and are more aggressive than Little Crakes (Cramp and Simmons 1980). Water Rails have been observed attacking conspecifics during territory establishment in pre-nesting and wintering periods (Bengtson 1967; King 1980; Taylor 1998; Ciach 2007), and chasing and killing other bird species with their beaks (Barry 1995; Steiof 1999; Ciach 2007). Little Crakes defend breeding territories against conspecifics, but they are not offensive and seem to tolerate other marsh nesting species unless directly threatened (Cramp and Simmons 1980; Dittberner and Dittberner 1990).

The overall goal of this study was to test whether the behavioral and morphological differences between these similar species is related to their use of specific territorial defense strategies during breeding. To test this, we carried out a playback experiment and observed the birds’ behavior and signaling responses. Our predictions were that the Water Rail, as the species able to cause serious injury to other birds, would be less likely to act aggressively during interactions with conspecifics than the Little Crake. Additionally, Water Rails would display more vocal signaling than the Little Crake to compensate for the lack of direct aggression.

Methods

Study area and species

The study was carried out in the central part of the Mazurian Lakeland (NE Poland) in 2016 and 2017. We collected data at 32 small mid-field water bodies distributed near Łuknajno Lake between 53°47′–53°53′ N and 21°33′–21°44′ E. The total area of the water bodies varied from 0.18 to 5.75 ha (mean = 2.08), and all of them were overgrown by emergent vegetation (see Jedlikowski et al. 2016 for detailed description).

Water Rails and Little Crakes occupied the study area during the breeding season (April–September); yet, the distribution of territories varied seasonally depending on water level and availability of dense emergent vegetation. During the study period, we identified 47–65 breeding pairs of the Water Rail and 20–24 breeding pairs of the Little Crake. According to telemetry measurements, the areas occupied by breeding pairs varied from 325 to 1600 m2 (mean 1105.8 m2 ± 281.3 SE, n = 4) for the Water Rail and from 248 to 1225 m2 (mean 793.0 m2 ± 149.7 SE, n = 7) for the Little Crake (Jedlikowski and Brambilla 2017). The minimum distances between nests were 52 m and 28 m, respectively (Jedlikowski and Brambilla 2017).

Playback experiment

To examine the territorial behavior of Water Rails and Little Crakes, we carried out the playback experiment between May and July of 2016 and 2017. The experiment used acoustic playback that aimed at imitating territorial intrusion. Before the experiment, we located all nests of the subject pairs and mapped their territories by monitoring each water body at least once a week from the time the birds arrived until the end of the breeding season. To reduce variation in bird response within breeding stages, only incubating breeding pairs were tested.

The playback experiment was carried out on 26 pairs of the Water Rail and 24 pairs of the Little Crake, all subjected to multiple treatments. To minimize the chance of testing the same individuals twice, we performed playbacks at different water bodies each year. We captured and individually color-banded 35 Little Crakes from 23 pairs and four Water Rails from four pairs. We did not find any territory changes within a breeding season and there was a low return rate between seasons (four crakes—11%).

The design of the experiment was similar for both species. Each focal pair was subjected to two treatments: a conspecific treatment (Little Crake or Water Rail) and a heterospecific treatment (control—Little Grebe Tachybaptus ruficollis). The Little Grebe was used as a control because it regularly co-occurred with rallids within the studied area, and our former observations suggested that it would be neutral for the focal species. Little Grebes nest mainly in the patches of the Phragmites australis (Jedlikowski, unpubl. data) and collect food (benthonic invertebrates, amphibians, and small fish) diving at the open water area (Ceccobelli and Battisti 2010). In contrast, both rallids nest mainly in dense patches of Carex spp. and Typha spp. (Jedlikowski et al. 2016), and forage on invertebrates collected from the emergent vegetation or water surface (Taylor 1998). Treatments were carried out 60–90 min apart in a balanced order. For the conspecific treatment, we used male-specific calls produced during territory formation at the beginning of the breeding season (Polak 2005). In both rail species, these calls are uttered by solitary males in long series (Cramp and Simmons 1980; Polak 2005; Supplementary file1). For the control treatment, we used territorial calls of the Little Grebe (Supplementary file1). The conspecific stimuli were selected from a sample of nine Water Rails and six Little Crakes, whereas the heterospecific stimuli were selected from vocalizations of eight Little Grebes. The playback stimuli were thus not fully replicated among experimental pairs, but used 2–3 times each for the Water Rail treatments, four times each for the Little Crake treatments, and 3–4 times each for the Little Grebe treatments.

Playback stimuli were prepared from high-quality recordings. For each stimulus, we recorded a few minutes of the spontaneous calling of one individual, trimmed the recording to 1-min long uninterrupted fragment, and replicated this fragment six times. Call rates within the stimuli were natural: 130 ± 18 calls min−1 for Water Rails, 117 ± 17 calls min−1 for Little Crakes, and 169 ± 25 calls min−1 for Little Grebes (Jedlikowski unpubl. data). The recordings were made in the Łuknajno Lake (1–9 km from studied areas) in April of each year from a distance of 10–15 m from the subjects. Recordings were made with a Marantz PMD 661 MKII recorder and a Sennheiser ME67 unidirectional microphone stored in wav format using a sample rate of 48 kHz and a resolution of 16 bits. Recordings were edited using the Raven Pro 1.5 software (Bioacoustics Research Program 2014). All call samples were high-pass filtered (100 Hz) to remove low-frequency background noise.

Field methods

Treatments had identical timelines and execution. Treatments were carried out between 08:00–11:00 and 16:00–22:00 (local time). Each treatment lasted 15 min and consisted of three phases: 3-min of playback and 2-min of silence, 2-min of playback and 3-min of silence, and 1-min of playback and 4-min of silence. Playbacks were broadcast at 60 dB Sound Pressure Level (SPL at 5 m) for Water Rails, 55 dB SPL for Little Crakes, and 58 dB SPL for Little Grebes. These levels correspond with natural amplitudes measured by the authors with a CHY 650 Sound Level Meter (58–65 dB SPL from six Water Rails, 51–58 dB SPL from four Little Crakes, 55–61 dB SPL from five Little Grebes; at around 5 m). For playbacks, we used the Philips GoGEAR player connected to two loudspeakers (Pignose Legendary 7–100 Portable Amp). The loudspeakers were deployed at least 1 h beforehand on floating platforms within a breeding pair territory and hidden with dense vegetation 10 m from their active nest in a random direction. We used this distance to mirror the area usually defended by both species (Jedlikowski and Brambilla 2017). The vocal reaction of the birds was recorded using a digital recorder (Marantz PMD 661 MKII) coupled with a unidirectional microphone (Sennheiser ME67) set up on the tripod 15–20 m from the nest (48 kHz and 16 bit PCM files). The non-vocal behavior was recorded by two camouflaged digital wildlife observation cameras (Bushnell NatureView Cam HD) located 1.5 m in front of each loudspeaker. The cameras were movement-sensitive and able to record in low-light conditions without disturbing the birds (invisible infrared flash).

Responses to treatments

We started by describing the structure and production of natural vocalizations to assess the level of motivation of the subjects. During the incubation period, birds mainly remained silent and used territorial calls only when disturbed (Cramp and Simmons 1980; Jedlikowski unpubl. data). This suggests that the calls recorded during treatments were prompted by our stimuli. We counted calls (discrete vocalizations), classified them into particular variants (Water Rail) or types (Little Crake), and assigned them as solos or duets (Figs. 1, 2). Then, we visually scanned spectrograms of the experimental recordings using Raven Pro 1.5 software. The process of classification was straightforward because the same categories were present in multiple individuals, and no call could be assigned to more than one category (Figs. 1, 2; see also Results). However, the terms variant and type were used only for precision; specifically to reflect the fact that Water Rail calls are formed according to one general acoustic design, whereas calls of Little Crakes appear more polymorphic (Figs. 1, 2).

Fig. 1
figure 1

Spectrograms and oscillograms of Water Rail growling-squealing calls. Typically, calls are produced in series solo (a) or in duets (b). A series may consist only of disorganized pulses (V1) or contain one to three distinct call variants: soft growling calls (V2), muted squealing calls (V3) and harsh squealing calls (V4). At b I and II indicate calls produced by two birds in a duet. For the original recording, see Supplementary file2 or https://www.animalsoundarchive.org

Fig. 2
figure 2

Spectrograms and oscillograms of Little Crake calls. We identified four general types of calls: a chatting, b doublet, c peaks, and d trill. For the original recording, see Supplementary file3 or https://www.animalsoundarchive.org

Furthermore, we measured the acoustic parameters of calls produced in response to treatments (see Table 1 for a list and Figs. 1, 2 for visualization). We only measured calls that did not have strong background noise (n = 133 for the Water Rail and n = 244 for the Little Crake). The measurements were taken from sonograms using Raven Pro 1.5 software. To maximize precision, we used different settings of the sonogram window for spectral and temporal measures (Table 1), and these were: ‘Window: Hann, 2048 samples; 3 dB bandwidth: 67.4 Hz; frame overlap 50%; DFT Size: 8192 samples’ for spectral measures and ‘Window: Hann, 512 samples; 3 dB bandwidth: 270 Hz; frame overlap 50%; DFT Size: 512 samples’ for temporal measures. We assigned vocal responses to the male–female pairs rather than to individuals because both birds typically responded to the playback by calling sequentially or by producing duets. In most cases, it was impossible to identify the sex of the caller. Nevertheless, we could easily deduce that two birds were calling because calls were produced one after another from two locations and confirm it by inspecting spectrograms. Recordings from our camera traps and earlier observations supported this approach.

Table 1 Description of acoustic parameters used in the sound analysis

Finally, we quantified non-vocal responses by recording when birds approached the speaker and the duration of time the birds spent in the vicinity of the speaker. To do this, we analyzed video recordings from the camera traps (1080p Full HD). Birds that came within 1.5 m were noted as approaching the speaker. The duration of time spent within 1.5 m from the speaker (in seconds) was measured. Recordings were assigned to individuals through visual inspection on the video recordings because both species can be sexed visually.

Statistical analysis

We used generalized linear mixed models to compare the responses of birds to the treatments. This procedure is appropriate for repeated measures data, which enabled us to account for responses of the male and female within a pair and responses of the same pairs to multiple treatments, considering their order. The procedure is also appropriate for non-normally distributed data. Within this procedure, we included the recording ID as a random effect to emulate the randomness in playbacks. This approach is commonly used to account for pseudoreplication in the data (Millar and Anderson 2004). We analyzed the responses of each species separately using different response variables based on the particular behaviors that were observed. To compare the numbers of calls and acoustic parameters between treatments, we fitted the variables using the normal distribution. Because we could not always recognize individuals within a pair, we compared pairs (random effect) in terms of the vocal response, controlling for the within-pair variance. We found that contrary to the significant overall variance (within + between pairs), the within-pair variance was at least 10-times smaller and never significant, suggesting that either the same individuals responded during both treatments or that situations in which the male responded to one treatment and female to another were unlikely and did not contribute significantly to results. In the analyses with acoustic parameters, we additionally used the call variant/type and the interaction treatment-call variant/type as fixed effects. To compare non-vocal responses, we fitted the data using a binomial distribution with logit link function for the approaching behavior and normal distribution to analyze the time spent near the speaker. Because the approaching birds were video recorded and identified, we compared individuals for their non-vocal response, by adding individual identities as the random effect in the models. In addition to treatments, we controlled initially for the time of the experiment (morning-evening) in all analyses. However, this factor was never significant, and it was removed from the final models. In all analyses, Fisher’s LSD method was used to create confidence intervals for differences between treatment means. We used the SPSS v. 25 software (IBM, Armonk, NY, U.S.A.) for statistical analyses. All P values were two-tailed and, if not stated otherwise, means ± SE are given.

Results

Structure and production of vocalizations

Water Rails responded to treatments with a characteristic series of growling-squealing calls (Fig. 1). Each series contained 1–12 calls (3.6 ± 0.14, n = 115) that could be assigned to four variants. The acoustically simplest variants consisted only of the F-component. The F-component makes a soft, amplitude modulated, growling sound, which can be produced as a disorganized series of pulses (Fig. 1, variant 1) or in a trilled like fashion (Fig. 1, variant 2). It is produced when the bird’s beak and nares are closed, and the air is pumped back and forth to the crop (Supplementary file4 and 5). The remaining variants arise when the bird opens the beak and an increasing fraction of the air is exhaled. The effect of this process is specific biphonation with two components having different fundamental frequencies (F-component: 454.6 ± 3.4 Hz; G-component: 2542.1 ± 30.2 Hz). When the beak is opened only partially, the F-component is still strong, and the G-component sounds soft (Fig. 1, variant 3). In contrast, when the beak is fully opened, and no air is pumped back into the crop, G-component is very loud and contains sub-harmonics, which makes the sound harsh and eventually muffles the F-component (Fig. 1, variant 4). The consecutive variants (1–4) were characterized by increasing amplitudes (Fig. 1—oscillograms).

Little Crake responded to treatments with four distinct call types, all consisting only of a single fundamental frequency (Fig. 2). All call types were characterized by a similar structure with multiple harmonics and small to moderate frequency modulation but differed in their temporal organization and amplitudes (Fig. 2—oscillograms). The softest calls resembled listless chatting and were produced in more or less regular series (chatting, Fig. 2a), whereas the loudest calls were typically produced in couples (so-called doublets, Fig. 2b). The remaining two types were easily distinguishable short series of peaks (calls in series: 1.8 ± 0.15, n = 321; Fig. 2c) and long series of trills (calls in series: 19.2 ± 4.53, n = 27; Fig. 2D). Such trills can be produced in series solo or in duets.

Responses to treatments

The two bird species responded differently to conspecific and heterospecific calls. Water Rails produced more individual and duet calls in response to the conspecific treatment (solo calls: F1,200 = 26.52, P < 0.001; duets: F1,150 = 4.45, P = 0.04; see Fig. 3a, b). However, duets did not include variant 1 calls and the proportion of louder call variants (V3 and V4) was higher in duets than in solos (Χ23 = 22.53, P < 0.001). This difference in proportions was significant only for the conspecific treatment (Χ23 = 18.63, P < 0.001), meaning it was not a by-product of the overall decrease of the number of softer call variants during this treatment. Furthermore, the acoustic structure of Water Rail calls differed between treatments (Fig. 4). During the conspecific treatment, the Water Rails called with lower fundamental frequency (F1,128 = 43.72, P < 0.001; Fig. 4a), higher peak frequency (F1,130 = 4.05, P = 0.046; Fig. 4b), higher first quartile frequency (F1,133 = 3.13, P = 0.079; Fig. 4c), and higher third quartile frequency (F1,132 = 4.68, P = 0.032; Fig. 4d), but with similar call duration (∆F·P = 0.594; ∆F + G·P = 0.249; Figs. 4e). At the same time, despite these differences, the fundamental frequency was correlated significantly with the peak and quartile frequencies (F0—Peak: r = − 0.39, P < 0.001; F0—Q25: r = − 0.43, P < 0.001; F0—Q75: r = − 0.41, P < 0.001; acronyms correspond to Table 1). Finally, Water Rails showed few attempts at direct action, with only three out of 52 birds (26 pairs) approaching the speaker (conspecific treatment = 3, control = 0).

Fig. 3
figure 3

Individual (a) and cooperative (b) vocal responses of Water Rails to conspecific and heterospecific (control) treatments. The boxes show mean ± SE. Significant differences are indicated with symbols: *P < 0.05, **P < 0.01, ***P < 0.001; ns not significant

Fig. 4
figure 4

Variability of structural (ad) and temporal parameters (e, f) of Water Rail and Little Crake calls in relation to conspecific and heterospecific (control) treatments. The boxes show mean ± SE. Significant differences are indicated with symbols: *P < 0.05, **P < 0.01, ***P < 0.001; ns not significant

Little Crakes also produced more calls in response to the conspecific treatment (F1,184 = 30.22, P < 0.001; Fig. 5). All call types increased during the conspecific treatment except for chatting calls (F3,184 = 2.88, P = 0.04; Fig. 5a), yet all but three series of trill (out of 23) were solos. Trills were only produced during the conspecific treatment, and in nine out of 12 cases by approaching birds (Χ21 = 6.35, P = 0.01). However, contrary to Water Rails, the acoustic structure of Little Crake calls differed marginally between treatments (Fig. 4). During the conspecific treatment, birds called with a higher third quartile frequency (F1,243 = 6.45, P = 0.012; Fig. 4d), but this parameter was not correlated significantly with the fundamental frequency (r = − 0.044, P = 0.49). Furthermore, and also in contrast to Water Rails, Little Crakes often approached the speaker (13 males and seven females/48 birds from 24 pairs). Approaches occurred more often and lasted longer during the conspecific treatment than during the heterospecific treatment (approach: F1,94 = 7.57, P = 0.007; length: F1,94 = 6.00, P = 0.02; Figs. 5b, c, respectively). Little Crakes were also more likely to approach the speaker alone than as a pair (16 vs. 2 cases; Χ21 = 6.41, P = 0.01).

Fig. 5
figure 5

Vocal (a) and non-vocal (b, c) responses of Little Crakes to conspecific and heterospecific (control) treatments. Because Little Crakes produced only three duets (out of 23 series), solo and duet calls were counted together. The boxes show mean ± SE. Significant differences are indicated with symbols: *P < 0.05, **P < 0.01, ***P < 0.001; ns not significant

Discussion

Water Rails and Little Crakes both responded to conspecific calls but did so using distinct strategies. Water Rails increased the intensity of calling but disproportionately for louder call variants, while also producing more duets. They also modified the acoustic structure of their calls. However, they rarely approached the speaker, which suggests that Water Rails responded to territorial intrusion purely vocally. Little Crakes also increased the intensity of calling, but the increase was neither linked with the amplitude nor with the acoustic structure of calls as it was for the Water Rails. In contrast to Water Rails, Little Crakes frequently approached the speaker, suggesting that Little Crakes use both vocal signaling and direct action in response to the territorial intrusion. Overall, these differences in response demonstrate how two similar species and in the same habitat can develop differing adaptations for dealing with the territorial conflict settlement.

The territorial responses of Water Rails to conspecific playback come down to non-arbitrary changes in vocal behavior. Water Rails produced calls more frequently and louder in response to the conspecific treatment, suggesting a higher cost to defending their territory. However, these simple effects could be by-products of the caller’s agitation, without any signaling implications. Call or song rate has been frequently linked with energetic cost (Oberweger and Goller 2001; Hasselquist and Bensch 2008). Some studies challenge this view indicating that the production of vocalizations can be energetically cheap in relation to the overall daily energy budget (Ward et al. 2004). Similarly, call amplitude has a physical link with energy expenditure, but it could equally be assigned to the Lombard effect (Brumm and Zollinger 2011). Nonetheless, Water Rails also changed the structure of their calls. During both treatments, birds produced biphonal calls, which started with the low-frequency close-mouth F-component, followed by an open-mouth high-frequency G-component. The fundamental frequency of the F-component decreased when birds responded to conspecific playbacks, whereas the G-component sounded higher. The closed-mouth calling allowed birds to achieve very low frequencies, suggesting that such behavior evolved specifically for this purpose. However, such a mechanism comes down to very low amplitudes, particularly in smaller birds (Ręk 2014; Riede et al. 2016). Therefore, most species do not communicate solely with low-frequency calls but accompany them with higher frequency components. In Water Rails, the spectral distribution of energy of the loud component was correlated with the fundamental frequency of the soft component. This suggests that receivers might acquire enough information to decide whether to escalate or retreat from a distance. Overall, these results suggest that some signal designs can be particularly useful in preventing fights, provided their production and transmission is controlled and substantiated with a robust and effective mechanism.

Water Rails duetted more often during the conspecific treatment, as expected for cooperative investment in territorial defense (Hall 2009). Avian duets are displays where two birds coordinate their vocalizations with a degree of temporal precision (Farabaugh 1982). Despite many reports (Taylor 1998), the duetting behavior has never been tested experimentally among the Rallidae. However, the clear alternation of the male and female calls seen in this study (Fig. 1b) implies that the joint calling of Water Rails was duetting. Furthermore, Water Rails used louder call variants (V3, V4) at a higher proportion in duets than in solo calls, suggesting that duets in Water Rails reflect the highest level of anxiety of the signalers. Therefore, irrespective of whether rail and passerine duets represent the same or different behaviors, both have many functional and structural similarities.

In contrast to the vocal response, Water Rails only exceptionally approached the speakers. The low number of Water Rails approaching the speaker during the experiment was reflected later in the low effectiveness of our trapping for that species, as opposed to the effectiveness of the same methods for Little Crakes. Similar low responsiveness was observed for the Ridgway’s Rail (Rallus obsoletus) during call-count surveys (Bui et al. 2015). Reluctance to approach was likely not due to nest protection because incubating birds were observed leaving the nest to participate in duetting. Overall shyness is also not a reasonable explanation because the opposite behavior was observed for solitary males. At the beginning of the breeding season, solitary and presumably non-territorial males were frequently flying towards the loudspeaker, actively searching for the opponent (Authors’ personal observations). This behavior ceased as soon as territories and pair bonds were established. This discrepancy suggests that aggression in the Water Rail is an offensive strategy aimed at obtaining a female and territory, but otherwise avoided in territorial defense. Therefore, as long as the playbacks did not threaten the birds directly, they could apply a less risky approach of signaling and waiting.

Similarly to Water Rails, Little Crakes increased calling in response to the conspecific playback, but they did not modify the acoustic structure of their calls like Water Rails. All Little Crake call types were open-mouthed and lacked the growling component found in the calls of Water Rails and other sympatric rallids, such as the Corncrake or Spotted Crake (Porzana porzana) (Ręk 2014, 2015). At the same time, apart from the purely quantitative effect, Little Crake calls remained surprisingly similar during both treatments. Calls had relatively high fundamental frequency (Table 1), which was not changed between treatments (Fig. 4). These acoustic and mechanistic limitations appear to be interconnected, suggesting that Little Crake calls have not evolved to preclude further aggressive escalation.

In addition to their differences in vocal signaling, Little Crakes were more likely to approach the speaker in response to conspecific calls and remain in close vicinity of the speaker than Water Rails. We observed that a typical approach by Little Crakes is accompanied by vocal signaling and, according to an earlier study, some threatening postures such as stretching of wings (Koenig 1943). The Little Crakes were observed to run around the speaker, jump up on to the speaker, and peck at it. We did not use any stuffed model, but earlier observations suggest that would end with a physical attack towards the model (Koenig 1943). These observations suggest that some form of direct action, in the form of physical aggression, is an inherent part of Little Crake’s territorial strategy.

In conclusion, our study showed that territorial aggression and signaling does not necessarily reflect species behavior in other contexts and its morphology. In our experiment, both Water Rails and Little Crakes were in a privileged position as territory holders, which means that they neither had to call nor to approach. However, despite this initial standardization species responded differently; Water Rails applied a costly signaling strategy, whereas Little Crakes relied more on direct actions. We suggest that even if such differences might appear counterintuitive, they are justified evolutionarily. Aggression is common when the risk of injury is low, whereas costly signaling is used to supplements aggression when the risk is high.