Introduction

In the field of evolutionary biology, island-mainland systems worldwide serve as one of the few definitive biogeographic contexts for understanding the mechanisms behind species variation. Species inhabiting islands often display phenotypic differences in terms of body size and shape, color, vagility, diet, fecundity and life history traits, when compared to their closely related counterparts on the mainland (Whittaker and Fernandez-Palacios 2007; Losos and Ricklefs 2009; Baeckens and van Damme 2020). One significant observation to highlight is that islands are characterized by lower species richness, making them species-poor communities compared to the mainland (Losos and Ricklefs 2009). This provides the context for any explanatory framework researching on how isolation can influence various traits. Based on this, it would be expected that the removal of a local species would lead to a reduced interspecific competition, forming the foundation for the Character Release Hypothesis (Grant 1972). Hence, in avian communities with a decline in richness, decreased interspecific competition, as proposed by Naugler and Ratcliffe (1994), open new ecological opportunities for the expansion and manifestation of traits, such as acoustic behavior on islands (Robert et al. 2019).

The study of avian acoustic behavior has become a valuable source of data for exploring the ecological and evolutionary patterns linked to vocal communication signals. Vocalizations are instrumental for individual recognition within species (Seddon 2005), a factor that can influence mate choice (Catchpole and Slater 1995). Consequently, bird song structure contains a substantial amount of inheritable phenotypic information (McCracken and Sheldon 1997). Among non-vocal learning species in passerines (suboscines), there is scant evidence of learning in the song development of specific species, particularly within Tyranni (Kroodsma 1984; Kroodsma et al. 2013) and Thamnophilidae (Touchton et al. 2014). Empidonax flycatchers' research has established the presence of a rudimentary neural-respiratory circuit that is far more developed than within vocal learning birds (Liu et al. 2013) and unveiled candidate genes showcasing differential expression in the suboscine brain of an Empidonax flycatcher related to bird song production (García et al. 2023). This suggests that the role of learning cannot be entirely dismissed in the acoustic behavior of non-vocal learning suboscines. Nevertheless, it remains evident that a substantial amount of data is encoded within what appear to be innate features of avian acoustics in non-vocal learning suboscines.

Non-vocal learning birds are typically expected to exhibit limited variation in their acoustic signals within species (Lindell 1998; Bard et al. 2002; Seddon and Tobias 2006; Tobias et al. 2010). Interestingly, geographic variation in vocalizations may function as localized communication signals for intraspecific discrimination. Moreover, the geographic structure of these vocalizations could likely be influenced by natural selection (Wilkins et al. 2013; Shipilina et al. 2017). Therefore, investigating the geographic variation in the songs of non-vocal learning birds has the potential to provide insight into understanding how populations' acoustic behavior diverges. Nevertheless, the absence of genetic data in studies of acoustic behavior may often hinder attempts to understand the spatial arrangement of songs in non-vocal learning birds (Sementili-Cardoso et al. 2018). Consequently, the spatial distribution of inherent vocal signals serves as a hypothesis regarding the geographic arrangement of genetic variation in suboscines.

The patterns of isolation within island biota create a context that is crucial to consider when interpreting the intricate dynamics of species' life cycles (Bellemain et al. 2008). Such patterns may indeed play a significant role in the rapid accumulation of functional diversity on islands (Theron et al. 2001). Both oceanic and land-bridge islands display remarkable patterns of endemism, suggesting the intriguing possibility that this endemism could also extend to communication signals. However, the accumulation of endemism in land-bridge islands may exhibit fundamental differences. These islands were once connected to the coastline, hosting taxa assemblages remarkably similar to those of the mainland. The process of accumulating biota on these islands is closely linked to their isolation due to rising sea levels (Han et al. 2022). This likely resulted in significant changes in the islands' relationship with the coastline and their biota (Wilson et al. 2015). Conversely, taxon assemblages on oceanic islands likely arose through long-distance dispersal, limited to highly vagile species, with only one or few colonization events shaping the circumstances for main evolutionary outcomes (Whittaker and Fernandez-Palacios 2007). Although the small area of oceanic landmasses makes these islands remarkable natural laboratories (Whittaker et al. 2017), it is important to note that near-shore islands are also significant biogeographical settings for the study of ecology and evolutionary biology (Wilson et al. 2015; Salerno et al. 2022).

Whether insularity affects behavior has received far less attention in the literature (Gavriilidi et al. 2022). In the context of the Character Release Hypothesis, several studies have documented distinctions in acoustic behavior when comparing avian species in oceanic islands and closely related counterparts on the mainland (Morinay et al. 2013; Robert et al. 2019, 2021, 2022), prompting our interest in this phenomenon. These studies suggest the release of acoustic space for sound-producing populations in oceanic islands, which results in that the remaining species on the islands will expand some of their acoustic traits in a less competitive environment compared to the mainland.

The Black-crowned Antshrike is a suboscine species in the suborder Tyranni that has shown to be a remarkable system for studying behavioral ecology (Tarwater 2012; Tarwater et al. 2013; de Zwaan and Roitberg 2015). Its range spans from Central America to the Northern Andes, encompassing elevations from the sea level to 1500 m (Brumfield and Edwards 2007). This species has a rather simple and stereotyped pattern of acoustic elements; a perching display of a recurrent loud patterned trill that starts slowly and culminates in a memorable terminal trill note, referred to as loudsong (Isler et al. 1997). In addition, T. atrinucha also emits rapid calls consisting of short repetitive sequences, often during low volume exchanges between paired individuals in close proximity, which are termed rattles (Isler et al. 1997). While these vocalizations likely serve various purposes, in the case of T. atrinucha, their specific functions as acoustic phenotypic traits in this species remain largely unknown. It is widely suggested that the loudsong may play a significant role in the species recognition system of thamnophilids (Isler et al. 1997, 1998, 2012).

In this study, we investigated the evolution in the frequency and temporal characteristics of the acoustic behavior and the genetic variation of the Black-crowned Antshrike in the island-mainland system of Gorgona-Colombia. Gorgona is a land-bridge island located 35 km off the Colombian Pacific coastline, which harbors a remarkable pattern of endemism (Fernández and Schaefer 2005; Giraldo et al. 2014). Two allopatric subspecies of the Black-crowned Antshrike, identified by subtle plumage coloration patterns, have been documented. T. atrinucha gorgonae, is presumed to be confined to Gorgona Island's rainforest (Thayer and Bangs 1905). T. atrinucha atrinucha, as suggested by Isler et al. (1997), is believed to inhabit tropical dry forests in Central America, extending from the Caribbean plain to Venezuela, including the inter-Andean valleys of Colombia. The specific form inhabiting the Pacific plain of Colombia and Ecuador (Chocó) remains unclear. Determining the distribution of each geographic species within the entire range is still pending, particularly concerning information on bird songs or genetic data.

Here we used song recordings and mitochondrial DNA from fieldwork conducted in previous years to investigate the geographic, genetic and acoustic patterns of this species. In addition, the potential influence of marginal learning on vocalizations suggests that we may anticipate some degree of relatable patterns of song and genetic divergence. Although our genetic dataset is small, it does offer a valuable independent set of data for further comparison with vocalizations. Mitogenomic data is a valuable tool for exploring the possible influence of vicariant events (Zink and Barrowclough 2008). It has also proven to be helpful in delineating boundaries within avian taxa (Gutiérrez-Zuluaga et al. 2021), including well-established Thamnophilus species (Thom and Aleixo 2015; Bolívar-Leguizamón et al. 2020). Given the challenges in objectively diagnosing certain species within the Thamnophilus genus based solely on plumage characteristics (Isler et al. 1997; Whitney et al. 2004; Bolívar-Leguizamón et al. 2020), the taxon under examination here may be polytypic. Thus, by integrating multiple types of data related to acoustic behavior, morphological traits, environmental factors and genetic data, we aim to examine a population in the 23-km2 insular landmass, which constitutes an ideal setting for studying the evolution of behaviors and communication signals.

Methods

Study area

We collected vocalization recordings and tissue samples to obtain genotypic data for T. atrinucha in Colombia. Gorgona Island (2°9' N and 78°2' W) is 8.5-km long and 2.5-km wide and has a maximum elevation of 338 m and constitutes the primary study area to compare with mainland areas of Central America and Colombia (Fig. 1). The geographic range of T. atrinucha spans a diverse array of environments, encompassing elevations that vary in annual rainfall ranging from 1.5 to 6.7 m per year. It is evident that the species' habitat exhibits a high degree of complexity throughout the entire range.

Fig. 1
figure 1

Recordings of songs were obtained from locations in close proximity to the sites where specimens for genotyping were collected. Specific areas in Colombia (CO) are denoted by numbers: 1-Gorgona Island (GO), 2-Guapi (GU), 3-Nariño (NA), 4-Quindío (QU), 5-Antioquia (CA), 6-Cundinamarca (CU), 7-Choco (CHO), and for 8-Panama (PA), 9-Nicaragua (NI), and 10-Belize (BZ). However, no song recordings were available from BZ, and DNA sequences from NI were also absent. Sequence data for the ND2 gene were sourced from GenBank for BZ and PA, while data for CO were sequenced de novo

Collection and sampling

In 2016 and 2020, we used 30 mm mesh mist nets to collect individuals of T. atrinucha; A 10-µL blood sample was obtained through micro-puncture of the brachial vein, which we preserved in Queen's lysis buffer (Seutin et al. 1991). Additionally, during the preparation of skin specimens, we extracted approximately 3 mm3 of liver, heart, and muscle´s tissue. These tissue samples were then preserved either in denatured ethanol or lysis buffer. Five samples from Colombia were generously loaned to us by the avian collection of Universidad de Nariño and the Alexander von Humboldt Institute, respectively. Given the requirement to obtain all available specimens for our study, incorporating tissue samples blindly was impractical. However, to minimize observer bias, we implemented a blinded approach when recording and analyzing the acoustic behavior of focal animals in the field. The list of specimens used in this study, location and their GenBank accession numbers can be found in Fig. 2 and Supporting Online Information Table S1.

Fig. 2
figure 2

Consensus gen tree obtained from the phylogenetic inference based on partial sequences of the ND2 gene of Thamnophilus atrinucha. The nodal support is shown as the posterior probability from the Bayesian inference and the percent bootstrap value for 5,000 samples from the maximum likelihood analysis. The tip nodes list provides the GenBank accession number and specific areas with the country code: Colombia (CO), 1-Gorgona Island (GO), 2-Guapi (GU), 3-Nariño (NA), 4-Quindío (QU), 5-Antioquia (CA), 6-Cundinamarca (CU), 7-Choco (CHO), and for 8-Panama (PA), 10-Belize (BZ), and an unknown area (Unkn). The visualization of the posterior distribution uncertainty in a cloud diagram (green). Loudsong dialects next to the tree, a sample of vocal variation among subclades, the heat map towards warmer colors suggests greater intensity of energy emitted. Distribution of T. atrinucha and color code for an approximation of the spatial distribution of subclades. Sequences of the ND2 gene obtained from GenBank (BZ and PA) and sequence data obtained de novo (CO). T. atrinucha gorgonae, photo courtesy of NPA

DNA extraction and PCR

We conducted a characterization of mitochondrial DNA NADH dehydrogenase subunit 2 gene sequences as a preliminary approach to investigate the geographic genetic structure in T. atrinucha. To achieve this, we utilized samples collected from a substantial portion of its range in Colombia, and we also obtained additional sequences representing populations in Central America from the GenBank. We performed genomic DNA extractions using the Thermo Scientific® package and then further produced amplicons of the mtDNA ND2 gene. The PCR used previously described primers (Brumfield and Edwards 2007) and its product separated by electrophoresis on a 2% agarose gel with Safe DNA Dye (Hydragreen™). Amplicons of the expected size (ca. 1 Kb) were then sent to Macrogen Inc., for purification and sequencing. Sequencing of the light and heavy strands of mtDNA was conducted using BigDye (Applied BiosystemsTM). We aligned the sequences with Sequencher version 4.7 (Gene Codes Corporation, Ann Arbor, MI, USA), by using conspecific sequences from Panama and Belize obtained from GenBank (Benson et al. 2012).

Genealogical inference

We performed a genealogical analysis using a dataset comprising 30 ND2 partial sequences from Gorgona Island and the mainland. Most of these sequences were collected in Colombia (n = 23), while the remainder were sourced from GenBank, including sequences from Belize (n = 3) and Panama (n = 4). We performed a maximum likelihood (ML) inference analysis using the PhyML 3.0 platform (Guindon et al. 2010). The selection of the nucleotide substitution model was carried out using MEGA-X (Kumar et al. 2018). A total of 5,000 samples were collected from the inference to calculate the percentage support for the tree nodes. The gene tree inference included sequences from T. bridgesi (EF030293) and T. bernardi (EF030285) as external groups (Brumfield and Edwards 2007).

We also performed a Bayesian inference (BI) analysis using BEAST v1.10.4 (Suchard et al. 2018). The same substitution model was implemented in the Bayesian inference (BI) and ML analysis. To prepare the sequence file for BI analysis, we used BEAUTi v1.10.4 (Suchard et al. 2018). We configured a coalescent model with a constant population size, segregating the data into two partitions: one for codon positions 1 and 2 and another for position 3. During the analysis, data were sampled at intervals of 2000 steps from a total chain length of 200 million steps. We calibrated the genealogy based on the established monophyly reported in prior studies for T. atrinucha, T. bridgesi, and T. bernardi (Brumfield and Edwards 2007). We enforced a strict molecular clock and applied a rate of evolution of 0.029 substitutions per million years to the ND2 gene (Lerner et al. 2011).

After completion of the BI analysis, we inspected the resulting posterior estimates in Tracer (Rambaut et al. 2018), and excluded 25% of the initial sampled data as Burn-in. A visualization of the posterior distribution of sampled data after Burn-in was inspected using the Densitree 2 (Bouckaert and Heled 2014). We randomly selected 3 k trees from the stationary posterior distribution using R software (R Development Core Team 2019). These trees were subsequently used to construct a consensus majority tree using Paup v 4.0a (Swofford 2003). To assess the congruence between our Bayesian Inference (BI) and Maximum Likelihood (ML) analyses, we utilized node support as a reference criterion, requiring values of posterior probability ≥ 0.95 for BI and ≥ 60% bootstrap support for ML. Visualization of consensus tree was carried out using MrEnt v2.5 (Zuccon and Zuccon 2014).

Population genetic structure

The patristic (p) pairwise genetic distance was implemented to determine the nucleotide differences between ND2 gene sequences. We identified haplogroups among ND2 sequences using p-distances as implemented in USEARCH algorithm (Edgar 2010) with a nucleotide similarity criterion ≥ 99.99%, following the methodology outlined by Gil-Vargas and Sedano-Cruz (2019) and computed using MEGA-X (Kumar et al. 2018). We assessed the variation within T. atrinucha sequences by calculating nucleotide diversity (π), the number of parsimony-informative characters, and the Tajima D index (Tajima 1989) using Arlequin v.3.5.2.2 (Excoffier and Lischer 2010). We conducted the molecular analysis of variance (AMOVA) to estimate the nucleotide variation fixation index (Φst), using 16,000 permutations (Excoffier and Lischer 2010). Utilizing the matrix of Φst values as a representation of a neutral marker among different geographic areas, we explored whether the genetic clusters observed were influenced by geographical distance (Bohonak and Roderick 2001). The distance among sampling localities was calculated using the Geographic Distance Matrix Generator v1.2.3 (Ersts 2014).

Phenotypic characterization

Collection of vocalizations recording

We sourced audio recordings from both publicly available and unpublished archives, currently housed within the HFR's inventory. These recordings featured songs from sites in close to, or from the very areas where we collected specimens for genotyping (Fig. 1, Table S2). In Colombia, we recorded sounds in WAV format at a sampling rate of 44,000 Hz and 16 bits using different equipment. Most of the recordings were done with an XM-55 directional microphone and TASCAM DR-22WL solid-state recorder but in the Cauca River Canyon, we employed a Sennheiser Me67/K6 microphone with a Marantz PMD661 recorder (Supporting Online Information Table S2).

We also examined the sounds of T. atrinucha from Panama and Nicaragua in Central America (loudsong recordings from Belize were not available), obtained from public repositories such as the Macaulay Library at the Cornell Lab of Ornithology and Xeno-Canto. The data from the Macaulay Library were originally in 16-bit, 44,000 Hz WAV format; therefore, we had to transform the Xeno-Canto songs into WAV format with the same characteristics. The use of compressed files, such as those mp3 in the Xeno-Canto repository, is a topic of debate. However, both Heath et al. (2021) and Araya-Salas et al. (2019) coincide in suggesting that audio file compression does not affect acoustic measurements but in those designed to measure extreme frequencies. At any rate, the use of compressed audio files has provided reliable results and conclusions in acoustic analysis (Mikula et al. 2021). To mitigate potential issues related to data compression, as noted in Araya-Salas et al. (2019), only type A and B recordings (loud and clear) were selected from Xeno-Canto, based on their quality rating system. Then, we visually inspected the recordings spectrograms for subsequent analysis. If the target signal was distinctly distinguishable from the background noise, it was then chosen for further analysis. All recordings were normalized before any acoustic analysis. In addition, each recording represented only one individual. We focused on the acoustic data analysis of the loudsong (Isler et al. 1997), which manifests as a vocalization of multiple notes repeated at regular intervals with a clear end note that differs from the preceding notes.

Song analysis

We analyzed the structure of each loudsong using Avisoft-SASLab Pro Software, v. 5.2.09 (Avisoft Bioacoustics, Berlin, Germany). To quantify the acoustic spectrum of each sample, we employed Avisoft's automatic signal selection mode and set a detection threshold of -33 dB. This threshold was chosen to create standardized sonograms based on each song's peak amplitude, reducing the likelihood of spurious results attributable to measurement artifacts (Brumm et al. 2017). Subsequently, we derived spectrum-based measurements for nine acoustic parameters: Note duration (Dur), number of notes in the loudsong (Notes), silent intervals between notes (Break), strophe duration (ST-D), maximum frequency (max), minimum frequency (min), frequency peak (peak), bandwidth (Bandwd), and entropy (Entropy). The latter makes it possible to calculate the information content of the song, described in terms of purity or tonality of the sounds, where 0 indicates tonal sounds and 1 indicates sounds that accumulate greater entropy. We also estimated the rate at which notes were produced, a composite parameter that results from dividing the notes on ST-D. The note rate and ST-D are two common indicators of avian vocal performance (see a review on Robert et al. 2022). Because the songs of the same individual in a recording usually present low variability or are identical, we calculated the average values of notes per song/individual ST-D (sequence of loudsongs in a recording) for subsequent analyses.

To compare the pattern of acoustic structure between these groups of interest (e.g. regions or clades), we used the Wilcoxon test, which allowed us to examine the differential distribution pattern of the acoustic signal parameters. For the Wilcoxon test, we used scaled data per song per individual. Averaged values were calculated for each locality and arranged in pairs between groups of interest for further analysis. We also performed a discriminant analysis to reduce the dimensionality among the first nine parameters used to describe the structure of loudsong and thus, to maximize the acoustic discrimination between spatial groupings.

We calculated mean values per acoustic variable/locality or area of interest. The mean values were obtained from the acoustic spectrum of the loudsongs from all individuals sampled. Then, we calculated the Euclidean distance between pairs of areas of interest as a measurement of dissimilarity in a multidimensional space. The Euclidean distance is defined here as the root of the squared difference between the values of any pair of entities (Rivera-Gutierrez et al. 2010). Subsequently, we used this measurement of acoustic dissimilarity in a correlation analysis with the geographic and genetic distance matrices using the Cochran–Mantel–Haenszel test. All statistical analyses were performed in R software (R Development Core Team 2019).

Generalized dissimilarity analysis

We implemented a Generalized Dissimilarity Model (Ferrier et al. 2007), to estimate how environmental factors and geographic distance relate to the acoustic variation in loudsong of T. atrinucha. We utilized the GDM package for R (version 1.5) as described by Mokany et al. (2022) to employ the paired Acoustic Euclidean Distance as the response variable. This composite metric is founded on nine vocal structure parameters, excluding note rate. Four environmental variables were considered as predictors: BIO1 (Annual Mean Temperature), BIO7 (Temperature Annual Range, calculated as BIO5 Maximum annual temperature minus BIO6 Minimum annual temperature), BIO12 (Annual Precipitation) and a measure of vegetation cover (Leprieur et al. 2000): the Normalized Vegetation Index (NDVI) (Jiang and Huete 2010). Environmental data were extracted from Worldclim V. 2.1 (Fick and Hijmans 2017), using the Raster package for R software (Hijmans and van Etten 2023).

Body size analysis

Four measurements including exposed Culmen, Wing chord, Tarsus and Tail length, were used as proxies for body size. We conducted principal component analysis (PCA) for males and females to visually assess body size variation between island and mainland populations. In addition, we used the U-Mann Whitney comparison of body size measurements between areas of interest, such a non-parametric test was chosen due to the limited data available from the mainland. Specimens were obtained from the Universidad of Valle, Universidad de Nariño, the Institute of Natural Sciences from the Universidad Nacional of Colombia, and historical records from Gorgona Island (Thayer and Bangs 1905).

Phylogeographic and ecological clustering

We implemented the Bayesian phylogeographic and ecological clustering (BPEC, Manolopoulou et al. 2020) to examine whether differences in genetically distinct clusters are related to transitions of acoustic signals, body size and environmental variables. This model relies on the assumption that dispersal events result in the formation of new clusters, and these clusters are shaped by a multitude of factors, including the geographic distribution of haplotypes, phenotypic and ecological constraints on gene flow. We used the R-package BPEC to evaluate haplotype networks, jointly estimating genetic clusters along with georeferenced data points, nine acoustic variables, four male morphological traits derived from the prior body size analysis, and four environmental variables also used from the previous GDM analysis. Each dataset is represented in BPEC as the average observed at a specific location near the site where mtDNA haplotypes were collected. To run BPEC, missing sites in our sequence alignment were excluded. We also excluded three sequences from Belize to match only localities from where acoustic data of loudsongs were available. We performed pre-runs incrementing the number of dispersal events, and continued until maxMig + 1 yielded no change in the clustering output. Pre-runs included analysis after three, six and 10 million iterations, and relaxing haplotype network parsimony criteria ds from 0–3 (Manolopoulou et al. 2020).

Results

Genetic diversity

From mtDNA sequences for the ND2 gene (Supporting Online Information Table S2), we obtained an alignment of 1,008 base pairs (out of 1,042 bp -the total size of the ND2 gene in Thamnophilus), with an open reading frame without disruptive stop codons, and found no evidence of nuclear DNA copies. In this alignment, we found that the partial sequences of the ND2 gene had < 5% missing data and 96% were invariant loci, whereas 3% were informative loci (n = 28) (excluding outgroups). The mean number of pairwise differences is 6.2 nucleotides and averaged over loci π = 0.007. The p-distance among sequences was on average 1.1% (range 0.0 – 2.2%), which is a shorter distance than that found in other conspecifics (Lacerda et al. 2007). A total of 17 haplotypes were identified in the sequence alignment (Supporting Online Information Table S1).

The most common haplotype has a range distribution in Panama and Colombia, for the areas of Chocó, Nariño, and Gorgona Island (n = 12 sequences). We identified a single haplotype restricted to Panama and 14 haplotypes restricted to Colombia. The sequences from Belize (n = 3) formed a single haplotype group. The neutrality test for the entire set of haplotypes was not rejected (Tajima's D =  − 1.31413; P = 0.086) implying an excess of low frequency polymorphism relative to the expected value.

Genealogical inference and dating

We used the alignment for the ND2 gene for the selection of a TN93 nucleotide substitution model (Tamura and Nei 1993). Maximum Likelihood (ML) inference showed strong support for the monophyly of T. atrinucha (log-likelihood score of -2005.64), as evidenced by a high bootstrap support. The ML analysis yielded results consistent with BI, as illustrated in Fig. 1. The divergence times for the ND2 gene in T. atrinucha can be traced back to the Pleistocene, with a broad confidence interval ranging from 0.03 to 4.88 million years ago. When considering the diversification of subclades with robust posterior probabilities and ML bootstrap support, it becomes evident that there was a significant overlap in node age at the 95% high posterior density interval (data not shown). Consequently, we lacked the statistical power to discern distinct divergence times among subclades.

Both the bootstrap from ML and the posterior probability (PP) estimation from BI supported a genealogy, which also highlights a geographically structured pattern characterized by three significant subclades (Fig. 2). The first subclade comprises haplotypes distributed within the westernmost area of the species' range, which includes areas in Belize, Panama, and the Pacific coast of Colombia, and the population found on Gorgona Island (Fig. 2). The second clade is composed of haplotypes distributed throughout the easternmost part of the inter-Andean region of Colombia (Eastern mountain range), hereafter Eastern region. The third clade comprises haplotypes distributed within Colombia's inter-Andean region, spanning from the lower Rio Cauca in Antioquia to the department of Quindío (Central mountain range of Colombia), hereafter Central region. Further, the spatial distribution of nucleotide variation was high (AMOVA Φst 1, 27 = 0.76601, Exact Test of individual distribution P = 0.00001), when contrasting the Pacific region (including the Gorgona Island´s population), with the broader Andean region of Colombia encapsulated by the Central and Eastern mountain ranges subclades.

Loudsong acoustic signals

The database comprised a total of 266 loudsongs gathered from 43 individuals of T. atrinucha. These recordings were made in the same geographical areas or nearby sites where genetic samples were collected for analysis. The loudsong lasted ≤ 2.5 s (Fig. 2). The Pacific region subclade displayed the longest and shortest songs in the entire dataset —Guapi and Gorgona Island, respectively— separated by just 60 km (Fig. 2). Despite the overall similar appearance of loudsongs across the species' entire range, distant populations likely exhibit differences in multiple acoustic parameters. However, it is difficult once in the field to objectively distinguish such differences in acoustics of the loudsong.

When comparing loudsongs within regions, particularly emphasizing the populations from Gorgona and Panama, distinctions emerged in most acoustic parameters (Fig. 3, Supporting Online Information Table S3). The acoustic signals of T. atrinucha from Gorgona Island showed differences in no fewer than seven parameters compared to other regions (Fig. 3, Supporting Online Information Table S4). In terms of performance, while the mean note duration fell within intermediate values when compared to other regions, birds from Gorgona Island showed the lowest number of notes and song rate, along with the longest breaks between notes. Additionally, birds from Gorgona Island sang at both the highest peak and maximum frequency, concurrently displaying one of the lower frequencies (Fig. 3, Supporting Online Information Table S4). This variance in frequency contributed to the broadest bandwidth in their acoustic signals in Gorgona Island (Fig. 3).

Fig. 3
figure 3

Regional comparison of Loudsong in Thamnophilus atrinucha among major subclades identified in the ND2 gene genealogy, specifically the eastern mountain range (Andes), the central range (Cauca), and the Pacific region. Moreover, the comparison offers detailed analyses of populations from Gorgona Island (T. atrinucha gorgonae) and Panama (T. atrinucha atrinucha), both situated within the Pacific region subclade (see Fig. 1). This comparison implement mean values and standard errors for each acoustic signal, encompassing the duration of the note (Dur), strophe duration (ST- D), number of notes in the strophe (Notes), frequency peak (peak), minimum frequency (min), maximum frequency (max), bandwidth (Bandwd), Entropy, and silence time between notes (Break)

Seven distinct parameters within the acoustic signal of T. atrinucha's loudsong provided a clear differentiation between the Central inter-Andean and Eastern mountain range regions. On average, loudsongs in the Eastern mountain range had lower acoustic frequencies (min, max, and peak), fewer notes, a generally reduced note rate, and a narrower bandwidth when compared to the songs from the Central mountain range region (Fig. 3, Supporting Online Information Table S4).

When comparing the loudsongs from the Pacific and Central inter-Andean regions, various key differences emerged. In the Central region, the maximum frequency of the song was notably lower, shorter in duration, notes were briefer, and, overall, the note rate was higher. Conversely, in the Pacific region, loudsongs tended to be longer and consist of more notes; however, the individual notes within the song were shorter in duration and were emitted at higher frequencies (peak and max.) (Fig. 3, Supporting Online Information Table S4). Despite these differences in the number of notes and strophe duration, both the Eastern mountain range and Pacific regions shared a similar note rate (Fig. 3, Supporting Online Information Table S4).

We used the spatial distribution of three major subclades in the ND2 gene tree to conduct a discriminant analysis of the loudsong. This analysis showed that the acoustic data was sufficient to describe the structure of loudsong in two discriminant functions of phylogeographic regions (df = 2; Wilks = 0.40658; approx. F22506 = 13.071; P < 0.001). These functions lower the dimensionality of nine acoustic signals (Supporting Online Information Fig. S1), and maximize the total variation observed between regions (subclades) established a priori (LD1 = 80.4%, LD2 = 19.6%). This resulted in the correct assignment of each song to its region of origin for 95% of the loudsongs in T. atrinucha. The discriminant pattern of the acoustic signals for the Central region (subclade) is located at a midpoint between the Eastern mountain range and the Pacific regions, which in turn do not overlap (Supporting Online Information Fig. S1).

Loudsong acoustics, genetic, geographic and environmental distances

Despite the approximate 600-km separation between some localities in the Pacific region, populations of Gorgona Island and Panama differed in all nine acoustic variables (Fig. 3, Supporting Online Information Table S3) and showed the closest genetic p-distance (0.09% ± 0.06). Conversely, with a greater genetic p-distance (0.21% ± 0.06) between Gorgona Island and mainland areas of Chocó, Guapi, and Nariño in the Pacific coast of Colombia, acoustic differences were evident, including seven variables (Fig. 3, Supporting Online Information Table S3). Notably, geographic distances among populations in the Pacific region of mainland Colombia and the land-bridge island span a shorter range of 60 to 300 km. It's noteworthy that six distinct acoustic signals effectively differentiated the populations in Panama from those in the mainland regions of the Pacific coast in Colombia. It's also important to emphasize that when comparing these mainland populations to Panama, a higher genetic p-distance of 0.21% ± 0.05 is observed, contrasting with the higher genetic similarity previously noted between Gorgona Island and Panama. Geographic and environmental distances explained roughly 27.3% of the variance on the acoustic Euclidean distance, according to the GDM analysis. The I-Spline coefficients showed that Temperature Annual Range (BIO7) and Geographic distance played the largest role explaining acoustic distance (BIO7: 0.524; Geographic distance: 0.427). In simple terms, the I-Spline coefficient represents the minimum "force" required to determine the best-fitted polynomial curve for each factor as a function of acoustic distance. While the directionality of acoustic response is a rather complex effect, all coefficients including those with a lower I-Spline value (NDVI: 0.165; BIO1: 0.159 and BIO12: 0.135) suggest to us that bioclimatic variables and vegetation structure, all contributed to explain the spatial variation in acoustic characteristics of the loudsong in T. atrinucha.

Body size between Gorgona Island and Mainland

We obtained body measurement data for 56 males and 48 females of T. atrinucha, of which 16 and 11 corresponded to males and females from Gorgona Island. The principal component analysis can account for as much as 37.3% of the variation in the four morphological measurements for males and 41.1% for females (Supporting Online Information Fig. S2). Males and females from the island tended to have larger bodied-size when compared to mainland populations from the Pacific and inter-Andean regions. Females in Gorgona Island had longer wing length, tarsi, and tails when compared to mainland (p < 0.05 for each variable among the available specimens). Male specimens from the island consistently showed larger culmen, wings, tarsi, and tails when compared to multiple mainland locations (p < 0.02 for each variable among the available specimens). When comparing male specimens from Gorgona Island, which is only 60 km away from Guapi, an evident difference in exposed culmen length becomes apparent. In contrast, for females, the variation in culmen length was more subtle, indicating a marginal level of difference (p < 0.053).

Phylogeographic, phenotypic and ecological clustering

The parsimony network obtained using BPEC supports three geographically distinct clusters of T. atrinucha (PP = 1.0) (Fig. 4, Supporting Online Information Fig. S3). The spatial distinctiveness among these genetic clusters varies significantly and was influenced by nine acoustic variables, four morphological measurements for males, three bioclimatic variables, and data pertaining to vegetation structure. The first cluster clearly comprised haplotypes sampled from Gorgona island, Guapi and Nariño (SW-Pacific Colombian coast). The second cluster was composed of haplotypes with distribution in Panama and northern Chocó. The third was an Andean cluster that comprises haplotypes from Central and Eastern mountain ranges of Colombia (Fig. 4, Supporting Online Information Fig. S3). Nonetheless, BPEC strongly suggested that it is difficult to separate the latter from the Panama and the northern Chocó cluster (PP = 0.64). The transitions between clusters were driven by differences in means of the acoustic, body size and environmental variables rather than their variances. The modulation of acoustic variation reveals a trend that is not strictly unidirectional, as shown by variation in spatial transitions among clusters in the nine acoustic signals of the loudsong. According to BPEC the parsimony network lacked any loops to be resolved, which may lead to equivocal connections between haplotypes (Manolopoulou et al. 2020). However, the clusters do not correspond strictly with the spatial distribution of the primary subclades identified in the phylogenetic inference (Fig. 2, 4). This outcome discrepancy was unsurprising, as it aligns with the inherently distinct approach of integrating multiple data types within BPEC.

Fig. 4
figure 4

Distribution of covariates for each phylogeographic cluster pertaining to Thamnophilus atrinucha dataset using BPEC. The colors correspond to the same clusters depicted in the contour map (top left panel). Panels show spatial transitions involving acoustic signals, body size, and environmental variables among clusters. The shaded area correspond to the 5% and 95% credibility intervals for each cluster, while the solid lines denote the median for each point. The presented subset comprises two acoustic signals, Note and Bandwidth (Bandwd), in addition to measurements of exposed Culmen and Wing chord for males, along with Temperature Annual Range. A comprehensive set of 17 variables were implemented in the full BPEC analysis (Supporting Online Material Fig. S3). This set included nine acoustic parameters, four morphological traits, four remote sensing variables associated with environmental factors, and an index of vegetation structure

Discussion

Our study on the variation in loudsong acoustic signals covered the entire range of a non-vocal learning Thamnophilid species. In particular, we found evidence of evolutionary changes in loudsong between island and mainland populations within T. atrinucha. In this species the mtDNA ND2 gene form a monophyletic group and exhibit non-paraphyletic relationships with T. bridgesi and T. bernardi (external groups), contributing to a more comprehensive understanding of the genus Thamnophilus (Brumfield and Edwards 2007). While we acknowledge the limitations associated with relying solely on a single maternally inherited genetic marker for drawing conclusions about the genealogical process (see a study case in Palacios et al. 2023), our genetic data, at the very least, provides a supportive genealogical framework for analyzing the acoustic variation in loudsongs between Gorgona Island and mainland populations.

We documented clear differences in acoustic variables of T. atrinucha using as a line of reference three well-supported mtDNA lineages in the discrimination analysis (Supporting Online Information Fig. S1). On the contrary, Bayesian phylogeographic and ecological clustering does not involve reducing the dimensionality of multiple acoustic signals into discriminant functions. It also does not depend on any preconceived assumptions about spatial groupings, as is the case in a discriminant function analysis. The BPEC analysis reveals spatially structured clusters based on genetic, geographic, acoustic, morphological and environmental data. We highlight the observed shifts in the relative significance of individual variables across space (Fig. 4, Supporting Online Information Fig. S3). This underscores the complex spatial relationship between song structure and the varying contribution of each variable, a connection previously identified through the Generalized Dissimilarity Model.

Our integrative approach of multiple types of data was instrumental for the phylogeographic and ecological clustering of T. atrinucha. These clusters imply an unambiguous pattern of divergence if one considers that differences in two or more characteristics of the acoustic signals as sufficient criteria to distinguish between species of the genus Thamnophilus (Isler et al. 1997, 1998, 2012) and in other avian groups (Cadena and Cuervo 2010; Lara et al. 2012). Here, the distributional range of T. a. gorgonae is being extended beyond Gorgona Island to nearby areas along the SW-Pacific Colombian coast. According to our BPEC analysis, there is support for the unambiguous differentiation of T. a. atrinucha in Panama from T. a. gorgonae. From our results, the ND2 gene tree, acoustic, body size and environmental data strongly imply that a polytypic classification (Clements et al. 2016; IOC version 7.2, 2017) divided only into T. a atrinucha (Salvin and Godman 1892) and T. a. gorgonae (Thayer and Bangs 1905) appears insufficient. These two well differentiated geographic populations are part of the same subclade within the Pacific region and Central America, but additional subclades in the gene tree minimally imply that there are likely unrecognized populations elsewhere. It is worth recognizing that the number of samples for some areas was smaller than we would have intended. Nevertheless, we emphasize that our primary goal was to investigate the hypothesis of population divergence in Gorgona Island; we believe that the size of our sample adequately served its intended purpose.

Loudsong structure within the Pacific region, where T. atrinucha in Gorgona Island belongs to, does not seem restricted in terms of their genetic distance for the loss or gain of signals in their acoustic repertoire (Fig. 4). Our main finding is that two vocal attributes of loudsong in T. atrinucha inhabiting a land-bridge island are consistent with predicted patterns of avian song divergence between oceanic islands and mainland (Robert et al. 2021, 2022). The Gorgona Island population with larger bodied-size individuals produces a loudsong with the largest increase in frequency bandwidth of acoustic signals and the lowest note rate. Here, birds´ loudsong manifestation appears relaxed showing the lowest performance (song rate), which is achieved through the longest breaks between notes.

Because the number of notes is a phylogenetically informative trait in species with unlearned vocalizations (McCracken and Sheldon 1997; Carneiro et al. 2019); we argue that the lowest number of notes (but longer notes) of T. atrinucha in Gorgona Island is a reasonable proxy to describe the species´ simplest stereotyped loudsong. Although the description of complexity for vocal phenotypes is a topic still under discussion and nowhere near a consensus (Pearse et al. 2018; Najar and Benedict 2019); the diversity of notes has been proposed to be an informative variable when the number and variety of the notes are considered (Wolf 1977). The lowest note rate and the short duration of the loudsong of T. atrinucha in Gorgona Island add to the notion that the acoustic structure in the island´s endemic is the least complex Loudsong, in support of the argument of a relaxed vocal performance in the island.

Abiotic and biotic context to Loudsong

Regarding local restrictions, it is reasonable to speculate that environmental heterogeneity, such as variations in rainfall, temperature, or habitat structure, could influence not only the structure of avian communities (Boncoraglio and Saino 2007; Brown et al. 2017), but also the aspects related to the effectiveness of acoustic communication. Our GDM and BPEC analyses both indicate that bioclimatic variables, vegetation structure, and geographic distance may significantly influence the spatial acoustic variation of T. atrinucha. The shaping of acoustic variation among clusters exhibits a non-unidirectional trend, evident in spatial transitions across nine acoustic signals of the loudsong. Specifically, in populations along the southwest Pacific coast of Colombia (including Gorgona Island), the increased frequency bandwidth of acoustic signals, when compared to other clusters, may be linked to transitions toward lower annual temperature ranges (BIO7) and reduced measurements of vegetation cover (NDVI) (Supporting Online Material Fig. S3). How these and other biotic factors relate to loudsong divergence awaits further research. Subsequent studies may examine if the increased frequency bandwidth of acoustic signals is mediated by environmental restrictions or local noise near the shore on the 23-km2 island. An increase in the frequency of sound emission in birds has been suggested as an adaptive strategy for avoiding acoustic interference caused by environmental constraints, including ocean noise (Sebastianelli et al. 2021) or perhaps the omnipresent sound of insects, which are an important generator of noise in tropical rainforests (Luther and Gentry 2013).

However, even when the acoustic signal of T. atrinucha is not distorted by a rainier environment, the acoustic interference from other species in the community may be different and may promote acoustic acclimatization (Luther 2009) or song evolution to reduce interference. A forest depleted of bird sounds characterizes Gorgona Island compared with the loud choruses of birds in the forest by the coast, home to 20 times richer avian fauna than that found on the island (RES-C pers. obs.). In a context such as this, birds on the island would have the opportunity to emit vocalizations using a less demanding portion of the acoustic spectrum (Torres et al. 2020). To us, the loudsong phenomena in Gorgona Island would be consistent with acoustic relaxation of song, in support of the character release hypothesis for avian vocalizations in islands (Robert et al. 2021).

Changes in the area of Gorgona Island and its relationship with the coastal line as a result of recurrent changes in the sea level cannot be discounted considering that only 35 km of maritime gap separates the mainland. The areal reduction would possibly promote the restricted scattering or cycles of local isolation as geographic permeability for dispersal had been suggested to influence the range limits of many birds of the northern Andes (Graham et al. 2010; Valderrama et al. 2014). There are also important characteristics of T. atrinucha that presumably relate to the observed pattern presented here, including high dwelling fidelity and low post-fledging dispersal (Tarwater 2012; Tarwater et al. 2013) and the possible restriction to dispersal by specialization as documented in other species of the genus Thamnophilus (Choueri et al. 2017). All these factors would lead to a historical pattern of differential accumulation of acoustic signals and of neutral genetic variation as documented here in a non-vocal learning suboscine.

It is tempting to hypothesize that local divergence of loudsong in Gorgona Island has been promoted primarily in isolation. The local effect of a reduction in the acoustic interference of the bird community cannot be ruled out (Capelli et al. 2020; Torres et al. 2020). We suggest field experiments to examine whether the acoustic divergence of T. atrinucha may be coupled with a rapid discrimination of communication signals from the mainland and the selection phenomenon for better signal transmission (Barker 2008). Acoustic divergence may also avoid acoustic interference within a lineage that share the same habitat type (Slabbekoorn and Smith 2002; Cicero et al. 2020), perhaps by a subtle adjustment of the signals during the crystallization of the song (Touchton et al. 2014). Even slight variations in innate song can contribute to the specific recognition (Macedo et al. 2019). Innate vocalizations are expected to have slowly accumulated small mutations in response to local adaptation (Laiolo and Rolando 2002; Nicholls and Goldizen 2006; Tobias et al. 2010). Our results collectively suggest that invoking neutral evolution of the loudsong in the light of our genetic, acoustic, body size and environmental data seems insufficient to explain the geographic variation of loudsong among closely related populations of T. atrinucha.