Introduction

In the absence of high-quality biodiversity data, land-use planners and conservationists often rely on biodiversity surrogates (Margules and Pressey 2000; Pinto et al. 2008), which are species, groups of species, or environmental variables used as proxies to represent broader aspects of biodiversity (Margules and Pressey 2000; Pinto et al. 2008). Use of biodiversity surrogates requires a trade-off between the surrogate’s ability to reflect the distribution of unsampled species and the cost of sampling species in the field (Gardner et al. 2008; Lansac-Tôha et al. 2022). This approach has been applied to numerous taxonomic groups in terrestrial (Corcos et al. 2021; Verdinelli et al. 2022) and aquatic ecosystems (Pakulnicka et al. 2015; Spigoloni et al. 2022). If a surrogate is identified, the results could be utilized to prioritize locations or actions for conservation purposes (Nogueira et al. 2023).

Evaluating the effectiveness of surrogates in conservation planning is challenging (Fattorini et al. 2012; Covre et al. 2022) since it necessitates the availability of the same extensive datasets whose absence initially justified the use of surrogates. For example, in the tropics, where endemic species are common and little information is available regarding the life history and ecology of most taxa, the surrogate approach has been recommended, even for relatively species-poor communities and relatively small areas (Wiens et al. 2008). However, studies often propose surrogates without considering who will use them and why they are better than direct measures (Caro 2010).

In cases where surrogates appear to underestimate complementarity, assessing the effectiveness of frequently used surrogates or establishing new ones may be necessary. In the Amazon, data on the distributions of various taxonomic groups are still too incomplete to serve as the foundation for systematic conservation planning (Carvalho et al. 2023). As a result, conservation targets aiming to maximize complementarity between protected areas have been established based on untested assumptions about the environmental niches of most species. These targets typically rely on ecoregions, vegetation types, and soil types as biodiversity surrogates (Peres 2005). However, the use of general classifications, such as broadscale environmental data, is not always useful because biological processes and ecological interactions can operate at different scales due to the non-uniform geographical distribution of environmental conditions (McGill 2010; Lu and Jetz 2023). Moreover, relying solely on broadscale environmental data may obscure important relationships between species and their environment (Qian and Kissling 2010; Monteiro et al. 2023).

This appears to be the case for the Purus-Madeira interfluve located in southwestern Amazonia; it is a large, forested area extending over 800 km where maps of vegetation types and geology show a relatively homogeneous landscape (Olson et al. 2001). However, biological surveys have revealed compositional heterogeneity across the region in many taxonomic groups, including ants (Baccaro et al. 2013), bats (Marciente et al. 2015), trees (Schietti et al. 2016), lizards (Peixoto et al. 2019), fishes (Stegmann et al. 2019), and dung beetles (Salomão et al. 2022). These results suggest that the effectiveness of using broadscale environmental data as biodiversity surrogates in large forested areas, such as the Amazon, could be inconsistent. Therefore, it becomes necessary to identify alternative and effective biodiversity surrogates to support conservation efforts in the region.

One potential alternative is considering the biotic interactions between taxonomic groups (Cosentino et al. 2023; Bazzato et al. 2023). For example, if biotic interactions between birds and palms are significant, then variations in bird species composition - defined by the presence or absence of specific bird species - would correspond with changes in the abundance of palm species. This expectation could be justified because birds interact with palm species in various ways, utilizing them as food resources, nesting, roosting and perching sites (Snow 1981; Brightsmith and Cáceres 2017; van der Hoek et al. 2019; Glória and Tozetti 2021). The extent to which birds exploit different plant species depends on their specific characteristics, such as whether they are canopy or understory birds (Bradfer-Lawrence et al. 2018; Lima et al. 2023). On the other hand, the composition of palm species changes in response to environmental variation (Sousa et al. 2020; Muscarella et al. 2020), with some species showing close associations with specific environmental characteristics. For instance, the presence of Oenocarpus bacaba palm species indicates well-drained soils, while Lepidocaryum tenue is more commonly found in wet conditions (Cohn-Haft et al. 2007).

To assess the potential of using data on palms as a substitute for bird assemblages, we compared the relationship between the distribution of 26 bird species, which can be easily-detected either visually or by ear, and which belong to different taxonomic groups, with the distribution of the five most abundant palm species along the Purus-Madeira interfluve (Lepidocaryum tenue, Attalea speciosa, Mauritiella aculeata, Oenocarpus bataua, Oenocarpus bacaba). To determine whether the composition of bird assemblages is more strongly correlated with changes in palm-species abundance than with environmental factors (such as precipitation, water-table levels, sand and clay content) that commonly affect species distributions in this region, we also compared these relationships to the ecoregions proposed by Ximenes et al. (2021), who considered the region to be relatively heterogeneous. If such a relationship is found, palm species, which are sedentary and potentially easier to sample, could serve as cost-effective and efficient biodiversity surrogates for birds. Moreover, these palm species are abundant in the region and can be easily identified and monitored by non-specialists, such as those in local communities.

Methods

Study area

The study area is located between the Purus and Madeira rivers, in Brazilian Amazonia (Fig. 1). The northern part of the interfluve is comprised of three geomorphological units: alluvial terraces, where streams are dammed by the large rivers during the high-water season; mega-slopes, which are deeply dissected by stream meanders and flood temporarily after heavy rainfall; and mega-plateaus, which are higher, poorly drained areas featuring irregular microtopography and temporary pools (Brasil 1978). Vegetation type varies from dense forests in the north to more open, palm-dominated forests in the south (Brasil 1978). According to the ecoregions map by Olson et al. (2001), the Purus-Madeira interfluve region is dominated by the Purus-Madeira Moist Forests. For a more detailed classification incorporating bioclimatic, spatial, and topographic variations, see Ximenes et al. (2021).

Fig. 1
figure 1

Location and vegetation types of the Purus-Madeira interfluve in Brazilian Amazonia and data collection points along the BR-319 highway (M01-M11)

Sampling design

We collected data at 11 sites (M01 – M11, Fig. 1) along the BR-319 road, located in the Purus-Madeira interfluve. Sites were spaced roughly 60 km apart, forming a 670 km transect. At each site, a 5 km trail extended perpendicularly from the highway. Five 250 m-long transects were located at 1 km intervals along this trail, with the long axis of the plots following topographic contour lines. Bird data were collected at a point in the center of each transect and palm data were collected throughout each transect.

Water-table level, sand, and clay content were collected in situ and followed the sampling protocols applied by the Biodiversity Research Program in Western Amazonia (PPBio-AmOc). Water-table level was based on the mean of seven measurements of distance from the surface to ground water during the months of March, July and November 2011; March, August, October and December 2012; and March 2013. Positive values indicate that the water level was above ground. Soil samples (sand and clay content) were collected to a depth of 0.30 m every 50 m along the 250 m-long transects, giving six samples per transect. Sand and clay content in each plot was obtained following the standard protocol proposed by Donagema et al. (2011). Precipitation data were based on the annual precipitation (BIO12) obtained from the WorldClim database (Fick and Hijmans 2017).

The ecoregions proposed by Ximenes et al. (2021), which clustered the Purus-Madeira interfluve into 14 regions (Supplementary Information – Fig. S1), were based on latitude, altitude, drainage density, and soil characteristics. These variables were identified by the author as the most descriptive of environmental heterogeneity in the region relevant to biodiversity.

Due to the region’s rich bird diversity, we focused on 26 species (Supplementary Information – Table S1) easily-detected visually and by ear that covered a broad range of feeding guilds, body sizes, and use of different forest strata (canopy and understory). These species include nearly all guilds in the regional avifauna. We collected data between June and November 2010, a time of year when birds in the region reproduce and vocalize more frequently. Each bird-sampling point was visited for four consecutive days, and the order in which sites were visited was modified each day. Sampling each point repeatedly increased the probability of detecting species (Dorazio and Royle 2005; Mackenzie and Royle 2005), while sampling points in a different sequence each day removed the bias of detecting species that only call early in the morning. Between 5:30 AM and 9:30 AM, peak hours for bird activity, the same observer recorded all species detected via visual or auditory records at each sampling point for five minutes in an unlimited detection radius (Blake 1992; Vielliard 2000; Dorazio et al. 2006). Audio recordings were taken at all sampling points and deposited in the Sound Library (Acervo Sonoro) of the Bird Collection of the Instituto Nacional de Pesquisas da Amazônia.

We measured the abundance of five palm species that are often considered to be indicators of bird communities: Lepidocaryum tenue, Oenocarpus bataua, Oenocarpus bacaba, Mauritiella aculeata, and Attalea speciosa (Cohn-Haft et al. 2007). We collected palm-composition data in the same 250 m-long transects where the bird-sampling points were located. For clonal palm species, we considered each stem to be an individual.

Data analyses

We applied different statistical techniques to test whether the overall bird-species groups (all species, canopy birds, and understory birds) composition can be more accurately predicted by palm-species abundance (i.e., the five most abundant palm species), than environmental factors (precipitation, water-table level, sand, and clay contents) or ecoregions (Ximenes et al. 2021).

We fitted multivariate models based on the overall presence-absence data of bird-species, assuming that the response variable follows a binomial distribution, and used the “cloglog” link function to relate the predictors (palm-species abundance, environmental factors and ecoregions) to the response variable. To quantify the potential use of palm data as a substitute for bird assemblages, we compared the reduction in covariation among models. This metric is based on measurement of the sizes of the factor loadings and correlations in each model, estimated via the co-occurrence patterns using the Copula model (Popovic et al. 2022); a model-based approach that allows the construction of an ordination of the observations, where the taxa correlation is described using latent factors (i.e., unobserved variables that are inferred from the observed data) and modeled considering the data properties.

Additionally, we used Procrustes analysis(Jackson 1995; Peres-Neto and Jackson 2001) to statistically compare the similarity of overall bird-species ordination results with palm-species and environmental-factor ordinations. If such similarity is found, the superimposed ordination could potentially serve as effective biodiversity surrogates for birds. To characterize the variation in bird-species composition and palm-species abundance, we also used ordinations from the Copula model to visualize the main patterns in the multivariate datasets (Warton 2022; Popovic et al. 2022). For bird species, we fitted the multivariate models as previously described. For palm-species abundance, we used the same method, but assumed that the response variable follows a negative-binomial distribution and used the “log” link function. To summarize the variation in environmental factors, we used Principal Components Analysis (Pearson 1901). To assess the statistical significance of the Procrustes analysis, we used a permutation test (Number of permutation = 1000).

All analyses were undertaken in R version 4.2.2 (R Core Team 2022) using the packages ecoCopula (Popovic et al. 2019), mvabund version 4.2.1 (Wang et al. 2012) and vegan version 2.6-4 (Oksanen et al. 2022).

Results

The multivariate models fitted based on the overall presence-absence data for bird-species showed that palm-species abundance was the best predictor for the composition of all birds (BIC: 1266.24) and for canopy birds only (BIC: 704.98) (Table 1). For understory birds, environmental factors provided the best model (BIC: 684.69). All these models performed better than the ecoregions proposed by Ximenes et al. (2021). These results are qualitatively similar to those comparing the reduction of covariation among the analyses. The sizes of the factor loadings and correlations were in general lower for models with palm-species abundance as the predictor than other models, but only for all and canopy birds (Table 1); for understory birds, environmental factors were the best predictors of covariation in bird-species composition (Table 1).

Table 1 Multivariate models were fitted based on the overall presence-absence data of bird-species and the sizes of the factor loadings and correlations estimated via the co-occurrence patterns using the Copula model. The percentage values within the parentheses indicate the reduction in covariation among the variables in the analysis when the predictor was included in the model. A plus signal in front of the number indicates an increase in the covariation

The Procrustes analysis (Fig. 2; Table 2) showed that distributions of palm-species abundance (Supplementary Information – Fig. S2) could potentially serve as effective biodiversity surrogates for the distribution of bird-species (Supplementary Information – Fig. S3), but with a few limitations. The palm-species reflected the distributions of the dataset including all birds and only understory birds (Supplementary Information – Fig. S3). This analysis did not detect a significant relationship between the variation in the environmental factors (Supplementary Information – Fig. S2) and any bird functional group (Fig. 2; Table 2).

Fig. 2
figure 2

The relationships between the ordinations of bird-species, palm-species co-occurrence and environmental factors expressed by Procrustes analysis. The color represents the samples taken in dense forest (dark green) and open forest (light green) along the Purus-Madeira interfluve. The circles and diamonds represent the 250-m plots for birds and palms, respectively. The lines represent the connection between the target matrix and the rotated matrix in the analysis. The letters A to F represent the relationships between bird-species (i.e., target matrix), palm-species and environmental factors (i.e., rotated matrix): A) All birds and environmental factors; B) All birds and palm-species abundance; C) Canopy birds and environmental factors; D) Canopy birds and palm-species abundance; E) Understory birds and environmental factors; F) Understory birds and palm-species abundance

Table 2 Results of the procrustes analysis, showing the statistical significance (p ≤ 0.05) of the associations. “SQ” represents the sum of squared differences between the two matrices (target and rotated matrices), while “R” represents the correlation-like statistic. The “P-value” indicates the significance of the Procrustes statistic based on a permutation test with 1000 permutations

Discussion

Our results reveal an alternative and generally effective biodiversity surrogate in the Amazon that could support conservation efforts. This indicates that biotic interactions between two taxonomic groups may serve as a promising strategy to identify biodiversity surrogates. The observed variation in bird composition in the region was better predicted by palm-species abundance than environmental predictors or ecoregions. While this study did not test other surrogates that imply the study region to be homogeneous, they would be incapable of predicting variation in bird composition across the study area.

Palms may represent an important resource for vertebrates and invertebrates (Zona and Henderson 1989; Kahn and Granville 1992), especially when other forest resources are scarce (Peres 1994). Palms are also an important structural element in the region, where they can reach densities approximately four times higher than in other areas of Central Amazonia. The variation in palm composition between localities likely mirrors finer-scale environmental differences. Such local, fine-scale environmental variations, as indicated by palm composition, may have a more profound influence on bird populations than the environmental-surrogates-based on broad-scale-environmental data.

The importance of considering biotic interactions in studies for conservation planning should be emphasized, mainly because several studies have shown that biotic interactions (e.g., consumer–resource) have a significant effect on predicting the species distribution, often lowering the relative importance of environmental variables (Wang et al. 2018; Bazzato et al. 2023). Moreover, other studies have demonstrated that including biotic features can improve model performance to identify priority areas for conservation (Jenkins et al. 2020; Nogueira et al. 2023). The identification and comprehension of these relationships are urgent, because many conservations decisions guided by species distribution models (Guisan et al. 2013) underestimate the importance of biotic interactions in shaping species distributions.

Although there is a well-documented strong relationship, using palms as surrogates for birds in systematic conservation planning on a large scale in the Amazon has its limitations. Knowledge about the distribution and abundance of palm species is not available for the entire Amazon. However, alternative methods, such as remote sensing, have improved palm inventories and show promise as a strategy for understanding their distribution and diversity (Wagner et al. 2020). This strategy could highlight discrepancies between the actual number of species in a given area and the number expected based on habitat suitability, resources, and environmental conditions. Nonetheless, the taxonomy of palm species is relatively well understood, and local people are capable of conducting common palm species surveys efficiently.

Land-use planning is not only a central-government activity. Citizen scientists are often involved in data acquisition and subsequent planning (Garretson et al. 2023; Jesus et al. 2023), but they often do not have the experience to sample assemblages potentially consisting of hundreds of species. Involvement of local communities is facilitated if they can initially focus on species which they are already familiar with and use as resources, such as the palm species. The inclusion of these people in monitoring biodiversity could provide an inexpensive and potentially large labour force (Fa and Luiselli 2023). Initiatives that consider collaborative sampling by amateurs could be used in conjunction with traditional methods to evaluate the distribution of endangered species (Adamantopoulou et al. 2023) and conservation planning (Soteropoulos et al. 2021). The data provided by knowledgeable amateurs, could also reveal ecological patterns, enhancing our understanding in how biodiversity relates to environment (Devictor et al. 2010; Campbell et al. 2023; Garretson et al. 2023; Jesus et al. 2023). Furthermore, local people are often interested in sampling biodiversity in a purposeful manner, especially if the sample will contribute to scientific knowledge and decisions about land use (Thompson et al. 2023).

Given that the areas in our study have similar geomorphology, but showed different bird composition among the sites, using geomorphology as a surrogate to determine bird conservation priorities in the region will tend to underestimate the number of unique assemblages meriting protection. Even the ecoregions proposed by Ximenes et al. (2021), who considered the region to be relatively heterogeneous, are not good indicators of bird-species distributions. These surrogates frequently fail to track variation in conditions that are important for bird species, perhaps because such conditions vary at different spatial scales than the surrogates. Geomorphology and ecoregions may be useful for determining conservation priorities at large scales (Sano et al. 2019), but they can fail to capture between-site variation at fine scales, such as that captured by the biotic interaction between bird and palm species that we have studied.

In conclusion, identifying biodiversity surrogates, considering biotic interactions and incorporating local people in monitoring biodiversity programs, could be a useful tool for enhancing conservation and land-use planning across the Amazon basin and potentially in other parts of the world; especially when surrogates established based on assumptions about the environmental niches of most species have proved to be ineffective in predicting the species distributions at fine scales. Future studies should also incorporate other potential biotic interactions (Morales-Castilla et al. 2015) that are recognized to play a role in shaping the community composition and may have important consequences at an ecosystem level. Additionally, the reliability of conclusions regarding the identification of biodiversity surrogates may depend on both the taxonomic reorganization of species assemblages (e.g., into guilds) and the statistical methods employed. Taking these factors into consideration can improve our knowledge about biodiversity surrogates and could avoid inappropriate conservation recommendations.