1 Introduction

Mourella caerulea (Friese 1900) is the only species of the stingless bee genus Mourella Schwarz 1946 (Camargo and Wittmann 1989; Camargo and Pedro 2013). Its distribution is mainly associated to Pampa biome (Rio Grande do Sul State, in South Brazil, Argentine, and Paraguay), with a few records in areas of Campos de Cima da Serra from southern states of Brazil (Camargo and Wittmann 1989; Wittmann and Hoffman 1990; Silveira et al. 2002). This species nests on the ground (~ 50 cm from surface) and is an important pollinator of natural and agricultural ecosystems. It is known to visit flowers of onions, carrots, coriander (Witter and Blochtein 2003), canola (Halinski et al. 2015), and also native plants of the Pampa (Camargo and Wittmann 1989), and thus is an economically and ecologically essential species, necessary for native flora maintenance.

Due to its nesting habit, M. caerulea is a species vulnerable to inadequate soil management, standard practice in an environment where the main economic activities are agriculture and livestock.

The Pampa biome is divided into four physiographic regions: Campanha, with a predominance of grasslands, continental climate, and average altitude ranging from 60 to 120 m; Depressão Central, which is a transition area, close to Atlantic Rain Forest, with altitude usually inferior to 100 m; Serra do Sudeste, with an altitude ranging from 20 to 200 m with shrub vegetation, and height from 400 to 600 m with grasslands. It is also possible to find small forest vegetation; the last one is Encosta do Sudeste, which is near the great lakes, and its average altitude is less than 30 m, with a few hills up to 200 m and vegetation similar to the Serra do Sudeste region (Fortes 1979; Roesch et al. 2009). According to the Brazilian government, between 1970 and 2009, the Pampa area reduced more than 36% of its original size (MMA 2012).

Mourella caerulea was described in 1900, and the description of its nest was made only in 1989, from four colonies of Canguçu, Rio Grande do Sul (Camargo and Wittmann 1989). Up to now, nothing is known about the intraspecific diversity of this species and its relation with the Pampa biome.

In this sense, our objective was to characterize and map the population variability of the Pampa endemic stingless bee M. caerulea in the Rio Grande do Sul State, Brazil. Thus, we expected to provide valuable data into M. caerulea population structure and new information for its conservation. To achieve this objective, we used (1) geometric morphometrics of wings, to evaluate the morphological variation in population (Francoy and Imperatriz-Fonseca 2010); (2) cuticular hydrocarbon (CHC) profiles, which allow us to verify genetic (Falcón et al. 2014) and environmental effects (Howard and Blomquist 2005); and (3) the analysis of mitochondrial DNA, through nucleotide variation of the gene Cytochrome oxidase I (COI), commonly used to verify genetic variability inside Meliponini population (Arias et al. 2003).

2 Material and methods

2.1 Sampling

We collected workers of M. caerulea from 24 natural nests distributed in the following localities: 14 colonies from Alegrete (29° 47′ 5.29″ S 55° 46′ 32.37″ W), 3 from Candiota (31° 35′ 59.99″ S 53° 43′ 59.94″ W), 6 from Turuçu (31° 25′ 18.74″ S 52° 10′ 39.86″ W), and 1 from Viamão (30° 05′ 18.09″ S 51° 01′ 25.64″ W). Additionally, we collected bees visiting flowers in Bagé (31° 19′ 13.96″ S 54° 10′ 24.42″ W), from two places 8 km apart from each other (Fig. 1). As each analysis used a different number of individuals, these amounts will be described in the following sections.

Figure 1.
figure 1

Map of Brazil and in detail the collecting sites of Mourella caerulea in the state of Rio Grande do Sul, Brazil.

2.2 Morphometric analysis

For this analysis, we used approximately 20 individuals per nest, except for the two different localities in Bagé, from which we used around 30 individuals per sampling locality (Table I). We removed the right forewing of each worker, placed it between microscope slides and photographed each wing. A .tps file was created using the software tpsUtil v. 1.40 (Rohlf 2009), and 13 landmarks were manually plotted in the intersections of wing veins (Fig. 2) using the software tpsDig v. 2.12 (Rohlf 2008). The landmark configurations were Procrustes aligned (Klingenberg and McIntyre 1998) using MorphoJ v. 1.03a (Klingenberg 2011), which also generates canonical variate analysis (CVA), Mahalanobis distances between the centroids of each group of samples, and pairwise discriminant analysis. MEGA v. 5.05 (Tamura et al. 2011) was used to build a neighbor-joining dendogram based on the Mahalanobis distances between the centroids of the distributions. To verify the correlation between the morphological and geographical distances of the localities, we performed a Mantel test using TFPGA (Miller 1997).

Table I Number of bees per nest used in geometric morphometric analysis
Figure 2.
figure 2

Right forewing of M. caerulea worker. Black dots represent the position of the 13 landmarks in the cell veins.

2.3 CHC profiles

CHC extraction, analysis, peak identification, normalization of relative peak area and derivatization reaction were made as described earlier (Falcón et al. 2014) with two adaptations: the volume for CHC extraction was 1 ml, and the re-suspended volume was 50 μl. We performed a PCA (R software v. 3.0.0) to check each CHC peak contribution for the differences among localities. The linear discriminant analysis was used to discriminate the localities based on CHC profiles using the software Past v. 3.04 (Hammer et al. 2001). We also used the R package pvclust v. 1.3-2 (Suzuki and Shimodaira 2006) to calculate the potential clustering of samples (localities) based on Euclidean distances and 10,000 bootstrap replications. This package returns the values of approximately unbiased p values for cluster support (AU) for each bootstrap support (BP). Groups with AU ≥ 95% were considered supported by these data. For this approach, we used five individuals per nest. In the case of Bagé, we used five bees per sampling site, since we collected them on the flowers. To estimate the correlation between the average Euclidean distances of CHC data and geographical distances, we used the Mantel test (R software v. 3.2.2, package ade4 v. 1.7-4).

2.4 Molecular analysis

We extracted total DNA from the middle leg of one individual per nest (sampling site in the case of Bagé) according to the methodology proposed by Walsh et al. (1991). We performed a PCR using conditions previously described (Françoso and Arias 2013; Simon et al. 1994). For the PCR, it was used 5 μl of DNA, 2 μl of each primer (10 pmol), 25 μl of 2× Promega PCRMaster Mix [Taq-DNA polymerase (0.05 units/μl), 2× reaction buffer, 400 μM of each dNTP, and 3 mM of MgCl2], and 16 μl of ultra-pure water. The amplified fragment was visualized in 1% agarose gel contrasted with Gel Red™, using UV light. Then, it was purified using Wizard® SV Gel and PCR Clean-Up System Kit (Promega) following the manufacturer’s instruction. The purified samples were sequenced using Sanger’s methodology. The sequences were visualized, edited, and aligned using the software Geneious 5.1.6 (Drummond et al. 2011). For the alignment, we used Muscle algorithm (Edgar 2004) with eight interactions. DnaSP v. 5.10 (Librado and Rozas 2009) was used to determine the number of polymorphic sites (S), haplotypes (h), haplotype diversity (Hd), nucleotide diversity (π), and the average of nucleotide differences (k). The haplotype network was built using NETWORK v. 4.6 (Polzin and Daneshmand 2003), through the algorithm median joining (Bandelt et al. 1999). MEGA v. 5.5 (Tamura et al. 2011) was used to calculate the average number of variable sites between and within populations. The distance matrix was calculated using the model of Kimura two-parameter (K2P) (Nei and Kumar 2000) and used to build a neighbor-joining dendrogram. For the dendrogram construction, we also used the COI sequences of Schwarziana quadripunctata (access number at National Center for Biotechnology Information (NCBI) data bank KC853379) and Scaura latitarsis (access number KC853332). We also sequenced, with the same pair of primers, the COI of the stingless bees Plebeia pugnax, Plebeia saiqui, and Plebeia droryana, for comparative purposes. For this approach, we used four samples per collection site of Bagé. The software Arlequin v. 3.5 (Excoffier and Lischer 2010) was used to estimate the pairwise Fst. We determined the correlation between the genetic and geographical distances through a Mantel test in the software TFPGA (Miller 1997).

3 Results

3.1 Geometric morphometrics

The two first canonical axes of the CVA explained approximately 90% of the variation among the sampled individuals. This result showed that the locations of Candiota, Turuçu and Viamão are morphologically similar. The CV1 discriminated better Alegrete and Bagé from the other localities. CV2 was also important to discriminate Bagé from the others (Online Resource 1). The cross-validation tests showed a high level of samples’ correct classification, with the smallest values being the comparison between Candiota and Turuçu with 78.57 and 84.75% of correct match, respectively. Mahalanobis distances and its dendrogram also support CVA results, grouping Candiota, Turuçu, and Viamão and separating Alegrete and Bagé (Table II, Fig. 3). The Mantel test indicated a significant positive correlation between morphological and geographical distances (r = 0.61, p = 0.05), which may indicate isolation by distance among populations.

Table II Mahalanobis-squared distances (below diagonal) and p values (above diagonal) among sampling localities obtained from the CVA from geometric morphometric data
Figure 3.
figure 3

Dendrogram representing the Mahalanobis squared distances from geometric morphometric data.

3.2 CHC analysis

We identified 62 peaks of CHC, consisting of n-alkanes, unsaturated CHC (alkenes and alkadienes) and branched alkanes of chain length ranging from 21 (C21) to 35 (C35) carbon atoms. The PCA revealed that of the 20 peaks that contributed most to explaining the variation among localities, only 1 was an n-alkane (C32), 6 were branched alkanes, and the majority (13 peaks) were unsaturated CHC (Online resource 2). The discriminant analysis differentiated all analyzed groups, except Bagé from Candiota (Online Resource 3). The cluster analysis showed that samples from Alegrete form a separated group while the other localities clustered together. Although Viamão is inside of this larger group, its samples significantly clustered together (Fig. 4). The Mantel test indicated no statistical significance (r = 0.7763467, p = 0.1213), which corresponds to the absence of correlation between the average Euclidean distances of chemical data and geographical distances. Another interesting fact is that our results demonstrated that unsaturated and branched CHCs are the main responsible peaks for the differentiation of the groups and, based on the canonical analysis and cluster approach, Alegrete and Viamão are different from the other sampling places.

Figure 4.
figure 4

Dendrogram representing the Euclidean distances between samples based on CHC profiles. The first letter in each terminal represents the locality: Alegrete (A), Bagé (B), Candiota (C), Turuçu (T), Viamão (V). Cluster support (au—dark gray values) ≥ 95 are considered statistically significant. Bootstrap (bp—light gray values). Branches’ edges in gray. The localities of Alegrete and Viamão, which had all samples clustered together, as well as the major group that comprises Bagé, Candiota and Turuçu, are indicated by an arrow.

3.3 Molecular data

We obtained a 546-bp region from COI for population comparisons (accession numbers to NCBI data bank KY274850 to KY274882). The nucleotide composition was A = 32.1%, T = 45.3%, G = 10.2%, and C = 12.4%. We found 25 polymorphic sites and six different haplotypes (Table III). We found unique haplotypes for each sampling site (Online Resource 4), which is an indication of low gene flow among populations. Despite a large number of mutational steps between haplotypes from different localities, there was one single change in amino acid sequence, from methionine to leucine on haplotype 4 (in position 126). The neighbor-joining dendrogram showed that M. caerulea sequences grouped and that the samples from each site are discriminated through this method (Fig. 5). Along with these results, Fst values significantly discriminated populations of different localities (Table IV), except for Viamão. The K2P distance between sampling sites ranged from 0.6 to 3.1%, also in agreement with haplotype and Fst results (Online Resource 5). Additionally, Mantel test indicated a positive significant correlation between genetic and geographic distances (r = 0.51, p < 0.05).

Table III Genetic diversity of COI among M. caerulea population. (N) number of samples. (S) number of polymorphic sites. (h) number of haplotypes. (Hd) haplotype diversity. (π) nucleotide diversity. (K) average number of nucleotide differences
Figure 5.
figure 5

Neighbor-joining dendrogram of genetic proximity among sampling locations of M. caerulea and outgroups (S. quadripunctata, S. latitarsis, P. saiqui, P. pugnax, and P. droryana). The numbers represent the bootstrap support after 1000 replications.

Table IV Pairwise Fst (below diagonal) and p values (above diagonal) among M. caerulea population

4 Discussion

The three methodologies used in the present study indicated similar results regarding their effectiveness in the differentiation among samples from different localities. The high rates of correct classification using morphological and biochemical markers, as well as the existence of an elevated number of exclusive haplotypes, are in agreement to previous works involving bee populations (Francisco et al. 2008, 2013; Batalha-Filho et al. 2010; Brito and Arias 2010; Francisco and Arias 2010; Francoy et al. 2011; Brito et al. 2013; Bonatti et al. 2014). Otherwise, it is worth to mention that, as far our knowledge goes, studies involving the discrimination of Meliponini bee populations using CHC profiles are scarce and analyzed samples from few localities (Francisco et al. 2008; Ferreira-Caliman et al. 2012). It makes this the first study that analyzed a higher number of populations. As the results indicate, CHC can be considered a good marker to discriminate different bee populations and also be useful to track the geographic origin of samples. Another interesting fact here is that the majority of the compounds that influenced the discrimination of the groups are unsaturated CHC. This group is typically associated with derived functions, like nest mates, recognition, communication and others (Gibbs 2002; Châline et al. 2005).

The three markers are also in agreement in demonstrating the influence of the physiographic features of the sampling sites in the clustering of groups. The agreement between physiographic features and species differentiation has already been reported by other studies using mtDNA and geometric morphometrics of wings (Francisco et al. 2008; Francoy et al. 2011, 2016; Combey et al. 2013; Bonatti et al. 2014; Lima et al. 2014; Hurtado-Burillo et al. 2016). This influence may be due to some environmental effects on the heritability of the patterns of wing venation, which can vary between different parts of the wings (Monteiro et al. 2002). Eventual advantages of some haplotypes in similar physiographic regions may also be an explanation (Francoy et al. 2011, Bonatti et al. 2014). For the CHC, this influence is clearer, since environmental factors, such as temperature, diet and floral resources, have a direct impact over CHC (Gibbs 1998; Martin et al. 2012; Kather and Martin 2012). This significant influence may also be responsible for the lack of correlation between geographic and chemical distances among the studied populations.

On the other hand, the morphometric and molecular distances presented positive correlations with the geographic distance, indicating an isolation process caused by the geographic distances among the sampling sites (Telles and Diniz-Filho 2005). This fact may be due to specific characteristics of bees from Meliponini tribe. These bees are monandric, and this fact influences the genetic structure of the population (Peters et al. 1999). They also present strong queen philopatry, mainly due to the behavior of the colony during the swarming process (Nogueira-Neto 1997). In this process, the new queen leaves the original colony with some workers (Nogueira-Neto 1954). They nest in a location close enough to keep in touch with the mother colony, getting resources from it for a short period (Nogueira-Neto 1997). The small body size of M. caerulea workers works as a limiting factor in the flight range. For example, small bees such as workers of Tetragonisca angustula, Scaura latitarsis and Nannotrigona testaceicornis, which are approximately the same size of a M. caerulea worker, have their maximum flight distances between 621 and 951 m (Araújo et al. 2004). This short range of flight, together with the strong habitat fragmentation in South Brazil and the reduced capability of dispersion by the queens, is probably the main factor that determines the current population structuration.

M. caerulea is a stingless bee that nests exclusively underground, and its distribution is congruent to Pampa biome. Because of its nesting habits, M. caerulea populations do not suffer the effects of colony transportation since they are not reared for beekeeping as bees of the Melipona genus. Additionally, Pampa biome has been reduced due to urbanization and intense agriculture activity. Together with a low dispersion capability, our results suggest that the sampling sites are distinct and the structuration found here is directly influenced by geographical distance. The presence of exclusive haplotypes for each location and different morphometric and biochemical profiles indicate the need for particular attention for all occurrence area of M. caerulea, once the loss of any of these regions might correspond to a substantial negative impact on the diversity of this species.

As a final observation, all three markers seem to be adequate to evaluate population variability in bees and also to address specific questions regarding the relationships among these populations. Each one of them has its advantages and limitations, and they should be analyzed as such. However, the conjunction of the results may also indicate a broader picture, and it is worthy, in our opinion, to try to look at them all together.