Background

Genetic drift and natural selection are often seen as two counteracting evolutionary forces across populations [1]. In large and stable populations, where the random effects of genetic drift are limited, balancing selection maintains high levels of adaptive variation affecting population fitness and evolutionary potential [2, 3]. By contrast, in small and isolated populations, or within the context of demographic events such as bottlenecks, genetic drift is predominant, masking possible effects of natural selection [4, 5]. Neutral molecular markers allow assessing interactions between demographic history and genetic diversity [6], but do not provide direct information on selective processes [7, 8]. Thus, evidence of selection and adaptive potential is best approached by contrasting neutral and functional molecular markers (e.g., [9, 10]).

In vertebrates, the major histocompatibility complex (MHC) is the prime candidate for pathogen resistance genes and contains some of the most polymorphic functional loci [11, 12]. The exceptional level of MHC polymorphism is believed to be driven by the antagonistic coevolution with pathogens and occurs through pathogen-mediated balancing selection [11, 12]. Beyond its clear significance in modulating pathogen resistance [2, 13, 14], MHC genes have been shown to influence other biological traits such as maternal-fetal interactions, kin recognition, life-time reproductive success and mate choice [15,16,17,18]. MHC is intimately linked with factors likely to affect individual fitness, population viability and evolutionary potential in changing environments. Thus, patterns of MHC diversity have been repeatedly used in a conservation context in populations of particular interest [19,20,21,22].

In Europe, abundance and distribution of large carnivores (bear, wolf and lynx) have been dramatically shaped by humans during the past few centuries. Although persecuted and driven to or close to extirpation in several countries, most of the remnant populations have stabilized or even expanded recently [23]. One example is the Iberian wolf (Canis lupus), which erstwhile ranged over most of the Iberian Peninsula, but after decades of severe human persecution became confined to small and fragmented populations, representing about 1/5 of the former population range [23,24,25]. In the 1970s, the Iberian wolf population was estimated to be reduced to ca. 700 individuals, mostly restricted to the northwestern region [23, 26, 27]. After the 1970s, the population started to grow and expanded southwards and eastwards (Fig. 1) [28]. Currently, the Iberian wolf is considered under a stable or positive demographic trend [23]. It occurs in a core area where the species have always persisted in northwestern Iberia [26, 27, 29], and the adjacent re-colonized area (the expansion front), alongside two small and isolated populations [30,31,32]. Such a complex demographic history have translated into a cryptic population structure in the Iberian wolf at small spatial scales, with moderate level of genetic differentiation, including the differentiation of the re-colonized area [33].

Fig. 1
figure 1

Map showing location of samples corresponding to three demographic groups (persistent in green, expanding in blue and isolated in yellow). Pie charts represent the relative frequency distribution of seven three-locus MHC haplotypes per demographic group. Each color of the pie chart represents a three-locus MHC haplotype, and size is proportional to haplotype frequency within demographic group (Additional file 2: Table S2). Filled gray polygon represents the estimated wolf distribution in the Iberian Peninsula in 2005 [24]

Remarkable MHC class II variability has been shown in wolf populations [34,35,36,37]. However, results vary regarding the predominant role of selection over neutral forces. Some studies provided evidence that wolf MHC diversity is maintained by balancing selection, including studies on populations under demographic decline [34, 36], where parasite resistance was suggested as the possible driving force [36]. In contrast, MHC diversity in the bottlenecked wolf population of Scandinavia was shown to be compatible with neutral evolution [38]. Although MHC variability patterns of small and isolated populations may differ from that in large and outbred populations, the power of balancing selection acting on MHC can be outweighed by demographic events, such as bottlenecks and fragmentation with consequent genetic drift [38, 39]. Hence the signature of selection and/or population demographic effects is expected to vary across populations under different demographic histories, challenging the assessment of the relative role of different microevolutionary forces [40].

In this study, we integrate neutral and adaptive diversity patterns to disentangle how adaptive diversity is driven under three different demographic within the Iberian wolf range. Taking into account the genetic population structure of this population based on neutral markers [33], we used the permanent wolf range (i.e., persistent group) as the baseline for demographic stability and compared it to the population dominated by the re-colonization process after the 1970s bottleneck [27] (i.e., expanding group) and to the remaining small and isolated population in the South of Douro River in Portugal (i.e., isolated group). For the persistent group we expect to find natural selection acting to maintain adaptive variation at a stronger intensity than genetic drift; and therefore MHC diversity is expected to exceed neutral diversity. In this scenario, we can either detect balancing or diversifying selection. We may observe balancing selection when MHC exhibits lower population differentiation than neutral loci, due to enhanced effective migration rate and similar allele frequencies in MHC loci [11]. Alternatively, we can detect diversifying selection when MHC exhibits accelerated divergence as a response to spatial or temporal heterogeneity in parasite abundance and diversity [41]. Contrastingly, for expanding and isolated groups, genetic drift is expected to outweigh selection; and therefore similar diversity patterns between MHC and neutral markers are expected.

Results

MHC diversity

We found seven DRB1, four DQA1, and six DQB1 alleles across the 113 Iberian wolf samples (Additional file 1: Table S1), all previously reported in other wolf populations (Additional file 2: Table S2, Additional file 3: Table S3). Diversity at DRB1 and DQB1 was higher than diversity at DQA1 (Additional file 4: Table S4). Observed heterozygosity was higher than expected at all three loci considering the whole dataset, with significant departure from Hardy-Weinberg Equilibrium (Additional file 4: Table S4). Nucleotide and amino acid distances were higher for DRB1 and DQB1 than for DQA1, with amino acid distances higher than the nucleotide distances (Additional file 4: Table S4).

When clustering the three loci (DRB1/DQA1/DQB1), we observed seven haplotypes (Fig. 1, Table 1). Each DRB1 allele is found in haplotypic association with a different DQ pair, with exception of alleles DRB1*05401 and DRB1*05501, which share the same DQ pair, DQA1*00301/ DQB1*00401.

Table 1 Sequence diversity and neutrality tests of the three demographic groups (persistent, expanding and isolated) and the whole Iberian wolf range for the three-locus (DRB1/DQA1/DQB1) MHC haplotypes

Five haplotypes are shared with other European wolf populations (Additional file 3: Table S3). We cannot confirm the presence of two haplotypes in other European wolf populations due to the absence of haplotype reconstruction in [37], where all alleles that compose these two haplotypes were reported. The three most frequent three-locus haplotypes are shared with Croatia, Italy and Finland (Fig. 2, Additional file 2: Table S2, Additional file 3 Table S3). Iberian haplotypes are distributed throughout the NJ tree (Fig. 2). Seven, six and four three-locus haplotypes were found in the persistent, expanding and isolated groups, respectively (Fig. 1, Table 1). The most frequent haplotype was different for each demographic group (Fig. 1, Additional file 2: Table S2). Sequence diversity as the number of segregating sites, nucleotide diversity, number of mutations and Watterson’s mutation parameter were similar among the three different demographic groups (Table 1).

Fig. 2
figure 2

Neighbor joining tree showing relationships between three-locus haplotypes of Iberian and other European wolf populations using p-distances. Colored circles indicate populations where haplotypes were found. Iberian haplotypes are in bold. Haplotype nomenclature is given in Additional file 1: Table S1

Positive selection

All MHC loci revealed positive and significant departure from neutrality (Additional file 4: Table S4), indicating an excess of high-frequency segregating sites. Persistent and expanding groups revealed positive and significant departure from neutrality for Tajima’s D and Fu & Li D* tests (Table 1). The isolated group did not show significant departure from neutrality for Tajima’s D and Fu & Li D* tests (Table 1). The dN/dS ratio was significantly different from neutral expectations in all demographic groups both at all sites and at the peptide binding region for DRB1 loci, and only at all sites for DQB1 loci (Table 2). For DQA1, the dN/dS ratio was significantly different from neutral expectations at all sites in isolated group and at the peptide binding region in persistent and expanding group (Table 2).

Table 2 Z-tests of selection on all sites, peptide binding region (PBR) inferred from [42], and the non-peptide binding region (non-PBR) of the three demographic groups (persistent, expanding and isolated) of Iberian wolves for the three MHC loci (DRB1, DQA1, and DQB1)

Additionally, we found several MHC codons under positive selection in the three demographic groups using OmegaMap, a few having high posterior probability (PP > 95%, Additional file 5: Table S5). DRB1 and DQB1 loci had higher proportions of selected codons than DQA1 (Additional file 5: Table S5). We matched our alignment with the human orthologue [42] and observed that almost all selected codons with high posterior probability (DRB1: 6 in 7, DQA1: 1 in 2, DQB1: 4 in 4 codons) match the same peptide-binding sites described by these authors (Additional file 5: Table S5).

Genetic differentiation between demographic groups

We found a single model with support from the data to explain allelic richness and expected heterozygosity, with a deviance explained of more than 65% (Additional file 6: Table S6). Allelic richness is explained by a combination of demographic group and locus type, while expected heterozygosity is explained by demographic group, both including the effect of locus (Additional file 6: Table S6). For both the number of alleles and observed heterozygosity we found three models with support, presenting a deviance explained between 20 and 43% (Additional file 6: Table S6). Number of alleles is explained by demographic group or a combination of demographic group and locus type (Additional file 6: Table S6). Observed heterozygosity is explained by the null model or by locus type (Additional file 6: Table S6).

According to the selected models, allelic richness, expected heterozygosity and the number of alleles are significantly higher in the persistent than in the isolated group (Additional file 7: Table S7). Number of alleles is also significantly higher in the persistent than in the expanding group (Additional file 7: Table S7). Allelic richness and observed heterozygosity are significantly higher in MHC than in microsatellites.

Comparing MHC and neutral diversity

For both MHC and microsatellite loci, significant deviations from Hardy-Weinberg Equilibrium were observed for the persistent and expanding groups (Table 3). However this deviation is in opposite directions (Table 3). The persistent group showed significant higher expected than observed heterozygosity in both molecular markers, while the expanding group showed significant higher observed than expected heterozygosity in MHC (Table 3). Observed heterozygosity was significantly higher at MHC than at microsatellite loci for the persistent and expanding groups (p < 0.05, t-test, Additional file 8: Table S8). For the expanding group, the mean allelic richness and expected heterozygosity at MHC were also significantly higher than the ones at microsatellite loci (p < 0.05, t-test, Additional file 8: Table S8). For the isolated group, none of the diversity indices was significantly different between MHC and microsatellite loci (p > 0.05, t-test, Additional file 8: Table S8).

Table 3 Genetic diversity of the three demographic groups (persistent, expanding and isolated) and the whole Iberian wolf range for microsatellite loci and for the three-locus (DRB1/DQA1/DQB1) MHC haplotypes

The first two components of PCA based on MHC loci did not differentiate among the three demographic groups (Fig. 3). Pairwise FST values using MHC data support the lack of genetic differentiation between persistent and expanding groups, while moderately significant differentiation is observed between all groups using Jost’s D differentiation (Table 4). Contrastingly, PCA based on microsatellite data showed clear genetic differentiation between all demographic groups, especially for the isolated group, while a partial overlap is observed between persistent and expanding groups (Fig. 3). All FST and Jost’s D pairwise comparisons between groups were significant for genetic differentiation using microsatellite loci (Table 4). Overall, pairwise genetic differentiation using FST was lower in MHC than in microsatellite loci, whereas for Jost’s D differentiation index, the isolated group exhibited higher values for MHC than for microsatellite loci (Table 4). The global co-inertia RV coefficient observed was high (0.98), indicating strong correlation between MHC and microsatellite matrices, but this coefficient is not different from what is expected by chance (p-value = 0.17).

Fig. 3
figure 3

Co-inertia analysis (CoA) between neutral and adaptive data for all demographic groups. a. Principal Component Analysis (PCA) for MHC loci; b. PCA for microsatellite loci; dots represent different individuals and circles around dots with different colors represent demographic groups. c. CoA plot indicating the relative position of each demographic group on the factorial plane for the first two CoA eigenvalues. Dots and arrows represent the projected co-ordinates of each dataset (MHC and microsatellite loci, respectively), which are joined by a vector, where the length of the vector is proportional to the divergence between the datasets. d. Canonical weights for MHC alleles. e. Canonical weights for microsatellite alleles

Table 4 Pairwise differentiation values for microsatellite and MHC loci (X/X, respectively), considering three demographic groups of Iberian wolves, estimated for FST and for Jost’s differentiation index (D)

Discussion

This is the first report of MHC class II diversity in the Iberian wolf. Overall, Iberian wolves exhibit lower MHC diversity than their European counterparts, both in number of alleles and number of three-locus haplotypes (i.e., 6–7 alleles and 7 haplotypes in Iberia vs 6–13 alleles and 13–14 haplotypes in other European populations) [34,35,36]. The exception is the isolated Scandinavian wolf population, which went through a drastic bottleneck recovering from only three individuals [38]. The reduced MHC diversity in the Iberian wolf has also been observed for other genomic regions [31, 43,44,45]. Long-term isolation and past bottlenecks in the Iberian population [27, 45] have certainly reduced neutral diversity and may also be at the origin of this depleted MHC diversity.

All MHC alleles have been previously identified in other canids [37, 46, 47], a pattern that is better explained by the trans-species polymorphism described in MHC canid phylogenies [37, 48] than by hybridization which is not common in the Iberian wolf [49]. The observed 17 alleles were confined in just seven three-locus haplotypes of which five have previously been reported [34,35,36, 48]. The remaining two haplotypes were not unequivocal confirmed to be present in other wolf populations due to the absence of haplotype reconstruction in [37]. The occurrence of conserved three-locus MHC haplotypes in Iberian wolves confirms the tight linkage of these gene loci, supporting a strong selective pressure maintaining haplotype combinations. The association observed between each DRB1 allele and a specific DQ pair supports the preferential association between alleles at these loci [46]. A similar trend is observed in other European wolf population [34,35,36], though not as extreme as in Iberia, where seven DRB1 alleles yielded only seven haplotypes. However, similarly to other European populations, the seven haplotypes are widely distributed in the NJ tree, supporting maximal MHC diversity in the Iberian wolf.

Within Iberia, we found contrasting MHC diversity patterns according to different demographic trajectories. The persistent group exhibited significant higher genetic diversity than the isolated group, but did not show significant excess of MHC heterozygotes. However, we found evidence of balancing selection acting to maintain MHC adaptive variation in this group, compensating the effects of the severe population decline in the 19th and 20th centuries [50, 51]. This is revealed by a significant departure from neutrality at MHC loci, significant higher observed and expected heterozygosity and lower differentiation at MHC than neutral loci, and signs of positive selection with higher proportions of selected codons lying in peptide-binding regions, as inferred from humans. Similar results have been reported for Finnish wolves where, despite a strong genetic signal of population decline [52], the population still reveals a predominant role of balancing selection maintaining MHC diversity [36]. Although selective advantage has been associated with a heterozygote advantage mechanism, MHC diversity can also be promoted by negative frequency-dependent selection [2]. This model proposes that the emergence of a novel pathogen can promote the frequency of a rare or novel allele/haplotype [2, 3, 53]. The frequency shift of an adaptive allele/haplotype by such process can occur in very few generations [53]. We can therefore not exclude that negative frequency-dependent selection may have contributed to maintain high allelic richness possibly compensating for the low number of heterozygotes observed.

The expanding group exhibited similar genetic diversity than the persistent group but show significant excess of MHC heterozygotes. Reduced genetic diversity is expected for marginal populations [10, 54,55,56], but increased heterozygosity at MHC supports a model of heterozygote advantage for this demographic group. The positive and significant result for neutrality statistics provides further evidence for balancing selection acting in this demographic group. The overdominance model postulates that heterozygotes have higher fitness than homozygotes due to the wider spectrum of MHC receptors able to induce parasite resistance [2]. MHC heterozygotes have indeed been associated with resistance to several infections in wolves [36]. Thus, the high MHC genetic diversity exhibited by the expanding group is expected to maintain its high adaptive potential.

The isolated group showed the lowest neutral and adaptive diversity and no significant excess of MHC heterozygotes, a likely result of the dramatic population decline that culminated in isolation in the early twenty-first century [26, 30]. The observed depletion of MHC diversity potentially increases population susceptibility to disease, and adds to current concern for its survival [12, 57]. Indeed, in a wolf serologic survey the isolated group showed no canine parvovirus antibodies, possibly related to high case-fatality rate [58]. No significant departure from neutrality neither significant difference between MHC and neutral loci were observed, suggesting that MHC loci behave according to neutral expectations in the isolated group. Other examples are known for small and isolated populations where genetic drift outweighs the strength of selection [3, 38, 39]. However, one interesting result comparing the isolated group with the others may suggest selective forces acting in this group. The higher Jost’s D differentiation index observed for MHC relative to neutral loci may be explained by diversifying selection as consequence of spatial variation in pathogen-mediated responses [40] and/or due to stronger divergence due to short-term positive selection on beneficial alleles. Interestingly, the isolated wolf population shares several pathogens with wild and domestic species [59,60,61], but holds also new pathogens [60], which could be associated with the observed differentiation in MHC. Inconclusive results are probably due to the small sample size of this group and should be interpreted with caution [62].

Conclusion

By empirically testing adaptive diversity under three different demographic scenarios within the Iberian wolf range, we were able to show that these different demographic scenarios are associated with different selective pressures, which are possibly driven by different selection mechanisms to maintain adaptive diversity. Future studies are warranted to investigate the link between the Iberian wolf immunogenetic diversity and the pathogen community structure to better understand the mechanisms underlying adaptation in each demographic group.

Methods

Sampling and DNA extraction

A total of 113 wolf tissue samples were collected across the range of wolves in the Iberian Peninsula in the last two decades. Tissue samples were collected from several Portuguese and Spanish entities (please see Ethics section). To avoid close familiar relationships and the presence of hybrids between wolf and dog in our dataset we selected samples according criteria defined by [33]. Samples were classified into one of three demographic groups: i) persistent group (n = 78), represented by samples from the northwest Iberian Peninsula where the wolf has been continuously present in the last decades, ii) expanding group (n = 25), represented by samples originally from the area where wolves were absent during the recent low of the population in the 1970s, representing a well-defined genetic cluster within Iberia (Castilla y León cluster) [33], and iii) isolated group (n = 10), represented by samples from the small and isolated wolf area south of the Douro river in Portugal (Fig. 1). Samples were stored in ethanol and DNA was further extracted using QIAGEN DNeasy Blood & Tissue Kit.

MHC amplification and sequencing

DLA class II genes DRB1, DQA1 and DQB1 were amplified and sequenced using intronic and locus specific primers described in [34]. Polymerase chain reactions (PCR) were carried out in a total volume of 10 μl containing approximately 10 ng of DNA, 0.4 μM of each primer and 5 μl 2× MyTaq HS Mix (Bioline) (see Additional file 9: Table S9 for PCR conditions). PCR products were sequenced in both directions using BigDye® Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) according to the manufacturer’s protocol and sequencing products were separated on a 3130xl Genetic Analyzer (Applied Biosystems).

MHC alleles and haplotype assignment

DRB1, DQA1 and DQB1 loci sequences of each of the 113 samples were analyzed and aligned with CodonCode Aligner (CodonCode Corporation, www.codoncode.com) and BioEdit [63].

MHC sequences were phased in DnaSP v. 5.10 [64] using PHASE [65] with the “recombination” model (−MR0) and 1000 iterations after 100 burn-ins. The threshold for accepting true alleles was set to 90%. This methodology has been shown to be accurate for MHC haplotype reconstruction [66], retrieving very low false positives [67]. MHC alleles obtained for each locus were compared with available information in public datasets (NCBI: https://www.ncbi.nlm.nih.gov) for the genus Canis. As the three genes studied are chromosomally clustered in tandem, we analyzed each locus individually or as three-locus haplotypes. Three-locus haplotypes (DRB1/DQA1/DQB1) were reconstructed firstly in individuals homozygous for each locus, and then for heterozygous individuals basing inferences on haplotypes observed in homozygous state. Haplotype reconstruction was confirmed with available information for other canid populations.

MHC diversity

MHC diversity was analysed for each locus and for the three-locus haplotypes. For both we estimated number of alleles (NA), mean number of effective alleles (Ne), mean observed (Ho) and expected (He) heterozygosities and inbreeding coefficient (FIS) using GENALEX 6.5 [68]. Test of deviation from Hardy-Weinberg equilibrium was calculated using GENEPOP 4.6 [69]. Allelic richness (AR) was estimated using FSTAT 2.9.3.2 [70], which performs rarefaction compensating sampling disparity, reducing all populations to a minimum sample size of n = 10. Sequence-level diversity values, such as number of haplotypes (h), number of segregating sites (S), nucleotide (π) and haplotype (HD) diversities, and Watterson’s mutation parameter (θw) were estimated either in each locus or each group (with the three-locus haplotypes) using DnaSP 5.10.

Molecular divergence analysis over all sequence pairs, also either in each locus or each group (with the three-locus haplotypes) was conducted using Mega 7 [71] with a 1000 replicate bootstrap procedure. The best nucleotide and amino acid substitution models were chosen according to the Bayesian Information Criterion and used to compute nucleotide and amino acid evolutionary distances by the maximum-likelihood method through 1000 bootstrap replicates. The relationship between three-locus haplotypes of Iberian and other European wolf populations was reconstructed using the Neighbor Joining (NJ) method and p-distances in Mega 7. Three-locus haplotypes of other European wolf populations were retrieved from previous studies [34,35,36, 38].

Testing positive selection

To detect evidence of selection on MHC loci at the population-level, we estimated Tajima’s D [72] and Fu & Li D* [73] statistics using DnaSP 5.10. Under neutrality, D and D* are not different from 0, and significant negative deviations may indicate a recent selective sweep or population expansion while positive deviations indicate balancing selection or population contraction. Neutrality tests were computed for each MHC locus and for the three-locus haplotypes in the whole Iberian wolf range and in each demographic group. Deviations from neutrality were also assessed by a Z-test (non-synonymous substitution ≠ synonymous substitution; dN ≠ dS) implemented in MEGA 7, using Nei-Gojobori method with Jukes-Cantor correction for multiple substitutions and 1000 bootstrap replicates. Z-tests were computed for all sites, antigen-binding sites inferred from the human orthologue [42], and non-antigen binding sites for each demographic group. Additionally, we also used a “coalescent with recombination” Bayesian approximation implemented in OmegaMap [74] to find signatures of selection in each demographic group. OmegaMap allows both selection parameter (ω) and recombination rate (ρ) to vary along the sequence [75]. Codons with excess of non-synonymous polymorphism compared to synonymous polymorphism are assumed under positive selection. We followed [36] to set distribution and parameter values of priors to represent neutrality. We performed two runs for each locus and each demographic group with 250,000 repeats and a burn-in of 20,000. Convergence of runs was check through summary function implemented in R 3.5.0 (R Core Team, 2018). Codons are considered as positively selected with posterior probabilities (PP) above 95%.

Microsatellite diversity

We selected a set of 39 microsatellite loci previously genotyped by [33] for all but three samples (two from the persistent and one from the isolated groups) sequenced for MHC to estimate neutral genetic diversity in each demographic group. This set included only loci with less than 25% missing data and with no significant departure from Hardy–Weinberg equilibrium after Bonferroni correction estimated using GENALEX 6.5. Genetic diversity, including number of alleles (NA), mean number of effective alleles (Ne), mean allelic richness (AR), mean observed (Ho) and expected (He) heterozygosities and inbreeding coefficient (FIS) were estimated using GENALEX 6.5 and FSTAT 2.9.3.2. For allelic richness all populations were reduced to a minimum sample size of n = 5. Test of deviation from Hardy-Weinberg equilibrium was calculated using GENEPOP 4.6.

Testing for genetic differentiation between demographic groups

To test for statistically significant differences of genetic diversity between the demographic groups, we performed linear mixed effects models (e.g.,[76, 77]) using lme4 [78] and lmerTest [79] implemented in R platform (R Core Team, 2018). We used number of alleles, allelic richness, observed and expected heterozygosities as dependent variables, demographic group and locus type (MHC and microsatellites) as explanatory factors, and locus as random variable. Ten competing models were tested, including i) null, ii) demographic group-dependent, iii) locus type-dependent, iv) dependent on both demographic group and locus type, and v) dependent on both demographic group and locus type plus the association between these two factors. For each of these options we run two models accounting or not for locus random effect (for model syntaxes see Additional file 6: Table S6). We performed independent model testing for each diversity measure. We used Akaike Information Criterion corrected for small sample sizes, AICc, to determine the best model. Models with delta AICc (ΔAICc) lower than 2.0 were assumed as equally plausible. For each diversity measure we present estimates, standard errors and probability inferred from t-value, indicating the precision of the estimates.

Comparing neutral and functional diversity

Expectations that functional diversity exceeds neutral diversity were tested by a one-tailed t-test (p-value), following [35]. The relationship between microsatellite and MHC loci was estimated using a co-inertia analysis (CoA) following [10]. CoA is a multivariate method that estimates the covariance structure between two tables with the same observations (e.g., same individuals). For each genetic matrix (microsatellite and MHC loci) we performed a Principal Component Analysis (PCA) followed by factorial PCA using demographic groups as explanatory variable. For this analysis we used the adegenet and ade4 R packages [80, 81]. We then performed a CoA using the two most important PCA components. In a CoA plot, an arrow and a dot define each demographic group, where the length of the vector connecting arrow and dot is proportional to the divergence between the types of markers. The correlation coefficient RV was used to evaluate the strength of the relationship between the two matrices. Values of RV close to 1 indicate strong correlation between matrices. The significance of the RV coefficient was tested using 1000 permutations, where null hypothesis is that the two matrices are related by chance. Canonical weights for microsatellite and MHC alleles were used to disclose the contribution of each allele to the divergence of demographic groups.

MHC and neutral differentiation between groups was estimated through FST [82] and Jost’s D [83] using Arlequin 3.5.2.2 [84] and the DEMEtics R package [85], respectively. FST and Jost’s D differentiation measures were both used because FST may be affected by highly variable markers [39].