Abstract
Captive breeding is often a last resort management option in the conservation of endangered species which can in turn lead to increased risk of inbreeding depression and loss of genetic diversity. Thus, recording breeding events via studbook for the purpose of estimating relatedness, and facilitating mating pair selection to minimize inbreeding, is common practice. However, as founder relatedness is often unknown, loss of genetic variation and inbreeding cannot be entirely avoided. Molecular genotyping is slowly being adopted in captive breeding programs, however achieving sufficient resolution can be challenging in small, low diversity, populations. Here, we evaluate the success of the Vancouver Island marmot (Marmota vancouverensis; VIM; among the worlds most endangered mammals) captive breeding program in preventing inbreeding and maintaining genetic diversity. We explored the use of high-throughput amplicon sequencing of microsatellite regions to assay greater genetic variation in both captive and wild populations than traditional length-based fragment analysis. Contrary to other studies, this method did not considerably increase diversity estimates, suggesting: (1) that the technique does not universally improve resolution, and (2) VIM have exceedingly low diversity. Studbook estimates of pairwise relatedness and inbreeding in the current population were weakly, but positively, correlated to molecular estimates. Thus, current studbooks are moderately effective at predicting genetic similarity when founder relatedness is known. Finally, we found that captive and wild populations did not differ in allelic frequencies, and conservation efforts to maintain diversity have been successful with no significant decrease in diversity over the last three generations.
Introduction
The need for conservation actions are rapidly increasing as biodiversity declines with the onset of the sixth mass extinction, which has already caused severe population declines in 30–50% of vertebrates worldwide (Ceballos et al. 2015, 2017, 2020). Current conservation strategies to minimize population declines and species extinction include in situ and ex situ methods. In situ strategies occur in a species’ native environment and can include food supplementation, translocation of individuals, and predator-free enclosures (Limoges et al. 2013; Kyle et al. 2017). In situ measures often provide short-term population increases with relatively minimal disturbance to the population. However, such measures are often slow to materialize and require relatively stable populations to be successful (Limoges et al. 2013; Kyle et al. 2017). In addition, severe population declines can occur before a species is recognized as endangered (Peters et al. 2015), at which point in situ approaches may be insufficient for conservation. Populations considered ‘critically endangered’, for example, may require more interventionistic approaches to avoid extinction (Peters et al. 2015).
Ex situ conservation generally consists of moving individuals out of their natural environments into artificial habitats, such as zoos or aquaria, with the intent of preventing imminent extinction (Canessa et al. 2015, Brichieri-Colombi et al. 2019). Populations facing extreme threats may be removed in their entirety or captive breeding programs may be implemented, in which case a subset of individuals are brought into a controlled environment and their offspring are released to reinforce the wild population (Canessa et al. 2015). Captive breeding programs have prevented the extinction of California condors (Gymnogyps californianus; Ralls and Ballou 2004), black-footed ferrets (Mustela nigripes; Wisely et al. 2003), and red wolves (Canis lupus rufus; Hedrick and Fredrickson 2008) among others (Barbanti et al. 2019). However, captive breeding programs are not always successful as they are constrained by certain logistical and financial issues. For example, some species fail to thrive and propagate in captive environments (Robin 2003). In species that will breed, limited capacity, limited population size, and accidental breeding of closely related individuals can lead to high levels of inbreeding (Kyle et al. 2017; Barbanti et al. 2019). Inbreeding is a major concern in conservation efforts as it can lead to reduced fitness (Ólafsdóttir and Krisjánsson 2008; Knief et al. 2015). Consequently, one of the foremost long-term challenges in captive breeding programs is preventing inbreeding despite a limited gene pool (Rollinson et al. 2014; Kyle et al. 2017).
Captive-breeding programs have attempted to address the concerns of inbreeding and loss of genetic diversity by using pedigree-based methods, traditionally using studbooks (records of past breeding events) to track ancestry and recommend future breeding pairs (Ivy et al. 2009; Lacy et al. 2012; Kyle et al. 2017). However, one concern with studbooks is the unknown relatedness amongst the initial breeding stock (founder population) or ‘new’ breeders later on. In the absence of prior information, these founders are often assumed to be unrelated, or relatedness may be inferred based on behavioural observations or geographic distance between source populations in the wild (Ivy et al. 2009; Witzenberger and Hochkirch 2011; Kyle et al. 2017). While estimates of relatedness and inbreeding among founder progeny become increasingly accurate within four generations (Huisman et al. 2016), the initial breeding events may result in unacceptable loss of diversity for populations already experiencing limited genetic variation (Frankham et al. 2017). Further, studbooks predict relatedness between pairs of individuals by assuming the law of averages from Mendelian inheritance across generations—half-siblings or grandparent-offspring dyads are expected to share 25% of their genes. In reality this will not be true for every pair of individuals as cross-over and meiotic reduction is random across gametes, meaning some dyads may share considerably more or less than 25% of their genes. This uncertainty can be exacerbated over generations leading to undesirable consequences in small populations. Nonetheless, mating pairs are typically selected based on high genetic dissimilarity according to studbook estimates, which can lead to differences between expected and true (realized) relatedness (Ivy et al. 2009; Witzenberger and Hochkirch 2011; Kyle et al. 2017).
The concerns associated with the use of studbooks have prompted the incorporation of molecular techniques to supplement, or even replace, traditional studbooks (Sekino et al. 2003; Ivy et al. 2016; Modesto et al. 2018). Unique genetic information is often acquired by genotyping individuals at microsatellite loci; markers that vary in the number of repeat units (length-based fragment analysis). This approach is relatively affordable and accessible, especially in non-model species with little molecular information available (Selkoe and Toonen 2006; Darby et al. 2016). However, this approach is prone to underestimating genetic diversity due to length homoplasy—alleles with identical length may not be identical by descent or by sequence (Fig. 1; Darby et al. 2016; Barbian et al. 2018). Consequently, length-based microsatellite genotyping typically detects only a subset of the genetic variation present. For species with very low genetic diversity, length-based fragment analysis may not reveal sufficient polymorphism to assist conservation and management (Darby et al. 2016; Barbian et al. 2018).
High-throughput amplicon sequencing (HTAS) (Darby et al. 2016; Barbian et al. 2018) is a targeted technique that sequences microsatellite loci, offering a bridge between traditional microsatellites and genome-wide SNP (single nucleotide polymorphism) methods. HTAS provides several benefits: (1) it overcomes length homoplasy by revealing SNPs, and other mutations, in addition to length variation; (2) it is cheaper than restriction-site associated SNP methods as microsatellite loci have typically already been identified for the species of interest or can be readily applied from closely related species; (3) future samples can easily be added because primers consistently amplify the same loci in separate runs overtime; and (4) much of the intial preparatory work can be done in a laboratory with limited equipment before being sent for sequencing, making this technique more accessible and further reducing costs. Importantly, HTAS has revealed 61–79% more diversity compared to traditional length-based fragment analysis in some species (Darby et al. 2016; Barbian et al. 2018).
In this study, we focus on the Vancouver Island marmot (Marmota vancouverensis; Swarth 1911; hereafter VIM) which is endemic to Vancouver Island and the only critically-endangered terrestrial mammal species in Canada (COSEWIC 2008, Roach 2017, Vancouver Island Marmot Recovery Team [VIMRT] 2017, COSEWIC 2019). The population declined in the early 1990s, likely as a result of increased predation and habitat modification through timber harvest, resulting in less than 30 wild individuals by 2003 (VIMRT 2017). Low numbers led to the initiation of an intensive captive breeding program in 1997 involving four different breeding facilities (Toronto Zoo, Calgary Zoo, Mountainview Conservation Centre, and Tony Barrett Mt Washington Marmot Recovery Centre). Between 2003 and 2020, 529 captive-bred offspring have been released back into the wild which has helped to recover the current wild population to ~ 200 individuals (Aaltonen et al. 2009, Brashares et al. 2010, Jackson et al. 2016, VIMRT 2017, Werner 2017, COSEWIC 2019, Lloyd et al. 2019).
In addition to behavioural considerations (Casimir et al. 2007), the captive breeding program uses a studbook to record breeding history and select new breeding pairs while attempting to minimize mean kinship (MK) and inbreeding coefficients following the recommendations of Ballou and Lacy (1995). Though our current pedigree contains a maximum of four generations, the inception of the VIM studbook was preceded by several years of behavioural observations in the wild which, along with geographically-dispersed populations, helped to estimate relatedness among most of the founders. The captive population has been self-sufficient since it’s establishment in 2001 until some wild individuals were supplemented in 2019 to boost the population size and the predicted genetic diversity.
Kruckenhauser et al. (2009) genotyped individuals from the wild VIM population during the decline in the mid to late 1990s; predating the release of captive-bred marmots into the wild (VIMRT 2017). Eleven polymorphic microsatellites showed low allelic richness (mean 2.1 alleles/locus) and three genetically-distinct population clusters (Kruckenhauser et al. 2009). The extant population is small, has likely undergone a genetic bottleneck due to severe decline, and is likely to display patterns of genetic drift that further decrease its genetic variation. Thus, we expected length-based fragment analysis to lack sufficient polymorphism to differentiate among individuals making relatedness and inbreeding inestimable. However, HTAS genotyping may uncover hidden variation and provide increased resolution to distinguish individuals. In this study, we used traditional length-based fragment analysis and HTAS genotyping of microsatellite loci in VIM to: (1) determine how much genetic diversity is hidden due to length homoplasy by comparing length-based genotypes to HTAS genotypes; (2) compare molecular estimates of pairwise relatedness and inbreeding with estimates derived from studbook ancestry; (3) compare the genetic diversity in the wild and captive populations; and (4) determine changes in allelic richness over time by comparing previous estimates to current length-based estimates.
Materials and methods
Sample origins and sequencing
Tissue and hair samples from captive and wild marmots were collected by the Calgary Zoo, the Toronto Zoo, and the Marmot Recovery Foundation. A total of 88 individuals sampled between 2005 and 2017 were selected based on their survival to the summer of 2018, and represented both wild and captive populations. Selected individuals included 51 captive-bred and 37 wild-born marmots. Wild marmots originated from ten colonies within the Nanaimo Lakes metapopulation (n = 33) and the Mt. Washington colony in the Strathcona metapopulation (n = 4) (Fig. 2). DNA was extracted from each tissue or hair sample (using a Qiagen DNA Blood and Tissue extraction kit) and amplified at 25 common Sciuridae microsatellite loci using QIAGEN TopTaq® Master Mix (Detailed PCR conditions and primers sequences for all loci are in Supplemental Table S1). Primers were redesigned for loci MA001, MA018 and 3b1 to reduce the length of the amplicon to less than 300 bp for sequencing. Illumina adapter overhang nucleotide sequences (Forward overhang: 5ʹ TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG‐[locus specific sequence], Reverse overhang: 5ʹ GTCTCGTGGGCTCGGxAGATGTGTATAAGAGACAG‐[locus specific sequence]) were added to the 5’ ends of the locus-specific microsatellite primers to facilitate introduction of dual indices and Illumina sequencing adapters (Illumina 2013; Darby et al. 2016). Following amplification, the 25 PCR products were pooled at equal volumes for each individual. Pooled reactions were purified using Agencourt AMPure XP beads (Beckman) using a 1:1 ratio and used as template in a second PCR reaction. Indexing PCR reactions were performed in 25 μL total volume containing 2.5 μL of template, 12.5 μL of 2X KAPA HiFi HotStart Ready Mix (KAPA Biosystems), 2.5 μL of Nextera XT Index 1 primer (N7XX), 2.5 μL of Nextera Index 2 primer (S5XX); (Illumina), and 5 uL of PCR grade H2O. PCRs were cycled at 95 °C for 3 min followed by 8 cycles of 95 °C for 30 s, 55 °C for 30 s and 72 °C for 30 s and completed with a final extension at 72 °C for 5 min. Indexed microsatellite amplicons were pooled across individuals and again purified using Agencourt AMPure XP beads at a 1:1 ratio. The final library was sequenced on an Illumina MiSeq using a V3 2X300 bp sequencing kit and a 15% PhiX spike-in.
Genotyping
In steps similar to Darby et al. (2016), raw sequence data were quality checked, trimmed and merged to produce text files containing both length and sequence information. Briefly, FastQC (Andrews 2010) was used to ensure the average sequence quality was > 20 and that sequences were not over-represented. Paired-end reads were trimmed for low quality bases (-trimns; -trimqualities) and overlapping reads were merged (-collapse) using AdapterRemoval v2 (Schubert et al. 2016). Merged reads were partitioned so that the number of unique reads per individual per primer pair could be determined using USEARCH (Edgar 2010) parameters -fastx_uniques, -fastaout, -sizeout and -tabbedout. These dereplicated reads were filtered by size and length to produce plain text files for each primer pair per individual. The ten read lengths with the highest frequency were used for genotyping. A detailed workflow of the pipeline has been placed on github (https://github.com/jazjanes/VI-marmots). Genotypes were scored manually by visualizing the typical microsatellite stutter profile using unique sequence length and the number of copies of each sequence per individual (Supplemental Figure S1). In determining individual genotypes, the minimum count required for a primary allele was 80 amplicons with a stutter peak within four base-pairs. For a genotype to be heterozygous the secondary allele needed a minimum of 20 amplicons with stutter peaks within four base-pairs, and the shortest allele needed to have the higher amplicon count of the two alleles. In the case of heterozygotes with two different sequences of the same length, the peak representing the second allele had to be greater than 80% of the first allele peak. Sequences with identical lengths were compared in BioEdit (Hall 1999) to identify hidden variation (Supplemental Figure S1). An additional data set was generated where the HTAS genotypes were simplified to create standard length-based genotypes where SNPs were ignored, different sequences of the same length were merged into single alleles, and some heterozygotes became homozygotes.
Data analysis
Allelic richness, expected and observed heterozygosities were calculated using GenAlEx 6.5 (Peakall and Smouse 2006). We used GENEPOP (Raymond and Rousset 1995) to estimate linkage disequilibrium and deviations from Hardy–Weinberg equilibrium (HWE) with a Bonferroni correction to avoid Type 1 error (Rice 1989). Loci that deviated from HWE were not removed from subsequent analysis because we sampled across colonies and some colonies were represented by single individuals. Therefore, the violation of HWE is likely due to non-random sampling, which recent research has shown is not detrimental to subsequent analyses (Trevoy et al. 2018).
Independent Shapiro–Wilk tests revealed that three separate estimates of allelic richness (HTAS genotypes from all loci, length-based genotypes from all loci, and length-based genotypes from six loci in common with Kruckenhauser et al. (2009)) all deviated from a normal distribution. Hence, non-parametric Wilcoxon Signed-Rank tests were used to compare the allelic richness of HTAS vs. length-based genotypes, captive vs. wild individuals, and current length-based vs. previous length-based genotypes from Kruckenhauser et al. (2009). Individual observed heterozygosity (HO) was compared between captive vs. wild populations using a Mann–Whitney U test and between HTAS vs. length-based genotypes using both a Mann–Whitney and a Wilcoxon Signed-Rank test.
Molecular estimates of individual inbreeding coefficients (Ritland 1996) and pairwise relatedness were calculated using HTAS genotypes from all loci in COANCESTRY 2.0 (Wang 2011). A simulation with our data revealed that the TrioML estimator (Wang 2007) had the lowest variance and was thus used to calculate pairwise relatedness (Simulation results are in Supplemental Table S2). As inbreeding coefficients range from -1 to + 1 and TrioML relatedness coefficients range from 0 to 1, values outside of this range are artifacts of extreme homozygosity in the selected microsatellites (Wright 1922). We compared molecular inbreeding coefficients and estimates of pairwise relatedness between wild and captive-born individuals using Mann–Whitney U tests or Student’s t-tests where appropriate. Statistical tests were performed using the RealStats extension in Microsoft Excel (Zaiontz 2013) and all values were reported with mean ± SD.
Studbook estimates of inbreeding and pairwise kinship were calculated in the captive breeding management software PMx 1.6 (Lacy et al. 2012). Pairwise relatedness coefficients were calculated using R(xy) = 2*f(xy)/√{(1 + Fx)(1 + Fy)}, where f(xy) is the studbook kinship between individuals x and y, and Fx and Fy are the studbook inbreeding coefficients of individuals x and y (Crow and Kimura 1970; Galla et al. 2020). Due to non-normal distributions, we used nonparametric Spearman’s rank correlations to assess the relationships between molecular and studbook estimates of both individual inbreeding coefficients and pairwise relatedness. We also evaluated relatedness using a Pearson’s correlation and a Mantel test due to the presence of repeated values leading to rank ties and the dependency of pairwise matrices. All correlations included 47 captive-bred individuals with known parental ancestry going back at least two generations. Correlations were performed in R 4.0 (R Core Team 2020) and we used the ape 5.0 package (Paraidis and Schliep 2018) to complete the Mantel test with 999 permutations.
To investigate differences in allele frequencies among colonies and populations, we used the Bayesian clustering algorithm in STRUCTURE 2.3.4 (Pritchard et al. 2000). We analyzed wild and captive individuals simultaneously across 20 independent runs testing K = 1–5 using both HTAS genotypes from all loci, and length-based genotypes using only the six loci in common with Kruckenhauser et al (2009) in separate analyses. Runs included one million MCMC iterations preceded by 100,000 burn-in with correlated allele frequencies using an admixture model. We used StructureSelector (Li and Liu 2018) and CLUMPAK (Kopelman et al. 2015) to ensure our analyses converged, to determine the optimal number of clusters (considering only the mean Ln P(K) to avoid the recognized biased with ΔK (Janes et al. 2017; Cullingham et al. 2020), and to visualize cluster bar plots. In the event of K = 2, each cluster was reanalyzed separately to investigate the possibility of hidden substructure as recommended by Janes et al. (2017) using an individual q-value threshold of 0.9 for cluster membership.
Results
After bioinformatic processing, we were left with an average of 9,581 ± 4,610 reads per individual, per locus across 25 loci and 88 individuals. Of the 25 loci examined, seven loci were eliminated as reads failed to reach genotyping thresholds in greater than 50% of individuals (2h10, 2h15, Bibl14, GS25, MA002, MS6, MS41), and another seven loci were removed because they were monomorphic (2g4, 3b1, Bibl18, Bibl31 GS12, MA066, MA091) leaving 11 loci for analysis (2g2, 2h4, 2h6, Bibl25, GS14, GS17, MA001, MA018, MS53, MS56, St10). Six of the 11 loci were in common with the loci used by Kruckenhauser et al. (2009; 2g2, GS17, MA018, MS53, MS56, St10). Six individuals were eliminated (one captive and five wild) because they failed to amplify in a minimum of 50% of 11 loci used for analysis, leaving 82 individuals remaining in the analysis; 50 captive-bred and 32 wild-born individuals from ten colonies in the Nanaimo Lakes metapopulation (n = 29) and Mt. Washington in the Strathcona metapopulation (n = 3). Of the 11 successful loci and 82 individuals genotyped, we sequenced an average of 9,854 ± 3,937 reads per individual, per locus, with all loci having a minimum of 3,508 average number of reads per individual. All loci were in linkage equilibrium and although eight of 11 loci deviated from Hardy–Weinberg equilibrium, they were retained in the analysis as specified in the methods (Table 1).
Allelic richness from length-based and HTAS genotypes across 11 polymorphic loci revealed one to four alleles per locus (Table 1) with two novel alleles discovered using HTAS genotyping (8.3% increase in allelic richness). One of these new alleles was at a locus (GS14) that would otherwise have appeared to be monomorphic. These two new alleles decreased the probability of identity (two randomly selected individuals having the same genotype) from less than 1 in 3,000 (length-based) to less than 1 in 6,000 (HTAS). However, using sequence data to overcome length homoplasy did not significantly increase mean allelic richness across all 11 loci (length-based 2.2 ± 0.2, HTAS 2.4 ± 0.2; z = 1.4, n = 11, P(one-tail) = 0.08, Table 1). Interestingly, mean individual heterozygosity derived from HTAS genotypes (0.28 ± 0.14) was greater than length-based genotypes (0.25 ± 0.13) using a paired Wilcoxon test (z = 4.5, n = 82, P < 0.001) but this difference was not supported by an independent Mann–Whitney test (U = 2,924, n1 = n2 = 82, P = 0.08).
Molecular estimates of inbreeding (0.119 ± 0.247) were poorly, but positively, correlated with studbook estimates (0.003 ± 0.005; Spearman’s rho = 0.29, n = 47, P < 0.05, Fig. 3) with three individuals excluded from the analysis (two individuals had less than two generations worth of studbook information available and one had a molecular inbreeding value outside the theoretical range). Molecular estimates of pairwise relatedness (0.15 ± 0.19) were also weakly, but positively, correlated with studbook estimates (0.10 ± 0.14; Spearman’s rho = 0.20, n = 1081, P < 0.0001, Fig. 4). This relationship remained significant with a Pearson correlation (r = 0.34, P < 0.0001) and a Mantel test (z-stat = 25.6, P = 0.001).
Mean allelic richness of the captive population (2.3 ± 0.6) was not statistically different than the wild population across all 11 loci (2.4 ± 0.7; z = 1.0, P = 0.32), and we did not discover any private alleles among wild colonies (Table 2). Likewise, mean individual heterozygosity did not vary between the captive (0.29 ± 0.14) and wild (0.25 ± 0.14) populations (U = 681.5, n1 = 50, n2 = 32, P = 0.13). However, molecular inbreeding coefficients were greater in the wild (0.41 ± 0.14) than the captive (0.15 ± 0.04) population (U = 579.5, n1 = 50, n2 = 32, P < 0.04), although this was partially driven by three outliers (2 wild, 1 captive) with values outside of the theoretical limits of the estimate (− 1 to + 1). With these three outliers removed inbreeding coefficients became normally distributed and the wild population was only marginally greater than the captive population (0.25 ± 0.05, 0.13 ± 0.04 respectively; t(77) = 1.99, P = 0.051). Pairwise relatedness estimates among wild marmots (0.14 ± 0.18) were not different from the captive (0.15 ± 0.19) population (U = 312,807.5, n1 = 1,081, n2 = 595, P = 0.17).
A Bayesian STRUCTURE analysis using the HTAS genotypes from all 11 loci identified two genetic clusters (K = 2) across all runs (Supplemental Figures S2-S3). However, there was no obvious geographic pattern of cluster separation between the wild and captive populations or across wild colonies as both clusters were represented equally in wild and captive populations (Fig. 5a). A second STRUCTURE analysis using only length-based genotypes from the six loci in common with Kruckenhauser et al. (2009) did not change the number of clusters, or the representation of genetic structure across populations or colonies (Fig. 5b, Supplemental Figures S2-S4). Furthermore, subsequent STRUCTURE analyses within each cluster did not reveal any hidden substructure using either the full HTAS dataset or the reduced dataset with only six loci.
Using only the six loci in common with Kruckenhauser et al. (2009), allelic richness from length-based genotypes was 2.5 ± 0.8 which was not statistically different from 2.2 ± 0.4 estimated by Kruckenhauser et al. (2009) (z = 1.4, P = 0.16). Expanding our dataset to ten loci, including the same six polymorphic loci in common between the two studies plus Bibl18 (monomorphic in our study but polymorphic in Kruckenhauser et al. 2009) and three other loci (Bibl31, GS12, MA091) deemed monomorphic in both studies, still did not reveal a significant change in mean allelic richness over time (z = 0.6, P = 0.56).
Discussion
We compared the genetic diversity of VIM using both length-based microsatellite genotyping and next-generation sequencing to assess the ability of the HTAS approach to detect greater genetic variation in a species with low genetic diversity. Additionally, we used HTAS to compare molecular and studbook-derived estimates of relatedness and inbreeding. We determined that the HTAS approach did not detect significantly more genetic diversity compared to traditional fragment length analysis. However, the field behavioural observations used to establish the studbook created a reliable and effective way of minimizing inbreeding. Wild and captive populations were similar in all aspects except that the wild population had marginally higher molecular estimates of inbreeding. Genetic variation in VIM appears to have remained constant since the previous estimates, although some population structure may have degraded.
Length-based vs HTAS genotyping
Despite our expectations, the HTAS approach did not lead to a significant increase in allelic richness compared to traditional length-based genotyping. Only two additional alleles were identified across 11 loci. In contrast, chimpanzees (Pan troglodytes) had a 61% increase in mean allelic richness using HTAS (Barbian et al. 2018) while muskrats (Ondatra zibethicus) saw a 79% increase (Darby et al. 2016). However, the conflicting results may be attributable to the relatively low genetic diversity of VIM. For example, the mean allelic richness using length-based genotyping was 6.4 in chimpanzee (Barbian et al. 2018) and 14.9 in muskrat (Darby et al. 2016), while VIM was just 2.1 historically (Kruckenhauser et al. 2009) and 2.2 in this study. Likewise, mean expected heterozygosity was 0.75 in chimpanzee, 0.82 in muskrat but only 0.33 in our study. VIM is an endemic species with historically small population sizes; it is likely to be heavily impacted by genetic drift which would contribute to extremely limited genetic diversity. The Vancouver lamprey (Entosphenus macrostomus), the only other species endemic to Vancouver Island that we are aware of (although we recognize there are several endemic subspecies), also exhibits low genetic variation (mean allelic richness of 3.1, mean expected heterozygosity of 0.50; Taylor et al. 2012). Both studies by Darby et al. (2016) and Barbian et al. (2018) showed that the HTAS approach increased HO at all loci, while we only saw an increase in two of the eleven loci. However this was enough to significantly increase mean individual HO when using a more sensitive paired Wilcoxon test, though the lack of significance with the Mann–Whitney test suggests this difference was small and not robust. This potential increase is the result of finding heterozygotes with the HTAS approach that would have otherwise appeared to be homozygotes using traditional length-based genotyping (two different sequences of the same length). In all, despite the reduced cost and increased accessibility of the HTAS approach, this technique may have limited applications in the conservation and management of endangered species with reduced diversity as there are also fewer alleles hidden by homoplasy.
Studbook vs HTAS estimates
We found that molecular and studbook estimates of both inbreeding and pairwise relatedness were positively correlated, although both relationships contained considerable unexplained variance. Generally, molecular genotypes tend to overestimate relatedness due to the presence of identical-by-state (but not identical-by-descent) alleles which artificially increase similarity among individuals (Taylor 2015; Taylor et al. 2015). In contrast, studbooks can underestimate relatedness by not accounting for relations prior to the start of record-keeping (Wells et al. 2018; Hogg et al. 2019). Also, studbook estimates are based on the 50% rule of Mendelian inheritance which can under or overestimate true relatedness between kin (Galla et al. 2020).
Given the low allelic richness in VIM, our molecular data are challenged to calculate relatedness and inbreeding with a high degree of confidence, which likely contributes in part to the unexplained variance (Taylor 2015; Taylor et al. 2015; Galla et al. 2020). However, our studbook greatly benefitted from prior behavioural observations to help estimate relatedness among founders. This approach is highly recommended as it can improve the reliability of kinship and inbreeding estimates, and minimize unknown relationships among founders (Russell et al. 1994, Kennedy et al. 2014, Taylor 2015, Frankham et al. 2017, Hogg et al. 2019, Galla et al. 2021). In this sense, our studbook is likely more accurate in estimating relatedness compared to studies that did not have any prior information during inception. However, new wild marmots have recently been recruited into the captive breeding program but are lacking pedigree data to estimate how they might be related to captive individuals. This presents a drawback to relying on the studbook alone, suggesting a hybrid approach of both behavioural and molecular data sets is likely ideal (Galla et al. 2020, Galla et al. 2021).
Wild vs captive VIM
The loss of genetic diversity as a result of captive breeding is a known concern, one that is often followed by the recommendation that molecular markers be incorporated into studbook data (Ivy et al. 2009; Hogg et al. 2019; Ayala-Burbano et al. 2020). The VIM captive population was originally established with individuals from as many colonies as possible and hence we might expect to see limited differences between the wild and captive populations. Indeed, our results confirmed our expectations as we found no difference in allelic richness or individual heterozygosity between populations. Furthermore, our STRUCTURE analyses demonstrated that allele frequencies did not vary between the wild and captive populations. The maintenance of VIM genetic diversity speaks to the success of capturing genetic variation in the wild, incorporating behavioural data early on, and the diligent efforts to maintain diversity through the selection of breeding pairs (VIMRT 2017). Interestingly, our results suggested that the captive population was possibly less inbred than the wild and there may be multiple reasons for this. First, the statistical significance was mostly driven by three data points (these individuals had inbreeding values greater than one, which is outside of theoretical limits) which once removed resulted in only a marginally significant difference. Second, the breeding program intentionally minimized inbreeding in the captive population, while wild marmot colonies are usually geographically separated, small populations with limited gene flow (VIMRT 2017; COSEWIC 2019). Third, the accuracy of inbreeding and relatedness estimates is known to be dramatically reduced when using few low diversity microsatellites (Taylor et al. 2015). Genotyping techniques with greater resolution would provide more confidence around these estimates therefore, we remain conservative about our ability to accurately quantify inbreeding coefficients and what these values imply about the two populations as a whole. Consequently, while the captive population may be less inbred than the wild population, this does not indicate that the wild individuals are severely inbred.
Genetic diversity in the wild over time.
Due to the severe population decline since the previous study by Kruckenhauser et al. (2009, VIMRT 2017, COSEWIC 2019), we anticipated a noticeable decrease in genetic diversity in the wild. Contrary to expectations, our estimate of mean allelic richness in VIM was not significantly different from the previous estimate (Kruckenhauser et al. 2009). In fact, we identified two new length-based alleles in our study (at loci 2g2 and MA018) that were not identified in the previous study. The most likely reason for this discrepancy is incomplete sampling of the population as these new alleles were relatively low in frequency in our study. While the mutation of new alleles between sampling events (approximately three generations; VIMRT 2017) seems unlikely, this remains a possibility.
Overall, we failed to observe a decrease in genetic variation as would be expected through drift. This may be attributed to the pre-existing low diversity as well as successful conservation management. Conservation efforts have been focused on maintaining gene flow between colonies through translocations and releases, and careful management and implementation of the captive-breeding program by the Marmot Recovery Foundation and the Calgary and Toronto Zoos. In fact, the software PMx estimates that our pedigree has retained 96% of the original genetic diversity from the founding captive population. However, translocations among colonies and releases from captive populations are not without consequence; population genetic structure seems to have eroded since the initial assessment before translocations and releases began. Kruckenhauser et al. (2009) identified three genetic clusters (K = 3), including one cluster for Mt. Washington and two clusters split across the Nanaimo Lakes metapopulation. Our study also shows two clusters split across the Nanaimo Lakes metapopulation but fails to separate Mt. Washington as a unique cluster. Although only three samples from Mt. Washington were genotyped in the current study, these individuals did not have any private alleles which is in contrast to Kruckenhauser et al. (2009) who found private alleles at five loci. Additionally, one locus (Bibl18), identified by Kruckenhauser et al. (2009) as containing a fixed but private allele at Mt. Washington, now appears to be monomorphic across all individuals sampled. While our results may be due to small sample size, they are more likely suggesting that the pre-existing genetic structure between Mt. Washington (Strathcona metapopulation) and the Nanaimo Lakes metapopulation has been eroded. This is not surprising considering the Marmot Recovery Team determined the genetic variance in the Mt. Washington colony would be best intermixed across all populations, leading to extensive translocations among wild colonies and releases of marmots to and from the Mt. Washington colony (VIMRT 2017; Lloyd et al. 2019).
Conclusions
We determined that genetic diversity of the Vancouver Island marmot appears to be unaffected by several generations of captive breeding, with wild and captive populations maintaining allelic diversity over time. This is likely attributed to the early implementation of behavioural and molecular data in the establishment of the studbook and diligent efforts to minimize inbreeding among mating pairs ever since. Moving forward, we recommend current and future captive breeding programs incorporate molecular data to help establish or improve studbooks and inform mate selection. Conversely to our expectations, we also determined that while HTAS genotyping did reveal a few hidden alleles, these did not lead to a significant increase in estimates of genetic diversity, indicating this technique may have limited value in species with especially low genetic variation. The HTAS technique should still be explored further for its potential suitability in the conservation of endangered species, although some species, like the Vancouver Island marmot, may benefit more by using a sequencing technique that has the potential to capture a greater number of genome wide markers (e.g. restriction-site associated DNA sequencing or whole genome sequencing).
Data availability
All data generated or analysed during this study are included in this published article (and its supplementary information files).
Code availability
The script to organize sequence data is available at https://github.com/jazjanes/VI-marmots.
References
Aaltonen K, Bryant AA, Hostetler JA, Oli MK (2009) Reintroducing endangered Vancouver Island marmots: survival and cause-specific mortality rates of captive-born versus wild-born individuals. Biol Conserv 142:2181–2190
Andrews S (2010) FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc
Ayala-Burbano PA, Junior PM, Wormell D, Pissinatti A, Marques MC, de Freitas PD (2020) Studbook and molecular analyses for the endangered black-lion-tamarin; an integrative approach for assessing genetic diversity and driving management in captivity. Sci Rep 10:6781
Ballou JD, Lacy RC (1995) Identifying genetically important individuals for management of genetic variation in pedigreed populations. In: Ballou JD, Gilpin M, Foose TJ (eds) Population management for survival and recovery. Columbia University Press, New York, pp 76–111
Barbanti A, Martin C, Blumenthal JM, Boyle J, Broderick AC, Collyer L, Ebanks-Petrie G, Godley BJ, Mustin W, Ordóñez V, Pascual M, Carreras C (2019) How many came home? Evaluating ex situ conservation of green turtles in the Cayman Islands. Mol Ecol 28:1637–1651
Barbian HJ, Connell AJ, Avitto AN, Russell RM, Smith AG, Gundlapally MS, Shazad AL, Li Y, Bibollet-Ruche F, Wroblewski EE, Mjungu D, Lonsdorf EV, Stewart FA, Piel AK, Pusey AE, Sharp PM, Hahn BH (2018) CHIIMP: an automated high-throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees. Ecol Evol 8:7946–7963
Brashares JS, Werner JR, Sinclair ARE (2010) Social ‘meltdown’ in the demise of an island endemic: Allee effects and the Vancouver Island marmot. J Anim Ecol 79:965–973
Brichieri-Colombi TA, Lloyd NA, McPherson JM, Moehrenschlager A (2019) Limited contributions of released animals from zoos to North American conservation translocations. Conserv Biol 33:33–39
Canessa S, Converse SJ, Wess M, Clemann N, Gillespie G, McFadden M, Silla AJ, Parris KM, McCarthy MA (2015) Planning for ex situ conservation in the face of uncertainty. Conserv Biol 30:599–610
Casimir DL, Moehrenschlager A, Barclay RMR (2007) Factors influencing reproduction in captive Vancouver Island marmots: implications for captive breeding and reintroduction programs. J Mammal 88:1412–1419
Ceballos G, Ehrlich PR, Barnosky AD, Garcia A, Pringle RM, Palmer TM (2015) Accelerated modern human-induced species losses: Entering the sixth mass extinction. Sci Adv 1:e1400253
Ceballos G, Ehrlich PR, Dirzo R (2017) Biological annihilation via the ongoing sixth mass extinction signaled by vertebrate population losses and declines. PNAS 114:E6089–E6096
Ceballos G, Ehrlich PR, Raven PH (2020) Vertebrates on the brink as indicators of biological annihilation and the sixth mass extinction. PNAS 117:13596–13602
COSEWIC (2008) COSEWIC assessment and update status report on the Vancouver Island marmot Marmota vancouverensis in Canada. Committee on the Status of Endangered Wildlife in Canada. Ottawa.
COSEWIC (2019) COSEWIC assessment and status report on the Vancouver Island Marmot Marmota vancouverensis in Canada. Committee on the Status of Endangered Wildlife in Canada. Ottawa.
Crow JF, Kimura M (1970) An introduction to population genetics theory. Harper & Row Publishers, New York
Cullingham CI, Miller JM, Peery RM, Dupuis JR, Malenfant RM, Gorrell JC, Janes JK (2020) Confidently identifying the correct K value using the ΔK method: When does K=2? Mol Ecol 29:862–869
Darby BJ, Erickson SF, Hervey SD, Ellis-Felege SN (2016) Digital fragment analysis of short tandem repeats by high-throughput amplicon sequencing. Ecol Evol 6:4502–4512
Edgar RC (2010) Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26:2460–2461
Frankham R, Ballou JD, Ralls K, Eldridge MDB, Dudash MR, Fenster CB, Lacy RC, Sunnucks P (2017) Genetic management of fragmented animal and plant populations. Oxford University Press, Oxford
Galla SJ, Moraga R, Brown L, Cleland S, Hoeppner MP, Maloney RF, Richardson A, Slater L, Santure AW, Steeves TE (2020) A comparison of pedigree, genetic and genomic estimates of relatedness for informing pairing decisions in two critically endangered birds: implications for conservation breeding programmes worldwide. Evol Appl 5:991–1008
Galla SJ, Brown L, Couch-Lewis (Ngāi Tahu: Te Hapū o Ngāti Wheke, Ngāti Waewae) Y, Cubrinovska I, Eason D, Gooley RM, Hamilton JA, Heath JA, Hauser S S, Latch EK, Matocq MD, Richardson A, Wold JR, Hogg CJ, Santure AW, Steeves TE (2021) The relevance of pedigrees in the conservation genomics era. Mol Ecol. https://doi.org/10.1111/mec.16192
Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser 41:95–98
Hedrick PW, Fredrickson RJ (2008) Captive breeding and the reintroduction of Mexican and red wolves. Mol Ecol 17:344–350
Hogg CJ, Wright B, Morris KM, Lee AV, Ivy JA, Grueber CE, Belov K (2019) Founder relationships and conservation management: empirical kinships reveal the effect on breeding programmes when founders are assumed to be unrelated. Anim Conserv 22:348–361
Huisman J, Loeske EB, Kruuk PAE, Clutton-Brock T, Pemberton JM (2016) Inbreeding depression in a wild mammal population. Proc Natl Acad Sci USA 13:3585–3590
Illumina (2013) 16S metagenomic sequencing library preparation. http://support.illumina.com/documents/documentation/chemistry_documentation/16s/16s-metagenomic-library-prep-guide-15044223-b.pdf. Accessed 13 June 2019.
Ivy JA, Miller A, Lacy RC, Dewoody A (2009) Methods and prospects for using molecular data in captive breeding programs: an empirical example using Parma Wallabies (Macropus parma). J Hered 100:441–454
Ivy JA, Putnam A, Navarro A, Gurr J, Ryder O (2016) Applying SNP-derived molecular coancestry estimates to captive breeding programs. J Hered 107:403–412
Jackson CL, Schuster R, Arcese P (2016) Release date influences first-year site fidelity and survival in captive-bred Vancouver Island marmots. Ecosphere 7:e01314
Janes JK, Miller JM, Dupuis JR, Malenfant RM, Gorrell JC, Cullingham CC, Andrew RL (2017) The K=2 conundrum. Mol Ecol 26:3594–3602
Kennedy ES, Grueber CE, Duncan RP, Jamieson IG (2014) Severe inbreeding depression and no evidence of purging in an extremely inbred wild species—the Chatham Island black robin. Evolution 68:987–995
Knief U, Hemmrich-Stanisak G, Wittig M, Franke A, Griffith SC, Kempenaers B, Forstmeier W (2015) Quantifying realized inbreeding in wild and captive animal populations. Heredity 114:397–403
Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I (2015) CLUMPAK: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour 15:1179–1191
Kruckenhauser L, Bryant AA, Griffin SC, Amish AJ, Wilhelm P (2009) Patterns of within and between-colony microsatellite variation in the endangered Vancouver Island marmot (Marmota vancouverensis): implications for conservation. Conserv Genet 10:1759–1772
Kyle R, Beatty GE, Roberts D, Provan J (2017) Using genetic monitoring to inform best practice in a captive breeding programme: inbreeding and potential genetic rescue in the freshwater pearl mussel Margaritifera margaritifera. Conserv Genet 17:1323–1332
Lacy RC, Ballou JD, Pollak JP (2012) PMx: software package for demographic and genetic analysis and management of pedigreed populations. Methods Ecol Evol 3:433–437
Li YL, Liu JX (2018) StructureSelector: a web-based software to select and visualize the optimal number of clusters using multiple methods. Mol Ecol Resour 18:176–177
Limoges B, Boisseau G, Gratton L, Kasisi R (2013) Terminologie relative à la conservation de la biodiversité in situ. Nat Can 137:21–27
Lloyd NA, Hostetter NJ, Jackson CL, Converse SJ, Moehrenschlager A (2019) Optimizing release strategies: a stepping-stone approach to reintroduction. Anim Conserv 22:105–115
Modesto P, Biolatti C, Favaro L, Colussi S, Peletto S, Piga S, Riin MV, Pessani S, Trincas E, Isaja V, Acutis PL (2018) Molecular genetics unveiled unknown family relationships and hybrids in an ex-situ colony of African penguins. J Hered 109:653–662
Ólafsdóttir GÁ, Kristjánsson T (2008) Correlated pedigree and molecular estimates of inbreeding and their ability to detect inbreeding depression in the Icelandic sheepdog, a recently bottlenecked population of domestic dogs. Conserv Genet 9:1639–1641
Paradis E, Schliep K (2018) ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35:526–528
Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes 6:288–295
Peters H, O’Leary BC, Hawkins JP, Roberts CM (2015) Identifying species at extinction risk using global models of anthropogenic impact. Glob Chang Biol 21:618–628
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotyping data. Genetics 155:945–959
R Core Team (2020) R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/
Ralls K, Ballou JD (2004) Genetic status and management of California condors. Condor 106:215–228
Raymond M, Rousset F (1995) GENEPOP (version 1.2): population genetics software for exact tests and ecumenicism. J Hered 86:248–249
Rice WR (1989) The sequential Bonferroni test. Evolution 43:223–225
Ritland K (1996) Estimators for pairwise relatedness and inbreeding coefficients. Genet Res 67:258–275
Roach N (2017) Marmota vancouverensis. The IUCN red list of threatened species 2017: e.T12828A22259184. https://doi.org/10.2305/IUCN.UK.2017-2.RLTS.T12828A22259184.en.
Robin LA (2003) Maintaining behavioural diversity in captivity for conservation: natural behaviour management. Anim Welf 12:85–94
Rollinson N, Keith DM, Houde ALS, Deves PV, Mcbride MC, Hutchings JA (2014) Risk assessment of inbreeding and outbreeding depression in a captive-breeding program. Conserv Biol 28:529–540
Russell WC, Thorne ET, Oakleaf R, Ballou JD (1994) The genetic basis of black-footed ferret reintroduction. Conserv Biol 8:263–266
Schubert M, Lindgreen S, Orlando L (2016) AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res Notes 9:88
Sekino M, Saitoh K, Yamada T, Kumagai A, Hara M, Yamashita Y (2003) Microsatellite-based pedigree tracing in a Japanese flounder Paralichthys olivaceus hatchery strain: implications for hatchery management related to stock enhancement program. Aquaculture 221:255–263
Selkoe KA, Toonen RJ (2006) Microsatellites for ecologists: a practical guide to using and evaluating microsatellite markers. Ecol Lett 9:615–629
Swarth HS (1911) Two new species of marmots from north-western America. Univ Calif Publ Zool 7:201–204
Taylor HR (2015) The use and abuse of genetic marker-based estimates of relatedness and inbreeding. Ecol Evol 15:3140–3150
Taylor EB, Harris LN, Spice EK, Docker MF (2012) Microsatellite DNA analysis of parapatric lamprey (Entosphenus spp.) populations: implications for evolution, taxonomy, and conservation of a Canadian endemic. Can J Zool 90:291–303
Taylor HR, Kardos MD, Ramstad KM, Allendorf FW (2015) Valid estimates of individual inbreeding coefficients from marker-based pedigrees are not feasible in wild populations with low allelic diversity. Conserv Genet 16:901–913
Trevoy SA, Janes JK, Muirhead K, Sperling FA (2018) Repurposing population genetics data to discern genomic architecture: a case study of linkage cohort detection in mountain pine beetle (Dendroctonus ponderosae). Ecol Evol 9:1147–1159
Vancouver Island Marmot Recovery Team (2017) Recovery plan for the Vancouver Island Marmot (Marmota vancouverensis) in British Columbia. Prepared for the B.C. Ministry of Environment, Victoria, BC.
Wang J (2007) Triadic IBD coefficients and applications to estimating pairwise relatedness. Genet Rese 89:135–153
Wang J (2011) COANCESTRY: a program for simulating, estimating and analysing relatedness and inbreeding coefficients. Mol Ecol Resour 1:141–145
Wells DA, Cant MA, Nichols HJ, Hoffman JI (2018) A high-quality pedigree and genetic markers both reveal inbreeding depression for quality but not survival in a cooperative mammal. Mol Ecol 27:2271–2288
Werner JR (2017) Getting back to normal: space use and behaviour of reintroduced and wild Vancouver Island marmots. Can Wildl Biol Manag 6:66–78
Wisely SM, McDonald DB, Buskirk SW (2003) Evaluation of the genetic management of the endangered black-footed ferret (Mustela nigripes). Zoo Biol 22:287–298
Witzenberger KA, Hochkirch A (2011) Ex situ conservation genetics: a review of molecular studies on the genetic consequences of captive breeding programmes for endangered animal species. Biodivers Conserv 20:1843–1861
Wright S (1922) Coefficients of inbreeding and relationship. Am Nat 56:330–338
Zaiontz C (2013) Real Statistics Resource Pack software (Release 5.4) Excel extension. http://www.real-statistics.com/
Acknowledgements
We thank Liz Gillis and two anonymouse reviewers for providing valuable feedback on an early draft of this manuscript, the Marmot Recovery Foundation for sharing samples and studbook data, and Typhenn Brichieri-Colombi for illustrating the map. This research was funded by a grant from the Scholarship, Research and Creative Activity Office at VIU, and a Discovery Grant from the Natural Sciences and Engineering Research Council (NSERC) of Canada to JCG. Sequencing was performed at the Molecular Biology Services Unit in the Department of Biological Sciences at the University of Alberta. KGB completed this research as an undergraduate student and was supported by a REACH scholarship from VIU and an undergraduate student research award from NSERC.
Funding
This study was funded by the Scolarship, Research and Creative Activity office at Vancouver Island University and a Discovery Grant (#RGPIN-2018-06764) from the Natural Sciences and Engineering Research Council of Canada to JCG.
Author information
Authors and Affiliations
Contributions
KGB, GA and JCG conceived the study, while lab work was performed by KGB, GA, ME and CSD. MLM, JC, and AM provided samples, studbook data and intimate knowledge of VIM and the captive-breeding program. KGB, JKJ and JCG analyzed the molecular data and KGB and JCG drafted the manuscript with input from all authors.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare they have no conflict of interest.
Ethical approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
All authors give their consent for publication.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Barrett, K.G., Amaral, G., Elphinstone, M. et al. Genetic management on the brink of extinction: sequencing microsatellites does not improve estimates of inbreeding in wild and captive Vancouver Island marmots (Marmota vancouverensis). Conserv Genet 23, 417–428 (2022). https://doi.org/10.1007/s10592-022-01429-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10592-022-01429-7