1 Introduction

In recent years, as fossil fuel decreases rapidly and the current crisis of global warming due to the use of fossil fuels, people have to find new renewable “green” energy including wind energy, solar energy, hydroenergy and so on. In order to develop the abundant water resource, control flood and relieve drought, more and more key water-control projects have been constructed especially in south China. However, these projects will affect river ecosystems seriously by disrupting thermal regimes, disconnecting river corridors and modifying aquatic and terrestrial habitats [1, 2]. Meanwhile, with the improvement of people’s living standards, the proportion of fish on the dinner table is also getting higher and higher, which further leads to overfishing. Thus, the wild fishes in river and lakes have been under severe threat.

The Zig-zag eel (Mastacembelus armatus) is an important fish for human consumption that is widely distributed in fresh water rivers of south China. M. armatus belongs to the Mastacembelidae family within the Synbranchiformes order. M. armatus has contributed to the Asian aquaculture economy, with wide distribution in India, Vietnam, Nepal, Malaysia, Southern China, and other parts of South East Asia [3]. As an important economic fish, its populations face some challenges including overfishing, water pollution and habitat destruction. Thus, the wild resource of Zig-zag eel has gradually declined in recent years [4]. Due to these challenges, understanding the present situation of germplasm resources is extremely important for conservation management and sustainable utilization of this species.

Genetic diversity reflects the living and adaptive state of the species, which is critically important for the management and sustainable exploitation of species [5]. High genetic diversity of the species usually means strong adaptability to environmental change. Microsatellites, also named simple sequence repeats, are neutral co-dominant genetic markers [6, 7]. Because of the locus specificity, highly polymorphism and high reproducibility, microsatellites have been widely used in assessment of genetic diversity and population structure in many species [5, 8].

Previous studies have mainly focused on the reproductive biology, breeding, immunity and nutrition of M. armatus [9,10,11,12,13,14,15]. However, few genetic studies have been published, and information regarding genetic variation and population structure for this species is scarce. Yang et al. [16] examined genetic diversity in 11 populations of M. armatus using ISSR (inter-simple sequence repeat), and the results indicated a relatively high diversity of M. armatus. Cytochrome c oxidase sub-unit, cytochrome b and D-loop region of mtDNA and ribosomal S7 introns 1 and 2 of nuclear genome have been used to evaluate its phylogeny and population structure [17,18,19]. Although, its complete mitochondrial and nuclear genomes have been published [20, 21], genetic markers were not well mined. Thus, it’s necessary to get more information on the genetic diversity and population structure of this zig-zag eel.

In this study, we successfully discovered multiple novel microsatellites based on RADseq data. These microsatellites were first tested in 35 individuals from seven wild populations and then ten microsatellites were further used to study the genetic diversity and population differentiation of seven populations of M. armatus. The genetic information obtained would provide valuable information for developing conservation and management strategies of Zig-zag eel.

2 Materials and methods

2.1 Compliance with ethical standards

Guidelines for the care and use of animals at Guangzhou University were followed. All experimental protocols were approved by Special Committee on Science Ethics of Guangzhou University, Guangzhou, China. An Institutional Review Board (IRB) approval (#115/2020, dated 25/10/2020) was obtained.

2.2 Samples and DNA extraction

A total of 188 individuals of M. armatus were collected from 7 locations in different river systems in south China (Table S1, Fig. 1), including Beijiang (BJ; n = 30), Xijiang (XJ; n = 30), Dongjiang (DJ; n = 30), Hanjiang (HJ; n = 30), Tanjiang (TJ; n = 21), Nandujiang (NDJ; n = 22) and Yuanjiang (YJ; n = 25). Sampling information including the date acquired, location, and river was shown in Table 1. The XJ, DJ, BJ, and TJ rivers belong to the Pearl River System. Muscle tissues were preserved in 95% ethanol and stored at − 80 °C. Genomic DNA was extracted using the Ezup Column Animal Genomic DNA Purification Kit (Sangon Biotech, Shanghai, China).

Fig. 1
figure 1

Map of the sampling sites where the M. armatus populations were obtained. BJ: Beijiang; XJ: Xijiang; DJ: Dongjiang, HJ: Hanjiang; TJ: Tanjiang; NDJ: Nandujiang; and YJ: Yuanjiang

Table 1 Primer information of 10 microsatellite loci

2.3 Development and characterization of microsatellite markers

According to RADseq data of M. armatus (unpublished data), the potential SSR motifs were identified using MicroSAtellite identification tool (MISA, http://pgrc.ipk-gatersleben.de/misa/) [22]. Twenty-eight primers were randomly selected and synthesized by Sangon Biotech (Shanghai, China) (Table S2). The primers were labelled with a fluorochrome (FAM) and verified on a subset of random 35 individuals from seven wild populations (five in each population). Polymerase chain reaction (PCR) was performed in 25-μl reactions containing 20 ng genomic DNA, 0.5 μM of each primer, 7.5 μl of ddH2O, and 12.5 μl 2 × Taq Master Mix (GDSbio, China). The amplification conditions were as follows: an initial denaturation at 94ºC for 5 min; followed by 30 cycles of 94 °C for 45 s, 58 °C for 25 s, and 72 °C for 30 s; and a final extension at 72 °C for 10 min. The PCR products were first separated by 1.0% agarose gel electrophoresis and then analyzed on ABI3730XL DNA Analyzer (Applied Biosystems). The PCR products were sized using GeneMapper software (version 4.0) with size standards GS500 or GS500Liz. Polymorphic primers were selected for subsequent experiments.

Ten primer sets with high PIC were selected for genotyping all samples in the 7 studied populations (Table 1). One primer in each set was labelled with a fluorophore (FAM, HEX or TAMRA) and used for PCR. PCR products were separated using an ABI3130 DNA Sequencer (Applied Biosystems). PCR was performed by first mixing 20 ng of template DNA, 0.5 μM of forward primer and reverse primer, 5 μl 2 × Taq Master Mix, and 3 μl ddH2O in a total volume of 10 µl. Amplification was performed using the following thermocycling conditions: 94 °C for 5 min; 30 cycles of 94 °C for 30 s, the appropriate annealing temperature (Table 1) for 30 s, and 72 °C for 30 s; with a final extension at 72 °C for 10 min. PCR products were first separated by 1.0% agarose gel electrophoresis and then analyzed on ABI3730XL DNA Analyzer (Applied Biosystems). The PCR products were sized using GeneMapper software (version 4.0) and the size standard was GS500 or GS500Liz.

2.4 Genetic diversity and population structure analysis

Genotyping errors and the possible existence of null alleles were estimated by Microchecker [23]. Genepop (version 4.2) was used to calculate the observed heterozygosity (HO) and expected heterozygosity (HE), the number of alleles at each locus (NA), the effective number of alleles (NE), Nei’s genetic diversity, and the Hardy–Weinberg equilibrium (HWE) of each locus within each population [24]. The polymorphism-information content (PIC) was estimated using PIC-CALC (version 0.6). Population pairwise FST values and analysis of molecular variation (AMOVA) were calculated using Arlequin (version 3.0) [25]. All results for multiple tests were adjusted using Bonferroni's corrections [26]. The gene flow (Nm) between localities was estimated according to the FST using the formula Nm = (1 − FST)/(4 × FST) [25]. Significant correlations between population genetic distances and linear straight geographical distances were assessed using the Mantel test, implemented in TFPGA [27]. The geographic distance was estimated as the shortest water surface distance using Google Maps.

Population differentiation was also examined using the Structure program, with K ranging from 1 to 7 [28]. Ten independent runs for each K value were performed under the admixture-ancestry model with 1,000,000 Markov chain Monte Carlo replications after a burn-in of 100,000 repetitions. The most likely K value was determined using the Structure Harvester program. The number of groups was determined using the delta K method [29]. Recent population-size changes were detected as bottlenecks using the Bottleneck program (version 1.2) [30]. Analysis was performed using a two-phased model of mutation (TPM), a stepwise mutation model (SMM), and an infinite allele model (IAM). Significance was tested using the 2-tailed Wilcoxon signed-rank test and the sign test (1000 iterations, variance for TPM = 50).

3 Results

3.1 Identification of microsatellites and polymorphic analysis

Based on RADseq data, a lot of microsatellites were identified. Then, we randomly selected 28 primer pairs for further polymorphic analysis and the results showed that these microsatellites were all polymorphic. The primer sequences and repeat motifs are listed in Table S2. The average NA per locus was 7.857, ranging from 3 alleles (Mat03, Mat07, Mat15, Mat20, Mat23, Maf07, Maf09, Maf10) to 21 alleles (Mad15). The mean HO and HE values for each locus were 0.513 (0.133–0.867) and 0.604 (0.245–0.944), respectively. Four loci (Mat03, Mat08, Mat20, Mat23) showed significant departure from HWE because of heterozygote deficiency.

The PIC of each locus ranged from 0.236 to 0.928 (mean PIC = 0.596). Seventeen loci were highly informative (PIC > 0.5), 10 loci were reasonably informative (0.5 > PIC > 0.25), and only the Mat20 locus showed a low degree of polymorphism (PIC < 0.25). No evidence of null alleles was detected in the 28 loci.

3.2 Genetic diversity in 7 populations of M. armatus

All 188 eels were successfully genotyped at each of the 10 microsatellite loci. No evidence of genotyping errors was detected across the whole data set. In total, 224 alleles were obtained at the 10 microsatellite loci in several populations of M. armatus. The NA at each locus among the populations ranged from 1 (locus Mad18 in population BJ) to 22 (locus Mad10 in population BJ), with an average number of 8.871 alleles (Table S3). Among all 7 populations, XJ showed the highest NA, with an average number of 11.100 alleles, whereas NDJ had the smallest mean number of alleles (4.600). The HO and HE ranged from 0.550 (NDJ) to 0.964 (YJ) and 0.537 (NDJ) to 0.799 (TJ), respectively. The average HE for all loci was > 0.5 for all populations.

Population genetic analysis of departure from HWE was undertaken to assess the suitability of the 10 selected markers for assessing genetic variability in the 7 M. armatus populations. Loci Mad05 and Mad15 showed a significant departure from HWE in more than 3 populations with heterozygotic deficiency or excess. The numbers of loci that deviated from HWE in the BJ, XJ, TJ, DJ, NDJ, and YJ samples were 3 (Mad02, Mad09, Mad27), 5 (Mad05, Mad10, Mad15, Mad02, Mad27, Mat03), 3 (Mad15, Mad18, Mad29), 3 (Mad05, Mad15, Mad18), 5 (Mad02, Mad05, Mad10, Mad22, Mad29), and 4 (Mad02, Mad05, Mad15, Mat03), respectively. HJ population showed only one locus that deviated from HWE (Mad02). PIC values ranged from 0.033 to 0.937 (mean PIC = 0.664), indicating that most loci have high information content. The highest mean PIC (0.757) and HE values (0.799) were observed in TJ. YJ showed the maximum HO value (0.964).

The results of the bottleneck analysis of M. armatus using 3 tests (IAM, TPM, and SMM) were shown in Table 2. Both Sign and Wilcoxon rank tests revealed that HJ had experienced a recent bottleneck event (P < 0.05). No obvious evidence of a bottleneck was observed in any other samples, including BJ, XJ, YJ, and NDJ.

Table 2 Population bottleneck analysis of the M. armatus populations

3.3 Population structure

Pairwise FST values were used to estimate the degree of genetic differentiation among the 7 M. armatus populations, using the 10 microsatellite markers (Table 3). The average FST value was 0.190, and all pairwise FST values among the populations were highly significant (P < 0.01). The largest differentiation was observed between populations BJ and NDJ (0.405), while the FST value between XJ and YJ was smallest haing a value of 0.009. AMOVA showed that 75% of the genetic variation occurred within individuals, whereas 25% of the variation occurred among populations (Table 4). The pairwise Nm value (Table S4) between populations varied between 0.367 (BJ and NDJ) and 27.528 (YJ and XJ). The Nm values between NDJ population and other populations were all substantially less than 1.0.

Table 3 Nei's genetic distance (above the diagonal) and pairwise FST estimates (below the diagonal) among the M. armatus populations
Table 4 AMOVA results for testing the genetic subdivision between 10 M. armatus populations

The population structure was further studied using the Bayesian program Structure Harvester, and the most likely number of genetic clusters was revealed to be 5 (Fig. 2). Structure results supported the division of M. armatus populations into 5 groups: group 1 (XJ, TJ and YJ), group 2 (BJ), group 3 (DJ), group 4 (HJ) and group 5 (NDJ) (Fig. 3). Interestingly, we observed that each of these 5 groupss was significantly differentiated from each other.

Fig. 2
figure 2

Variation trend of lnP (D) values and ΔK values

Fig. 3
figure 3

Structure bar plots (K = 5) for the 10 microsatellite loci of the 7 populations

4 Discussion

Random amplified polymorphic DNA, ISSR, and amplified fragment length polymorphism are valuable markers for analysing the genetic diversity at the species level, but they also have many limitations, such as a low mutation rate. Previous research has indicated that the most appropriate genetic markers for determining the genetic diversity of fish species may be microsatellite markers because they are co-dominantly inherited and are often highly polymorphic [31]. Various studies have used microsatellite markers to conduct genetic diversity and population structure analysis of teleosts, such as Tanakia somjinensis and Trachidermus fasciatus [32, 33]. However, only one such study has been reported for M. armatus [16]. The current study aimed at developing more polymorphic microsatellites and assessing the genetic diversity and population structure of M. armatus populations in 7 geographic sites using polymorphic microsatellites.

4.1 Developing of polymorphic microsatellites

In this study, 28 polymorphic SSR markers of M. armatus were successfully developed. The average Na per locus (7.857), ranging from 3 to 21 alleles was higher than the previous report using ISSR (Na = 1.321, I = 0.168) [16]. Besides, the mean HO and HE values for each locus were 0.513 (0.133–0.867) and 0.604 (0.245–0.944), which were a little lower than the average values (Na = 9.1, Ho = 0.54) reported by DeWoody based on 13 kinds of freshwater fish [34], suggesting relatively low genetic diversity of M. armatus. According to previous reports, lower genetic diversity was also found in other fishes in the Pearl River like Cranoglanis bouderius (Na = 2.04, Ho = 0.448 and He = 0.462) [35], Paramisgurnus dabryanus (Na = 4.833, Ho = 0.265 and He = 0.504) [36] and Misgurnus anguillicaudatus [37], indicating M. armatus might have relatively stronger adaptability than other fishes in the rivers of south China. The PIC of each locus ranged from 0.236 to 0.928 (mean PIC = 0.596), indicating they were highly informative (PIC > 0.5) [38] and could be further used to evaluate the genetic diversity of population.

4.2 Genetic diversity

The genetic diversity of population has an important influence on the environmental adaptation of species. Endangered, endemic and narrowly distributed species usually have low genetic diversities [39]. Of course, not all endangered species have low genetic diversity [40]. Despite the wild resource of zig-zag eel have declined a lot in recent years, our results showed a relatively high-level genetic diversity. Similarly, based on ISSR and part of mtDNA sequence, previous studies also suggested a high level of genetic diversity of zig-zag eel in south China [16, 41]. Besides, the average numbers of Na (4.600–11.100) and Ne (3.470–6.531) in this study were obviously higher than that reported based on ISSR (1.232–1.402, 1.146–1.277) [16], but similar to that reported based on SSR in BJ (4.000–20.000, 1.544–12.766) [42]. Based on our results, the genetic diversity of YJ and TJ populations was relatively higher among seven populations. High genetic diversity of M. armatus in EnPing and Red River, with similar geographic location to TJ and YJ population, was also revealed [16]. However, the level of genetic diversity of NDJ population in Hainan province was the lowest, which was different from previous findings [16]. Zig-zag eels have been cultured for many years in Hainan and wild eels are captured by farmers and used as breeding parents. Thus, low genetic diversity may be due to overfishing. Besides, the genetic diversity of HJ population was also relatively lower and the bottleneck effect was detected. HJ basin is small, and there are problems of overfishing and damming [43]. Previous study also revealed low genetic diversity of Squaliobarbus curriculus and Macropodus opercularis in HJ [44]. Thus, lower genetic diversity of many fish species in this river may be due to the overfishing and damming.

4.3 Population structure

According to AMOVA analysis, there was 25% of variance among populations. Wright (1978) defined the genetic differentiation coefficient, namely FST < 0.05 indicates low genetic differentiation, 0.05 ≤ FST < 0.15 indicates medium genetic differentiation, and FST ≥ 0.15 indicates high genetic differentiation [45]. There was a high degree of genetic differentiation among most populations of zig-zag eel, namely FST ≥ 0.15. NDJ population significantly differentiated from other populations with low gene flow. From a geographical point of view, NDJ is far away from other water systems and only distributes on Hainan Island. Hainan Island has separated from mainland for a long time (about one million years ago). Thus, long periods of geographic isolation led to higher levels of species differentiation from species distributed in mainland [46]. HJ population also showed high genetic differentiation from other populations, which might be related to overfishing and damming [43]. Also, acting as a dependent river may also cause large differentiation of species in HJ from other populations. Strangely, very low genetic differentiation was found between YJ and XJ population, which suggested the two populations of zig-zag eel existed a close correlation. As we can see, YJ locates very closely to the upper reaches of the XJ, it was reasonable to infer that there might be frequent human activity or some unkown connections between YJ and XJ [47]. Genetic structure analysis also suggested that the M. armatus populations could be split into 5 genetic clusters. XJ, TJ, and YJ population grouped together, while other 3 populations formed a separate group, indicating the genetic relationships among YJ, TJ, and XJ populations were very close.

4.4 Conservation implications

Population genetic diversity is very important for the long-term survival and sustainable utilization of species [48]. After a population experiences the bottleneck effect, the genetic variation of its offspring will be included in the few individuals, which further leads to the loss of some alleles, the reduction of genetic diversity, and the increasing frequency of inbreeding, resulting in germplasm degradation. Thus, for HJ and NDJ populations, it’s necessary to perform in situ conservation, and establish nature reserve. Even, artificial breeding should be performed to transplant a lot of juveniles to HJ and NDJ to increase the size of local population. Besides, increasing minimum flows, installation or improvement of fish ladders could also decrease the effect of dam on fish surviving. For other populations with higher level of genetic diversity, only by controlling the fishing intensity, the population size can be effectively restored. Moreover, these populations with high genetic diversity were also used as good resource for aquaculture and breeding of M. armatus. Besides, solving the problem of artificial breeding and aquaculture is also an important means to solve the fishing pressure of wild zig-zag eel.

5 Conclusion

M. armatus is an economically and ecologically important freshwater fish, which is mainly distributed in South China. However, in recent years, the wild population decreased due to a lot of human activities especially the construction of dams and overfishing. In this study, for the first time, we successfully developed 28 polymorphic microsatellite markers for zig-zag eels. Further, ten of them were used to evaluate the genetic diversity and population structure of 7 populations distributed in south China. The results showed low level of genetic diversity was only found in HJ and NDJ populations and evidence of recent bottleneck was found in the HJ populations. Besides, there was low differentiation among XJ, TJ and YJ populations. These results provided important data of genetic diversity of zig-zag eel, which would further facilitate the germplasm conservation and utilization.