Introduction

Animal genetic resources are the basis of the sustainable development of animal husbandry1,2, and China is one of the foremost countries in this respect, accounting for one-sixth of the world’s animal genetic resources3,4. Two systematic and comprehensive surveys were conducted on animal genetic resources: one from 1970 to 1980, and the other in 2004. According to the Report on Domestic Animal Genetic Resource in China (edited in 2012)5, 777 breeds of animal genetic resources have been formally named in China, including 556 local breeds, 109 cultivated breeds, 104 introduced breeds, and eight other breeds. With respect to breeds of poultry, there are 116 chickens, 34 ducks, 31 geese, three turkeys, three pigeons, and two species of quail.

However, animal genetic resources have shown an overall decline in China since 1970s, due to unknown resources in some areas, low in vivo conservation, loss of animal genetic resources, and large-scale adoption of breeding and intensification processes6. In particular, the large number of imported breeds, and their wide promotion, have greatly threatened Chinese domestic animal genetic resources7. As examples, the Guping chicken, Lintao chicken, Wenshan goose, and Simao goose have all become extinct and, in total, 44 breeds are on the edge of extinction and 15 are endangered8,9. In response to this crisis, the departments responsible for managing Chinese animal genetic resources have allocated yearly funds toward conservation efforts. Since these funds are limited, both conservation strategy and fund allocation are determined by the economic value and population size of a breed10. However, the subjectivity of this system could result in ineffective conservation of precious and endangered genetic resources. Therefore, a better system is needed to determine priority in these conservation efforts and achieve optimal allocation of funds11,12.

One option is through marginal diversity, which was defined by Weitzman in 199213 as a mechanism for measuring genetic diversity. The concept uses genetic and non-genetic factors to calculate a “maximum-likelihood tree”14 and the current diversity of breeds, and estimates the expected variations in diversity over a certain time. This approach defines criteria of diversity and relies on quantitative assessments of different strategies, providing concrete reasoning for breed conservation. At present, marginal diversity has been applied in studies on European pigs15 and cows16; however, no systematic assessment with this approach has been conducted in Chinese domestic duck breeds and the managers also do not know how to allocate funds for breed insurance. Here, we use short tandem repeat profiling to perform a marginal diversity analysis of 21 Chinese domestic duck breeds or genetic resources, which can be used to determine conservation priority.

Materials and Methods

All animal experiments were performed in accordance with the Regulations for the Administration of Experimental Animals issued by the Ministry of Science and Technology (Beijing, China). All experiments were approved by the Animal Care and Use Committee of Yangzhou University.

Breed and genetic distance measurements

The objects of the study were Chinese domestic duck breeds or their genetic resources. Their name, sample size, and origin are shown in Table 1. Blood was collected according to pedigree, to ensure that samples were from unrelated individuals. The samples (0.5 mL) obtained from the vein of the ducks wings were carefully mixed with lysis solution and kept at 4 °C for subsequent DNA extraction. DNA extraction was performed according to the method described by Huang et al.17.

Table 1 Name, sample size, and origin of 21 Chinese domestic duck breeds.

Twelve pairs of microsatellite primers with rich polymorphism were selected as follows: APH01, APL2, AJ272579, AJ272578, AJ272577, AJ415887, AJ515884, AJ515893, AY493256, AY493289, AY493313, and CMO11. The Sequences, combination, and optimal reaction condition have been reported previously18. A total of 1802 ducks were genotyped and the population genetic parameters calculated were described in a published paper18. The standard genetic distance between populations was calculated with Microsatellite-Toolkit19 and Dispan (http://www.softpedia.com/get/Science-CAD/DISPAN.shtml).

PCA and population structure analysis for all breeds

In this study, SPSS13.0 software was used for principal component analysis (PCA) of all the detected alleles20, and Structure 2.0 (http://rosenberglab.bioinformatics.med.umich.edu/distruct.htm) software was used for genetic Structure analysis of 21 populations.

Extinction probability

Extinction probability is an important index for genetic resource diversity. Future changes in the diversity of local breeds or genetic resources can be measured as the extinction probability over time (500 years)21. In Weitzman’s approach, the extinction probability (Zi) of each set is a variable that needs special attention. There are various methods for calculating Zi; however, we adopted the method proposed by Reust-Marti11. This method uses seven variables: the total population size (POS), its change over the past 10 years (CHA), distribution of the breed (DIS), risk of indiscriminate crossing (CRO), organization and conservation measures of breeding (ORG), special traits (SPE), and threat of production transition (PRO) (Table 2). Different weights (wi) were given to different variables to estimate Zi in the future 500 years. The estimate formula and correction formula are as follows:

$$Z{\rm{i}}=\mathop{\sum }\limits_{i=1}^{n}{{\rm{w}}}_{i}{{\rm{x}}}_{i}/\mathop{\sum }\limits_{i=1}^{{\rm{n}}}{\rm{\max }}{{\rm{w}}}_{i}{{\rm{x}}}_{i}$$
(1)

where, wi is the weight of each variable (w1 = 0.35, w2 = 0.15, w3 = 0.14, w4 = 0.10, w5 = 0.10, w6 = 0.06 and w7 = 0.10) and xi is the estimate of the ith indicator. The seven parameters for this analysis were attained by on-site observation, literature review, and estimation, in order to calculate the Zi of each breed and genetic resource in the next 500 years. For the convenience of calculation, Zia of each breed or genetic resource was corrected to 0.1–0.9, according to the formula below22.

$$Zi=\frac{0.8}{1.2}\times \mathop{\sum }\limits_{a=1}^{7}Zia+0.1$$
(2)
Table 2 Influencing factors and criteria of extinction probability.

For the set (S) containing a certain number (N) of breeds and genetic resources, and a breed i, the distance of j ∈ S can be expressed as dij. According to Weitzman’s recursive algorithm, the diversity variable D (S) can be calculated by an N × N distance matrix. The probability of a breed’s existence in 500 years is 1-Zi, if Z is an N-dimensional vector containing Zi of N sets. K is an N-dimensional vector containing the indicator variable Ki (i = 1, 2, … N). Ki = 1 if the set i exists, whereas Ki = 0 if the set i is extinct. Therefore, K represents an overview of the status in which a subset of breeds exists and its complementary subset is extinct. The formula of the existence probability of a subset of breeds is as follows:

$$P(K)=\prod _{{\rm{i}}}({\rm{Ki}}+{(-1)}^{{{\rm{k}}}_{{\rm{i}}}}{{\rm{z}}}_{{\rm{i}}})$$
(3)

DK is the diversity of the subsets safe from extinction. The expected diversity at the end of the time horizon (500 years) is calculated as:

$$E(D)=\sum _{\forall K}P(K){D}_{K}$$
(4)

The variance of the expected diversity is:

$${{\rm{D}}}_{{\rm{i}}}^{^{\prime} }=\frac{\partial E(D)}{\partial {z}_{i}}V{\rm{ar}}({\rm{D}})=\sum _{\forall K}P(K){D}_{K}^{2}-{[\sum _{K}P(K)DK]}^{2}$$
(5)

The marginal diversity of a breed or genetic resource reflects the variation of the expected diversity when the extinction probability is increased by one unit. The marginal diversity is calculated as follows:

$${{\rm{D}}}_{{\rm{i}}}^{^{\prime} }=\frac{\partial E(D)}{\partial {z}_{i}}$$
(6)

Based on the extinction probability and expected diversity of a breed or genetic resource, Weitzman suggested conservation potency as the optimal parameter to assess the genetic diversity over a given time horizon. The conservation potency (CP) is calculated as follows:

$$C{P}_{i}={z}_{i}\times {D}_{i}^{^{\prime} }$$
(7)

CPi represents a possible increase in the expected diversity of a breed or genetic resource when the threat is completely removed. According to previous work by Simianer et al.10, CPi is the optimal parameter for determining conservation schemes, with the highest CPi requiring a minimum amount of capital required for the protection scheme22. The breed or genetic resource with the highest CPi should be allocated the least funds in breeding conservation efforts10,23.

Results

Genetic distance

Twelve simple sequence repeats were detected in the 21 duck breeds. Nei’s standard genetic distance was estimated using Microsatellite-Toolkit and Dispan24. According to the genetic distance matrix (Table 3) and cluster analysis by the unweighted pair-group method with arithmetic means (Fig. 1)25, FY belonging to the Cairina breed forms a single set, whereas the other 20 duck breeds (Anas) form three large sets. The distance between GY and JA was the shortest, and the distance between FY and GX was the longest First bullet.

Table 3 Nei’s standard genetic distance between 21 Chinese domestic duck breeds.
Figure 1
figure 1

Two-dimensional scatter plot of the first and second factors for 21 duck populations. Note: Breed abbreviations are defined in Table 1.

PCA and population structure analysis for all breeds

PCA was performed on the gene frequencies of all alleles detected in 12 SSR seats in 21 populations. The plane distance graph constructed according to the first two principal components was shown in Fig. 1. 21 duck breeds were divided into three large groups. Among them, BJ, ZS, SS, JC, PT, JS, LC were relatively close to each other. JJ, DY and CH were close to each other, forming another group. In addition, 9 breeds including JA, GY, SM, JX, JD, GX, YX, LW constituted a group. The distances between FY and other 20 domestic duck breeds were relatively large.

Structure 2.0 program can group individuals with similar genotypes on multiple SSR seats without prior knowledge of the fusion or evolutionary history of the populations (Fig. 2). When K = 2, no single population was isolated; Of these, 9 breeds such as BJ, JC, JS, LC, MW, PT, SS, TW, ZS were grouped together, and the rest of 12 breeds were clustered into one group. When K = 3 and K = 4, no single group was isolated. FY, LC, PT were clustered into one group. FY ducks were isolated as a single group until K = 5. When K = 7, BJ, JC, JS and ZS still formed a group.CH, DY and JX still gathered into one group: GX, GY, JA, JD CLUSTERED into the last one group.

Figure 2
figure 2

Population structure of 21 populations by the individual Q matrix structure. (Running Structure 1000 times from K = 2 to 7). Note: Breed abbreviations are defined in Table 1.

Extinction probability

The average extinction probability for the 21 Chinese domestic duck breeds and genetic resources was 0.38% (Table 4). ZS, GX, and JA had the largest extinction probabilities, whereas LW, JD, and GY had the smallest.

Table 4 Extinction factor weighting and extinction probability correction of each population.

Current and expected future diversity

The current diversity of the 21 breeds and genetic resources was determined to be 7.72, and the expected diversity of all sets in 500 years is 5.14 ± 1.15. Therefore, an overall decrease of 2.58 (33.43%) is anticipated.

Contributions and marginal diversities of each breed

The term “contributions” is defined as the percentage of contribution of each breed to overall diversity. The contributions and marginal diversities of each of the 21 breeds are shown in Table 5 and Fig. 3. The contribution of FY was the largest (Table 3) followed by GX, whereas JD had the smallest contribution. Similarly, in terms of marginal diversity, FY showed the largest (Table 5), followed by GX, whereas JD has the smallest (−0.12, 2.32). Finally, FY had the highest conservation potency, followed by GX, and JD had the lowest.

Table 5 Marginal diversity of 21 Chinese domestic duck breeds.
Figure 3
figure 3

A maximum-likelihood tree showing the marginal diversity, contribution, and conservation potency of each breed. Note: Breed abbreviations are defined in Table 1.

Discussion

There are numerous domestic duck breeds in China including 27 indigenous breeds, two introduced breeds, and a few developing breeds. However, with the introduction and promotion of cherry valley duck, the number of local duck species in China has dropped sharply, and many species are facing the danger of extinction26. Conservation of genetic diversity plays an important role in sustaining the livestock breeds27. At present, Weitzman marginal diversity method has attracted more and more attention in the research on rational allocation of livestock and poultry resources protection funds, and has become one of the most dynamic theories in the field of livestock genetic resources protection and utilization28,29. There were many researches on animal genetic diversity in the world30,31,32, but few of them analyze the application of marginal diversity method To our knowledge, Reist-Marti et al.(2003) have estimated extinction probability in livestock breeds11. Bennewitz (2005) estimated the extinction probabilities of 5 German dual-purpose cattle breeds by population viability analysis33. And then (2006) he analyzed 44 North Eurasian cattle breeds using simplified determined extinction probabilities. The results show that the expected loss of diversity within the next 50 years is between 1 and 3% of the actual diversity34. The marginal diversity analysis of goat29, sheep35, cattle36, pigs37 has been completed in China, which provides a reliable data reference for the division of conservation funds. After years of investigation of domestic duck resources and collection of blood samples, this paper analyzed the marginal diversity and extinction probability of local duck breeds in China for the first time.

In this paper, the PCA was used to explain the molecular genetic relationships among the populations and a plane distance map was constructed, reflecting the real genetic information and genetic relationships of the 21 populations. Structure cluster analysis use allelic and genotype data from multiple loci, such as SSR loci, to construct a cluster model. Structure 2.0 program was based on Bayesian probability theory, adopt Markov-Monte Carlo simulation algorithm, and used mixed model when running the program to reveal the unknown population genetic relationship and potential population Structure from all population levels38. The expected number of classification (K value) of the detected group was set at runtime, which can be used to divide all individuals and reflect the genetic structure of the group. It is especially suitable for the study of the genetic structure, the differentiation and migration of individuals. In this paper, the population Structure diagram and the maximum-likelihood tree39 obtained based on Structure 2.0 program were consistent with the results from PCA, verifying the accuracy of population Structure inference.

The calculation for extinction probability considers all factors that might cause change in a breed or genetic resource, making it an accurate and reliable estimation40. However, due to the political and economic situation in China, as well as the distribution of indigenous duck breeds and resources, some factors were not considered in this study, such as natural disasters, reliability of the information source, and development of reasonable storage approaches. For this measurement, seven variables (POS, CHA, DIS, CRO, ORG, SPE, and TRA) were assigned to different weights as major factors. These variables have been proved to be important factors reflecting population diversity. Here, we calculated the current and expected population diversities of a total of 21 Chinese duck breeds, respectively. Importantly, we found that the expected diversity (within 500 years) were 33.43% lower than current diversity.

Some variables for calculating the extinction probability, such as CHA and CRO, only consider the conservation of a single breed or genetic resource, and do not account for the effect of the conserved breed on the genetic diversity of the entire population41. If limited breeding conservation funding is allocated based on extinction probability parameters, it may not be the most beneficial solution for the entire population, especially if that population includes numerous breeds and strains. Instead, the breed with the largest contribution should be given the highest priority42,43. In this study, the largest contributor was FY, followed by GX.

However, breed contribution is not the only consideration for conservation planning efforts, and its calculation does not consider extinction probability. In contrast, marginal diversity considers both contribution and the extinction probability, and can therefore act as a comprehensive measurement of the importance of each breed. According to Weitzman, marginal diversity parameters should be considered the preferred reference during breeding conversation planning44. Conserving the breed with the largest conservation potency is the most effective way to maintain overall genetic diversity. Therefore, the first two breeds prioritized should be FY and GX, followed by CH, JC, JS, JX, JA, PT, TW, LC, DY, SS, ZS, JJ, SM, MW, BJ, YX, LW, GY, and JD.

Based on the marginal diversity parameters, we identified the conservation priority of 21 local duck breeds and genetic resources. FY and GX are the first two breeds that should be protected. The conservation priority in this study can provide a reference for breed conservation planning.