Background

India is known as a mega biodiversity nation for its abundant wild flora and fauna, but the country is not untouched by the problem of biodiversity loss and the need to implement conservation efforts to protect its vulnerable biodiversity. Although Haryana, a state in the northern part of India, is deficient in natural forests, but it has a rich biodiversity of some wild animals, especially Blackbuck antelopes (Antilope cervicapra). The Blackbuck is native to the Indian subcontinent, and it is protected as a scheduled-I animal under the 1972 Wildlife Protection Act, enacted by the Indian government (Kumar et al. 2017). Once, Blackbuck antelopes were abundant and ranged throughout Haryana, but now their population has been marginalized and confined to the southwestern part of Haryana, and are nearly to extinct in the rest of the state due to increases in wildlife poaching, infrastructure development, and biotic interference. The alarming rate of the population depletion of the Blackbuck has brought it to the verge of extinction, and this is a matter of concern for the state.

Forensic approaches have been adopted to identify wildlife animal species in need of conservation and to prevent their fraudulent misidentification, especially from samples of animal residues and the products made from processing their parts (Iyengar 2014; Tobe & Linacre 2010; Hsieh et al. 2011; Ogden et al. 2009). Advances in molecular science have launched a new era in the taxonomy of species identification, even with a bit of biological sample which, sometimes, is not possible to identify using conventional morphological methods. DNA barcoding is a molecular taxonomic idea initiated by Hebert et al. (2003) to transform conventional taxonomy by digitizing the identity of a given species (Hebert et al. 2003). Thus, a DNA barcode is a standardized approach used to identify species from partial sequences of DNA (Hebert and Gregory, 2005; Kumar et al. 2012). This technique uses the mitochondrial region of cytochrome C oxidase subunit I (COI) to provide a strong species-level resolution for diverse wildlife (Linacre & Tobe 2011). The DNA barcode region contains highly conserved sites for primer binding, and it can be used to show interspecific variations (Ferri et al. 2009; Eaton et al. 2010). This method has been attracting international attention due to its significant role in advancing the taxonomy of life forms and their forensic implications.

The successful amplification for the DNA barcode region requires cross-species primers, which are the universal primers designed to amplify homologous sequences in other closely related species (Housley et al. 2006; Ghosh et al., 2013; Lucio et al. 2011). In the present research study, we selected the universal primers of the COI gene by checking their feasibility and specificity to develop a species-specific DNA sequence as a barcode sequence for the Blackbuck species. The first specific objective of the present study was to develop and enrich the DNA barcode data of the Blackbuck from Haryana, India, found in the Barcode of Life Database (BOLD) and National Center for Biotechnology Information (NCBI) database. Secondly, we evaluated the ability of the COI barcoding gene to identify the Blackbuck species from its sympatric species and to elucidate its species diversity.

Material and methods

Sample collection

Around 25 blood and tissue samples of the Blackbuck animal were collected from different part from Haryana, India. Blood samples were collected during a veterinary checkup and tissue samples from Pathology Department, LUVAS University, Hisar, during the postmortem analysis, with the permission of the Wildlife Department of Haryana, in the presence of wildlife staff and veterinary doctors. Blood samples were taken and preserved in EDTA tubes and finally stored at 4 °C and tissue samples were preserved at -80 °C. Permission for the sample collection for the scientific research is granted by the National (MoEF&CC) and State government body vide letter no 1–56/2016 and WL-87/11–04-16 respectively.

Primer selection

A universal set of Primer was selected for this study from the primer database available in BOLD system to target the barcode COI sequence of the Antilope cervicapra. In details, of primer were VF1 (5′-ttctcaaccaaccacaaagacattgg-3′) and VR1 (5′-tagacttctgggtggccaaagaatca-3′).

DNA extraction, PCR amplification, and sequencing

A good quality of 22 samples of DNA (50 ng) were extracted using DNeasy blood and tissue Kit (Qiagen, Valencia, CA, USA) following manufacturer’s instruction. All PCR reaction for the amplifications of the COI region were carried out using 1X PCR Buffer, 2.5 mM MgCl2, 0.4 mM of each of dNTPs, 10 pM each of forward and reverse primers and one units of Taq polymerase in a 50 μl final reaction volume. The amplification were carried out as follow: initial denaturation at 95 °C for 5 min, followed by 35 cycles at 95 °C for 45 s, 54 °C for 45 s and 72 °C for 1 min, and a final elongation for 10 min at 72 °C in a Takara PCR System 9700. The Amplification results of PCR products were checked by electrophoresis in 1.2% agarose gel stained with ethidium bromide and visualized in Gel-doc system.

The successful amplified PCR products were purified using QIAquick PCR Purification Kit Qiagen. The purified PCR products were sequenced (Eurofins Genomics India Pvt. Ltd) on both strands. The obtained chromatograms were analyzed and manually edited in the Chromas 2.4 software (Technelysium Pty Ltd., South Brisbane, Australia). The terminal ends of the sequence were trimmed to obtain the consensus sequences of 637 bp. Polymorphic bases were checked using the original chromatograms in MEGA v7 (Kumar et al. 2016).

Barcode tree analysis

The DNA barcoding primers successfully amplified the desired sequence of the COI region of Blackbuck yielding an approximately 640–720 bp size product. The results were blasted for similarity search against non-redundant sequence database showing high degree of similarity (99–100%) with the corresponding sequences of Blackbuck both in BOLD and GenBank database, confirmed the successful amplification and could be used as a DNA barcode for species identification (Johnson et al. 2008; Dalton & Kotze 2011).

A dataset of 43 sequences were used for molecular analysis and to construct the Neighbour-Joining phylogenetic tree. The data comprised of generated sequences of the Blackbuck (n = 22) and with additional COI sequences of the related species (n = 21) showing maximum homology were used and downloaded from NCBI-GenBank for wide coverage of inter and intra-specific nucleotide diversity (Table 1). From the developed sequences of this study, those sequences showing intraspecific variation were accepted by both NCBI-GenBank and BOLD and are published in their database. All the 43 sequences representing 13 species from 4 genera of Antilopinae subfamily, were aligned in MUSSCLE program in MEGA v7 software to yield an equal length of 636 bp with no gaps and no indels. MEGA7 software was used to calculate nucleotide diversity and to construct a phylogenetic trees based on Neighbour-joining method incorporated with Kimura 2 parameter (K2P) model (Kumar et al. 2016). Bootstrap values (1000 replicates) for the internal topology were assessed using Giraffa camelopardalis (NCBI accession no. AP003424.1) as an out-group animal to root the phylogenetic tree. In addition, the species identification were also performed in the BOLD search engine and phylogenetic cluster was assessed with the closest matches with using BINs (Barcode Index Number system) assigned by BOLD. BIN is an online platform of the BOLD system that cluster barcode sequences algorithmically to creating a web page for each cluster.

Table 1 Showing partial COI sequence of mtDNA used in this study from NCBI-GenBank

Database submission

The generated barcode sequences were submitted to BOLD and the NCBI-GenBank database under the project: “DNA Barcoding of Blackbuck, from Haryana, India”. The BOLD-IDs and the NCBI Accession No. of the submitted sequence are shown in Table 2. The Supporting Information files of the submitted sequences are available for each record in the Public Data Portal in the BOLD system, with all relevant information, i.e., specimen information, collection date, collector name, specimen images, primer sequences, global positioning system (GPS) coordinates, and trace files for all the specimen collection localities.

Table 2 Showing submitted list of DNA Barcoding Sequences of Antilope cervicapra in BOLD and NCBI BOLD database with BOLD IDs and Accession No respectively

Utility of the blackbuck DNA barcodes

To solve crimes related to Blackbuck wildlife poaching around the world, the utility of these DNA barcodes will play an important role in the wildlife forensics because they provide a reliable and cost-efficient tool for species identification. Although identification through BOLD and the NCBI-GenBank shows 99–100% matching with Antilope cervicapra sequences, the low number of sequence availability reflects a serious limitation in the database-based identity. Prior to this study, only four or five COI sequences were available in the NCBI-GenBank, and only one submission was available in the Public Data Portal of the COI record in the BOLD system. After submitting the sequences to both databases, a comparatively rich database is now available for Blackbuck, showing its high potential for use in wildlife forensic investigation; this will further help conserve this species.

Results

The Antilope cervicapra DNA sequences provide 100% support for the clade in the phylogenetic tree, showing a strong species distinction with other species related to Antilopinae. This implies that four genera were clustered in the appropriate branch (Fig. 1). The NJ-phylogenetic tree were strongly supported the genera and subfamily relationship with high bootstrap values: 100% for genus Antilope (Blackbuck or Antilope cervicapra), and genus Eudorcas (Eudorcus rufifrons), 99% and 98% for genus Nanger (Nanger dama, Nager granti, Nanger soemmerringii) and genus Gazella (Gazella subgutturosa, Eudorcas rufifrons, Gazella dorcas, Gazella leptoceros, Gazella cuvieri, Gazella spekei, Gazella erlangeri) respectively. The phylogenetic tree was also generated through BOLD, showing a 99.37% match with the Antilope cervicapra sequence. However, only one COI sequence was available for identification, thus demonstrating the limitation of this study. After the sequences were submitted, a relatively rich database is now available for comparison. This demonstrates the utility of this study (Fig. 2).

Fig. 1
figure 1

NJ-Phylogenetic tree (K2P) inferred from 636 base-pair 43 partial COI sequences for 13 species belonging to 4 genus of Antilopinae sub-family. Bootstrap values are shown above the species nodes and for higher taxonomic node. Giraffa camelopardalis was used as an out-group animal to root the phylogenetic tree

Fig. 2
figure 2

Neighbour-joining phylogenetic tree (K2P distance) based on COI barcode generated in BOLD database (BOLD, www.barcodinglife.org). Highlighted yellow color shows the submitted sequences of Blackbuck from the present study in BOLD

The generated sequences of the study animal (n = 22) and the downloaded sequences of the related species (n = 21) from the GenBank were aligned to yield an equal length of 636 bp without gaps and indels. The results revealed that the 637 bp region within Antilope cervicapra (n = 24) had 11variable sites, 625 invariable or conserved sites, 6 Singleton variable sites and 5 parsimony informative sites. Whereas interspecific (n = 43) results showing that 636 bp COI sequences had 141 polymorphic sites, 495 monomorphic sites, 23 Singleton variable sites and 118 positions parsimony informative sites. The average of nucleotide composition in the 636 bp partial COI region of 13 Antilopinae species representing 4 genera were: T (30.4%) C (26.3%) A (27.1%), and G (16.2%). This is the consent with the fact that the nucleotide G frequency is lowest amongst all the four nucleotide bases in most of the mammalian species (Lin et al., 1999). Table 3 showing the intraspecific and interspecific genetic distance matrix. Low intraspecific nucleotide diversity was observed within subfamily Antilopinae with an average of 0.47% (<1%).

Table 3 Pairwise genetic distance using the K2P distance model between 13 Antilopinae species (below diagonal) and mean genetic distance within species represented with two or more specimens (bold numbers on diagonal)

The mean Intra-specific distance was observed in Antilope cervicapra species (0.3%) and in Gazella_subgutturosa (0.3%). The interspecific K2P distance was observed with an average of 6.6% ranged from 0.3% to 12.6% (Table 3) The lowest interspecies divergence was observed among Gazella_leptoceros and Gazella_cuvieri (0.3%) and while the highest interspecific divergence observed in between Antiolpe_cervicapra and Gazella_erlangeri (12.6%) followed by Antilope_cervicapra and Gazella_spekei (11.8%).

Discussion and conclusion

This study showed strong ability of the COI barcoding gene to discriminate the Blackbuck (Antilope cervicapra) with their sympatric species. This study showed that, the intraspecific diversity among 4 genera of sub-family Antilopinae was observed with an average < 1% and specifically for the studied genus Antilope it was observed 0.3%. Such a low level intraspecific divergence observed in this study conforms to among other previous COI based studies such as in Tanzanian antelopes (0.2%)(Bitanyi et al. 2011), Suidae (0.32%)(Ahanthem et al. 2013), 0.62% in Bovidae species ((Cai et al. 2011) 0.60% in Neotropical bats (Clare et al. 2007), 1.1% in primates (Lorenz et al. 2005), in Birds (<1%) (Hebert et al.; 2004), 0.2–0.3% in fish (Lakra et al. 2009, Hubert et al. 2008), and 1% in small mammal communities (Borisenko et al. 2008).

The interspecific diversity observed in this study ranged 0.3% to 12.7%, correspond well with other COI based previous research such as in Tanzanian Antelopes (6.3% to 22%) (Bitanyi et al. 2011) Suidae family (9.9% to 14%) (Ahanthem et al. 2013), Bovidae species (mean 12.44%); Cai et al. 2011) Neotropical bats (mean 7.8%) (Clare et al. 2007), Fish (2.5%–5.9%) Lakra et al. 2009; Hubert et al. 2008).

The phylogenetic tree clades gave the same results as those obtained from measuring the intra- and interspecific divergences with strong bootstrap values. The same species sequences were clade together with low divergence (100% bootstrap supports), and closely related species from the same sub-family and genus were clearly separated in the study. The phylogenetic analysis of this study supports other cladistics studies conducted on antelopes and Bovidae using barcoding COI genes (Bitanyi et al. 2011; Cai et al. 2011).

In summary, this present study developed a species-specific barcoding (COI) sequence for Indian Blackbuck (Antilope cervicapra) from Haryana, India. It demonstrated a high potential to identify Blackbuck antelopes from their sympatric species in wildlife-related (poaching) crimes and in conservation activities. Published barcode sequences in both NCBI and BOLD databases serve as a wildlife forensic tool that can be used to solve Blackbuck-related crimes. The study’s results reaffirmed the significance of the DNA barcode gene in wildlife forensics; the findings can be used to discriminate species within the Bovidae family as well as other animals that have been recognized through prior taxonomic work.