The genome of the Black Bengal goat (Capra hircus)

Siddiki, Amam Zonaed; Baten, Abdul; Billah, Masum; Alam, Mohammad Atique Ul; Shawrob, Kazi Shefaul Mulk; Saha, Sourav; Chowdhury, Muntaha; Rahman, Atif Hasan; Stear, Michael; Miah, Gous; Kumkum, Mahadia; Islam, Mohammad Sirazul; Hossain, Mohammad Alamgir; Mollah, A. K. M. Moniruzzaman; Khan, Md. Kabirul Islam

doi:10.1186/s13104-019-4400-3

The genome of the Black Bengal goat (Capra hircus)

Data note
Open access
Published: 27 June 2019

Volume 12, article number 362, (2019)
Cite this article

Download PDF

You have full access to this open access article

BMC Research Notes Aims and scope Submit manuscript

The genome of the Black Bengal goat (Capra hircus)

Download PDF

Amam Zonaed Siddiki ORCID: orcid.org/0000-0003-2990-4022¹,
Abdul Baten^2,7,
Masum Billah¹,
Mohammad Atique Ul Alam¹,
Kazi Shefaul Mulk Shawrob¹,
Sourav Saha¹,
Muntaha Chowdhury^1,6,
Atif Hasan Rahman³,
Michael Stear⁴,
Gous Miah⁵,
Mahadia Kumkum¹,
Mohammad Sirazul Islam¹,
Mohammad Alamgir Hossain¹,
A. K. M. Moniruzzaman Mollah⁶ &
…
Md. Kabirul Islam Khan⁵

4807 Accesses
9 Citations
5 Altmetric
Explore all metrics

Abstract

Objectives

Black Bengal goat (Capra hircus), a member of the Bovidae family with the unique traits of high prolificacy, skin quality and low demand for food is the most socioeconomically significant goat breed in Bangladesh. Furthermore, the aptitude of adaptation and disease resistance capacity of it is highly notable which makes its whole genome information an area of research interest.

Data description

The genomic DNA of a local (Chattogram, Bangladesh) healthy male Black Bengal goat (Capra hircus) was extracted and then sequenced. Sequencing was completed using the Illumina HiSeq 2500 sequencing platform and the draft assembly was generated using the “ARS1” genome as the reference. MAKER gene annotation pipeline was utilized to annotate 26,458 gene models. Genome completeness was assessed using BUSCO (Benchmarking Universal Single-Copy Orthologs) which showed 82.5% completeness of the assembled genome.

Objective

Black Bengal goat (BBG) belongs to the Bovidae family and found throughout Bangladesh, West Bengal, Bihar, and Orissa regions of northeastern India. It is estimated that more than 90% of the goat population in Bangladesh comprised the Black Bengal, the remainder being Jamunapari and their crosses [1]. Higher prolificacy, fertility, resistance against common diseases, adaptability to the adverse environmental condition, early maturity, seasonality and superiority in the litter size are some of the outstanding features of BBG. Besides, it produces excellent quality flavored, tender and delicious meat with low intramuscular fat and fine skin of extraordinary quality for which there is tremendous demand all over the world [1, 2]. Moreover, it plays a vital role in the economy of Bangladesh by contributing 1.66% of the GDP (Gross Domestic Product) (DLS 2017).

Fortunately, the market demand of Black Bengal goat is emerging. This gives breeders of original/rare breeds an opportunity to expand the stock and preserve its genetic diversity. One of the primary goals in managing goat populations is to maintain high-level genetic diversity and low-level inbreeding. To estimate the future breeding potential of a goat breed, it is necessary to characterize the genetic structure and evaluate the level of genetic diversity within the breed. Moreover, a long term genetic approach can be used to improve the spectacular economic characteristics of BBG [3].

Therefore, the genetic characterization of the entire BBG genome is essential in characterizing its economic traits as well as adaptive capability. With the availability of whole genome sequence, the targeted areas for genetic improvements are now: goat prolificacy, growth rate, meat quality, skin quality, disease resistance, and survivability. A complete and accurate reference to the goat genome is an essential component of advanced genomic selection of product characteristics.

Data description

At first, A 3 years old male healthy Black Bengal goat (BBG) without known genetic diseases was selected for blood collection. Genomic DNA from each animal was isolated from the EDTA-blood, using the Addprep genomic DNA extraction kit (South Korea) (detailed methodology in Data file 1—Table 1). The quality and quantity of the DNA were assessed by the Qubit fluorometer (Invitrogen, Carlsbad, CA, USA) and Infinite F200 microplate reader (TECAN), according to the manufacturer’s instruction. The status of the DNA was visually inspected by 0.8% agarose gel electrophoresis. Purified genomic DNA was sent for library preparation (detailed methodology in Data file 1—Table 1) and whole genome sequencing (WGS) at BGI Group (Shenzhen, Guangdong, China). A total of 40 Gb (Gigabase pair) (14-fold) of subread bases with a read length of 150 bp were generated using next-generation sequencing (NGS) technology on an Illumina HiSeq 2500 platform (detailed methodology in Data file 1—Table 1).

Table 1 Overview of data files/data sets

Full size table

After sequencing, quality of the raw sequencing reads were inspected using FastQC version 0.11.8 [4]. Reads were quality controlled including removing adaptor sequences, contamination and low-quality reads from raw reads using Trimmomatic V0.32 [5]. A total of 247,325,362 clean reads were included in the assembly. Subsequently, for de novo assembly we used ABySS v. 2.1.5 assembler [6], which generated 32,94,295 contigs (minimum contig size 200 bp). Next, ABACAS v.1.3.1 pipeline was used with the reference genome ARS1 (GCA_001704415.1) [7] to arranging, ordering, and orientation of the assembled genome [8]. The genome assembly data has been deposited in the NCBI GenBank under the Accession number GCA_001704415.1 (Data file 2—Table 1). The final assembled genome size of BBG is 3.04 Gb with 724.80 Mb (Megabase pair) gaps and GC content of 41.77%. Completeness of the genome was assessed with benchmarking universal single-copy orthologs (BUSCO) version 3.0.2 [9] which showed 82.5% completeness.

Genes were annotated using Maker version 3.0 pipeline [10] which identified 26,458 gene models. RepeatMasker V 4.0.9 [11] using the latest version of the repbase database [12] identified 31.85% repeat elements in the genome. Finally, InterProScan V 5.33–72.0 [13] was used to identify the gene ontology (GO) terms, which identified a total of 12,589 GO terms and 8173 genes have at least 1 associated GO term. The whole genome sequence data has been submitted in the NCBI GenBank under the Accession numbers SMSF01000001–SMSF01003972 (Data file 3—Table 1).

Limitations

The number of unassembled regions in the genome is 3943 and the total number of bases placed in this gap is 724,808,570 bp.

Availability of data materials

The genome sequence information has been accessible at DDBJ/ENA/GenBank under the Accession Numbers SMSF01000001–SMSF01003972 and the assembled genome at GCA_001704415.1. The version reported in this paper is the first version, SMSF00000000.1.

Abbreviations

BBG:: Black Bengal goat
GDP:: gross domestic production
EDTA:: ethylene diamine tetra-acetic acid
DNA:: deoxyribonucleic acid
WGS:: whole genome sequencing
BUSCO:: benchmarking universal single-copy orthologs
ABACAS:: algorithm-based automatic contiguation of assembled sequences
Gb:: giga base pair
Mb:: megabase pair
Kb:: kilobase pair
bp:: base pair
GO:: gene ontology
gDNA:: genomic DNA
PCR:: polymerase chain reaction

References

Husain SS. A study on the productive performance and genetic potentials of Black Bengal goats. A Ph.D. Thesis, Bangladesh Agricultural University, Mymensingh. 1993.
Islam M, Nahar TN, Haq S. Prospect of goat production in Bangladesh. Asian Livestock (FAO). 1991.
Faruque S, Chowdhury SA, Siddiquee NU, Afroz MA. Performance and genetic parameters of economically important traits of Black Bengal goat. J Bangladesh Agric Univ. 2010;8(1):67–78.
Article Google Scholar
FastQC program. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/. Accessed 12 Jan 2017.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Article CAS Google Scholar
Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I. ABySS: a parallel assembler for short read sequence data. Genome Res. 2009;19(6):1117–23.
Article CAS Google Scholar
Bickhart DM, Rosen BD, Koren S, Sayre BL, Hastie AR, Chan S, Lee J, Lam ET, Liachko I, Sullivan ST, Burton JN. Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome. Nat Genet. 2017;49(4):643.
Article CAS Google Scholar
Assefa S, Keane TM, Otto TD, Newbold C, Berriman M. ABACAS: algorithm-based automatic contiguation of assembled sequences. Bioinformatics. 2009;25(15):1968–9.
Article CAS Google Scholar
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2.
Article Google Scholar
Cantarel BL, Korf I, Robb SM, Parra G, Ross E, Moore B, Holt C, Alvarado AS, Yandell M. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 2008;18(1):188–96.
Article CAS Google Scholar
Smit A, Hubley R, Green P. RepeatMasker open-4.0. 2013–2015. Seattle, WA, USA: Institute for Systems Biology; 2015. https://www.repeatmasker.org/faq.htmlb .
Bao W, Kojima KK, Kohany O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob DNA. 2015;6(1):11.
Article Google Scholar
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. InterProScan: protein domains identifier. Nucleic Acids Res. 2005;33(suppl_2):W116–W12020.
Article CAS Google Scholar

Download references

Acknowledgments

Authors concede the support of BGI Group for the sequencing service and Southern Cross University, Lismore, Australia for the computational support.

Funding

This study is supported by funding from Chattogram Veterinary and Animal Sciences University (CVASU). CVASU is the only veterinary university of Bangladesh and Black Bengal goat is important livestock. The funding authority initiated the project to construct the genome and identify the specific trait-related genes.

The funding authority monitored the whole study.

Author information

Authors and Affiliations

Genomics Research Group, Department of Pathology and Parasitology, Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University (CVASU), Chattogram, 4225, Bangladesh
Amam Zonaed Siddiki, Masum Billah, Mohammad Atique Ul Alam, Kazi Shefaul Mulk Shawrob, Sourav Saha, Muntaha Chowdhury, Mahadia Kumkum, Mohammad Sirazul Islam & Mohammad Alamgir Hossain
AgResearch, Private Bag 11008, Palmerston North, 4410, New Zealand
Abdul Baten
Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka, 1000, Bangladesh
Atif Hasan Rahman
Department of Animal, Plant and Soil Sciences, School of Life Sciences, AgriBio, La Trobe University, Bundoora, VIC, 3083, Australia
Michael Stear
Department of Genetics and Animal Breeding, Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University (CVASU), Chattogram, 4225, Bangladesh
Gous Miah & Md. Kabirul Islam Khan
Department of Biological Sciences, Asian University for Women (AUW), Chattogram, 4000, Bangladesh
Muntaha Chowdhury & A. K. M. Moniruzzaman Mollah
Southern Cross Plant Science, Southern Cross University, Lismore, NSW, 2480, Australia
Abdul Baten

Authors

Amam Zonaed Siddiki
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Baten
View author publications
You can also search for this author in PubMed Google Scholar
Masum Billah
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Atique Ul Alam
View author publications
You can also search for this author in PubMed Google Scholar
Kazi Shefaul Mulk Shawrob
View author publications
You can also search for this author in PubMed Google Scholar
Sourav Saha
View author publications
You can also search for this author in PubMed Google Scholar
Muntaha Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar
Atif Hasan Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Michael Stear
View author publications
You can also search for this author in PubMed Google Scholar
Gous Miah
View author publications
You can also search for this author in PubMed Google Scholar
Mahadia Kumkum
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Sirazul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Alamgir Hossain
View author publications
You can also search for this author in PubMed Google Scholar
A. K. M. Moniruzzaman Mollah
View author publications
You can also search for this author in PubMed Google Scholar
Md. Kabirul Islam Khan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AZS and MKIK conceived the experiment. AB, MB, KSMS, SS performed the data analysis. MAA, MC, MK, MSI, and GM drafted the manuscript. AZS supervised the project and revised the manuscript. MAH, ARH, MS, AMM N reviewed the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Amam Zonaed Siddiki.

Ethics declarations

Ethics approval and consent to participate

The experiments discussed in this investigation were approved by the Institute Review Committee of Chattogram Veterinary and Animal Sciences University.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Siddiki, A.Z., Baten, A., Billah, M. et al. The genome of the Black Bengal goat (Capra hircus). BMC Res Notes 12, 362 (2019). https://doi.org/10.1186/s13104-019-4400-3

Download citation

Received: 17 April 2019
Accepted: 22 June 2019
Published: 27 June 2019
DOI: https://doi.org/10.1186/s13104-019-4400-3

The genome of the Black Bengal goat (Capra hircus)

Abstract

Objectives

Data description

Objective

Data description

Limitations

Availability of data materials

Abbreviations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The genome of the Black Bengal goat (Capra hircus)

Abstract

Objectives

Data description

Objective

Data description

Limitations

Availability of data materials

Abbreviations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation