Abstract
The aim of our study was to create a high-quality Holstein cow genome reference sequence and describe the different types of variations in this genome compared to the reference Hereford breed. We generated one fragment and three mate-paired libraries from genomic DNA. Raw files were mapped and paired to the reference cow (Bos taurus) genome assemblies bosTau6/UMD_3.1. BioScope (v1.3) software was used for mapping and variant analysis. Initial sequencing resulted in 2,842,744,008 of 50-bp reads. Average mapping efficiency was 78.4 % and altogether 2,168,425,497 reads and 98,022,357,422 bp were successfully mapped, resulting in 36.7X coverage. Tertiary analysis found 5,923,230 SNPs in the bovine genome, of which 3,833,249 were heterozygous and 2,089,981 were homozygous variants. Annotation revealed that 4,241,000 of all discovered SNPs were annotated in the dbSNP database and 1,682,230 SNPs were considered as novel. Large indel variations accounted for 48,537,190 bp of the entire genome and there were 138,504 of them. The largest deletion was 18,594 bp and the largest insertion was 13,498 bp. Another group of variants, small indels (n = 458,061), accounted for the total variation of 1,839,872 nucleotides in the genome. Only 92,115 small indels were listed in the dbSNP and therefore 365,946 small indels were novel. Finally, we identified 1,876 inversions in the bovine genome. In conclusion, this is another description of the Holstein cow genome and, similar to previous studies, we found a large amount of novel variations. Better knowledge of these variations could explain significant phenotypic differences (e.g., health, production, reproduction) between different breeds.
Similar content being viewed by others
References
Bickhart DM, Hou Y, Schroeder SG, Alkan C, Cardone MF et al (2012) Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res 22:778–790
Burt DW (2009) The cattle genome reveals its secrets. J Biol 8:36
Canavez FC, Luche DD, Stothard P, Leite KR, Sousa-Canavez JM et al (2012) Genome sequence and assembly of Bos indicus. J Hered 103:342–348
Cao W, Bover L, Cho M, Wen X, Hanabuchi S et al (2009) Regulation of TLR7/9 responses in plasmacytoid dendritic cells by BST2 and ILT7 receptor interaction. J Exp Med 206:1603–1614
Coutinho LL, Matukumalli LK, Sonstegard TS, Van Tassell CP, Gasbarre LC et al (2007) Discovery and profiling of bovine microRNAs from immune-related and embryonic tissues. Physiol Genomics 29:35–43
Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA et al (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74
Eck SH, Benet-Pages A, Flisikowski K, Meitinger T, Fries R et al (2009) Whole genome sequencing of a single Bos taurus animal for single nucleotide polymorphism discovery. Genome Biol 10:R82
Ekman A, Niku M, Liljavirta J, Iivanainen A (2009) Bos taurus genome sequence reveals the assortment of immunoglobulin and surrogate light chain genes in domestic cattle. BMC Immunol 10:22
Elsik CG, Tellam RL, Worley KC, Gibbs RA, Muzny DM et al (2009) The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324:522–528
Gibbs RA, Taylor JF, Van Tassell CP, Barendse W, Eversole KA et al (2009) Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science 324:528–532
Glazov EA, Kongsuwan K, Assavalapsakul W, Horwood PF, Mitter N et al (2009) Repertoire of bovine miRNA and miRNA-like small regulatory RNAs expressed upon viral infection. PLoS One 4:e6349
Jenisch S, Koch S, Henseler T, Nair RP, Elder JT et al (1999) Corneodesmosin gene polymorphism demonstrates strong linkage disequilibrium with HLA and association with psoriasis vulgaris. Tissue Antigens 54:439–449
Jonca N, Guerrin M, Hadjiolova K, Caubet C, Gallinaro H et al (2002) Corneodesmosin, a component of epidermal corneocyte desmosomes, displays homophilic adhesive properties. J Biol Chem 277:5024–5029
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R et al (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639–1645
Levy S, Sutton G, Ng PC, Feuk L, Halpern AL et al (2007) The diploid genome sequence of an individual human. PLoS Biol 5:e254
Lilleoja R, Sarapik A, Reimann E, Reemann P, Jaakma U et al (2011) Sequencing and annotated analysis of an Estonian human genome. Gene 493:69–76
Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB et al (2005) Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438:803–819
Liu GE, Bickhart DM (2012) Copy number variation in the cattle genome. Funct Integr Genomics 12:609–624
Liu MM, Weissman SM, Tang L (2006) Identification of coding single nucleotide polymorphisms and mutations by combination of genome tiling arrays and enrichment/depletion of mismatch cDNAs. Anal Biochem 356:117–124
Liu Y, Qin X, Song XZ, Jiang H, Shen Y et al (2009) Bos taurus genome assembly. BMC Genomics 10:180
Lu HK, Mitchell A, Endoh Y, Hampartzoumian T, Huynh O et al (2012) LILRA2 selectively modulates LPS-mediated cytokine production and inhibits phagocytosis by monocytes. PLoS ONE 7:e33478
McKernan KJ, Peckham HE, Costa GL, McLaughlin SF, Fu Y et al (2009) Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. Genome Res 19:1527–1541
Parker SC, Gartner J, Cardenas-Navia I, Wei X, Ozel Abaan H et al (2012) Mutational signatures of de-differentiation in functional non-coding regions of melanoma genomes. PLoS Genet 8:e1002871
Schuster SC, Miller W, Ratan A, Tomsho LP, Giardine B et al (2010) Complete Khoisan and Bantu genomes from southern Africa. Nature 463:943–947
Stothard P, Choi JW, Basu U, Sumner-Thomson JM, Meng Y et al (2011) Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery. BMC Genomics 12:559
Strozzi F, Mazza R, Malinverni R, Williams JL (2009) Annotation of 390 bovine miRNA genes by sequence similarity with other species. Anim Genet 40:125
Van Tassell CP, Smith TP, Matukumalli LK, Taylor JF, Schnabel RD et al (2008) SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries. Nat Methods 5:247–252
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ et al (2001) The sequence of the human genome. Science 291:1304–1351
Wang J, Wang W, Li R, Li Y, Tian G et al (2008) The diploid genome sequence of an Asian individual. Nature 456:60–65
Zhan B, Fadista J, Thomsen B, Hedegaard J, Panitz F et al (2011) Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping. BMC Genomics 12:557
Zhou Y, Chaplin DD (1993) Identification in the HLA class I region of a gene expressed late in keratinocyte differentiation. Proc Natl Acad Sci USA 90:9470–9474
Zimin AV, Delcher AL, Florea L, Kelley DR, Schatz MC et al (2009) A whole-genome assembly of the domestic cow. Bos taurus. Genome Biol 10:R42
Acknowledgments
Support from the High Performance Computing Centre of the University of Tartu provided by Mr. Lauri Anton and Mr. Martin Loginov is highly appreciated. This study was financially supported by P8001VLVL from Estonian University of Life Sciences, by EU30020 from Enterprise Estonia, by the Centre of Translational Genomics of the University of Tartu, and by a grant from the European Regional Development Fund to the Centre of Translational Medicine, University of Tartu.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kõks, S., Lilleoja, R., Reimann, E. et al. Sequencing and annotated analysis of the Holstein cow genome. Mamm Genome 24, 309–321 (2013). https://doi.org/10.1007/s00335-013-9464-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00335-013-9464-0