Skip to main content
Log in

Sequencing and annotated analysis of the Holstein cow genome

  • Published:
Mammalian Genome Aims and scope Submit manuscript

Abstract

The aim of our study was to create a high-quality Holstein cow genome reference sequence and describe the different types of variations in this genome compared to the reference Hereford breed. We generated one fragment and three mate-paired libraries from genomic DNA. Raw files were mapped and paired to the reference cow (Bos taurus) genome assemblies bosTau6/UMD_3.1. BioScope (v1.3) software was used for mapping and variant analysis. Initial sequencing resulted in 2,842,744,008 of 50-bp reads. Average mapping efficiency was 78.4 % and altogether 2,168,425,497 reads and 98,022,357,422 bp were successfully mapped, resulting in 36.7X coverage. Tertiary analysis found 5,923,230 SNPs in the bovine genome, of which 3,833,249 were heterozygous and 2,089,981 were homozygous variants. Annotation revealed that 4,241,000 of all discovered SNPs were annotated in the dbSNP database and 1,682,230 SNPs were considered as novel. Large indel variations accounted for 48,537,190 bp of the entire genome and there were 138,504 of them. The largest deletion was 18,594 bp and the largest insertion was 13,498 bp. Another group of variants, small indels (n = 458,061), accounted for the total variation of 1,839,872 nucleotides in the genome. Only 92,115 small indels were listed in the dbSNP and therefore 365,946 small indels were novel. Finally, we identified 1,876 inversions in the bovine genome. In conclusion, this is another description of the Holstein cow genome and, similar to previous studies, we found a large amount of novel variations. Better knowledge of these variations could explain significant phenotypic differences (e.g., health, production, reproduction) between different breeds.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Bickhart DM, Hou Y, Schroeder SG, Alkan C, Cardone MF et al (2012) Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res 22:778–790

    Article  PubMed  CAS  Google Scholar 

  • Burt DW (2009) The cattle genome reveals its secrets. J Biol 8:36

    Article  PubMed  Google Scholar 

  • Canavez FC, Luche DD, Stothard P, Leite KR, Sousa-Canavez JM et al (2012) Genome sequence and assembly of Bos indicus. J Hered 103:342–348

    Article  PubMed  CAS  Google Scholar 

  • Cao W, Bover L, Cho M, Wen X, Hanabuchi S et al (2009) Regulation of TLR7/9 responses in plasmacytoid dendritic cells by BST2 and ILT7 receptor interaction. J Exp Med 206:1603–1614

    Article  PubMed  CAS  Google Scholar 

  • Coutinho LL, Matukumalli LK, Sonstegard TS, Van Tassell CP, Gasbarre LC et al (2007) Discovery and profiling of bovine microRNAs from immune-related and embryonic tissues. Physiol Genomics 29:35–43

    PubMed  CAS  Google Scholar 

  • Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA et al (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74

    Article  PubMed  CAS  Google Scholar 

  • Eck SH, Benet-Pages A, Flisikowski K, Meitinger T, Fries R et al (2009) Whole genome sequencing of a single Bos taurus animal for single nucleotide polymorphism discovery. Genome Biol 10:R82

    Article  PubMed  Google Scholar 

  • Ekman A, Niku M, Liljavirta J, Iivanainen A (2009) Bos taurus genome sequence reveals the assortment of immunoglobulin and surrogate light chain genes in domestic cattle. BMC Immunol 10:22

    Article  PubMed  Google Scholar 

  • Elsik CG, Tellam RL, Worley KC, Gibbs RA, Muzny DM et al (2009) The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324:522–528

    Article  PubMed  Google Scholar 

  • Gibbs RA, Taylor JF, Van Tassell CP, Barendse W, Eversole KA et al (2009) Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science 324:528–532

    Article  PubMed  CAS  Google Scholar 

  • Glazov EA, Kongsuwan K, Assavalapsakul W, Horwood PF, Mitter N et al (2009) Repertoire of bovine miRNA and miRNA-like small regulatory RNAs expressed upon viral infection. PLoS One 4:e6349

    Article  PubMed  Google Scholar 

  • Jenisch S, Koch S, Henseler T, Nair RP, Elder JT et al (1999) Corneodesmosin gene polymorphism demonstrates strong linkage disequilibrium with HLA and association with psoriasis vulgaris. Tissue Antigens 54:439–449

    Article  PubMed  CAS  Google Scholar 

  • Jonca N, Guerrin M, Hadjiolova K, Caubet C, Gallinaro H et al (2002) Corneodesmosin, a component of epidermal corneocyte desmosomes, displays homophilic adhesive properties. J Biol Chem 277:5024–5029

    Article  PubMed  CAS  Google Scholar 

  • Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R et al (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639–1645

    Article  PubMed  CAS  Google Scholar 

  • Levy S, Sutton G, Ng PC, Feuk L, Halpern AL et al (2007) The diploid genome sequence of an individual human. PLoS Biol 5:e254

    Article  PubMed  Google Scholar 

  • Lilleoja R, Sarapik A, Reimann E, Reemann P, Jaakma U et al (2011) Sequencing and annotated analysis of an Estonian human genome. Gene 493:69–76

    Article  PubMed  Google Scholar 

  • Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB et al (2005) Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438:803–819

    Article  PubMed  CAS  Google Scholar 

  • Liu GE, Bickhart DM (2012) Copy number variation in the cattle genome. Funct Integr Genomics 12:609–624

    Article  PubMed  CAS  Google Scholar 

  • Liu MM, Weissman SM, Tang L (2006) Identification of coding single nucleotide polymorphisms and mutations by combination of genome tiling arrays and enrichment/depletion of mismatch cDNAs. Anal Biochem 356:117–124

    Article  PubMed  CAS  Google Scholar 

  • Liu Y, Qin X, Song XZ, Jiang H, Shen Y et al (2009) Bos taurus genome assembly. BMC Genomics 10:180

    Article  PubMed  Google Scholar 

  • Lu HK, Mitchell A, Endoh Y, Hampartzoumian T, Huynh O et al (2012) LILRA2 selectively modulates LPS-mediated cytokine production and inhibits phagocytosis by monocytes. PLoS ONE 7:e33478

    Article  PubMed  CAS  Google Scholar 

  • McKernan KJ, Peckham HE, Costa GL, McLaughlin SF, Fu Y et al (2009) Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. Genome Res 19:1527–1541

    Article  PubMed  CAS  Google Scholar 

  • Parker SC, Gartner J, Cardenas-Navia I, Wei X, Ozel Abaan H et al (2012) Mutational signatures of de-differentiation in functional non-coding regions of melanoma genomes. PLoS Genet 8:e1002871

    Article  PubMed  CAS  Google Scholar 

  • Schuster SC, Miller W, Ratan A, Tomsho LP, Giardine B et al (2010) Complete Khoisan and Bantu genomes from southern Africa. Nature 463:943–947

    Article  PubMed  CAS  Google Scholar 

  • Stothard P, Choi JW, Basu U, Sumner-Thomson JM, Meng Y et al (2011) Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery. BMC Genomics 12:559

    Article  PubMed  CAS  Google Scholar 

  • Strozzi F, Mazza R, Malinverni R, Williams JL (2009) Annotation of 390 bovine miRNA genes by sequence similarity with other species. Anim Genet 40:125

    Article  PubMed  CAS  Google Scholar 

  • Van Tassell CP, Smith TP, Matukumalli LK, Taylor JF, Schnabel RD et al (2008) SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries. Nat Methods 5:247–252

    Article  PubMed  Google Scholar 

  • Venter JC, Adams MD, Myers EW, Li PW, Mural RJ et al (2001) The sequence of the human genome. Science 291:1304–1351

    Article  PubMed  CAS  Google Scholar 

  • Wang J, Wang W, Li R, Li Y, Tian G et al (2008) The diploid genome sequence of an Asian individual. Nature 456:60–65

    Article  PubMed  CAS  Google Scholar 

  • Zhan B, Fadista J, Thomsen B, Hedegaard J, Panitz F et al (2011) Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping. BMC Genomics 12:557

    Article  PubMed  CAS  Google Scholar 

  • Zhou Y, Chaplin DD (1993) Identification in the HLA class I region of a gene expressed late in keratinocyte differentiation. Proc Natl Acad Sci USA 90:9470–9474

    Article  PubMed  CAS  Google Scholar 

  • Zimin AV, Delcher AL, Florea L, Kelley DR, Schatz MC et al (2009) A whole-genome assembly of the domestic cow. Bos taurus. Genome Biol 10:R42

    Article  Google Scholar 

Download references

Acknowledgments

Support from the High Performance Computing Centre of the University of Tartu provided by Mr. Lauri Anton and Mr. Martin Loginov is highly appreciated. This study was financially supported by P8001VLVL from Estonian University of Life Sciences, by EU30020 from Enterprise Estonia, by the Centre of Translational Genomics of the University of Tartu, and by a grant from the European Regional Development Fund to the Centre of Translational Medicine, University of Tartu.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sulev Kõks.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kõks, S., Lilleoja, R., Reimann, E. et al. Sequencing and annotated analysis of the Holstein cow genome. Mamm Genome 24, 309–321 (2013). https://doi.org/10.1007/s00335-013-9464-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00335-013-9464-0

Keywords

Navigation