Mammalian Genome

, Volume 23, Issue 1, pp 124–131

Annotation of the domestic dog genome sequence: finding the missing genes

  • Thomas Derrien
  • Amaury Vaysse
  • Catherine André
  • Christophe Hitte
Article

DOI: 10.1007/s00335-011-9372-0

Cite this article as:
Derrien, T., Vaysse, A., André, C. et al. Mamm Genome (2012) 23: 124. doi:10.1007/s00335-011-9372-0

Abstract

There are over 350 genetically distinct breeds of domestic dog that present considerable variation in morphology, physiology, and disease susceptibility. The genome sequence of the domestic dog was assembled and released in 2005, providing an estimated 20,000 protein-coding genes that are a great asset to the scientific community that uses the dog system as a genetic biomedical model and for comparative and evolutionary studies. Although the canine gene set had been predicted using a combination of ab initio methods, homology studies, motif analysis, and similarity-based programs, it still requires a deep annotation of noncoding genes, alternative splicing, pseudogenes, regulatory regions, and gain and loss events. Such analyses could benefit from new sequencing technologies (RNA-Seq) to better exploit the advantages of the canine genetic system in tracking disease genes. Here, we review the catalog of canine protein-coding genes and the search for missing genes, and we propose rationales for an accurate identification of noncoding genes though next-generation sequencing.

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • Thomas Derrien
    • 1
  • Amaury Vaysse
    • 1
  • Catherine André
    • 1
  • Christophe Hitte
    • 1
  1. 1.Institut de Génétique et Développement de Rennes, CNRS-UMR6061Université de Rennes 1RennesFrance