Abstract
The Mouse Genomes Project was initiated in 2009 with the goal of using next-generation sequencing technologies to catalogue molecular variation in the common laboratory mouse strains, and a selected set of wild-derived inbred strains. The initial sequencing and survey of sequence variation in 17 inbred strains was completed in 2011 and included comprehensive catalogue of single nucleotide polymorphisms, short insertion/deletions, larger structural variants including their fine scale architecture and landscape of transposable element variation, and genomic sites subject to post-transcriptional alteration of RNA. From this beginning, the resource has expanded significantly to include 36 fully sequenced inbred laboratory mouse strains, a refined and updated data processing pipeline, and new variation querying and data visualisation tools which are available on the project’s website (http://www.sanger.ac.uk/resources/mouse/genomes/). The focus of the project is now the completion of de novo assembled chromosome sequences and strain-specific gene structures for the core strains. We discuss how the assembled chromosomes will power comparative analysis, data access tools and future directions of mouse genetics.
Similar content being viewed by others
References
Bahn JH, Lee J-H, Li G, Greer C, Peng G, Xiao X (2012) Accurate identification of A-to-I RNA editing in human by transcriptome sequencing. Genome Res 22:142–150
Bass BL, Nishikura K, Keller W, Seeburg PH, Emeson RB, O’Connell MA, Samuel CE, Herbert A (1997) A standardized nomenclature for adenosine deaminases that act on RNA. RNA 3:947–949
Blanc V, Park E, Schaefer S, Miller M, Lin Y, Kennedy S, Billing AM, Hamidane HB, Graumann J, Mortazavi A et al (2014) Genome-wide identification and functional analysis of Apobec-1-mediated C-to-U RNA editing in mouse small intestine and liver. Genome Biol 15:R79
Boyden ED, Dietrich WF (2006) Nalp1b controls mouse macrophage susceptibility to anthrax lethal toxin. Nat Genet 38:240–244
Chinwalla AT, Cook LL, Delehaunty KD, Fewell GA, Fulton LA, Fulton RS, Graves TA, Hillier LW, Mardis ER, McPherson JD et al (2002) Initial sequencing and comparative analysis of the mouse genome. Nature 420:520–562
Church DM, Goodstadt L, Hillier LW, Zody MC, Goldstein S, She X, Bult CJ, Agarwala R, Cherry JL, DiCuccio M et al (2009) Lineage-specific biology revealed by a finished genome assembly of the mouse. PLoS Biol 7:e1000112
Church DM, Schneider VA, Graves T, Auger K, Cunningham F, Bouk N, Chen H-C, Agarwala R, McLaren WM, Ritchie GRS et al (2011) Modernizing reference genome assemblies. PLoS Biol 9:e1001091
Danecek P, Nellåker C, McIntyre RE, Buendia-Buendia JE, Bumpstead S, Ponting CP, Flint J, Durbin R, Keane TM, Adams DJ (2012) High levels of RNA-editing site conservation amongst 15 laboratory mouse strains. Genome Biol 13:r26
Down TA, Piipari M, Hubbard TJP (2011) Dalliance: interactive genome viewing on the web. Bioinformatics 27:889–890
Eilbeck K, Lewis SE, Mungall CJ, Yandell M, Stein L, Durbin R, Ashburner M (2005) The sequence ontology: a tool for the unification of genome annotations. Genome Biol 6:R44
Frazer KA, Eskin E, Kang HM, Bogue MA, Hinds DA, Beilharz EJ, Gupta RV, Montgomery J, Morenzoni MM, Nilsen GB et al (2007) A sequence-based variation map of 8.27 million SNPs in inbred mouse strains. Nature 448:1050–1053
Gu T, Buaas FW, Simons AK, Ackert-Bicknell CL, Braun RE, Hibbs MA (2012) Canonical A-to-I and C-to-U RNA editing is enriched at 3’UTRs and microRNA target sites in multiple mouse tissues. PLoS One 7:e33720
Haldane JBS, Sprunt AD, Haldane NM (1915) Reduplication in mice (preliminary communication). J. Genet. 5:133–135
Harrow J, Denoeud F, Frankish A, Reymond A, Chen C-K, Chrast J, Lagarde J, Gilbert JG, Storey R, Swarbreck D et al (2006) GENCODE: producing a reference annotation for ENCODE. Genome Biol 7:S4
Keane TM, Goodstadt L, Danecek P, White MA, Wong K, Yalcin B, Heger A, Agam A, Slater G, Goodson M et al (2011) Mouse genomic variation and its effect on phenotypes and gene regulation. Nature 477:289–294
Kirby A, Kang HM, Wade CM, Cotsapas C, Kostem E, Han B, Furlotte N, Kang EY, Rivas M, Bogue MA et al (2010) Fine mapping in 94 inbred mouse strains using a high-density haplotype resource. Genetics 185:1081–1095
Klein J, Figueroa F (1981) Polymorphism of the mouse H-2 loci. Immunol Rev 60:23–57
Lagarrigue S, Hormozdiari F, Martin LJ, Lecerf F, Hasin Y, Rau C, Hagopian R, Xiao Y, Yan J, Drake TA et al (2013) Limited RNA editing in exons of mouse liver and adipose. Genetics 193:1107–1115
Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25
Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv13033997 Q-Bio
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25:1754–1760
Li H, Ruan J, Durbin R (2008) Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res 18:1851–1858
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079
Liao P, Yong TF, Liang MC, Yue DT, Soong TW (2005) Splicing for alternative structures of Cav1.2 Ca2+ channels in cardiac and smooth muscles. Cardiovasc Res 68:197–203
Lilue J, Müller UB, Steinfeldt T, Howard JC (2013) Reciprocal virulence and resistance polymorphism in the relationship between Toxoplasma gondii and the house mouse. Elife 2:e01298
Lindner R, Friedel CC (2012) A comprehensive evaluation of alignment algorithms in the context of RNA-seq. PLoS One 7:e52403
McLaren W, Pritchard B, Rios D, Chen Y, Flicek P, Cunningham F (2010) Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26:2069–2070
Mendelowitz L, Pop M (2014) Computational methods for optical mapping. GigaScience 3:33
Mudge JM, Armstrong SD, McLaren K, Beynon RJ, Hurst JL, Nicholson C, Robertson DH, Wilming LG, Harrow JL (2008) Dynamic instability of the major urinary protein gene family revealed by genomic and phenotypic comparisons between C57 and 129 strain mice. Genome Biol 9:R91
Neeman Y, Levanon EY, Jantsch MF, Eisenberg E (2006) RNA editing level in the mouse is determined by the genomic repeat repertoire. RNA 12:1802–1809
Paigen K (2003) One hundred years of mouse genetics: an intellectual history. I. The classical period (1902-1980). Genetics 163:1–7
Paul MS, Bass BL (1998) Inosine exists in mRNA at tissue-specific levels and is most abundant in brain mRNA. EMBO J 17:1120–1127
Peng Z, Cheng Y, Tan BC-M, Kang L, Tian Z, Zhu Y, Zhang W, Liang Y, Hu X, Tan X et al (2012) Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome. Nat Biotechnol 30:253–260
Powell LM, Wallis SC, Pease RJ, Edwards YH, Knott TJ, Scott J (1987) A novel form of tissue-specific RNA processing produces apolipoprotein-B48 in intestine. Cell 50:831–840
Ramaswami G, Lin W, Piskol R, Tan MH, Davis C, Li JB (2012) Accurate identification of human Alu and non-Alu RNA editing sites. Nat Methods 9:579–581
Schadt EE, Turner S, Kasarskis A (2010) A window into third-generation sequencing. Hum Mol Genet 19:R227–R240
Sherry ST, Ward M-H, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K (2001) dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 29:308–311
Skarnes WC, Rosen B, West AP, Koutsourakis M, Bushell W, Iyer V, Mujica AO, Thomas M, Harrow J, Cox T et al (2011) A conditional knockout resource for the genome-wide study of mouse gene function. Nature 474:337–342
Steward CA, Gonzalez JM, Trevanion S, Sheppard D, Kerry G, Gilbert JGR, Wicker LS, Rogers J, Harrow JL (2013) The non-obese diabetic mouse sequence, annotation and variation resource: an aid for investigating type 1 diabetes. Database 2013:bat032
Sudbery I, Stalker J, Simpson JT, Keane T, Rust AG, Hurles ME, Walter K, Lynch D, Teboul L, Brown SD et al (2009) Deep short-read sequencing of chromosome 17 from the mouse strains A/J and CAST/Ei identifies significant germline variation and candidate genes that regulate liver triglyceride levels. Genome Biol 10:R112
Van der Weyden L, Adams DJ, Bradley A (2002) Tools for targeted manipulation of the mouse genome. Physiol Genomics 11:133–164
Wong K, Bumpstead S, Weyden LVD, Reinholdt LG, Wilming LG, Adams DJ, Keane TM (2012) Sequencing and characterization of the FVB/NJ mouse genome. Genome Biol 13:R72
Yalcin B, Wong K, Agam A, Goodson M, Keane TM, Gan X, Nellaker C, Goodstadt L, Nicod J, Bhomra A et al (2011) Sequence-based characterization of structural variation in the mouse genome. Nature 477:326–329
Yalcin B, Wong K, Bhomra A, Goodson M, Keane TM, Adams DJ, Flint J (2012) The fine-scale architecture of structural variants in 17 mouse genomes. Genome Biol 13:R18
Zhang X, Firestein S (2002) The olfactory receptor gene superfamily of the mouse. Nat Neurosci 5:124–133
Author information
Authors and Affiliations
Corresponding author
Additional information
David J. Adams and Anthony G. Doran are joint first authors.
Rights and permissions
About this article
Cite this article
Adams, D.J., Doran, A.G., Lilue, J. et al. The Mouse Genomes Project: a repository of inbred laboratory mouse strain genomes. Mamm Genome 26, 403–412 (2015). https://doi.org/10.1007/s00335-015-9579-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00335-015-9579-6