Mammalian Genome

, Volume 23, Issue 9, pp 550–558

BioGPS and GXD: mouse gene expression data—the benefits and challenges of data integration


DOI: 10.1007/s00335-012-9408-0

Cite this article as:
Ringwald, M., Wu, C. & Su, A.I. Mamm Genome (2012) 23: 550. doi:10.1007/s00335-012-9408-0


Mouse gene expression data are complex and voluminous. To maximize the utility of these data, they must be made readily accessible through databases, and those resources need to place the expression data in the larger biological context. Here we describe two community resources that approach these problems in different but complementary ways: BioGPS and the Mouse Gene Expression Database (GXD). BioGPS connects its large and homogeneous microarray gene expression reference data sets via plugins with a heterogeneous collection of external gene centric resources, thus casting a wide but loose net. GXD acquires different types of expression data from many sources and integrates these data tightly with other types of data in the Mouse Genome Informatics (MGI) resource, with a strong emphasis on consistency checks and manual curation. We describe and contrast the “loose” and “tight” data integration strategies employed by BioGPS and GXD, respectively, and discuss the challenges and benefits of data integration. BioGPS is freely available at GXD is freely available through the MGI web site ( or directly at

Copyright information

© Springer Science+Business Media, LLC 2012

Authors and Affiliations

  1. 1.The Jackson LaboratoryBar HarborUSA
  2. 2.The Scripps Research InstituteLa JollaUSA