Abstract
Gene expression patterns can reflect gene regulations in human tissues under normal or pathologic conditions. Gene expression profiling data from studies of primary human disease samples are particularly valuable since these studies often span many years in order to collect patient clinical information and achieve a large sample size. Disease-to-Gene Expression Mapper (DGEM) provides a beneficial community resource to access and analyze these data; it currently includes Affymetrix oligonucleotide array datasets for more than 40 human diseases and 1400 samples. The data are normalized to the same scale and stored in a relational database. A statistical-analysis pipeline was implemented to identify genes abnormally expressed in disease tissues or genes whose expressions are associated with clinical parameters such as cancer patient survival. Data-mining results can be queried through a web-based interface at http://dgem.dhcp.iupui.edu/. The query tool enables dynamic generation of graphs and tables that are further linked to major gene and pathway resources that connect the data to relevant biology, including Entrez Gene and Kyoto Encyclopedia of Genes and Genomes (KEGG). In summary, DGEM provides scientists and physicians a valuable tool to study disease mechanisms, to discover potential disease biomarkers for diagnosis and prognosis, and to identify novel gene targets for drug discovery. The source code is freely available for non-profit use, on request to the authors.
Similar content being viewed by others
References
Parkinson H, Sarkans U, Shojatalab M, et al. ArrayExpress: a public repository for microarray gene expression data at the EBI. Nucleic Acids Res 2005; 33 (database issue): D553–5
Barrett T, Suzek TO, Troup DB, et al. NCBI GEO: mining millions of expression profiles: database and tools. Nucleic Acids Res 2005; 33 (database issue): D562–6
Ball CA, Awad IA, Demeter J, et al. The Stanford Microarray Database accommodates additional microarray platforms and data formats. Nucleic Acids Res 2005; 33 (database issue): D580–2
Chen N, Harris TW, Antoshechkin I, et al. WormBase: a comprehensive data resource for Caenorhabditis biology and genomics. Nucleic Acids Res 2005; 33 (database issue): D383–9
Grumbling G, Strelets V. FlyBase: anatomical data, images and queries. Nucleic Acids Res 2006; 34 (database issue): D484–8
Pollet N, Schmidt HA, Gawantka V, et al. Axeldb: a Xenopus laevis database focusing on gene expression. Nucleic Acids Res 2000; 28(1): 139–40
Craigon DJ, James N, Okyere J, et al. NASCArrays: a repository for microarray data generated by NASC’s transcriptomics service. Nucleic Acids Res 2004; 32 (database issue): D575–7
Henrich T, Ramialison M, Quiring R, et al. MEPD: a Medaka gene expression pattern database. Nucleic Acids Res 2003; 31(1): 72–4
Hill DP, Begley DA, Finger JH, et al. The mouse Gene Expression Database (GXD): updates and enhancements. Nucleic Acids Res 2004; 32 (database issue): D568–71
Shah V, Sridhar S, Beane J, et al. SIEGE: Smoking Induced Epithelial Gene Expression Database. Nucleic Acids Res 2005; 33 (database issue): D573–9
Rhodes DR, Yu J, Shanker K, et al. ONCOMINE: a cancer microarray database and integrated data-mining platform. Neoplasia 2004; 6(1): 1–6
Kato K, Yamashita R, Matoba R, et al. Cancer Gene Expression Database (CGED): a database for gene expression profiling with accompanying clinical information of human cancer tissues. Nucleic Acids Res 2005; 33 (database issue): D533–6
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Proc R Stat Soc B 1995; 57: 289–300
Lawless JF. Statistical models and methods for lifetime data. New York: John Wiley & Sons, 1982
Acknowledgments
We thank Dr John Calley for his critical review of the manuscript. The authors have no conflicts of interest that are directly relevant to the content of this article.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xia, Y., Campen, A., Rigsby, D. et al. DGEM — A Microarray Gene Expression Database for Primary Human Disease Tissues. Mol Diag Ther 11, 145–149 (2007). https://doi.org/10.1007/BF03256235
Published:
Issue Date:
DOI: https://doi.org/10.1007/BF03256235