Network-Based Analysis of Multivariate Gene Expression Data
Multivariate microarray gene expression data are commonly collected to study the genomic responses under ordered conditions such as over increasing/decreasing dose levels or over time during biological processes, where the expression levels of a give gene are expected to be dependent. One important question from such multivariate gene expression experiments is to identify genes that show different expression patterns over treatment dosages or over time; these genes can also point to the pathways that are perturbed during a given biological process. Several empirical Bayes approaches have been developed for identifying the differentially expressed genes in order to account for the parallel structure of the data and to borrow information across all the genes. However, these methods assume that the genes are independent. In this paper, we introduce an alternative empirical Bayes approach for analysis of multivariate gene expression data by assuming a discrete Markov random field (MRF) prior, where the dependency of the differential expression patterns of genes on the networks are modeled by a Markov random field. Simulation studies indicated that the method is quite effective in identifying genes and the modified subnetworks and has higher sensitivity than the commonly used procedures that do not use the pathway information, with similar observed false discovery rates. We applied the proposed methods for analysis of a microarray time course gene expression study of TrkA- and TrkB-transfected neuroblastoma cell lines and identified genes and subnetworks on MAPK, focal adhesion, and prion disease pathways that may explain cell differentiation in TrkA-transfected cell lines.
Key wordsMarkov random field Empirical Bayes KEGG pathways
This research was supported by NIH grants R01-CA127334 and P01-CA097323. We thank Mr. Edmund Weisberg, MS at Penn CCEB for editorial assistance.
- 9.Besag J (1974) Spatial interaction and the statistical analysis of lattice systems. J R Stat Soc Ser B 36:192–225Google Scholar
- 10.Besag J (1986) On the statistical analysis of dirty pictures. J R Stat Soc Ser B 48:259–302Google Scholar
- 12.Smyth GK (2004) Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3(1):Article 3Google Scholar
- 16.Tombes RM, Auer KL, Mikkelsen R, et al (1998) The mitogenactivated protein (MAP) kinase cascade can either stimulate or inhibit DNA synthesis in primary cultures of rat hepatocytes depending upon whether its activation is acute/phasic or chronic. J biochemistry 330(Pt 3):1451–1460PubMedCrossRefGoogle Scholar