Identifying Differentially Expressed Genes in Time Course Microarray Data

Ma, Ping; Zhong, Wenxuan; Liu, Jun S.

doi:10.1007/s12561-009-9014-1

Identifying Differentially Expressed Genes in Time Course Microarray Data

Published: 10 October 2009

Volume 1, pages 144–159, (2009)
Cite this article

Statistics in Biosciences Aims and scope Submit manuscript

Ping Ma¹,
Wenxuan Zhong¹ &
Jun S. Liu²

218 Accesses
15 Citations
Explore all metrics

Abstract

Identifying differentially expressed (DE) genes across conditions or treatments is a typical problem in microarray experiments. In time course microarray experiments (under two or more conditions/treatments), it is sometimes of interest to identify two classes of DE genes: those with no time-condition interactions (called parallel DE genes, or PDE), and those with time-condition interactions (nonparallel DE genes, NPDE). Although many methods have been proposed for identifying DE genes in time course experiments, methods for discerning NPDE genes from the general DE genes are still lacking. We propose a functional ANOVA mixed-effect model to model time course gene expression observations. The fixed effect of (the mean curve) of the model decomposes bivariate functions of time and treatments (or experimental conditions) as in the classic ANOVA method and provides the associated notions of main effects and interactions. Random effects capture time-dependent correlation structures. In this model, identifying NPDE genes is equivalent to testing the significance of the time-condition interaction, for which an approximate F-test is suggested. We examined the performance of the proposed method on simulated datasets in comparison with some existing methods, and applied the method to a study of human reaction to the endotoxin stimulation, as well as to a cell cycle expression data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Calvano S, Xiao W, Richards D et al. (2005) A network-based analysis of systemic inflammation in humans. Nature 437:1032–1037
Article Google Scholar
Cantoni E, Hastie T (2002) Degrees-of-freedom tests for smoothing splines. Biometrika 89:251–263
Article MathSciNet MATH Google Scholar
Castillo-Davis C, Hartl D (2003) Genemerge: post-genomic analysis, data-mining and hypothesis. Bioinformatics 19:891–892
Article Google Scholar
Crainiceanu CM, Ruppert D (2004) Restricted likelihood ratio tests in nonparametric longitudinal models. Stat Sin 14(3):713–729
MathSciNet MATH Google Scholar
Craven P, Wahba G (1979) Smoothing noisy data with spline functions: Estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31:377–403
Article MathSciNet MATH Google Scholar
Davies RB, (1980) [Algorithm AS 155] The distribution of a linear combination of χ ² random variables (AS R53: 84V33 pp 366–369). Appl Stat 29:323–333
Article MATH Google Scholar
Dennis JE, Schnabel RB (1996) Numerical methods for unconstrained optimization and nonlinear equations. SIAM, Philadelphia. Corrected reprint of the 1983 original
MATH Google Scholar
Gu C (2002) Smoothing spline ANOVA models. Springer, New York
MATH Google Scholar
Gu C (2004) Model diagnostics for smoothing spline ANOVA models. Can J Stat 32(4):347–358
Article MATH Google Scholar
Gu C, Ma P (2005) Optimal smoothing in nonparametric mixed-effect models. Ann Stat 33:1357–1379
Article MathSciNet MATH Google Scholar
Guo W (2002) Inference in smoothing spline analysis of variance. J R Stat Soc, Ser B: Stat Methodol 64(4):887–898
Article MathSciNet MATH Google Scholar
Hastie T, Tibshirani R (1990) Generalized additive models. Chapman & Hall, London
MATH Google Scholar
Hogan C, Serpente N, Cogram P, Hosking CR, Bialucha CU, Feller SM, Braga VMM, Birchmeier W, Fujita Y (2004) Rap1 regulates the formation of e-cadherin-based cell–cell contacts. Mol Cell Biol 24:6690–6700
Article Google Scholar
Hong F, Li H (2006) Functional hierarchical models for identifying genes with different time-course expression profiles. Biometrics 62:534–544
Article MathSciNet MATH Google Scholar
Khatri P, Bhavsar P, Bawa G, Draghici S (2004) Onto-tools: an ensemble of web-accessible, ontology-based tools for the functional design and interpretation of high-throughput gene expression experiments. Nucleic Acids Res 32:W449–W456
Article Google Scholar
Kim Y-J, Gu C (2004) Smoothing spline Gaussian regression: More scalable computation via efficient approximation. J Roy Stat Soc Ser B 66:337–356
Article MathSciNet MATH Google Scholar
Kunst CB (2004) Complex genetics of amyotrophic lateral sclerosis. Am J Hum Genet 75:933–947
Article Google Scholar
Leung YF, Ma P, Link BA, Dowling J (2008) Factorial microarray analysis of zebrafish retina development. Proc Natl Acad Sci 105:12909–12914
Article Google Scholar
Li C, Wong WH (2001) Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detection. Proc Natl Acad Sci 98:31–36
Article MATH Google Scholar
Liu A, Wang Y (2004) Hypothesis testing in smoothing spline models. J Stat Comput Simul 74(8):581–597
Article MathSciNet MATH Google Scholar
Ma P, Castillo-Davis CI, Zhong W, Liu JS (2006) A data-driven clustering method for time course gene expression data. Nucleic Acids Res 34:1261–1269
Article Google Scholar
Ma P, Zhong W (2008) Penalized clustering of large scale functional data with multiple covariates. J Amer Stat Assoc 103:625–636
Article MathSciNet MATH Google Scholar
Maglott D, Ostell J, Pruitt KD, Tatusova T (2005) Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res 33(Database Issue):D45–D58
Article Google Scholar
Orlando DA, Lin CY, Bernard A, Wang JY, Socolar JES, Iversen ES, Hartemink AJ, Haase SB (2008) Global control of cell-cycle transcription by coupled CDK and network oscillators. Nature 453:944–947
Article Google Scholar
Robinson GK (1991) That BLUP is a good thing: The estimation of the random effects. Statist Sci 6:15–51 (with discussions)
Article MathSciNet MATH Google Scholar
Self SG, Liang K-Y (1987) Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Am Stat Assoc 82:605–610
Article MathSciNet MATH Google Scholar
Storey JD, Tibshirani R (2003) Statistical significance for genome-wide studies. Proc Natl Acad Sci 100:9440–9445
Article MathSciNet MATH Google Scholar
Storey JD, Xiao W, Leek JT, Tompkins R, Davis G (2005) Significance of time course microarray experiments. Proc Natl Acad Sci 102:12837–12842
Article Google Scholar
Tai YC, Speed TP (2006) A multivariate empirical Bayes statistic for replicated microarray time course data. Ann Stat 34:2387–2412
Article MathSciNet MATH Google Scholar
Tusher VG, Tibshirani R, Chu G (2001) Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci 98:5116–5121
Article MATH Google Scholar
Wahba G (1990) Spline models for observational data. CBMS-NSF regional conference series in applied mathematics, vol. 59. SIAM, Philadelphia
MATH Google Scholar
Yuan M, Kendziorski C (2006) Hidden Markov models for microarray time course data under multiple biological conditions. J Am Stat Assoc 101:1323–1340
Article MathSciNet MATH Google Scholar
Zhang C (2003) Calibrating the degrees of freedom for automatic data smoothing and effective curve checking. J Am Stat Assoc 98(463):609–628
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of Illinois at Urbana-Champaign, Champaign, IL, 61820, USA
Ping Ma & Wenxuan Zhong
Department of Statistics, Harvard University, Cambridge, MA, 02138, USA
Jun S. Liu

Authors

Ping Ma
View author publications
You can also search for this author in PubMed Google Scholar
Wenxuan Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Jun S. Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ping Ma or Jun S. Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ma, P., Zhong, W. & Liu, J.S. Identifying Differentially Expressed Genes in Time Course Microarray Data. Stat Biosci 1, 144–159 (2009). https://doi.org/10.1007/s12561-009-9014-1

Download citation

Received: 12 August 2009
Accepted: 28 September 2009
Published: 10 October 2009
Issue Date: November 2009
DOI: https://doi.org/10.1007/s12561-009-9014-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Identifying Differentially Expressed Genes in Time Course Microarray Data

Abstract

Access this article

Similar content being viewed by others

From sequence to consequence: Deciphering the complex cis-regulatory landscape

Effective use of the McNemar test

Introduction to Bioinformatics

References

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Identifying Differentially Expressed Genes in Time Course Microarray Data

Abstract

Access this article

Similar content being viewed by others

From sequence to consequence: Deciphering the complex cis-regulatory landscape

Effective use of the McNemar test

Introduction to Bioinformatics

References

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation