Curve Fitting for Short Time Series Data from High Throughput Experiments with Correction for Biological Variation

  • Frank Klawonn
  • Nada Abidi
  • Evelin Berger
  • Lothar Jänsch
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7619)


Modern high-throughput technologies like microarray, mass spectrometry or next generation sequencing enable biologists to measure cell products like metabolites, peptides, proteins or mRNA. With the advance of the technologies there are more and more experiments that do not only compare the cell products under two or more specific conditions, but also track them over time. These experiments usually yield short time series for a large number of cell products, but with only a few replicates. The noise in the measurements, but also the often strong biological variation of the replicates makes a coherent analysis of such data difficult. In this paper, we focus on methods to correct measurement errors or deviations caused by biological variation in terms of a time shift, different reaction speed and different reaction intensity for replicates. We propose a regression model that can estimate corresponding parameters that can be used to correct the data and to obtain better results in the further analysis.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bar-Joseph, Z.: Analyzing time series gene expression data. Bioinformatics 20, 2493–2503 (2004)CrossRefGoogle Scholar
  2. 2.
    Bezdek, J.C., Hathaway, R.J.: Convergence of alternating optimization. Neural, Parallel Sci. Comput. 11(4), 351–368 (2003)MathSciNetzbMATHGoogle Scholar
  3. 3.
    Bar-Joseph, Z., Gerber, G., Jaakkola, T., Gifford, D., Simon, I.: Comparing the continuous representation of time series expression profiles to identify differentially expressed genes. Proc. Natl. Acad. Sci. USA 100, 10146–10151 (2003)MathSciNetzbMATHCrossRefGoogle Scholar
  4. 4.
    Bar-Joseph, Z., Gerber, G., Jaakkola, T., Gifford, D., Simon, I.: Continuous representations of time series gene expression data. J. Computational Biology 3, 341–356 (2003)CrossRefGoogle Scholar
  5. 5.
    Ernst, J., Bar-Joseph, Z.: STEM: a tool for the analysis of short time series gene expression data. BMC Bioinformatics 7 (2006), doi:10.1186/1471–2105–7–191Google Scholar
  6. 6.
    Ndukum, J., Fonseca, L., Santos, H., Voit, E., Datta, S.: Statistical inference methods for sparse biological time series data. BMC Systems Biology 5(1), 57 (2011)CrossRefGoogle Scholar
  7. 7.
    Hoaglin, D., Mosteller, F., Tukey, J.: Understanding Robust and Exploratory Data Analysis. Wiley, New York (2000)zbMATHGoogle Scholar
  8. 8.
    Huber, P.: Robust Statistics. Wiley, New York (2004)Google Scholar
  9. 9.
    Akaike, H.: A new look at the statistical model identification. IEEE Transactions on Automatic Control 19, 716–723 (1974)MathSciNetzbMATHCrossRefGoogle Scholar
  10. 10.
    R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2009)Google Scholar
  11. 11.
    Villén, J., Gygi, S.: The SCX/IMAC enrichment approach for global phosphorylation analysis by mass spectrometry. Nature Protocols 3, 1630–1638 (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Frank Klawonn
    • 1
    • 2
  • Nada Abidi
    • 2
  • Evelin Berger
    • 2
  • Lothar Jänsch
    • 2
  1. 1.Department of Computer ScienceOstfalia University of Applied SciencesWolfenbuettelGermany
  2. 2.Cellular ProteomicsHelmholtz Centre for Infection ResearchBraunschweigGermany

Personalised recommendations