Inhomogeneous Parsimonious Markov Models

  • Ralf Eggeling
  • André Gohr
  • Pierre-Yves Bourguignon
  • Edgar Wingender
  • Ivo Grosse
Conference paper

DOI: 10.1007/978-3-642-40988-2_21

Part of the Lecture Notes in Computer Science book series (LNCS, volume 8188)
Cite this paper as:
Eggeling R., Gohr A., Bourguignon PY., Wingender E., Grosse I. (2013) Inhomogeneous Parsimonious Markov Models. In: Blockeel H., Kersting K., Nijssen S., Železný F. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2013. Lecture Notes in Computer Science, vol 8188. Springer, Berlin, Heidelberg

Abstract

We introduce inhomogeneous parsimonious Markov models for modeling statistical patterns in discrete sequences. These models are based on parsimonious context trees, which are a generalization of context trees, and thus generalize variable order Markov models. We follow a Bayesian approach, consisting of structure and parameter learning. Structure learning is a challenging problem due to an overexponential number of possible tree structures, so we describe an exact and efficient dynamic programming algorithm for finding the optimal tree structures.

We apply model and learning algorithm to the problem of modeling binding sites of the human transcription factor C/EBP, and find an increased prediction performance compared to fixed order and variable order Markov models. We investigate the reason for this improvement and find several instances of context-specific dependences that can be captured by parsimonious context trees but not by traditional context trees.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Ralf Eggeling
    • 1
  • André Gohr
    • 1
  • Pierre-Yves Bourguignon
    • 2
  • Edgar Wingender
    • 3
  • Ivo Grosse
    • 1
    • 4
  1. 1.Institute of Computer ScienceMartin Luther UniversityHalleGermany
  2. 2.Max Planck Institute for Mathematics in the SciencesLeipzigGermany
  3. 3.Institute of BioinformaticsUniversity Medical Center GöttingenGöttingenGermany
  4. 4.German Center of Integrative Biodiversity Research (iDiv) Halle-Jena-LeipzigLeipzigGermany

Personalised recommendations