Nucleosome Occupancy Information Improves de novo Motif Discovery
- Cite this paper as:
- Narlikar L., Gordân R., Hartemink A.J. (2007) Nucleosome Occupancy Information Improves de novo Motif Discovery. In: Speed T., Huang H. (eds) Research in Computational Molecular Biology. RECOMB 2007. Lecture Notes in Computer Science, vol 4453. Springer, Berlin, Heidelberg
A complete understanding of transcriptional regulatory processes in the cell requires identification of transcription factor binding sites on a genome-wide scale. Unfortunately, these binding sites are typically short and degenerate, posing a significant statistical challenge: many more matches to known transcription factor binding sites occur in the genome than are actually functional. Chromatin structure is known to play an important role in guiding transcription factors to those sites that are functional. In particular, it has been shown that active regulatory regions are usually depleted of nucleosomes, thereby enabling transcription factors to bind DNA in those regions . In this paper, we describe a novel algorithm which employs an informative prior over DNA sequence positions based on a discriminative view of nucleosome occupancy; the nucleosome occupancy information comes from a recently published computational model . When a Gibbs sampling algorithm with our informative prior is applied to yeast sequence-sets identified by ChIP-chip , the correct motif is found in 50% more cases than with an uninformative uniform prior. Moreover, if nucleosome occupancy information is not available, our informative prior reduces to a new kind of prior that can exploit discriminative information in a purely generative setting.
Unable to display preview. Download preview PDF.