Identifying Transcription Factor Binding Sites Based on a Neural Network
The identification of regulatory motifs (transcription factor binding sites) in DNA sequences is a difficult pattern recognition problem. Many methods have been developed in the past few years. Although some are better than the others in a sense, yet not a single one is recognized to be the best. Generally, in the case of long and subtle motifs, exhaustive enumeration becomes problematic. In this paper,we present a new method which improves exhaustive enumeration based on a neural network. We test its performance on both synthetic data and realistic biological data. It proved to be successful in identifying very subtle motifs. Experiments also show our method outperforms some popular methods in terms of identifying subtle motifs. We refer to the new method as IMNN (Identifying Motifs based on a Neural Network).
KeywordsHide Layer Transcription Factor Binding Site Synthetic Data Consensus Motif Motif Problem
Unable to display preview. Download preview PDF.
- 3.Bailey, T.L., Elkan, C.: The Value of Prior Knowledge in Discovering Motifs with MEME. In: Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology, pp. 21–29. AAAI Press, Menlo Park (1995)Google Scholar
- 4.Workman, C.T., Stormo, G.D.: ANN-Spec: a Method for Discovering Transcription Factor Binding Sites with Improved Specificity. In: Pacific Symposium on Biocomputing, pp. 467–478. Stanford University, Stanford (2000)Google Scholar
- 5.Pevzner, P.A., Sze, S.-H.: Combinatorial Approaches to Finding Subtle Signals in DNA Sequences. In: Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, pp. 269–278. AAAI Press, Menlo Park (2000)Google Scholar
- 6.Dai, X., Dai, Z.: Finding Transcription Factor Binding Sites Based on Multiple Pairwise Alignment. In: Proceedings of 2005 International Symposium on Intelligence Signal Processing and Communication Systems, pp. 577–580 (2005)Google Scholar