ICIC 2007: Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues pp 775-781 | Cite as
A Two – Block Motif Discovery Method with Improved Accuracy
Abstract
The accuracy of the existing methods for two - block motifs discovery is usually less than 50%, which is difficult to be increased. Based on position weight matrix (PWM) for two - block motif model, this paper proposed an improved Gibbs sampling algorithm to overcome local converged performance of original Gibbs sampling algorithm and increase the predictive accuracy by introducing motif base. The feasibility and the effectiveness of novel algorithm are verified by the real biological data through computer experiments. The results are analyzed and compared with other algorithms such as RSAT and AlignACE. The accuracy of novel algorithm is larger than 55% for two - block motifs, which is superior to that of existing methods.
Keywords
Motif Discovery Pwm Gibbs Sampling Motif Base AccuracyPreview
Unable to display preview. Download preview PDF.
References
- 1.Gert, T.: A Gibbs Sampling Method to Detect Overrepresented Motifs in the Upstream Regions of Co-Expressed Genes. In: Proceedings of Annual Conference on Research in Computational Molecular Biology, pp. 1123–1128 (2002)Google Scholar
- 2.Mark, R.: Improving Computational Predictions of Cis - Regulatory Binding Sites. Pacific Symposium on Biocomputing 11, 391–402 (2006)Google Scholar
- 3.Shane, T.J.: BioOptimizer: A Bayesian Scoring Function Approach to Motif Discovery. Bioinfomatics 20(10), 1557–1564 (2004)CrossRefGoogle Scholar
- 4.Xie, X., Lu, J., Kulbokas, E.J., et al.: Systematic Discovery of Regulatory Motifs in Human Promoters and 3’ UTRs by Comparison of Several Mammals. Nature 434, 338–345 (2005)CrossRefGoogle Scholar
- 5.Benjamin, R.: A Uniform Projection Method for Motif Discovery in DNA Sequences. Proceedings of IEEE/ACM Transactions on Computational Biology and Bioinformatics 1(2), 91–94 (2004)CrossRefMathSciNetGoogle Scholar
- 6.Bailey, T.L., Elkan, C.: Fitting A Mixture Model by Expectation Maximization to Discover Motifs in Biopolymers. In: Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, Menlo Park, California, pp. 28–36. AAAI Press, Stanford, California, USA (1994)Google Scholar
- 7.Hertz, G.Z., Stormo, G.D.: Identifying DNA and Protein Patterns with Statistically Significant Alignments of Multiple Sequences. Bioinfomatics 15, 563–577 (1999)CrossRefGoogle Scholar
- 8.Liu, J.S., Neuwald, A.N., Lawrence, C.E.: Bayesian Models for Multiple Local Sequence Alignment and Gibbs Sampling Strategies. J. Am. Stat. Assoc. 90, 1156–1170 (1995)MATHCrossRefGoogle Scholar
- 9.Neuwald, A.F., Liu, J.S., Lawrence, C.E: Gibbs Motif Sampling: Detection of Bacterial Outer Membrane Protein Repeats. Protein Science 4, 1618–1632 (1995)CrossRefGoogle Scholar
- 10.Liu, X.: BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes. Pac. Symp. Biocomput., 127–138 (2001)Google Scholar
- 11.Favorov, A.V., Gelfand, M.S., Gerasimova, A.V., et al.: A Gibbs Sampler for Identification of Symmetrically Structured, Spaced DNA Motifs with Improved Estimation of the Signal Length. Bioinformatics 21, 2240–2245 (2005)CrossRefGoogle Scholar
- 12.Wu, X.M., et al.: A Combined Model and a Varied Gibbs Sampling Algorithm Used for Motif Discovery. Proceedings of 2nd Asia-Pacific Bioinformatics Conference 55, 1–6 (2004)Google Scholar
- 13.Eric, C.R.: A Brief Overview of Gibbs Sampling. Technique Report, IBC Statistics Study Group (1997)Google Scholar
- 14.Hughes, J.D., Estep, P.W., Tavazoie, S., Church, G.M.: Computational Identification of Cis-Regulatory Elements Associated with Groups of Functionally Related Genes in Sacchaomyces Cerevisiae. J. Mol. Biol. 296(5), 1205–1214 (2000)CrossRefGoogle Scholar
- 15.Henry, C.M., et al.: Finding Exact Optimal Motifs in Matrix Representation by Partitioning. Bioinfomatics 21(Suppl 2), ii86-ii92 (2005)Google Scholar