A Two – Block Motif Discovery Method with Improved Accuracy

  • Bin Kuang
  • Nini Rao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4681)

Abstract

The accuracy of the existing methods for two - block motifs discovery is usually less than 50%, which is difficult to be increased. Based on position weight matrix (PWM) for two - block motif model, this paper proposed an improved Gibbs sampling algorithm to overcome local converged performance of original Gibbs sampling algorithm and increase the predictive accuracy by introducing motif base. The feasibility and the effectiveness of novel algorithm are verified by the real biological data through computer experiments. The results are analyzed and compared with other algorithms such as RSAT and AlignACE. The accuracy of novel algorithm is larger than 55% for two - block motifs, which is superior to that of existing methods.

Keywords

Motif Discovery Pwm Gibbs Sampling Motif Base  Accuracy 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gert, T.: A Gibbs Sampling Method to Detect Overrepresented Motifs in the Upstream Regions of Co-Expressed Genes. In: Proceedings of Annual Conference on Research in Computational Molecular Biology, pp. 1123–1128 (2002)Google Scholar
  2. 2.
    Mark, R.: Improving Computational Predictions of Cis - Regulatory Binding Sites. Pacific Symposium on Biocomputing 11, 391–402 (2006)Google Scholar
  3. 3.
    Shane, T.J.: BioOptimizer: A Bayesian Scoring Function Approach to Motif Discovery. Bioinfomatics 20(10), 1557–1564 (2004)CrossRefGoogle Scholar
  4. 4.
    Xie, X., Lu, J., Kulbokas, E.J., et al.: Systematic Discovery of Regulatory Motifs in Human Promoters and 3’ UTRs by Comparison of Several Mammals. Nature 434, 338–345 (2005)CrossRefGoogle Scholar
  5. 5.
    Benjamin, R.: A Uniform Projection Method for Motif Discovery in DNA Sequences. Proceedings of IEEE/ACM Transactions on Computational Biology and Bioinformatics 1(2), 91–94 (2004)CrossRefMathSciNetGoogle Scholar
  6. 6.
    Bailey, T.L., Elkan, C.: Fitting A Mixture Model by Expectation Maximization to Discover Motifs in Biopolymers. In: Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, Menlo Park, California, pp. 28–36. AAAI Press, Stanford, California, USA (1994)Google Scholar
  7. 7.
    Hertz, G.Z., Stormo, G.D.: Identifying DNA and Protein Patterns with Statistically Significant Alignments of Multiple Sequences. Bioinfomatics 15, 563–577 (1999)CrossRefGoogle Scholar
  8. 8.
    Liu, J.S., Neuwald, A.N., Lawrence, C.E.: Bayesian Models for Multiple Local Sequence Alignment and Gibbs Sampling Strategies. J. Am. Stat. Assoc. 90, 1156–1170 (1995)MATHCrossRefGoogle Scholar
  9. 9.
    Neuwald, A.F., Liu, J.S., Lawrence, C.E: Gibbs Motif Sampling: Detection of Bacterial Outer Membrane Protein Repeats. Protein Science 4, 1618–1632 (1995)CrossRefGoogle Scholar
  10. 10.
    Liu, X.: BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes. Pac. Symp. Biocomput., 127–138 (2001)Google Scholar
  11. 11.
    Favorov, A.V., Gelfand, M.S., Gerasimova, A.V., et al.: A Gibbs Sampler for Identification of Symmetrically Structured, Spaced DNA Motifs with Improved Estimation of the Signal Length. Bioinformatics 21, 2240–2245 (2005)CrossRefGoogle Scholar
  12. 12.
    Wu, X.M., et al.: A Combined Model and a Varied Gibbs Sampling Algorithm Used for Motif Discovery. Proceedings of 2nd Asia-Pacific Bioinformatics Conference 55, 1–6 (2004)Google Scholar
  13. 13.
    Eric, C.R.: A Brief Overview of Gibbs Sampling. Technique Report, IBC Statistics Study Group (1997)Google Scholar
  14. 14.
    Hughes, J.D., Estep, P.W., Tavazoie, S., Church, G.M.: Computational Identification of Cis-Regulatory Elements Associated with Groups of Functionally Related Genes in Sacchaomyces Cerevisiae. J. Mol. Biol. 296(5), 1205–1214 (2000)CrossRefGoogle Scholar
  15. 15.
    Henry, C.M., et al.: Finding Exact Optimal Motifs in Matrix Representation by Partitioning. Bioinfomatics 21(Suppl 2), ii86-ii92 (2005)Google Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Bin Kuang
    • 1
  • Nini Rao
    • 1
  1. 1.School of Life Sciences & Technology, University of Electronic Science & Technology of China, Chengdu 610054P.R. China

Personalised recommendations