How does DNA sequence motif discovery work?

D'haeseleer, Patrik

doi:10.1038/nbt0806-959

How does DNA sequence motif discovery work?

Primer
Published: 01 August 2006

Volume 24, pages 959–961, (2006)
Cite this article

From

View current issue Submit your manuscript

Patrik D'haeseleer¹

8343 Accesses
70 Citations
12 Altmetric
1 Mention
Explore all metrics

How can we computationally extract an unknown motif from a set of target sequences? What are the principles behind the major motif discovery algorithms? Which of these should we use, and how do we know we've found a 'real' motif?

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

**Figure 1: Starting from a single site, expectation maximization algorithms such as MEME⁴ alternate between assigning sites to a motif (left) and updating the motif model (right).**

A practical guide to amplicon and metagenomic analysis of microbiome data

Article Open access 11 May 2020

Advances in Structural Bioinformatics

How to Choose a Title?

References

D'haeseleer. P. What are DNA sequence motifs? Nat. Biotechnol. 24, 423–425 (2006).
Article CAS Google Scholar
Sinha, S. & Tompa, M. YMF: a program for discovery of novel transcription factor binding sites by statistical overrepresentation. Nucleic Acids Res. 31, 3586–3588 (2003).
Article CAS Google Scholar
Pavesi, G. et al. Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes. Nucleic Acids Res. 32 (Web Server Issue), W199–W203 (2004).
Article CAS Google Scholar
Bailey, T.L. & Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28–36 (1994).
CAS PubMed Google Scholar
Tompa, M. et al. Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol. 23, 137–144 (2005).
Article CAS Google Scholar
Li, N. & Tompa, M. Analysis of computational approaches for motif discovery. Alg. Mol. Biol. 1, 8 (2006).
Article Google Scholar
Hu, J., Li, B. & Kihara, D. Limitations and potentials of current motif discovery algorithms. Nucleic Acids Res. 33, 4899–4913 (2005).
Article CAS Google Scholar
Thijs, G. et al. A Gibbs sampling method to detect overrepresented motifs in the upstream regions of coexpressed genes. J. Comp. Biol. 9, 447–464 (2002).
Article CAS Google Scholar
Huber, B.R. & Bulyk, M.L. Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data. BMC Bioinformatics 7, 229 (2006).
Article Google Scholar
Hughes, J.D. et al. Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J. Mol. Biol. 296, 1205–1214 (2000).
Article CAS Google Scholar
McGuire, A.M., Hughes, J.D. & Church, G.M. Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes. Genome Res. 10, 744–757 (2000).
Article CAS Google Scholar
Huang, H.-D. et al. Identifying transcriptional regulatory sites in the human genome using an integrated system. Nucleic Acids Res. 32, 1948–1956 (2004).
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Microbial Systems Division, Biosciences Directorate, Lawrence Livermore National Laboratory, 7000 East Ave., PO Box 808, L-448, Livermore, 94551, California, USA
Patrik D'haeseleer

Authors

Patrik D'haeseleer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

D'haeseleer, P. How does DNA sequence motif discovery work?. Nat Biotechnol 24, 959–961 (2006). https://doi.org/10.1038/nbt0806-959

Download citation

Issue Date: 01 August 2006
DOI: https://doi.org/10.1038/nbt0806-959
Springer Nature America, Inc.

This article is cited by

biomapp::chip: large-scale motif analysis
- Jader M. Caldonazzo Garbelini
- Danilo S. Sanches
- Aurora T. Ramirez Pozo
BMC Bioinformatics (2024)
Sequence motif finder using memetic algorithm
- Jader M. Caldonazzo Garbelini
- André Y. Kashiwabara
- Danilo S. Sanches
BMC Bioinformatics (2018)
DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data
- Chadi Saad
- Laurent Noé
- Martin Figeac
BMC Bioinformatics (2018)
SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets
- Qiang Yu
- Dingbang Wei
- Hongwei Huo
BMC Bioinformatics (2018)
A systematic approach to RNA-associated motif discovery
- Tian Gao
- Jiang Shu
- Juan Cui
BMC Genomics (2018)

Associated content

Computational Biology

Collection 01 May 2016

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

How does DNA sequence motif discovery work?

From

Access this article

Similar content being viewed by others

A practical guide to amplicon and metagenomic analysis of microbiome data

Advances in Structural Bioinformatics

How to Choose a Title?

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

This article is cited by

biomapp::chip: large-scale motif analysis

Sequence motif finder using memetic algorithm

DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data

SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets

A systematic approach to RNA-associated motif discovery

Computational Biology

Navigation

How does DNA sequence motif discovery work?

Access this article

Similar content being viewed by others

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation