Discovering Motiv Based Association Rules in a Set of DNA Sequences

Kiem, Hoang; Phuc, Do

doi:10.1007/3-540-45554-X_47

Discovering Motiv Based Association Rules in a Set of DNA Sequences

Hoang Kiem² &
Do Phuc²

Conference paper
First Online: 18 December 2001

5093 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2005))

Abstract

The research of similarity between DNA sequences is an important problem in Bio-Informatics. In the traditional approach, the dynamic programming based pair-wise alignment is used for measuring the similarity between two sequences. This method does not work well in a large data set. In this paper, we consider motifs like the phrase of document and use text mining techniques for finding the frequent motifs, maximal frequent motifs, motif based association rules in a group of genes.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A Chouchoulas and Q. Shen, A Rough Set based approach to text classification, In the Proceedings of RFDGRC99 international conference, Yamaguchi-UBE, Japan, 1999.
Google Scholar
Anders Krogh: An introduction to Hidden Markov Models for Biological Sequences, Computer Methods in Molecular Biology, Elservier, 1998
Google Scholar
Hoang Kiem, Do Phuc: Discovering the binary and fuzzy association rules from database: hi the proceedings of the AFSS2000 international conference, Tsukuba, Japan, 2000
Google Scholar
Hoang Kiem, Do Phuc: On the Extension of lower approximation in rough set theory for classification problem in data mining, the WCC2000 conference, Beijing, August 2000 (to be accepted for presentation).
Google Scholar
Timothy L. Bailey: Discovering motifs in DNA and protein sequence: the approximate common sub-string problem: Ph D dissertation, Univ California, San Diego, USA, 1995
Google Scholar
Robert Giegerich and David Wheeler: Pair wise Sequence Alignment, 1996 website: http://www.techfak.uni-bielefeld.de/bcd/Curric/PrwAli/prwali.html
R. Agrawal, R. Srikant, Fast Algorithm for Mining Association Rules in large database, Research report RJ, IBM Almaden Research Center, San Jose, CA,1994
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, University of Natural Sciences, HCMC, 227 Nguyen Van Cu St District 5, HCM city, Vietnam
Hoang Kiem & Do Phuc

Authors

Hoang Kiem
View author publications
You can also search for this author in PubMed Google Scholar
Do Phuc
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Regina Regina, S4S 0A2, Saskatchewan, Canada
Wojciech Ziarko & Yiyu Yao &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kiem, H., Phuc, D. (2001). Discovering Motiv Based Association Rules in a Set of DNA Sequences. In: Ziarko, W., Yao, Y. (eds) Rough Sets and Current Trends in Computing. RSCTC 2000. Lecture Notes in Computer Science(), vol 2005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45554-X_47

Download citation

DOI: https://doi.org/10.1007/3-540-45554-X_47
Published: 18 December 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43074-2
Online ISBN: 978-3-540-45554-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics