A Method to Find Sequentially Separated Motifs in Biological Sequences (SSMBS)
- Cite this paper as:
- Kumar C., Kumar N., Rangarajan S., Balakrishnan N., Sekar K. (2008) A Method to Find Sequentially Separated Motifs in Biological Sequences (SSMBS). In: Chetty M., Ngom A., Ahmad S. (eds) Pattern Recognition in Bioinformatics. PRIB 2008. Lecture Notes in Computer Science, vol 5265. Springer, Berlin, Heidelberg
Sequence motifs occurring in a particular order in proteins or DNA have been proved to be of biological interest. In this paper, a new method to locate the occurrences of up to five user-defined motifs in a specified order in large proteins and in nucleotide sequence databases is proposed. It has been designed using the concept of quantifiers in regular expressions and linked lists for data storage. The application of this method includes the extraction of relevant consensus regions from biological sequences. This might be useful in clustering of protein families as well as to study the correlation between positions of motifs and their functional sites in DNA sequences.
KeywordsRegular expressions protein and nucleotide sequences sequence motifs
Unable to display preview. Download preview PDF.