Finding Nested Common Intervals Efficiently
In this paper, we study the problem of efficiently finding gene clusters formalized by nested common intervals between two genomes represented either as permutations or as sequences. Considering permutations, we give several algorithms whose running time depends on the size of the actual output rather than the output in the worst case. Indeed, we first provide a straightforward O(n 3) time algorithm for finding all nested common intervals. We reduce this complexity by providing an O(n 2) time algorithm computing an irredundant output. Finally, we show, by providing a third algorithm, that finding only the maximal nested common intervals can be done in linear time. Considering sequences, we provide solutions (modifications of previously defined algorithms and a new algorithm) for different variants of the problem, depending on the treatment one wants to apply to duplicated genes.
KeywordsGene Cluster Linear Time Time Algorithm Identity Permutation Consecutive Integer
Unable to display preview. Download preview PDF.
- 3.Bergeron, A., Gingras, Y., Chauve, C.: Formal models of gene clusters. In: Mandoiu, I.I., Zelikovsky, A. (eds.) Bioinformatics Algorithms: Techniques and Applications, ch. 8, pp. 177–202. Wiley, Chichester (2008)Google Scholar