Abstract
As the need for comparing genomes of different species has grown dramatically with the fast progress of the Human Genome Project, the evolution at the level of whole genomes has attracted more and more attention from both biologists and computer scientists. They are especially interested in the scenarios in which the genome evolves through insertions, deletions, and movements of genes along its chromosomes. Marron et al proposed a polynomial-time approximation algorithm to compute (near) minimum edit distances under inversions, deletions, and unrestricted insertions. Our work is based on their algorithm, which carries out lots of comparisons and sorting to calculate the edit distance. These comparisons and sorting are extremely time-consuming, and they result in decrease of computational efficiency. We believe the time of the algorithm can be improved through parallelization. We parallelize their algorithm via OpenMP using Intel C++ compiler for Linux 7.1, and compare three levels of parallelism: coarse grain, fine grain and combination of both. The experiments are conducted for a varying number of threads and different lengths of the gene sequences. The experimental results show that either coarse grain parallelism or fine grain parallelism alone does not improve the performance of the algorithm very much. However, the use of combination of both fine grain and coarse grain parallelism improves the performance of the algorithm drastically.
Keywords
- Parallel Implementation
- Edit Distance
- Sequential Algorithm
- Average Execution Time
- Chunk Size
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Download conference paper PDF
References
Bader, D.A., Moret, B.M.E., Yan, M.: A fast linear-time algorithm for inversion distance with an experimental comparison. J. Comput. Biol. 8(5), 483–491 (2001)
Marron, M., Sweson, K.M., Moret, B.M.E.: Genomic Distances under Deletions and Insertions. In: Warnow, T.J., Zhu, B. (eds.) COCOON 2003. LNCS, vol. 2697, pp. 537–547. Springer, Heidelberg (2003)
Caprara, A.: Sorting by reversals is difficult. In: Proc. 1st Int’l Conf. on Comput. Mol. Biol. RECOMB 1997, pp. 75–83. ACM Press, New York (1997)
Hannenhalli, S., Pevzner, P.: Transforming cabbage into turnip (polynomial algorithm for sorting signed permutations by reversals). In: Proc. 27th Ann. Symp. Theory of Computing STOC 1995, pp. 178–189. ACM Press, New York (1995)
El-Mabrouk, N.: Genome rearrangement by reversals and insertions/deletions of contiguous segments. In: Giancarlo, R., Sankoff, D. (eds.) CPM 2000. LNCS, vol. 1848, pp. 222–234. Springer, Heidelberg (2000)
Liu, T., Moret, B.M.E., Bader, D.A.: An exact linear-time algorithm for computing genomic distances under inversions and deletions U. New Mexico, TR-CS-2003-31
Tutorial on OpenMP, http://www.llnl.gov/computing/tutorials/openMP/
Quinn, M.: Parallel Programming in C with MPI and OpenMP. The McGraw-Hill Companies, New York (2004)
Intel C++ Compiler for Linux, http://www.intel.com/software/products/compilers/clin/clinux.htm
Kaplan, H., Shamir, R., Tarjan, R.E.: Faster and Simpler Algorithm for Sorting Signed Permutations by Reversals. In: Proc. SODA 1997, pp. 344–351 (1997); SIAM Journal on Computing 29(3), 880–892 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kolli, V.S., Liu, H., Pan, M.H., Pan, Y. (2005). A Parallel Implementation for Determining Genomic Distances Under Deletion and Insertion. In: Sunderam, V.S., van Albada, G.D., Sloot, P.M.A., Dongarra, J.J. (eds) Computational Science – ICCS 2005. ICCS 2005. Lecture Notes in Computer Science, vol 3515. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11428848_127
Download citation
DOI: https://doi.org/10.1007/11428848_127
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26043-1
Online ISBN: 978-3-540-32114-9
eBook Packages: Computer ScienceComputer Science (R0)
