Abstract
This paper introduces a novel study on the performance of Stochastic Diffusion Search (SDS)—a swarm intelligence algorithm—to address DNA sequence assembly problem. This is an NP-hard problem and one of the primary problems in computational molecular biology that requires optimisation methodologies to reconstruct the original DNA sequence. In this work, SDS algorithm is adapted for this purpose and several experiments are run in order to evaluate the performance of the presented technique over several frequently used benchmarks. Given the promising results of the newly proposed algorithm and its success in assembling the input fragments, its behaviour is further analysed, thus shedding light on the process through which the algorithm conducts the task. Additionally, the algorithm is applied to overlap score matrices which are generated from the raw input fragments; the algorithm optimises the overlap score matrices to find better results. In these experiments real-world data are used and the performance of SDS is compared with several other algorithms which are used by other researchers in the field, thus demonstrating its weaknesses and strengths in the experiments presented in the paper.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
al-Rifaie, M.M., Bishop, M.: Stochastic diffusion search review. In: Paladyn, Journal of Behavioral Robotics, vol. 4(3), pp. 155–173. Springer, Heidelberg (2013)
Alba, E., Luque, G.: A new local search algorithm for the dna fragment assembly problem. In: Evolutionary Computation in Combinatorial Optimization, pp. 1–12. Springer, Heidelberg (2007)
Alba, E., Luque, G.: A hybrid genetic algorithm for the dna fragment assembly problem. In: Recent Advances in Evolutionary Computation for Combinatorial Optimization, pp. 101–112. Springer, Heidelberg (2008)
Bishop, J.: Stochastic searching networks. In: Proceedings of the 1st IEEE Conference on Artificial Neural Networks. pp. 329–331. London, UK (1989)
Blum, C., Li, X.: Swarm Intelligence in Optimization. Springer, Berlin (2008)
Cotta, C., Fernández, A., Gallardo, J., Luque, G., Alba, E.: Metaheuristics in bioinformatics: DNA sequencing and reconstruction. In: Optimization Techniques for Solving Complex Problems, pp. 265–286 (2009)
Fang, S.C., Wang, Y., Zhong, J.: A genetic algorithm approach to solving dna fragment assembly problem. J. Comput. Theor. Nanosci. 2(4), 499–505 (2005)
Firoz, J.S., Rahman, M.S., Saha, T.K.: Bee algorithms for solving dna fragment assembly problem with noisy and noiseless data. In: Proceedings of the Fourteenth International Conference on Genetic and Evolutionary Computation Conference. pp. 201–208. ACM, New York (2012)
Huang, K.W., Chen, J.L., Yang, C.S., Tsai, C.W.: A memetic particle swarm optimization algorithm for solving the dna fragment assembly problem. Neural Comput. Appl. pp. 1–12 (2014)
Kim, K., Mohan, C.K.: Parallel hierarchical adaptive genetic algorithm for fragment assembly. In: The 2003 Congress on IEEE Evolutionary Computation CEC’03, vol. 1, pp. 600–607. (2003)
Li, L., Khuri, S.: A comparison of dna fragment assembly algorithms. METMBS 4, 329–335 (2004)
Luque, G., Alba, E.: Metaheuristics for the DNA fragment assembly problem. Int. J. Comput. Intell. Res. 1, 98–108 (2005)
Mallén-Fullerton, G.M., Fernández-Anaya, G.: DNA fragment assembly using optimization. In: IEEE Congress on Evolutionary Computation (CEC), pp. 1570–1577 (2013)
Mallén-Fullerton, G.M., Hughes, J.A., Houghten, S., Fernández-Anaya, G.: Benchmark datasets for the dna fragment assembly problem. Int. J. Bio-Inspir. Comput. 5(6), 384–394 (2013)
Meksangsouy, P., Chaiyaratana, N.: DNA fragment assembly using an ant colony system algorithm. In: The 2003 Congress on IEEE Evolutionary Computation CEC’03, vol. 3, pp. 1756–1763 (2003)
Minetti, G., Alba, E.: Metaheuristic assemblers of dna strands: noiseless and noisy cases. In: IEEE Congress on Evolutionary Computation (CEC), pp. 1–8 (2010)
Parsons, R., Johnson, M.E.: DNA sequence assembly and genetic algorithms-new results and puzzling insights. In: ISMB, pp. 277–284 (1995)
Pevzner, P.: Computational molecular biology: an algorithmic approach. MIT press, Cambridge (2000)
Setubal, J.C., Meidanis, J., Setubal-Meidanis.: Introduction to Computational Molecular Biology. PWS, Boston (1997)
Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981)
U.S. National Library of Medicine: Cells and DNA. what is DNA? http://ghr.nlm.nih.gov/handbook/basics/dna. Accessed 06 Jan 2015
Verma, R.S., Singh, V., Kumar, S.: DNA sequence assembly using particle swarm optimization. Int. J. Comput. Appl. 28 (2011)
Wieds, G.: Bioinformatics explained: Blast versus smith-waterman. CLCBio. http://www.clcbio.com/index.php. (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Majid al-Rifaie, F., Majid al-Rifaie, M. (2016). Maximising Overlap Score in DNA Sequence Assembly Problem by Stochastic Diffusion Search. In: Bi, Y., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. Studies in Computational Intelligence, vol 650. Springer, Cham. https://doi.org/10.1007/978-3-319-33386-1_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-33386-1_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-33384-7
Online ISBN: 978-3-319-33386-1
eBook Packages: EngineeringEngineering (R0)