Skip to main content

Maximising Overlap Score in DNA Sequence Assembly Problem by Stochastic Diffusion Search

  • Chapter
  • First Online:
  • 829 Accesses

Part of the book series: Studies in Computational Intelligence ((SCI,volume 650))

Abstract

This paper introduces a novel study on the performance of Stochastic Diffusion Search (SDS)—a swarm intelligence algorithm—to address DNA sequence assembly problem. This is an NP-hard problem and one of the primary problems in computational molecular biology that requires optimisation methodologies to reconstruct the original DNA sequence. In this work, SDS algorithm is adapted for this purpose and several experiments are run in order to evaluate the performance of the presented technique over several frequently used benchmarks. Given the promising results of the newly proposed algorithm and its success in assembling the input fragments, its behaviour is further analysed, thus shedding light on the process through which the algorithm conducts the task. Additionally, the algorithm is applied to overlap score matrices which are generated from the raw input fragments; the algorithm optimises the overlap score matrices to find better results. In these experiments real-world data are used and the performance of SDS is compared with several other algorithms which are used by other researchers in the field, thus demonstrating its weaknesses and strengths in the experiments presented in the paper.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    In American English, 1 billion is equated to a thousand million (i.e. 1, 000, 000, 000).

  2. 2.

    The results of these algorithms, other than SDS, are borrowed from [2].

  3. 3.

    These issues are explained in Sect. 3.

  4. 4.

    The results of these algorithms, other than SDS, are borrowed from [14].

References

  1. al-Rifaie, M.M., Bishop, M.: Stochastic diffusion search review. In: Paladyn, Journal of Behavioral Robotics, vol. 4(3), pp. 155–173. Springer, Heidelberg (2013)

    Google Scholar 

  2. Alba, E., Luque, G.: A new local search algorithm for the dna fragment assembly problem. In: Evolutionary Computation in Combinatorial Optimization, pp. 1–12. Springer, Heidelberg (2007)

    Google Scholar 

  3. Alba, E., Luque, G.: A hybrid genetic algorithm for the dna fragment assembly problem. In: Recent Advances in Evolutionary Computation for Combinatorial Optimization, pp. 101–112. Springer, Heidelberg (2008)

    Google Scholar 

  4. Bishop, J.: Stochastic searching networks. In: Proceedings of the 1st IEEE Conference on Artificial Neural Networks. pp. 329–331. London, UK (1989)

    Google Scholar 

  5. Blum, C., Li, X.: Swarm Intelligence in Optimization. Springer, Berlin (2008)

    Google Scholar 

  6. Cotta, C., Fernández, A., Gallardo, J., Luque, G., Alba, E.: Metaheuristics in bioinformatics: DNA sequencing and reconstruction. In: Optimization Techniques for Solving Complex Problems, pp. 265–286 (2009)

    Google Scholar 

  7. Fang, S.C., Wang, Y., Zhong, J.: A genetic algorithm approach to solving dna fragment assembly problem. J. Comput. Theor. Nanosci. 2(4), 499–505 (2005)

    Article  Google Scholar 

  8. Firoz, J.S., Rahman, M.S., Saha, T.K.: Bee algorithms for solving dna fragment assembly problem with noisy and noiseless data. In: Proceedings of the Fourteenth International Conference on Genetic and Evolutionary Computation Conference. pp. 201–208. ACM, New York (2012)

    Google Scholar 

  9. Huang, K.W., Chen, J.L., Yang, C.S., Tsai, C.W.: A memetic particle swarm optimization algorithm for solving the dna fragment assembly problem. Neural Comput. Appl. pp. 1–12 (2014)

    Google Scholar 

  10. Kim, K., Mohan, C.K.: Parallel hierarchical adaptive genetic algorithm for fragment assembly. In: The 2003 Congress on IEEE Evolutionary Computation CEC’03, vol. 1, pp. 600–607. (2003)

    Google Scholar 

  11. Li, L., Khuri, S.: A comparison of dna fragment assembly algorithms. METMBS 4, 329–335 (2004)

    Google Scholar 

  12. Luque, G., Alba, E.: Metaheuristics for the DNA fragment assembly problem. Int. J. Comput. Intell. Res. 1, 98–108 (2005)

    Google Scholar 

  13. Mallén-Fullerton, G.M., Fernández-Anaya, G.: DNA fragment assembly using optimization. In: IEEE Congress on Evolutionary Computation (CEC), pp. 1570–1577 (2013)

    Google Scholar 

  14. Mallén-Fullerton, G.M., Hughes, J.A., Houghten, S., Fernández-Anaya, G.: Benchmark datasets for the dna fragment assembly problem. Int. J. Bio-Inspir. Comput. 5(6), 384–394 (2013)

    Article  Google Scholar 

  15. Meksangsouy, P., Chaiyaratana, N.: DNA fragment assembly using an ant colony system algorithm. In: The 2003 Congress on IEEE Evolutionary Computation CEC’03, vol. 3, pp. 1756–1763 (2003)

    Google Scholar 

  16. Minetti, G., Alba, E.: Metaheuristic assemblers of dna strands: noiseless and noisy cases. In: IEEE Congress on Evolutionary Computation (CEC), pp. 1–8 (2010)

    Google Scholar 

  17. Parsons, R., Johnson, M.E.: DNA sequence assembly and genetic algorithms-new results and puzzling insights. In: ISMB, pp. 277–284 (1995)

    Google Scholar 

  18. Pevzner, P.: Computational molecular biology: an algorithmic approach. MIT press, Cambridge (2000)

    Google Scholar 

  19. Setubal, J.C., Meidanis, J., Setubal-Meidanis.: Introduction to Computational Molecular Biology. PWS, Boston (1997)

    Google Scholar 

  20. Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981)

    Article  Google Scholar 

  21. U.S. National Library of Medicine: Cells and DNA. what is DNA? http://ghr.nlm.nih.gov/handbook/basics/dna. Accessed 06 Jan 2015

  22. Verma, R.S., Singh, V., Kumar, S.: DNA sequence assembly using particle swarm optimization. Int. J. Comput. Appl. 28 (2011)

    Google Scholar 

  23. Wieds, G.: Bioinformatics explained: Blast versus smith-waterman. CLCBio. http://www.clcbio.com/index.php. (2007)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fatimah Majid al-Rifaie .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Majid al-Rifaie, F., Majid al-Rifaie, M. (2016). Maximising Overlap Score in DNA Sequence Assembly Problem by Stochastic Diffusion Search. In: Bi, Y., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. Studies in Computational Intelligence, vol 650. Springer, Cham. https://doi.org/10.1007/978-3-319-33386-1_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-33386-1_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-33384-7

  • Online ISBN: 978-3-319-33386-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics