Skip to main content

Advertisement

Log in

A Novel Hypergraph-Based Genetic Algorithm (HGGA) Built on Unimodular and Anti-homomorphism Properties for DNA Sequencing by Hybridization

  • Original Research Article
  • Published:
Interdisciplinary Sciences: Computational Life Sciences Aims and scope Submit manuscript

Abstract

The sequencing by hybridization (SBH) of determining the order in which nucleotides should occur on a DNA string is still under discussion for enhancements on computational intelligence although the next generation of DNA sequencing has come into existence. In the last decade, many works related to graph theory-based DNA sequencing have been carried out in the literature. This paper proposes a method for SBH by integrating hypergraph with genetic algorithm (HGGA) for designing a novel analytic technique to obtain DNA sequence from its spectrum. The paper represents elements of the spectrum and its relation as hypergraph and applies the unimodular property to ensure the compatibility of relations between l-mers. The hypergraph representation and unimodular property are bound with the genetic algorithm that has been customized with a novel selection and crossover operator reducing the computational complexity with accelerated convergence. Subsequently, upon determining the primary strand, an anti-homomorphism is invoked to find the reverse complement of the sequence. The proposed algorithm is implemented in the GenBank BioServer datasets, and the results are found to prove the efficiency of the algorithm. The HGGA is a non-classical algorithm with significant advantages and computationally attractive complexity reductions ranging to \(O(n^{2} )\) with improved accuracy that makes it prominent for applications other than DNA sequencing like image processing, task scheduling and big data processing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Bains W, Smith GC (1988) A novel method for nucleic acid sequence determination. J Theor Biol 135(3):303–307

    Article  CAS  PubMed  Google Scholar 

  2. Drmanac R, Labat I, Brukner I, Crkvenjakov R (1989) Sequencing of megabase plus DNA by hybridization: theory of the method. Genomics 4(2):114–128

    Article  CAS  PubMed  Google Scholar 

  3. Khrapko KR, Lysov YP, Khorlyn AA, Shick VV, Florentiev VL, Mirzabekov AD (1989) An oligonucleotide hybridization approach to DNA sequencing. FEBS Lett 256(1–2):118–122

    Article  CAS  PubMed  Google Scholar 

  4. Błażewicz J, Kasprzak M, Kuroczycki W (2002) Hybrid genetic algorithm for DNA sequencing with errors*. J Heuristics 8:495–502

    Article  Google Scholar 

  5. Blazewicz J, Glover F, Kasprzak M (2005) Evolutionary approaches to DNA sequencing with errors. Ann Oper Res 138:67–78

    Article  Google Scholar 

  6. Błażewicz J et al (2000) Tabu search for DNA sequencing with false negatives and false positives. Eur J Oper Res 125(2):257–265

    Article  Google Scholar 

  7. Błażewicz J, Glover F, Kasprzak M (2004) DNA sequencing—Tabu and Scatter search combined. INFORMS J Comput 16(3):232–240

    Article  Google Scholar 

  8. Caserta M, Vo S (2014) A hybrid algorithm for the DNA sequencing problem. Discret Appl Math 163:87–99

    Article  Google Scholar 

  9. Radom M, Formanowicz P (2017) An Algorithm for Sequencing by Hybridization Based on an Alternating DNA Chip. Interdiscip Sci: Comput Life Sci 1–11

  10. Blazewicz J, Hertz A, Kobler D, De Werra D (1999) On some properties of DNA graphs. Discret Appl Math 98:1–19

    Article  Google Scholar 

  11. Sanger F, Nicklen S, Coulson AR (1977) DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA 74(12):5463–5467

    Article  CAS  PubMed  Google Scholar 

  12. Sanger F (1981) Determination of nucleotide sequences in DNA. Science 214(4526):1205–1210

    Article  CAS  PubMed  Google Scholar 

  13. Sutton G, White O, Adams M (1995) TIGR Assembler: A new tool for assembling large shotgun sequencing projects. Genome Sci Technol 1(1):9–19

    Article  CAS  Google Scholar 

  14. Ahmadian A, Ehn M, Hober S (2006) Pyrosequencing: history, biochemistry and future. Clin Chim Acta

  15. Lysov IP, Florent’ev VL, Khorlin AA, Khrapko KR, Shik VV (1988) Determination of the nucleotide sequence of DNA using hybridization with oligonucleotides A new method. Dokl Akad Nauk SSSR 303(6):1508–1511

    CAS  PubMed  Google Scholar 

  16. Bains W (1991) Hybridization methods for DNA sequencing. Genomics 11(2):294–301

    Article  CAS  PubMed  Google Scholar 

  17. Drmanac R, Crkvenjakov R (1993) Method of sequencing of genomes by hybridization of oligonucleotide probes. US Patent 5,202,231

  18. Zhang J-H, Wu L-Y, Zhang X-S (2003) Reconstruction of DNA sequencing by hybridization. Bioinformatics 19(1):14–21

    Article  PubMed  Google Scholar 

  19. Cattaneo G, Chiaselotti G, Ciucci D, Gentile T (2016) On the connection of hypergraph theory with formal concept analysis and rough set theory. Inf Sci (NY) 330:342–357

    Article  Google Scholar 

  20. Hartemink AJ, Gifford DK, Khodor J (1999) Automated constraint-based nucleotide sequence selection for DNA computation. Biosystems 52(1):227–235

    Article  CAS  PubMed  Google Scholar 

  21. Penchovsky R, Ackermann J (2003) DNA library design for molecular computation. J Comput Biol 10(2):215–229

    Article  CAS  PubMed  Google Scholar 

  22. Zhang Z et al (2000) A greedy algorithm for aligning DNA sequences. J Comput biol 7(1–2):203–214

    Article  CAS  PubMed  Google Scholar 

  23. Blum C, Vallès MY, Blesa MJ (2008) An ant colony optimization algorithm for DNA sequencing by hybridization. Comp Oper Res 35(11):3620–3635

    Article  Google Scholar 

  24. Kurniawan T, Khalid N, Ibrahim Z (2008) Evaluation of ordering methods for DNA sequence design based on ant colony system. In: 2008 Second Asia

  25. Mizas C et al (2008) Reconstruction of DNA sequences using genetic algorithms and cellular automata: towards mutation prediction? Biosystems 92(1):61–68

    Article  CAS  PubMed  Google Scholar 

  26. Chaves-González JM, Vega-Rodríguez MA, Granado-Criado JM (2013) A multiobjective swarm intelligence approach based on artificial bee colony for reliable DNA sequence design. Eng Appl Artif Intell 26(9):2045–2057

    Article  Google Scholar 

  27. Cui G, Li X (2010) The optimization of DNA encodings based on modified PSO/GA algorithm. In: 2010 International Conference on Computer Design and Applications (ICCDA), vol 1. IEEE

  28. Khalid NK, Ibrahim Z, Kurniawan TB, Khalid M, Engelbrecht AP (2009) Implementation of binary particle swarm optimization for DNA sequence design. In: International Work-Conference on Artificial Neural Networks. Springer, Heidelberg, pp 450–457

  29. Xiao J, Cheng Z (2011) DNA sequences optimization based on gravitational search algorithm for reliable DNA computing. In: Bio-Inspired Computing: Theories and Applications (BIC-TA), 2011 6th International Conference on IEEE, pp 103–107. IEEE

  30. Blazewicz J et al (2013) A hyper-heuristic approach to sequencing by hybridization of DNA sequences. Ann Oper Res 207(1):27–41

    Article  Google Scholar 

  31. Cuticchia AJ, Arnold J, Timberlake WE (1993) ODS: ordering DNA sequences—a physical mapping algorithm based on simulated annealing. Bioinformatics 9(2):215–219

    Article  CAS  Google Scholar 

  32. Błażewicz J, Formanowicz P, Kasprzak M, Schuurman P, Woeginger GJ (2002) DNA sequencing, eulerian graphs, and the exact perfect matching problem. Springer, Berlin, pp 13–24

    Google Scholar 

  33. Blazewicz J, Kasprzak M (2006) Computational complexity of isothermic DNA sequencing by hybridization. Disc Appl Math 154(5):718–729

    Article  Google Scholar 

  34. Jafarzadeh N, Iranmanesh A (2013) A new graph theoretical approach to DNA sequencing with nanopores. Match-Commun Math Comp Chem 70(1):401–415

    CAS  Google Scholar 

  35. Xie X, Zaitsev Y, Velasquez-Garcia L, Teller S, Livermore C (2014). Compact, scalable, high-resolution, MEMS-enabled tactile displays. In: Proceedings of solid-state sensors, actuators, and microsystems workshop, pp 127–130

  36. Xie X, Livermore C (2017) Passively self-aligned assembly of compact barrel hinges for high-performance, out-of-plane mems actuators. In: Proceedings of IEEE international conference micro electro mechanical system, pp 813–816

  37. Xie X, Livermore C (2016) A pivot-hinged, multilayer SU-8 micro motion amplifier assembled by a self-aligned approach. In: Proceedings of IEEE international conference on micro electro mechanical system, vol. 2016, pp 75–78

  38. Xie X, Zaitsev Y, Velásquez-García LF, Teller SJ, Livermore C (2014) Scalable, MEMS-enabled, vibrational tactile actuators for high resolution tactile displays. J. Micromech Microeng 24(12):125014

    Article  CAS  Google Scholar 

  39. Sheng B et al (2017) AutoPath : harnessing parallel execution paths for efficient resource allocation in multi-stage big data frameworks autopath : harnessing parallel execution paths for efficient resource allocation in multi-stage big data frameworks

  40. Pevzner PA, Lipshutz RJ (1994) Towards DNA sequencing chips. In: International symposium on mathematical foundations of computer science. Springer, Berlin

  41. Xiong S, Ji D (2016) Query-focused multi-document summarization using hypergraph-based ranking. Inf Process Manag 52(4):670–681

    Article  Google Scholar 

  42. Kannan K, Kanna BR, Aravindan C (2010) Root mean square filter for noisy images based on hyper graph model. Image Vis Comput 28(9):1329–1338

    Article  Google Scholar 

  43. Xiao G, Wang H, Lai T, Suter D (2016) Hypergraph modelling for geometric model fitting. Pattern Recognit 60:748–760

    Article  Google Scholar 

  44. Huang S, Elgammal A, Yang D (2017) On the effect of hyperedge weights on hypergraph learning. Image Vision Comput 57:89–101

    Article  Google Scholar 

  45. Berge C, Minieka E (1973) Graphs and hypergraphs, 2nd edn. Amsterdam, North-Holland, p 528

    Google Scholar 

  46. Pandey HM, Shukla A, Chaudhary A, Mehrotra D (2016) Evaluation of Genetic Algorithm’s Selection Methods. In: Information Systems Design and Intelligent Applications. Springer, New Delhi, pp 731–738

    Chapter  Google Scholar 

  47. Parsons R, Forrest S, Burks C (1993) Genetic algorithms for DNA sequence assembly. Proceed Int Conf Intell Syst Mol Biol 1:310–318

    CAS  Google Scholar 

  48. Yang Z, Wang J, Evans D, Mi N (2017) AutoReplica: automatic data replica manager in distributed caching and data processing systems. In: 2016 IEEE 35th international conference on computer communications IPCCC 2016, December

  49. Yang Z, Tai J, Bhimani J, Wang J, Mi N, Sheng B (2017) GReM: dynamic SSD resource allocation in virtualized storage systems with heterogeneous IO workloads. In: 2016 IEEE 35th international conference on computer communications IPCCC 2016

  50. Bhimani J, Mi N, Leeser M (2017) FiM : performance prediction for parallel computation in iterative data processing applications FiM : performance prediction for parallel computation in iterative data processing applications

  51. Yang Z, Awasthi M, Ghosh M, Mi N (2017) A fresh perspective on total cost of ownership models for flash storage in datacenters. In: Proc. Int. Conf. Cloud Comput. Technol. Sci. CloudCom, no. December 2016, pp 245–252

  52. Caserta M, Voß S (2014) A hybrid algorithm for the DNA sequencing problem. Discret Appl Math 163(1):87–99

    Article  Google Scholar 

  53. Illumina (2017) An introduction to next-generation sequencing technology. http://www.illumina.com/technology/next-generation-sequencing.html. Accessed 18 Jun 2017

Download references

Acknowledgements

The authors thank the Department of Science and Technology—Fund for Improvement of S&T Infrastructure in Universities and Higher Educational Institutions Government of India (SR/FST/MSI-107/2015) for their financial support. Authors express their gratefulness to SASTRA University, Thanjavur, for providing the infrastructural facilities and academic support to carry out this research work. We would like to express our gratitude toward the unknown potential reviewers who have agreed to review this paper and provided valuable suggestions to improve the quality of the paper.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Gangothri Rajaram or K. Kannan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Swaminathan, V., Rajaram, G., Abhishek, V. et al. A Novel Hypergraph-Based Genetic Algorithm (HGGA) Built on Unimodular and Anti-homomorphism Properties for DNA Sequencing by Hybridization. Interdiscip Sci Comput Life Sci 11, 397–411 (2019). https://doi.org/10.1007/s12539-017-0267-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12539-017-0267-y

Keywords

Navigation