Abstract
The conserved regions between genomic sequences extracted from the different species give important information about the complex regulatory mechanisms of the transcription and translation processes. However, identification of these regulatory regions that are short DNA segments and called motifs is a major challenge in bioinformatics. With the avalanche of the newly sequenced genomic data and our evolving understanding of the characteristics of the regulatory mechanisms, there is still a need for developing fast and accurate motif discovery techniques or considerably refinement on the existing models. In this paper, we presented two different Artificial Bee Colony algorithm based motif discovery techniques and investigated their serial and parallelized implementations. Experimental studies on the three real data sets showed that the proposed methods outperformed other metaheuristics in terms of similarity values of the predicted motifs.
Similar content being viewed by others
Notes
Please send an e-mail to the corresponding author for the results of the all experimental cases.
References
Akay B, Karaboga D (2010) Artificial bee colony algorithm for large-scale problems and engineering design optimization. J Intell Manuf 23(4):1001. https://doi.org/10.1007/s10845-010-0393-4
Akay B, Karaboga D (2015) A survey on the applications of artificial bee colony in signal, image, and video processing. SIViP 9(4):967. https://doi.org/10.1007/s11760-015-0758-4
Alshamlan HM et al (2015) Genetic Bee Colony (GBC) algorithm: a new gene selection method for microarray cancer classification. Comput Biol Chem 56:49. https://doi.org/10.1016/j.compbiolchem.2015.03.001
Badem H, Basturk A, Caliskan A, Yuksel ME (2017) A new efficient training strategy for deep neural networks by hybridization of artificial bee colony and limited memory BFGS optimization algorithms. Neurocomputing 266(Supplement C). https://doi.org/10.1016/j.neucom.2017.05.061. http://www.sciencedirect.com/science/article/pii/S0925231217309487
Bolaji AL et al (2013) Artificial bee colony algorithm, its variants and applications: a survey. J Theor Appl Inf Technol 47(2)
Bulyk ML et al (2004) Computational prediction of transcription-factor binding site locations. Genome Biol 5(1):201
Cao B et al (2015) MOEPGA: a novel method to detect protein complexes in yeast protein–protein interaction networks based on multiobjective evolutionary programming genetic algorithm. Comput Biol Chem 58:173
Celik M et al (2016) CoABCMiner: an algorithm for cooperative rule classification system based on Artificial Bee Colony. Int J Artif Intell Tools 25(01):1. https://doi.org/10.1142/S0218213015500281
Chan TM, Leung KS, Lee KH (2012) Memetic algorithms for de novo motif discovery. IEEE Trans Evolut Comput 16(5):730
Chang BCH et al (2004) Particle swarm optimisation for protein motif discovery. Genet Program Evolvable Mach 5(2):203. https://doi.org/10.1023/B:GENP.0000023688.42515.92
Che D et al (2005) MDGA: motif discovery using a genetic algorithm. In: Proceedings of the 7th annual conference on genetic and evolutionary computation (ACM), pp 447–452
Das MK, Dai HK (2007) A survey of DNA motif finding algorithms. BMC Bioinform 8(Suppl 7):S21
González-Álvarez DL et al (2010) Solving the motif discovery problem by using differential evolution with pareto tournaments. In: IEEE congress on evolutionary computation (CEC), pp 1–8
González-Álvarez DL et al (2011) Finding motifs in DNA sequences applying a multiobjective Artificial Bee Colony (MOABC) algorithm. In: evolutionary computation, machine learning and data mining in bioinformatics. Springer, Berlin, pp 89–100
González-Álvarez DL et al (2012) Comparing multiobjective Artificial Bee Colony adaptations for discovering DNA motifs. In: Evolutionary computation, machine learning and data mining in bioinformatics. Springer, Berlin, pp 110–121
Huo H et al (2010) Optimizing genetic algorithm for motif discovery. Math Comput Modelling 52(11):2011
Jones NC, Pevzner P (2004) An introduction to bioinformatics algorithms. MIT press, London
Karaboga D, Aslan S (2015) A new emigrant creation strategy for parallel Artificial Bee Colony algorithm. In: IEEEE 2015 9th international conference on electrical and electronics engineering (ELECO), pp. 689–694
Karaboga D, Aslan S (2016a) A discrete Artificial Bee Colony algorithm for detecting transcription factor binding sites in DNA sequences. Genet Mol Res 15(02):1. https://doi.org/10.4238/gmr.15028645
Karaboga D, Aslan S (2016b) Best supported emigrant creation for parallel implementation of Artificial Bee Colony algorithm. IU J Electr Electron Eng 16(2):2055
Karaboga D, Basturk B (2007) A powerful and efficient algorithm for numerical function optimization: Artificial Bee Colony (ABC) algorithm. J Global Optim 39(3):459. https://doi.org/10.1007/s10898-007-9149-x
Karaboga D, Basturk B (2008) On the performance of Artificial Bee Colony (ABC) algorithm. Appl Soft Comput 8(1):687. https://doi.org/10.1016/j.asoc.2007.05.007
Kaya M (2009) MOGAMOD: multi-objective genetic algorithm for motif discovery. Expert Syst Appl 36(2, Part 1): 1039. https://doi.org/10.1016/j.eswa.2007.11.008
Lawrence CE, Reilly AA (1990) An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences, Proteins Struct Funct Bioinform 7(1):41. https://doi.org/10.1002/prot.340070105
Lesk A (2013) Introduction to bioinformatics. Oxford University Press, Oxford
Li N, Tompa M (2006) Analysis of computational approaches for motif discovery. Algorithms Mol Biol 1(1):1. https://doi.org/10.1186/1748-7188-1-8
Li B et al (2015) A balance-evolution artificial bee colony algorithm for protein structure optimization based on a three-dimensional AB off-lattice model. Comput Biol Chem. https://doi.org/10.1016/j.compbiolchem.2014.11.004
Liu FFM et al (2004) FMGA: finding motifs by genetic algorithm. In: Proceedings of the fourth IEEE symposium on bioinformatics and bioengineering (BIBE) 2004, pp 459–466. https://doi.org/10.1109/BIBE.2004.1317378
Liu J et al (2013) Heuristic-based tabu search algorithm for folding two-dimensional AB off-lattice model proteins. Comput Biol Chem 47:142. https://doi.org/10.1016/j.compbiolchem.2013.08.011
Luo Jw, Wang T (2010) Motif discovery using an immune genetic algorithm. J Theor Biol 264(2):319
Martinez E et al (2010) Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm. Comput Biol Chem 34(4):244. https://doi.org/10.1016/j.compbiolchem.2010.08.003
Mathe C et al (2002) Current methods of gene prediction, their strengths and weaknesses. Nucleic Acids Res 30(19):4103. https://doi.org/10.1093/nar/gkf543
Matys V et al (2003) TRANSFAC: transcriptional regulation, from patterns to profiles. Nucleic Acids Res 31(1):374. https://doi.org/10.1093/nar/gkg108
Ozturk C, Aslan S (2016) A new artificial bee colony algorithm to solve the multiple sequence alignment problem. Int J Data Min Bioinform 14(4):332
Shao L, Chen Y (2009) Bacterial foraging optimization algorithm integrating tabu search for motif discovery. In: IEEE international conference on bioinformatics and biomedicine BIBM’09 , pp 415–418
Tompa M et al (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nat Biotechnol 23(1):137. https://doi.org/10.1038/nbt1053
Wang Z et al (2004) A brief review of computational gene prediction methods. Genom Proteom Bioinform 2(4):216
Zvelebil M, Baum J (2007) Understanding bioinformatics. Garland Science, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Karaboga, D., Aslan, S. Discovery of conserved regions in DNA sequences by Artificial Bee Colony (ABC) algorithm based methods. Nat Comput 18, 333–350 (2019). https://doi.org/10.1007/s11047-018-9674-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11047-018-9674-1