Abstract
Many Metaheuristics solve optimization problems in the continuous domain, so it is necessary to apply binarization schemes to solve binary problems, this selection that is not trivial since it impacts the heart of the search strategy: its ability to explore. This paper proposes a Hyperheuristic Binarization Framework based on a Machine Learning technique of Reinforcement Learning to select the appropriate binarization strategy, which is applied in a Low Level Metaheuristic. The proposed implementation is composed of a High Level Metaheuristic, Ant Colony Optimization, using Q-Learning replacing the pheromone trace component. In the Low Level Metaheuristic, we use a Grey Wolf Optimizer to solve the binary problem with binarization scheme fixed by ants. This framework allowing a better balance between exploration and exploitation, and can be applied selecting others low level components.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Beasley, J.E.: Or-library: distributing test problems by electronic mail. J. Oper. Res. Soc. 41(11), 1069–1072 (1990). http://www.jstor.org/stable/2582903
Bishop, C.M.: Pattern Recoginiton and Machine Learning (2006)
Blum, C., Roli, A.: Metaheuristics in combinatorial optimization: overview and conceptual comparison. ACM Comput. Surv. 35(3), 268–308 (2003). https://doi.org/10.1145/937503.937505
Book, R.V.: Book review: computers and intractability: a guide to the theory of NP-completeness. Bull. Am. Math. Soc. 3(2), 898–905 (1980). https://doi.org/10.1090/s0273-0979-1980-14848-x
Burke, E., Kendall, G., Newall, J., Hart, E., Ross, P., Schulenburg, S.: Hyper-heuristics: an emerging direction in modern search technology. In: Glover, F., Kochenberger, G.A. (eds) Handbook of Metaheuristics. Springer, Boston(2006). https://doi.org/10.1007/0-306-48056-5_16
Celebi, M.E., Aydin, K. (eds.): Unsupervised Learning Algorithms. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-24211-8
Choong, S.S., Wong, L.P., Lim, C.P.: Automatic design of hyper-heuristic based on reinforcement learning. Inf. Sci. (NY). (2018). https://doi.org/10.1016/j.ins.2018.01.005
Cowling, P., Kendall, G., Soubeiga, E.: A hyperheuristic approach to scheduling a sales summit. In: Burke, E., Erben, W. (eds.) PATAT 2000. LNCS, vol. 2079, pp. 176–190. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-44629-X_11
Crawford, B., Soto, R., Astorga, G., García, J.: Constructive metaheuristics for the set covering problem. In: Korošec, P., Melab, N., Talbi, E.-G. (eds.) BIOMA 2018. LNCS, vol. 10835, pp. 88–99. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91641-5_8
Crawford, B., Soto, R., Astorga, G., García, J., Castro, C., Paredes, F.: Putting continuous metaheuristics to work in binary search spaces (2017). https://doi.org/10.1155/2017/8404231
Crawford, B., Soto, R., Olivares, R., Riquelme, L., Astorga, G., Johnson, F., Cortés, E., Castro, C., Paredes, F.: A self-adaptive biogeography-based algorithm to solve the set covering problem. RAIRO - Oper. Res. 53(3), 1033–1059 (2019). https://doi.org/10.1051/ro/2019039
Dorigo, M., Birattari, M., Stützle, T.: Ant colony optimization artificial ants as a computational intelligence technique. IEEE Comput. Intell. Mag. (2006). https://doi.org/10.1109/CI-M.2006.248054
Dorigo, M., Gambardella, L.M.: Ant colony system: A cooperative learning approach to the traveling salesman problem. IEEE Trans. Evol. Comput. (1997). https://doi.org/10.1109/4235.585892
Dorigo, M., Maniezzo, V., Colorni, A., Dorigo, M.: Positive Feedback as a Search Strategy. Technical report, 91-016 (1991)
Eusuff, M., Lansey, K., Pasha, F.: Shuffled frog-leaping algorithm: A memetic meta-heuristic for discrete optimization. Eng. Optim. (2006). https://doi.org/10.1080/03052150500384759
Feo, T.A., Resende, M.G.: A probabilistic heuristic for a computationally difficult set covering problem. Oper. Res. Lett. (1989). https://doi.org/10.1016/0167-6377(89)90002-3
Glover, F.: Tabu search-Part II. ORSA J. Comput. (1990). https://doi.org/10.1287/ijoc.2.1.4
Holland, J.H.: Genetic algorithms. Sci. Am. (1992). https://doi.org/10.1038/scientificamerican0792-66
Hussain, K., Mohd Salleh, M.N., Cheng, S., Shi, Y.: Metaheuristic research: a comprehensive survey. Artif. Intell. Rev. 52(4), 2191–2233 (2018). https://doi.org/10.1007/s10462-017-9605-z
Khamassi, I., Hammami, M., Ghédira, K.: Ant-Q hyper-heuristic approach for solving 2-dimensional Cutting Stock Problem. In: IEEE SSCI 2011 - Symposium Series Computing Intelligent - SIS 2011 2011 IEEE Symposium Swarm Intelligent (2011). https://doi.org/10.1109/SIS.2011.5952530
Kotsiantis, S.B.: Supervised machine learning: a review of classification techniques. In: Proceedings of the 2007 Conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in EHealth, HCI, Information Retrieval and Pervasive Technologies, pp. 3–24. IOS Press, NLD (2007). https://doi.org/10.5555/1566770.1566773
Leguizamon, G., Michalewicz, Z.: A new version of ant system for subset problems. In: Proceedings of the 1999 Congress on Evolutionary Computation, CEC 1999 (1999). https://doi.org/10.1109/CEC.1999.782655
Lones, M.: Sean Luke: essentials of metaheuristics. Genet. Program Evolvable Mach. (2011). https://doi.org/10.1007/s10710-011-9139-0
Mafarja, M., Eleyan, D., Abdullah, S., Mirjalili, S.: S-shaped vs. V-shaped transfer functions for ant lion optimization algorithm in feature selection problem. In: ACM International Conference Proceedings Series (2017). https://doi.org/10.1145/3102304.3102325
Mirjalili, S., Mirjalili, S.M., Lewis, A.: Grey Wolf Optimizer. Adv. Eng. Softw. (2014). https://doi.org/10.1016/j.advengsoft.2013.12.007
Mirjalili, S., Song Dong, J., Lewis, A.: Nature-Inspired Optimizers (2020).https://doi.org/10.1007/978-3-030-12127-3
Müller, A.C., Guido, S.: Introduction to Machine Learning with Python: a guide for data scientists (2016). https://doi.org/10.1017/CBO9781107415324.004
Muncie, H.L., Sobal, J., DeForge, B.: Research methodologies (1989). https://doi.org/10.5040/9781350004900.0008
Solnon, C.: Ants can solve constraint satisfaction problems. IEEE Trans. Evol. Comput. (2002). https://doi.org/10.1109/TEVC.2002.802449
Song, H., Triguero, I., Özcan, E.: A review on the self and dual interactions between machine learning and optimisation. Progress Artif. Intell 8(2), 143–165 (2019). https://doi.org/10.1007/s13748-019-00185-z
Stützle, T., Hoos, H.H.: MAX-MIN ant system. Futur. Gener. Comput. Syst. (2000). https://doi.org/10.1016/S0167-739X(00)00043-1
Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction 2018. Technical report (2017). https://doi.org/10.1109/TNN.1998.712192
Talbi, E.G.: Metaheuristics: From Design to Implementation (2009). https://doi.org/10.1002/9780470496916
Talbi, E.G.: Machine learning into metaheuristics: a survey and taxonomy of data-driven metaheuristics, June 2020. https://hal.inria.fr/hal-02745295, working paper or preprint
Watkins, C.J., Dayan, P.: Technical note: Q-learning. Mach. Learn. (1992). https://doi.org/10.1023/A:1022676722315
Acknowledgements
Felipe Cisternas-Caneo is supported by Grant DI Investigación Interdisciplinaria del Pregrado/VRIEA/PUCV/039.324/2020. Broderick Crawford is supported by Grant CONICYT/FONDECYT/REGULAR/1171243. Ricardo Soto is supported by Grant CONICYT/FONDECYT/REGULAR/1190129. José Lemus-Romani is supported by National Agency for Research and Development (ANID)/Scholarship Program/DOCTORADO NACIONAL/2019 - 21191692. José García is supported by the Grant CONICYT/FONDECYT/INICIACION/11180056.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Tapia, D. et al. (2020). A Q-Learning Hyperheuristic Binarization Framework to Balance Exploration and Exploitation. In: Florez, H., Misra, S. (eds) Applied Informatics. ICAI 2020. Communications in Computer and Information Science, vol 1277. Springer, Cham. https://doi.org/10.1007/978-3-030-61702-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-61702-8_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61701-1
Online ISBN: 978-3-030-61702-8
eBook Packages: Computer ScienceComputer Science (R0)