Abstract
The XCS classifier system evolves solutions that represent complete mappings from state-action pairs to expected returns therefore, in every possible situation, XCS can predict the value of all the available actions. Such complete mapping is sometimes considered redundant as most of the applications (like for instance, classification), usually focus only on the best action. In this paper, we introduce an extension of XCS with an adaptive (state-action) mapping mechanism (or XCSAM) that evolves solutions focused actions with the largest returns. While UCS evolves solutions focused on the best available action but can only solve supervised classification problems, our system can solve both supervised and multi-step problems and, in addition, it can adapt the size of the mapping to the problems: Initially, XCSAM starts building a complete mapping and then it slowly tries to focus on the best actions available. If the problem admits only one optimal action in each niche, XCSAM tends to focus on such an action as the evolution proceeds. If more actions with the same return are available, XCSAM tends to evolve a mapping that includes all of them. We applied XCSAM both to supervised problems (the Boolean multiplexer) and to multi-step maze-like problems. Our experimental results show that XCSAM can reach optimal performance but requires smaller populations than XCS as it evolves solutions focused on the best actions available for each subproblem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bernadó-Mansilla, E., Garrell, J.M.: Accuracy-based learning classifier systems: Models, analysis and applications to classification tasks. Evolutionary Computation 11, 209–238 (2003)
Butz, M.V., Goldberg, D.E., Lanzi, P.L.: Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems. Evolutionary Computation 9(5), 452–473 (2005)
Butz, M.V., Kovacs, T., Lanzi, P.L., Wilson, S.W.: Toward a Theory of Generalization and Learning in XCS. IEEE Transactions on Evolutionary Computation 8(1), 28–46 (2004)
Butz, M.V., Sastry, K., Goldberg, D.E.: Tournament Selection: Stable Fitness Pressure in XCS. In: Cantú-Paz, E., Foster, J.A., Deb, K., Davis, L., Roy, R., O’Reilly, U.-M., Beyer, H.-G., Kendall, G., Wilson, S.W., Harman, M., Wegener, J., Dasgupta, D., Potter, M.A., Schultz, A., Dowsland, K.A., Jonoska, N., Miller, J., Standish, R.K. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 1857–1869. Springer, Heidelberg (2003)
Butz, M.V., Wilson, S.W.: An algorithmic description of xcs. Journal of Soft Computing 6(3-4), 144–153 (2002)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison Wesley (1989)
Holland, J.H.: Escaping Brittleness: The Possibilities of General Purpose Learning Algorithms Applied to Parallel Rule-based system. Machine Learning 2, 593–623 (1986)
Kovacs, T.: Evolving optimal populations with XCS classifier systems. Technical Report CSR-96-17 and CSRP-96-17, School of Computer Science, University of Birmingham, Birmingham, U.K. (1996), Available from the technical report archive, ftp://ftp.cs.bham.ac.uk/pub/tech-reports/1996/CSRP-96-17.ps.gz
Lanzi, P.L.: An Analysis of Generalization in the XCS Classifier System. Evolutionary Computation Journal 7(2), 125–149 (1999)
Lanzi, P.L.: Learning classifier systems from a reinforcement learning perspective. Soft Computing - A Fusion of Foundations, Methodologies and Applications 6(3), 162–170 (2002)
Sutton, R.S., Barto, A.G.: Reinforcement Learning – An Introduction. MIT Press (1998)
Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary Computation 3(2), 149–175 (1995)
Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary Computation 3(2), 149–175 (1995), http://prediction-dynamics.com/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nakata, M., Lanzi, P.L., Takadama, K. (2012). XCS with Adaptive Action Mapping. In: Bui, L.T., Ong, Y.S., Hoai, N.X., Ishibuchi, H., Suganthan, P.N. (eds) Simulated Evolution and Learning. SEAL 2012. Lecture Notes in Computer Science, vol 7673. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34859-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-34859-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34858-7
Online ISBN: 978-3-642-34859-4
eBook Packages: Computer ScienceComputer Science (R0)