XCS with Adaptive Action Mapping

Nakata, Masaya; Lanzi, Pier Luca; Takadama, Keiki

doi:10.1007/978-3-642-34859-4_14

Masaya Nakata²¹,
Pier Luca Lanzi²² &
Keiki Takadama²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7673))

Included in the following conference series:

Asia-Pacific Conference on Simulated Evolution and Learning

1575 Accesses
11 Citations

Abstract

The XCS classifier system evolves solutions that represent complete mappings from state-action pairs to expected returns therefore, in every possible situation, XCS can predict the value of all the available actions. Such complete mapping is sometimes considered redundant as most of the applications (like for instance, classification), usually focus only on the best action. In this paper, we introduce an extension of XCS with an adaptive (state-action) mapping mechanism (or XCSAM) that evolves solutions focused actions with the largest returns. While UCS evolves solutions focused on the best available action but can only solve supervised classification problems, our system can solve both supervised and multi-step problems and, in addition, it can adapt the size of the mapping to the problems: Initially, XCSAM starts building a complete mapping and then it slowly tries to focus on the best actions available. If the problem admits only one optimal action in each niche, XCSAM tends to focus on such an action as the evolution proceeds. If more actions with the same return are available, XCSAM tends to evolve a mapping that includes all of them. We applied XCSAM both to supervised problems (the Boolean multiplexer) and to multi-step maze-like problems. Our experimental results show that XCSAM can reach optimal performance but requires smaller populations than XCS as it evolves solutions focused on the best actions available for each subproblem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bernadó-Mansilla, E., Garrell, J.M.: Accuracy-based learning classifier systems: Models, analysis and applications to classification tasks. Evolutionary Computation 11, 209–238 (2003)
Article Google Scholar
Butz, M.V., Goldberg, D.E., Lanzi, P.L.: Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems. Evolutionary Computation 9(5), 452–473 (2005)
Article Google Scholar
Butz, M.V., Kovacs, T., Lanzi, P.L., Wilson, S.W.: Toward a Theory of Generalization and Learning in XCS. IEEE Transactions on Evolutionary Computation 8(1), 28–46 (2004)
Article Google Scholar
Butz, M.V., Sastry, K., Goldberg, D.E.: Tournament Selection: Stable Fitness Pressure in XCS. In: Cantú-Paz, E., Foster, J.A., Deb, K., Davis, L., Roy, R., O’Reilly, U.-M., Beyer, H.-G., Kendall, G., Wilson, S.W., Harman, M., Wegener, J., Dasgupta, D., Potter, M.A., Schultz, A., Dowsland, K.A., Jonoska, N., Miller, J., Standish, R.K. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 1857–1869. Springer, Heidelberg (2003)
Chapter Google Scholar
Butz, M.V., Wilson, S.W.: An algorithmic description of xcs. Journal of Soft Computing 6(3-4), 144–153 (2002)
Article MATH Google Scholar
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison Wesley (1989)
Google Scholar
Holland, J.H.: Escaping Brittleness: The Possibilities of General Purpose Learning Algorithms Applied to Parallel Rule-based system. Machine Learning 2, 593–623 (1986)
Google Scholar
Kovacs, T.: Evolving optimal populations with XCS classifier systems. Technical Report CSR-96-17 and CSRP-96-17, School of Computer Science, University of Birmingham, Birmingham, U.K. (1996), Available from the technical report archive, ftp://ftp.cs.bham.ac.uk/pub/tech-reports/1996/CSRP-96-17.ps.gz
Lanzi, P.L.: An Analysis of Generalization in the XCS Classifier System. Evolutionary Computation Journal 7(2), 125–149 (1999)
Article Google Scholar
Lanzi, P.L.: Learning classifier systems from a reinforcement learning perspective. Soft Computing - A Fusion of Foundations, Methodologies and Applications 6(3), 162–170 (2002)
MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning – An Introduction. MIT Press (1998)
Google Scholar
Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary Computation 3(2), 149–175 (1995)
Article Google Scholar
Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary Computation 3(2), 149–175 (1995), http://prediction-dynamics.com/
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, The University of Electro-Communications, Tokyo, Japan
Masaya Nakata & Keiki Takadama
Dipartimento di Elettronica e Informazione, Politecnico di Milano, Milano, Italy
Pier Luca Lanzi

Authors

Masaya Nakata
View author publications
You can also search for this author in PubMed Google Scholar
Pier Luca Lanzi
View author publications
You can also search for this author in PubMed Google Scholar
Keiki Takadama
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Information Technology, Le Quy Don Technical University, 100 Hoang Quoc Viet Street, Cau Giay District,, Hanoi, Vietnam
Lam Thu Bui
School of Computer Engineering, Nanyang Technological University, Block N4, 2b-39, Nanyang Avenue, 639798,, Singapore, Singapore
Yew Soon Ong
Faculty of Information Technology,, Hanoi University, km9 Nguyen Trai Road, Hanoi, Vietnam
Nguyen Xuan Hoai
Graduate School of Engineering, Department of Computer Science, Osaka Prefecture University, 1-1 Gakuen-cho, Nakaku, 599-8531, Sakai, Osaka, Japan
Hisao Ishibuchi
School of Electrical and Electronic Engineering, Nanyang Technological University, Block N4, 2b-39, Nanyang Avenue, 639798, Singapore, Singapore
Ponnuthurai Nagaratnam Suganthan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nakata, M., Lanzi, P.L., Takadama, K. (2012). XCS with Adaptive Action Mapping. In: Bui, L.T., Ong, Y.S., Hoai, N.X., Ishibuchi, H., Suganthan, P.N. (eds) Simulated Evolution and Learning. SEAL 2012. Lecture Notes in Computer Science, vol 7673. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34859-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-34859-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34858-7
Online ISBN: 978-3-642-34859-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics