Enhancing Learning Capabilities by XCS with Best Action Mapping

  • Masaya Nakata
  • Pier Luca Lanzi
  • Keiki Takadama
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7491)

Abstract

This paper proposes a novel approach of XCS called XCS with Best Action Mapping (XCSB) to enhance the learning capabilities of XCS. The feature of XCSB is to learn only best actions having the highest predicted payoff with the high accuracy unlike XCS which learns actions having the highest and lowest predicted payoff with the high accuracy. To investigate the effectiveness of XCSB, we applied XCSB to two benchmark problems: multiplexer problem as a single step problem and maze problem as a multi step problem. The experimental results show that (1) XCSB can solve quickly the problem which has a large state space and (2) XCSB can achieve a high performance with a small max population size.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bernadó-mansilla, E., Garrell-Guij, J.M.: Accuracy-Based Learning Classifier Systems: Models, Analysis and Applications to Classification Tasks. Evolutionary Computation 11, 209–238 (2003)CrossRefGoogle Scholar
  2. 2.
    Butz, M.V., Goldberg, D.E., Lanzi, P.L.: Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems. Evolutionary Computation 9(5), 452–473 (2005)CrossRefGoogle Scholar
  3. 3.
    Butz, M.V., Kovacs, T., Lanzi, P.L., Wilson, S.W.: Toward a Theory of Generalization and Learning in XCS. IEEE Transactions on Evolutionary Computation 8(1), 28–46 (2004)CrossRefGoogle Scholar
  4. 4.
    Butz, M.V., Sastry, K., Goldberg, D.E.: Tournament Selection in XCS. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2003), pp. 1857–1869 (2003)Google Scholar
  5. 5.
    Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison Wesley (1989)Google Scholar
  6. 6.
    Holland, J.H.: Escaping Brittleness: The Possibilities of General Purpose Learning Algorithms Applied to Parallel Rule-based system. Machine Learning 2, 593–623 (1986)Google Scholar
  7. 7.
    Kovacs, T.: Strength or Accuracy? Fitness Calculation in Learning Classifier Systems. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 1999. LNCS (LNAI), vol. 1813, pp. 143–160. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  8. 8.
    Sutton, R.S.: Learning to Predict by the Methods of Temporal Differences. Machine Learning 3(1), 9–44 (1988)Google Scholar
  9. 9.
    Wilson, S.W.: ZCS: A Zeroth Level Classifier System. Evolutionary Computation 2(1), 1–18 (1994)CrossRefGoogle Scholar
  10. 10.
    Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary Computation 3(2), 149–175 (1995)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Masaya Nakata
    • 1
  • Pier Luca Lanzi
    • 2
  • Keiki Takadama
    • 1
  1. 1.Department of InformaticsThe university of Electo-CommunicationsTokyoJapan
  2. 2.Dipartimento di Elettronica e InformazionePolitecnico di MilanoMilanoItaly

Personalised recommendations