Skip to main content

Building Accurate Strategies in Non Markovian Environments without Memory

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6471))

Abstract

This paper focuses on the study of the behavior of a genetic algorithm based classifier system, the Adapted Pittsburgh Classifier System (A.P.C.S), on maze type environments containing aliasing squares. This type of environment is often used in reinforcement learning literature to assess the performances of learning methods when facing problems containing non markovian situations.

Through this study, we discuss on the performance of the APCS upon two mazes (Woods 101 and Maze E2) and also on the efficiency of an improvement of the APCS learning method inspired from the XCS: the covering mechanism. We manage to show that, without any memory mechanism, the APCS is able to build and to keep accurate strategies to produce regular sub-optimal solution to these maze problems. This statement is shown through a comparison between the results obtained by the XCS on two specific maze problems and those obtained by the APCS.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bacardit, J., Garrell-Guiu, J.M.: Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 4399, pp. 59–79. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  2. Bagnall, A.J., Zatuchna, Z.: On the classification of maze problems. In: Bull, L., Kovacs, T. (eds.) Applications of Learning Classifier Systems. Studies in Fuzziness and Soft Computing, vol. 183, pp. 307–316. Springer, Heidelberg (2005)

    Google Scholar 

  3. Bernadó-Mansilla, E., Llorà, X., Garrell-Guiu, J.M.: Xcs and gale: A comparative study of two learning classifier systems on data mining. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2001. LNCS (LNAI), vol. 2321, pp. 115–132. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  4. Bull, L.: Lookahead and latent learning in ZCS. In: GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, New York, July 9-13, pp. 897–904. Morgan Kaufmann Publishers, San Francisco (2002)

    Google Scholar 

  5. Butz, M., Wilson, S.W.: An algorithmic description of XCS. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, pp. 253–272. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  6. Butz, M.V.: Documentation of XCS+TS c-code 1.2. IlliGAL Report 2003023, Illinois Genetic Algorithms Laboratory (October 2003)

    Google Scholar 

  7. De Jong, K.A., Spears, W.M., Gordon, D.F.: Using Genetic Algorithms for Concept Learning. Machine Learning 13(3), 161–188

    Google Scholar 

  8. Énée, G.: Systèmes de Classeurs et Communication dans les Systèmes Multi-Agents. PhD thesis, Ecole Doctorale de STIC, Université de Nice Sophia-Antipolis, (Janvier 2003)

    Google Scholar 

  9. Énée, G., Barbaroux, P.: Adapted pittsburgh-style classifier-system: Case-study. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 2661, pp. 30–45. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  10. Holmes, J.H., Lanzi, P.L., Stolzmann, W., Wilson, S.W.: Learning classifier systems: New models, successful applications. Inf. Process. Lett. 82(1), 23–30 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  11. Lanzi, P.L.: Adding Memory to XCS. In: Proceedings of the IEEE Conference on Evolutionary Computation (ICEC 1998), IEEE Press, Los Alamitos (1998), http://ftp.elet.polimi.it/people/lanzi/icec98.ps.gz

    Google Scholar 

  12. Lanzi, P.L.: An analysis of the memory mechanism of XCSM. In: Proceedings of the Third Genetic Programming Conference, pp. 643–651. Morgan Kaufmann, San Francisco (1998), http://ftp.elet.polimi.it/people/lanzi/gp98.ps.gz

    Google Scholar 

  13. Lanzi, P.L., Wilson, S.W.: Optimal classifier system performance in non-markovian environments. Technical Report 99.36, Illinois Genetic Algorithms Laboratory, Milan, Italy (1999)

    Google Scholar 

  14. Sigaud, O.: Les systèmes de classeurs: un état de lárt. Revue d’intelligence Artificielle RSTI série RIA,Lavoisier, vol. 21 (February 2007)

    Google Scholar 

  15. Sigaud, O., Wilson, S.W.: Learning classifier systems: a survey. Soft Comput. 11(11), 1065–1078 (2007)

    Article  MATH  Google Scholar 

  16. Smith, S.F.: A Learning System based on Genetic Adaptive Algorithms. PhD thesis, University of Pittsburgh (1980)

    Google Scholar 

  17. Wilson, S.W.: Classifier fitness based on accuracy. Evolutionary Computation 3(2), 148–175 (1995)

    Article  Google Scholar 

  18. Zatuchna, Z.V.: AgentP: A Learning Classifier System with Associative Perception in Maze Environments. PhD thesis, School of Computing Sciences, UEA (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gilles, É., Mathias, P. (2010). Building Accurate Strategies in Non Markovian Environments without Memory. In: Bacardit, J., Browne, W., Drugowitsch, J., Bernadó-Mansilla, E., Butz, M.V. (eds) Learning Classifier Systems. IWLCS IWLCS 2009 2008. Lecture Notes in Computer Science(), vol 6471. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17508-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17508-4_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17507-7

  • Online ISBN: 978-3-642-17508-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics