Building Accurate Strategies in Non Markovian Environments without Memory

Gilles, Énée; Mathias, Péroumalnaïk

doi:10.1007/978-3-642-17508-4_8

Building Accurate Strategies in Non Markovian Environments without Memory

Énée Gilles²⁴ &
Péroumalnaïk Mathias²⁴

Conference paper

557 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6471))

Abstract

This paper focuses on the study of the behavior of a genetic algorithm based classifier system, the Adapted Pittsburgh Classifier System (A.P.C.S), on maze type environments containing aliasing squares. This type of environment is often used in reinforcement learning literature to assess the performances of learning methods when facing problems containing non markovian situations.

Through this study, we discuss on the performance of the APCS upon two mazes (Woods 101 and Maze E2) and also on the efficiency of an improvement of the APCS learning method inspired from the XCS: the covering mechanism. We manage to show that, without any memory mechanism, the APCS is able to build and to keep accurate strategies to produce regular sub-optimal solution to these maze problems. This statement is shown through a comparison between the results obtained by the XCS on two specific maze problems and those obtained by the APCS.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bacardit, J., Garrell-Guiu, J.M.: Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 4399, pp. 59–79. Springer, Heidelberg (2007)
Chapter Google Scholar
Bagnall, A.J., Zatuchna, Z.: On the classification of maze problems. In: Bull, L., Kovacs, T. (eds.) Applications of Learning Classifier Systems. Studies in Fuzziness and Soft Computing, vol. 183, pp. 307–316. Springer, Heidelberg (2005)
Google Scholar
Bernadó-Mansilla, E., Llorà, X., Garrell-Guiu, J.M.: Xcs and gale: A comparative study of two learning classifier systems on data mining. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2001. LNCS (LNAI), vol. 2321, pp. 115–132. Springer, Heidelberg (2002)
Chapter Google Scholar
Bull, L.: Lookahead and latent learning in ZCS. In: GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, New York, July 9-13, pp. 897–904. Morgan Kaufmann Publishers, San Francisco (2002)
Google Scholar
Butz, M., Wilson, S.W.: An algorithmic description of XCS. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, pp. 253–272. Springer, Heidelberg (2001)
Chapter Google Scholar
Butz, M.V.: Documentation of XCS+TS c-code 1.2. IlliGAL Report 2003023, Illinois Genetic Algorithms Laboratory (October 2003)
Google Scholar
De Jong, K.A., Spears, W.M., Gordon, D.F.: Using Genetic Algorithms for Concept Learning. Machine Learning 13(3), 161–188
Google Scholar
Énée, G.: Systèmes de Classeurs et Communication dans les Systèmes Multi-Agents. PhD thesis, Ecole Doctorale de STIC, Université de Nice Sophia-Antipolis, (Janvier 2003)
Google Scholar
Énée, G., Barbaroux, P.: Adapted pittsburgh-style classifier-system: Case-study. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 2661, pp. 30–45. Springer, Heidelberg (2003)
Chapter Google Scholar
Holmes, J.H., Lanzi, P.L., Stolzmann, W., Wilson, S.W.: Learning classifier systems: New models, successful applications. Inf. Process. Lett. 82(1), 23–30 (2002)
Article MathSciNet MATH Google Scholar
Lanzi, P.L.: Adding Memory to XCS. In: Proceedings of the IEEE Conference on Evolutionary Computation (ICEC 1998), IEEE Press, Los Alamitos (1998), http://ftp.elet.polimi.it/people/lanzi/icec98.ps.gz
Google Scholar
Lanzi, P.L.: An analysis of the memory mechanism of XCSM. In: Proceedings of the Third Genetic Programming Conference, pp. 643–651. Morgan Kaufmann, San Francisco (1998), http://ftp.elet.polimi.it/people/lanzi/gp98.ps.gz
Google Scholar
Lanzi, P.L., Wilson, S.W.: Optimal classifier system performance in non-markovian environments. Technical Report 99.36, Illinois Genetic Algorithms Laboratory, Milan, Italy (1999)
Google Scholar
Sigaud, O.: Les systèmes de classeurs: un état de lárt. Revue d’intelligence Artificielle RSTI série RIA,Lavoisier, vol. 21 (February 2007)
Google Scholar
Sigaud, O., Wilson, S.W.: Learning classifier systems: a survey. Soft Comput. 11(11), 1065–1078 (2007)
Article MATH Google Scholar
Smith, S.F.: A Learning System based on Genetic Adaptive Algorithms. PhD thesis, University of Pittsburgh (1980)
Google Scholar
Wilson, S.W.: Classifier fitness based on accuracy. Evolutionary Computation 3(2), 148–175 (1995)
Article Google Scholar
Zatuchna, Z.V.: AgentP: A Learning Classifier System with Associative Perception in Maze Environments. PhD thesis, School of Computing Sciences, UEA (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

LAMIA Laboratory, Université des Antilles Guyane, Campus Fouillole, BP 592, 97157, Pointe à Pitre Cedex, Guadeloupe, France
Énée Gilles & Péroumalnaïk Mathias

Authors

Énée Gilles
View author publications
You can also search for this author in PubMed Google Scholar
Péroumalnaïk Mathias
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, ASAP research group, University of Nottingham, Jubilee Campus, Nottingham, NG8 1BB, and Multidisciplinary Centre for Integrative Biology, School of Biosciences, LE12 5RD, Sutton Bonington, UK
Jaume Bacardit
School of Engineering and Computer Science, Victoria University of Wellington, PO Box 600, 6140, Wellington, New Zealand
Will Browne
Department of Brain and Cognitive Sciences, University of Rochester, Meliora Hall, 14627, Rochester, NY, USA
Jan Drugowitsch
Enginyeria i Arquitectura La Salle, Universitat Ramon Llull, Quatre Camins, 2, 08022, Barcelona, Spain
Ester Bernadó-Mansilla
Department of Psychology III, University of Würzburg, COBOSLAB – Cognitive Bodyspaces: Learning and Behavior,, Röntgenring 11, 97070, Würzburg, Germany
Martin V. Butz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gilles, É., Mathias, P. (2010). Building Accurate Strategies in Non Markovian Environments without Memory. In: Bacardit, J., Browne, W., Drugowitsch, J., Bernadó-Mansilla, E., Butz, M.V. (eds) Learning Classifier Systems. IWLCS IWLCS 2009 2008. Lecture Notes in Computer Science(), vol 6471. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17508-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-17508-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17507-7
Online ISBN: 978-3-642-17508-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics