Skip to main content
Log in

A brief history of learning classifier systems: from CS-1 to XCS and its variants

  • Special Issue
  • Published:
Evolutionary Intelligence Aims and scope Submit manuscript

Abstract

The direction set by Wilson’s XCS is that modern Learning Classifier Systems can be characterized by their use of rule accuracy as the utility metric for the search algorithm(s) discovering useful rules. Such searching typically takes place within the restricted space of co-active rules for efficiency. This paper gives an overview of the evolution of Learning Classifier Systems up to XCS, and then of some of the subsequent developments of Wilson’s algorithm to different types of learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Ahluwalia M, Bull L (1999) A genetic programming-based classifier system. In: Banzhaf W et al (eds) GECCO-99: proceedings of the genetic and evolutionary computation conference. Morgan Kaufmann, Burlington, pp 11–18

  2. Becker S (2005) A computational principle for hippocampal learning and neurogenesis. Hippocampus 15(6):722–738

    Article  Google Scholar 

  3. Bernado Mansilla E, Garrell J (2003) Accuracy-based learning classifier systems: models, analysis and applications to classification tasks. Evol Comput 11(3):209–238

    Article  Google Scholar 

  4. Bezerra G, Barra T, de Castro L, Von Zuben (2005) Adaptive radius immune algorithm for data clustering. In: Pilat C et al (eds) Proceedings of the 4th international conference on artificial immune systems. Springer, New York, pp 290–303

  5. Bonelli P, Parodi A, Sen S, Wilson SW (1990) NEWBOOLE: a fast GBML system. In: International conference on machine learning. Morgan Kaufmann, Burlington, pp 153–159

  6. Booker L (1982) Intelligent behavior as an adaptation to the task environment. Ph.D. Thesis, the University of Michigan

  7. Booker LB (1985) Improving the performance of genetic algorithms in classifier systems. In: Grefenstette JJ (ed) Proceedings of the first international conference on genetic algorithms and their applications. Lawrence Erlbaum Associates, New York, pp 80–92

  8. Booker L (1988) Classifier systems that learn internal world models. Mach Learn 3:161–192

    Google Scholar 

  9. Booker L (1989) Triggered rule discovery in classifier systems. In Schaffer J (ed) Proceedings of the international conference on genetic algorithms. Morgan Kaufmann, Burlington, pp 265–274

  10. Box G (1957) Evolutionary operation: a method for increasing industrial productivity. J R Stat Soc C 6(2):81–101

    Google Scholar 

  11. Bull L (ed) (2004) Applications of learning classifier systems. Springer, New York

  12. Bull L (2005) Two simple learning classifier systems. In: Bull L, Kovacs T (eds) Foundations of learning classifier systems. Springer, New York, pp 63–90

  13. Bull L (2009) On dynamical genetic programming: simple boolean networks in learning classifier systems. Int J Parallel Emergent Distrib Syst 24(5):421–442

    Article  MATH  MathSciNet  Google Scholar 

  14. Bull L (2011) Towards a mapping of modern AIS and LCS. In: Lio P et al (eds) Proceedings of the tenth international conference on artificial immune systems. Springer, New York pp 371–382

  15. Bull L (2014) Exploiting generalisation symmetries in accuracy-based learning classifier systems: an initial study. http://arxiv.org/abs/1401.2949

  16. Bull L, Hurst J (2002) ZCS Redux Evol Comput 10(2):185–205

    Article  Google Scholar 

  17. Bull L, O’Hara T (2002) Accuracy-based neuro and neuro-fuzzy classifier systems. In: Langdon WB et al (eds) GECCO-2002: proceedings of the genetic and evolutionary computation conference. Morgan Kaufmann, Burlington, pp 905–911

  18. Bull L, Kovacs T (eds) (2005) Foundations of learning classifier systems. Springer, New York

  19. Bull L, Lanzi P-L, O’Hara T (2007) Anticipation mappings for learning classifier systems. In: Proceedings of the IEEE congress on evolutionary computation. IEEE Press, Piscataway, pp 2133–2140

  20. Bull L, Studley M, Bagnall A, Whittley I (2007) Learning classifier system ensembles with rule sharing. IEEE Trans Evol Comput 11(4):496–502

    Article  Google Scholar 

  21. Bull L, Bernado Mansilla E, Holmes J (eds) (2008) Learning classifier systems in data mining. Springer, New York

  22. Butz M (2002) Anticipatory learning classifier systems. Kluwer, New Yokr

  23. Butz MV (2006) Rule-based evolutionary online learning systems. Springer, New York

  24. Butz MV, Wilson SW (2002) An algorithmic description of XCS. Soft Comput 6(3–4):144–153

    Article  MATH  Google Scholar 

  25. Butz MV, Goldberg DE (2003) Generalized state values in an anticipatory learning classifier system. In: Butz MV, Sigaud O, Gérard P (eds) Anticipatory behavior in adaptive learning systems. Springer, New York, pp 282–301

  26. Butz MV, Kovacs T, Lanzi P-L, Wilson SW (2004) Toward a theory of generalization and learning in XCS. IEEE Trans Evol Comput 8(1):28–46

    Article  Google Scholar 

  27. Butz MV, Sastry K, Goldberg DE (2005) Strong, stable, and reliable fitness pressure in XCS due to tournament selection. Genet Program Evol Mach 6(1):53–77

    Article  Google Scholar 

  28. Butz MV, Goldberg DE, Lanzi P-L (2005) Gradient descent methods in learning classifier systems: improving XCS performance in multi-step problems. IEEE Trans Evol Comput 9(5):452–473

    Article  Google Scholar 

  29. Butz MV, Pelikan M, Llora X, Goldberg DE (2006) Automated global structure extraction for effective local building block processing in XCS. Evol Comput 14(3):345–380

    Article  Google Scholar 

  30. Butz MV, Goldberg D, Lanzi P-L, Sastry K (2007) Problem solution sustenance in XCS: markov chain analysis of niche support distributions and the impact on computational complexity. Genet Program Evol Mach 8(1):5–37

    Article  Google Scholar 

  31. Butz MV, Lanzi P-L, Wilson SW (2008) Function approximation with XCS: hyperellipsoidal conditions, recursive least squares, and compaction. IEEE Trans Evol Comput 12(3):355–376

    Article  Google Scholar 

  32. Casillas J, Carse B, Bull L (2007) Fuzzy XCS: a michigan genetic fuzzy system. IEEE Trans Fuzzy Syst 15(4):536–550

    Article  Google Scholar 

  33. Cliff D, Ross S (1995) Adding Temporary Memory to ZCS. Adapt Behav 3(2):101–150

    Article  Google Scholar 

  34. Coufal N et al (2009) L1 retrotransposition in human neural progenitor cells. Nature 460:1127–1131

    Article  Google Scholar 

  35. Dam H, Abbass H, Lokan C, Yao X (2008) Neural-based learning classifier systems. IEEE Trans Knowl Data Eng 20(1):26–39

    Article  Google Scholar 

  36. De Castro L, Von Zuben F (2002) Learning and optimization using the clonal selection principle. IEEE Trans Evol Comput 6(3):239–251

    Article  Google Scholar 

  37. Dorigo M, Bersini H (1994) A comparison of Q-learning and classifier systems. In: Cliff D, Husbands P, Meyer J-A, Wilson SW (eds) From animals to animats 3: proceedings of the third international conference on simulation of adaptive behaviour. MIT Press, Cambridge, pp 248–255

  38. Drugowitsch J, Barry A (2005) XCS with eligibility traces. In: Beyer HG et al (eds) GECCO-2005: proceedings of the genetic and evolutionary computation conference. Morgan Kaufmann, Burlington, pp 1851–1858

  39. Edakunni N, Brown G, Kovacs T (2011) Online, GA based mixture of experts: a probabilistic model of UCS. In: GECCO-2011: Proceedings of the genetic and evolutionary computation conference. ACM Press, New York, pp 1267–1274

  40. Eiben A, Smith J (2003) Introduction to evolutionary computing. Springer, New York

  41. Farley B, Clark W (1954) Simulation of self-organizing systems by digital computer. IRE Trans Inf Theory 4:76–84

    Article  MathSciNet  Google Scholar 

  42. Farmer JD, Packard N, Perelson A (1986) The immune system, adaptation and machine learning. Phys D 22:187–204

    Article  MathSciNet  Google Scholar 

  43. Fernández A, García S, Luengo J, Bernadó-Mansilla E, Herrera F (2010) Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study. IEEE Trans Evol Comput 14(6):913–941

    Article  Google Scholar 

  44. Fernando C, Szathmary E, Husbands P (2012) Selectionist and evolutionary approaches to brain function: a critical appraisal. Front Comput Neurosci 6:24

    Article  Google Scholar 

  45. Fraser A (1957) Simulation of genetic systems by automatic digital computers. I. Introduction. Aust J Biol Sci 10:484–491

    Google Scholar 

  46. Frey P, Slate D (1991) Letter recognition using holland-style adaptive classifiers. Mach Learn 6:161–182

    Google Scholar 

  47. Gererd P, Sigaud O (2001) YACS: combining dynamic programming with generalization in classifier systems. In: Lanzi P-L, Stolzmann W, Wilson SW (eds) Advances in learning classifier systems: proceedings of the third international workshop on learning classifier systems. Springer, New York, pp 52–69

  48. Giani A, Baiardi F, Starita A (1995) PANIC: a parallel evolutionary rule based system. In: McDonnell J, Reynolds R, Fogel D (eds) Proceedings of the fourth annual conference on evolutionary programming. MIT Press, Cambridge, pp 753–772

  49. Goldberg D (1985) Genetic algorithms and rule learning in dynamic system control. In: Grefenstette JJ (ed) Proceedings of the first international conference on genetic algorithms and their applications. Lawrence Erlbaum Associates, New York, pp 8–15

  50. Greensmith J, Feyereisl J, Aickelin U (2008) DCA: some comparison. Evol Intel 1(2):85–112

    Article  Google Scholar 

  51. Hartley A (1999) Accuracy-based fitness allows similar performance to humans in static and dynamic classification environments. In: Banzhaf W et al (eds) GECCO-99: proceedings of the genetic and evolutionary computation conference. Morgan Kaufmann, Burlington, pp 266–273

  52. Hercog L, Fogarty TC (2002) Coevolutionary classifier systems for multi-agent simulation. In: Proceedings of the IEEE congress on evolutionary computation. IEEE Press, Piscataway, pp 1798–1803

  53. Hoffmann J (1993) Vorhersage und Erkenntnis. Hogrefe, Goettingen

    Google Scholar 

  54. Holland JH (1975) Adaptation in natural and artificial systems. University of Michigan Press, Ann Arbor

  55. Holland JH (1976) Adaptation. In: Rosen and Snell (eds) Progress in theoretical biology, vol 4. Plenum, Berlin, pp 263–293

  56. Holland JH (1980) Adaptive algorithms for discovering and using general patterns in growing knowledge bases. Int J Policy Anal Inf Syst 4(3):245–268

    Google Scholar 

  57. Holland JH (1985) Properties of the bucket brigade. In: Grefenstette JJ (ed) Proceedings of the first international conference on genetic algorithms and their applications. Lawrence Erlbaum Associates, New York, pp 1–7

  58. Holland JH (1986) Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems. In: Michalski R, Carbonell J, Mitchell T (eds) Machine learning, an artificial intelligence approach. Morgan Kaufmann, Burlington, pp 593–623

  59. Holland JH (1990) Concerning the emergence of tag-mediated lookahead in classifier systems. Phys D 42:188–201

    Article  Google Scholar 

  60. Holland JH, Reitman JH (1978) Cognitive systems based in adaptive algorithms. In Waterman DA, Hayes-Roth F (eds) Pattern-directed inference systems. Academic Press, New York, pp 313–329

  61. Holland JH, Holyoak KJ, Nisbett RE, Thagard PR (1986) Induction: processes of inference, learning and discovery. MIT Press, Cambridge

    Google Scholar 

  62. Howard D, Bull L, Lanzi P-L (2009) Continuous actions in continuous space and time using self-adaptive constructivism in neural XCSF. In: GECCO-2009: proceedings of the genetic and evolutionary computation conference. ACM Press, New York, pp 1219–1226

  63. Howard D, Bull L, Lanzi P-L (2010) A spiking neural representation for XCSF. In: Proceedings of the IEEE congress on evolutionary computation. IEEE Press, Piscataway, pp 1–8

  64. Hurst J, Bull L (2002) A self-adaptive XCS. In: Lanzi P-L, Stolzmann W, Wilson SW (eds) Advances in learning classifier systems: proceedings of the 4th international workshop on learning classifier systems. Springer, New York, pp 57–73

  65. Iqbal M, Browne W, Zhang M (2013) Evolving optimum populations with XCS classifier systems—XCS with code fragmented action. Soft Comput 17(3):503–518

    Article  Google Scholar 

  66. Iqbal M, Browne W, Zhang M (2014) Reusing building blocks of extracted knowledge to solve complex, large-scale boolean problems. IEEE Trans Evol Comput 18(4):465–480

    Article  Google Scholar 

  67. Kovacs T (1997) XCS classifier system reliably evolves accurate, complete, and minimal representations for boolean functions. In: Roy, Chawdhry, Pant (eds) Soft computing in engineering design and manufacturing. Springer, New York, pp 59–68

  68. Lanzi P-L (2008) Learning classifier systems: then and now. Evol Intel 1(1):63–82

    Article  Google Scholar 

  69. Lanzi P-L, Perrucci A (1999) Extending the representation of classifier conditions part II: from messy coding to S-expressions. In: Banzhaf W et al (eds) GECCO-99: proceedings of the genetic and evolutionary computation conference. Morgan Kaufmann, Burlington, pp 345–352

  70. Lanzi P-L, Riolo R (2000) A roadmap to the last decade of learning classifier system research. In: Lanzi P-L, Stolzmann W, Wilson SW (eds) Learning classifier systems: from foundations to applications. Springer, New York, pp 33–62

  71. Lanzi P-L, Wilson SW (2000) Toward optimal classifier system performance in non-markov environments. Evol Comput 8(4):393–418

    Article  Google Scholar 

  72. Lanzi P-L, Loiacono D, Wilson SW, Goldberg D (2007) Generalization in the XCSF classifier system: analysis, improvement, and extension. Evol Comput 15(2):133–168

    Article  Google Scholar 

  73. Loiacono D, Lanzi P-L (2009) Recursive least squares and quadratic prediction in continuous multistep problems. In: Bacardit J et al (eds) Learning classifier systems: revised selected papers. Springer, New York, pp 70–86

  74. Orriols-Puig A, Bernado Mansilla E (2008) Revisiting UCS: description, fitness sharing, and comparison with XCS. In: Bacardit J et al (eds) Learning classifier systems: revised selected papers. Springer, New York, pp 96–116

  75. Orriols-Puig A, Casillas J, Bernadó Mansilla E (2009) Fuzzy-UCS: a michigan-style learning fuzzy-classifier system for supervised learning. IEEE Trans Evol Comput 13(2):260–283

    Article  Google Scholar 

  76. Preen R, Bull L (2013) Dynamical genetic programming in XCSF. Evol Comput 21(3):361–388

    Article  Google Scholar 

  77. Qian L, Shi Y, Gao Y, Yin H (2013) Voting-XCSc: a consensus clustering method via learning classifier system. In: Yin H et al (eds) Intelligent data engineering and automated learning—IDEAL. Springer, New York, pp 603–610

  78. Riolo R (1991) Lookahead planning and latent learning in a classifier system. In: Meyer J-A, Wilson SW (eds) From animals to animats: proceedings of the first international conference on simulation of adaptive behaviour. MIT Press, Cambridge, pp 316–326

  79. Roberts G (1993) Dynamic planning for classifier systems. In: Forrest S (ed) Proceedings of the 5th international conference on genetic algorithms. Morgan Kaufmann, Burlington, pp 231–237

  80. Samuel AL (1959) Some studies in machine learning using the game of checkers. IBM J Res Dev 3:211–229

    Article  MathSciNet  Google Scholar 

  81. Samuel AL (1967) Some studies in machine learning using the game of checkers. II. Recent progress. IBM J Res Dev 11:601–617

    Article  Google Scholar 

  82. Schultz W (1998) Predictive reward signal of dopamine neurons. J Neurophysiol 68:1190–1208

    Google Scholar 

  83. Seward J (1949) An experimental analysis of latent learning. J Exp Psychol 39:177–186

    Article  Google Scholar 

  84. Shannon C (1950) Programming a computer for playing chess. Phil Mag 41:256–275

    Article  MATH  MathSciNet  Google Scholar 

  85. Smith SF (1980) A learning system based on genetic adaptive algorithms. PhD Thesis, University of Pittsburgh

  86. Smith R, Cribbs H (1994) Is a learning classifier system a type of neural network? Evol Comput 2(1):19–36

    Article  Google Scholar 

  87. Smith R, Jiang M, Bacardit J, Stout M, Krasnogor N, Hirst J (2010) A learning classifier system with mutual-information-based fitness. Evol Intel 3(1):31–50

    Article  Google Scholar 

  88. Stalph P, Llorà X, Goldberg D, Butz MV (2012) Resource management and scalability of the XCSF learning classifier system. Theoret Comput Sci 425:126–141

    Article  MATH  MathSciNet  Google Scholar 

  89. Stalph P, Rubinsztajin J, Sigaud O, Butz MV (2012) Function approximation with LWPR and XCSF: a comparative study. Evol Intel 5(2):103–116

    Article  Google Scholar 

  90. Stolzmann W (1998) Anticipatory classifier systems. In: Koza et al (eds) Genetic programming 1998: proceedings of the third annual conference. Morgan Kaufmann, Burlington, pp 658–654

  91. Stone C, Bull L (2003) For Real! XCS with continuous-valued inputs. Evol Comput 11(3):299–336

    Article  Google Scholar 

  92. Stone C, Bull L (2005) Comparing XCS and ZCS on noisy continuous-valued environments. Technical report: UWELCSG05-002. http://www.cems.uwe.ac.uk/lcsg

  93. Studley M, Bull L (2005) X-TCS: accuracy-based learning classifier system robotics. In: Proceedings of the IEEE congress on evolutionary computation. IEEE, pp 2099–2106

  94. Studley M, Bull L (2006) Using the XCS classifier system for multi-objective reinforcement learning problems. Artif Life 13(1):69–86

    Article  Google Scholar 

  95. Sutton R, Barto A (1981) Toward a modern theory of adaptive networks: expectation and prediction. Psychol Rev 88:135–170

    Article  Google Scholar 

  96. Sutton R, Barto A (1998) Reinforcement learning. MIT Press, Cambridge

  97. Tammee K, Bull L, Ouen P (2006) A learning classifier system approach to clustering. In: Proceedings of the 6th international conference on intelligent systems design and applications. IEEE, pp 621–626

  98. Tammee K, Bull L, Ouen P (2007) Towards clustering with XCS. In: Thierens D et al (eds) GECOO-2007: proceedings of the genetic and evolutionary computation conference. ACM Press, New York, pp 1854–1860

  99. Thorndike E (1911) Animal intelligence. Macmillan Company, New York

  100. Tibshirani R, Walther G, Hastie T (2000) Estimating the number of clusters in a dataset via the gap statistic. J R Stat Soc B 63:411–423

    Article  MathSciNet  Google Scholar 

  101. Timmis J, Andrews P, Owens N, Clark E (2008) An interdisciplinary perspective on artificial immune systems. Evol Intel 1(1):5–26

    Article  Google Scholar 

  102. Tomlinson A, Bull L (2002) An accuracy-based corporate classifier system. Soft Comput 6(3–4):200–215

    Article  MATH  Google Scholar 

  103. Tran T, Sanza C, Duthen Y, Nguyen D (2007) XCSF with computed continuous action. In: Thierens D et al (eds) GECCO-07: proceedings of the genetic and evolutionary computation conference. ACM Press, New York, pp 1861–1868

  104. Turing A (1948) Intelligent machinery. Reprinted in: Copeland J. (2004) The essential turing. Oxford University Press, Oxford, pp 395–432

  105. Urbanowicz R, Moore J (2009) Learning classifier systems: a complete introduction, review and roadmap. J Artif Evol Appl 1:1–25

    Article  Google Scholar 

  106. Valenzuela-Rendón M (1991) The fuzzy classifier system: a classifier system for continuously varying variables. In: Belew R, Booker L (eds) Proceedings of the 4th international conference on genetic algorithms. Morgan Kaufmann, Burlington, pp 346–353

  107. Venturini G (1994) Apprentissage Adaptatif et Apprentissage Supervisé par Algorithme Génétique. Thèse de Docteur en Science (Informatique), Université de Paris-Sud

  108. Vijayakumar S, D’Souza A, Schall S (2005) Incremental on-line learning in high dimensions. Neural Comput 17(12):2602–2634

    Article  MathSciNet  Google Scholar 

  109. Watkins CJ (1989) Learning from delayed rewards. Ph.D. Thesis, Cambridge University

  110. Wilson SW (1985) Knowledge growth in an artificial animal. In: Grefenstette JJ (ed) Proceedings of the first international conference on genetic algorithms and their applications. Lawrence Erlbaum Associates, New York, pp 16–23

  111. Wilson SW (1987) Classifier systems and the animat problem. Mach Learn 2:219–228

  112. Wilson SW (1994) ZCS: a zeroth-level classifier system. Evol Comput 2(1):1–18

    Article  Google Scholar 

  113. Wilson SW (1995) Classifier fitness based on accuracy. Evol Comput 3(2):149–176

    Article  Google Scholar 

  114. Wilson SW (1998) Generalization in the XCS classifier system. In: Koza et al (eds) Genetic programming 1998: proceedings of the 3rd annual conference. Morgan Kaufmann, Burlington, pp 322–334

  115. Wilson SW (2001) Function approximation with a classifier system. In: Spector L et al (eds) GECCO-01: proceedings of the genetic and evolutionary computation conference. Morgan Kaufmann, Burlington, pp 974–981

  116. Wilson SW (2002) Classifiers that approximate functions. Nat Comput 1(1):211–233

    Article  MATH  MathSciNet  Google Scholar 

  117. Wilson SW (2007) Three architectures for continuous action. In: Bacardit J et al (eds) Learning classifier systems: revised selected papers. Springer, New York, pp 239–257

  118. Wilson SW (2008) Classifier conditions using gene expression programming. In Bacardit J et al (eds) Learning classifier systems: revised selected papers. Springer, New York, pp 206–217

  119. Wilson SW, Goldberg DE (1989) A critical review of classifier systems. In: Schaffer J (ed) Proceedings of the 3rd international conference on genetic algorithms. Morgan Kauffman, San Francisco, pp 244–255

  120. Wyatt D, Bull L (2004) A memetic learning classifier system for describing continuous-valued problem spaces. In: Krasnagor N, Hart W, Smith J (eds) Recent advances in memetic algorithms. Springer, New York, pp 355–396

  121. Xu R, Wunsch D (2009) Clustering. IEEE Press, Piscataway

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Larry Bull.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bull, L. A brief history of learning classifier systems: from CS-1 to XCS and its variants. Evol. Intel. 8, 55–70 (2015). https://doi.org/10.1007/s12065-015-0125-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12065-015-0125-y

Keywords

Navigation