A Comparison between Cognitive and AI Models of Blackjack Strategy Learning

  • Marvin R. G. Schiller
  • Fernand R. Gobet
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7526)

Abstract

Cognitive models of blackjack playing are presented and investigated. Blackjack playing is considered a useful test case for theories on human learning. Curiously, despite the existence of a relatively simple, well-known and optimal strategy for blackjack, empirical studies have found that casino players play quite differently from that strategy. The computational models presented here attempt to explain this result by modelling blackjack playing using the cognitive architecture CHREST. Two approaches to modeling are investigated and compared; (i) the combination of classical and operant conditioning, as studied in psychology, and (ii) SARSA, as studied in AI.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gobet, F., Lane, P.C.R., Croker, S., Cheng, P.C.H., Jones, G., Oliver, I., Pine, J.M.: Chunking mechanisms in human learning. Trends in Cognitive Sciences 5, 236–243 (2001)CrossRefGoogle Scholar
  2. 2.
    Perez-Uribe, A., Sanchez, E.: Blackjack as a test bed for learning strategies in neural networks. In: IEEE International Joint Conference on Neural Networks, IJCNN 1998, vol. 3, pp. 2022–2027 (1998)Google Scholar
  3. 3.
    Kendall, G., Smith, C.: The evolution of blackjack strategies. In: The 2003 Congress on Evolutionary Computation, CEC 2003, vol. 4, pp. 2474–2481 (2003)Google Scholar
  4. 4.
    Wagenaar, W.: Paradoxes of gambling behaviour. Erlbaum, Hillsdale (1988)Google Scholar
  5. 5.
    Thorp, E.: Beat the dealer: A winning strategy for the game of twenty-one: A scientific analysis of the world-wide game known variously as blackjack, twenty-one, vingt-et-un, pontoon or Van John. Blaisdell Pub. Co. (1962)Google Scholar
  6. 6.
    Walker, M., Sturevska, S., Turpie, D.: The quality of play in Australian casinos. In: Finding the Edge: Mathematical Analysis of Casino Games. Institute for the Study of Gambling and Commercial Gaming (2000)Google Scholar
  7. 7.
    Chau, A.W.L., Phillips, J.G., Von Baggo, K.L.: Departures from sensible play in computer blackjack. Journal of General Psychology 127(4), 426–438 (2000)CrossRefGoogle Scholar
  8. 8.
    Widrow, B., Gupta, N.K., Maitra, S.: Punish/reward: Learning with a critic in adaptive threshold systems. IEEE Transactions on Systems, Man and Cybernetics 3, 455–465 (1973)MathSciNetMATHCrossRefGoogle Scholar
  9. 9.
    Wetzels, R., Vandekerckhove, J., Tuerlinckx, F., Wagenmakers, E.J.: Bayesian parameter estimation in the expectancy valence model of the Iowa gambling task. Journal of Mathematical Psychology 54, 14–27 (2010)MathSciNetMATHCrossRefGoogle Scholar
  10. 10.
    Busemeyer, J.R., Stout, J.C.: A contribution of cognitive decision models to clinical assessment: Decomposing performance on the Bechara gambling task. Psychological Assessment 14, 253–262 (2002)CrossRefGoogle Scholar
  11. 11.
    Simon, H.A., Chase, W.G.: Skill in chess: Experiments with chess-playing tasks and computer simulation of skilled performance throw light on some human perceptual and memory processes. American Scientist, 394–403 (1973)Google Scholar
  12. 12.
    Gobet, F., Simon, H.A.: Templates in chess memory: A mechanism for recalling several boards. Cognitive Psychology 31, 1–40 (1996)CrossRefGoogle Scholar
  13. 13.
    Ekman, P.: Basic emotions. In: Handbook of Cognition and Emotion. Wiley (1999)Google Scholar
  14. 14.
    Plutchik, R.: Emotion: A psychoevolutionary synthesis. Harper & Row, New York (1980)Google Scholar
  15. 15.
    Rescorla, R.A., Wagner, A.R.: A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In: Black, A.H., Prokasy, W.F. (eds.) Classical Conditioning II: Current Research and Theory, pp. 64–99. Appleton-Century-Crofts, New York (1972)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Marvin R. G. Schiller
    • 1
  • Fernand R. Gobet
    • 1
  1. 1.Brunel UniversityLondonUK

Personalised recommendations