Abstract
Building a strong computer Go player is a longstanding open problem. In this paper we consider the related problem of predicting the moves made by Go experts in professional games. The ability to predict experts’ moves is useful, because it can, in principle, be used to narrow the search done by a computer Go player. We applied an ensemble of convolutional neural networks to this problem. Our main result is that the ensemble learns to predict 36.9% of the moves made in test expert Go games, improving upon the state of the art, and that the best single convolutional neural network of the ensemble achieves 34% accuracy. This network has less than 104 parameters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
van der Werf, E.: AI Techniques for the Game of Go. UPM, Universitaire Pers Maastricht (2004)
Müller, M.: Review: Computer Go 1984-2000. Lecture Notes In Computer Science, 405–413 (2000)
Bouzy, B., Cazenave, T.: Computer Go: An AI oriented survey. Artificial Intelligence 132(1), 39–103 (2001)
Schaeffer, J., Burch, N., Bjornsson, Y., Kishimoto, A., Muller, M., Lake, R., Lu, P., Sutphen, S.: Checkers Is Solved. Science 317(5844), 1518 (2007)
LeCun, Y., Boser, B., Denker, J., Howard, R., Habbard, W., Jackel, L., Henderson, D.: Handwritten digit recognition with a back-propagation network. Advances in neural information processing systems 2 table of contents, 396–404 (1990)
Schraudolph, N., Dayan, P., Sejnowski, T.: Temporal Difference Learning of Position Evaluation in the Game of Go. Advances in Neural Information Processing Systems 6, 817–824 (1994)
Stern, D., Herbrich, R., Graepel, T.: Bayesian pattern ranking for move prediction in the game of Go. In: Proc. of the 23rd international conference on Machine learning, pp. 873–880 (2006)
Hall, M.T., Fairbairn, J.: The Gogod Database and Encyclopaedia (2006), www.gogod.co.uk
Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. Document Analysis and Recognition, 958–963 (2003)
Ranzato, M., LeCun, Y.: A sparse and locally shift invariant feature extractor applied to document images. In: Proc. International Conference on Document Analysis and Recognition (ICDAR) (2007)
LeCun, Y., Huang, F., Bottou, L.: Learning methods for generic object recognition with invariance to pose and lighting. Computer Vision and Pattern Recognition 2 (2004)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEEÂ 86(11) (1998)
van der Werf, E., Uiterwijk, J., Postma, E., van den Herik, J.: Local Move Prediction in Go. Computers and Games (2003)
Enzenberger, M.: Evaluation in Go by a Neural Network using Soft Segmentation. Advances in Computer Games 10 (2003)
Tesauro, G.: Temporal difference learning and TD-Gammon. Communications of the ACM 38(3), 58–68 (1995)
Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Brugmann, B.: Monte Carlo Go (1993)
Gelly, S., Wang, Y.: Exploration exploitation in Go: UCT for Monte-Carlo Go. In: NIPS-2006: On-line trading of Exploration and Exploitation Workshop, Whistler, Canada (2006)
Wu, L., Baldi, P.: A Scalable Machine Learning Approach to Go. Neural Information Processing Systems, 1521–1528 (2007)
Araki, N., Yoshida, K., Tsuruoka, Y., Tsujii, J.: Move Prediction in Go with the Maximum Entropy Method. In: Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Games (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sutskever, I., Nair, V. (2008). Mimicking Go Experts with Convolutional Neural Networks. In: KĹŻrková, V., Neruda, R., KoutnĂk, J. (eds) Artificial Neural Networks - ICANN 2008. ICANN 2008. Lecture Notes in Computer Science, vol 5164. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87559-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-87559-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87558-1
Online ISBN: 978-3-540-87559-8
eBook Packages: Computer ScienceComputer Science (R0)