Mimicking Go Experts with Convolutional Neural Networks

Sutskever, Ilya; Nair, Vinod

doi:10.1007/978-3-540-87559-8_11

Ilya Sutskever¹ &
Vinod Nair¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5164))

Included in the following conference series:

International Conference on Artificial Neural Networks

2565 Accesses
13 Citations

Abstract

Building a strong computer Go player is a longstanding open problem. In this paper we consider the related problem of predicting the moves made by Go experts in professional games. The ability to predict experts’ moves is useful, because it can, in principle, be used to narrow the search done by a computer Go player. We applied an ensemble of convolutional neural networks to this problem. Our main result is that the ensemble learns to predict 36.9% of the moves made in test expert Go games, improving upon the state of the art, and that the best single convolutional neural network of the ensemble achieves 34% accuracy. This network has less than 10⁴ parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

van der Werf, E.: AI Techniques for the Game of Go. UPM, Universitaire Pers Maastricht (2004)
Google Scholar
Müller, M.: Review: Computer Go 1984-2000. Lecture Notes In Computer Science, 405–413 (2000)
Google Scholar
Bouzy, B., Cazenave, T.: Computer Go: An AI oriented survey. Artificial Intelligence 132(1), 39–103 (2001)
Article MATH MathSciNet Google Scholar
Schaeffer, J., Burch, N., Bjornsson, Y., Kishimoto, A., Muller, M., Lake, R., Lu, P., Sutphen, S.: Checkers Is Solved. Science 317(5844), 1518 (2007)
Article MathSciNet Google Scholar
LeCun, Y., Boser, B., Denker, J., Howard, R., Habbard, W., Jackel, L., Henderson, D.: Handwritten digit recognition with a back-propagation network. Advances in neural information processing systems 2 table of contents, 396–404 (1990)
Google Scholar
Schraudolph, N., Dayan, P., Sejnowski, T.: Temporal Difference Learning of Position Evaluation in the Game of Go. Advances in Neural Information Processing Systems 6, 817–824 (1994)
Google Scholar
Stern, D., Herbrich, R., Graepel, T.: Bayesian pattern ranking for move prediction in the game of Go. In: Proc. of the 23rd international conference on Machine learning, pp. 873–880 (2006)
Google Scholar
Hall, M.T., Fairbairn, J.: The Gogod Database and Encyclopaedia (2006), www.gogod.co.uk
Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. Document Analysis and Recognition, 958–963 (2003)
Google Scholar
Ranzato, M., LeCun, Y.: A sparse and locally shift invariant feature extractor applied to document images. In: Proc. International Conference on Document Analysis and Recognition (ICDAR) (2007)
Google Scholar
LeCun, Y., Huang, F., Bottou, L.: Learning methods for generic object recognition with invariance to pose and lighting. Computer Vision and Pattern Recognition 2 (2004)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE 86(11) (1998)
Google Scholar
van der Werf, E., Uiterwijk, J., Postma, E., van den Herik, J.: Local Move Prediction in Go. Computers and Games (2003)
Google Scholar
Enzenberger, M.: Evaluation in Go by a Neural Network using Soft Segmentation. Advances in Computer Games 10 (2003)
Google Scholar
Tesauro, G.: Temporal difference learning and TD-Gammon. Communications of the ACM 38(3), 58–68 (1995)
Article Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Brugmann, B.: Monte Carlo Go (1993)
Google Scholar
Gelly, S., Wang, Y.: Exploration exploitation in Go: UCT for Monte-Carlo Go. In: NIPS-2006: On-line trading of Exploration and Exploitation Workshop, Whistler, Canada (2006)
Google Scholar
Wu, L., Baldi, P.: A Scalable Machine Learning Approach to Go. Neural Information Processing Systems, 1521–1528 (2007)
Google Scholar
Araki, N., Yoshida, K., Tsuruoka, Y., Tsujii, J.: Move Prediction in Go with the Maximum Entropy Method. In: Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Games (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
Ilya Sutskever & Vinod Nair

Authors

Ilya Sutskever
View author publications
You can also search for this author in PubMed Google Scholar
Vinod Nair
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Véra Kůrková Roman Neruda Jan Koutník

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sutskever, I., Nair, V. (2008). Mimicking Go Experts with Convolutional Neural Networks. In: Kůrková, V., Neruda, R., Koutník, J. (eds) Artificial Neural Networks - ICANN 2008. ICANN 2008. Lecture Notes in Computer Science, vol 5164. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87559-8_11

Download citation

DOI: https://doi.org/10.1007/978-3-540-87559-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87558-1
Online ISBN: 978-3-540-87559-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics