A Fast Indexing Method for Monte-Carlo Go

Chen, Keh-Hsun; Du, Dawei; Zhang, Peigang

doi:10.1007/978-3-540-87608-3_9

Keh-Hsun Chen¹,
Dawei Du¹ &
Peigang Zhang¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5131))

Included in the following conference series:

International Conference on Computers and Games

2108 Accesses
2 Citations

Abstract

3×3 patterns are widely used in Monte-Carlo (MC) Go programs to improve the performance. In this paper, we propose a direct indexing approach to build and use a complete 3×3 pattern library. The contents of the immediate 8 neighboring positions of a board point are coded into a 16-bit string, called surrounding index. The surrounding indices of all board points can be updated incrementally in an efficient way. We propose an effective method to learn the pattern weights from forty thousand professional games. The method converges faster and performs equally well or better than the method of computing “Elo ratings” [4]. The knowledge contained in the pattern library can be efficiently applied to the MC simulations and to the growth of MC search tree. Testing results showed that our method increased the winning rates of Go Intellect against GNU Go on 9×9 games by over 7% taking the tax on the program speed into consideration.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Softcover Book: USD 99.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bouzy, B., Chaslot, G.M.J.-B.: Bayesian Generation and Integration of K-nearest neighbor Patterns for 19×19 Go. In: Kendall, G., Lucas, S. (eds.) IEEE 2005 Symposium on Computational Intelligence in Games, Essex, UK, pp. 176–181 (2005)
Google Scholar
Bouzy, B., Chaslot, G.M.J.-B.: Monte-Carlo Go Reinforcement Learning Experiments. In: IEEE 2006 Symposium on Computational Intelligence in Games, Reno, USA, pp. 187–194 (2006)
Google Scholar
Chen, K., Zhang, P.: Monte-Carlo Go with Knowledge-guided Simulations. ICGA Journal 31(2), 67–76 (2008)
Google Scholar
Coulom, R.: Computing “Elo Ratings” of Move Patterns in the Game of Go. ICGA Journal 30(4), 198–208 (2007)
Google Scholar
Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with Patterns in Monte-Carlo Go. Technical Report 6062, INRIA (2006)
Google Scholar
Gelly, S., Silver, D.: Combining Online and Offline Knowledge in UCT. In: Ghahramani, Z. (ed.) Proceedings of the International Conference of Machine Learning (ICML 2007), pp. 273–280 (2007)
Google Scholar
Kocsis, L., Szepesvári, C.: Bandit Based Monte-Carlo Planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Chapter Google Scholar
Silver, D., Sutton, R.S., Müller, M.: Reinforcement Learning of Local Shape in the Game of Go. In: 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), pp. 1053–1058 (2007)
Google Scholar
Stern, D., Herbrich, R., Graepel, T.: Bayesian Pattern Ranking for Move Prediction in the Game of Go. In: Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, PA (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
Keh-Hsun Chen, Dawei Du & Peigang Zhang

Authors

Keh-Hsun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Du
View author publications
You can also search for this author in PubMed Google Scholar
Peigang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

H. Jaap van den Herik Xinhe Xu Zongmin Ma Mark H. M. Winands

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, KH., Du, D., Zhang, P. (2008). A Fast Indexing Method for Monte-Carlo Go. In: van den Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds) Computers and Games. CG 2008. Lecture Notes in Computer Science, vol 5131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87608-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-540-87608-3_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87607-6
Online ISBN: 978-3-540-87608-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics