4*4-Pattern and Bayesian Learning in Monte-Carlo Go

Wang, Jiao; Li, Shiyuan; Chen, Jitong; Wei, Xin; Lv, Huizhan; Xu, Xinhe

doi:10.1007/978-3-642-31866-5_10

Jiao Wang¹⁷,
Shiyuan Li¹⁷,
Jitong Chen¹⁷,
Xin Wei¹⁷,
Huizhan Lv¹⁷ &
…
Xinhe Xu¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7168))

Included in the following conference series:

Advances in Computer Games

1830 Accesses
1 Citations

Abstract

The paper proposes a new model of pattern, namely the 4*4-Pattern, to improve MCTS (Monte-Carlo Tree Search) in computer Go. A 4*4-Pattern provides a larger coverage space and more essential information than the original 3*3-Pattern. Nevertheless the latter is currently widely used. Due to the lack of a central symmetry, it takes greater challenges to apply a 4*4-Pattern compared to a 3*3-Pattern. Many details of a 4*4-Pattern implementation are presented, including classification, multiple matching, coding sequences, and fast lookup. Additionally, Bayesian 4*4-Pattern learning is introduced, and 4*4-Pattern libraries are automatically generated from a vast amount of professional game records. The results of our experiments show that the use of 4*4-Patterns can improve MCTS in 19*19 Go to some extent, in particular when supported by 4*4-Pattern libraries generated by Bayesian learning.

The material in this paper is based upon work supported by the NSFC-MSRA Joint Research Fund under Grant 60971057.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bouzy, B., Chaslot, G.: Bayesian generation and integration of k-nearest-neighbor patterns for 19×19 Go. Computational Intelligence in Games, 176–181 (2005)
Google Scholar
Bouzy, B., Cazenave, T.: Computer go: An AI oriented survey. Artificial Intelligence 132(1), 39–103 (2001)
Article MathSciNet MATH Google Scholar
Fuego Developer’s Documentation, http://www.cs.ualberta.ca/~games/go/fuego/fuegodoc/
Gelly, S., Silver, D.: Combining Offline and Online Knowledge in UCT. In: ICML 2007: Proceedings of the 24th International Conference on Machine Learning, pp. 273–280. Association for Computing Machinery (2007)
Google Scholar
Gelly, S., et al.: Modification of UCT with Patterns in Monte-Carlo Go. Technical Report 6062. INRIA,France (2006)
Google Scholar
Gelly, S., Wang, Y.: Exploration exploitation in go: UCT for Monte-Carlo go. In: On-line trading of Exploration and Exploitation Workshop (2006)
Google Scholar
Minka, T.P.: A family of algorithms for approximate Bayesian inference. Massachusetts Institute of Technology (2001)
Google Scholar
Müller, M.: Position Evaluation in Computer Go. ICGA Journal, pp. 219-228 (2002)
Google Scholar
Silver, D., Tesauro, G.: Monte-Carlo Simulation Balancing. In: Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, Quebec, Canada, pp. 954–852 (2009)
Google Scholar
Stern, D., Herbrich, R., Graepel, T.: Bayesian Pattern Ranking for Move Prediction in the Game of Go. In: The 23rd International Conference on Machine Learning, pp.873–880 (2006)
Google Scholar
Stern, D., Graepel, T., MacKay, D.: Modelling Uncertainty in The Game of Go. In: Advances in Neural Information Processing Systems, pp.33–40 (2004)
Google Scholar
Wang, Y., Gelly, S.: Modifications of UCT and sequence-like simulations for Monte-Carlo Go. In: IEEE Symposium on Computational Intelligence and Games, pp. 175–182 (2007)
Google Scholar
Wu, R., Beal, D.F.: A Memory Efficient Retrograde Algorithm and Its Application To Chess Endgames. In: More Games of No Chance, vol. 42. MSRI Publication (2002)
Google Scholar
Zobrist: Feature. Extraction and Representation for Pattern Recognition and the Game of Go. PhD thesis, University of Wisconsin (1970)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Science and Engineering, Northeastern University, China
Jiao Wang, Shiyuan Li, Jitong Chen, Xin Wei, Huizhan Lv & Xinhe Xu

Authors

Jiao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shiyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Jitong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xin Wei
View author publications
You can also search for this author in PubMed Google Scholar
Huizhan Lv
View author publications
You can also search for this author in PubMed Google Scholar
Xinhe Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tilburg Institute of Cognition and Communication, Tilburg University, Warandelaan 2, 5037 AB, Tilburg, The Netherlands
H. Jaap van den Herik & Aske Plaat &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, J., Li, S., Chen, J., Wei, X., Lv, H., Xu, X. (2012). 4*4-Pattern and Bayesian Learning in Monte-Carlo Go. In: van den Herik, H.J., Plaat, A. (eds) Advances in Computer Games. ACG 2011. Lecture Notes in Computer Science, vol 7168. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31866-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-31866-5_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31865-8
Online ISBN: 978-3-642-31866-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics