DeepChess: End-to-End Deep Neural Network for Automatic Learning in Chess
We present an end-to-end learning method for chess, relying on deep neural networks. Without any a priori knowledge, in particular without any knowledge regarding the rules of chess, a deep neural network is trained using a combination of unsupervised pretraining and supervised training. The unsupervised training extracts high level features from a given position, and the supervised training learns to compare two chess positions and select the more favorable one. The training relies entirely on datasets of several million chess games, and no further domain specific knowledge is incorporated.
The experiments show that the resulting neural network (referred to as DeepChess) is on a par with state-of-the-art chess playing programs, which have been developed through many years of manual feature selection and tuning. DeepChess is the first end-to-end machine learning-based method that results in a grandmaster-level chess playing performance.
- 2.Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: NIPS (2007)Google Scholar
- 3.David, O.E., Koppel, M., Netanyahu, N.S.: Genetic algorithms for mentor-assisted evaluation function optimization. In: GECCO (2008)Google Scholar
- 4.David, O.E., van den Herik, H.J., Koppel, M., Netanyahu, N.S.: Simulating human grandmasters: evolution and coevolution of evaluation functions. In: GECCO (2009)Google Scholar
- 7.Elo, A.E.: The Rating of Chessplayers, Past and Present. Batsford, London (1978)Google Scholar
- 8.Hinton, G., Vinyals, O., Dean, J.: Distilling knowledge in a neural network. In: Deep Learning and Representation Learning Workshop, NIPS (2014)Google Scholar
- 10.Lai, M.: Giraffe: Using deep reinforcement learning to play chess. Master’s Thesis, Imperial College London (2015)Google Scholar
- 11.Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS (2015)Google Scholar
- 12.Romero, A., Ballas, N., Ebrahimi Kahou, S., Chassang, A., Gatta, C., Bengio, Y.: FitNets: hints for thin deep nets. In: ICLR (2015)Google Scholar
- 13.Schaeffer, J., Hlynka, M., Jussila, V.: Temporal difference learning applied to a high-performance game-playing program. In: Joint Conference on Artificial Intelligence (2001)Google Scholar
- 17.Wiering, M.A.: TD learning of game evaluation functions with hierarchical neural architectures. Master’s Thesis, University of Amsterdam (1995)Google Scholar