Reducing the Learning Time of Tetris in Evolution Strategies

Boumaza, Amine

doi:10.1007/978-3-642-35533-2_17

Amine Boumaza^22,23

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7401))

Included in the following conference series:

International Conference on Artificial Evolution (Evolution Artificielle)

737 Accesses

Abstract

Designing artificial players for the game of Tetris is a challenging problem that many authors addressed using different methods. Very performing implementations using evolution strategies have also been proposed. However one drawback of using evolution strategies for this problem can be the cost of evaluations due to the stochastic nature of the fitness function. This paper describes the use of racing algorithms to reduce the amount of evaluations of the fitness function in order to reduce the learning time. Different experiments illustrate the benefits and the limitation of racing in evolution strategies for this problem. Among the benefits is designing artificial players at the level of the top ranked players at a third of the cost.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 72.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Audibert, J.-Y., Munos, R., Szepesvári, C.: Tuning Bandit Algorithms in Stochastic Environments. In: Hutter, M., Servedio, R.A., Takimoto, E. (eds.) ALT 2007. LNCS (LNAI), vol. 4754, pp. 150–165. Springer, Heidelberg (2007)
Chapter Google Scholar
Bertsekas, D., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific (1996)
Google Scholar
de Boer, P., Kroese, D., Mannor, S., Rubinstein, R.: A tutorial on the cross-entropy method. Annals of Operations Research 1(134), 19–67 (2004)
Google Scholar
Böhm, N., Kókai, G., Mandl, S.: An Evolutionary Approach to Tetris. In: University of Vienna Faculty of Business; Economics, Statistics (eds.) Proc. of the 6th Metaheuristics International Conference, CDROM (2005)
Google Scholar
Boumaza, A.: On the evolution of artificial tetris players. In: Proc. of the IEEE Symp. on Comp. Intel. and Games, CIG 2009, pp. 387–393. IEEE (June 2009)
Google Scholar
Burgiel, H.: How to lose at Tetris. Mathematical Gazette 81, 194–200 (1997)
Article Google Scholar
Demaine, E.D., Hohenberger, S., Liben-Nowell, D.: Tetris is Hard, Even to Approximate. In: Warnow, T.J., Zhu, B. (eds.) COCOON 2003. LNCS, vol. 2697, pp. 351–363. Springer, Heidelberg (2003)
Chapter Google Scholar
Fahey, C.P.: Tetris AI, Computer plays Tetris (2003), on the web http://colinfahey.com/tetris/tetris_en.html
Farias, V., van Roy, B.: Tetris: A study of randomized constraint sampling. Springer (2006)
Google Scholar
Hansen, N., Müller, S., Koumoutsakos, P.: Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES). Evolutionary Computation 11(1), 1–18 (2003)
Article Google Scholar
Hansen, N., Niederberger, S., Guzzella, L., Koumoutsakos, P.: A method for handling uncertainty in evolutionary optimization with an application to feedback control of combustion. IEEE Trans. Evol. Comp. 13(1), 180–197 (2009)
Article Google Scholar
Heidrich-Meisner, V., Igel, C.: Hoeffding and bernstein races for selecting policies in evolutionary direct policy search. In: Proc. of the 26th ICML, pp. 401–408. ACM, New York (2009)
Google Scholar
Maron, O., Moore, A.W.: Hoeffding races: Accelerating model selection search for classification and function approximation. In: Proc. Advances in Neural Information Processing Systems, pp. 59–66. Morgan Kaufmann (1994)
Google Scholar
Ostermeier, A., Gawelczyk, A., Hansen, N.: A derandomized approach to self-adaptation of evolution strategies. Evolutionary Computation 2(4), 369–380 (1994)
Article Google Scholar
Schmidt, C., Branke, J., Chick, S.E.: Integrating Techniques from Statistical Ranking into Evolutionary Algorithms. In: Rothlauf, F., Branke, J., Cagnoni, S., Costa, E., Cotta, C., Drechsler, R., Lutton, E., Machado, P., Moore, J.H., Romero, J., Smith, G.D., Squillero, G., Takagi, H. (eds.) EvoWorkshops 2006. LNCS, vol. 3907, pp. 752–763. Springer, Heidelberg (2006)
Chapter Google Scholar
Siegel, E.V., Chaffee, A.D.: Genetically optimizing the speed of programs evolved to play tetris. In: Angeline, P.J., Kinnear Jr., K.E. (eds.) Advances in Genetic Programming 2, pp. 279–298. MIT Press, Cambridge (1996)
Google Scholar
Stagge, P.: Averaging Efficiently in the Presence of Noise. In: Eiben, A.E., Bäck, T., Schoenauer, M., Schwefel, H.-P. (eds.) PPSN 1998. LNCS, vol. 1498, pp. 188–197. Springer, Heidelberg (1998)
Chapter Google Scholar
Szita, I., Lörincz, A.: Learning tetris using the noisy cross-entropy method. Neural Comput. 18(12), 2936–2941 (2006)
Article MATH Google Scholar
Thiery, C., Scherrer, B.: Building Controllers for Tetris. International Computer Games Association Journal 32, 3–11 (2009)
Google Scholar
Thiery, C., Scherrer, B.: Least-Squares λ Policy Iteration: Bias-Variance Trade-off in Control Problems. In: Proc. ICML, Haifa (2010)
Google Scholar
Tsitsiklis, J.N., van Roy, B.: Feature-based methods for large scale dynamic programming. Machine Learning 22, 59–94 (1996)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Univ. Lille Nord de France, F-59000, Lille, France
Amine Boumaza
ULCO, LISIC, F-62100, Calais, France
Amine Boumaza

Authors

Amine Boumaza
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LERIA, Laboratoire d’Etude et de echerche en Informatique d’Angers, Université d’Angers, 2 Boulevard Lavoisier, 49045, Angers Cedex 01, France
Jin-Kao Hao
Equipe ALEA, INRIA Bordeaux sud-ouest, Institut de Mathématiques de Bordeaux (IMB), UMR CNRS 5251, Université Bordeaux Segalen, 3ter place de la victoire, 33076, Bordeaux, France
Pierrick Legrand
Laboratoire des Sciences de l’Ingénieur, de l’Informatique et de l’Imagerie (ICUBE), Part d’innovation, University of Strasbourg, Boulevard Sébastien Brant, BP 10413, 67412, Illkirch Cedex, France
Pierre Collet
Ecole Polytechnique de l’Université de Tours, Laboratoire d’Informatique de l’Université de Tours, 64 avenue Jean Portalis, 37200, Tours, France
Nicolas Monmarché
INRIA Saclay - Île-de-France, Equipe AVIZ, Bât 490, 91405, Orsay Cedex, France
Evelyne Lutton
INRIA Saclay - Ile de France, Equipe TAO, LRI, Université Paris Sud, Bât 490, 91405, Orsay Cedex, France
Marc Schoenauer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boumaza, A. (2012). Reducing the Learning Time of Tetris in Evolution Strategies. In: Hao, JK., Legrand, P., Collet, P., Monmarché, N., Lutton, E., Schoenauer, M. (eds) Artificial Evolution. EA 2011. Lecture Notes in Computer Science, vol 7401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35533-2_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-35533-2_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35532-5
Online ISBN: 978-3-642-35533-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics