Game Analytics pp 205-253 | Cite as

Game Data Mining

  • Anders DrachenEmail author
  • Christian Thurau
  • Julian Togelius
  • Georgios N. Yannakakis
  • Christian Bauckhage


During the years of the Information Age, technological advances in the computers, satellites, data transfer, optics, and digital storage has led to the collection of an immense mass of data on everything from business to astronomy, counting on the power of digital computing to sort through the amalgam of information and generate meaning from the data. Initially, in the 1970s and 1980s of the previous century, data were stored on disparate structures and very rapidly became overwhelming. The initial chaos led to the creation of structured databases and database management systems to assist with the management of large corpuses of data, and notably, the effective and efficient retrieval of information from databases. The rise of the database management system increased the already rapid pace of information gathering.


Data Mining Association Rule Game Development Player Behavior Online Game 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



  1. Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM-SIGMOD international conference on management of data (SIGMOD) (pp. 207–216). Washington, DC.Google Scholar
  2. Bauckhage, C., Kerstin, C., Sifa, R., Thurau, C., Drachen, A., & Canossa, A. (2012). How players lose interest in playing a game: An empirical study based on distributions of total playing times. In Proceedings of IEEE computational intelligence in games, Granada, Spain.Google Scholar
  3. Berry, M., & Linoff, G. (1999). Mastering data mining: The art and science of customer relationship management. New York: Wiley.Google Scholar
  4. Bishop, C. M. (2006). Pattern recognition and machine learning (Information science and statistics). New York: Springer.zbMATHGoogle Scholar
  5. Bohannon, J. (2010). Game-miners grapple with massive data. Science, 330(6000), 30–31.Google Scholar
  6. Castranova, E. (2001). Virtual worlds: A first-hand account of market and society on the Cyberian frontier (CESifo Working Paper Series no 618). München.Google Scholar
  7. Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinart, T., Shearer, C., & Wirth, R. (2000). Crispdm step-by-step data mining guide.
  8. Charles, D., & Black, M. (2004, November 8–10). Dynamic player modelling: A framework for playercentric digital games. In Proceedings of CGAIDE 2004, 5th international conference on computer games: Artificial intelligence, design and education. Microsoft Campus, Reading, UK. ISBN 09549016-0-6Google Scholar
  9. Chen, M. S., Han, J., & Yu, P. S. (1996). Data mining: An overview from a database perspective. IEEE Transactions on Knowledge and Data Engineering, 8, 866–883.CrossRefGoogle Scholar
  10. Coulton, P., Bamford, W., Cheverst, K., & Rashid, O. (2008). 3D space-time visualization of player behavior in pervasive location-based games. International Journal of Computer Games Technology Volume 2008 (2008), Article ID 192153, 5 pages. http://doi:10.1155/2008/192153Google Scholar
  11. Cutler, A., & Breiman, L. (1994). Archetypal analysis. Technometrics, 36(4), 338–347.MathSciNetzbMATHCrossRefGoogle Scholar
  12. DeRosa, P. (2007, August 7). Tracking player feedback to improve game design. Gamasutra. Available from:
  13. Drachen, A., & Canossa, A. (2009). Towards gameplay analysis via gameplay metrics. In Proceedings of the 13th international MindTrek conference. Tampere: ACM. Google Scholar
  14. Drachen, A., & Canossa, A. (2011). Evaluating motion: Spatial user behavior in virtual environments. International Journal of Arts and Technology, 4, 294–314.CrossRefGoogle Scholar
  15. Drachen, A., Canossa, A., & Yannakakis, G. N. (2009). Player modeling using self- organization in Tomb Raider: Underworld. In Proceedings of the international symposium on Computational Intelligence and Games, CIG’09, Piscataway.Google Scholar
  16. Drachen, A., Sifa, R., Bauckhage, C., & Thurau, C. (2012). Guns, swords and data: Clustering of player behavior in computer games in the wild. In Proceedings of IEEE computational intelligence in games, Granada, Spain.Google Scholar
  17. Ducheneaut, N., & Moore, R. J. (2004). The social side of gaming: A study of interaction patterns in a massively multiplayer online game. In Proceedings of the 2004 ACM conference on computer supported cooperative work, Chicago. Google Scholar
  18. Erfani Joorabchi, M., Seif El-Nasr, M. (2011, October, 5–8). Measuring the impact of knowledge gained from playing FPS and RPG games on gameplay performance. In Proceedings of 10th international conference, ICEC 2011 (Lecture notes in computer science, Vol. 6972, pp. 300–306). Vancouver.Google Scholar
  19. Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P., & Uthurusamy, R. (1996). Advances in knowledge discovery and data mining. Menlo Park: AAAI Press.Google Scholar
  20. Fields, T., & Cotton, B. (2011). Social game design: Monetization methods and mechanics. Waltham: Morgan Kauffman Publishers.Google Scholar
  21. Finesso, L., & Spreij, P. (2004). Approximate nonnegative matrix factorization via alternating minimization. In Proceedings 16th international symposium on mathematical theory of networks and systems, Leuven. Google Scholar
  22. Flood, K. (2012, March 27). Game analytics (series). Kevin’s corner. URL: file:///G:/Work/METRICS/Metrics_references/Kevin%27s%20Corner%20%20Game%20Analytics.htm
  23. Gagné, A., Seif El-Nasr, M., & Shaw, C. (2012). Analysis of telemetry data from a real time ­strategy game: A case study. Computers in Entertainment (CIE) - Theoretical and Practical Computer Applications in Entertainment, 10(3), Article No. 2. New York: ACM. doi:http://10.1145/2381876.2381878Google Scholar
  24. Geng, L., & Hamilton, H. J. (2006). Interestingness measures for data mining: A survey. ACM Computing Surveys, 38(3), Article No. 9. New York: ACM. doi:http://10.1145/1132960.1132963Google Scholar
  25. Golub, G., & van Loan, J. (1996). Matrix computations (3rd ed.). Baltimore: Johns Hopkins University Press.zbMATHGoogle Scholar
  26. Han, J., Pei, J., & Yin, Y. (2000). Mining frequent patterns without candidate generation. In Proceedings of the 2000 ACM-SIGMOD international conference on management of data (SIGMOD) (pp. 1–12). New York.Google Scholar
  27. Han, J., Kamber, M., & Pei, J. (2005). Data mining: Concepts and techniques (Morgan Kaufmann large-scale data mining in games 41 2nd ed.). San Francisco: Morgan Kaufmann Publishers.Google Scholar
  28. Hoobler, N., Humphreys, G., & Agrawala, M. (2004). Visualizing competitive behaviors in multi-user virtual environments. In Proceedings of the conference on visualization. Los Alamitos: IEEE.Google Scholar
  29. Houlette, R. (2004). Player modeling for adaptive games. In S. Rabin (Ed.), AI game programming wisdom II (pp. 557–566). Hingham: Charles River Media.Google Scholar
  30. Isbister, K., & Schaffer, N. (2008). Game usability: Advancing the player experience. San Francisco: Morgan Kaufman.Google Scholar
  31. Jansen, B. J. (2009). Understanding user-web interactions via web analytics. San Rafael: Morgan & Claypool Publishers.Google Scholar
  32. Jolliffe, I. (1986). Principal component analysis. New York: Springer.CrossRefGoogle Scholar
  33. Kastbjerg, E. (2011). Combining sequence mining and heatmaps to visualize game event flows (working title). Master’s thesis, IT University of Copenhagen, Copenhagen.Google Scholar
  34. Kennerly, D. (2003, August 15). Better game design through data mining. Gamasutra. Available from:
  35. Kim, J. H, Gunn, D. V, Phillips, B. C, Pagulayan, R. J, & Wixon, D. (2008). Tracking real-time user experience (TRUE): A comprehensive instrumentation solution for complex systems. In Proceedings of the twenty-sixth annual SIGCHI conference on human factors in computing systems, CHI’08, Florence. Google Scholar
  36. King, D., & Chen, S. (2009). Metrics for social games. Presentation at the social games summit 2009, game developers conference. San Francisco, CA.Google Scholar
  37. Larose, D. T. (2004). Discovering knowledge in data: An introduction to data mining. Hoboken: Wiley.CrossRefGoogle Scholar
  38. Lee, D. D., & Seung, H. S. (1999). Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755), 788–799.CrossRefGoogle Scholar
  39. Mahlman, T., Drachen, A., Canossa, A., Togelius, J., & Yannakakis, G. (2010). Predicting player behavior in Tomb Raider: Underworld. In Proceedings of the international conference on Computational Intelligence and Games, CIG’10, Copenhagen.Google Scholar
  40. Mellon, L. (2009). Applying metrics driven development to MMO costs and risks. White paper, Versant Corporation.Google Scholar
  41. Missura, O., & Gärtner, T (2009). Player modeling for intelligent difficulty adjustment. In Proceedings of the 12th international conference on discovery science, DC’09, Berlin. Google Scholar
  42. Moura, D., Seif El-Nasr, M., & Shaw, C. D. (2011). Visualizing and understanding players’ behavior in video games: Discovering patterns and supporting aggregation and comparison. In Proceedings of the 2011 ACM SIGGRAPH symposium on video games (Sandbox ’11) (pp. 11–15). New York. ISBN:978-1-4503-0775-8, doi:http://10.1145/2018556.2018559.Google Scholar
  43. Nozhnin, D. (2012, May 17). Predicting churn: Data-mining your game. Gamasutra. URL:
  44. Paatero, P., & Tapper, U. (1994). Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics, 5(2), 111–126.CrossRefGoogle Scholar
  45. Pedersen, C., Togelius, J., & Yannakakis, G. N. (2010). Modeling player experience for content creation. Transactions on Computational Intelligence and AI in Games, 2, 54–67.CrossRefGoogle Scholar
  46. Rokach, L., & Maimon, O. (2008). Data mining with decision trees: Theory and applications. New Jersey: World Scientific Publishing.zbMATHGoogle Scholar
  47. Seif El-Nasr, M., & Zammitto, V. (2010). User experience research for sports games. Presentation at the GDC summit on games user research. San Francisco, CA.Google Scholar
  48. Seif El-Nasr, M., Aghabeigi, B., Milam, D., Erfani, M., Lameman, B., Maygoli, H., & Mah, S. (2010). Understanding and evaluating cooperative games. CHI 2010 (pp. 253–262). New York.Google Scholar
  49. Shaker, N., Yannakakis, G., & Togelius, J. (2011). Feature analysis for modeling game content quality. In Proceedings of the 2011 IEEE conference on computational intelligence and games (pp. 126–133). Seoul, KoreaGoogle Scholar
  50. Summit Kohonen, T. (2001). Self-organizing maps. Heidelberg: Springer.CrossRefGoogle Scholar
  51. Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction (Adaptive computation and machine learning). Cambridge: The MIT Press.Google Scholar
  52. Thawonmas, R., & Iizuka, K. (2008). Visualization of online-game players based on their action behaviors. International Journal of Computer Games Technology, 2008, 1–9.CrossRefGoogle Scholar
  53. Thawonmas, R., Kashifuji, Y., & Chen, K. T. (2008, December 3–5). Design of MMORPG Bots based on behavior analysis. In Proceedings of the 2008 international conference on advances in computer entertainment technology, ACE’08, Yokohama, Japan (ACM International Conference Proceeding Series 352, pp. 91–94). doi:10.1145/1501750.1501770, ISBN:978-1-60558-393-8.Google Scholar
  54. Thompson, C. (2007). Halo 3: How Microsoft labs invented a new science of play. Wired Magazine. Google Scholar
  55. Thurau, C., & Bauckhage, C. (2010). Analyzing the evolution of social groups in world of warcraft. In Proceedings of the international conference on Computational Intelligence and Games, IEEE, CIG’10, Copenhagen. Google Scholar
  56. Thurau, C., & Drachen, A. (2011). Introducing archetypal analysis for player classification in games. In Proceedings of the international workshop on evaluating player experience in games (EPEX’11) hosted at the 6th international conference on the foundations of digital games (FDG2011). Bordeaux.Google Scholar
  57. Thurau, C., Bauckhage, C., & Sagerer, G. (2004, July 13–17). Learning human-like movement behavior for computer games. In Proceedings of the 8th international conference on the Simulation of Adaptive Behavior, SAB’04. Los Angeles, USA. ISBN: 9780262693417.Google Scholar
  58. Thurau, C., Paczian, T., Sagerer, G., & Bauckhage, C. (2007). Bayesian imitation learning in game characters. International Journal of Intelligent Systems Technologies and Applications, 2(2–3), 284–295.CrossRefGoogle Scholar
  59. Thurau, C., Kersting, K., & Bauckhage, C. (2009). Convex non-negative matrix factorization in the wild. In Proceedings of the IEEE international conference on data mining, Miami.Google Scholar
  60. Thurau, C., Kersting, K., & Bauckhage, C. (2010). Yes we can – Simplex volume maximization for descriptive web–scale matrix factorization. In Proceedings of the international Conference on Information and Knowledge Management, ACM, CIKM’10, Toronto. Google Scholar
  61. Thurau, C., Kersting, K., Wahabzada, M., & Bauckhage, C. (2011). Descriptive matrix factorization for sustainability: Adopting the principle of opposites. Journal of Data Mining and Knowledge Discovery, 24, 325–354.MathSciNetCrossRefGoogle Scholar
  62. Weber, B., & Mateas, M. (2009). A data mining approach to strategy prediction. In Proceedings of the international symposium on Computational Intelligence and Games, CIG’09, Piscataway.Google Scholar
  63. Weber, B. G. John, M. Mateas, M. & Jhala, A. (2011). Modeling player retention in Madden NFL 11. In Proceedings of the association for the advancement of artificial intelligence conference, San Francisco.Google Scholar
  64. Williams, D., Yee, N., & Caplan, S. E. (2008). Who plays, how much, and why? Debunking the stereotypical gamer profile. Journal of Computer-Mediated Communication, 13, 993–1018.CrossRefGoogle Scholar
  65. Williams, D., Consalvo, M., Caplan, S., & Yee, N. (2009). Looking for gender (LFG): Gender roles and behaviors among online gamers. Journal of Communication, 59, 700–725.CrossRefGoogle Scholar
  66. Witten, I. H., & Frank, E. (2000). Data mining. New York: Morgan-Kaufmann.Google Scholar
  67. Yannakakis, G. A. (2012). Game AI revisited. In Proceedings of the conference on computing frontiers, Caligari.Google Scholar
  68. Yannakakis, G. N., & Hallam, J. (2009). Real-time game adaptation for optimizing player satisfaction. Transactions on Computational Intelligence and AI in Games, 1, 121–133.CrossRefGoogle Scholar
  69. Yannakakis, G. A., & Togelius, J. Experience-driven procedural content generation. IEEE Transactions on Affective Computing, 2 (3), 147–161CrossRefGoogle Scholar
  70. Zaki, M. J. (2001). Spade: An efficient algorithm for mining frequent sequences. Machine Learning, 42, 31–60.zbMATHCrossRefGoogle Scholar
  71. Zoeller, G. (2010). Game development telemetry. Presentation at the game developers conference 2010. Google Scholar

Copyright information

© Springer-Verlag London 2013

Authors and Affiliations

  • Anders Drachen
    • 1
    • 2
    • 3
    Email author
  • Christian Thurau
    • 4
  • Julian Togelius
    • 5
  • Georgios N. Yannakakis
    • 5
    • 6
  • Christian Bauckhage
    • 7
    • 8
  1. 1.PLAIT LabNortheastern UniversityBostonUSA
  2. 2.Department of Communication and PsychologyAalborg UniversityAalborgDenmark
  3. 3.Game AnalyticsCopenhagenDenmark
  4. 4.Game AnalyticsCopenhagenDenmark
  5. 5.Center for Computer Games ResearchIT University of CopenhagenCopenhagenDenmark
  6. 6.Department of Digital GamesUniversity of MaltaMsidaMalta
  7. 7.Fraunhofer Institute Intelligent Analysis and Information Systems IAISSchloss BirlinghovenSankt AugustinGermany
  8. 8.Bonn-Aachen International Center for Information TechnologyB-IT Dahlmannstraße 2BonnGermany

Personalised recommendations