Validating gameplay activity inventory (GAIN) for modeling player profiles

In the present study, we validated Gameplay Activity Inventory (GAIN), a short and psychometrically sound instrument for measuring players’ gameplay preferences and modeling player profiles. In Study 1, participants in Finland ($$N=879$$ N=879 ) responded to a 52-item version of GAIN. An exploratory factor analysis was used to identify five latent factors of gameplay activity appreciation: Aggression, Management, Exploration, Coordination, and Caretaking. In Study 2, respondents in Canada ($$N=1322$$ N=1322 ) and Japan ($$N=1178$$ N=1178 ) responded to GAIN, and the factor structure of a 15-item version was examined using a Confirmatory Factor Analysis. The results showed that the short version of GAIN has good construct validity, convergent validity, and discriminant validity in Japan and in Canada. We demonstrated the usefulness of GAIN by conducting a cluster analysis to identify player types that differ in both demographics and game choice. GAIN can be used in research as a tool for investigating player profiles. Game companies, publishers and analysts can utilize GAIN in player-centric game development and targeted marketing and in generating personalized game recommendations.


Introduction
The rise in popularity of digital games, ranging from console and PC video games to free-to-play mobile games and virtual reality games, calls for better tools for understanding what makes a gameplay experience gratifying to players and what factors are related to game preferences and choices in game consumption. According to some estimations, in 2016 more than 750,000 games were published across gaming platforms (Neogames 2016). Game marketplaces are saturated, and it has become very difficult for game companies and publishers to reach loyal paying customers, and for players to find the games that are the most enjoyable for them.
The objective of investigating preferences in playing digital games is closely related to discussions on game choice and, consequently, to applications of media choice theory. Identifying the factors that have an effect on game choice is an important task for efficient and targeted marketing. Player profiles can be utilized as a tool for identifying patterns in consumption behavior, and a successful player typology could therefore serve as beneficial for game companies and players alike.
Motivations to play and other psychographic segmentations (see Hartmann and Klimmt 2006;Hamari and Tuunanen 2014, p. 34), as well as player behavioral approaches, have been utilized in constructing player typologies and player personas, that is, representative players (Drachen et al. 2009;Vahlo and Koponen 2018). Player typologies based on behavior data (e.g., psychometric measurements, game logs, and telemetrics) aim to scrutinize patterns in dynamic gameplay interaction. These are approaches of player modeling. Studies on motivations to play are instead endeavors of player profiling, which analyze players' sustained game preferences that exist beyond any given act of gameplay (Yannakakis et al. 2013;Cowley and Charles 2016).
Motivations to play studies (e.g., Bateman et al. 2011;Przybylski et al. 2010;Sherry et al. 2006;Yee 2006) aim to track down the reasons why players play digital games in general, or why they favor a particular type of digital game, for example, online multiplayer games (Yee 2006;Yee et al. 2012). Behavioral approaches (e.g., Bartle 2003;Mulligan and Patrovsky 2003;Tseng 2010;Cowley and Charles 2016), on the other hand, ask how players play the game they have chosen to play. For instance, Bartle (1996Bartle ( , 2003 fashioned a player type model by analyzing multi-user dungeon (MUD) users' bulletin-board posts and classifying the players according to their preferred playing styles as achievers, explorers, socializers and killers.
However, neither studies on play motivations nor play behavior provide a sufficient approach for understanding which games players are likely to choose. Studies on motivations to play are very general, while player behavior models tend to be too specific in this regard. Recently, a third approach has been proposed for profiling players which could be used for predicting game choice. Vahlo et al. (2017) call this approach game preference research. Since the third approach aims to bring together characteristics of gameplay and players' sustained preferences for specific kinds of gameplay experiences, this approach can be situated in between player behavior and motivations to play models.
Considerable effort has been put into studies on motivations to play and player behavior, but identifying players' appreciation of various game dynamics has not been studied extensively. The current study focuses on this aspect of digital gaming and thus contributes to the understanding of players' preferences and habits in specific kinds of gameplay. Fleshing out players' preference profiles in gameplay appreciation is an important addition for player type research because it enables us to analyze continuities and emergent trends in gaming cultures in a way that may inform decision-making in game development and marketing.
The purpose of this study is to develop and validate a psychometrically sound short instrument for measuring players' gameplay activity preferences in contemporary digital games, and to demonstrate the potential of the instrument in constructing player profiles and in predicting game choice. The introduced Gameplay Activity Inventory (GAIN) is based on a theoretical model and on the Core Game Dynamics (CGD) scale presented by Vahlo et al. (2017). The original CGD scale is a 28-item Likert-7 measurement, which the authors (ibid.) argue consists of five latent factors. Their argument is based on an exploratory factor analysis (EFA) that the authors conducted with a data sample of 1718 respondents from Finland and Denmark. However, the fivefactor model of the CGD scale has not yet been validated. Furthermore, and as Vahlo et al. (2017) note, more items should be developed and included in the inventory. The inventory should also be balanced and shortened in order to make it more applicable to surveys. All of these shortcomings ought to be addressed before the model can be widely utilized for research purposes.
In this article, we investigate whether the five-factor structure of players' gameplay preferences can be validated and if the instrument can be shortened by conducting a Confirmatory Factor Analysis (CFA) on additional data sets. We begin by presenting our theoretical framework and research questions of the current study. Next, we analyze the Core Game Dynamics (CGD) scale as explored by Vahlo et al. (2017). We proceed to present a data triangulation for the items of the CGD scale by studying a total of 166 game instruction booklets. This is followed by an EFA for a reconsidered Gameplay Activity Inventory (GAIN) with a representative sample of 879 Finnish people. The rationale for another EFA is to screen the GAIN items and to build solid empirical grounds for a shorter and more robust inventory.
We continue then by describing our utilization of structural equation modelling (SEM) to construct a CFA for the argued-for five-factor model of gameplay activity preferences with representative data sets collected from Japan (N = 1178) and Canada (N = 1322). After investigating the construct validity, convergent validity, and discriminant validity of the shortened version of GAIN, we report a cluster analysis aimed at finding out whether player types could be identified by investigating players' preferences in the five gameplay activity factors. We continue then to explore the relationships between the identified player types and their habits to play games of specific genres. By doing so, we discuss how the findings provide new knowledge about which games players are likely to choose. Vahlo et al. (2017) suggest that game dynamics or gameplay activities should be considered as a unit of analysis in game preference research (see also Tondello et al. 2017). In this study, we have chosen to use 'gameplay activity' over 'game dynamics' (cf. Vahlo et al. 2017), because the former concept is readily more comprehensible for players and game designers than the latter, which refers to how interrelated game mechanics (Sicart 2009) are experienced as an emergent whole during gameplay (LeBlanc 2004;Hunicke et al. 2004;Vahlo et al. 2017;Vahlo 2017).

Theoretical framework
Furthermore, gameplay challenges and gameplay activities can both be considered as constituents of game dynamics (see Vahlo et al. 2017;Adams 2014). A player may encounter strategic or tactical challenges in various kinds of gameplay activities, which may or may not be related to, for example, behavioral models, aesthetics and narrative representations of warfare. Similarly, a player may try to overcome challenges that require creative problem-solving skills in very different kinds of activities and fictional settings. Hence, if we were to operate with the concept of game dynamics, the distinction between activities and challenges would be more difficult to establish.
Gameplay challenges and gameplay activities thus refer to separate dimensions of the gameplay experience. Because challenge types are of a higher level of abstraction than activity types, we presume that it is more intuitive for players to reflect on their gameplay activity preferences than gameplay challenge type preferences. Because of these reasons, we have decided to focus only on gameplay activities and exclude gameplay challenge types from the scale development. Therefore, we have relabeled the CGD scale presented by Vahlo et al. (2017) as the Gameplay Activity Inventory (GAIN).
An approach based on investigating gameplay activity preferences is also related to media choice theory, which states that users' preferences are consistently and systematically connected to media content, and that users exemplify their preferences in the patterns of their media consumption (Webster and Wakshlag 1983;Scherer and Naab 2009). Next, we situate our approach to media choice theories, and move then to present the research questions of this study.

Game choice
Media choice, and consequently game choice, has been conceptualized in the literature from the perspectives of need satisfaction, motivations and gratification (e.g., Ruggiero 2000;Krcmar and Strizhakova 2009), moods (e.g., Vorderer et al. 2004), taste (e.g., Peterson 2005, and preference type (e.g., Youn 1994;Prior 2007) factors (see Webster 2011). We argue in this article that dynamical and structural characteristics and resources of game media constitute an important factor in identifying game choice. Our approach is consistent both with the Theory of Subjective Quality Assessment (TSQA) and the structuration theory (Webster 2009(Webster , 2011(Webster , 2014. From a structuration theoretical viewpoint, there are three equally important constituents in media choice: agents, structures, and their duality or reciprocity (Webster 2011). Indeed, structuration approaches can be portrayed as audience-centric instead of media-centric or user-centric (Webster and Ksiasek 2012). The structuration theory differs in this regard from the uses and gratifications paradigm and from the mood management approach, both of which emphasize psychological factors in media choice (see e.g., Ruggiero 2000;Rubin 2002;Sherry 2004;Vorderer et al. 2004;Scherer and Naab 2009). A structural theoretical stance enables us to consider how media prod-ucts shape players' preference patterns, and how players' sustaining game choices influence what kinds of games are produced (Webster 2011;Jöckel and Dogruel 2012).
The model we present also draws from the Theory of Subjective Quality Assessment (TSQA). The TSQA argues that the media selections a person makes are guided by the implicit or explicit features and attributes of the perceived product, and of the desires of the user. In contrast to the uses and gratifications and mood management approaches, the TSQA is, in principle, able to track down the specific characteristics of a media product that have caused a user to experience gratification or inspired a preference change. However, the TSQA is akin to the uses and gratifications approach in its objective to explain long-term and medium-term patterns in media use (Wolling 2009).
In conducting research based on the TSQA, a crucial task is to identify which features of the products the users consider paramount for making quality assessments and value judgements (Wolling 2009, p. 88). Genre is an important factor in how a player's quality assessment transfers from one game product to another. It has been argued that individuals develop sustaining genre preferences as well as an individual taste in games in 'gaming socialization', that is, through experimentation and specialization (see Greenberg et al. 2010;von Salisch et al. 2011;Scharkow et al. 2015). Players' genre preferences have been discussed in previous research mostly from the perspective of gender (e.g., Lucas and Sherry 2004), age (e.g., De Schutter 2011;Quandt et al. 2009), or gratifications and motives (e.g., Hartmann and Klimmt 2006;Greenberg et al. 2010).
Prior research has argued that video game genres are primarily separated from each other based on the kinds of interactivity they afford for the player in gameplay challenges and activities (e.g., Apperley 2006;Arsenault 2009;Landay 2014). However, video game genres are emergent constructs: new subgenres are constantly introduced both by game companies and by player communities themselves (Arsenault 2009(Arsenault , 2014Juul 2014;Clarke et al. 2015). Thus, it is more relevant to investigate which game design constituents (i.e. game mechanics and dynamics) players find attractive rather than operate with established game genre constructs in fashioning players TSQA criteria ).
In the current structuration analytical and TSQA-inclined study, we associate 'structure' with the level of gameplay activities, which is enabled by interrelated game mechanics designed in games . Descriptions of gameplay activities are commonly used in both game journalism and player communities (Kirkpatrick 2012;Vahlo et al. 2017). In the TSQA it is contended that the elements of products to which people constantly refer in conversations form a plausible basis for modeling quality assessment criteria in media choice (Wolling 2009, p. 94).
As argued in activity theory (Kaptelinin and Nardi 2006), activities are purposeful, participatory and object-oriented. As with the agent-structure-duality approach in the structuration theory (Webster 2014), the concept of 'gameplay activity' manages thus to refer simultaneously to the motivated player-subject, to the dynamic game object, and to the reciprocity between these two constituents of the gameplay experience (Juul 2005, pp. 83-88;Björk and Holopainen 2006, p. 411;Leino 2012, pp. 58-59;Vahlo 2017

Research questions
The original Core Game Dynamics (CGD) scale is a 28-item Likert-7 questionnaire in which respondents are asked to rate how pleasant recurrent digital game dynamicssuch as collecting rare items, managing cities, or exploding and destroying-are for them as players (1 = very unpleasant, 7 = very pleasant). Vahlo et al. (2017) identified five latent constructs in the CGD scale by conducting an exploratory factor analysis (EFA) with data from 1718 respondents. They labeled the factors Assault, Manage, Journey, Coordinate, and Care. As specified by Vahlo et al. (2017), the next phase for the inventory is to further develop the survey items and conduct a theory-driven confirmatory analysis with a large data set representing cultural backgrounds other than Finnish and Danish. The first research question of this study can thus be summarized as:

RQ1
Can we confirm that players' gameplay activity preferences consist of five factors? If so, are these factors similar to the five factors presented by Vahlo et al. (2017)? Vahlo and Koponen (2018) studied how being motivated to play, enjoying in-game challenges and in-game activities predict game choice. By analyzing the survey data from 3551 adult respondents, they constructed a SEM model for predicting game choice. This was done by studying the direct and indirect effects of gaming motivations, gameplay challenge appreciation, and gameplay activity preferences on game choice, as measured by survey participants' reported frequency to play digital games of different genres. Motivations to play were measured by a 25-item 5-point Likert scale, gameplay challenge appreciation by a 9-item 5-point Likert scale, and gameplay activity preferences by a 28-item 5-point Likert scale (Fig. 1).
In the study by Vahlo and Koponen (2018), being motivated to play and appreciating in-game challenges predicted weakly game choice (i.e., a habit of playing a variety of digital game genres). In comparison, the effect of gameplay activity preference on game choice was moderate. Indeed, although motivations to play had the largest total effect on game choice, this effect was mostly indirect, which indicates that predictions of players' game choices should be based primarily on gameplay activity preference profiling and modeling. Furthermore, as argued in the TSQA (Wolling 2009), users evaluate and choose media products based on readily observable qualities and features. In contrast to gameplay activities, gameplay challenges are abstract constructs, which, consequently, are not as commonly referred to in game journalism and game-related discussions. For instance, a person may be perfectly cognizant of her preferences to explode and destroy in gameplay but unaware of what kind of physical, cognitive, or emotional challenges she finds enjoyable (see Denisova et al. 2017).
To better understand game choices of latent player segments, a player typology based on gameplay activity preferences is therefore called for. Vahlo, Kaakinen, Holm and Koponen made a cluster analysis to study whether player types can be constructed based on a player's gameplay type (Assault, Manage, Journey, Coordinate, Care) preference scores. The authors identified seven player types or player personas: The Mercenary, The Companion, The Commander, The Adventurer, The Patterner, The Explorer, and The Daredevil (See Vahlo et al. 2017).
To further investigate whether GAIN can be regarded as contingent with the CGD model, a cluster analysis should be conducted. Based on the results presented by Vahlo and Koponen (2018), identified player types can then be utilized in predicting players' game choices. The second and the third research questions of this study are: RQ2 Can the GAIN model be used in identifying player types and constructing player personas? If so, are these player personas similar to the seven presented by Vahlo et al. (2017)?
RQ3 Is it possible to predict players' game choices by analyzing how the identified player types differ in their habits of playing specific game genres?

Developing and confirming the gameplay activity inventory GAIN
We began the validation process of the gameplay activity inventory by collecting more empirical evidence on the dimensionality and construct validity of the original CGD inventory. A single exploratory study on a factor structure is not adequate for making a confirmatory factor analysis (Matsunaga 2010). Thus, we first conducted an additional exploratory factor analysis, and then proceeded to design a theory-driven CFA.

Data triangulation
We included in this study a total of 23 items from the original CGD scale ). These items were kept very similar to the original ones, although several of the items were rephrased and shortened to describe a specific gameplay activity more precisely than the original items arguably do. The original CGD scale included two items which described tactics and strategy. These items were excluded from GAIN because they converge to gameplay challenge types rather than to gameplay activities.
The original CGD item of "Exploring the gameworld and uncovering the game's secrets, mysteries and story" was divided into the items "Exploring the gameworld" and "Investigating the story and its mysteries". The item "Showing affection like flirting, hugging, kissing or making love" was modified into the items of "Flirting, seducing and romantic dating" and "Hugging, kissing and making out". Finally, the item of "Dancing, singing or playing instruments together and staying in rhythm" was split into three items of "Dancing to the music", "Playing musical instruments", and "Moving to a beat and staying in rhythm". Vahlo et al. (2017) suggested that the scale could be developed further by adding items that describe the gameplay activities of decorating, dressing up, customizing appearances, and sports. Along these guidelines, we developed the new items of "Decorating rooms and houses", "Dressing up, applying makeup and choosing looks", "Customizing a character's appearance", "Performing in athletics, gymnastics or other sports", "Doing acrobatic tricks (e.g., in extreme sports)", and "Managing a team and its players". Although the items "Piloting and steering vehicles" and "Racing at a high speed" did not load on any factor in the study by Vahlo et al. (2017), we decided to include them in GAIN due to the popularity of these gameplay activities in contemporary digital games. These changes resulted in a total of 38 inventory items.
Exploratory phases of inventory-development should include an extensive pool of items, which manage to cover all important aspects of the construct under study (Matsunaga 2010). We continued to analyze what kinds of new items could still be added to GAIN by studying a sample of 166 official game instruction booklets. This procedure was a data triangulation for the original content analysis of 700 game review articles reported in Vahlo et al. (2017).
The process included instruction booklets for games published for Playstation, Playstation 2, Playstation 3, Playstation Portable (PSP), Nintendo GameCube, Nintendo Wii, Nintendo WiiU, GameBoy Advance (GBA), Nintendo DS, Nintendo 3DS, Xbox 360, Xbox One and PC in the 1990s and 2000s. The data covered a wide variety of games from sports games and simulation games to strategy games and graphic adventures. The sample size of 166 game instruction manuals was selected based on saturation of the data. As a result of the content analysis, we were able to identify a total of 50 recurrent activities in contemporary digital games. These results were then considered in a focus group meeting with 15 game design course participants and their lecturer who suggested adding two more items to the inventory.
All of the items listed in the original inventory by Vahlo et al. (2017) were supported by the data triangulation process. In addition, the content analysis of 166 game instructions and the results of the focus group meeting suggested that GAIN could include a total of 14 new gameplay activity items that were not identified by Vahlo et al. (2017).

Survey on gameplay activity appreciation and an EFA-based item screening
A survey with the full list of the 52 GAIN items was conducted to explore players' preferences for the recurrent gameplay activities of digital games. Based on the results of the survey, the initial pool of items was to be evaluated and reduced.

Sampling
To screen the items of GAIN, we first recruited 1053 survey respondents in cooperation with an international market research company to obtain a representative sample of the gaming population from Finland (age 18-65 years). The market research company cleaned the data by analyzing participants' response times and by removing cases that replied to the survey too quickly.
At the beginning of the survey, participants were asked to report on a 5-point Likert scale how interested they were in digital games (1 = not at all interested, 5 = very interested). If a participant replied that he or she was not at all interested in digital games, he or she was thanked for participating and instructed to quit the survey. This was an informed decision because our focus was on studying on how players appreciate a variety of gameplay activities, which requires that a person have at least some interest in playing games.

Materials and procedure
The survey participants were instructed to think about their gameplay preferences and specify how pleasant (1 = very unpleasant, 5 = very pleasant, 5-point Likert scale) each of the 52 gameplay activities were for them as players. The survey also included, e.g., a 25-item 5-point Likert inventory for studying participants' motivations to play digital games, a 9-item 5-point Likert inventory on their preferred challenge types in games, and questions regarding participants' age, gender, income, expenditure on games and weekly play time. The data was collected via a web-based survey tool, and it took about 15-20 min to take the whole survey with either a computer or a mobile device.
It is generally recommended that researchers screen their data to exclude cases of inappropriate responses, especially in the factor analytical studies of scale development (Meade and Craig 2012). According to this principle, we removed from the initial data set (N = 1053) participants who showed content nonresponsivity, i.e., who responded in a specific way regardless of the item content. We removed the participants who did not show any or only very minimal variance (only 3 or less responses deviated from the mode response) in their responses. Since the items were randomized by the internet panel provider, we could not check the prevalence of fixed sequence responses.
As a result, the data was cleaned by excluding a total of 174 respondents. The final sample included in the EFA consisted of 879 participants (49.5 % men, mean age of 41.6 years). The careless response rate of 16.5 % is in line with the typical estimations for the prevalence of careless responses (see Meade and Craig 2012).

Results
An exploratory factor analysis using principal axis factors and promax rotation was made with the data of 879 respondents to investigate the latent structures of players' preferences in gameplay activities. Promax rotation was selected over varimax rotation because varimax rotation does not allow correlations between identified factors, but forces them to be orthogonal to each other (Matsunaga 2010, p. 100). There is, however, a well-founded reason to assume that the latent factors of gameplay activity preferences are related to each other, which is why the varimax method could misrepresent the results.
The number of extracted factors was identified by utilizing a parallel analysis (Henson and Roberts 2006). To do so, we first ran an EFA on the data and and then a parallel analysis (PA), which generated an artificial data set for identifying the correct number of factors. The PA test suggested that five factors were to be extracted, and therefore we proceeded to investigate five-factor solutions. We applied two criteria for retaining items in the inventory. First, the factor loading of > 0.50 was used as a criterion to define whether an item loaded on a factor. Secondly, we accepted only those items that had a discrepancy value of over 0.3 (see Matsunaga 2010).
In the first solution, the items of "Helping others in need", "Playing musical instruments", "Matching tiles or shapes together", "Jumping on platforms and bouncing off of walls" and "Befriending in-game characters" had loadings of < 0.50 and these items were dropped. We then repeated the process without these five items. The PA test still suggested a five-factor structure. Since each remaining item showed a loading of > 0.50 on a factor, we continued to investigate the discrepancy values for the remaining 47 items.
A total of five items showed a discrepancy value of under 0.30. These items were "Attacking, defending and casting spells", "Racing at a high speed", "Defending one's territory, city or base", "Riding animals or other creatures", and "Customizing a character's appearance". After removing these items, we ran an EFA again with the five-factor structure, supported by an additional PA test. In the next solution, four additional items showed a discrepancy value of < 0.30, and we therefore excluded the items of "Commanding units or troops", "Managing a team and its players", "Piloting and steering vehicles", and "Dancing to the music". We proceeded to run an EFA again. Another item, "Producing vehicles, units or weaponry", showed a relatively high secondary loading, and so it was dropped.
After running an EFA again with five factors as suggested by the PA test, all of the remaining 37 items showed > 0.50 loading on their primary factor (lowest 0.545) and a discrepancy value of over > 0.30 between the primary and the secondary factor loading (lowest 0.345). The final results and descriptive statistics are shown in Table 1.
Items 1-11 all describe gameplay activities of direct action and aggressive or violent conduct. In the type of gameplay covered by items 12-17, the player directs, commands and generates resources. In the gameplay activities of 18-24, the player is engaged with the inner development of her character and her endeavors in the gameworld and its stories. Items 25-30 encompass activities in which the player must show skill by acting precisely and accurately in relation to the dynamically changing game environment. Finally, items 31-37 represent gameplay activities that are about nurturing and showing affection and tenderness. All of the five factors revealed in the EFA with 879 respondents were consistent with the original factors identified by Vahlo et al. (2017), and therefore, we call the latent factors Aggression, Management, Exploration, Coordination, and Caretaking.
A total of 23 gameplay activities of the 37-item GAIN (Table 1) were highly similar to the original Core Game Dynamics scale introduced by Vahlo et al. (2017). The items describing racing at a high speed and piloting vehicles loaded in this study on Primary and secondary factor loadings, uniqueness, and discrepancy values between the strongest and the second strongest factor loadings for each item. Items marked with ( * ) are similar to the original CGD scale; see Vahlo et al. (2017) Coordination whereas in the study by Vahlo et al. (2017), these items did not show > 0.50 loadings on any of the five factors. However, both of these gameplay activity items showed discrepancy values of < 0.30 and were excluded from the shortlisted 37item version of GAIN. All four of the items that were excluded from the Management factor because they showed a discrepancy of < 0.30 can be interpreted in a broad framework of 'warfare', which has evident connections with both Aggression and Management. This can be exemplified further by considering the item "Waging war and conquering", which showed a strong primary loading on Aggression (0.78) but also a notable secondary loading on Management (0.25). As a result of the EFA-based item screening process, in Aggression, a total of 11 items were retained while Exploration and Caretaking both consisted of 7 items and Coordination and Management of 6 items.

A confirmatory factor analysis of a 15-item GAIN
A theory-driven CFA was designed for investigating whether the five-factor structure of GAIN could be confirmed by analyzing cross-cultural survey data (RQ1). We also asked whether the EFA-screened 37-GAIN items could be validated as indicators of the latent factor they were developed and designed to measure. Based on the theoretical considerations, findings presented by Vahlo et al. (2017) and the EFA reported in this article, we formed the following hypotheses: H1 Players' gameplay activity preferences form five distinctive, but related, dimensions of Aggression, Management, Exploration, Caretaking, and Coordination.

H2
The shortlisted GAIN items are valid indicators for measuring the hypothesized five factors of gameplay activity appreciation.

Sampling
A total of 3000 survey respondents (ages 18-65) were obtained from Japan (N = 1500) and Canada (N = 1500) to construct representative samples from the gaming population of both of these countries. The surveys were conducted simultaneously in January and February 2017 in cooperation with a market-research company by using a web-based tool similar to the one used for recruiting the Finnish sample (N = 1053).

Materials and procedure
The questions in both surveys were kept identical to the survey conducted in Finland. Thus, the surveys conducted in Canada and Japan included all of the 37 items of the Gameplay Activity Inventory, which were accepted into the final five-factor model presented in Table 1. The surveys were translated into Japanese, English, and French and translated back to Finnish to check the quality of these translations. Respondents from Canada had an opportunity to choose whether they would reply to the English or the French version of the survey.
The samples from Japan and Canada were cleaned with a similar procedure as was used with the Finnish sample. As a result, a total of 322 participants were removed from the Japanese sample, which resulted in 1178 respondents (55.0 % men, mean age 41.8 years). From the sample collected in Canada, we removed 178 participants. The Canadian sample included in this study had 1322 respondents (50.4 % men, mean age 40.8 years). The combined data from Japan and Canada had 2500 participants (52.6 % men, mean age 41.3 years).

Results
To conduct a CFA on the five-factor model of gameplay activity preference types, we constructed 3-item scales for each of the five hypothesized factors. The rationale for designing 3-item scales was to develop a psychometrically sound instrument for measuring players' gameplay activity preferences. A 15-item measurement for gameplay activity appreciation would be more easily included in future game research surveys than the full 37-item inventory. Three items per latent construct is generally regarded as a sufficient minimum for constructing CFA models (Brown 2015, pp. 61-62).
The 15 items were selected based on the following criteria: (1) the discrepancy value of the item was high (> 0.30), and (2) the item showed a strong loading on the corresponding factor (> 0.50) in the EFA reported earlier in this article (N = 879). Furthermore, (3) the qualitative aspects the item did not overlap with the other two selected items but rather managed to portray an additional facet of the hypothesized gameplay preference dimension, and (4) each item could be argued to characterize a considerable proportion of contemporary digital games.
The analyses were conducted with structural equation modelling (SEM) and by using the statistical software, Stata 14.2, and the maximum likelihood estimation procedure. We conducted a CFA on the five-factor model of gameplay activity preference factors with the combined data from Canada (N = 1322) and Japan (N = 1178), which resulted in a data sample of 2500 respondents. The measurement model for reporting the CFA is presented in Fig. 2, and the descriptive statistics for the shortlisted 15 GAIN items are reported in Table 2.
The construct validity of the five-factor construct was investigated by calculating the comparative fit index (CFI), the Tucker Lewis Index (TLI), the root mean squared error of approximation (RMSEA), and the standardized root mean squared residual score (SRMR). The chi square test (χ 2 ) was not utilized since this test has been argued as ill-fitting for large sample sizes, especially when the correlations in the model are strong (Russell 2002;Matsunaga 2010, p. 106). RMSEA and SRMR values are generally regarded as a good fit for the data if the values are at or under 0.05 and acceptable with values under 0.08. A CFI score of > 0.90 is considered acceptable and > 0.95 good. (Brown 2015;Acock 2013, pp. 24-31;Schreiber et al. 2006, p. 330;Kline 2010;Marsh et al. 2004;Hu and Bentler 1999) The model we present in Fig. 2 had the following fit to the data: RMSEA 0.050, CFI 0.978, TLI 0.970, and SRMR 0.028. These results suggest construct validity of the model and a close fit to the data of 2500 respondents from Japan and Canada (see Schreiber et al. 2006). We also estimated the goodness-of-fit for the 15-item GAIN separately for the two datasets obtained from Canada and Japan. The model showed a good fit with the data collected from both Canada (N = 1322) and Japan (N = 1178).

Fig. 2
The measurement model reporting confirmatory factor analysis for the five factors of gameplay activity preferences (N = 2500). All loadings for the 15-item version of the scale are significant on the level p < 0.001. See Table 7 in "Appendix" for information about Standardized and standardized factor loadings for the CFA model The goodness of fit values for the data from Canada were RMSEA 0.052, CFI 0.975, TLI 0.967, and SRMR 0.032, and for the data from Japan: RMSEA 0.053, CFI 0.976, TLI 0.969, and SRMR 0.033.

Convergent and discriminant validity
To validate a construct, convergent validity and discriminant validity of the model should be investigated. To show convergence, the factor loadings for all items should be strong, preferably > 0.7. The five-factor model we present in Fig. 2 fulfills this criterion of unidimensionality, since the lowest factor loading was 0.72 for the item of "Developing skills and abilities" (× 7). Depending on the theoretical framework, the correlations between latent constructs in a model can be desired or undesired (Harrington 2009, pp. 6-7;Kline 2010, pp. 71-72). We expected that the correlations between the latent constructs of a CFA model would be at least moderate because players who enjoy gameplay often play a significant variety of games. Moreover, we argue that Aggression, Management, Exploration, Coordination, and Caretaking are all facets of the phenomenon of gameplay activity appreciation, and we expected therefore to find significant correlations between each of the five constructs.
With the exception of the correlation between Aggression and Caretaking, all of the five factors of gameplay activity preferences correlated moderately (> 0.40) or strongly (> 0.60) with each other. The strongest correlation was between the latent constructs of Management and Aggression (0.76), and Management and Exploration (0.75). All the correlations between the five latent factors were statistically significant ( p < 0.001). We continued to test the convergent validity of the model by calculating composite reliability (CR) estimates for each of the five constructs included in the CFA (Fig. 2). It is generally held that an acceptable value for CR is 0.7 (see Zait and Bertea 2011). The CR estimates for the 3-item five factors were: Aggression (0.883), Management (0.879), Exploration (0.841), Coordination (0.852), and Caretaking (0.802).
To establish both the convergent and discriminant validity for the gameplay activity construct (Fig. 2), we conducted an average variance extracted analysis (AVE), which has been recommended for confirmatory stages of research (Farrell 2009;Zait and Bertea 2011). AVE is used to investigate the average amount of variance in the observed variables that is explained by the latent construct. The value of AVE should be > 0.50 or higher for each construct to demonstrate convergent validity (Fornell and Larcker 1981;Farrell 2009).
AVE is then utilized to test whether the square root of the average variance of each construct is larger than the shared variance, i.e., square of the correlation, between The AVE test was calculated according to the formula presented by Fornell and Larcker (1981, p. 46), which takes the measurement error of the variables into account two constructs within the model. If the AVE for each construct exceeds its shared variance with all of the other constructs, the test supports discriminant validity for the model (Farrell 2009;Fornell and Larcker 1981, pp. 45-46). The AVE for each of the five gameplay activity preference factors was > 0.50 and greater than the shared variance between it and other factors. This result together with factor loadings and composite reliability (CR) estimates supports convergent and discriminant validity of the five-factor model of the 15-item GAIN (Fig. 2, Table 3).

Player clusters and game choice
We continued to perform a cluster analysis for the combined data from Finland, Canada, and Japan (N = 3379) to investigate whether player types could be identified based on players' gameplay activity type preferences (RQ2). We computed z-standardized factor scores for each participant by using the confirmed 15-item GAIN. Next, the standardized scores were utilized in a complete linkage cluster analysis. The Calinski-Harabasz pseudo-F stopping rule identified six player clusters in the data. Descriptive statistics of the background variables for each player type are presented in Table 4. We can observe from Table 4 that the six player types have distinctive characteristics in regard to cluster size, weekly play time, gender proportion, motivations, challenge type and gameplay activity preferences. Player Type 1 (447 participants, 57.5% men) showed a high preference for Management, and low for Coordination. In contrast to this, Player Type 2 (279 participants, 60.2% men) strongly showed enjoyment of Coordination but showed a slight dislike for Management and Caretaking. Both of these player types were over-represented in Japan.
Player Type 3 (647 participants, 24.9% men) favored Caretaking but did not enjoy Aggression. Similarly, Player Type 4 (588 participants, 40.3% men) despised Aggression, but this player type showed a moderate preference for Exploration, and also some for Coordination. Player Type 3 was under-represented in Canada. In contrast to this, Player Type 4 was over-represented in Canada and also under-represented in Finland.
Player Type 5 (592 participants, 46.5% men) enjoyed Exploration very much but showed a mild dislike for all other dimensions of gameplay activities. This player Gameplay preference differences between the player types are calculated as d x k =x k −x x · 100%, wherex k is the mean of analyzed variable x (e.g., Aggression, Management) in player clusters k (e.g., Type 1, Type 2) andx the sample mean of analyzed. Challenge preference, motivations to play, and interest in games are reported as mean sums on 5-point Likert scales. Weekly play is reported as mean play hours type was over-represented in Finland. Finally, Player Type 6 (784 participants, 80.8% men), which was under-represented in Japan, favored Aggression and moderately Exploration, but showed dislike for Caretaking. A total of 42 participants were not included in any of the six clusters, because they had replied similarly to all of the 15 items of GAIN.

Comparing GAIN and CGD player types
We then continued to study RQ2 by comparing the six GAIN player types with the seven CGD player types presented by Vahlo et al. (2017). The purpose of this analysis was to explore whether the player types in these two studies were congruent. We were Table 5 GAIN player types as allocated into the CGD player types Vahlo et al. (2017), based on the highest correlation between the z-standardized GAIN factor vectors per each participant and the mean factor scores of the seven CGD player types The Pearson's correlations are are reported in parentheses also interested in identifying whether the CGD player types could be argued to be persistent.
To compare the player types of these two studies, we classified respondents of our current sample into the seven CGD player types presented by Vahlo et al. (2017). This was done by calculating the Pearson's correlations between the vectors of our study participants' z-standardized factor scores and the mean factor scores of the seven player types that Vahlo et al. (2017) reported in their study. Participants of our study were associated with the CGD player types according to the highest revealed correlation (Table 5).
Based on the results we report in Table 5, it can be concluded that GAIN player types share clear similarities with CGD player types. Player Type 6 showed a very high correlation with the mean profile of The Mercenary type. Player Type 2 was very strongly correlation with The Patterner type, and Player Type 4 had a very high correlation with both of The Explorer and The Companion type. Player Type 5 was strongly correlated with The Adventurer type, and Player Type 3 with The Companion type. Finally, Player Type 1 had a moderate correlation with The Commander type and also with The Companion type.
Based on these findings, we decided to label the GAIN player types according to the CGD player personas: Mercenary (Player Type 6), Adventurer (Player Type 5), Explorer (Player Type 4), and Companion (Player Type 3). Since Player Type 1 was correlated not only with The Commander type, but also with The Companion type who dislikes Aggression, we decided to label this player type Supervisor, because the latter term is not associated with military activities. Similarly, Player Type 2 was correlated not only with The Patterner type, but also with The Daredevil type. Since The Daredevil enjoys both Coordination and direct action, including, e.g., racing and sports (see Vahlo et al. 2017), we decided to call this player persona Acrobat.

Game choices of the six GAIN player types
The surveys conducted in Finland, Japan, and Canada included questions that enquired about respondents' habits of playing games by asking how often they had played games of different genres during the previous 12 months (17-item scale of which we report here 11 items on the 5-point Likert scale, 1 = not at all, 5 = very often). To investigate how GAIN can be utilized in predicting game choice (RQ3), we designed a model where the outcome variable was the player type and binary transformations of RQ3 were used as predictor variables. In transformation, a variable gets a value of 0 if the respondent never plays that genre (original value = 1) and 1 if she/he plays it at least sometimes (original value > 1). Gender and age were included in the model as control variables. Predictive margins of each genre and player type are presented in Table 6.
A habit of playing simulation games and strategy games predicted that a person belonged to the Supervisor cluster. Also, playing sports games, racing games, and puzzles negatively predicted the identity of the Supervisor. In contrast to this, playing sports games and party games strongly predicted the Acrobat type, alongside a lower age.
A habit of playing party games also strongly predicted the Companion type, but the single strongest predictor for this player type was being female. Also, playing simulation games predicted belonging to this player type, but playing action games, action-adventure games, role-playing games, and strategy games all predicted negatively that one was of this player type.
The predictors for Explorers were higher age, being female, and playing sports games, party games and platformer games. Playing strategy games and role-playing games negatively predicted this player type. Being of the Adventurer type was predicted by the habit of playing role-playing games and action-adventure games. However, playing action games, racing games, simulations, and sports games were all negative predictors for the Adventurers. Finally, playing action games, racing games, strategy games, and being a younger man all predicted the Mercenary type, while playing puzzle games, simulation games, and party games were negative predictors for this player type.

Discussions and conclusions
The purpose of this study was to develop and validate a psychometrically sound short instrument for modeling player profiles based on gameplay preferences, and demonstrate the usefulness of this approach in predicting players' game choices. This was done by forming an approach to the structuration theoretical and TSQA views on media choice (Webster 2011(Webster , 2014Wolling 2009) and by empirically investigating how players appreciate recurrent gameplay activities in contemporary digital games. Positive values in gender are associated with the female gender. The percentage of the respondents (N = 3337) who reported to play specific game genres at least occasionally are reported under each genre label in parenthesis * p < 0.05, ** p < 0.01, *** p < 0.001 We investigated whether the five-factor structure of gameplay preference dimensions presented by Vahlo et al. (2017) could be validated by developing the gameplay activity inventory (GAIN) and by conducting a CFA with 15-item version of the inventory (RQ1). The CFA consisted of five 3-item measurements for the hypothesized latent constructs of Aggression, Management, Exploration, Coordination, and Caretaking. The CFA showed a close fit to the combined data from Japan and Canada, and also to both of these data sets when investigated separately. By conducting a CFA, we were able to confirm the five distinctive, but related, dimensions of gameplay activity preference types (H1), and we demonstrated that the GAIN items are valid indicators for measuring the five factors of gameplay activity appreciation (H2).
We then conducted a cluster analysis (N = 3379) and observed that the 15-GAIN could be used in constructing player types (RQ2). By comparing the factor score vectors of the respondents of our sample with the mean factor score vector of the CGD study by Vahlo et al. (2017), we were able to conclude that the six GAIN player types were mostly similar with the seven CGD player types. Four of the GAIN player types had a very similar profile to some of the CGD player personas. Therefore, we labeled these types according to the CGD model: Mercenary, Adventurer, Companion, and Explorer. Two of the GAIN player types had notable differences in their qualitative characteristics in comparison to the CGD types. We labeled these two personas as Acrobat and Supervisor.
The six player types differed clearly in their size, gender proportion and gameplay activity type preference profiles (Table 4). Also, it should be noted that the Mercenary type was clearly more typical for men than for women, and that the great majority of Companions were women. However, the Adventurer type and the Supervisor type were almost equally represented by both men and women, and the gender gap between the Acrobat and the Explorer types was only moderate. Mercenaries reported to be more interested in games and motivated to play than the other five player types. This player type was also the only one that had a higher mean preference score for gameplay activities than for gameplay challenges (Table 4). Indeed, Explorers who spent the least time playing digital games, reported a stronger preference in gameplay challenges than Mercenaries. Interestingly, the Mercenary type had the lowest mean age and the Explorer had the highest mean age, which indicates a correlation between higher age and preference in gameplay challenge types.
Adventurers reported a high interest in games, but only from the perspective of the Exploration type of gameplay. This result is in line with what Vahlo (2018) has recently argued: players who are fascinated by the story elements in games find games more meaningful than players who do not appreciate the dimension of fiction and fantasy in gameplay. The GAIN player types demonstrate that stereotypical male and female players may exist in the large player clusters of the Mercenary type and the Companion, but also that most of the players do not fall into these two player clusters but instead into player types that have quite similar proportions of both male and female players.
Finally, we conducted a multinomial logit regression analysis to study whether the GAIN player profiles could be utilized in predicting game choice (RQ3). The main result of this analysis was that players' habits of playing games of different genres do predict their player type, and thus it can be argued that the GAIN model can, in principle, be utilized in targeted marketing and in generating game recommendations for identified player profiles. It is worth noting that all genres other than adventure games were significant predictors for the GAIN player types ( Table 6).
The GAIN player profiles are linked to genres people play, which validates the potential of GAIN in predicting game choices. Importantly, the six player types are associated with a habit (or the lack thereof) of playing multiple game genres. Therefore it cannot be concluded that Mercenaries equate to action game players or that Adventurers are merely RPG players. Indeed, among Mercenaries prevalence of playing puzzle games is higher than playing action games (Table 6). This result shows the limits of using only data on genre playing habits in player profiling and in generating personalized recommendations.
These results (Table 6) can be argued to be partly consistent with the findings by Greenberg et al. (2010), who found that male players preferred physical games (action, racing, and sports) over imaginative role-playing games and puzzles, whereas the latter were the most favored game types by females. Furthermore, Terlecki et al. (2011) found that both men and women enjoy adventure games, which can be argued to include many elements of Exploration.

Limitations of the current study
Similar to the game dynamics approach by Vahlo et al. (2017), the gameplay activity inventory (GAIN) is influenced by the data utilized in developing the inventory items, and by researchers' interpretations. Notably, the data of 166 game instruction manuals did not cover mobile game titles, which may suggest that GAIN is currently best adapted for investigating non-mobile gameplay preferences and player profiles. Relatedly, the inventory may be a better fit for investigating players' preferences in non-mobile games because many mobile games only include a few gameplay activities.
GAIN is primarily an approach for identifying player profiles ), gameplay preferences, and game choice, although still questionable is how well a respondent's GAIN responses predict which games a player chooses to play as she browses through her options, e.g., in game marketplaces. Investigating this question would require that the GAIN model would also be utilized in analyzing behavioral data instead of mere survey data made up of players' reflections on their own gameplay preferences.
Like other applications of the TSQA, the GAIN instrument is perhaps not wellsuited for those individuals who have no or very little experience in playing digital games. A person may find it difficult to reply to a questionnaire about different types of gameplay if one lacks first-hand experience of playing games.
It must also be noted that, although the five-factor structure was confirmed in this study, the player types remain context-dependent and unstable, due to the nature of available clustering methods. In other words, the five orientations of gameplay preferences can be utilized widely in game research, but player clusters should be constructed by taking the unique characteristic of each sample into the consideration.

Directions for future research
Video game genres are primarily separated from each other based on the kinds of interactivity they afford for the player in gameplay (Apperley 2006;Landay 2014). Research on game preferences and gameplay type preferences thus differs from other player profiling methods, e.g., from studies on motivations to play and player behavior research, by providing tools for understanding players' habitual game choices. Because of this, the gameplay type preference approach can inform the game industry in how to better design, develop and market games with better market fit.
The GAIN model could also be utilized in developing game recommendation models based on review-based player profile building, review-based game profile building, or both (see Chen et al. 2015). For this purpose, the GAIN approach could be imple-mented in analyzing users' behavioral data on, e.g., game marketplaces or game streaming services. By combining the GAIN model with meta-data of game titles (e.g., genres and tags) and players' game ratings, the approach could make use of both content-based filtering and collaborative filtering. The GAIN approach may also provide more insight into latent trends in gaming cultures and usages of different game technologies.
In future research, it would be important to investigate how the gameplay type preference approach may open new horizons for understanding expenditure of time and money on digital games. For this purpose, the relationships between gameplay activity preferences, gameplay challenge preferences, and gameplay aesthetics preferences should be studied. It would also be relevant to analyze how the gameplay approach at large relates to motivations to play, expected outcome gratifications (LaRose 2009), and gameplay behavior style types. For instance, GAIN could be used in player modeling studies that aim to capture patterns in dynamic player-game interaction. By doing so, researchers could investigate how players' game choices correlate with their gameplay choices (Cowley and Charles 2016), and how a player's gameplay preference profile relates to her personality traits and temperament type.
Another important theme for future research would be to investigate how distinctive gameplay activity preferences, for instance, a preference for Aggression, correlates with, e.g., appreciating sensory realism (see Shafer et al. 2014;Ribbens et al. 2016), specific motivations such as immersion (Lin and Peng 2015;Yee 2006), and possible positive and negative media effects, such as aggressive outcomes beyond gameplay situations (e.g., Krcmar et al. 2011;Valadez and Ferguson 2012;Jeong et al. 2012;Zumbach et al. 2015;Greitemeyer 2018).
GAIN should be further developed by exploring how the inventory manages to cover mobile gameplay preferences. Also, it would be crucial to investigate whether GAIN could offer an alternative take on game genres. It is well known that established genres such as 'action games' or 'strategy games' are very general, and game companies and consumers alike have difficulties in identifying which game genres represent specific games the best. Future research should study whether GAIN could help with this problem-which could thus help game companies with marketing and providing new tools for game discovery.
We revealed in this study that there are five facets of gameplay appreciation and that these dimensions are cross-culturally shared since similar factor structures were identified in Finnish, Japanese, and Canadian samples. Players representing different cultural backgrounds evaluated qualities of digital gameplay according to the dimensions we call Aggression, Management, Exploration, Coordination, and Caretaking. We propose that both the 15-item and the 37-GAIN are valid instruments for investigating players' gameplay activity preference factors and modeling player types. and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.