Number of Papers
The initial result of the research indicated a total number of 1053 papers, out of which 89 were extracted by reading the title, abstract and keywords. The refining of the paper continued by reading the full text of the studies, and a total number of 32 studies were selected and used as the basis for this SLR. Table 5 shows the number of initial results along with a final number of extracted papers. After the data extraction phase, all of the studies have been given a unique identification number to enable easier referencing for further analysis. In the rest of this paper, the studies are referred to as in the form of their identity key from (S1) to (S32), presented in appendixes.
In this section, we have present the number of papers based on the two following categories:
Number of papers per year.
Number of papers per topic.
The idea here is to understand the age of the research topic in general, and the trends of the subject, being the convergence of machine learning and gamification. This approach of analyzing the study topic trends based on literature is also presented and deployed in other studies, such as Kitchenham et al. (2009) and Sánchez-Gordón and Colomo-Palacios (2019).
In this part, we present the number of studies distributed by their publishing year. In this systematic literature review, we have retrieved the data for all the previous years. Interestingly, the first paper was published in 2014, which shows that our topic of interest emerged recently. In 2018, researches focused more on the topic by presenting 15 studies, followed by 4 papers published in 2019. The number of studies per year is demonstrated in Fig. 1.
The above presentation demonstrated the emergence of the topic in last 6 years. After a slight attention in two years of 2014 and 2015, researches shed more light on the topic in 2016. There was a noticeable increase during this year, after which the hype was again disappeared. However in 2018 more number of studies worked around this topic that can be due to recent improvements in various technologies, such as machine learning, gamification, sensors, mobile devices, etc.
Dispersion of the Topics in Literature
The next investigation carried out was on the dispersion of the topics. The results show that there is a considerable amount of studies focused on learning, with 16 papers comprising more than 40% of the literature.. This introduces the potential of the research topic in learning and education and the interest of corresponding researchers. Personalization is the next topic that has been the focus of the studies. One of the major goals of gamification tasks has been to support human behaviors. But the problem is normally the long-lasting effects of behavioral change stimulants. In this regard, machine learning has contributed to the customization of the user experience in order to encourage the continuation of user engagement. The following topic in the list includes studies that focus on behavioral change. The following interesting topic is crowdsourcing. Interestingly, in these studies the direction of the convergence was in reverse order, that is, from gamification towards the improvement of machine learning. These attempts were to improve the participation rate of the users in labeling the required data for the machine learning training step. Other efforts were summarized in affective computing, sentiment analysis, health and medical activities, and lowering energy consumption. Figure 2 shows this dispersion.
Answer to Research Questions
In this section, we perform a detailed analysis of the studies based on the research questions. Section 4.4.1 investigates the papers to answer the “RQ1: What is the reported usage of machine learning in gamification?”. Next Sect. 4.4.2, scrutinizes the answers to “RQ2: What is the reported usage of Gamification in Machine Learning?”. We then present the answer to “RQ3: What are the reported effects of using Gamification over Machine Learning?” in Sect. 4.4.3. Followed by the response to “RQ4: Which aspects of Gamification are affected by machine learning?” in 4.4.4. Finally, “RQ5: Benefits and challenges in connection with machine learning and gamification?” is addressed in Sect. 4.4.5.
RQ1. What is the Reported Usage of Machine Learning in Gamification?
To answer this research question, we identified the papers being in the direction of machine learning towards gamification. In this context, papers that attempted to use a machine learning technique or approach in order to enhance and support gamification tasks are identified. Following the identification of these studies, we further inspected each paper to find out the corresponding machine learning concept applied.
We have found 3 main areas of application based on which corresponding studies have been analyzed. These areas are presented as follows: 1. Learning, 2. Personalization, 3. Behavioral Change. Table 6 shows the goals of the reviewed papers in each category specific to the RQ1. However, there are papers in other applications that we address in the next research questions.
Affective state recognition is claimed to have an influence over optimization of the learning process and its outcomes by providing learning interaction, for example in a gamified learning that can be personalized. In this regard, authors in (S10) presented a method towards retrieving the affective state of a student while interacting with a serious game learning platform by applying machine learning (Ghaleb et al. 2018). In this work, the so-called model Theory of Flow is utilized to link the affective state of the student to the user-platform interaction. This model presents three states of boredom, engagement, and frustration. A support vector machine algorithm is used to train a classifier to distinguish between different affective states. Authors have reported precision of 67%. Hence, the machine learning technique can be applied to predict the affective state of students when interacting with a gamified learning platform.
In a different scenario, machine learning and gamification can work together to facilitate learning. For instance, authors in (S11) developed an application in which deep learning and gamification are used to assist 3-4 years old children to learn generalizing objects (Suresh et al. 2018). They have used an approach called joint-embedding visual question answering, leveraging on a convolutional neural network (CNN) and a stacked recurrent neural network (RNN) called long short-term memory (LSTM). The application of machine learning in this gamified learning platform helped to extend the learning context by making the game elements dynamic and intelligent, hence optimizing the learning process.
In a similar effort towards the advancement of learning processes, authors in (S12) designed a serious game in order to assess the medical student’s knowledge level (Lima et al. 2016). Various diagnostics were simulated through a gamified virtual reality assisted platform. Gamification features were used to motivate users to work with the system for a longer duration. On the other hand, a machine learning algorithm was employed to develop a disease classification model in order to assist the working of the platform. The researchers of this study used the freely available machine learning API, so-called Weka, which offers a set of machine learning algorithms.
One of the techniques to improve gamified applications is the personalization of the game elements that will be discussed in detail in the next section. With regards to the learning platforms along with the adaptation technique, personalization is claimed to be an effective solution as well. As an instance, authors in (S19) proposed an approach towards systemizing the selection of personalization strategy with the help of machine learning (Knutas et al. 2018). In their demonstration phase, they used a CN2 rule induction algorithm to model a classifier to distinguish between various situations that take place in a computer-supported collaborative learning context (CSCL), followed by the recommendation of a personalized gamification activity.
Furthermore, assessing student performance while interacting with a gamified learning environment can assist professors in dynamically changing the learning environment to adapt the condition concerning student’s performance. (S20) proposed a method to achieve this task with the help of machine learning methods (Barata et al. 2015). The authors in this study used the data from a gamified learning task to perform a clustering technique grouping students’ performance into different categories, namely, achievers, disheartened, underachievers, and late awakeners. They have used an algorithm called expectation–maximization (EM) and claimed that by using this student clustering technique, it is possible to predict student behaviors in the early stages of their interaction and hence, an adaptive and smart learning environment can be developed.
The above-mentioned work was focused on the performance assessment of the students while interacting with the platform to proactively improve the performance of the students, whereas, a more constructive manner of improving gamified learning platforms is to improve the performance using an adaptation mechanism. (S21) presented an approach where machine learning methods and algorithms are applied in order to adapt the game content to specific characteristics of every user, assisting with the learning process (Stefanidis et al. 2019). They have used a variation of a machine learning algorithm ϵ-greedy called, ϵ-decreasing algorithm to obtain the engagement profile of the user.
Last but not least, are some of the specific subject learning platforms that require some type of machine learning methods to implement a required application. For instance, the work presented in (S31) is an attempt towards assisting in music education, air-violin self-learning specifically, (Dalmazzo and Ramirez 2017). Authors in this work deployed two machine learning models created using decision trees and hidden Markovian and developed a fingering recognition model in a gamified virtual violin platform. The study presented in (S38) used gamification and machine learning to track child’s brain development and to participate its improvement (Anparasanesan et al. 2019). Authors of this study provided an approach in which the K-means Clustering algorithm of machine learning is used to identify the initial child’s brain status and suggesting a proportional brain game. The proposed solution starts with an initial evaluation of the child’s brain status and assess if the current brain development is proportional to the child’s age. Then, a data set is created from the initial evaluation which is then undergone the unsupervised learning algorithm of K-means clustering to identify suitable gamified tasks. Various tasks have been designed as memory games, attention games, games for concentration, and games for response time. The study demonstrates that ML can be used to provide suitable gamified tasks to particular target group based on their cognitive abilities.
One of the areas in which machine learning was used to optimize the results of gamification was personalizing the gamified tasks for each user to adapt the gamification aspects. In this regards, the study in (Teasley 2017) presented that the one-size-fit-all feedback system does not always perform satisfactory and can have mixed results, and hence a personalized feedback may moderate the negative effects. Related to gamification, the authors in (S4) used machine learning to predict the performance of each user and to adapt the complexity of the gamification task to the respective user (Lopez and Tucker 2018), they used the facial key point data in conjunction with a support vector machine algorithm and predicted the performance of each user by 76.8% accuracy.
Adding to that, another way of personalizing the game contents using the performance of the users is by understanding the performance profile of each user and customizing the game contents, accordingly. Authors in (S20) proposed a technique towards identifying the performance profile of each user by utilizing a machine learning-based clustering method to personalize the game contents based on the potential of each user (Barata et al. 2015). This approach is also an attempt towards the adaptation of the gamified platforms that can improve the engagement of the users with the gamified task.
Therefore, the adaptation of the gamified platform is another mechanism that tailors the game content with respect to the context’s specific situations. As stated in Knutas et al. (2018), adaptive gamification differs from personalized gamification. In fact, adaptation refers to the gamified system react to different situations, whereas personalized gamification is modifying the game contents based on users’ specific characteristics. Although, the work presented in (S21), developed a system called an adaptation manager that is capable of identifying the player characteristics and adjusts the game contents accordingly with the help of machine learning algorithms (Stefanidis et al. 2019). However, they have also created some kind of adaptable game scenarios and elements that can be selected at some specific points of interaction with the game and are offered by the adaptation manager framework of the system. The adaptation mechanism has two parts, online and offline adaptation mechanisms. The offline mechanism is used to assist the users’ in-game performance, whereas the online mechanism is concerned with the level of engagement of the user.
In a different perspective, authors in (S33) considered the problem of personalizing the mobile crowdsensing processes (Karaliopoulos et al. 2016). They believe that the main concern with the crowdsourcing activities is to increase the level of contribution of the users. This contribution is subject to various types of parameters such as incentives provided to users and it’s proportionality to the practicality of the task. To overcome the challenge of optimizing the task proportionality to the incentives offered to the users and maximizing the chance of user contributions, authors proposed a novel approach of using the machine learning technique of logistic regression. They provided an approach of modeling the past user behaviors in previous crowdsensing applications, trying to predict the optimal pair of (task, incentive) for the current target. The proposed solution was evaluated against the real data of an online questionnaire approach to collect user preferences and the results have shown to be promising in terms of level of contributions. In another effort, (S36) proposed a machine learning based technique to automatically analyze serious games by capturing players behaviors (Palavalli et al. 2014). Authors believe that by identifying and analysis of players activities through a video based technique, it is possible to determine some of the parameters influencing the learning potentials of serious games.
(S42) carried out a study on encouraging children to do more physical activity using a personalized gamified feedback system (Schäfer et al. 2018). The machine learning techniques of SVM and random forest are utilized to classify the initial activity level of a child prior to the usage of the system. The classification models are created by learning over a pre-labeled mobile sensor data comprising children normal activities. Then, based on the users specific activity class, a personalized gamified feedback is provided to the users. The feedback system is based on visualizing the activity level of the user by means of showing an Avatar and a motivating message to the user. At the end of each day a bar chart showing the progress of the user is also provided to increase the awareness. In the experiment conducted in this study, Random Forest outperformed SVM with higher accuracy and made the personalized gamification task more engaging.
Another effort towards optimizing the engagement of the users in a gamified platform is to statically configure the game difficulty. Adjusting the difficulty of the game statically is defined as configuring the game difficulty prior to the start of the game based on previous game play data of various players. This is against the dynamic adjustment of the game difficulty during the game based on the user performance. This approach is carried out in the study presented in (S37) by Khajah et al. (2016). The authors of this study proposed the application of a machine learning technique known as Bayesian optimization to manipulate the game difficulties. They have distinguished between two types of manipulations, namely, overt and covert. Overt manipulations are those that players can feel during the game play, in contrast, covert manipulations are those that are less visible and includes some aspects of the game that the player cannot distinguish. Authors argued that, the overt manipulation does not have any effect on user engagement compared to the covert manipulation that shown improvements in engagement in the experiments they conducted.
Rather than predicting the performance of the users in order to personalize the game content, one can understand the affect of the person interacting with the gamified platform in different stages of the game to learn the reactions of the user in different scenarios. This task is called affective computing and is discussed in a later section. But what is important here is that this approach leads to personalization, as well. In the work presented in (S10), authors performed a subject-based analysis to evaluate the adaptive nature of the learning process and highlighting the employment of interaction features towards creating a customized and personalized learning environment (Ghaleb et al. 2018). They have presented an accuracy of 74% using a support vector machine algorithm in recognition of the corresponding affective states.
(S43) is another effort to presented a framework for developing a language learning platform (Lungu 2016). The study is focusing on learning vocabulary of a new language by combining the free reading exercises and optimal repetition of learned concepts. The system comprises various modules. First, a machine learning agent that evaluates the present knowledge of the learner based on previous interactions of the user with the system. Second, a motivator agent that utilizes gamification elements to provide suitable feedbacks to the learner in order to keep the learner engaging with the system. Therefore, setting the feedback dynamically based on users performances personalizes the user experience and improves the over system efficiency.
Above all of the advantages of personalization, tailoring gamified designs and contents to each of the corresponding users is not an easy task to achieve. Hence, there have been attempts to simplify and structure the process of developing personalized gamification. For instance, (S19) proposed a technique in which personalization is carried out with the help of a machine learning algorithm-based content selection (Knutas et al. 2018). Author suggested that to overcome the difficulties of selecting personalized contents, gamified platforms can benefit from the machine learning-based algorithms to automate personalization. Furthermore, this approach may convert the process of personalized content selection into systematic and repeatable means.
Despite the difficulties of deploying personalization for gamification tasks, it has been one of the most prominent reasons for applying machine learning methods in human–computer interaction systems. Authors in (S29), proposed a system in which machine learning is used to personalize a gamified In-vehicle human–machine interface (Di Lena et al. 2017). In this system, a prototype is developed that is equipped with a dashboard offering personalized challenges to the drivers based on their estimated energy consumption that has been found with the help of machine learning methods.
Nevertheless, verifying a behavioral change success is impractical in the short-term and requires the long-term engagement of users to be able to judge on the behavioral change attainment. To achieve this goal, there have been several attempts to personalize the game content in order to motivate users to continue using the system for a longer duration. One of which used machine learning methods is (S30) that developed a framework known as procedural content generation (PCG), which is a solution to sustain the interest of players by tailoring the game contents based on the specific users’ profile and characteristics (Khoshkangini et al. 2017). The framework uses machine learning methods to improve the challenge selection based on the users’ historical interactions with the system, eventually recommending new challenges that can be suitable, according to the players’ profile and characteristics.
Applying gamification for persuasive technologies to foster behavioral change activities have gained a lot of attention in recent years (Kappen and Orji 2017). Furthermore, promoting environmental sustainability practices taking from energy saving to pollution control has been the focus of many studies and researches as well (Akasiadis et al. 2015; Tserstou et al. 2017; Konstantakopoulos et al. 2019). As an instance, (S17) provided a gamified interface to promote renewable energy usage by residential buildings (Akasiadis et al. 2015). They used machine learning methods to provide a forecast possible electricity consumption rescheduling, hence motivating residents to take appropriate action towards the goal of the gamified task. They have trained various regression algorithms and compared them to choose the best one. They have concluded that Support Vector Regression is the best for their application, since it trains fast and is scalable compared to other regression algorithms and neural networks.
Another attempt towards achieving behavioral change is presented in (S22), where authors used machine learning methods in a serious game mobile application that identifies stairsteps and encourages people to use stairs instead of elevators (Ciman et al. 2016). This work aims to increase peoples’ daily physical activity with the help of a smart serious game. Authors in this work developed a mobile application that holds a game that records and analyzes the data from smartphone sensors. The application also counts the stairsteps taken by the user to provide persuasive game elements. The task of recognizing stairsteps is a classification problem that is implemented using three machine learning algorithms, namely, decision trees, K-nearest neighbors (KNN), and kernel optimization of the margin distribution (KOMID). KOMD combined with smoothing the data, demonstrates a better result comparing to other algorithms. The algorithm shows a precision of 91%.
Machine learning-based classification of a phenomenon from the data retrieved from a gamified user interaction has been simultaneously used as a method in various studies. Serious games normally are used to provoke users to provide some specific type of data that can later be used for the desired application. This approach is also used for the intention of some behavioral changes. As an instance, (S24) used a mobile game application to deliver oropharyngeal exercises to treat snoring (Goswami et al. 2019). A machine learning support vector machine classifier is used to classify the extracted recordings of snore from the participants. The authors in this study conducted a randomized controlled trial over 16 participants with habitual snoring to play the game daily. Results are shown to be successful after 8 weeks of trial, as reported by the bed partners of all the participants.
One of the main contributions of behavioral change studies has been towards provoking buildings’ human occupants to use energy more efficiently and effectively. In this regard, gamification approaches have been used by many researchers to create an interface for occupants to interact with the energy usage of buildings, hence incentivizing the energy-efficient behavior (Konstantakopoulos et al. 2019).
Another area, in which being a behavioral change influencer was the study`s focus, is encouraging Eco-Driving behaviors. As an instance, (S29) fosters Eco-Driving behaviors by proposing an In-vehicle dashboard, functioning based on machine learning and gamification techniques (Di Lena et al. 2017). Researchers in this work provided various gamified challenges based on the driver´s braking style. In other words, the system identifies the user’s driving behavior and predicts future battery usage and offers some choices of battery saving behavioral change activities. Although the results of this work are still under investigation, it demonstrates to have noticeable outcomes. Staying in driving behavioral change, the study presented in (S39) developed a solution for recognizing drivers way of holding the steering wheel and providing them suitable feedback to provide an awareness that leads to behavioral changes (Raptis et al. 2018). They have used a support vector machine to identify drivers attentiveness by classifying them to be either as attentive or inattentive. Finally a gamified feedback system is provided at the end of the driving session to make the users aware of their steering wheel holding habits. Interviews with the users demonstrated that drivers are practicing dangerous behaviors during driving that they are not aware of. This work presented the application of ML to provide suitable gamified feedback to users towards possible behavioral change activities.
RQ2. What is the Reported Usage of Gamification in Machine Learning?
To answer RQ2, we first identified those papers with the direction from gamification towards machine learning. Studies that comprised of using gamification aspects and mechanics in the machine learning process were taken under inspection in this section.
Machine learning methods are well known for their power in learning from the data and predict future values for some desired target variable. However, this process requires a vast amount of labeled data, and in this regard, gamification aspects were used in combination with a crowdsourcing approach, extensively by many researchers, to produce labeled data. One of such is (S2), where a gamification framework for sensor data analytics was proposed (L’Heureux et al. 2017). They used gamification to motivate users to perform targeted action through the use of gaming mechanics. The action is in fact, labeling the sensor data which later was gone under a supervised classification problem by the K-Nearest neighbor algorithm. The authors reported improved sensor data analytics with 88.6% of accuracy.
In another data collection task, authors in (S8) used a serious game for retrieving data regarding cognitive neuroscience (Murphy et al. 2018). Their goal was to quantify cognitive aging and performance in a home situation, where they utilized a game with the purpose to make data analysis possible by collecting desired data at home. Then, they used random forest and linear regression algorithms to analyze the collected data. Other data collection tasks were (S15), where authors utilized gamification aspects by providing a web game, supporting it with a deep learning algorithm to create a facial emotion dataset (Li et al. 2016). They reported a classification accuracy of 80% by deploying the CNN algorithm over the dataset collected through the proposed game with a purpose.
The study presented by (S34) used gamification element of providing users statistics to encourage users to engage with a mobile application (Urh and Pejović 2016). This application collects and labels user’s data that is later utilized by machine learning techniques. Hence, the gamification power of motivating users have been utilized to higher the quality of data collection for the purpose of machine learning. However, the authors of this study did not measured the increased level of engagement properly.
On the other hand, gamification and machine learning were used considerably in the learning and education context. (S3) utilized the power of gamification to improve the engagement of students in the learning process in teaching them machine learning. Gamifying the learning process to translate the complexity and technical knowledge of machine learning was achieved in this study (Sakulkueakulsuk et al. 2018). They reported that students had more fun, engagement and hands-on interactivity while learning ML. In the same criteria, (S5) and (S7) proposed a game with purpose with badge and leaderboards of gamification elements to teach machine learning to students through a gameplay activity. All of these studies gamified the learning process to minimize the hassle in learning technical contents, especially for non-technical students (Anderson et al. 2014; Rattadilok et al. 2018).
From a different viewpoint, authors in (S40) believe that gamification and machine learning can be used together to access users private data (Acharya et al. 2019). They have anticipated that it is possible to put users into situation of providing their private data unintentionally. To support their argument they have developed a mobile application that motivates users to play a game which is getting them into performing particular patterns to collect and create training data tailored to the user. At the same time, the authentication pattern of the mobile phone is recorded every time user log into his phone in background. A possibility that is available in Android devices. Later, ML algorithms, namely support vector machine and logistic Regression is used to predict the users lock pattern. Therefore, once again gamification has been used for data collection required for machine learning activities. However, this time a malicious target have been followed.
Until now, most of the efforts regarding the usage of gamification to assist with machine learning tasks have been around the data collection and labeling tasks which are considered to be based on the supervised machine learning problem. However, other types of machine learning problems, being unsupervised and reinforcement learning can also benefit from the advantages of gamification. Holzinger (2016) presented the concepts of interactive machine learning (iML), where the human agents or human-in-the loop can interact with the algorithm and optimize its learning process through this interaction. Adding to that, he considered the unsupervised learning as an automated ML (aML) approach, since there is no human interaction with the learning process, although he mentioned that, in an unsupervised learning task, the human expert can verify the results of the algorithms at the end of the ML-pipeline. This is exactly where gamification can participate in verifying the results and assisting the optimization of the algorithm, however, to the best of our knowledge, there have not been any studies in the literature deploying this potential. Another situation is where the gamification is used to help human intervention to label parts of the data which turns an unsupervised learning problem into another type of ML problem called semi-supervised learning. However, this is also a topic that is not addressed by the research community as well.
RQ3. What are the Reported Effects of Using Gamification Over Machine Learning?
To answer this research question, we investigated the studies which attempted to use gamification in order to enhance a machine learning process. There have been several studies in this regard, that are discussed in this section.
(S7) is an attempt to create a video recommender system in which authors used sentiment analysis of the user comments about every video (Mulholland et al. 2015). Comments have been under a machine learning method using Google API to carry out a sentiment classification for video recommendation. In this context, gamification aspects, namely, badges and leaderboards have been used to encourage user interaction with the system. Authors reported that using gamification, facilitated the encouragement of user interaction, facilitating in turn the Machine learning sentiment analysis task. In the same context, (S10) reported an optimized level of engagement in an attempt to develop an affect recognition system (Ghaleb et al. 2018). They used machine learning SVM algorithm in combination with a serious game to identify and classify users’ affections. Another study reporting the same effect of gamification over machine learning is (S16), where a higher engagement rate through gamification resulted in a higher detection rate of student engagement recognition in prosocial games (Psaltis et al. 2018).
Moreover, the effect of gamification in improving the prediction accuracy of the machine learning method is reported by some of the studies. For example, (S9) reported such an improvement in the application of predicting student profiles (Barata et al. 2016). They used gamification aspects such as points and leaderboards to make early student profile detection possible, which improves the accuracy of machine learning algorithms. In a different scenario, (S41) introduced a solution for people with deaf blindness that have combined issues with vision and hearing senses (Korn et al. 2018). They have developed a system that uses machine learning for object and face recognition and environmental sensing, which are then provided to the users by means of haptic communication through smart textiles. At the same time, the proposed solution is empowered by gamification techniques to be more engaging and joyful for people with these sever conditions. Here, gamification and machine learning are applied as two separate technologies to provide a beneficial solution to some specific target users.
To conclude, it can be seen in the literature that the usage of gamification provides the required data for later analysis with the help of machine learning. In fact, it is the process of gamification that made machine learning analysis possible. The same is reported by Murphy et al. (2018).
RQ4. Which Aspects of Gamification are Affected by Machine Learning?
This research question is in fact, contrary to the previous one and investigates the contribution of machine learning in enhancing gamification processes. There have been few studies reporting and considering the effect of machine learning in gamification that are discussed in this section.
In general, the common issue with gamification processes is the fact that one game does not fit all. This is where various users have different capabilities, preferences, potentials, and characteristics, and they require the gamification design to be configured for them accordingly. This is the case when gamification is to be used in learning and education applications. Different students have different capabilities and characteristics, making it difficult to have a gamified task that serves all of them equally. Therefore, providing a one-time configured gamified application may result in inefficient interactions with the users, making it a false effort. In this regard, the main approach that can be taken into consideration is to personalize and adapt the gamified application with respect to the user’s characteristics which requires the application to understand and learn the characteristics of its users.
One of the significant influences of applying machine learning in the gamification context is to personalize the gamified tasks for the users, as well as adapting gamification with regards to the user capabilities and domain context that enhances gamification performance to a great extent. (S4) reported an improvement in the adaptation of the gamification task with the help of machine learning (Lopez and Tucker 2018). Authors used machine learning SVM algorithm to help adapting game features and task difficulties based on user performances. They argue that since previous methods of adapting gamification tasks have not been able to obtain the user’s behavior before the end of the interaction, the level of personalization is limited. Therefore, they analyze the interactions of the user dynamically and try to predict future performance with the help of machine learning. As a result, authors also mentioned that this approach can help to understand the relationship between the factors affecting user performance and gamified tasks. The authors of the mentioned study, collected the data of the users’ facial expression to analyze their affective state and to predict their performances, based on which the game elements should be modified. However, the sole usage of the facial expression data may be insufficient for predicting user affective state and given the limitation of users data, it is unverifiable that into what extent the proposed technique is effective. The distinctive feature of their work is that they used the affective state indirectly to modify the game design based on user performance prediction, compared to studies that altered the game elements based on the user affective state directly. Anyhow, machine learning is shown to be useful for personalizing the game design.
Therefore, the same goal can also be achieved with the help of other techniques such as identifying the student type or their affective states. Studies reporting the same personalizing effect with these techniques are presented in (S20) and (S10) by Barata et al. (2015) and Ghaleb et al. (2018). Authors in (S10) developed a gamified engineering course to create a smarter learning environment. Moreover, they utilized the machine learning clustering task to personalize gamification by clustering students based on their specific performances. They reported that the usage of gamification proved rich responses fitted to the characteristics of every student. The study of (S10) is a well reported example for how machine learning can be used to classify students based on their performances in early stages of their interaction with a gamified learning platform and consequently adapting the game features based on that classification. The study shows that how the grouping of students based on their first year performances leaded to making the game components more rewarding for those who were showing less engagement in the first year, and hence the percentage of those dropped in the second year. Hence increasing the engagement of the students with the gamified platform can be achieved with the help of the machine learning.
(S20) on the other hand, used affect recognition with the help of machine learning to improve the adaptiveness parameter of a gamified task in a learning environment. In general, the goal of personalizing the game elements in a gamification strategy for learning and education purposes can be achieved with the help of machine learning. (S20) presented the application of a serious game in delivering learning contents to students. Serious games or games with purpose that employ game design elements for interacting with students are shown to be an effective solution in previous studies. Given that the user reaction to a serious game can be fluctuating based on user attributes, it is constructive to identify the user’s feedback in different stages of the interaction and personalize it accordingly. The effort of this study toward identifying the three user’s affective states of frustration, boredom, and engagement with the help of machine learning helps the task of personalization to be achieved effectively.
Other studies reported some specific improvements in gamified tasks with the help of machine learning as a result of their work. (S12) believes that machine learning can facilitate game content insertion during the gamification process, which can be seen as an adaption context too (Lima et al. 2016). Authors in this study used machine learning to insert real data into the gamified platform, hence, optimizing the gamified learning experience. They proposed a method called intelligent agents to interfere in the gamified learning activity in order to get the virtual learning environment closer to reality. (S22) presented a serious game where users are motivated to use stairs instead of elevators to increase users’ physical activity (Ciman et al. 2016). They used machine learning to recognize whether users are taking stairs or not. They believe machine learning makes the process of gamification to work interactively in real-time. Therefore, one of the applications of machine learning in improvement of gamified platforms is to provide an advantageous functionality to the system. For example, the study of (S12) used gamification to classify diseases based on their symptoms in a medical application. Generally, machine learning has the ability of making the gamified platform to be intelligent and updated by feeding new data to the system continuously. This data can be inserted into the system either by a human agent or by the gamified platform itself. This is the same approach that is also used in (S22) to make the gamified platform changing its behavior dynamically and in real-time.
RQ5. Benefits and Challenges in Connection with Machine Learning and Gamification?
To answer the final research question, we investigated the benefits presented by the papers in connection with converging machine learning and gamification to achieve a particular task. In what follows we first provide the benefits of using machine learning for the gamification community, followed by the benefits of gamification for the machine learning community. Finally we outline some of the identified challenges regarding the combining of machine learning and gamification as a whole.
In particular, machine learning is said to help adapt game features and task difficulties as in (S4), as well as, providing early student profile detection to enhance gamification tasks as in (S9) by Barata et al. (2016) and Lopez and Tucker (2018). Adapting game features based on interaction progress and system goals helps to improve the gamified tasks which are only made possible with the help of machine learning. In other words, this helps the continuation of the user interaction with the platform that is called engagement, as well as updating interaction contents to be more realistic and beneficial. Consequently, the predicting power of machine learning can be used to analyze both the environment and the user data to predict some future events in a gamified platform and adapt the game context accordingly.
Gamification elements, such as points and badges can be used to increase the engagement level of users in a particular application. However, other attributes such as users’ specific situations and their behaviors can also be positively correlated to the level of engagement. Authors in (S35) proposed a machine learning approach to collectively consider both user attributes and gaming measures to evaluate the level of engagement in an application of online emotional support system (Doran et al. 2015). They trained a random forest algorithm to predict the level of engagement over time, both for old and new members. In this study, gamification and machine learning worked cooperatively to measure the level of engagement of users in a crowdsourcing application. In general, early detection of user’s future engagement level can be helpful to prevent users from leaving the gamified application. This can be achieved with the help of machine learning, by which user’s activity level in the future is predicted as a supervised learning problem. The study in (S35) have deployed this application in an emotional support platform and reported positive results. They also found that gamification elements are positively correlated to the user engagement level. But, it is also required to verify how generalizable is this approach to other applications, such as a gamified learning platform that its users require a different level of motivation for continuing their engagement with the system.
Another benefit of using machine learning in gamification tasks is to automate the process of personalization, without additional overwhelming effort by operators. The same is reported in (S19), (Knutas et al. 2018). The personalization of game features based on user capabilities and preferences has always been significant to improve user experience. This is because game features are normally designed and developed once and before the exposure of the system to the real environment when much of the specific user potentials, limits, and behaviors are unknown. Therefore, it is difficult to design a system that is responsive enough to the needs of all the users.
On the other hand, there are benefits of using gamification over machine learning practices, such as improved user interaction, facilitating the machine learning-based learning materials, and higher quality data collection are reported as in (S7), (S11) and (S15), respectively (Mulholland et al. 2015; Suresh et al. 2018; Li et al. 2016). (S7) used leaderboards and badges to motivate individuals to interact with the system in order to collect their sentiments. Collected sentiments are later used for analysis in making an online recommender system. Here, the fact that gamification can add to the engagement of users is to be used in the context of machine learning-based systems that require user interactions. This is the case with every human-in-the-loop application, where human cooperatively work with the machine to either optimize the data collection task for a supervised learning problem or to verify the results of an unsupervised or reinforcement learning problem. In general, given that gamification technology basically deals with optimizing the human interaction with the machine, there is a potential to use it in any machine learning application that requires human intervention to achieve some tasks. This is where the concepts of interactive machine learning (iML) is introduced (Holzinger 2016).
Moreover, sometimes the quality of the data collected through user interaction with the system becomes important. In other words, gamification can enhance the focus of the user while interacting with the system and higher quality data can be collected especially when the data is to be collected from the user itself, such as affect recognition, as provided in (S15). In this scenario, the gamification is used to make the participating users in the data collection task to be more careful about their progress, hence enhancing the quality of the collected data. This data collection can be both a labeling task for a supervised problem and feature data collection for an unsupervised learning problem. This is a great benefit for machine learning community, given that the amount of available data is limited in many applications, such as in the case of medical applications.
In a different perspective, (S11) provides a learning platform that uses machine learning to provide dynamic content to users. The authors of this study have combined their platform with gamification to improve the overall features of the system. Gamification has been deployed to assess the performance of the user that can be used for the evaluation of the machine learning-based contents. Content dynamicity is an important feature for any learning platforms. Different students have different learning capabilities. Providing the right content to each individual student based on their interaction with a gamified learning platform improves their level of engagement, hence, improving the learning process as well.
Regarding the challenges, the papers were inspected for possible limitations of combining these technologies. There is a very limited number of challenges reported by the researchers, as they are normally focusing on the strengths of their work in the reports. (S9) argued that the constraints imposed by the limited datasets are a significant challenge since big datasets are a crucial requirement for machine learning-based gamified platforms (Barata et al. 2016). In general, most of the applications of machine learning techniques in gamified platforms, were attempts toward formulating the problem as a supervised learning, which requires a labeled target variable. Although, considering them as other types of ML problems such as semi-supervised learning can be beneficial by utilizing the available limited amount of data. For example, in the case of clustering the users based on their performances, presented in (S10).
On the other hand, the data may be biased as mentioned in (S16), which can result in unreliable outcomes (Psaltis et al. 2018). Thus, this is important to make sure of the quality of the data. A biased dataset can result in misleading inferences and the quality of the collected data should be taken into consideration prior to any decisions. However, it is a challenging task to ensure the quality of data that is collected through a gamified human interaction.
Last but not least, (S19) reflects that the process of deploying machine learning to personalize gamification tasks requires ML experts (Knutas et al. 2018), which is a known challenge regarding ML developments in every field. Although, in recent years development of cloud based machine learning tools and services, provided the ability to deploy ML methods and techniques much easier and faster.