Introduction

A well-recognized challenge in biological science education is successfully teaching evolutionary concepts (Alters and Nelson 2002). Within this discipline, some topics remain more challenging to teach than others, and the number and efficacy of tools available for instruction varies (Shulman 1987; Ziadie and Andrews 2018). For instance, multiple strategies have been developed to improve the teaching of concepts like natural selection (Ziadie and Andrews 2018) while the best practices for teaching equally important topics such as non-adaptive evolution remain largely understudied (Kalinowski et al. 2013). This is particularly problematic for topics like genetic drift because concepts of adaptive and non-adaptive evolution form independent elements in evolutionary thinking, and a better understanding of one does not necessarily imply a better comprehension of the other (Beggrow and Nehm 2012). To address this, studies devoted to developing, improving, and testing teaching strategies for non-adaptive evolutionary concepts are needed.

Previous studies have created approaches aimed to identify student misconceptions regarding genetic drift (Andrews et al. 2012; Price et al. 2014); in addition, study activities and software have been developed, tested, and made publicly available (Price et al. 2016; Revell 2019; Staub 2002). These serve as indicators that the knowledge gap regarding genetic drift instruction is being addressed. Nonetheless, diverse class environments, student cohorts, and even teaching styles require distinct sets of tools. Furthermore, there is an academic push for improving the teaching strategies currently set in place and to utilize alternative instruction methods (Lee et al. 2017; Nelson 2008; Tanner and Allen 2005). In fact, more holistic strategies such as transformative teaching—centered around helping students master key course concepts as well as developing learning-related values and skills—have been proposed (Slavich and Zimbardo 2012). Similarly, teaching strategies that favor discussing and testing evolutionary concepts among students have been shown to be effective (Nehm and Reilly 2007). As a result, tools that can be used to facilitate free in-class exploration of evolutionary concepts are especially useful since they allow students to both learn these concepts and develop critical thinking skills.

Here, we developed Genie, a web application designed to demonstrate several population genetics and evolutionary notions including genetic drift, gene flow, and random mutation. Genie simulates evolution in real time in a finite population using cellular automata. Our intuition is that spatially explicit, individual-based simulations allow students to visualize genetic drift in a way not available from line graphs of allele frequency changes, including how changes at the individual level of the population translate into fluctuations in allele frequency and fixation or loss of alleles. This web-based software is accessible to students and leads to increased knowledge of genetic-drift concepts, as tested using a Genetic Drift Inventory (Price et al. 2014). These types of assessments have proven to be useful in capturing students’ understanding of other complex evolutionary concepts in the past (Perez et al. 2013). Genie requires no setup other than navigating to a web page, thus making the use of stochastic simulations to demonstrate genetic drift practical to both educators and students.

Methods

Genie simulation program

Genie (https://cartwrig.ht/apps/genie/) is a web-based, stochastic simulation app written in JavaScript. The simulation uses a spatially explicit Moran model (Nei et al. 1976) to describe a finite population of 1,024 individuals on a 32 by 32 grid. Individuals are haploid with a single locus. The locus mutates according to the infinite alleles model (Nei et al. 1976). Genie works as follows:

Population initialization

The simulation begins when a population is randomly initialized according to Hoppe’s Urn (Perez et al. 2013). Briefly, the population is created one individual at a time, and each individual either carries a new, unique allele or is a copy of a previously created individual. The probability that individual i + 1 has a new allele is θ/(θ+i) and the probability that the individual copies an existing allele is (i)/(θ+i), where θ = 2Nμ, N is the population size, and μ is the mutation rate. If an individual copies an allele, it randomly chooses a previously initialized individual uniformly. At initialization μ is = 0.001 to ensure diversity within the initial population, but the mutation rate of each generation can be specified by the user, defaulting to 0.

Algorithm

At each step of the simulation, a randomly selected individual dies, leaving its corresponding cell momentarily empty. A parent allele is then randomly selected from the eight immediate neighboring cells (including adjacent and diagonal). Cells on the edges and corners of the simulation have fewer neighbors than internal cells, causing a small edge effect. The probability that a new individual will have the same allele as its parent is 1-μ, and the probability that an individual has a new, unique allele is μ. Each ‘generation’ consists of 2000 death/birth steps after which the population is redrawn in the visualization window.

Running

The application contains four components: a grid, where the population is displayed (Additional file 1a); a control panel, where users can manipulate the simulation’s mutation parameter (Additional file 1b); an upper graph, where users can see the number of alleles in the population at any given time (Additional file 1c); and a lower graph, where users can see the frequency of different alleles at any given time (Additional file 1d). Both graphs update in real time as the simulation runs. Each initial allele is assigned one of 18 basic colors, while each mutant allele is assigned one of six neon colors. A single button allows users to toggle between starting the simulation or pausing it. A reset button allows users to restart and reinitialize the simulation at any point.

Barriers

Users can create a barrier in the population grid. To do so, users alter a cell by clicking on it or alter a set of cells by clicking and dragging the cursor to select multiple cells. When a barrier is created, the color associated with the cell changes to black. Barriers act neither as parent cells (they are never replicated) nor die. Thus, for each created barrier cell the total population size declines by one. By building barriers, users can construct physical constraints that restrict the movement of alleles between subpopulations. Barriers can be used to create subpopulations of different sizes and shapes, as well as to study the effects of corridors on gene flow. Barriers can be removed by clicking on the chosen cell(s) a second time; this will set the cell color to white and designate the cell as unoccupied. Neighboring cells will replicate into unoccupied cells; unoccupied cells cannot serve as a parent of a neighboring cell.

Forced mutation

Users can force a mutation to occur in a manner similar to creating barriers. Cells can be mutated by holding the SHIFT button while clicking the cell, or while clicking and dragging the cursor across several cells. Forcing a mutation immediately creates a new, unique allele in each of the chosen cell(s).

Instruction

Before each recitation section, all participants received a lecture on genetic drift. Following, all recitation classes received further instruction of genetic drift by their corresponding Graduate Teaching Assistants (TAs). For the participants in the Non-Genie 2017 class (control), the TAs used static images in a worksheet, taken from screenshots of Genie, to explain genetic drift. These images were comparable to textbook images of differences in allele frequencies. For the participants in the Genie 2016 and Genie 2017 classes, the basic features, display, and usability of the Genie software were explained, and questions designed to facilitate discussion and interpretation were provided. The recitation slides (Additional file 2) were made available to all students after all recitation sessions concluded. Four main activities were conducted:

Activity 1: Defaults parameters/settings

Participants ran Genie without modifying any parameters or creating barriers. As the number of generations increased, participants kept track of the changes in the number of alleles in the population and the allele frequencies. Participants made conjectures on the distribution of haplotypes in the population by tracking variations in the colors patterns (alleles) shown in the population grid. The mutation rate was not modified; however, participants recorded new alleles arising at any point of the simulation. The simulation ran until one allele reached fixation, and participants kept track of the number of generations until this occurred.

Activity 2: Effects of absolute barriers on genetic drift and gene flow

The simulation was re started and participants created two barriers reaching opposite borders of the population grid (one horizontal and one vertical). This setup resulted in four completely isolated populations of roughly equal size. No modifications in the mutation rate were introduced. Participants kept track of variations in alleles, changes in number of alleles, and allele frequency. Additionally, participants recorded the allele number and distribution in each of the four independent sections/populations. The simulation continued until one allele became fixed in each section/population. After one allele became fixed in each section/population, participants paused the simulation and created a corridor by removing part of a barrier. Participants recorded the changes in number of alleles and allele frequency, as well as the movement of alleles between connected sections/populations. The simulation ran until one allele became fixed. The number of generations for an allele to become fixed amongst independent sections/populations was recorded.

Activity 3: Effects of partial barriers and corridors on genetic drift and gene flow

Participants restarted the simulation and created barriers that entirely separated the population grid into four sections of roughly equal size. Before the simulation started, participants formed a corridor by removing a portion of the barriers. This setting allowed for gene flow to occur between sections/populations from the beginning of the simulation. Participants recorded the changes in number and allele frequency between (1) completely isolated sections/populations; and (2) sections/populations connected by the corridor. Participants compared the flow of alleles across the corridor with that observed in Activity 2. Additionally, participants also recorded the number of generations until fixation was reached in connected and isolated areas. The mutation rate was not modified in this activity.

Activity 4: Effects of mutation rate on genetic drift

Participants restarted the simulation, increased the mutation rate, and recorded the changes in the population grid and accompanying graphs. Participants repeated the activity reducing the mutation rate. No barriers were created on the population grid.

After the four main activities were completed, participants were allowed to freely explore other potential outcomes of genetic drift. Participants freely modified the population landscape by creating various types of barriers and/or changing the mutation rate. Participants followed series of suggestions activities/questions: Evaluate the effects of creating barriers of different size and shape; assess the effects of genetic drift on different population sizes; discern the effects of genetic drift on allele diversity within a single population, and between isolated populations; observe the effects that creating corridors with different size and shapes have on gene flow; evaluate the effects of creating corridors and barriers at different points of the simulation; and track the effects of modifying the mutation rate at different points of the simulation.

Data collection

Genie’s efficacy as a tool for teaching Genetic Drift concepts was tested in the Evolution (BIO345) class at Arizona State University (ASU). Genie was used during the practical portion (recitation) of the BIO345 course in the Spring 2016 and Spring 2017 classes. All participants in the Spring 2016 class used Genie during practical class sessions. In the Spring 2017 class, half of the participants used the dynamic visualization of Genie while the other half used static illustrations. Participants in both the Spring 2016 and Spring 2017 classes were given the option to opt-in to the study at the end of the semester. In addition, participants were given the option to provide their demographic information: reported gender, reported ethnicity, and first-generation college student status. All research was reviewed and approved by Arizona State University’s IRB protocol STUDY00003707.

The impact of Genie as a tool for teaching concepts of genetic drift was evaluated using the Genetic Drift Inventory (Price et al. 2014). The inventory was used without changes (22 questions assessing different aspects of genetic drift) in pre- and post-recitation assessments. The pre- and post-recitation assessments (considered as homework for the entire class) were individually answered by each participant. The pre-recitation assessment was posted online on Blackboard two days before recitation. Participants were asked to answer all questions by 3:00 pm on the day of the recitation. The post-recitation assessment was posted on Blackboard at 9:00 pm after the last recitation session ended. Participants had two days to individually complete the post-recitation assessment. All participants were allowed the same amount of time to complete both the pre- and post-recitation assessments. Participants’ answers were recorded, and their individual pre- and post-recitation scores were calculated by summing the number of correctly answered questions (value 1 point) out of the 22 questions in the Genetic Drift Inventory.

Genie assessment

The complete dataset was divided into two major groups based on the instruction year. These groups were: the entire Spring 2016 class (henceforth referred to as Genie 2016) and the entire Spring 2017 class. The 2017 class was further subdivided into groups based on the instruction method used during the practical class session. These groups were: participants that used Genie during the recitation session in 2017 (henceforth referred to as Genie 2017) and the participants who did not use Genie during the recitation session in 2017 (henceforth referred to as Non-Genie 2017). The Genie 2016 class was subsequently divided into eight in-class groups of roughly equal size, while each 2017 class was divided into four in-class groups of roughly equal size (two Genie and two Non-Genie). The groups were designated based on recitation start times, TA pairs; and in the case of 2017, on the use of dynamic (Genie) vs. static (Non-Genie) instruction methods. No more than 48 participants participated in each recitation session. All analyses and figures were developed using R v3.2. The code and datasets used are available (Additional files 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, https://github.com/AndreinaCastillo/Genie_manuscript_data_analysis).

The putative relationship between participants’ demographics and the pre- and post-recitation scores was evaluated using a two-way ANOVA. The following demographic parameters were used as explanatory variables: reported gender, reported ethnicity, and first-generation college student status. In the case of 2017, the use of Genie as an instruction tool was also considered as an explanatory variable. The two-way ANOVA was performed independently for Genie 2016, Genie 2017, and Non-Genie 2017. Next, we assessed if the pre- and post-recitation performance varied between the three class groups or among subgroups within each class. To conduct this analysis, the distribution of pre- and post- recitation scores was assessed using the ‘fitdistrplus’ (Delignette-Muller and Dutang 2015) and ‘betareg’ (Cribari-Neto and Zeileis 2010) R packages. Potential differences between pre- and post-recitation scores were evaluated both between classes and within each in-class group. Cohen's d was used to measure the effect size between pre- and post-recitation scores within each class, and to estimate differences in pre- and post- recitation scores between Genie 2017 and Non-Genie 2017. In addition, a paired Student’s t-test was performed between individual participants’ pre- and post-recitation scores within each class.

Finally, question-specific performance was evaluated to determine how Genie aided participants in addressing the specific genetic drift concepts and misconceptions listed in the Genetic Drift Inventory (Price et al. 2014). The number of correct answers in pre- and post-recitation sessions associated with each question were calculated from participants’ individual answers, and the totals were then compiled by class. Differences between pre- and post- recitation scores for each question were assessed using a McNemar’s χ2 test. In addition, the difference in the number of correct answers per question in Genie 2017 vs. Non-Genie 2017 pre- and post-recitation sessions was assessed using a Fisher’s exact test.

Results

Demographic representation varied among cohorts (Table 1). A two-way ANOVA found that most demographic explanatory variables did not affect pre- and post-recitation scores (Table 2) with the exception of ‘First-generation’ college in post-recitation scores (F = 7.955, p-value = 0.005) for Genie 2016, and ‘Genie used’ in pre-recitation scores (F = 6.131, p-value = 0.014) for 2017. In the case of the ‘First-generation’ college students, lower pre-recitations scores were observed in ‘Not First-generation’ students from Genie 2016 and Genie 2017, while the opposite trend was observed in Non-Genie 2017. It should be noted that ‘First-generation’ college students showed slightly less improvement than ‘Not First-generation’ college students despite both groups having higher post- than pre-recitation scores. Overall, pre- and post-recitation scores were different among the classes analyzed. The mean pre- and post-recitation scores for Genie 2016 were lower than in either Genie 2017 or Non-Genie 2017. Differences in post-recitation scores could be largely explained by the initial class performance (Table 3). In-class groups showed similar performance levels in all evaluated groups except for ‘TA Pair1 7:30 pm’ (p-value = 0.017) during Genie 2016, this class was composed exclusively of honor students.

Table 1 Demographic breakdowns of participants in each year and section shows variable representation of different groups
Table 2 Most demographic predictors did not affect performance on the evaluation of genetic drift knowledge
Table 3 Post-recitation scores were mostly influenced by pre-recitation scores, but not class section

Overall, comparisons of pre- and post-recitation scores showed that students performed better in all classes regardless of the instruction method used (Fig. 1). Cohen’s d values (Table 4) showed a moderate improvement in post-recitation scores compared to the pre-recitation scores in Genie 2016 (0.608, CI: 0.408–0.807), Genie 2017 (0.632, CI: 0.410–0.855), and Non-Genie 2017 (0.658, CI: 0.430–0.886). This was also true for most individual participant scores (Table 5, Additional file 14). Understanding of key genetic drift concepts and misconceptions statistically improved after instruction with or without Genie (Table 6). Post-recitation scores were generally higher in Genie 2017 than in Non-Genie 2017 except for two questions (Q10 and Q15, Fig. 2). Fisher's exact test showed that the instruction method (Genie vs. Non-Genie) was not associated with student’s switching answers from correct to incorrect or incorrect to correct between pre- and post-recitation (Table 7). Results were comparable with or without including students within honor sections (Additional file 15).

Fig. 1
figure 1

Students' test scores improved after instruction. Blue dots represent excess improvement in class performance. The presence of blue points in a graph indicates that there were more students whose post-test score was better than their pre-test score. The number of blue points indicates how many more students improved their scores than students whose scores decreased

Table 4 Effect of Genie on learning outcomes
Table 5 Individual participants in all classes showed higher post-recitation scores compared to their pre-recitation scores
Table 6 For most individual questions, participant post-recitation performance improved across classes following instruction either with or without Genie
Fig. 2
figure 2

Post-recitation scores by question (Price et al. 2014) were generally higher in Genie 2017 compared to Non-Genie 2017. A bar plot comparing Genie 2017 (dark green) and Non-Genie 2017 (pale green) is shown. Questions have been grouped according to the classification provided by Price et al. (2014), with questions pertaining to Key concepts and misconceptions (M1-M6) separated by horizontal bars

Table 7 Comparison of performance between Genie 2017 and Non-Genie 2017, controlled by question

Discussion

There are numerous software options capable of generating genetic drift simulations. Some of them can be easily downloaded and installed (Kliman et al. 2008; Revell 2019), others include an ample array of parameters to be modified by the user (http://evolution.gs.washington.edu/popgen/popg.html), and others can be found publicly available online (e.g. the Genetic Drift Simulator (http://www.biology.arizona.edu/evolution/act/drift/drift.html or Phyletica (http://phyletica.org/teaching/drift-simulator/). Some of these software even have a dynamic interface similar to that developed by Genie (http://virtualbiologylab.org/NetWebHTML_FilesJan2016/RandomEffectsModel.html). While this list is not exhaustive, it provides a glimpse on how computational tools, and especially those found freely in web-interfaces, are becoming predominantly used in science teaching. The objective of this paper is not to compare Genie’s performance to all these tools, instead, the authors aim to present an additional teaching tool that can be added to an instructor’s repertoire. As such, we endeavor to show that Genie can be efficiently used alongside other class instruction methods. A comparison was made between Genie-based instruction and instruction using static images (henceforth referred as teacher-centered instruction). The comparison was chosen since teacher-centered methods still are commonly used in science teaching (Tanner and Allen 2004) and have been traditionally used when teaching evolutionary topics in ASU.

There were no significant differences in the performance levels among participants from distinct demographic backgrounds. Despite differences in levels of representation across groups, pre- and post- recitation scores were similar. However, while participant’s performance increased in all methods of instruction, ‘First-generation’ college students showed slightly lower improvement than ‘Non-first generation’ college students. Multiple studies have attempted to address the social class gap among undergraduate students and explain why ‘First-generation’ college students, on occasion, perform more poorly than ‘Non-first generation’ college students (Grineski et al. 2018; Tibbetts et al. 2018). One finding pertinent to our assessment is that ‘First-generation’ college students tend to underperform when they know that their performance is going to be compared to that of other students in the class (Jury et al. 2015). This might be an unintended consequence of the in-class methods used here, which favored in-class discussion and student participation. However, while not possible to address here, these results could point towards the unique disadvantages and social-related pressures that ‘First-generation’ college students face within ASU. These results should be evaluated in more detail in future studies.

Overall, participants’ performance was not affected by the instructor or the participant populations within the group, except for the ‘TA Pair1 7:30 pm’ group during Genie 2016. The ‘TA Pair1 7:30 pm’ group was formed by a small number of honors students; therefore, it is possible that this group performed better compared to the general class population in Genie 2016. Previous studies have found that instructors’ mastery of the content, as well as their overall teaching style play a critical role in students’ learning process (Alsharif and Qi 2014; Maleki et al. 2017). Thus, our results are indicative that Genie performs similarly well even with teachers using diverse teaching styles and having variable levels of expertise.

In that regard, we were unable to control for previous classes that BIO345 students took. Although, all students in BIO345 are required to have passed BIO340 (General Genetics), which typically includes instruction in evolutionary genetics; BIO340 is taught by multiple instructors, who do not teach evolutionary genetics equally. Interestingly, despite these differences, mean scores showed that the increase in performance between pre- and post-recitation was ~ 0.1 regardless of the teaching method used in BIO345. The main distinction were the pre-recitation scores, with some classes initially performing better than others. Pre-course/test assessments are used to contextualize post-course/test scores. Previous data shows that lower pre- scores often result in lower post- scores during midterms and finals (Furrow and Hsu 2019). Interestingly, the pre-recitation scores observed here were larger than in other studies using the Genetic Drift Inventory; in fact, the smallest pre-recitation score reported here (0.645 for Genie 2016) was higher than pre-recitation score reported during the assessment of The Genetic Drift and Bottlenecked Ferrets module (0.58) by Price (2016). This suggests that the starting performance level of the classes was higher than in other studies, which could be due to genetic concepts being introduced in lecture and before recitation. Taken together, these results are indicative that both teacher-centered and Genie-based teaching strategies led to a comparable improvement in participant’s scores, regardless of the initial performance level of the class. Thus, it is possible to conclude that Genie can perform as efficiently as traditionally teacher-centered instruction.

The lack of statistically significant differences between Genie-based and teacher-centered instruction also deserves some note. All students received the same lecture, were taught by the same professor, had access to the same set of slides, and followed the same worksheet instructions. The only difference was that participants in the Non-Genie 2017 class had static images in the worksheet, while participants in the Genie 2017 worked with the simulation software. Therefore, improvements in comprehension of genetic drift and related concepts may originate from any of those common factors and not from the use of Genie itself. Nonetheless, previous studies have demonstrated that cellular automata simulations can be effective tools for teaching evolution in action. In fact, the most well-known of these tools, Avida-ED, has been shown to serve multiple teaching purposes. First, digital simulations like Avida-ED or Genie provide an experimental platform in which students can test the effect of core evolutionary mechanisms; second, these simulations encourage students to use inquiry-based learning; and third, the simulations allow students to learn concepts in a manner that is transferable across levels of biological complexity (Bray Speth et al. 2008; Robert 2007; Smith et al. 2016). Simulations also provide multiple independent instances for students to observe the stochasticity in genetic drift, unlike a lesson that only walks through changes in allele frequencies. In this sense, the simulated individuals undergo real evolution, experiencing mutation, replication, selection, and drift. Avida-ED has detailed exercises and a lab manual describing genetic drift (i.e., https://avida-ed.msu.edu/files/curricula/LabBook/Avida-ED_LabBook_Ex4.pdf).

Overall, understanding of genetic drift key concepts and misconceptions improved following instruction with all teaching strategies. Previous analyses have shown that a combination of traditional teaching-centered methods, with student-centered methods, and active learning strategies results in superior student performance (Dolan and Collins 2015; Shir et al. 2016; Wieman 2014) and that concept inventories are an effective way to support and evaluate undergraduate learning of evolution concepts (Furrow and Hsu 2019). In particular, the Genetic Drift Inventory has been found to be a generally reliable tool for inferring knowledge changes on upper-level undergraduates (Tornabene et al. 2018). However, despite their advantages, previous research also suggests that concept inventories could be used more creatively. For instance, in the present study, the changes in participant’s performance might be related to students becoming familiar with the questions found in the Genetic Drift Inventory. Or in other words, to the study being linked to the exclusive use of the Genetic Drift Inventory as presented by the original authors. While this is undoubtedly a factor, we expect that all questions should be affected equally, which should not largely bias our results. Moreover, we observed that the changes in student performance varied among the Genetic Drift Inventory questions, suggesting that teaching strategies did have an impact on participants’ performance. Nonetheless, future uses of the Genetic Drift Inventory should be enriched by incorporating new multiple choice and open-ended questions in post-recitation assessments.

In the case of evolution teaching, strategies that favor student’s development of critical thinking skills are especially useful. For instance, tools and methods that aid in creating and testing hypotheses have been effective in improving students' understanding and acceptance of evolutionary theory (Lark et al. 2018; Smith et al. 2016). Likewise, instruction using computer simulation programs has proven to be valuable in facilitating student’s recognition of the breadth of evolutionary mechanisms that can act in a population (Kliman 2001). Nonetheless, different students can master the same topic using different paths (Price et al. 2016), and different classroom settings might be more suitable for distinct teaching methods. Simply put, there is no ‘fit all’ teaching strategy that can be universally implemented. Therefore, providing instructors with a broad repertoire of teaching tools can aid them in finding those that better work for the topic being instructed, the specific class needs, and the instructor style. In this regard, we expect Genie can be added to the repertoire of higher education tools to be used for teaching genetic drift and other non-adaptive evolution concepts.

Conclusion

The present study shows that Genie can be successfully used for teaching concepts related to genetic drift and non-adaptive evolution to undergraduate students. Genie performed comparably to traditional teacher-centered methods across all evaluated groups. Moreover, Genie-based and teacher-centered approaches led to participants understanding distinct key concepts and misconceptions of genetic drift. This indicates that Genie can be effectively used alongside other teaching strategies to provide a rounded view of non-adaptive evolution. In a related note, Genie provides a means for participants to develop and test their own hypotheses, which can be useful in practicing critical thinking skills. Despite this positive outcome, it should be noted that this manuscript only describes the first two instances in which Genie was used as a teaching tool in an undergraduate class, and therefore, there is room for improvement on the software’s implementation. For one, Genie could be presented to students before or during lecture as to determine how the software influences their understanding of genetic drift concepts when they are first introduced. In addition, recitations could allocate time for students to freely interact with the software at the start of the session, that way, students could infer what the different feature of the program do, how they affect the population, and how they could use them to test evolutionary hypotheses (this could even be done as a homework). Alternatively, questions allowing students to evaluate their understanding of genetic drift concepts in relation to the usefulness of the tool should be included in future classes. The effectiveness of Genie should also be further evaluated across institutions as well as in classes with higher pre- score performance than obtained here.