Fully Individualized Curriculum with Decaying Knowledge, a New Hard Problem: Investigation and Recommendations

Lebis, Alexis; Humeau, Jérémie; Fleury, Anthony; Lucas, Flavien; Vermeulen, Mathieu

doi:10.1007/s40593-023-00376-9

Fully Individualized Curriculum with Decaying Knowledge, a New Hard Problem: Investigation and Recommendations

Article
Open access
Published: 20 November 2023

(2023)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Artificial Intelligence in Education Aims and scope Submit manuscript

Fully Individualized Curriculum with Decaying Knowledge, a New Hard Problem: Investigation and Recommendations

Download PDF

821 Accesses
Explore all metrics

Abstract

The personalization of curriculum plays a pivotal role in supporting students in achieving their unique learning goals. In recent years, researchers have dedicated efforts to address the challenge of personalizing curriculum through diverse techniques and approaches. However, it is crucial to acknowledge the phenomenon of student forgetting, as individuals exhibit variations in limitations, backgrounds, and goals, as evidenced by studies in the field of learning sciences. This paper introduces the complex issue of fully individualizing a curriculum while considering the impact of student forgetting, presenting a comprehensive framework to tackle this problem. Moreover, we conduct two experiments to explore this issue, aiming to assess the difficulty of identifying relevant curricula within this context and uncover behavioral patterns associated with the problem. The findings from these experiments provide valuable prescriptive recommendations for educational stakeholders seeking to implement personalized approaches. Furthermore, we demonstrate the complexity of this problem, highlighting the need for our framework as an initial decision-making tool to address this challenging endeavor.

Increasing the Sensitivity of a Personalized Educational Data Mining Method for Curriculum Composition

Individualization of Bayesian Knowledge Tracing Through Elo-infusion

A Data-Driven Student Model to Provide Adaptive Support During Video Watching Across MOOCs

Find the latest articles, discoveries, and news in related topics.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction and Context

Education in universities and engineering schools is undertaking quite important changes, mostly to reflect the use of new technologies, breakthroughs in the Technology Enhanced Learning (TEL) field, and ever-changing society’s needs Kryukov and Gorin (2017); Daniela et al. (2018). Therefore, it is important to understand these changes to provide reliable assistance to education stakeholders (i.e. students, teachers, and institutions). A significant observable change in these institutions is the gradual substitution of predefined curricula with more modular alternatives University of Reading (2019).

In this paradigm, emphasis is given to personalizing the learning experience to better match students’ career expectations and goals. Students can then choose courses at each academic term (e.g. semesters) to build their sequence of courses (namely a curriculum) according to their objective (e.g. career goal). Yet, an inadequately structured curriculum presents several challenges to students. The absence of a coherent and adapted progression may hinder their assimilation of skills, competencies, behaviors, attitudes, abilities, or knowledge and impede their effective application Tetzlaff et al. (2021); Aleven et al. (2016). Difficulties stemming from insufficient prerequisites or unpreparedness for advanced coursework may also precipitate disengagement Walkington and Bernacki (2014). Erroneous sequencing can also extend degree completion timelines, thus affecting institutional graduation rates. Considering that the average time-to-degree for a Bachelor’s degree in Europe is approximately 3.5 years Vossensteyn et al. (2015) in a traditional learning environment, these aforementioned challenges could substantially prolong students’ time-to-degree. Furthermore, these badly structured curricula could be hard to identify by teachers and institutions Caputi and Garrido (2015) and may yield inefficiencies in resource allocation, with certain courses witnessing disproportionate demand while others are underutilized.

Currently, to circumvent these risks, in the vast majority of cases, institutions implementing this kind of approach decide which time periods and courses can be personalized by students. Individualizing curricula consists of relaxing these constraints for students: they have to fully define their entire curriculum which will, potentially, better match their objective. We call such curricula fully individualized curricula.

Student forgetting, a phenomenon observed in the field of learning science, refers to the gradual loss or decay of previously acquired knowledge or skills over time. It is influenced by various factors such as the passage of time, lack of reinforcement or practice, and interference from new information Arthur et al. (1998); Ebbinghaus (2013). The decay of knowledge significantly impacts the outcome of a student’s curriculum, as it can lead to the loss of essential prerequisites for future courses within the curriculum determining the success or failure of a student’s educational journey. Recognizing and predicting the potential impact of this decay is needed to optimize the learning experience and ensure students have a strong foundation for continuous academic growth. Yet, to the best of our knowledge, works personalizing curricula do not take into account the decay aspect and its effect on the generation of solutions.

However, educational stakeholders involved in the individualization of curricula may already be confronting challenges stemming from knowledge decay. Students, as an example, are put in a rather challenging situation Daniela et al. (2018) as they must plan courses for the coming years without actual prior knowledge about the courses they have to choose, ensure that the sequencing is well made in terms of prerequisites, and have to assess the relevance of each course concerning their objectives. Furthermore, students should engage in self-examination to recognize the potential decay of their knowledge over time, an inherently difficult task. For teachers, this context makes some practices harder, such as multi-modal teaching, re-take exams, or one-to-one attention, because it tends to favor a considerable heterogeneity of students’ backgrounds.

In such a context, institutions should guarantee the quality and equity of the educational journey of each student, as some curricula could end up being more difficult than others. Consequently, institutions have to assess whether a curriculum is either well-formed or not depending on several factors, such as the fulfillment of course prerequisites, timetable scheduling Loo et al. (1986), or teachers’ availability. Institutions also need to support the individualization of a curriculum according to the student’s profile Klinkenberg et al. (2011); Desmarais and Baker (2012); Papousek et al. (2014). It is also necessary for institutions to ensure that the curriculum aligns with the student’s objective and is attainable. This implies that courses have to be properly cataloged by institutions, including the knowledge they taught and their prerequisites. However, all these tasks are typically carried out manually by institutional staff members, as no study explicitly emphasizes the challenges posed by the decay of knowledge concerning the individualization of curricula.

In this paper, we address the Fully Individualized Curriculum with a Decaying Knowledge Problem (FIC/DK-P). The objectives of this paper are threefold: 1) propose a theoretical and reproducible framework for the problem; 2) study the effect of decaying knowledge in the individualization of a curriculum and its complexity; 3) formulate several actionable recommendations and warnings for education stakeholders who are considering or currently implementing curriculum individualization. The remainder of this paper is structured as follows. In Section 2, we provide an overview of related works on curricula personalization, including the major challenges and techniques employed, and the theory of student forgetting. The Section 3 formalizes the problem by introducing the FIC/DK-P framework. This section is followed by Section 4, which covers the experiments conducted to study the problem, as well as the data generated for these experiments. Based on the analysis of the experimental results, we put forth nine recommendations for education stakeholders in Section 5. We then conclude the paper in Section 6. Finally, in Appendix A, the interested readers can explore the mathematical foundation of the FIC/DK-P framework.

Literature Study

FIC/DK-P consists of recommending to a student a sequence of courses until graduation that matches his/her objective, whether it be personal and/or professional while considering that knowledge can decay over time. Although this issue has not been addressed directly in the literature to our knowledge, it is noteworthy to mention related works in learning path personalization. Learning path personalization refers to approaches that generate learning paths by taking into account the individuality of a student and his/her learning preferences Deng et al. (2017). This personalization operates at various levels, mostly to the learning object level Belacel et al. (2014), at the topic level, the lesson level Nabizadeh et al. (2020) and at the course level Nabizadeh et al. (2017); Parameswaran et al. (2011). Duval and Hodgins (2003) introduced a modular content hierarchy based on these levels used to promote the sequencing of contents. Yet, from an institution’s point of view, there is no consensus over this hierarchy and its explicit or implicit implementation changes across institutions, hence the need for a flexible and adaptable learning path recommendation system.

In the literature, two main methods of personalizing a learning path can be observed. Either 1) computing and recommending an entire learning path for a student (or a group of students), such as in Kardan et al. (2014); Feng et al. (2011); Belacel et al. (2014) or 2) recommending a path for a student learning content by learning content, as shown in Govindarajan et al. (2016); Salahli et al. (2013) for example. The second approach may offer significant computational speed advantages but is inherently limited in capturing certain unique aspects and broader contextual nuances that emerge when taking into account the entirety of a student’s learning journey, including factors like knowledge decay.

The algorithms used in these methods are numerous. Among them, we can cite machine learning techniques, such as clustering or tree classifier Kardan et al. (2014); Lin et al. (2013). Recently, notable studies have incorporated semi-supervised learning and unsupervised learning approaches to analyze student data, aiming to predict students’ performance and provide recommendations for personalized curricula Backenköhler et al. (2018); Wong (2018). However, these machine learning techniques tend to merge student profiles, resulting in a loss of precision concerning the individuality of each student, which is essential in addressing our specific problem. Additionally, they tend to require a large amount of data to be efficient.

Among the other algorithms used are greedy algorithms Durand et al. (2013), graph theory Li et al. (2016); Belacel et al. (2014), Markov decision process Durand et al. (2011) or bayesian network Zhang and Koren (2007). While these algorithms generally yield high-quality results in terms of recommendation, they tend to be highly dependent on the problem and data, often requiring extensive fine-tuning and optimization techniques. As a result, they may not be well-suited for exploring novel problems. In the e-learning literature, we observe that genetic algorithm is a widely used technique Seki et al. (2005); da Silva Lopes et al. (2009); De-Marcos et al. (2009); Al-Muhaideb and Menai (2011); Benmesbah et al. (2021); de-Marcos et al. (2008) that can also produce high-quality, locally optimal, solutions. As a meta-heuristic approach, genetic algorithms demonstrate problem-agnostic characteristics, making them a promising candidate for initial exploration and analysis of our specific problem.

All these above works are implicitly based on the hypothesis of an ideal memory model of students Georghiades (2000). Nevertheless, compelling evidence suggests that students experience forgetting, and their knowledge retention curve exhibits a distinct pattern that is unique to each individual Bahrick (2000). This forgetting process is supposed to be driven by a core set of major factors Arthur et al. (1998); Bacon and Stewart (2006) such as 1) length of the non-use interval, 2) degree of overlearning, 3) task characteristics, 4) cognitive interference, 5) retrieval conditions, 6) training and instructional strategies and methods and 7) spontaneous loss of knowledge. In the forgetting curve theory, which thus considers that students’ knowledge proficiency follows a declining curve, we can observe several works modeling this decay over time.

Nonetheless, the modeling of forgetting remains one of the longstanding unresolved issues in the field of experimental psychology and is not the subject of consensus Klammer and Gueldenberg (2019). In Averell and Heathcote (2011), the authors propose a general memory model based on a study of a large dataset and Bayesian model selection to account for the student’s capacity to forget, where a power function seemed to be favored Anderson and Schunn (2013). Another popular model of forgetting is the exponential forgetting curve of Ebbinghaus Ebbinghaus (2013), yet this model was initially conducted by Ebbinghaus himself in an incomplete study. Nonetheless, Murre and Dros (2015) attempted to replicate Ebbinghaus’ experimentation and findings; eventually they showed that the experimental results were similar to Ebbinghaus’ curve, therefore supporting the relevance of his model.

Hence, a question remains concerning how the decay of knowledge can impact actual learning algorithms, especially recommending systems. The literature gives evidence that taking this decay phenomenon into account can lead to better solutions but tends to make the problem harder. In Lindsey et al. (2014), the authors incorporated memory models into factor analysis (Item Response Theory van der Linden and Hambleton (2013) is a canonical model of factor analysis). The authors’ model performed better compared to models that did not implement memory models. However, optimal solutions for personalized scheduling were found to be intractable. This evidence is also supported by Choffin et al. (2019); Huang et al. (2020).

Considering the challenges associated with manually personalizing a curriculum, which demands extensive technical and pedagogical expertise Vanitha and Krishnan (2019), it would be unwise to expect education stakeholders to undertake such a task without the aid of suitable decision-making tools or recommendations, especially while considering that decay phenomenon has a significant impact on the problem hardness.

Problem Definition

In this section, we model FIC/DK-P. This problem being related to other hard problems, such as scheduling problems, we had to make the four following hypotheses to study it and to give initial points of comparison:

1.
Logistical aspects (e.g. rooms availability, teachers availability) have no impact on the quality of a curriculum;
2.
The course catalog used is considered complete and sound, that all the information is at our disposal and there is no implicit information;
3.
Courses could not overlap two or more academic terms: they are always confined into a single academic term (and last this entire academic term);
4.
The learning process is perfect, meaning that at the end of the course, a student has acquired everything that a course should provide so that we do not introduce probabilistic learning models in the study of FIC/DK-P.

FIC/DK-P is the problem, for a student, of selecting for a specific time range a sequence of courses to acquire the necessary skills, competencies, behaviors, attitudes, abilities, or knowledge such that he/she becomes qualified for his/her objective (most often the objective being a professional one), while these elements being subject to a decay effect. This decay effect makes it more complex to plan a coherent sequence of courses as the student may no longer be qualified to attend specific courses when academic terms are distant. We illustrate FIC/DK-P in Fig. 1. In the following subsection, we model the components of FIC/DK-P.

Modeling Knowledge and Mastery

The essence of the complexity of FIC/DK-P lies in the order in which skills, competencies, behaviors, attitudes, abilities, or knowledge are learned during courses, to what extent, and how they evolve throughout a curriculum. As multiple definitions of these properties exist, we define a surjective mapping of the mastery whether it be of skill, competency, behavior, attitude, ability, or knowledge into a set, that we call Knowledge for convenience.^{Footnote 1} A member of this set encodes the mastery information continuously between [0; 1] for a specific knowledge, where 0 signifies that the corresponding knowledge has not been encountered by a student and 1 indicates complete mastery of the knowledge. Such a set allows for a generic representation of mastery and can be used in vast educational situations and paradigms, such as in more classic learning Bloom et al. (1956); Mandin and Guin (2014), constructivism learning Bada and Olusegun (2015) or with works considering knowledge mastery as a binary property Huang et al. (2020). This set only requires from institutions to agree on a knowledge decomposition according to their epistemological, didactic and/or practical standpoints and agree on the mapping of their knowledge graduation into our interval, which can be quite straightforward (e.g. dividing the [0; 1] interval by the number of possible grades assignable to a student for each knowledge).

Modeling Courses

Courses are the basic building blocks of curriculum, as they can be considered the main vector of knowledge Hill et al. (2005) for students. Consequently, to keep our modeling generic, we consider a course as a macro entity that provides knowledge that can potentially be aligned with any learning material level and hierarchy Duval and Hodgins (2003). Each course is considered to mobilize at least one knowledge to the student; we did not define an upper bound about the number of knowledge that can exist in a course as we did not find a formal threshold in the literature. The amount of knowledge taught by a course is expressed as a mastery value.

Additionally, courses can also have prerequisites. The importance of prerequisites in a curriculum is highlighted in works such as Molontay et al. (2020). Prerequisites ensure that a student has the minimum background to fully acquire the knowledge provided by the course. Attending a course without meeting all the prerequisites can pose risks in a student’s educational pathway, such as failure, cognitive overload, and increased stress. This is an important decision factor regarding the quality of a curriculum. We express these prerequisites as a knowledge mastery value threshold, meaning that a student should have more, or at least equal, knowledge mastery to guarantee success in the concerned courses.

Another property of courses is their temporal availability. Typically, courses within educational institutions adhere to specific scheduling constraints, operating during designated academic terms for various reasons. To capture this characteristic generically, each course is associated with a set of temporal availability terms, designated as academic terms.

Furthermore, we take into account the attendance and the involvement of students with the courses they take. We define a notion of credit, that works both with the American credit system and the European Credit Transfer and Accumulation (ECTS) standard Herrero and Algarrada (2010). Consequently, each course is assigned a credit value which is earned by the student at the end of the course. One can consider a specific threshold of credit value for a student to graduate (e.g. 180 ECTS, which represents a bachelor’s degree).

Modeling Curriculum and Student’s Objective

A fully individualized curriculum is a sequence of courses that spans over one or more academic terms. Each of these academic terms is designed to accommodate a dedicated number of courses, and this number can be different for each academic term. This implies that a student is limited in the number of courses he/she can take both in an academic term and his/her entire curriculum. This limit is theoretically different from one institution to another, which makes the computational nature of the FIC/DK-P more complex. In addition, please remember that, via our third hypothesis, a course cannot overlap two academic terms.

Furthermore, a curriculum should be designed to qualify a student for his/her objective – most often a professional one. Consequently, we had to model the objective of a student. Again, for generic purposes, we based the modeling of student objectives on our knowledge representation. An objective is therefore expressed as a set of knowledge mastery values, indicating which knowledge is expected and to what extent. One can see these objective mastery values as the final requisites of the entire curriculum. In real-life scenarios, the identification of these final requisites will most likely be conducted by the institutions themselves, especially by collaborating with the professional world.

Ideally, the sequence of courses should be defined so that no prerequisites are missed at any time. We consider such a sequence as a good fully individualized curriculum. We also consider that a curriculum only serves the purpose of only one student objective at a time – yet the objective can hold any amount of knowledge.

Modeling Student Profile and Decay

What sets FIC/DK-P apart in the literature dedicated to learning path personalization is its consideration of the decay of knowledge over time when formulating individualized curricula for students. This consideration aligns with findings in educational psychology, underscoring the significance of this factor in students’ educational experiences and their reception of course materials. Several works Arthur et al. (1998); Ebbinghaus (2013); Heller et al. (2006) shown that the mastery of knowledge is not stationary in time: it can decrease when the knowledge is not used over a certain period, and vice-versa, according to the pedagogical context. Predicting such variations is an important challenge as it can greatly improve the learning experience of the students, as shown by the works related to the spaced repetition system (SRS) Settles and Meeder (2016) – even if monitoring the mobilization of knowledge outside a pedagogical context is difficult. Yet, it also adds complexity to the curriculum design process. As the knowledge acquired by a student during a specific academic term can diminish over time, it may reach a point where some of the prerequisites for future courses in subsequent academic terms are no longer fulfilled. Therefore, it becomes essential to predict and mitigate this decay effect to determine the optimal sequence of courses that ensures all prerequisites and the requisites of the objective are satisfied for the student, thus diminishing the risk of failure of the student.

We model the decay of knowledge, given a student, as a function of the elapsed time since the last time this knowledge has been learned. The codomain of the function is [0; 1], representing the amount (i.e. mastery value) of the knowledge lost during this period. This function impacts how the mastery evolve throughout the curriculum. The explicit function’s mapping should be defined according to one’s psychological standing of the decay, for example by using Ebbinghaus’ forgetting curve Ebbinghaus (2013) .

In accordance with prior research works such as Howe (1980), learning is commonly regarded as a cumulative process. In our model, the evolution of mastery is characterized by the accumulation^{Footnote 2} of previous mastery levels and the acquisition of additional knowledge mastery from relevant courses. Alternatively, mastery may experience decay if the acquired knowledge is not utilized during the academic term. Thus, to predict the mastery of a knowledge for a student in an academic year, the amount of decay is subtracted from the accumulated mastery value of this knowledge. Each knowledge within the framework may possess a unique decay function tailored to its characteristics. Additionally, a decay function can change based on various properties such as time or specific knowledge thresholds in order to capture the notion that certain knowledge becomes more resistant to forgetting over time (e.g. riding a bicycle once learned). We give two examples in the Fig. 2 regarding the evolution of the mastery of a knowledge according to a decay function. In our model, it is possible to observe a mastery overflow if, after attending a course, the mastery should be greater than 1 (see Fig. 2b). In that case, the value of one’s knowledge is expressed as $min(1,m_{k,t})$.^{Footnote 3}

Finally, we introduced the concept of student’s profile. At each academic term, knowledge mastery values of a student are stored. In addition, the profile of a student is also composed of a set of decay functions concerning each knowledge. Indeed, knowledge could face differences regarding how they are forgotten by a student: this phenomenon is strongly dependent on the student Brewer and Unsworth (2012); Mozer and Lindsey (2016). These decay functions may also change over time. This allows us to fine-tune the prediction of forgetting if needed, and the individualization: given two students having a different profile but the same objective, the best fully individualized curriculum will potentially be different.

Experimentation

In this section, we outline two experiments that were conducted to address our problem: one utilizing an exact method and the other employing a meta-heuristic approach. A meta-heuristic is an agnostic problem-solving strategy designed to find approximate solutions across a wide range of optimization problems – for a comprehensive view on the subject please refer to Sörensen (2015). The objectives of these experiments were to gain a deeper understanding of the problem, explore its complexity, assess the impact of decay on problem difficulty, establish initial benchmarks for the research community, and derive preliminary recommendations based on our findings. Before presenting the experiments, we provide a comprehensive overview of the experimental context.

Experimental Context

Academic Background

Below, we present the setup we used for instantiating FIC/DK-P from the presented model. The problem assumption is inspired by the academic background of a French engineering school. Since FIC/DK-P modeling is generic, it allows for flexible assumptions to accommodate various backgrounds and requirements.

Assumption 1

An academic year is divided into two academic terms, also known as semesters. Therefore, a five-year curriculum consists of ten academic terms.

Assumption 2

An academic term should always bring to the student 30 credits once completed. These 30 credits represent the ECTS credits earned by students.

Assumption 3

It is not possible for a student to take the same course more than once during his/her entire curriculum.

Assumption 4

We consider the following epistemological model for the decay function $\delta $, inspired by the works done in the neuroscience field Averell and Heathcote (2011); Ebbinghaus (2013):

$$\begin{aligned} \delta (t) = \frac{e^{\frac{t}{s}}+5}{100} \end{aligned}$$

(1)

with t representing the difference between the last time a student saw this knowledge and the current academic term. This function illustrates that, the less a student uses one of his/her knowledge, the greater he/she forgets it. The function was designed mainly for an academic curriculum of 3 and 5 years with two academic terms by year; for any other duration, one should modify the coefficient of memory stability s introduced by Ebbinghaus (here, $s=2$).

It implies that the decay function associated with each knowledge in a student’s profile is the same: each knowledge will develop in the same way.^{Footnote 4}

Assumption 5

There is no decay regarding the mastery of a knowledge within an academic term (here, a semester). This can be expressed as $\delta (0)=0$.

Assumption 6

The student for whom we solve FIC/DK-P has no prior knowledge at the beginning of his/her curriculum. That means every knowledge mastery that composes his/her profile is set to 0. Note that, in real-life applications, it is more than likely that some of his/her knowledge mastery will be different from zero.

Data Generation Background

To the best of our knowledge, no sufficiently comprehensive public catalog of courses exists in our community, at least publicly. This is arguably because the elaboration of such catalogs is an important task for both teachers and institutional stakeholders: each course must be properly described, as well as all its properties. In the absence of strong incentives, their creation seems not to have been a priority. For example, in France, the Commission des Titres d’Ingénieurs (CTI) recently acts for the creation of a detailed syllabus for each course of engineering schools, describing the knowledge taught to the students and the adoption of an approach by competencies Commission des Titres d’Ingénieurs (2023): these institutions have started the elaboration of some kind of course catalog alongside a knowledge catalog.

It is in this context that we produced our datasets for the following two experiments. Having at our disposal the syllabuses of courses taught during the two years of a master’s degree at IMT Nord Europe, a French engineering school, we identified the number of courses available, their temporal availability, the number of knowledge taught in each course, the prerequisites for each one of them, the overall distributions of knowledge and how many courses at average a student should attend. Nonetheless, some information was missing from these syllabuses and we had to presuppose them. This was the case for the knowledge mastery that each course brings to a student. To approximate this value, we use our prior knowledge of the courses we knew and contact some of the referees of the other courses.

Thus, we have extrapolated the gathered data to define a data generation model that is representative of a five-year curriculum to produce simulated datasets. The size of the course catalog $\mathcal {C}$ was set in between [300; 500] courses, and the size of the catalog of knowledge $\mathcal {K}$ was set in between [200; 600]. The maximal amount of prerequisites a course can have $\mathcal {P}$ was set in between [4; 5] and the maximal amount of knowledge taught by a course $\mathcal {T}$ was set up to 5. The selection of knowledge taught and used as prerequisites followed a uniform distribution and the mastery $\mathcal {M}$ was set in between [0.25; 0.75]. Finally, we set the number of courses $\mathcal {S}$ a student has to choose at each academic term in between [10; 20].

A dataset can therefore be expressed as the combination of $\langle \mathcal {C}, \mathcal {K}, \mathcal {T}, \mathcal {P},\mathcal {S}, \mathcal {M}\rangle $, plus the random seed used during computation. We produce, for the same configuration, 30 different datasets by changing the problem seed. To reduce the combinatoric, we select $\mathcal {C}$ in increments of 10, $\mathcal {K}$ in increments of 50 and $\mathcal {S}$ in increments of 1. By doing so, we have produced 124740 different instances of the problem (which is roughly equal to 15 GB of experimental data).

Even if our studies are based on simulated data, we have made them as close as possible to real-life scenarios. Nonetheless, we had no prior knowledge that, given a student profile and an objective, a solution whether exists or not – since this relates to directly solving FIC/DK-P. Please note that there are also some biases using simulated data (e.g. the effect of the distribution used), yet this was an essential first step in order to study FIC/DK-P. We are currently working to elaborate on a real problem instance that could be shared with the community.

We also had to randomly generate the student objective. Essentially, it is defined according to the available dataset: the prerequisites and objective requisites are taken from $\mathcal {K}$, the set of all knowledge that can be taught in the institution. The amount of prerequisites and requisites was set in between [2; 4] and the expected mastery of each of these was picked in between [0.5; 0.95]. We attempted to convey that students should be fairly proficient in the knowledge essential for their future employment.

Exact Solving Method

In this section, we present our experiment implementing an exact method to find the best curriculum, according to an initial student profile and a student objective. One goal of this experimentation was to study how hard the problem of fully individualizing a curriculum is and to identify the moment that educational stakeholders should be assisted in the customization task of curricula. The experimental results tend to show that FIC/DK-P exhibits an important combinatorial explosion, making this problem probably not suited for 1) exact methods – even considering a small set of courses, and for 2) educational stakeholders to manually solve this problem.

An exact method for solving FIC/DK-P consists in finding at least one complete assignment (i.e. a sequence of courses chosen from the catalog of courses) that satisfies all the constraints entailed by our problem. As a reminder, these constraints are: 1) all the courses in the sequence must be different; 2) each course prerequisite must be validated, as well as objective requisites ; 3) a course can only be taken when it is available; 4) the number of courses in an academic term must be equal to the theoretical value used; 5) each academic term must bring at least 30 ECTS.

We do not speak of the quality of a solution nor the optimality of a solution, as they should be left to the discretion of the pedagogical stakeholders. Is a solution maximizing the mastery values of knowledge required by an objective to be considered better than one which maximally diversifies the knowledge seen by a student? We do not know.

Experimental Setup

Software

We implemented in prolog language two constraints-based search algorithms (i.e. solvers) to solve FIC/DK-P. Algorithmic details as well as implementation are available in Appendix B. The first solver – Solver 1 – evaluates the validity of a solution after its full assignment. We suppose that the behavior of Solver 1 is somewhat representative of an educational stakeholder’s behavior trying to solve FIC/DK-P: the evaluation of the solution will be carried out at the end of a full assignment for the sake of convenience.

Yet, we face an important combinatorial explosion that forced us to drastically reduce the dimensions of the datasets used. At the end of the experiment, we used $\mathcal {C} \in [24;100]$, $\mathcal {K} \in \{10;20;30\}$ and $\mathcal {S} \in \{2;3\}$, with the objective of the student expressed as 2 requisites and the number of academic terms T being equal to 6 or 10: above these parameters the problem becomes intractable, exceeding the time limit of 12 hours we fixed (considering that the problem is supposed to be solved for several hundred or thousand students in a real context). Thus, we designed a second solver – Solver 2 – implementing a search heuristic based on the decay prediction (see Appendix B, Algorithm 2). This heuristic does not prune by itself any path of the exploration tree (it does not prevent forward chaining and backward chaining)^{Footnote 5}; it prioritizes courses that maintain the student’s mastery of the learning objective at a level above or equal to the expected value when subjected to the decay effect.

To summarize, Solver 1 is a full exact method that evaluates implicitly all the possible solutions. Solver 2 is built on Solver 1 and uses the heuristic as a predictive model to prioritize some courses over others during the search.

In our different instances configuration, the search always starts at the beginning of the first academic term of the first academic year, and the student profile has all its mastery scores set to zero.

Hardware

This experiment was conducted on a 2.0 GHz Intel i5-8250 laptop with 8 GB of RAM. In the following, we considered logical inference (LI) made by the solvers, rather than time spent, because of more scalable and representative information regarding the computation force required to solve the problem.^{Footnote 6}

Results

The Fig. 3 presents the experimental results obtained from our exact solving attempts. The y-axis is logarithmic and represents the logical inferences (LI) made by the solver for solving the problem. The x-axis represents the number of courses available for each academic term.

First of all, let us note that regarding the LI made, we obtain better results in terms of computation time for all the observed cases with Solver 2 which uses the decay as a selection heuristic than Solver 1. This observation leads that pedagogical models could be useful to design efficient selection heuristics and reduce the computational time of a problem.

In Fig. 3 a), b) and c), we are solving the problem for a bachelor curriculum ($T=6$), where two courses are attended by the student at each semester ($\mathcal {S}=2$). As we increase the pool of courses available at each semester N, we quickly observe a combinatorial explosion: around $N=15$, FIC/DK-P becomes generally not tractable in a reasonable time. We also vary the pool of available knowledge: $\mathcal {K}=10$ for a), $\mathcal {K}=20$ for b) and $\mathcal {K}=30$ for c). Interestingly, the configuration used for b) appears to make the problem more difficult than the configuration used for c). Additionally, in a), we can observe a strong advantage towards Solver 2: it finds an answer to the problem for $N=12$ whereas Solver 1 cannot solve these instances under 12 hours.

In d), we compare similarly configured instances ($\mathcal {S}=2$, $\mathcal {K}=10$) over two different durations: a full bachelor degree ($T=6$) and a full bachelor and master degree ($T=10$). It can be observed that the number of academic terms has a nonnegligible effect regarding the overall tractability of the problem: for $T=10$ the two solvers could not solve the problem above $N=10$ (see the doted plots). Thus it appears that when the number of academic terms increases, so does the difficulty of the problem.

In e), we study the effect of the number of courses $\mathcal {S}$ to be taken each semester. The results are unequivocal: the more $\mathcal {S}$ increases, the more the problem is difficult. For $\mathcal {S}=3$, the problem becomes not tractable in a reasonable time for $N=8$. For $\mathcal {S}=4$ (not plotted), the solver could not solve the problem when $N=7$. It appears that $\mathcal {S}$ has also an important effect on the combinatorics of FIC/DK-P as one would anticipate, maybe more important than T.

Overall, despite the small scale of our experiment, the results show that FIC/DK-P is a very difficult problem to solve, highly demanding computation-wise. Some parameters, such as $\mathcal {S}$ and T, seem to have a strong effect on the complexity of the problem. Additionally, our results are further evidence of the difficulty that educational stakeholders will face in addressing this problem in real-life scenarios. Furthermore, we are inclined to discourage the utilization of exact methods alone to solve FIC/DK-P, unless accompanied by robust heuristics capable of efficiently exploring the state space. Nonetheless, these first results have been useful to establish some prescriptive recommendations for the educational stakeholders, which are discussed in Section 5.

Meta-Heuristic Solving

As we have not been able to scale up to real case scenarios while using an exact solving method, we decided to further study the problem using a meta-heuristic approach. The objective of this experiment was not to find an exact solution, namely a sequence of courses for which all the constraints are satisfied, but good enough solutions where the constraints are violated as little as possible. Considering the extensive use of genetic algorithm (GA) in the e-learning literature to solve problems and its efficiency, such as arranging and delivering e-learning materials Al-Muhaideb and Menai (2011); Benmesbah et al. (2021), we developed a GA to solve FIC/DK-P. By doing so, we provide the very first benchmarks and insights to the community for the full-scale problem, being as typical as real-life scenarios. We hope that these contributions will give researchers in our community a solid foundation for developing new, more efficient algorithms. Before presenting our experimental results, we present the GA parametrization for the sake of reproducibility and discussion.

GA Parametrization and Reproducibility

GA is known to be multi-parametric: a parameter’s value can have a substantial effect on the quality of a solution Eiben et al. (2003). Yet, identifying the best configurations is computationally intensive (e.g. identifying good crossover, mutation, and tournament combinations), and most of the time the values are chosen empirically Eiben et al. (2003). Below we discuss the parametrization of our GA.

Problem Representation

In our implementation, we opted for a widely used representation in e-Learning, especially in learning object recommendation, which is the integer chromosome encoding with fixed length da Silva Lopes et al. (2009); De-Marcos et al. (2009); Al-Muhaideb and Menai (2011); Benmesbah et al. (2021). A single individual represents a curriculum. Each individual’s gene represents a specific course. The size of the genome of an individual equals the quantity of the overall number of courses a student will attend during its entire curriculum (which is the number of academic terms T multiplied by the number of courses that must be attended at each academic term $\mathcal {S}$). The order of the genes within an individual is discretized by an academic term that represents the succession of the courses. Each course encodes its prerequisites, the knowledge it teaches as well as the credit value it is worth. Figure 4 illustrates an individual in our implementation.

In our GA, the initial population is pseudo-randomly generated. Instead of randomly picking a course in the entire catalog for each gene of each individual we create, we verify that 1) a course can effectively be taken in the academic term it is planned and 2) all the courses are different. This integrity check is computationally straightforward and dramatically improves the overall quality of the initial population, making a more efficient convergence. The population size of each generation was set to 100 as it appears to be a good compromise between exploration and computation efficiency; increasing the size could potentially improve the likelihood of finding better solutions for learning paths but at the expense of a higher computational cost^{Footnote 7} Chen (2009).

During the computation of the next generation of individuals, some individual inconsistencies may happen due to the stochastic nature of GA. In such a case, either we re-generate the ill-formed individual with a probability of $p=0.75$ or we replace the faulty course(s) with a valid one with a probability of $p^{'}=1-p$. GA does not guarantee to reach an optimal solution during the generational process. To stop it, we used a common disjunction of case Eiben et al. (2003); Samia and Mostafa (2007); De-Marcos et al. (2009): either reaching a fitness threshold of 0, which means that all the constraints are successfully passed, or reaching the maximum number of generations. We empirically select 10000 generations as a maximum, as we notice strong convergence from the individuals around $8\times 10^{3}$ generations.

Fitness

The fitness function f expresses the quality of an individual. It is based on four $\nu _i \in [0;1]$ metrics. $\nu _1$ expresses the difference of credits between the amount expected and the amount obtained at each academic term. $\nu _2$ expresses the amount of mastery lacking to entirely match the requisites of the student objective. $\nu _3$ expresses the quantity of misallocated courses. $\nu _4$ expresses the amount of mastery lacking to match each of the prerequisites of courses at each academic term. When all the metrics are maxed, $f(x)=0$, meaning that the individual fully passes all the constraints. When $f(x) = 4$, each constraint is violated. f is defined as:

$$\begin{aligned} f=\sum _{i=1}^{4}\nu _{i} \end{aligned}$$

(2)

Crossover, Mutation and Tournament selection

GA is driven by three important operators: tournament, crossover, and mutation. A tournament is the selection of the individuals that will contribute to the new individuals of the next generation. Crossover is the creation of new offspring from the combination of two selected individuals. The mutation is the modification of an individual genotype to introduce some noise in the population. All of these operators are known to have a significant impact on the quality of the solutions found. Consequently, we led several upstream experiments to empirically select the most effective rates.

We used a generational replacement strategy Hovakimyan et al. (2004) coupled with parsimonious elitism by always selecting the best individual to prevent the eventual loss of the best solution. This strategy is driven by a deterministic tournament selection, mostly because it is efficient to code and allows the selection pressure to be easily adjusted Miller et al. (1995). We empirically chose a tournament size of $\tau =2$.

We implement a one-point crossover operation that produces two new individuals: this is a common operation in our domain Hovakimyan et al. (2004); da Silva Lopes et al. (2009). We empirically chose a crossover rate of $X=0.75$. As for mutation, we implement a simple binary mutation operation that changes, for the concerned individual, one of its courses to another one from the course catalog. This binary mutation is coupled with an integrity check regarding the course to pick: we randomly select a course having all its prerequisites met thanks to the previous gene if any such a course exists, otherwise we randomly pick one from the entire catalog. We empirically chose a mutation rate of $M=0.75$.