A Dynamic Mining Algorithm for Multi-granularity User’s Learning Preference Based on Ant Colony Optimization

Liu, Shengjun; Chen, Shengbing; Meng, Hu

doi:10.1007/978-3-319-68121-4_14

A Dynamic Mining Algorithm for Multi-granularity User’s Learning Preference Based on Ant Colony Optimization

Shengjun Liu¹⁸,
Shengbing Chen¹⁹ &
Hu Meng²⁰

Conference paper
First Online: 27 September 2017

1558 Accesses
1 Citations

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 510))

Abstract

Mining user’s learning preference is one of the key issues in the personalized online learning system, which is of great significance technology for modern educational. In this paper, using the hierarchical characteristics of the knowledge points in the course domain, we defined the equivalence relation and equivalence of knowledge points, and defined the structure of the knowledge points quotient space. Then, the functions of support, pheromone concentration and preference were defined on various levels, and an improved ant colony optimization was proposed to handle the multi granularity data structure of quotient space. An algorithm of multi-granularity Learning Preference Mining based on Ant Colony Optimization (ACO-LPM) was proposed to address the problems about too many learning knowledge points and too few user’s test data in the online personalized learning system. The pheromone has the characteristic of dynamic evaporation, so, the preference patterns mined by ACO-LPM can be changed with the change of user interest in real time. The experimental results show that the algorithm can mining the user’s learning preferences in online learning system effectively and efficiently.

You have full access to this open access chapter, Download conference paper PDF

Online learning, which breakthrough the constraints of time and space, provides a convenient and efficient learning platform for learners. It has become an important means of modern education for its three characteristics: the various learning modalities, the multiple teachers and students’ role, and the rich learning resources. Personalized learning is the hotspot of online learning, which makes learners achieve the best learning effect under the minimum time and the best learning experience according to learners’ learning characteristics and preference model. Researchers have done a lot of work in this domain. Jiunn used neural network to analyze students’ online browsing behavior and get students’ learning styles and learning preferences [1]. Du studied the personality traits of learners and the association between learning behaviors, and used the data mining algorithm to obtain the learner behavior model [2]. Qiu employed solomon learning style scale for pre-test, and obtained user interest model through the data mining of user learning history [3]. Lin and Yan studied the news recommendation in the mobile network environment, constructed the keyword vector by using the spatial model, and clustered the document according to the similarity degree, to obtain the gravity vector of each document cluster and build the user preference model [4, 5]. Ren put forward a U-I-C user interest model, which obtained scenario user preferences by adding the scene information in the user-project matrix [6]. Wang put forward the idea and method of user preference based on ontology and label [7]. Wolfgang studied the scenario information of the users in the mobile environment and found the user preference information by using the collaborative filtering algorithm [8]. Chen presented logistic curve model and hyperbolic model to analyze the user behavior, and proposed a user preference model based on multi-vector tree [9]. Pazzan used the expected information gain to analysis the annotations of users when they were browsing the pages and get user interest preferences [10]. Adomavicius mined the user’s individual access records to construct the user model by using the associated association rules and user’s personal information [11]. These methods have greatly improved the efficiency of the users’ preferences in different backgrounds and applications. However, because of the characteristics of the massive knowledge points and the few test data, the problems of learning preferences on knowledge points has not been solved yet, and becomes a hot issue.

Granular computation can reduce the complexity of the solution, which inspired by mankind who can solve complex problems at different levels and solve them at the appropriate size [12]. In recent years, Granular computation (such as quotient space, rough set and fuzzy set) have been successfully applied to complex problems in many fields such as industrial control, transportation, graphic image processing, decision support, and biological information [13]. With the consideration of all the facts (the universe, structure, projection, etc.), quotient space can meet all needs of online learning system, such as the domain of knowledge and the dependence analysis. In this paper, we used quotient space theory to explore a dynamic mining algorithm for user’s learning preference based on Ant Colony Optimization.

1 Quotient Space Structure of Knowledge Points

In online learning system, each knowledge point corresponds to a concept, which comes from the domain knowledge ontology, the knowledge points have a certain hierarchical structure and complex dependencies. As shown in Fig. 1, there is a inclusion relation between the knowledge points in different level. At same level, the knowledge has three relationships: pre-order, brotherhood, and equivalent. The knowledge point KP is defined as follows:

Definition 1:

Knowledge point K is a triple (C, T, f), where C is the corresponding concept of knowledge point K; T is the topology of various relationships between knowledge points. The online learning system mainly has inclusion and pre-order structure; f is the property of knowledge.

Definition 2:

If the knowledge of the learning point K _i need to use the knowledge point K _j, then K _j is K _i’s pre-order knowledge (also known as preparatory knowledge), expressed as K _j < K _i.

Definition 3:

If K _u is the upper level knowledge point of K _i and K _j, and C _u ⊇ C _i + C _j, T _u ⊇ T _i + T _j, then K _u contains K _i and K _j, K _i and K _j are brotherhood relations, K _u is K _i and K _j’s Father knowledge points.

Since K _u is not the same level as K _i and K _j, the internal relationship between K _i and K _j disappears naturally in the upper level, so T _u ⊇ T _i + T _j is the operation of the parent node K _u level.

From the above definition we can see that if the brotherhood of the knowledge points is denoted by R, then R has both reflexivity, symmetry and transitivity, that is, R is the equivalence relation. Using the brotherhood relationship R constructs the equivalence class, the new triples ([C], [T], [f]) form a larger granularity of knowledge space, which is identified by the parent knowledge point.

Constructing the equivalence class mapping p: (C, T) → ([C], [T]) which is the continuous natural projection of the knowledge point concept space. So, it meet the conditions of false warranty principle and fidelity principle define as follows:

1.
False warranty principle: If the problem in the quotient space has no solution, it must also has no solution in any finer space.
2.
Fidelity principle: Assuming that the problem has solution in the semaphore quotient space {C1, T1, f1}, {C2, T2, f2}, then it has a solution in its synthetic quotient space {C3, T3, f3}.

Using the false warranty principle and the fidelity principle, we can mining users’ preferences on knowledge points in different granularity. The number of knowledge points were reduced by the equivalence. At the same time, the data sparse problems have also been cut down because the equivalent knowledge points have larger size.

The knowledge point space constructed by the brotherhood relationship R is shown in Fig. 2, the top is the root node, and its child node is called the inner node (corresponding to the equivalence class which R constructed). The inner node can also contain other inner nodes (finer equivalence), the bottom level is the leaf node, corresponding to the specific knowledge point. We call the sub-nodes of the root for the 1th level, the sub-nodes of the 1th level node for the 2th level, and so on, the specific knowledge points for the n-th level.

In the structure of quotient space shown in Fig. 2, each level is a set of equivalence of a certain granularity. When a user study a leaf node (a specific knowledge point), each inner node on the path is considered to be accessed.

2 Functions Definition of Multi-granularity Ant Colony Optimization

2.1 Ant Colony Optimization

Ant Colony Optimization (ACO) proposed by Dr. Marco Dorigo who inspired by the natural ant colony foraging process. Ants can find the shortest path from the food source to the nest in un-visual conditions. During the foraging process, the ants release pheromone, which is proportional to the quality of the food source, and sense the pheromone. Then ants tend to move in the direction of high pheromone concentration. Thus, the group behavior of the ant colony shows a positive feedback: the more ants travel on a path, the more probability the latter choose the path, so, the path with good quality and short distance will attract more ants, and the pheromone concentration grows faster [14].

2.2 Functions Definition of ACO in Multi-granularity Data

In order to apply the ACO to the mining of multi-granularity knowledge points of hierarchical structure, several functions such as support degree, pheromone concentration and preference function in ACO are defined as follows:

Definition 4:

Support η _l represents the probability that the user accesses path l, expressed as the access frequency of the path.

Let K _ij, K _im be the two knowledge points (equivalence class) nodes of the i-th level, the number of nodes in i-th level is n _i, path l(i, j, m) represents a certain path from node K _ij to node K _im, the number of times the user visits the path l(i, j, m) is expressed as C _l(i, j, m). Then for the hierarchical interest pattern, its support η _l(i, j, m) is:

$$ \upeta_{l} \left( {i,j,m} \right) = \frac{{C_{l} (i,j,m)}}{{\sum\nolimits_{k = 1}^{{n_{i} }} {C_{l} (i,j,k)} }} $$

(1)

Definition 5:

The pheromone concentration τ_l (t) represents the user’s interest in access to a path l, fades with time and increase with the user accesses the path l.

$$ \uptau_{l } \left( {t + 1} \right) = (1 -\uprho)\uptau_{l } \left( t \right) + \updelta \varDelta\uptau_{l} $$

(2)

In formula (2), ρ is pheromone volatilization coefficient, δ is pheromone concentration increment adjustment coefficient, ⊿τ_l is pheromone increased value, the formula is as follows:

$$ \varDelta \tau_{l} = \frac{{\tau_{l} (t) \times t + F_{b} }}{t - 1} - \tau_{l} (t) $$

(3)

In formula (3), F _b is the feedback value of the user’s access to the node at time t + 1.

Definition 6:

The preference function P _l (t) represents the user’s preference for path l, including two factors which are pheromone and support. Let τ_w(t), η _w represent the pheromone and support of w-th path. Then the preference function P _l (t) is defined as:

$$ P_{l} (t) = \frac{{[\tau_{l} (t)]^{\alpha } \cdot [\eta_{l} ]^{\beta } }}{{\sum\limits_{{{\text{w}} = 1}}^{\text{n}} {[\tau_{w} (t)]^{\alpha } \cdot [\eta_{w} ]^{\beta } } }} $$

(4)

In formula (4), n is the total number of paths in this level. If the threshold of the preference function is given and the value of the preference function P of all the path nodes is checked by (4), then the user’s learning preference on multi-granularity knowledge points can be obtained.

3 Dynamic Mining Algorithm for Multi-granularity Learning Preferences

In this paper, we use the dynamic volatility of pheromone to dynamically mining the learning preferences on multi-granularity knowledge points. The main idea is that the user’s knowledge learning activity corresponds to the ant’s foraging behavior, the process of the users’ learning knowledge points corresponds to the ants’ foraging activity cycle. All of the users’ learning actions are recorded in log files. Based on these log files, we can use ACO to find preferred function values of path nodes that formed by all knowledge points, and dynamically mine learning preferences for each knowledge point of the users, and then offer the users needed content for further study. Multi-granularity Learning Preference Mining algorithm based on Ant Colony Optimization (ACO-LPM) described as follows:

In step 4 of the above algorithm, Tree.High represents the height of the tree Tree.

The time complexity of the algorithm: according to the definition of the quotient space structure, if the average of each equivalence class is composed of m elements, then the number of nodes in the upper lever is m ² times less than the number of nodes in the lower level in the multi-granularity knowledge quotient space structure. While the time complexity of standard ACO is O(I * N ² * k), in which I is the number of iterations, N is the number of vertices, and k is the number of ants in the ant colony. So the time complexity of the i-th level is O(I * (N/m ²ⁱ)² * k), which has a greater reduction compared to the original space. Meanwhile, because the data of user online learning is the same, so the training data of each node in i-level increased m ²ⁱ times, which can effectively solve the problem of sparse data in the online learning system.

4 Experiment and Result

This section verified the effectiveness of the multi-granularity mining algorithm from two aspects: dynamic mining process display and practical application system. The experimental data was got from the online learning system, and the user learning behavior is recorded in log files. In order to record the user learning behavior, we added knowledge points information on the basis of the W3C extended log ExLF, which format is: “c-ip date time cs-username cs-method cs-uri-stem cs-user-agent sc-knowledgepoint cs-iscorrect sc-status sc-bytes”. For example: “205.12.15.179 [01/Jul/2015:09:10:12] utest1 ‘GET/index/course/?courseid=10379 HTTP/4.0’ M050311 0 200 598”. Table 1 listed the log identification and description:

Table 1. The log format definition based on W3C extended standard

Full size table

4.1 Dynamic Change Process Experiment of User Learning Preference

Dynamic mining is one of the advantages of the ACO-LPM algorithm. We took some log information to do simulation test on the situation of user’s interest change. After the pre-processing of log records, knowledge that involved in one learning action combined with the case of right or wrong answer (F: wrong, T: right) as a record, such as: “K01F” represents the answer of knowledge K01 is wrong. Selected part of the knowledge of learning behavior data as follows (Table 2):

Table 2. The data of user learning behavior

Full size table

Assuming that the minimum support is 2, the traditional mining algorithm will get frequent itemsets in consideration of the correctness of the answer{{K01K02: 3}, {K03K07: 2}}(In order to facilitate the expression, we call item {K01K02} for itemset1, item set {K03K07} for itemset2). However, if you carefully analyze the data flow, you will find two facts: First, In the long run, the user has the preferences of knowledge point K01K02, itemset1 coincidence occurs 3 times, and the error times with the correct times of 2: 1; Second, recent user learning has changed, itemset1 did not appear in the last 5 learning, but the itemset2 appeared 2 times, indicating that the user was concern about itemset2 recently (Comprehensive practice of knowledge points K03 and K07). In order to show the changes of frequent item sets more clearly and intuitive, we added value of pheromone volatilization coefficient ρ and concentration incremental coefficient δ, the parameters of ACO are defined as: ρ = 0.15, δ = 0.3, α = 1, β = 0.5. The preference function curve of each item is shown in Fig. 3. It can be seen that at the beginning of the training, the preference function value of the node is exponentially increasing due to the user’s continuous error at the knowledge point K01K02, and the preference function value of the other knowledge points is zero; In the second half of the training, the knowledge point K01K02 is gradually reduced because it has not been accessed, but the preference value of the knowledge point K03K07 is gradually increased with the access, and even more than the knowledge point K01K02. It can be concluded that this algorithm can dynamically capture the current user’s preference when the user’s preference changes.

4.2 Experiment Data Analysis of Practical Application System

In order to verify the effectiveness of this method, we use the learning resources and system logs of the online education online system of Anhui Education Publishing Network Company (http://www.timeep.com/cms/index.html) on line in 2015. We have extracted the user learning logs of junior high school mathematics, physics, chemistry, a total of 12,000 log information, and recorded them in Math, Physics, Chemistry three data sets.

After preprocessing above data, we used the current mainstream mining algorithms: BP neural network (referred to as the BP-NN) and FP-growth algorithm for mining frequent items (referred to as FP-growth), and multi-size paper ACO-LPM learning preferences mining Algorithm, respectively, to test and verify the accuracy rate. In order to unify the format, ACO-LPM algorithm in this paper only mine for the first three granularity level. We set the ACO parameters as: ρ = 0.05, δ = 0.5, α = 1, β = 0.5. The accuracy of each learning preferences mining algorithms is shown in the following table:

It can be seen from Table 3 that the ACO-LPM mining algorithm in this paper is superior to BP-NN and FP-growth algorithm as a whole. And with the level of the study increased, the accuracy of mining increases. This situation is not random, but by the character of hierarchical structure user interest, the higher the level, the user interest model contains more content, and the more vague concept, the higher the hit rate.

Table 3. Accuracy comparison of the ACO-LPM with other algorithms

Full size table

5 Conclusion

In online learning system, it is a great challenge to mining learning preferences on knowledge points because of the massive knowledge points, the few single user test data, and the change of user’s learning preferences. Based on the Multi-granularity feature of knowledge points, this paper defines a quotient space structure of the knowledge points. On this basis, the ACO is introduced into the quotient space, and the multi-granularity dynamic mining algorithm is proposed by using the characteristics of pheromone dynamic volatility. Experiments show the effectiveness of the method.

References

Lo, J.-J., Shu, P.-C.: Identifying learning styles by observing learners’ browsing behavior. Br. J. Educ. Technol. (2008, in press)
Google Scholar
Jin, D., Qinghua, Z.: The research of mining association rules between personality and behavior of learner under web-based learning environment. In: The 4th International Conference on Web-Based Learning (ICWL 2005), Hong Kong, China, 31 July–3 August 2005
Google Scholar
Baishuang, Q.: Research on User Model in Adaptive Learning System Based on Semantic Web. Northeast Normal University, Changchun (2008). (in Chinese)
Google Scholar
Hongfei, L., Yuansheng, Y.: The user table and update mechanism. J. Comput. Res. Dev. 39(7), 844–847 (2002). (in Chinese)
Google Scholar
Shukui, Y.: Design and Implementation of User Interest Extraction System for Mobile Network News. Beijing University of Posts and Telecommunications, Beijing (2012). (in Chinese)
Google Scholar
Ziting, R.: Research on Personalized Recommendation User Interest Modeling in Mobile Environment. University of Electronic Science and Technology of China, Chengdu (2015). (in Chinese)
Google Scholar
Hongming, W.: Design and Implementation of User Preference Extraction System Based on Ontology and Tag. Beijing University of Posts and Telecommunications, Beijing (2011). (in Chinese)
Google Scholar
Woemd, W., Schueiler, C., Wojtech, R.: A hybrid recommender system for context-aware recommendations of mobile applications. In: Proceedings of the International Conference on Data Engineering, Workshops in Conjunction with the International Conference on Data Engineering, ICDE 2007, pp. 871–878 (2007)
Google Scholar
Shuran, C.: Research on Personalized Service Oriented User Interest Modeling and Application. Chongqing University (2007). (in Chinese)
Google Scholar
Pazzani, M., Billsus, D.: Learning and revising user profiles: the identification of interesting web sites. Mach. Learn. 27, 313–331 (1997)
Article Google Scholar
Adomavicius, G., Tuzhilin, A.: Using data mining methods to build customer profiles. IEEE Comput. 34, 74–82 (2001)
Article Google Scholar
Zhang, L., Zhang, B.: Dynamic quotient space model and its basic properties. Pattern Recogn. Artif. Intell. 25(2), 181–185 (2012). (in Chinese)
Google Scholar
Wang, G.Y., Zhang, Q.H., Jun, H.U.: An overview of granular computing. CAAI Trans. Intell. Syst. 2(6), 8–26 (2007). (in Chinese)
Google Scholar
Chen, S., Lv, G., Wang, X.: Offensive strategy in the 2D soccer simulation league using multi-group ant colony optimization. Int. J. Adv. Rob. Syst. 13, 1 (2016)
Article Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant No. 61672204), Natural Science Foundations of Higher Education Institutions of Anhui Province (Grant Nos. KJ2012B149, KJ2013A226, KJ2015A229, KJ2015A257), Key Projects of Domestic Visiting and Training for Middle-aged and Young Scholar in Anhui Province (Grant No. gxfxZD2016211).

Author information

Authors and Affiliations

Anhui USTC-GZ Information Technology Co., Ltd., Hefei, 230031, China
Shengjun Liu
Key Lab of Network and Intelligent Information Processing, Department of Computer Science and Technology, Hefei University, Hefei, 230601, China
Shengbing Chen
HEFEI City Cloud Data Center Co., Ltd., Hefei, 230094, China
Hu Meng

Authors

Shengjun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shengbing Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hu Meng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shengjun Liu .

Editor information

Editors and Affiliations

Chinese Academy of Sciences, Beijing, China
Zhongzhi Shi
Machine Intelligence Research Institute, Rockville, Maryland, USA
Ben Goertzel
Shanghai Maritime University, Shanghai, China
Jiali Feng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, S., Chen, S., Meng, H. (2017). A Dynamic Mining Algorithm for Multi-granularity User’s Learning Preference Based on Ant Colony Optimization. In: Shi, Z., Goertzel, B., Feng, J. (eds) Intelligence Science I. ICIS 2017. IFIP Advances in Information and Communication Technology, vol 510. Springer, Cham. https://doi.org/10.1007/978-3-319-68121-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-68121-4_14
Published: 27 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68120-7
Online ISBN: 978-3-319-68121-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)

Abstract

1 Quotient Space Structure of Knowledge Points

Definition 1:

Definition 2:

Definition 3:

2 Functions Definition of Multi-granularity Ant Colony Optimization

2.1 Ant Colony Optimization

2.2 Functions Definition of ACO in Multi-granularity Data

Definition 4:

Definition 5:

Definition 6:

3 Dynamic Mining Algorithm for Multi-granularity Learning Preferences

4 Experiment and Result

4.1 Dynamic Change Process Experiment of User Learning Preference

4.2 Experiment Data Analysis of Practical Application System

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation