Use of machine learning techniques for educational proposes: a decision support system for forecasting students’ grades

Abstract

Use of machine learning techniques for educational proposes (or educational data mining) is an emerging field aimed at developing methods of exploring data from computational educational settings and discovering meaningful patterns. The stored data (virtual courses, e-learning log file, demographic and academic data of students, admissions/registration info, and so on) can be useful for machine learning algorithms. In this article, we cite the most current articles that use machine learning techniques for educational proposes and we present a case study for predicting students’ marks. Students’ key demographic characteristics and their marks in a small number of written assignments can constitute the training set for a regression method in order to predict the student’s performance. Finally, a prototype version of software support tool for tutors has been constructed.

This is a preview of subscription content, access via your institution.

References

  1. Amershi S, Conati C (2009) Combining unsupervised and supervised classification to build user models for exploratory learning environments. J Educ Data Min 1(1): 18–71

    Google Scholar 

  2. Anaya AR, Boticario JG (2011) Application of machine learning techniques to analyse student interactions and improve the collaboration process. Expert Syst Appl 38: 1171–1181

    Article  Google Scholar 

  3. Anozie N, Junker BW (2006) Predicting end-of-year accountability assessment scores from monthly student records in an online tutoring system. In Educational data mining AAAI workshop, California, USA, pp 1–6

  4. Atkeson CG, Moore AW, Schaal S (1997) Locally weighted learning. Artificial Intell Rev 11(1–5): 11–73

    Article  Google Scholar 

  5. Baker RSJD, Yacef K (2009) The state of educational data mining in 2009: a review and future visions. J Educ Data Min 1(1): 3–17

    Google Scholar 

  6. Brusilovsky P, Millán E (2007) User models for adaptive hypermedia and adaptive educational systems. In The adaptive web. LNCS 4321, Springer, Berlin, pp 3–53

  7. Buldu A, Üçgün K (2010) Data mining application on students’ data. Procedia Soc Behav Sci 2: 5251–5259

    Article  Google Scholar 

  8. Campbell J (2007) Utilizing student data within the course management system to determine undergraduate student academic success: An exploratory study. Doctoral thesis, Purdue University, Indiana, USA

  9. Chen S-M, Bai S-M (2010) Using data mining techniques to automatically construct concept maps for adaptive learning systems. Expert Syst Appl 37: 4496–4503

    Article  Google Scholar 

  10. Chen CM, Hsieh YL, Hsu SH (2007a) Mining learner profile utilizing association rule for web-based learning diagnosis. Expert Syst Appl 33(1): 6–22

    Article  Google Scholar 

  11. Chen C-M, Chen Y-Y, Liu C-Y (2007b) Learning Performance Assessment Approach Using Web-Based Learning Portfolios for E-learning Systems, IEEE Trans Syst Man Cybern C Appl Rev 37(6) November

  12. Cocea M, Weibelzahl S (2006) Can log files analysis estimate learners’ level of motivation? In: Proceedings of the workshop week Lernen—Wissensentdeckung—Adaptivität, Hildesheim, pp 32–35

  13. Cohen A, Nachmias R (2010) What can instructors and policy makers learn about Web-supported learning through Web-usage mining, Internet and Higher Education. doi:10.1016/j.iheduc.2010.07.008

  14. Cummins D, Yacef K, Koprinska I (2006) A sequence based recommender system for learning resources. Aust J Intell Inform process Syst 9: 49–56

    Google Scholar 

  15. Delen D (2010) A comparative analysis of machine learning techniques for student retention management. Decis Support Syst 49: 498–506

    Article  Google Scholar 

  16. El-Alfy EM, Abdel-Aal RE (2008) Construction and analysis of educational tests using abductive machine learning. Comput Educ 51: 1–16

    Article  Google Scholar 

  17. Frias-Martinez E, Chen SY, Liu X (2006) Survey of data mining approaches to user modeling for adaptive hypermedia. IEEE Trans Syst Man Cybern C Appl Rev 36(2): 734–748

    Article  Google Scholar 

  18. García E, Romero C, Ventura S, de Castro C (2010) A collaborative educational association rule mining tool, A collaborative educational association rule mining tool, Internet and Higher Education. doi:10.1016/j.iheduc.2010.07.006

  19. Guo Q, Zhang M (2009) Implement web learning environment based on data mining. Knowl-Based Syst 22: 439–442

    Article  Google Scholar 

  20. Hamalainen W, Vinni M (2006) Comparison of machine learning methods for intelligent tutoring systems. In: Proceedings of the eighth international conference in intelligent tutoring systems, Taiwan, pp 525–534

  21. Hammouda K, Kamel M (2006) Data mining in e-learning. In: Pierre S (ed), E-learning networked environments and architectures: a knowledge processing perspective, Springer Book Series: Advanced information and knowledge processing, pp 1–28

  22. Hershkovitz A, Nachmias R (2009) Learning about online learning processes and students’ motivation through web usage mining. Interdiscip J E-Learning and Learning Objects, 5,197–215, Special series of Chais Conference 2009 best papers

  23. Hsia T-C, Shie A-J, Chen L-C (2008) Course planning of extension education to meet market demand by using data mining techniques—an example of Chinkuo technology university in Taiwan. Expert Syst Appl 34: 596–602

    Article  Google Scholar 

  24. Hsu MH (2008) A personalized English learning recommender system for ESL students. Expert Syst Appl 34(1): 683–688

    Article  Google Scholar 

  25. Jantan H , Hamdan AR, Othman ZA (2010) Classification and prediction of academic talent using data mining techniques, KES 2010, Part I, LNAI 6276, pp 491–500

  26. Kock M, Paramythis A (2011) Activity sequence modeling and dynamic clustering for personalized e-learning, User Model User-Adap Inter. doi:10.1007/s11257-010-9087-z

  27. Kotsiantis S, Pierrakeas C, Pintelas P (2004) Predicting Students’ Performance in distance learning using machine learning techniques. Appl Artif Intell (AAI) 18(5): 411–426

    Article  Google Scholar 

  28. Lee MW, Chen SY, Chrysostomou K, Liu X (2009) Mining students’ behavior in web-based learning programs. Expert Syst Appl 36: 3459–3464

    Article  Google Scholar 

  29. Lin WT, Wang SJ, Wub YC, Ye TC (2011) An empirical analysis on auto corporation training program planning by data mining techniques. Expert Syst Appl 38: 5841–5850

    Article  Google Scholar 

  30. Lykourentzou I, Giannoukos I, Nikolopoulos V, Mpardis G, Loumos V (2009) Dropout prediction in e-learning courses through the combination of machine learning techniques. Comput and Educ 53: 950–965

    Article  Google Scholar 

  31. Macfadyen LP, Dawson S (2010) Mining LMS data to develop an “early warning system” for educators: a proof of concept. Comput Educ 54: 588–599

    Article  Google Scholar 

  32. Malerba D, Esposito F, Ceci M (2004) Top–down induction of model trees with regression and splitting nodes. IEEE Trans Pattern Anal Mach Intell 26(5): 612–625

    Article  Google Scholar 

  33. Moseley LG, Mead DM (2008) Predicting who will drop out of nursing courses: a machine learning exercise. Nurse Educ Today 28: 469–475

    Article  Google Scholar 

  34. Paliwala M, Kumar UA (2009) Neural networks and statistical techniques: a review of applications. Expert Syst Appl 36(1): 2–17

    Article  Google Scholar 

  35. Perera D, Kay J, Koprinska I, Yacef K, Zaïne OR (2009) Clustering and sequential pattern mining of online collaborative learning data. IEEE Trans Knowl Data Eng 21(6): 759–772

    Article  Google Scholar 

  36. Platt J (1999) Using sparseness and analytic QP to speed training of support vector machines. In: Kearns MS, Solla SA, Cohn DA (eds) Advances in neural information processing systems 11. MIT Press, MA

    Google Scholar 

  37. Robnik-Šikonja M, Kononenko I (2003) Theoretical and empirical analysis of reliefF and RReliefF. Mach Learn 53(1–2): 23–69

    MATH  Article  Google Scholar 

  38. Romero C, Ventura S (2007) Educational data mining: a survey from 1995 to 2005. Expert Syst Appl 33(1): 135–146

    Article  Google Scholar 

  39. Romero C, Ventura S (2010) Educational data mining: a review of the state-of-the-art. IEEE Trans Syst Man Cybernet C Appl Rev 40(6): 601–618

    Article  Google Scholar 

  40. Romero C, Ventura S, García E (2008) Data mining in course management systems: moodle case study and tutorial. Comput Educ 51(1): 368–384

    Article  Google Scholar 

  41. Romero C, Ventura S, Zafra A, de Bra P (2009) Applying Web usage mining for personalizing hyperlinks in Web-based adaptive educational systems. Comput Educ 53: 828–840

    Article  Google Scholar 

  42. Sevindik T, Demirkeser N, Cömert Z (2010) Virtual education environments and web mining. Procedia Soc Behav Sci 2: 5120–5124

    Article  Google Scholar 

  43. Shevade S, Keerthi S, Bhattacharyya C, Murthy K (2000) Improvements to the SMO algorithm for SVM regression. IEEE Trans Neural Netw 11(5): 1183–1188

    Article  Google Scholar 

  44. Soller A (2007) Adaptive support for distributed collaboration. In: Brusilovsky P, Kobsa A, Nejdl W (eds) The AdaptiveWeb, vol 4321 of Lecture Notes in Computer Science. Springer, Berlin, pp 573–595

    Google Scholar 

  45. Tseng SS, Sue PC, Su JM, Weng JF, Tsai WN (2007) A new approach for constructing the concept map. Comput and Educ 49(3): 691–707

    Article  Google Scholar 

  46. Vialardi Sacín C, Bravo Agapito J, Shafti L, Ortigosa A (2009) Recommendation in higher education using data mining techniques. 2nd international conference of educational data mining 2009, Spain, 1–3 July, 2009, 190–199

  47. Wang Y-h, Tseng M-H, Liao H-C (2009) Data mining for adaptive learning sequence in english language instruction. Expert Syst Appl 36: 7681–7686

    Article  Google Scholar 

  48. Wang Y, Witten IH (1997) Induction of model trees for predicting continuous classes, In: Proceedings of the poster papers of the european conference on ML, Prague . Prague: university of economics, Faculty of informatics and statistics, pp 128–137

  49. Weisberg S (2005) Appl Linear Regres, 3rd Edition, ISBN: 978-0-471-66379-9

  50. Weng C-H (2011) Mining fuzzy specific rare itemsets for education data, Knowl-Based Syst. doi:10.1016/j.knosys.2011.02.010

  51. Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques (Third Edition), Morgan Kaufmann, January, ISBN 978-0-12-374856-0

  52. Xenos M, Pierrakeas C, Pintelas P (2002) A survey on student dropout rates and dropout causes concerning the students in the course of informatics of the Hellenic Open University. Comput Educ 39: 361–377

    Article  Google Scholar 

  53. Yudelson MV, Medvedeva O, Legowski E, Castine M, Jukic D, Rebecca C (2006) Mining student learning data to develop high level pedagogic strategy in a medical ITS. In: Proceedings of AAAI workshop on educational data mining, Boston, pp 1–8

  54. Zorrilla ME (2009) Data Warehouse Technology for E-Learning. In: Zakrzewska D et al. (eds) Meth and Support Tech for Data Analys, SCI 225, pp 1–20

Download references

Author information

Affiliations

Authors

Corresponding author

Correspondence to S. B. Kotsiantis.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Kotsiantis, S.B. Use of machine learning techniques for educational proposes: a decision support system for forecasting students’ grades. Artif Intell Rev 37, 331–344 (2012). https://doi.org/10.1007/s10462-011-9234-x

Download citation

Keywords

  • Machine learning
  • Educational data mining
  • Decision support tools