Artificial Intelligence Review

, Volume 37, Issue 4, pp 331–344 | Cite as

Use of machine learning techniques for educational proposes: a decision support system for forecasting students’ grades

  • S. B. KotsiantisEmail author


Use of machine learning techniques for educational proposes (or educational data mining) is an emerging field aimed at developing methods of exploring data from computational educational settings and discovering meaningful patterns. The stored data (virtual courses, e-learning log file, demographic and academic data of students, admissions/registration info, and so on) can be useful for machine learning algorithms. In this article, we cite the most current articles that use machine learning techniques for educational proposes and we present a case study for predicting students’ marks. Students’ key demographic characteristics and their marks in a small number of written assignments can constitute the training set for a regression method in order to predict the student’s performance. Finally, a prototype version of software support tool for tutors has been constructed.


Machine learning Educational data mining Decision support tools 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Amershi S, Conati C (2009) Combining unsupervised and supervised classification to build user models for exploratory learning environments. J Educ Data Min 1(1): 18–71Google Scholar
  2. Anaya AR, Boticario JG (2011) Application of machine learning techniques to analyse student interactions and improve the collaboration process. Expert Syst Appl 38: 1171–1181CrossRefGoogle Scholar
  3. Anozie N, Junker BW (2006) Predicting end-of-year accountability assessment scores from monthly student records in an online tutoring system. In Educational data mining AAAI workshop, California, USA, pp 1–6Google Scholar
  4. Atkeson CG, Moore AW, Schaal S (1997) Locally weighted learning. Artificial Intell Rev 11(1–5): 11–73CrossRefGoogle Scholar
  5. Baker RSJD, Yacef K (2009) The state of educational data mining in 2009: a review and future visions. J Educ Data Min 1(1): 3–17Google Scholar
  6. Brusilovsky P, Millán E (2007) User models for adaptive hypermedia and adaptive educational systems. In The adaptive web. LNCS 4321, Springer, Berlin, pp 3–53Google Scholar
  7. Buldu A, Üçgün K (2010) Data mining application on students’ data. Procedia Soc Behav Sci 2: 5251–5259CrossRefGoogle Scholar
  8. Campbell J (2007) Utilizing student data within the course management system to determine undergraduate student academic success: An exploratory study. Doctoral thesis, Purdue University, Indiana, USAGoogle Scholar
  9. Chen S-M, Bai S-M (2010) Using data mining techniques to automatically construct concept maps for adaptive learning systems. Expert Syst Appl 37: 4496–4503CrossRefGoogle Scholar
  10. Chen CM, Hsieh YL, Hsu SH (2007a) Mining learner profile utilizing association rule for web-based learning diagnosis. Expert Syst Appl 33(1): 6–22CrossRefGoogle Scholar
  11. Chen C-M, Chen Y-Y, Liu C-Y (2007b) Learning Performance Assessment Approach Using Web-Based Learning Portfolios for E-learning Systems, IEEE Trans Syst Man Cybern C Appl Rev 37(6) NovemberGoogle Scholar
  12. Cocea M, Weibelzahl S (2006) Can log files analysis estimate learners’ level of motivation? In: Proceedings of the workshop week Lernen—Wissensentdeckung—Adaptivität, Hildesheim, pp 32–35Google Scholar
  13. Cohen A, Nachmias R (2010) What can instructors and policy makers learn about Web-supported learning through Web-usage mining, Internet and Higher Education. doi: 10.1016/j.iheduc.2010.07.008
  14. Cummins D, Yacef K, Koprinska I (2006) A sequence based recommender system for learning resources. Aust J Intell Inform process Syst 9: 49–56Google Scholar
  15. Delen D (2010) A comparative analysis of machine learning techniques for student retention management. Decis Support Syst 49: 498–506CrossRefGoogle Scholar
  16. El-Alfy EM, Abdel-Aal RE (2008) Construction and analysis of educational tests using abductive machine learning. Comput Educ 51: 1–16CrossRefGoogle Scholar
  17. Frias-Martinez E, Chen SY, Liu X (2006) Survey of data mining approaches to user modeling for adaptive hypermedia. IEEE Trans Syst Man Cybern C Appl Rev 36(2): 734–748CrossRefGoogle Scholar
  18. García E, Romero C, Ventura S, de Castro C (2010) A collaborative educational association rule mining tool, A collaborative educational association rule mining tool, Internet and Higher Education. doi: 10.1016/j.iheduc.2010.07.006
  19. Guo Q, Zhang M (2009) Implement web learning environment based on data mining. Knowl-Based Syst 22: 439–442CrossRefGoogle Scholar
  20. Hamalainen W, Vinni M (2006) Comparison of machine learning methods for intelligent tutoring systems. In: Proceedings of the eighth international conference in intelligent tutoring systems, Taiwan, pp 525–534Google Scholar
  21. Hammouda K, Kamel M (2006) Data mining in e-learning. In: Pierre S (ed), E-learning networked environments and architectures: a knowledge processing perspective, Springer Book Series: Advanced information and knowledge processing, pp 1–28Google Scholar
  22. Hershkovitz A, Nachmias R (2009) Learning about online learning processes and students’ motivation through web usage mining. Interdiscip J E-Learning and Learning Objects, 5,197–215, Special series of Chais Conference 2009 best papersGoogle Scholar
  23. Hsia T-C, Shie A-J, Chen L-C (2008) Course planning of extension education to meet market demand by using data mining techniques—an example of Chinkuo technology university in Taiwan. Expert Syst Appl 34: 596–602CrossRefGoogle Scholar
  24. Hsu MH (2008) A personalized English learning recommender system for ESL students. Expert Syst Appl 34(1): 683–688CrossRefGoogle Scholar
  25. Jantan H , Hamdan AR, Othman ZA (2010) Classification and prediction of academic talent using data mining techniques, KES 2010, Part I, LNAI 6276, pp 491–500Google Scholar
  26. Kock M, Paramythis A (2011) Activity sequence modeling and dynamic clustering for personalized e-learning, User Model User-Adap Inter. doi: 10.1007/s11257-010-9087-z
  27. Kotsiantis S, Pierrakeas C, Pintelas P (2004) Predicting Students’ Performance in distance learning using machine learning techniques. Appl Artif Intell (AAI) 18(5): 411–426CrossRefGoogle Scholar
  28. Lee MW, Chen SY, Chrysostomou K, Liu X (2009) Mining students’ behavior in web-based learning programs. Expert Syst Appl 36: 3459–3464CrossRefGoogle Scholar
  29. Lin WT, Wang SJ, Wub YC, Ye TC (2011) An empirical analysis on auto corporation training program planning by data mining techniques. Expert Syst Appl 38: 5841–5850CrossRefGoogle Scholar
  30. Lykourentzou I, Giannoukos I, Nikolopoulos V, Mpardis G, Loumos V (2009) Dropout prediction in e-learning courses through the combination of machine learning techniques. Comput and Educ 53: 950–965CrossRefGoogle Scholar
  31. Macfadyen LP, Dawson S (2010) Mining LMS data to develop an “early warning system” for educators: a proof of concept. Comput Educ 54: 588–599CrossRefGoogle Scholar
  32. Malerba D, Esposito F, Ceci M (2004) Top–down induction of model trees with regression and splitting nodes. IEEE Trans Pattern Anal Mach Intell 26(5): 612–625CrossRefGoogle Scholar
  33. Moseley LG, Mead DM (2008) Predicting who will drop out of nursing courses: a machine learning exercise. Nurse Educ Today 28: 469–475CrossRefGoogle Scholar
  34. Paliwala M, Kumar UA (2009) Neural networks and statistical techniques: a review of applications. Expert Syst Appl 36(1): 2–17CrossRefGoogle Scholar
  35. Perera D, Kay J, Koprinska I, Yacef K, Zaïne OR (2009) Clustering and sequential pattern mining of online collaborative learning data. IEEE Trans Knowl Data Eng 21(6): 759–772CrossRefGoogle Scholar
  36. Platt J (1999) Using sparseness and analytic QP to speed training of support vector machines. In: Kearns MS, Solla SA, Cohn DA (eds) Advances in neural information processing systems 11. MIT Press, MAGoogle Scholar
  37. Robnik-Šikonja M, Kononenko I (2003) Theoretical and empirical analysis of reliefF and RReliefF. Mach Learn 53(1–2): 23–69zbMATHCrossRefGoogle Scholar
  38. Romero C, Ventura S (2007) Educational data mining: a survey from 1995 to 2005. Expert Syst Appl 33(1): 135–146CrossRefGoogle Scholar
  39. Romero C, Ventura S (2010) Educational data mining: a review of the state-of-the-art. IEEE Trans Syst Man Cybernet C Appl Rev 40(6): 601–618CrossRefGoogle Scholar
  40. Romero C, Ventura S, García E (2008) Data mining in course management systems: moodle case study and tutorial. Comput Educ 51(1): 368–384CrossRefGoogle Scholar
  41. Romero C, Ventura S, Zafra A, de Bra P (2009) Applying Web usage mining for personalizing hyperlinks in Web-based adaptive educational systems. Comput Educ 53: 828–840CrossRefGoogle Scholar
  42. Sevindik T, Demirkeser N, Cömert Z (2010) Virtual education environments and web mining. Procedia Soc Behav Sci 2: 5120–5124CrossRefGoogle Scholar
  43. Shevade S, Keerthi S, Bhattacharyya C, Murthy K (2000) Improvements to the SMO algorithm for SVM regression. IEEE Trans Neural Netw 11(5): 1183–1188CrossRefGoogle Scholar
  44. Soller A (2007) Adaptive support for distributed collaboration. In: Brusilovsky P, Kobsa A, Nejdl W (eds) The AdaptiveWeb, vol 4321 of Lecture Notes in Computer Science. Springer, Berlin, pp 573–595Google Scholar
  45. Tseng SS, Sue PC, Su JM, Weng JF, Tsai WN (2007) A new approach for constructing the concept map. Comput and Educ 49(3): 691–707CrossRefGoogle Scholar
  46. Vialardi Sacín C, Bravo Agapito J, Shafti L, Ortigosa A (2009) Recommendation in higher education using data mining techniques. 2nd international conference of educational data mining 2009, Spain, 1–3 July, 2009, 190–199Google Scholar
  47. Wang Y-h, Tseng M-H, Liao H-C (2009) Data mining for adaptive learning sequence in english language instruction. Expert Syst Appl 36: 7681–7686CrossRefGoogle Scholar
  48. Wang Y, Witten IH (1997) Induction of model trees for predicting continuous classes, In: Proceedings of the poster papers of the european conference on ML, Prague . Prague: university of economics, Faculty of informatics and statistics, pp 128–137Google Scholar
  49. Weisberg S (2005) Appl Linear Regres, 3rd Edition, ISBN: 978-0-471-66379-9Google Scholar
  50. Weng C-H (2011) Mining fuzzy specific rare itemsets for education data, Knowl-Based Syst. doi: 10.1016/j.knosys.2011.02.010
  51. Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques (Third Edition), Morgan Kaufmann, January, ISBN 978-0-12-374856-0Google Scholar
  52. Xenos M, Pierrakeas C, Pintelas P (2002) A survey on student dropout rates and dropout causes concerning the students in the course of informatics of the Hellenic Open University. Comput Educ 39: 361–377CrossRefGoogle Scholar
  53. Yudelson MV, Medvedeva O, Legowski E, Castine M, Jukic D, Rebecca C (2006) Mining student learning data to develop high level pedagogic strategy in a medical ITS. In: Proceedings of AAAI workshop on educational data mining, Boston, pp 1–8Google Scholar
  54. Zorrilla ME (2009) Data Warehouse Technology for E-Learning. In: Zakrzewska D et al. (eds) Meth and Support Tech for Data Analys, SCI 225, pp 1–20Google Scholar

Copyright information

© Springer Science+Business Media B.V. 2011

Authors and Affiliations

  1. 1.Educational Software Development Laboratory, Department of MathematicsUniversity of PatrasPatrasGreece

Personalised recommendations