Abstract
Virtual Learning Environments (VLEs) are web platforms where educational content is delivered, along with tools to support individual study. Logs that record how students interact with the platform are collected daily, so automated methods can be used to extract useful knowledge from these data. All stakeholders involved in the learning activities of the VLEs, especially students and teachers, can benefit from the insights derived from the educational data and valuable information can be extracted using machine learning algorithms. Usually, educational data are examined as stationary data using conventional batch methods. However, these data are non-stationary by nature and could be better treated as data streams. This paper reports the results of a classification study in which Random Forests, applied in both batch and adaptive mode, are used to build a model for predicting student exam failure/success. In addition, an analysis of the most important features is performed to detect the most discriminating attributes related to the student’s result. Experiments conducted on a subset of the Open University Learning Analytics (OULAD) dataset demonstrate the reliability of the adaptive version of Random Forest in accurately classifying the evolving educational data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Open University website: http://www.open.ac.uk.
- 2.
Freely available data from Open University: https://analyse.kmi.open.ac.uk/open_dataset#data.
- 3.
Student oriented subset of the Open University Learning Analytics dataset: https://zenodo.org/record/4264397#.X60DEkJKj8E.
References
Al-Shabandar, R., Hussain, A.J., Liatsis, P., Keight, R.: Detecting at-risk students with early interventions using machine learning techniques. IEEE Access 7, 149464–149478 (2019)
Aljohani, T., Pereira, F.D., Cristea, A.I., Oliveira, E.: Prediction of users’ professional profile in MOOCs Only by utilising learners’ written texts. In: Kumar, V., Troussas, C. (eds.) ITS 2020. LNCS, vol. 12149, pp. 163–173. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49663-0_20
Alonso, J.M., Casalino, G.: Explainable artificial intelligence for human-centric data analysis in virtual learning environments. In: Burgos, D., et al. (eds.) HELMeTO 2019. CCIS, vol. 1091, pp. 125–138. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-31284-8_10
Ardimento, P., Bernardi, M.L., Cimitile, M.: Software analytics to support students in object-oriented programming tasks: an empirical study. IEEE Access 8, 132171–132187 (2020)
Arrigo, M., et al.: HeARt mobile learning. In: 10th Annual International Conference on Education and New Learning Technologies (EDULEARN 2018), pp. 10899–10905. IATED Academy (2018)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Casalino, G., Castellano, G., Mannavola, A., Vessio, G.: Educational stream data analysis: a case study. In: IEEE Mediterranean Eletrotechnical Conference (MELECON 2020) (2020)
Casalino, G., Castellano, G., Mencar, C.: Incremental and adaptive fuzzy clustering for virtual learning environments data analysis. In: 23rd International Conference on Information Visualisation, pp. 382–387. IEEE (2019)
Casalino, G., Castellano, G., Vessio, G.: Student oriented subset of the Open University Learning Analytics dataset (2020). https://doi.org/10.5281/zenodo.4264397
Casalino, G., Castiello, C., Del Buono, N., Esposito, F., Mencar, C.: Q-matrix extraction from real response data using nonnegative matrix factorizations. In: Gervasi, O., et al. (eds.) ICCSA 2017. LNCS, vol. 10404, pp. 203–216. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-62392-4_15
Castellano, G., Fanelli, A., Roselli, T.: Mining categories of learners by a competitive neural network. In: IJCNN 2001. International Joint Conference on Neural Networks. Proceedings (Cat. No. 01CH37222), vol. 2, pp. 945–950. IEEE (2001)
Coussement, K., Phan, M., De Caigny, A., Benoit, D.F., Raes, A.: Predicting student dropout in subscription-based online learning environments: the beneficial impact of the logit leaf model. Decis. Supp. Syst. 135, 113325 (2020)
De Carolis, B., D’Errico, F., Macchiarulo, N., Palestra, G.: Engaged faces: measuring and monitoring student engagement from face and gaze behavior. In: IEEE/WIC/ACM International Conference on Web Intelligence-Companion Volume, pp. 80–85 (2019)
Diaz, M., Ferrer, M.A., Impedovo, D., Pirlo, G., Vessio, G.: Dynamically enhanced static handwriting representation for Parkinson’s disease detection. Pattern Recogn. Lett. 128, 204–210 (2019)
Ducange, P., Pecori, R., Sarti, L., Vecchio, M.: Educational big data mining: how to enhance virtual learning environments. In: Graña, M., López-Guede, J.M., Etxaniz, O., Herrero, Á., Quintián, H., Corchado, E. (eds.) SOCO/CISIS/ICEUTE -2016. AISC, vol. 527, pp. 681–690. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-47364-2_66
Fischer, C., et al.: Mining big data in education: affordances and challenges. Rev. Res. Educ. 44(1), 130–160 (2020)
Gomes, H.M., et al.: Adaptive random forests for evolving data stream classification. Mach. Learn. 106(9–10), 1469–1495 (2017)
Guest, W., Wild, F., Di Mitri, D., Klemke, R., Karjalainen, J., Helin, K.: Architecture and design patterns for distributed, scalable augmented reality and wearable technology systems. In: IEEE International Conference on Engineering, Technology and Education, IEEE TALE 2019: Creative & Innovative Education to Enhance the Quality of Life (2019)
Impedovo, D., Pirlo, G., Vessio, G., Angelillo, M.T.: A handwriting-based protocol for assessing neurodegenerative dementia. Cogn. Comput. 11(4), 576–586 (2019). https://doi.org/10.1007/s12559-019-09642-2
Kuzilek, J., Hlosta, M., Zdrahal, Z.: Open university learning analytics dataset. Sci. Data 4, 170171 (2017)
Leite, D., Škrjanc, I., Gomide, F.: An overview on evolving systems and learning from stream data. Evol. Syst. 11, 1–18 (2020). https://doi.org/10.1007/s12530-020-09334-5
Maggipinto, T., et al.: DTI measurements for Alzheimer’s classification. Phys. Med. Biol. 62(6), 2361 (2017)
Nakayama, M., Sciarrone, F., Uto, M., Temperini, M.: Estimating student’s performance based on item response theory in a MOOC environment with peer assessment. In: Kubincová, Z., Lancia, L., Popescu, E., Nakayama, M., Scarano, V., Gil, A.B. (eds.) MIS4TEL 2020. AISC, vol. 1236, pp. 25–35. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-52287-2_3
Pecori, R.: A virtual learning architecture enhanced by fog computing and big data streams. Future Internet 10(1), 4 (2018)
Picerno, P., Pecori, R., Raviolo, P., Ducange, P.: Smartphones and exergame controllers as byod solutions for the e-tivities of an online sport and exercise sciences university program. In: Burgos, D., et al. (eds.) HELMeTO 2019. CCIS, vol. 1091, pp. 217–227. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-31284-8_17
Qu, S., Li, K., Wu, B., Zhang, S., Wang, Y.: Predicting student achievement based on temporal learning behavior in MOOCs. Appl. Sci. 9(24), 5539 (2019)
Romero, C., Ventura, S.: Educational data mining and learning analytics: an updated survey. Wiley Interdisc. Rev.: Data Min. Knowl. Discov. 10(3), e1355 (2020)
Rooein, D.: Data-driven EDU chatbots. In: Companion Proceedings of the 2019 World Wide Web Conference, pp. 46–49 (2019)
Rossano, V., Lanzilotti, R., Cazzolla, A., Roselli, T.: Augmented reality to support geometry learning. IEEE Access 8, 107772–107780 (2020)
Scalera, M., Gentile, E., Plantamura, P., Dimauro, G.: A systematic mapping study in cloud for educational innovation. Appl. Sci. 10(13), 4531 (2020)
Tripathi, G., Ahad, M.A.: IoT in education: an integration of educator community to promote holistic teaching and learning. In: Nayak, J., Abraham, A., Krishna, B.M., Chandra Sekhar, G.T., Das, A.K. (eds.) Soft Computing in Data Analytics. AISC, vol. 758, pp. 675–683. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-0514-6_64
Vitanza, A., Rossetti, P., Mondada, F., Trianni, V.: Robot swarms as an educational tool: the Thymio’s way. Int. J. Adv. Rob. Syst. 16(1), 1729881418825186 (2019)
Acknowledgements
Gabriella Casalino acknowledges funding from the Italian Ministry of Education, University and Research through the European PON project AIM (Attraction and International Mobility), nr. 1852414, activity 2, line 1. All authors are members of the INdAM GNCS research group.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Casalino, G., Castellano, G., Vessio, G. (2021). Exploiting Time in Adaptive Learning from Educational Data. In: Agrati, L.S., et al. Bridges and Mediation in Higher Distance Education. HELMeTO 2020. Communications in Computer and Information Science, vol 1344. Springer, Cham. https://doi.org/10.1007/978-3-030-67435-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-67435-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67434-2
Online ISBN: 978-3-030-67435-9
eBook Packages: Computer ScienceComputer Science (R0)