Abstract
Today, the main aim of educational institutes is to provide a high level of education to students, as career selection is one of the most important and quite difficult decisions for learners, so it is essential to examine students' capabilities and interests. Higher education institutions frequently face higher dropout rates, low academic achievement, and graduation delays. One potential answer to these issues is to better leverage student data stored in institutional databases and online learning platforms to forecast students' academic achievements early by using artificial intelligence and advanced computer algorithms. Several research projects have been launched with the goal of building systems that can predict student performance. However, a system that can forecast student performance and identify the various factors that directly impact it is required. The purpose of this research work is to create a model that correctly identifies students who are in danger of low performance, as well as to identify the factors that contribute to this phenomenon and suggesting the remedial actions so as to reduce dropout rate and low performance among students. The emphasis of this study is to explore various factors that may affect mental health which lead to low performance or loss of interest in studies. The developed model can accurately identify at-risk students with over 96.5% accuracy using Machine learning techniques. This study focuses extensively on various factors apart from academics, such as personal and family factors and their association with student performance. To increase the accuracy of performance predictions, the model combines explainable Machine learning techniques to outline the factors associated with poor performance and discusses a novel framework that will help to increase the accuracy of prediction of the established prediction system. This assists low-performing students in improving their academic metrics by executing corrective actions that address the issues. The proposed novel framework, with the help of a mapping table, will recommend corrective actions along with visualization using the heatmap technique which may help the students to perform better in exams, increase the institution's effectiveness, and improves any country’s economic growth and stability.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-023-12072-1/MediaObjects/10639_2023_12072_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-023-12072-1/MediaObjects/10639_2023_12072_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-023-12072-1/MediaObjects/10639_2023_12072_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-023-12072-1/MediaObjects/10639_2023_12072_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-023-12072-1/MediaObjects/10639_2023_12072_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-023-12072-1/MediaObjects/10639_2023_12072_Fig6_HTML.png)
Similar content being viewed by others
Data availability
I declare that data collected by authors for purpose of study are original & data of this study “Stu_perf” is available from correspondence author upon reasonable request.
Abbreviations
- ML:
-
Machine learning
- AI:
-
Artificial Intelligence
- RA:
-
Remedial Actions
- EQ:
-
Equations
- AUC:
-
Area under the curve
- EDM:
-
Educational data mining
- CSV:
-
Comma-separated value
- ITS:
-
Intelligent tutoring systems
- LMS:
-
Learning management systems
- TG/TS:
-
Target grade/Target score
- XGB:
-
EXtreme gradient boosting
- LGB:
-
Light gradient boosting
References
Abu-Oda, G. S., & El-Halees, A. M. (2015). Data mining in higher education : university student dropout case study. International Journal of Data Mining & Knowledge Management Process (IJDKP), 5(1). https://doi.org/10.5121/ijdkp.2015.5102
Ahmed, A. B. E. D., & Elaraby, I. S. (2014). Data mining: A prediction for student’s performance using classification method. World Journal of Computer Application and Technology (CEASE PUBLICATION), 2(2), 43–47. https://doi.org/10.13189/WJCAT.2014.020203
al Karim, M., Masnad, M. M., Ara, M. Y., Rasel, M., & Nandi, D. (2022). A Comprehensive study to investigate student performance in online education during Covid-19. International Journal of Modern Education and Computer Science, 14(3), 1–25. https://doi.org/10.5815/IJMECS.2022.03.01
Alboaneen, D., Almelihi, M., Alsubaie, R., Alghamdi, R., Alshehri, L., & Alharthi, R. (2022). Development of a web-based prediction system for students’ academic performance. Data, 7, 21. https://doi.org/10.3390/data7020021
Albreiki, B. (2022). Framework for automatically suggesting remedial actions to help students at risk based on explainable ML and rule-based models. International Journal of Educational Technology in Higher Education, 19(1), 1–26. https://doi.org/10.1186/S41239-022-00354-6/TABLES/9
Albreiki, B., Habuza, T., Shuqfa, Z., Serhani, M. A., Zaki, N., & Harous, S. (2021). Customized rule-based model to identify at-risk students and propose rational remedial actions. Big Data and Cognitive Computing, 5, 71. https://doi.org/10.3390/bdcc5040071
Alhassan, A., Zafar, B., & Mueen, A. (2020). Predict students’ academic performance based on their assessment grades and online activity data. International Journal of Advanced Computer Science and Applications, 11(4). https://doi.org/10.14569/IJACSA.2020.0110425
Al-Rahmi, W., Aldraiweesh, A., Yahaya, N., Kamin, Y. B., & Zeki, A. M. (2019). Massive open online courses (moocs): Data on higher education. Data in Brief, 22, 118–125.
Altujjar, Y., Altamimi, W., Al-Turaiki, I., & Al-Razgan, M. (2016). Predicting critical courses affecting Students performance: A case study. Procedia Computer Science, 82, 65–71.
Agrawal, R. (2017). Data-Driven Education: Technologies and Directions.
Aydin, B., & Demirer, V. (2022). Are flipped classrooms less stressful and more successful? An experimental study on college students. International Journal of Educational Technology in Higher Education, 19(1), 1–17. https://doi.org/10.1186/S41239-022-00360-8/TABLES/4
Bağrıacık Yılmaz, A., & Karataş, S. (2022). Why do open and distance education students drop out? Views from various stakeholders. International Journal of Educational Technology in Higher Education, 19(1), 1–22. https://doi.org/10.1186/S41239-022-00333-X/TABLES/1
Bañeres, D., Rodríguez-González, M. E., Guerrero-Roldán, A. E., & Cortadas, P. (2023). An early warning system to identify and intervene online dropout learners. International Journal of Educational Technology in Higher Education, 20(1), 1–25. https://doi.org/10.1186/S41239-022-00371-5/TABLES/8
Bengio, Y., Lecun, Y., & Hinton, G. (2021). Deep learning for AI. Communications of the ACM, 64(7), 58–65. https://doi.org/10.1145/3448250
Binh, H. T., & Duy, B. T. (2017). Predicting students' performance based on learning style by using Artificial Neural Networks. 2017 9th International Conference on Knowledge and Systems Engineering (KSE). https://doi.org/10.1109/kse.2017.8119433
Borrella, I., Caballero-Caballero, S., & Ponce-Cueto, E. (2022). Taking action to reduce dropout in MOOCs. Computers and Education, 179. https://doi.org/10.1016/J.COMPEDU.2021.104412
Buenaño-Fernández, D., Gil, D., & Luján-Mora, S. (2019). Application of machine learning in predicting performance for computer engineering students: A case study. Sustainability, 11, 2833. https://doi.org/10.3390/su11102833
Carlos Muñoz-Carpio, J., Jan, Z., & Saavedra, A. (2021). Machine learning for learning personalization to enhance student academic performance.
Chinaveh, M. (2013). The effectiveness of problem-solving on coping skills and psychological adjustment. Procedia - Social and Behavioral Sciences, 84, 4–9. https://doi.org/10.1016/J.SBSPRO.2013.06.499
Costa, E. B., Fonseca, B., Santana, M. A., de Araújo, F. F., & Rego, J. (2017). Evaluating the effectiveness of educational data mining techniques for early prediction of students’ academic failure in introductory programming courses. Comput- Ers in Human Behavior, 73, 247–256.
Crompton, H., & Burke, D. (2023). Artificial intelligence in higher education: The state of the field. International Journal of Educational Technology in Higher Education, 20, 22. https://doi.org/10.1186/s41239-023-00392-8
David Kolo, K., Adepoju, S. A., & KoloAlhassan, J. (2015). A decision tree approach for predicting Students academic performance. International Journal of Education and Management Engineering, 5(5), 12–19. https://doi.org/10.5815/IJEME.2015.05.02
Dekker, I., De Jong, E. M., Schippers, M. C., Bruijn-Smolders, D., Alexiou, A., Giesbers, B., et al. (2020). Optimizing students’ mental health and academic performance: AI-enhanced life crafting. Frontiers in Psychology, 11, 1063.
Despujol, I., Castañeda, L., Marín, V. I., & Turró, C. (2022). Correction: What do we want to know about MOOCs? Results from a machine learning approach to a systematic literature mapping review (International Journal of Educational Technology in Higher Education, (2022), 19, 1, (53), 10.1186/s41239-022-00359-1). International Journal of Educational Technology in Higher Education, 19(1), 1–1. https://doi.org/10.1186/S41239-022-00370-6/METRICS
Engel, O., Zimmer, L. M., Lörz, M., & Mayweg-Paus, E. (2023). Digital studying in times of COVID-19: Teacher- and student-related aspects of learning success in German higher education. International Journal of Educational Technology in Higher Education, 20(1), 1–20. https://doi.org/10.1186/S41239-023-00382-W/FIGURES/2
Evangelista, E. D. (2021). A hybrid machine learning framework for predicting students’ performance in virtual learning environment. International Journal of Emerging Technologies in Learning (IJET), 16(24), 255–272. https://doi.org/10.3991/IJET.V16I24.26151
Fahd, K., Venkatraman, S., Miah, S. J., & Ahmed, K. (2022). Application of machine learning in higher education to assess student academic performance, at-risk, and attrition: A meta-analysis of literature. Education and Information Technologies, 27(3), 3743–3775. https://doi.org/10.1007/S10639-021-10741-7/METRICS
Flanagan, B., Majumdar, R., & Ogata, H. (2022). Early-warning prediction of student performance and engagement in open book assessment by reading behavior analysis. International Journal of Educational Technology in Higher Education, 19(1), 1–23. https://doi.org/10.1186/S41239-022-00348-4/TABLES/11
Goga, M., Kuyoro, S., & Goga, N. (2015). A recommender for improving the student academic performance. Procedia - Social and Behavioral Sciences, 180, 1481–1488. https://doi.org/10.1016/J.SBSPRO.2015.02.296
Guo, Bo, et al. (2015) Predicting students performance in educational data mining. In International symposium on educational technology (ISET). IEEE, 2015.
Gong, B., Nugent, J. P., Guest, W., Parker, W., Chang, P. J., Khosa, F., & Nicolaou, S. (2019). Influence of artificial intelligence on canadian medical students’ preference for radiology specialty: Anational survey study. Academic Radiology, 26(4), 566–577.
Guo, B., Zhang, R., Xu, G., Shi, C., & Yang, L. (2016). Predicting Students Performance in Educational Data Mining. Proceedings - 2015 International Symposium on Educational Technology, ISET 2015, 125–128. https://doi.org/10.1109/ISET.2015.33
Gupta, S. K., Antony, J., Lacher, F., & Douglas, J. (2018). Lean Six Sigma for reducing student dropouts in higher education – an exploratory study. 31(1–2), 178–193. https://doi.org/10.1080/14783363.2017.1422710
Ha, I., & Kim, C. (2014) The Research Trends and the Effectiveness of Smart Learning. International Journal of Distributed Sensor Networks, 10(5). https://doi.org/10.1155/2014/537346
Hawes, D., & Arya, A. (2023). Technology solutions to reduce anxiety and increase cognitive availability in students. IEEE Transactions on Learning Technologies. https://doi.org/10.1109/TLT.2023.3239985
Hernández-Blanco, A., Herrera-Flores, B., Tomás, D., & Navarro-Colorado, B. (2019). A systematic review of deep learning approaches to educational data mining. Complexity, 2019(1), 1–22.
Hew, K. F., Hu, X., Qiao, C., & Tang, Y. (2020). What predicts student satisfaction with MOOCs: Agradient boosting trees supervised machine learning and sentiment analysis approach. Computers & Education, 145, 103724. https://doi.org/10.1016/J.COMPEDU.2019.103724
Iatrellis, O., Savvas, I. K., Fitsilis, P., & Gerogiannis, V. C. (2021). A two-phase machine learning approach for predicting student outcomes. Education and Information Technologies, 26(1), 69–88.
Kaminskienė, L., Järvelä, S., & Lehtinen, E. (2022). How does technology challenge teacher education? International Journal of Educational Technology in Higher Education, 19(1), 1–9. https://doi.org/10.1186/S41239-022-00375-1
Karalar, H., Kapucu, C., & Gürüler, H. (2021). Predicting students at risk of academic failure using ensemble model during pandemic in a distance learning system. International Journal of Educational Technology in Higher Education, 18(1), 1–18. https://doi.org/10.1186/S41239-021-00300-Y/FIGURES/5
Kessler, R. C., van Loo, H. M., Wardenaar, K. J., Bossarte, R. M., Brenner, L. A., Cai, T., Ebert, D. D., Hwang, I., Li, J., de Jonge, P., Nierenberg, A. A., Petukhova, M. V., Rosellini, A. J., Sampson, N. A., Schoevers, R. A., Wilcox, M. A., & Zaslavsky, A. M. (2016). Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Molecular Psychiatry, 21(10), 1366–1371. https://doi.org/10.1038/MP.2015.198
Koprinska, I., Stretton, J., & Yacef, K. (2015). Predicti ng student performance from multiple data sources. In Conati, C., Heffernan, N., Mitrovic, A., & Verdejo, M. (Eds.), Artificial Intelligence in Educati on. AIED 2015. Lecture Notes in Computer Science (vol 9112). Springer, Cham. https://doi.org/10.1007/978-3-319-19773-9_90
Kortemeyer, G., Dittmann-Domenichini, N., Schlienger, C., Spilling, E., Yaroshchuk, A., & Dissertori, G. (2023). Attending lectures in person, hybrid or online—how do students choose, and what about the outcome? International Journal of Educational Technology in Higher Education, 20(1), 1–24. https://doi.org/10.1186/S41239-023-00387-5/FIGURES/9
Kruck, S. E. & Lending, D. (2003). “Predicti ng academic performance in an introductory college-level is course”. Informati on Technology, Learning, and Performance Journal, 21(2), 9–15
Liao, S. N., Zingaro, D., Thai, K., Alvarado, C., Griswold, W. G., & Porter, L. (2019). A robust machine learning technique to predict low-performing students. ACM Transactions on Computing Education, 19(3), 1–19. https://doi.org/10.1145/3277569
Li, Q., Li, Z., & Han, J. (2021). A hybrid learning pedagogy for surmounting the challenges of the COVID-19 pandemic in the performing arts education. Education and Information Technologies, 26(6), 7635–7655.
Lykourentzou, I., Giannoukos, I., Mpardis, G., Nikolopoulos, V., & Loumos, V. (2009). Early and dynamic student achievement prediction in e-learning courses using neural networks. Journal of the American Society for Information Science and Technology, 60(2), 372–380. https://doi.org/10.1002/ASI.20970
Marbouti, F., Diefes-Dux, H. A., & Madhavan, K. (2016). Models for early prediction of at-risk students in a course using standards-based grading. Computers & Education, 103, 1–15.
Martin, F., & Bolliger, D. U. (2022). Developing an online learner satisfaction framework in higher education through a systematic review of research. International Journal of Educational Technology in Higher Education, 19(1), 1–21. https://doi.org/10.1186/S41239-022-00355-5/FIGURES/3
Mishra, L., Gupta, T., & Shree, A. (2020). Online teaching-learning in higher education during lockdown period of COVID-19 pandemic. International journal of educational research open, 1, 100012.
Mousavinasab, E., Zarifsanaiey, N., NiakanKalhori, R. S., Rakhshan, M., Keikha, L., & Ghazi, S. M. (2021). Intelligent tutoring systems: A systematic review of characteristics, applications, and evaluation methods. Interactive Learning Environments, 29(1), 142–163.
Namoun, A., & Alshanqiti, A. (2021). Predicting student performance using data mining and learning analytics techniques: A systematic literature review. Applied Sciences, 11(1), 237.
Nayak, P., Vaheed, S., Gupta, S., & Mohan, N. (2023). Predicting students’ academic performance by mining the educational data through machine learning-based classification model. Education and Information Technologies, 1–27. https://doi.org/10.1007/S10639-023-11706-8/METRICS
Okubo, F., Shimada, A., Yamashita, T., & Ogata, H. (2017). A neural network approach for students’ performance prediction. ACM International Conference Proceeding Series, 598–599. https://doi.org/10.1145/3027385.3029479
Parhizkar, A., Tejeddin, G., & Khatibi, T. (2023). Student performance prediction using datamining classification algorithms: Evaluating generalizability of models from geographical aspect. Education and Information Technologies, 1–19. https://doi.org/10.1007/S10639-022-11560-0/METRICS
Peña-Ayala, A. (2014). Educational data mining: A survey and a data mining-based analysis of recent works. Expert Systems with Applications, 41(4), 1432–1462. https://doi.org/10.1016/J.ESWA.2013.08.042
Prenkaj, B., Velardi, P., Stilo, G., Distante, D., & Faralli, S. (2020). A survey of machine learning approaches for student drop- out prediction in online courses. ACM Computing Surveys (CSUR), 53(3), 1–34.
Purwaningsih, N., & Arief, D. R. (2018). Predicting students’ performance in English class. AIP Conference Proceedings, 1977. https://doi.org/10.1063/1.5042876
Romero, C., & Ventura, S. (2013). Data mining in education. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 3(1), 12–27. https://doi.org/10.1002/WIDM.1075
Ribeiro, J. D., Franklin, J. C., Fox, K. R., Bentley, K. H., Kleiman, E. M., Chang, B. P., & Nock, M. K. (2016). Self-injurious thoughts and behaviors as risk factors for future suicide ideation, attempts, and death: a meta-analysis of longitudinal studies. Psychol Med, 46(2), 225–36. https://doi.org/10.1017/S0033291715001804
Saa, A. A., Al-Emran, M., & Shaalan, K. (2020). Mining student information system records to predict students’ academic performance. Advances in Intelligent Systems and Computing, 921, 229–239. https://doi.org/10.1007/978-3-030-14118-9_23
Schaefer, J. D., Caspi, A., Belsky, D. W., Harrington, H., Houts, R., Horwood, L. J., Hussong, A., Ramrakha, S., Poulton, R., & Moffitt, T. E. (2017). Enduring mental health: Prevalence and prediction. Journal of Abnormal Psychology, 126(2), 212–224. https://doi.org/10.1037/ABN0000232
Shahiri, A. M., Husain, W., & Rashid, N. A. (2015). A review on predicting student’s performance using data mining techniques. Procedia Computer Science, 72, 414–422. https://doi.org/10.1016/J.PROCS.2015.12.157
Smets, E., Casale, P., Großekathöfer, U., Lamichhane, B., de Raedt, W., Bogaerts, K., van Diest, I., & van Hoof, C. (2016). Comparison of machine learning techniques for psychophysiological stress detection. Communications in Computer and Information Science, 604, 13–22. https://doi.org/10.1007/978-3-319-32270-4_2
Sripath Roy, K., Roopkanth, K., UdayTeja, V., Bhavana, V., & Priyanka, J. (2018). Student career prediction using advanced machine learning techniques. International Journal of Engineering and Technology, 7(2), 26–29. https://doi.org/10.14419/IJET.V7I2.20.11738
Thathsarani, H., Ariyananda, D. K., Jayakody, C., Manoharan, K., Munasinghe, A. A. S. N., & Rathnayake, N. (2023). How successful the online assessment techniques in distance learning have been, in contributing to academic achievements of management undergraduates? Education and Information Technologies, 1–25. https://doi.org/10.1007/S10639-023-11715-7/TABLES/4
Tomasevic, N., Gvozdenovic, N., & Vranes, S. (2020). An overview and comparison of supervised data mining techniques for student exam performance prediction. Computers & Education, 143, 103676. https://doi.org/10.1016/J.COMPEDU.2019.103676
Waheed, H., Hassan, S. U., Aljohani, N. R., Hardman, J., Alelyani, S., & Nawaz, R. (2020). Predicting academic performance of students from vle big data using deep learning models. Computers in Human Behavior, 104(106), 189.
Wang, M., Yu, H., Bell, Z., & Chu, X. (2022). Constructing an Edu-Metaverse ecosystem: A new and innovative framework. IEEE Transactions on Learning Technologies, 15(6), 685–696. https://doi.org/10.1109/TLT.2022.3210828
Xhafa, V. H. (2021). Perceptions of students for sudden movement from face-to-face teaching to online learning environment: A regional study in conditions affected by the COVID-19 pandemic. European Journal of Education, 4(2), 62–77.
Xiao, H., Hu, W., & Liu, G. P. (2022). Students’ online laboratory work assessment based on learning task lists and behavior data. IEEE Transactions on Learning Technologies. https://doi.org/10.1109/TLT.2022.3213751
Xiao, M., Tian, Z., & Xu, W. (2023). Impact of teacher-student interaction on students’ classroom well-being under online education environment. Education and Information Technologies, 1–23. https://doi.org/10.1007/S10639-023-11681-0/FIGURES/4
Xing, Z., & Qi, Y. (2023). Development of creativity in physical education teachers using interactive technologies: Involvement and collaboration. Education and information technologies, 28(5), 5763–5777.
Xing, W., Guo, R., Petakovic, E., & Goggins, S. (2015). Participation-based student final performance prediction model through interpretable Genetic Programming: Integrating learning analytics, educational data mining and theory. Computers in Human Behavior, 47, 168–181. https://doi.org/10.1016/J.CHB.2014.09.034
Yadav, S. K., Bharadwaj, B., & Pal, S. (2012). Data mining applications: A comparative Study for Predicting Student’s performance. International journal of innovative technology & creative engineering, 1(12), 2045–2711. https://doi.org/10.48550/arxiv.1202.4815
Ye, C., & Biswas, G. (2014). Early prediction of student dropout and performance in MOOCs using higher granularity temporal information. Journal of Learning Analytics, 1(3), 169–172. https://doi.org/10.18608/JLA.2014.13.14
Zaldívar-Colado, A., Aguilar-Calderón, J., Garcia-Sanchez, O., Zurita-Cruz, C., Moncada-Estrada, M., & Bernal-Guadiana, R. (2014). Artificial neural networks for the prediction of Students academic performance.
Zhao, Q., Wang, J. L., Pao, T. L., & Wang, L. Y. (2020). Modified fuzzy rule-based classification system for early warning of student learning. Journal of Educational Technology Systems, 48(3), 385–406.
Author information
Authors and Affiliations
Contributions
Conceptualization, methodology, software, statistical analysis, writing—original draft preparation, Data curation, writing—review and editing, visualization, Data analysis, Investigation, literature review:
Harsimran Singh1, Banipreet Kaur2.
Resources Tools: Ajeet Singh4.
Supervision: Arun Sharma2; Discussion: all authors.
Corresponding author
Ethics declarations
Declarations
I the undersigned declare that the manuscript is original research work, has not been published before and is not under any other for publication elsewhere. All authors have read and agreed to the published version of the manuscript.
Conflicts of interest/competing interest
All the authors declare that there is not conflict of interest associated with this publication.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Singh, H., Kaur, B., Sharma, A. et al. Framework for suggesting corrective actions to help students intended at risk of low performance based on experimental study of college students using explainable machine learning model. Educ Inf Technol 29, 7997–8034 (2024). https://doi.org/10.1007/s10639-023-12072-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10639-023-12072-1