Abstract
In today’s data-driven world, the abundance of information provides us with opportunities to explore the relationships between various data points, leading to progress in multiple domains. For instance, in the field of education, we can leverage students’ past course performance and academic records to offer tailored guidance, allowing them to concentrate their efforts on specific areas for academic growth. By employing machine learning techniques, we can analyze data relations and predict future events based on historical data. In this study, we utilized machine learning techniques on the educational dataset from NeurIPS 2020. We aimed to improve the prediction of upcoming student performance by adding valuable features. To accomplish this, we explored several classification algorithms, including SVM, Naive Bayes, Logistic Regression, and Decision Tree. Additionally, we considered Ensemble methods such as Boosting, Bagging, and Voting. By assessing the optimal hyperparameter values for these algorithms, we aimed to optimize their performance. Our findings revealed that augmenting the dataset with more correlated features significantly improved prediction accuracy. Among the classifiers examined, Decision Tree, XG Boost, and Voting exhibited the best performance, achieving an accuracy rate of 74%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Brighouse, H.: Education. Routledge (2012)
Orr, D.: What is education for. Context 27(53), 52–58 (1991)
Kučak, D., Juričić, V., Dambić, G.: Machine learning in education-a survey of current research trends. In: Annals of DAAAM & Proceedings, vol. 29 (2018)
Holmes, W., Bialik, M., Fadel, C.: Artificial intelligence in education. In: Boston: Center for Curriculum Redesign, vol. 2019, pp. 1–35 (2019)
Géron, A.: Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. O’Reilly Media (2019)
De La Hoz, E.J., Fontalvo, T.: Methodology of machine learning for the classification and prediction of users in virtual education environments (2019)
Abdullah, M., Al-Ayyoub, M., AlRawashdeh, S., Shatnawi, F.: E-learningDJUST: E-learning dataset from Jordan university of science and technology toward investigating the impact of Covid-19 pandemic on education. Neural Comput. Appl. 1–15 (2021)
Abdullah, M., Al-Ayyoub, M., Shatnawi, F., Rawashdeh, S., Abbott, R.: Predicting students’ academic performance using e-learning logs. IAES Int. J. Artif. Intell. 12(2), 831 (2023)
Woolf, B.P., Lane, H.C., Chaudhri, V.K., Kolodner, J.L.: AI grand challenges for education. AI Mag. 34(4), 66–84 (2013)
Woolf, B.P.: AI and education: celebrating 30 years of marriage. In: AIED Workshops. Citeseer (2015)
Sekeroglu, B., Dimililer, K., Tuncal, K.: Student performance prediction and classification using machine learning algorithms. In: Proceedings of the 2019 8th International Conference on Educational and Information Technology, pp. 7–11 (2019)
Jalota, C., Agrawal, R.: Analysis of educational data mining using classification. In: 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), pp. 243–247. IEEE (2019)
Darmayanti, I., Subarkah, P., Anunggilarso, L.R., Suhaman, J.: Prediksi potensi siswa putus sekolah akibat pandemi covid-19 menggunakan algoritme k-nearest neighbor. JST (Jurnal Sains dan Teknologi) 10(2), 230–238 (2021)
Amra, I.A.A., Maghari, A.Y.: Students performance prediction using KNN and Naïve Bayesian. In: 2017 8th International Conference on Information Technology (ICIT), pp. 909–913. IEEE (2017)
Mythili, M., Shanavas, A.M.: An analysis of students’ performance using classification algorithms. IOSR J. Comput. Eng. 16(1) (2014)
Wang, Z., et al.: Diagnostic questions: the neurips 2020 education challenge. arXiv preprint arXiv:2007.12061 (2020)
Gunawardana, A., Shani, G.: A survey of accuracy evaluation metrics of recommendation tasks. J. Mach. Learn. Res. 10(12) (2009)
Hossin, M., Sulaiman, M.N.: A review on evaluation metrics for data classification evaluations. Int. J. Data Min. Knowl. Manag. Process 5(2), 1 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Abdullah, M., Yaseen, N.B., Makahleh, M. (2024). Predicting Students Answers Using Data Science: An Experimental Study with Machine Learning. In: García Márquez, F.P., Jamil, A., Hameed, A.A., Segovia Ramírez, I. (eds) Emerging Trends and Applications in Artificial Intelligence. ICETAI 2023. Lecture Notes in Networks and Systems, vol 960. Springer, Cham. https://doi.org/10.1007/978-3-031-56728-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-56728-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56727-8
Online ISBN: 978-3-031-56728-5
eBook Packages: EngineeringEngineering (R0)