Predicting Students Answers Using Data Science: An Experimental Study with Machine Learning

Abdullah, Malak; Yaseen, Naba Bani; Makahleh, Mohammad

doi:10.1007/978-3-031-56728-5_10

Malak Abdullah¹³,
Naba Bani Yaseen¹³ &
Mohammad Makahleh¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 960))

Included in the following conference series:

International Conference on Emerging Trends and Applications in Artificial Intelligence

29 Accesses

Abstract

In today’s data-driven world, the abundance of information provides us with opportunities to explore the relationships between various data points, leading to progress in multiple domains. For instance, in the field of education, we can leverage students’ past course performance and academic records to offer tailored guidance, allowing them to concentrate their efforts on specific areas for academic growth. By employing machine learning techniques, we can analyze data relations and predict future events based on historical data. In this study, we utilized machine learning techniques on the educational dataset from NeurIPS 2020. We aimed to improve the prediction of upcoming student performance by adding valuable features. To accomplish this, we explored several classification algorithms, including SVM, Naive Bayes, Logistic Regression, and Decision Tree. Additionally, we considered Ensemble methods such as Boosting, Bagging, and Voting. By assessing the optimal hyperparameter values for these algorithms, we aimed to optimize their performance. Our findings revealed that augmenting the dataset with more correlated features significantly improved prediction accuracy. Among the classifiers examined, Decision Tree, XG Boost, and Voting exhibited the best performance, achieving an accuracy rate of 74%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Brighouse, H.: Education. Routledge (2012)
Google Scholar
Orr, D.: What is education for. Context 27(53), 52–58 (1991)
Google Scholar
Kučak, D., Juričić, V., Dambić, G.: Machine learning in education-a survey of current research trends. In: Annals of DAAAM & Proceedings, vol. 29 (2018)
Google Scholar
Holmes, W., Bialik, M., Fadel, C.: Artificial intelligence in education. In: Boston: Center for Curriculum Redesign, vol. 2019, pp. 1–35 (2019)
Google Scholar
Géron, A.: Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. O’Reilly Media (2019)
Google Scholar
De La Hoz, E.J., Fontalvo, T.: Methodology of machine learning for the classification and prediction of users in virtual education environments (2019)
Google Scholar
Abdullah, M., Al-Ayyoub, M., AlRawashdeh, S., Shatnawi, F.: E-learningDJUST: E-learning dataset from Jordan university of science and technology toward investigating the impact of Covid-19 pandemic on education. Neural Comput. Appl. 1–15 (2021)
Google Scholar
Abdullah, M., Al-Ayyoub, M., Shatnawi, F., Rawashdeh, S., Abbott, R.: Predicting students’ academic performance using e-learning logs. IAES Int. J. Artif. Intell. 12(2), 831 (2023)
Google Scholar
Woolf, B.P., Lane, H.C., Chaudhri, V.K., Kolodner, J.L.: AI grand challenges for education. AI Mag. 34(4), 66–84 (2013)
Google Scholar
Woolf, B.P.: AI and education: celebrating 30 years of marriage. In: AIED Workshops. Citeseer (2015)
Google Scholar
Sekeroglu, B., Dimililer, K., Tuncal, K.: Student performance prediction and classification using machine learning algorithms. In: Proceedings of the 2019 8th International Conference on Educational and Information Technology, pp. 7–11 (2019)
Google Scholar
Jalota, C., Agrawal, R.: Analysis of educational data mining using classification. In: 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), pp. 243–247. IEEE (2019)
Google Scholar
Darmayanti, I., Subarkah, P., Anunggilarso, L.R., Suhaman, J.: Prediksi potensi siswa putus sekolah akibat pandemi covid-19 menggunakan algoritme k-nearest neighbor. JST (Jurnal Sains dan Teknologi) 10(2), 230–238 (2021)
Article Google Scholar
Amra, I.A.A., Maghari, A.Y.: Students performance prediction using KNN and Naïve Bayesian. In: 2017 8th International Conference on Information Technology (ICIT), pp. 909–913. IEEE (2017)
Google Scholar
Mythili, M., Shanavas, A.M.: An analysis of students’ performance using classification algorithms. IOSR J. Comput. Eng. 16(1) (2014)
Google Scholar
Wang, Z., et al.: Diagnostic questions: the neurips 2020 education challenge. arXiv preprint arXiv:2007.12061 (2020)
Gunawardana, A., Shani, G.: A survey of accuracy evaluation metrics of recommendation tasks. J. Mach. Learn. Res. 10(12) (2009)
Google Scholar
Hossin, M., Sulaiman, M.N.: A review on evaluation metrics for data classification evaluations. Int. J. Data Min. Knowl. Manag. Process 5(2), 1 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Jordan University of Science and Technology, Irbid, Jordan
Malak Abdullah, Naba Bani Yaseen & Mohammad Makahleh

Authors

Malak Abdullah
View author publications
You can also search for this author in PubMed Google Scholar
Naba Bani Yaseen
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Makahleh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Malak Abdullah .

Editor information

Editors and Affiliations

Ingenium Research Group, University of Castilla-La Mancha, Ciudad Real, Spain
Fausto Pedro García Márquez
National University of Computer and Emerging Sciences, Islamabad, Pakistan
Akhtar Jamil
Department of Computer Engineering, Istinye University, Istanbul, Türkiye
Alaa Ali Hameed
Ingenium Research Group, University of Castilla-La Mancha (UCLM), Ciudad Real, Spain
Isaac Segovia Ramírez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abdullah, M., Yaseen, N.B., Makahleh, M. (2024). Predicting Students Answers Using Data Science: An Experimental Study with Machine Learning. In: García Márquez, F.P., Jamil, A., Hameed, A.A., Segovia Ramírez, I. (eds) Emerging Trends and Applications in Artificial Intelligence. ICETAI 2023. Lecture Notes in Networks and Systems, vol 960. Springer, Cham. https://doi.org/10.1007/978-3-031-56728-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-56728-5_10
Published: 30 April 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56727-8
Online ISBN: 978-3-031-56728-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Predicting Students Answers Using Data Science: An Experimental Study with Machine Learning