Abstract
In this research, we compared the accuracy of machine learning algorithms that could be used for predictive analytics in higher education. The proposed experiment is based on a combination of classic machine learning algorithms such as Naive Bayes and Random Forest with various ensemble methods such as Stochastic, Linear Discriminant Analysis (LDA), Tree model (C5.0), Bagged CART (treebag) and K Nearest Neighbors (KNN). We applied traditional classification methods to classify the students’ performance and to determine the independent variables that offer the highest accuracy. Our results depict that the data with the 11 features using random forest generated the best accuracy value of 0.7333. However, we revised the experiment with ensemble algorithms to reduce the variance (bagging), bias (boosting) and to improve the prediction accuracy (stacking). Consequently, the bagging random forest outperformed other methods with the accuracy value of 0.7959.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Harel, E., Sitko, T.: Digital Dashboards: Driving Higher Education Decisions. Educause Center for Applied Research, Boulder (2003)
Johnson, L., Levine, A., Smith, R., Stone, S.: The 2010 Horizon report. The New Media Consortium, Austin, TX (2010). http://wp.nmc.org/horizon2010
Watson, H.J.: Business analytics insight: hype or here to stay? Bus. Intell. J. 16(1), 4–8 (2011)
Burke, M., Parnell, A., Wesaw, A., Kruger, K.: Predictive analysis of student data (2017). https://www.naspa.org/images/uploads/main/PREDICTIVE_FULL_4-7-17_DOWNLOAD.pdf
Shapiro, D., et al.: Completing College: A National View of Student Completion Rates – Fall 2011 Cohort (Signature Report No. 14), December 2017. National Student Clearinghouse Research Center, Herndon, VA (2017)
Long, P., Siemens, G.: Penetrating the fog: analytics in learning and education. EDUCAUSE Rev. 46, 30 (2011). http://net.educause.edu/ir/library/pdf/ELI7079.pdf
Willging, P.A., Johnson, S.D.: Factors that influence students’ decision to dropout of online courses. J. Asynchronous Learn. Netw. 13(3), 115–127 (2009)
Boston, W.E. et al.: Comprehensive Assessment of Student Retention in Online Learning Environments. School of Arts and Humanities, APUS. Paper 1 (2011)
Hoskins, S.L., Van Hooff, J.C.: Motivation and ability: which students use online learning and what influence does it have on their achievement? Communications 36(2), 177–192 (2005)
Kai, S., et al.: Predicting student retention from behavior in an online orientation course
Hawkins, B.L.: Accountability, demand for information, and the role of the campus IT organization. In: Katz, R.N. (ed.) The Tower and the Cloud, pp. 98–104. Educause, Boulder (2008). http://www.educause.edu/thetowerandthecloud/PUB7202j
Bakharia, A., Dawson, S.: SNAPP: a bird’s-eye view of temporal participant interaction. In: Proceedings of the 1st International Conference on Learning Analytics and Knowledge, pp. 168–173 (2011)
Jackson, G., Read, M.: Connect 4 success: a proactive student identification and support program, pp. 1–5. ECU, Australia (2012). fyhe.com.au/past_papers/papers12/Papers/9B.pdf
Leece, R., Hale, R.: Student engagement and retention through e-Motional intelligence. UNE, Australia (2009). http://www.educationalpolicy.org/events/R09/PDF/Leece_E-Motion.pdf
Atif, A., Richards, D., Bilgin, A., Marrone, M.: A panorama of learning analytics featuring the technologies for the learning and teaching domain. In: Carter, H., Gosper, M., Hedberg, J. (Eds.) Electric Dreams. Proceedings ascilite 2013, Sydney, pp. 68–72 (2013)
Amrieh, E.A., Hamtini, T., Aljarah, I.: Mining educational data to predict student’s academic performance using ensemble methods. Int. J. Database Theor. Appl. 9(8), 119–136 (2016)
Amrieh, E.A., Hamtini, T., Aljarah, I.: Preprocessing and analyzing educational data set using X-API for improving student’s performance. In: 2015 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), November 2015, pp. 1–5. IEEE (2015)
Nissenbaum, H.N.: Privacy in Context: Technology, Policy, and the Integrity of Social Life. Stanford Law Books, Stanford (2010)
Denley, T.: How predictive analytics and choice architecture can improve student success. Res. Pract. Assess. 9(2), 61–69 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Brohi, S.N., Pillai, T.R., Kaur, S., Kaur, H., Sukumaran, S., Asirvatham, D. (2019). Accuracy Comparison of Machine Learning Algorithms for Predictive Analytics in Higher Education. In: Miraz, M., Excell, P., Ware, A., Soomro, S., Ali, M. (eds) Emerging Technologies in Computing. iCETiC 2019. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 285. Springer, Cham. https://doi.org/10.1007/978-3-030-23943-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-23943-5_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23942-8
Online ISBN: 978-3-030-23943-5
eBook Packages: Computer ScienceComputer Science (R0)