Skip to main content

Advertisement

Log in

Mining educational data to predict students performance

A comparative study of data mining techniques

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

Information is everywhere in a hidden and scattered way. It becomes useful when we apply Data mining to extracts the hidden, meaningful, and potentially useful patterns from these vast data resources. Educational data mining ensures a quality education by analyzing educational data based on various aspects. In this paper, we have analyzed the academic results and behavior of some engineering students. For this study, we collect data from 80 students from the CSE department. We gather data from mark sheets and other relevant factors that accelerate the results, collected through a survey. Our main goal is to predict the students’ performance. According to this prediction, the counseling department will guide them in advance so that those who are likely to have bad results can do better. The classification can be based on various aspects, as many factors improve the educational system. We have created two datasets focusing on two different angles. Our first dataset classifies and predicts the category of a student (good, bad, medium) on a specific course based on their prerequisite course performance. We have implemented this in the artificial intelligence course. Our second dataset also classifies and predicts the final grade (A, B, C) of any random subject, here we organize our data such a way where it will only focus on how their performance was till the midterm exam. We analyze and compare six classification algorithms. We have focused on all aspects of an algorithm, not only the accuracy level but also the complexity and cost. We have built two final models for two of our datasets based on a decision tree and the naive Bayes algorithms accordingly.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Ahmed, S., Paul, R., & Hoque, A.S.Md.L. (2014). Knowledge discovery from academic data using Association Rule Mining. In 2014 17th International Conference on Computer and Information Technology (ICCIT: IEEE.

  • Alasadi, S.A., & Bhaya, W.S. (2017). Review of data preprocessing techniques in data mining. Journal of Engineering and Applied Sciences, 12.16, 4102–4107.

    Google Scholar 

  • Bhardwaj, B.K., & Pal, S. (2012). Data Mining: A prediction for performance improvement using classification. arXiv preprint arXiv:1201.3418.

  • Bhargavi, P., & Jyothi, S. (2009). Applying naive bayes data mining technique for classification of agricultural land soils. International Journal of Computer Science and Network Security, 9.8, 117–122.

    Google Scholar 

  • Cao, Y., & Wu, J. (2004). Dynamics of projective adaptive resonance theory model: the founda-tion of part algorithm. IEEE Transactions on Neural Networks, 15(2), 245–260.

    Article  Google Scholar 

  • Francis, B.K., & Babu, S.S. (2019). Predicting academic performance of students using a hybrid data mining approach. Journal of Medical Systems, 43.6, 162.

    Article  Google Scholar 

  • García, S., & Luengo, J. (2015). And Francisco Herrera. Data preprocessing in data mining. Cham: Springer International Publishing.

    Google Scholar 

  • Hussain, S., & et al. (2018). Educational data mining and analysis of students’ academic performance using WEKA. Indonesian Journal of Electrical Engineering and Computer Science, 9.2, 447–459.

    Article  Google Scholar 

  • Kabra, R.R., & Bichkar, R.S. (2011). Performance prediction of engineering students using decision trees. International Journal of Computer Applications, 36.11, 8–12.

    Google Scholar 

  • Pathan, A.A., & et al. (2014). Educational data mining: A mining model fordeveloping students’ programming skills. In The 8th International Conference onSoftware, Knowledge, Information Management and Applications (SKIMA 2014): IEEE.

  • Priyam, A., & et al. (2013). Comparative analysis of decision tree classification algorithms. International Journal of Current Engineering and Technology, 3.2, 334–337.

    Google Scholar 

  • Rahman, Md.H., & Islam, Md.R. (2017). Predict Student’s Academic Performance and Evaluate the Impact of Different Attributes on the Performance Using Data Mining Techniques. In 2017 2nd International Conference on Electrical and Electronic Engineering (ICEEE): IEEE.

  • Rashu, R.I., Haq, N., & Rahman, R.M. (2014). Data mining approaches to predict final grade by overcoming class imbalance problem. In 2014 17th International Conference on Computer and Information Technology (ICCIT): IEEE.

  • Tan, P.-N., Steinbach, M., & Kumar, V. (2016). Introduction to data mining. Pearson Education India.

  • Veeramuthu, P., & Periasamy, R. (2014). Application of higher education system for predicting student using data mining techniques. International Journal of Innovative Research in Advanced Engineering (IJIRAE), 1.5, 36–38.

    Google Scholar 

  • Zhou, Z.-H. (2009). Ensemble learning. Encyclopedia of Biometrics, 1, 270–273.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Khaledun Nahar.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

  • ANN: ANN or Artificial Neural Network is a computational algorithm. This algorithm has been designed by using the concept of human neuron. It process information as like human brain analyze and processes information. It has the capability of self-learning that enables it to produce better results.

  • KNN: KNN stands for K-Nearest Neighbour. It is a simple Machine Learning algorithm based on Supervised Learning technique. This algorithm find outs the similarity between new and existing data and puts the new data into the most category which is most similar with the available categories.

  • Weka: An open source machine learning software with a collection of machine learning algorithms and data preprocessing tools. By using this software users can try out existing machine learning methods on their datasets in a flexible way.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nahar, K., Shova, B.I., Ria, T. et al. Mining educational data to predict students performance. Educ Inf Technol 26, 6051–6067 (2021). https://doi.org/10.1007/s10639-021-10575-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-021-10575-3

Keywords

Navigation