Skip to main content

An Exploratory Study on Students’ Performance Classification Using Hybrid of Decision Tree and Naïve Bayes Approaches

  • Conference paper
  • First Online:
Advances in Information and Communication Technology (ICTA 2016)

Abstract

Students’ performance prediction can give a prior approximate knowledge of the students’ performance in future academic to the educators. However, it is not any easy task to perform prediction due to the poor identification of parameters and the lack of prediction techniques. In this paper, few parameters will be proposed and the most influenced parameters on students’ performance will be identified using chi squared. The hybrid of Decision Tree and Naïve Bayes algorithms, NBTree will be used to classify the performance of new students. NBTree classifier undergoes the training and testing process using 10-folds cross validation technique and obtained the classification accuracy of 85.9 %, which is better than the accuracy of Decision Tree and Naïve Bayes classifiers which are having 63.7 % and 72.6 % respectively. The classified performance result can be used by the educators to improve the teaching and learning process by developing new teaching methods and new teaching styles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Romero, C., Ventura, S.: Educational data mining-a review of the state of the art. IEEE Trans. Syst., Man, Cybern.- Part C: Appl. Rev. 40(6), 601–619 (2010)

    Article  Google Scholar 

  2. Farid, D.M., et al.: Hybrid decision tree and Naïve Bayes classifiers for multi-class classification tasks. Expert Syst. Appl. 41(4), 1937–1946 (2014)

    Article  MathSciNet  Google Scholar 

  3. What is the difference between supervised learning, unsupervised learning? Stackoverflow.com (2016). http://stackoverflow.com/questions/1832076/what-is-the-difference-between-supervised-learning-and-unsupervised-learning

  4. What is a Decision Tree? -Examples, Advantages & Role in Management - Video & Lesson Transcript \(|\) Study.com, Study.com (2015). http://study.com/academy/lesson/what-is-a-decision-tree-examples-advantages-role-in-management.html

  5. Quadril, M.N., Kalyankar, N.V.: Drop out feature of student data for academic performance using decision tree techniques. Glob. J. Comput. Sci. Technol. 10(2), 1–5 (2010)

    Google Scholar 

  6. Osmanbegovic, E., Suljic, M.: Data mining approach for predicting student performance. J. Econ. Bus. 10(1), 51–56 (2012)

    Google Scholar 

  7. Naïve Bayes for Dummies a Simple Explanation, Text Analysis Blog \(|\) Aylien (2015). http://blog.aylien.com/post/120703930533/naive-bayes-for-dummies-a-simple-explanation

  8. Dangi, A., Srivastava, S.: Educational data classification using selective Naive Bayes for quota categorization. In: IEEE International Conference on MOOC, Innovation, Technology in Education (MITE), pp. 118–121 (2014)

    Google Scholar 

  9. Shana, J., Venkatachalam, T.: Identifying key performance indicators and predicting the result from student data. Int. J. Comput. Appl. 25(9), 45–48 (2011)

    Google Scholar 

  10. Qasem, A.A.-R., Emad, M.A.-S., Mustafa, I.A.-N.: Mining student data using decision trees. In: The 2006 International Arab Conference on Information Technology (ACIT 2006), pp. 1–5 (2006)

    Google Scholar 

  11. Kabra, R.R., Bichkar, R.S.: Performance prediction of engineering students using decision trees. Int. J. Comput. Appl. 36(11), 8–12 (2011)

    Google Scholar 

  12. Mishra, T., Kumar, D., Gupta, S.: Mining students’ data for prediction performance. In: Fourth International Conference on Advanced Computing & Communication Technologies, pp. 255–262 (2014)

    Google Scholar 

  13. Yadav, S.K., Pal, S.: Data mining- a prediciton of performance improvement using classification. World Comput. Sci. Inf. Technol. J. (WCSIT) 2(2), 51–56 (2012)

    Google Scholar 

  14. Superby, J.F., Vandamme, J.P., Meskens, N.: Determination of factors influencing the achievement of the first-year university students using data mining methods. In: Workshop on Educational Data Mining, pp. 37–44 (2006)

    Google Scholar 

  15. Ibrahim, Z., Rusli, D.: Predicting students’ academic performance-Comparing artificial neural network, decision tree, linear regression. In: 21st Annual SAS Malaysia Forum, pp. 1–6 (2007)

    Google Scholar 

  16. Ramaswami, M., Bhaskaran, R.: A CHAID based performance prediction model in educational data mining. Int. J. Comput. Sci. Issues, IJCSI 7(1), 10–19 (2010)

    Google Scholar 

  17. Baradwaj, B.K., Pal, S.: Mining educational data to analyze students’ performance. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 2(6), 63–69 (2011)

    Google Scholar 

  18. Bresfelean, V.P., Bresfelean, M., Ghisoiu, N.: Determining students’ academic failure profile founded on data mining methods. In: 30th International Conference on Information Technology Interfaces (ITI), pp. 317–322 (2008)

    Google Scholar 

  19. Shah, N.S.: Predicting factors that affect students’ academic performance by using data mining techniques. Pak. Bus. Rev., 631–668 (2012)

    Google Scholar 

  20. Kovcic, Z.J.: Early prediction of student success- mining students’ enrolment data. In: Proceedings of Informing Science & IT Education Conference (InSITE), pp. 647–665 (2010)

    Google Scholar 

  21. Cortez, P., Silva, A.: Using data mining to predict secondary school student performance. In: Brito, A., Teixeira, J. (eds.) Proceedings of 5th Future Business Technology Conference, pp. 5–12 (2008)

    Google Scholar 

  22. Ishitani, T.T.: A longitudinal approach to assessing attrition behavior among first-generation students: time-varying effects of pre-college characteristic. Res. High. Educ. 44(4), 433–449 (2003)

    Article  Google Scholar 

  23. Shahiri, A.M., Husain, W., Rashid, N.A.: A review on predicting student’s performance using data mining techniques. In: The Third Information Systems International Conference, pp. 1–8 (2015)

    Google Scholar 

  24. Kabakchieva, D.: Predicting student performance by using data mining methods for classification. Cybern. Inf. Technol. 13(1), 61–72 (2013)

    Google Scholar 

  25. Ramesh, V., Parkavi, P., Ramar, K.: Predicting student performance-a statistical and data mining approach. Int. J. Comput. Appl. 63(8), 35–39 (2013)

    Google Scholar 

  26. Hijazi, S.T., Naqvi, S.M.M.R.: Factors affecting students’ performance: a case of private colleges. Bangladesh e-J. Sociol. 3(1), 90–100 (2006)

    Google Scholar 

  27. Khan, Z.N.: Scholastic achievement of higher secondary students in science stream. J. Soc. Sci. 1(2), 84–87 (2005)

    Google Scholar 

  28. Romero, C., et al.: Data mining algorithms to classify students. In: Ist International Conference on Educational Data Mining Montreal, Quebec, Canada, pp. 8–17 (2008)

    Google Scholar 

  29. Jishan, S.T., et al.: Improving accuracy of students’ final grade prediction model using optimal equal width binning and synthetic minority over-sampling technique. Dec. Analytics 2(1), 1–25 (2015)

    Article  Google Scholar 

  30. Shyamala, D., et al.: A study on feature selection, classification techniques for automatic genre classification of traditional malay music. In: ISMIR- Section 3A- Content Based Retrieval, Categorization, Similarity, pp. 331–336 (2011)

    Google Scholar 

  31. Shana, J., Venkatachalam, T.: Identifying key performance indicators and predicting the result from student data. Int. J. Comput. Appl. 25(9), 45–48 (2011)

    Google Scholar 

  32. Dinakaran, S., Ranjit, J.T.P.: Role of attribute selection in classification algorithms. Int. J. Sci. Eng. Res. 4(6), 67–71 (2013)

    Google Scholar 

  33. Kohavi, R.: Scaling up the accuracy of Naïve-Bayes classifiers: a decision-tree hybrid. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 202–207 (1996)

    Google Scholar 

  34. Jiang, L., Li, C.: Scaling up the accuracy of decision-tree classifiers: a Naïve-Bayes combination. J. Comput. 6(7), 1325–1331 (2011)

    Google Scholar 

Download references

Acknowledgments

We would like to show our gratitude to Universiti Sains Malaysia (USM) for supporting this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yoong Yen Chuan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Chuan, Y.Y., Husain, W., Shahiri, A.M. (2017). An Exploratory Study on Students’ Performance Classification Using Hybrid of Decision Tree and Naïve Bayes Approaches. In: Akagi, M., Nguyen, TT., Vu, DT., Phung, TN., Huynh, VN. (eds) Advances in Information and Communication Technology. ICTA 2016. Advances in Intelligent Systems and Computing, vol 538. Springer, Cham. https://doi.org/10.1007/978-3-319-49073-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-49073-1_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49072-4

  • Online ISBN: 978-3-319-49073-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics