Efficiency of Multi-instance Learning in Educational Data Mining

  • S. Anupama Kumar
  • M. N. Vijayalakshmi


Educational data mining (EDM) is one of the emerging technologies in recent years. The various changes in the process of teaching and learning have brought in a lot of challenges to the stakeholders to understand the learners toward the different methods of teaching and the way they perform in various teaching environments. This chapter is an application of Baker’s taxonomy in an educational dataset to predict course outcome of the learners during the middle of the course. The experiment is conducted using different single and multi-instance-based learning algorithms. The efficiency of the single and multi-instance learning algorithms was measured using the accuracy rates and the time taken to build the model. In single instance algorithm, decision stump tree was found very effective and in multi-instance learning, the Simple MI method was found very effective. The precision of the instance-based learning algorithms is calculated using Wilcoxon rank method, and multi-instance learning algorithm is found to be more accurate than the single instance learning techniques.


Prediction Single instance learning Multi-instance learning Accuracy ROC PRC MCC Rank 


  1. 1.
    Quadri, M. N., & Kalyankar, N. V. (2010). Drop out feature of student data for academic performance using decision tree techniques, GJCST computing classification H.2.8 & K.3.m. Global Journal of Computer Science and Technology, 10(2).Google Scholar
  2. 2.
    Marsh, P. A., & Poepsel, D. L. (n.d.). Perceived usefulness of learning outcomes predicts ratings of departmental helpfulness. Teaching of Psychology, 1532–8023. 0098-6283.
  3. 3.
    Jaggars, S. S., & Xu, D. (n.d.). Predicting online student outcomes from a measure of course quality. CCRC Working Paper, 57. Retrieved from
  4. 4.
    Mitchell, T. M. (2010). Generative and discriminative classifiers: Naive bayes and logistic regression. In Machine learning. Mcgraw Hill.Google Scholar
  5. 5.
    Kumar, A. S., & Vijayalakshmi, M. N. (2011). Efficiency of decision trees in predicting student’s academic performance. In International Conference on Computer Science, Engineering and Applications (CCSEA 2011), Chennai (pp. 335–341). 2231-5403.Google Scholar
  6. 6.
    Pagallo, G., & Haussler, D. (1990). Boolean feature discovery in empirical learning. Machine Learning, 5, 71–99.CrossRefGoogle Scholar
  7. 7.
    Weiss, S. M., & Indurkhya, N. (1991). Reduced complexity rule induction. In International Joint Conference on Artificial Intelligence, 678–684.Google Scholar
  8. 8.
    Kumar, A. S., & Vijayalakshmi, M. N. (2012). Inference of naive bayes techniques on student assessment data (pp. 186–191). Berlin Heidelberg: Springer.Google Scholar
  9. 9.
    Kumar, A. S. (2016). Edifice an educational framework using educational data mining and visual analytics. I.J. Education and Management Engineering, 2, 24–30. 2305-3623.Google Scholar
  10. 10.
    Principles of data mining and knowledge discovery. (1999, September). In Third European Conference, PKDD’99, Prague, Czech Republic (pp. 15–18). 978-3-540-66490-1.Google Scholar
  11. 11.
    Qasem, A. A., Emad, M., & Mustafa, A. I. (2006). Mining student data using decision trees. In International Arab Conference on Information Technology, ACIT.Google Scholar
  12. 12.
    Crain-Dorough, M. L. (2003). A study of dropout characteristics and school-level effects on dropout prevention (Unpublished doctoral dissertation). A Dissertation Submitted to the Graduate Faculty of the Louisiana State University and Agricultural and Mechanical College.Google Scholar
  13. 13.
    Ayesha, S. (2010). Data mining model for higher education system. European Journal of Scientific Research, 43(1), 24–29. 1450-2165.Google Scholar
  14. 14.
    Danso, O. S. An exploration of classification prediction techniques in data mining: The insurance domain (Unpublished doctoral dissertation). A Dissertation Presented to the School of Design, Engineering, and Computing. Bournemouth University.
  15. 15.
    Kaufmann, M. (1993). C4.5: Programs for machine learning. San Francisco, CA, USA: Inc.Google Scholar
  16. 16.
    Ramesh, V., Thenmozhi, P., & Ramar, K. (2012). Study of influencing factors of academic performance of students: A data mining Approach. International Journal of Scientific & Engineering Research, 3(7).Google Scholar
  17. 17.
    Rajeshinigo, D., & Jebmalar, P. J. (2017). Educational mining: A comparative study of classification algorithms using WEAK. International Journal of Innovative Research in Computer and Communication Engineering, 5(3), 5583–5589.Google Scholar
  18. 18.
    Zafra, A., Romero, C., & Ventura, S. (2010, October 25). Multi-instance learning versus single-instance learning for predicting the student’s performance. In Handbook on educational data mining. CRC.Google Scholar
  19. 19.
    Frederick, U. N., & Christiana, O. C. (2016). Evaluation of data mining classification algorithms for predicting students performance in technical trades. International Journal of Engineering and Computer Science, 5(8), 17593–17601. 2319-7242.Google Scholar
  20. 20.
    Kotsiantis, S. B., Pierrakeas, C. J., Zaharakis, I. D., & Pintelas, P. E. (n.d.). Efficiency of machine learning techniques in predicting students’ performance in distance learning systems. Recent Advances in Mechanics and Related Fields, 297–305.Google Scholar
  21. 21.
    Shah, N. S. (2012, January). Predicting factors that affect students’ academic performance by using data mining techniques. Review of Business Pakistan, 631–668.Google Scholar
  22. 22.
    Dole, L., & Rajurkar, J. (2013). A decision support system to predict student performance. International Journal of Innovative Research in Computer and Communication Engineering, 2(13), 7237–7273.Google Scholar
  23. 23.
    Dietterich, T. G., Lathrop, R. H., & Lozano Perez, T. (n.d.). Solving the multiple instance problem with axis parallel rectangles. Artificial Intelligence, 89, 31–71 (Elsevier Science).Google Scholar
  24. 24.
    Shah, N. S. (2012). Predicting factors that affect students’ academic performance by using data mining techniques. Pakistan Business Review, 631–668.Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Department of MCAR V College of EngineeringBengaluruIndia

Personalised recommendations