Skip to main content

Applying Data Mining Techniques to Identify Success Factors in Students Enrolled in Distance Learning: A Case Study

  • Conference paper
  • First Online:
Advances in Artificial Intelligence and Its Applications (MICAI 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9414))

Included in the following conference series:

Abstract

Distance learning is now a key component in higher level education. Given the high dropout rates and the important investments in distance learning it is of utmost concern to determine the most critical data in the success and failure of students. In this article we data mine enrollment profiles, educational background and students´ data from the Open University System and Distance Learning of the National Autonomous University of Mexico to determine the key factors that drive success and failure, creating a relevant predictive model using a Naive Bayes classifier. We have found that the number of subjects approved and their average qualification in the first semester are part of the most interesting predictors of student success.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Each instance refers to a student with his/her data entrance profile, educational background and enrollment data.

  2. 2.

    The graph was generated with the web calculator ROC analysis of the school of medicine at Johns Hopkins University. URL: http://bit.ly/1eWFAnC.

References

  1. Zhao, C.-M., Luan, J.: Data mining: going beyond traditional statistics. New Dir. Inst. Res. 131(2), 7–16 (2006)

    Google Scholar 

  2. Yukselturk, E., Ozekes, S., Türel, Y.: Predicting dropout student: an application of data mining methods in an online education program. Eur. J. Open Distance E-Learn. 17(1), 118–133 (2014)

    Google Scholar 

  3. Lykourentzou, I., Giannoukos, I., Nikolopoulos, V., Mpardis, G., Loumos, V.: Dropout prediction in e-learning courses through the combination of machine learning techniques. Comput. Educ. 53(3), 950–965 (2009)

    Article  Google Scholar 

  4. Willging, P.A., Johnson, S.D.: Factors that influence students’ decision to dropout of online courses. J. Asynchronous Learn. Netw. 13(3), 115–127 (2004)

    Google Scholar 

  5. Lile, A.: Analyzing E-learning systems using educational data mining techniques. Mediterranean J. Soc. Sci. 2(3), 403–419 (2011)

    Google Scholar 

  6. Kotsiantis, S., Pierrakeas, C., Pintelas, P.: Preventing student dropout in distance learning using machine learning techniques. Knowl.-Based Intell. Inf. Eng. Syst. 2774, 267–274 (2003)

    Google Scholar 

  7. Zang, W., Lin, F.: Investigation of web-based teaching and learning by boosting algorithms. In: Proceedings of IEEE International Conference on Information Technology: Research and Education (ITRE 2003), pp. 445–449 (2003)

    Google Scholar 

  8. Dekker, G., Pechenizkiy, M., Vleeshouwers, J.: Predicting student drop out: a case study. In: Barnes, T., Desmarais, M., Romero, C., Ventura, S. (eds.), Proceedings of the 2nd International Conference on Educational Data Mining, (EDM 2009), pp. 41–50 (2009)

    Google Scholar 

  9. Stephens, C.R., Heau, J.G., González, C., Ibarra-Cerdeña, C.N., Sánchez-Cordero, V., et al.: Using biotic interaction networks for prediction in biodiversity and emerging diseases. PLoS ONE 4(5), e5725 (2009). doi:10.1371/journal.pone.0005725

    Article  Google Scholar 

  10. Mitchell, T., Machine Learning, Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression (Draft Version). McGraw Hill (2005)

    Google Scholar 

  11. Swet, J.A.: Measuring the accuracy of diagnostic systems. Science 240, 1285–1293 (1988)

    Article  MathSciNet  Google Scholar 

  12. Fawcett, T.: An introduction to ROC analysis. Pattern Recogn. Lett. 27(8), 861–874 (2006)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to José Gerardo Moreno Salinas .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Salinas, J.G.M., Stephens, C.R. (2015). Applying Data Mining Techniques to Identify Success Factors in Students Enrolled in Distance Learning: A Case Study. In: Pichardo Lagunas, O., Herrera Alcántara, O., Arroyo Figueroa, G. (eds) Advances in Artificial Intelligence and Its Applications. MICAI 2015. Lecture Notes in Computer Science(), vol 9414. Springer, Cham. https://doi.org/10.1007/978-3-319-27101-9_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27101-9_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27100-2

  • Online ISBN: 978-3-319-27101-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics