Skip to main content

Early Dropout Prediction for Programming Courses Supported by Online Judges

  • Conference paper
  • First Online:
Artificial Intelligence in Education (AIED 2019)

Abstract

Many educational institutions have been using online judges in programming classes, amongst others, to provide faster feedback for students and to reduce the teacher’s workload. There is some evidence that online judges also help in reducing dropout. Nevertheless, there is still a high level of dropout noticeable in introductory programming classes. In this sense, the objective of this work is to develop and validate a method for predicting student dropout using data from the first two weeks of study, to allow for early intervention. Instead of the classical questionnaire-based method, we opted for a non-subjective, data-driven approach. However, such approaches are known to suffer from a potential overload of factors, which may not all be relevant to the prediction task. As a result, we reached a very promising 80% of accuracy, and performed explicit extraction of the main factors leading to student dropout.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Number of log lines on attempt to solve problems. To illustrate, each time the student presses a button in the embedded IDE of the ‘online judge’, this event is stored as a line in a log file (adapted from [1, 3]).

References

  1. Ahadi, A., Lister, R., Haapala, H., Vihavainen, A.: Exploring machine learning methods to automatically identify students in need of assistance. In: ICER 2015, pp. 121–130 (2015)

    Google Scholar 

  2. Alamri, A., et al.: Predicting MOOCs dropout using only two easily obtainable features from the first week’s activities. In: Coy, A., Hayashi, Y., Chang, M. (eds.) ITS 2019. LNCS, vol. 11528, pp. 163–173. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-22244-4_20

    Chapter  Google Scholar 

  3. Castro-Wunsch, K., Ahadi, A., Petersen, A.: Evaluating neural networks as a method for identifying students in need of assistance. In: Proceedings of the 2017 ACM SIGCSE Technical Symposium on Computer Science Education, pp. 111–116. ACM (2017)

    Google Scholar 

  4. Chen, W., Brinton, C.G., Cao, D., Mason-Singh, A., Lu, C., Chiang, M.: Earlydetection prediction of learning outcomes in online short-courses vialearning behaviors. IEEE Trans. Learn. Technol. (2018)

    Google Scholar 

  5. Cristea, A.I., Alamri, A., Kayama, M., Stewart, C., Alsheri, M., Shi, L.: Earliest predictor of dropout in moocs: a longitudinal study of futurelearn courses. In: 27th International Conference on Information Systems Development (ISD2018), Lund, Sweden. Association for Information Systems (2018)

    Google Scholar 

  6. Dwan, F., Oliveira, E., Fernandes, D.: Predição de zona de aprendizagem de alunos de introdução à programação em ambientes de correção automática de código. In: Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), vol. 28, p. 1507 (2017)

    Google Scholar 

  7. Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Mach. Learn. J. 46(2), 389–422 (2002)

    Article  Google Scholar 

  8. Ihantola, P., et al.: Educational data mining and learning analytics in programming: literature review and case studies. In: Proceedings of the 2015 ITiCSE on Working Group Reports, pp. 41–63. ACM (2015)

    Google Scholar 

  9. Otero, J., Junco, L., Suarez, R., Palacios, A., Couso, I., Sanchez, L.: Finding informative code metrics under uncertainty for predicting the pass rate of online courses. Inf. Sci. 373, 42–56 (2016)

    Article  Google Scholar 

  10. Pereira, F.D., Oliveira, E., Fernandes, D., Cristea, A.: Early performance prediction for CS1 course students using a combination of machine learning and an evolutionary algorithm. In: The 19th IEEE International Conference on Advanced Learning Technologies (ICALT 2019) (2019)

    Google Scholar 

  11. Quinlan, J.R.: C4. 5: Programming for Machine Learning, vol. 38, p. 48. Morgan Kauffmann (1993)

    Google Scholar 

  12. Vivian, R., Falkner, K., Falkner, N.: Addressing the challenges of a new digital technologies curriculum: MOOCs as a scalable solution for teacher professional development (2014)

    Google Scholar 

  13. Wasik, S., Antczak, M., Laskowski, A., Sternal, T., et al.: A survey on online judge systems and their applications. ACM Comput. Surv. (CSUR) 51(1), 3 (2018)

    Article  Google Scholar 

  14. Whitehill, J., Mohan, K., Seaton, D., Rosen, Y., Tingley, D.: Delving deeper into MOOC student dropout prediction. arXiv preprint arXiv:1702.06404 (2017)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Filipe D. Pereira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pereira, F.D. et al. (2019). Early Dropout Prediction for Programming Courses Supported by Online Judges. In: Isotani, S., Millán, E., Ogan, A., Hastings, P., McLaren, B., Luckin, R. (eds) Artificial Intelligence in Education. AIED 2019. Lecture Notes in Computer Science(), vol 11626. Springer, Cham. https://doi.org/10.1007/978-3-030-23207-8_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-23207-8_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-23206-1

  • Online ISBN: 978-3-030-23207-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics