Skip to main content

Data Analysis and Machine Learning for MOOC Optimization

  • Conference paper
  • First Online:
Innovations in Smart Cities Applications Volume 7 (SCA 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 938))

  • 105 Accesses

Abstract

Our study analyses learning data in the context of massive open online courses (MOOCs) that are growing in popularity, but their effectiveness and learner completion rate are often criticized. Our goal is to improve students’ completion rate by analyzing their behaviors and level of engagement. Log data generated by students’ interactions with didactic activities are used to predict student dropout, and thus improve pedagogical quality. Different machine learning approaches, artificial neural networks, SVMs, and decision trees are used for this analysis. The use of artificial intelligence models makes it possible to personalize the learning experience, detect anomalies, and take appropriate action. The three classification models used show a high accuracy, reaching almost 99% and the mean square error is very low.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fei, M., Yeung, D.Y.: Temporal models for predicting student dropout in massive open online courses. In: 2015 IEEE International Conference on Data Mining Workshop (ICDMW), pp. 256–263. IEEE (2015)

    Google Scholar 

  2. Gardner, J., Brooks, C.: Student success prediction in MOOCs. User Model. User-Adap. Inter. 28, 127–203 (2018)

    Article  Google Scholar 

  3. Marbouti, F., Diefes-Dux, H.A., Madhavan, K.: Models for early prediction of at-risk students in a course using standards-based grading. Comput. Educ. 103, 1–15 (2016)

    Article  Google Scholar 

  4. Al-Musharraf, A., Alkhattabi, M.: An educational data mining approach to explore the effect of using interactive supporting features in an LMS for overall performance within an online learning environment. Int. J. Comput. Sci. Netw. Secur. (IJCSNS) 16(3), 1 (2016)

    Google Scholar 

  5. Conijn, R., Snijders, C., Kleingeld, A., Matzat, U.: Predicting student performance from LMS data: a comparison of 17 blended courses using Moodle LMS. IEEE Trans. Learn. Technol. 10(1), 17–29 (2016)

    Article  Google Scholar 

  6. Le, M.D., Nguyen, H.H., Nguyen, D.L., Nguyen, V.A.: How to forecast the students’ learning outcomes based on factors of interactive activities in a blended learning course. In: Proceedings of the 6th International Conference on Frontiers of Educational Technologies, pp. 11–15 (2020)

    Google Scholar 

  7. Chamizo-Gonzalez, J., Cano-Montero, E.I., Urquia-Grande, E., Muñoz-Colomina, C.I.: Educational data mining for improving learning outcomes in teaching accounting within higher education. Int. J. Inf. Learn. Technol. 32(5), 272–285 (2015)

    Article  Google Scholar 

  8. Jo, Y., Maki, K., Tomar, G.: Time series analysis of clickstream logs from online courses. arXiv preprint arXiv:1809.04177 (2018)

  9. Chung, J., Gulcehre, C., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014)

  10. Zhang, K., Xu, J., Min, M.R., Jiang, G., Pelechrinis, K., Zhang, H.: Automated IT system failure prediction: a deep learning approach. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 1291–1300. IEEE (2016)

    Google Scholar 

  11. Du, M., Li, F., Zheng, G., Srikumar, V.: DeepLog: anomaly detection and diagnosis from system logs through deep learning. In: Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, pp. 1285–1298 (2017)

    Google Scholar 

  12. Hagan, M.T., Demuth, H.B., Beale, M.H., De Jesús, O.: Neural Network Design, vol. 20. Pws Pub, Worcester (1996)

    Google Scholar 

  13. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1999). https://doi.org/10.1007/978-1-4757-3264-1

  14. Ozkan, I.A., Koklu, M.: Skin lesion classification using machine learning algorithms. Int. J. Intell. Syst. Appl. Eng. 5(4), 285–289 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to El Ghali Mohamed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mohamed, E.G., Issam, A., Mohamed, T. (2024). Data Analysis and Machine Learning for MOOC Optimization. In: Ben Ahmed, M., Boudhir, A.A., El Meouche, R., Karaș, İ.R. (eds) Innovations in Smart Cities Applications Volume 7. SCA 2023. Lecture Notes in Networks and Systems, vol 938. Springer, Cham. https://doi.org/10.1007/978-3-031-54376-0_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-54376-0_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-54375-3

  • Online ISBN: 978-3-031-54376-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics