Abstract
Massive Open Online Courses (MOOCs) have dramatically changed how people access education. Though substantial research works have been carried out to improve students’ learning experiences, very little attention was directed to the characterization and identification of quality MOOCs for students to undertake (e.g., those with a large enrolment of students), which, we argue, is vital to empower students to make use of MOOCs to reskill and upskill. To fill the gap, this study aimed to investigate the extent to which ML models can be used to automatically identify the popularity of a MOOC before or upon its publication. Specifically, we collected data about more than 50K courses from Udemy, based on which we engineered a total of 21 features as input to four widely-used ML models for MOOC popularity prediction, namely Linear Regression, Random Forests, XGBoost, and Multi-Layer Perceptron Neural Network. Through extensive evaluations, we demonstrated that (i) XGBoost gave the best performance in predicting MOOC popularity; (ii) features like the number of captions and enrolment fee were strongly correlated with MOOC popularity; (iii) the prediction results were mostly inferior to those reported on predicting the popularity of social media posts and news articles, and thus more research effort is needed to boost the prediction performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bandari, R., Asur, S., Huberman, B.: The pulse of news in social media: forecasting popularity. In: ICWSM, vol. 6 (2012)
Borges, H., Hora, A., Valente, M.T.: Predicting the popularity of github repositories. In: PROMISE, pp. 1–10 (2016)
Christensen, G., Steinmetz, A., Alcorn, B., Bennett, A., Woods, D., Emanuel, E.: The MOOC phenomenon: who takes massive open online courses and why? Available at SSRN 2350964 (2013)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Gayberi, M., Oguducu, S.G.: Popularity prediction of posts in social networks based on user, post and image features. In: Proceedings of the 11th International Conference on Management of Digital EcoSystems, pp. 9–15 (2019)
Guo, P.J., Reinecke, K.: Demographic differences in how students navigate through MOOCs. In: L@S, pp. 21–30 (2014)
Kizilcec, R.F., Piech, C., Schneider, E.: Deconstructing disengagement: analyzing learner subpopulations in massive open online courses. In: LAK, pp. 170–179 (2013)
Mazloom, M., Pappi, I., Worring, M.: Category specific post popularity prediction. In: Schoeffmann, K., et al. (eds.) MMM 2018. LNCS, vol. 10704, pp. 594–607. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73603-7_48
Moniz, N., Torgo, L.: A review on web content popularity prediction: issues and open challenges. Online Soc. Netw. Media 12, 1–20 (2019)
Tatar, A., de Amorim, M.D., Fdida, S., Antoniadis, P.: A survey on predicting the popularity of web content. J. Internet Serv. Appl. 5(1), 1–20 (2014). https://doi.org/10.1186/s13174-014-0008-y
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemom. Intell. Lab. Syst. 2(1–3), 37–52 (1987)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, L., Swiecki, Z., Gašević, D., Chen, G. (2022). Popularity Prediction in MOOCs: A Case Study on Udemy. In: Rodrigo, M.M., Matsuda, N., Cristea, A.I., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2022. Lecture Notes in Computer Science, vol 13355. Springer, Cham. https://doi.org/10.1007/978-3-031-11644-5_56
Download citation
DOI: https://doi.org/10.1007/978-3-031-11644-5_56
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11643-8
Online ISBN: 978-3-031-11644-5
eBook Packages: Computer ScienceComputer Science (R0)