Popularity Prediction in MOOCs: A Case Study on Udemy

Li, Lin; Swiecki, Zachari; Gašević, Dragan; Chen, Guanliang

doi:10.1007/978-3-031-11644-5_56

Lin Li¹¹,
Zachari Swiecki¹¹,
Dragan Gašević¹¹ &
…
Guanliang Chen¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13355))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

3830 Accesses

Abstract

Massive Open Online Courses (MOOCs) have dramatically changed how people access education. Though substantial research works have been carried out to improve students’ learning experiences, very little attention was directed to the characterization and identification of quality MOOCs for students to undertake (e.g., those with a large enrolment of students), which, we argue, is vital to empower students to make use of MOOCs to reskill and upskill. To fill the gap, this study aimed to investigate the extent to which ML models can be used to automatically identify the popularity of a MOOC before or upon its publication. Specifically, we collected data about more than 50K courses from Udemy, based on which we engineered a total of 21 features as input to four widely-used ML models for MOOC popularity prediction, namely Linear Regression, Random Forests, XGBoost, and Multi-Layer Perceptron Neural Network. Through extensive evaluations, we demonstrated that (i) XGBoost gave the best performance in predicting MOOC popularity; (ii) features like the number of captions and enrolment fee were strongly correlated with MOOC popularity; (iii) the prediction results were mostly inferior to those reported on predicting the popularity of social media posts and news articles, and thus more research effort is needed to boost the prediction performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Bandari, R., Asur, S., Huberman, B.: The pulse of news in social media: forecasting popularity. In: ICWSM, vol. 6 (2012)
Google Scholar
Borges, H., Hora, A., Valente, M.T.: Predicting the popularity of github repositories. In: PROMISE, pp. 1–10 (2016)
Google Scholar
Christensen, G., Steinmetz, A., Alcorn, B., Bennett, A., Woods, D., Emanuel, E.: The MOOC phenomenon: who takes massive open online courses and why? Available at SSRN 2350964 (2013)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Gayberi, M., Oguducu, S.G.: Popularity prediction of posts in social networks based on user, post and image features. In: Proceedings of the 11th International Conference on Management of Digital EcoSystems, pp. 9–15 (2019)
Google Scholar
Guo, P.J., Reinecke, K.: Demographic differences in how students navigate through MOOCs. In: L@S, pp. 21–30 (2014)
Google Scholar
Kizilcec, R.F., Piech, C., Schneider, E.: Deconstructing disengagement: analyzing learner subpopulations in massive open online courses. In: LAK, pp. 170–179 (2013)
Google Scholar
Mazloom, M., Pappi, I., Worring, M.: Category specific post popularity prediction. In: Schoeffmann, K., et al. (eds.) MMM 2018. LNCS, vol. 10704, pp. 594–607. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73603-7_48
Chapter Google Scholar
Moniz, N., Torgo, L.: A review on web content popularity prediction: issues and open challenges. Online Soc. Netw. Media 12, 1–20 (2019)
Article Google Scholar
Tatar, A., de Amorim, M.D., Fdida, S., Antoniadis, P.: A survey on predicting the popularity of web content. J. Internet Serv. Appl. 5(1), 1–20 (2014). https://doi.org/10.1186/s13174-014-0008-y
Article Google Scholar
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemom. Intell. Lab. Syst. 2(1–3), 37–52 (1987)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Learning Analytics, Monash University, Melbourne, Australia
Lin Li, Zachari Swiecki, Dragan Gašević & Guanliang Chen

Authors

Lin Li
View author publications
You can also search for this author in PubMed Google Scholar
Zachari Swiecki
View author publications
You can also search for this author in PubMed Google Scholar
Dragan Gašević
View author publications
You can also search for this author in PubMed Google Scholar
Guanliang Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guanliang Chen .

Editor information

Editors and Affiliations

Ateneo De Manila University, Quezon, Philippines
Maria Mercedes Rodrigo
Department of Computer Science, North Carolina State University, Raleigh, NC, USA
Noburu Matsuda
Durham University, Durham, UK
Alexandra I. Cristea
University of Leeds, Leeds, UK
Vania Dimitrova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, L., Swiecki, Z., Gašević, D., Chen, G. (2022). Popularity Prediction in MOOCs: A Case Study on Udemy. In: Rodrigo, M.M., Matsuda, N., Cristea, A.I., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2022. Lecture Notes in Computer Science, vol 13355. Springer, Cham. https://doi.org/10.1007/978-3-031-11644-5_56

Download citation

DOI: https://doi.org/10.1007/978-3-031-11644-5_56
Published: 27 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11643-8
Online ISBN: 978-3-031-11644-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Popularity Prediction in MOOCs: A Case Study on Udemy