Abstract
Massive Open Online Courses (MOOC) became popular and they posted great impact to education. Students could enroll and attend any MOOC anytime and anywhere according to their interest, schedule and learning pace. However, the dropout rate of MOOC was known to be very high in practice. It is desirable to discover students who have high chance to dropout in MOOC in early stage, and the course leader could impose intervention immediately in order to reduce the dropout rate. In this paper, we proposed a framework that applies big data methods to identify the students who are likely to dropout in MOOC. Real-world data were collected for the evaluation of our proposed framework. The results demonstrated that our framework is effective and helpful.
The original version of this chapter was revised. An erratum to this chapter can be found at 10.1007/978-3-662-48978-9_27
An erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-662-48978-9_27
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
https://www.coursera.org/about/community, retrieved on 1 September 2015.
- 5.
We employed Weka, which is an open-source data mining tools to implement our framework. (URL: http://www.cs.waikato.ac.nz/ml/weka/).
References
Alraimi, K.M., Zo, H., Ciganek, A.P.: Understanding the MOOCs continuance: the role of openness and reputation. Comput. Educ. 80, 28–38 (2015)
Baker, R.S., Yacef, K.: The state of educational data mining in 2009: a review and future visions. J. Educ. Data Min. 1(1), 3–17 (2009)
Chaturvedi, S., Goldwasser, D., Daume III, H.: Predicting instructor’s intervention in mooc forums. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 1501–1511. ACL (2014)
Daniel, B., Butson, R.: Foundations of big data and analytics in higher education. In: International Conference on Analytics Driven Solutions: ICAS 2014, p. 39. Academic Conferences Limited (2014)
Freund, Y., Mason, L.: The alternating decision tree learning algorithm. In: Proceeding of the Sixteenth International Conference on Machine Learning, pp. 124–133 (1999)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2005)
Kloft, M., Stiehler, F., Zheng, Z., Pinkwart, N.: Predicting MOOC dropout over weeks using machine learning methods. In: Proceedings of the EMNLP Workshop on Modeling Large Scale Social Interaction in Massively Open Online Courses, pp. 60–65 (2014)
Lin, F.-R., Hsieh, L.-S., Chuang, F.-T.: Discovering genres of online discussion threads via text mining. Comput. Educ. 52(2), 481–495 (2009)
Margaryan, A., Bianco, M., Littlejohn, A.: Instructional quality of massive open online courses (MOOCs). Comput. Educ. 80, 77–83 (2015)
MITx and HarvardX. HarvardX-MITx Person-Course Academic Year 2013 De-Identified dataset, version 2.0 (2014). http://dx.doi.org/10.7910/DVN/26147
Onah, D.F.O., Sinclair, J., Boyatt, R.: Dropout rates of massive open online courses: behavioural patterns. In: Proceedings of the Sixth International Conference on Education and New Learning Technologies, pp. 5825–5834 (2014)
Rabbany, R., Takaffoli, M., Zaiane, O.: Analyzing participation of students in online courses using social network analysis techniques. In: Proceedings of Educational Data Mining, pp. 21–30 (2011)
Romero, C.: Educational data mining: a review of the state of the art. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 60(6), 601–618 (2010)
Romero, C., Ventura, S., Garcia, E.: Data mining in course management systems: moodle case study and tutorial. Comput. Educ. 51(1), 368–384 (2008)
Zhang, Y., Chen, M., Mao, S., Hu, L., Leung, V.: CAP: crowd activity prediction based on big data analysis. IEEE Netw. 28(4), 52–57 (2014)
Acknowledgement
This work is partially supported by the Small Research Grant (MIT/SRG10/14-15) and the Internal Research Grant (IRG 30/2014-2015) of the Hong Kong Institute of Education.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tang, J.K.T., Xie, H., Wong, TL. (2015). A Big Data Framework for Early Identification of Dropout Students in MOOC. In: Lam, J., Ng, K., Cheung, S., Wong, T., Li, K., Wang, F. (eds) Technology in Education. Technology-Mediated Proactive Learning. ICTE 2015. Communications in Computer and Information Science, vol 559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48978-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-662-48978-9_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48977-2
Online ISBN: 978-3-662-48978-9
eBook Packages: Computer ScienceComputer Science (R0)