Skip to main content

A Big Data Framework for Early Identification of Dropout Students in MOOC

  • Conference paper
  • First Online:
Technology in Education. Technology-Mediated Proactive Learning (ICTE 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 559))

Included in the following conference series:

Abstract

Massive Open Online Courses (MOOC) became popular and they posted great impact to education. Students could enroll and attend any MOOC anytime and anywhere according to their interest, schedule and learning pace. However, the dropout rate of MOOC was known to be very high in practice. It is desirable to discover students who have high chance to dropout in MOOC in early stage, and the course leader could impose intervention immediately in order to reduce the dropout rate. In this paper, we proposed a framework that applies big data methods to identify the students who are likely to dropout in MOOC. Real-world data were collected for the evaluation of our proposed framework. The results demonstrated that our framework is effective and helpful.

The original version of this chapter was revised. An erratum to this chapter can be found at 10.1007/978-3-662-48978-9_27

An erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-662-48978-9_27

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    www.coursera.org.

  2. 2.

    www.edx.org.

  3. 3.

    www.khanacademy.org.

  4. 4.

    https://www.coursera.org/about/community, retrieved on 1 September 2015.

  5. 5.

    We employed Weka, which is an open-source data mining tools to implement our framework. (URL: http://www.cs.waikato.ac.nz/ml/weka/).

References

  1. Alraimi, K.M., Zo, H., Ciganek, A.P.: Understanding the MOOCs continuance: the role of openness and reputation. Comput. Educ. 80, 28–38 (2015)

    Article  Google Scholar 

  2. Baker, R.S., Yacef, K.: The state of educational data mining in 2009: a review and future visions. J. Educ. Data Min. 1(1), 3–17 (2009)

    Google Scholar 

  3. Chaturvedi, S., Goldwasser, D., Daume III, H.: Predicting instructor’s intervention in mooc forums. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 1501–1511. ACL (2014)

    Google Scholar 

  4. Daniel, B., Butson, R.: Foundations of big data and analytics in higher education. In: International Conference on Analytics Driven Solutions: ICAS 2014, p. 39. Academic Conferences Limited (2014)

    Google Scholar 

  5. Freund, Y., Mason, L.: The alternating decision tree learning algorithm. In: Proceeding of the Sixteenth International Conference on Machine Learning, pp. 124–133 (1999)

    Google Scholar 

  6. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  7. Kloft, M., Stiehler, F., Zheng, Z., Pinkwart, N.: Predicting MOOC dropout over weeks using machine learning methods. In: Proceedings of the EMNLP Workshop on Modeling Large Scale Social Interaction in Massively Open Online Courses, pp. 60–65 (2014)

    Google Scholar 

  8. Lin, F.-R., Hsieh, L.-S., Chuang, F.-T.: Discovering genres of online discussion threads via text mining. Comput. Educ. 52(2), 481–495 (2009)

    Article  Google Scholar 

  9. Margaryan, A., Bianco, M., Littlejohn, A.: Instructional quality of massive open online courses (MOOCs). Comput. Educ. 80, 77–83 (2015)

    Article  Google Scholar 

  10. MITx and HarvardX. HarvardX-MITx Person-Course Academic Year 2013 De-Identified dataset, version 2.0 (2014). http://dx.doi.org/10.7910/DVN/26147

  11. Onah, D.F.O., Sinclair, J., Boyatt, R.: Dropout rates of massive open online courses: behavioural patterns. In: Proceedings of the Sixth International Conference on Education and New Learning Technologies, pp. 5825–5834 (2014)

    Google Scholar 

  12. Rabbany, R., Takaffoli, M., Zaiane, O.: Analyzing participation of students in online courses using social network analysis techniques. In: Proceedings of Educational Data Mining, pp. 21–30 (2011)

    Google Scholar 

  13. Romero, C.: Educational data mining: a review of the state of the art. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 60(6), 601–618 (2010)

    Article  Google Scholar 

  14. Romero, C., Ventura, S., Garcia, E.: Data mining in course management systems: moodle case study and tutorial. Comput. Educ. 51(1), 368–384 (2008)

    Article  Google Scholar 

  15. Zhang, Y., Chen, M., Mao, S., Hu, L., Leung, V.: CAP: crowd activity prediction based on big data analysis. IEEE Netw. 28(4), 52–57 (2014)

    Article  Google Scholar 

Download references

Acknowledgement

This work is partially supported by the Small Research Grant (MIT/SRG10/14-15) and the Internal Research Grant (IRG 30/2014-2015) of the Hong Kong Institute of Education.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tak-Lam Wong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tang, J.K.T., Xie, H., Wong, TL. (2015). A Big Data Framework for Early Identification of Dropout Students in MOOC. In: Lam, J., Ng, K., Cheung, S., Wong, T., Li, K., Wang, F. (eds) Technology in Education. Technology-Mediated Proactive Learning. ICTE 2015. Communications in Computer and Information Science, vol 559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48978-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-48978-9_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-48977-2

  • Online ISBN: 978-3-662-48978-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics