Skip to main content

Advertisement

Log in

Deciphering the attributes of student retention in massive open online courses using data mining techniques

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

Aimed at a massive outreach and open access education, Massive Open Online Courses (MOOC) has evolved incredibly engaging millions of learners’ over the years. These courses provide an opportunity for learning analytics with respect to the diversity in learning activity. Inspite of its growth, high dropout rate of the learners’, it is examined to be a paramount factor that may obstruct the development of the e-learning platforms. Fabricating on the existing efforts of retaining learners’ engagement prior to learning, the study explores to decipher the attributes of student retention in e- learning. The study proposes a clear rationale of significant attributes using classification algorithms (Decision Tree) in order to improve course design and delivery for different MOOC providers and learners’. Using the three MOOC datasets, this research work analyses the approach and results of applying the data mining techniques to online learners’, based on their in-course behaviour. Finally, it predicts the attributes that lead to minimise attrition rate and analyse the different cohort behaviour and its impacts for dropouts using data mining technique. It focuses to build a more integrated environment for these learners’.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Adamopoulos, P. (2013). What makes a great MOOC? An interdisciplinary analysis of student retention in online courses.

  • Agarwal, S. (2013). Data mining: Data mining concepts and techniques. In Machine Intelligence and Research Advancement (ICMIRA), 2013 International Conference on (pp. 203–207). IEEE.

  • Al-Shabandar, R., Hussain, A., Laws, A., Keight, R., Lunn, J., & Radi, N. (2017). Machine Learning approaches to predict learning outcomes in Massive open online courses. In Neural Networks (IJCNN), 2017 International Joint Conference on (pp. 713–720). IEEE.

  • Arora, S., Goel, M., Sabitha, A. S., & Mehrotra, D. (2017). Learner groups in massive open online courses. American Journal of Distance Education, 31(2), 80–97.

    Article  Google Scholar 

  • Bassi, R., Daradoumis, T., Xhafa, F., Caballé, S., & Sula, A. (2014). Software agents in large scale open e-learning: a critical component for the future of massive online courses (MOOCs). In Intelligent Networking and Collaborative Systems (INCoS), 2014 International Conference on (pp. 184–188). IEEE.

  • Bates, T. (2013). Look back in anger? A review of online learning in 2013.

  • Bharara, S., Sabitha, S., & Bansal, A. (2017). Application of learning analytics using clustering data Mining for Students’ disposition analysis. Education and Information Technologies, 1–28.

  • Bharara, S., Sabitha, S., & Bansal, A. (2018). Application of learning analytics using clustering data Mining for Students’ disposition analysis. Education and Information Technologies, 23(2), 957–984.

    Article  Google Scholar 

  • Boyer, S., & Veeramachaneni, K. (2015). Transfer learning for predictive models in massive open online courses. In International Conference on Artificial Intelligence in Education (pp. 54–63). Springer, Cham.

  • Castro, E. G., & Tsuzuki, M. S. (2015). Churn prediction in online games using players’ login records: A frequency analysis approach. IEEE Transactions on Computational Intelligence and AI and Games, 7(3), 255–265.

    Article  Google Scholar 

  • Chen, Y., Chen, Q., Zhao, M., Boyer, S., Veeramachaneni, K., & Qu, H. (2016). DropoutSeer: Visualizing learning patterns in Massive Open Online Courses for dropout reasoning and prediction. In Visual Analytics Science and Technology (VAST), 2016 IEEE Conference on (pp. 111–120). IEEE.

  • Dataverse. (2014). HarvardX-MITx Person-Course Academic Year 2013 De-identified Dataset, Version 2.0, https://dataverse.harvard.edu/file.xhtml?fileId=2468954&version=RELEASED&version=.0.

  • Gallén, R.C., & Caro, E.T. (2017). An exploratory analysis of why a person enrolls in a massive open online course within MOOCKnowledge data collection. In Global Engineering Education Conference (EDUCON), 2017 IEEE (pp. 1600–1605). IEEE.

  • Gamage, D., Fernando, S., & Perera, I. (2015) August. Factors leading to an effective MOOC from participiants perspective. In Ubi-Media Computing (UMEDIA), 2015 8th International Conference on (pp. 230–235). IEEE.

  • Hegyesi, F., Kártyás, G., & Gáti, J. (2017). Answers to the 21st century challenges at a university with technical training. In Intelligent Systems and Informatics (SISY), 2017 IEEE 15th International Symposium on (pp. 000365–000368). IEEE.

  • Huang, N.F., Hsu, H.H., Chen, S.C., Lee, C.A., Huang, Y.W., Ou, P.W., & Tzeng, J.W. (2017). VideoMark: A video-based learning analytic technique for MOOCs. In Big Data Analysis (ICBDA), 2017 IEEE 2nd International Conference on(pp. 753–757). IEEE.

  • Kaggle. (2017a). big_student_clear_third_version, https://www.kaggle.com/kanikanarang94/mooc-dataset/data.

  • Kaggle. (2017b). cs_mitx, MOOC Dataset, https://www.kaggle.com/chellaindu/mooc-dataset/data.

  • Kaveri, A., Gunasekar, S., Gupta, D., & Pratap, M. (2016). Decoding Engagement in MOOCs: An Indian Learner Perspective. In Technology for Education (T4E), 2016 IEEE Eighth International Conference on (pp. 100–105). IEEE.

  • Khalil, H., & Ebner, M. (2014). MOOCs completion rates and possible methods to improve retention-a literature review. In EdMedia: World Conference on Educational Media and Technology (pp. 1305–1313). Association for the Advancement of Computing in Education (AACE).

  • Kloft, M., Stiehler, F., Zheng, Z. & Pinkwart, N. (2014). Predicting MOOC dropout over weeks using machine learning methods. In Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs (pp. 60–65).

  • Liyanagunawardena, T.R., Parslow, P., & Williams, S. (2014). Dropout: MOOC participants’ perspective.

  • Machado, N.L., & Ruiz, D.D. (2017). Customer: A novel customer churn prediction method based on mobile application usage. In Wireless Communications and Mobile Computing Conference (IWCMC), 2017 13th International (pp. 2146–2151). IEEE.

  • Mulik, S., Yajnik, N., & Godse, M. (2016). Determinants of acceptance of massive open online courses. In Technology for Education (T4E), 2016 IEEE Eighth International Conference on (pp. 124–127). IEEE.

  • Onah, D.F., Sinclair, J., & Boyatt, R. (2014). Dropout rates of massive open online courses: behavioural patterns. EDULEARN14 proceedings (pp. 5825–5834).

  • Rodriguez, C. O. (2012). MOOCs and the AI-Stanford like courses: Two successful and distinct course formats for massive open online courses. European Journal of Open, Distance and E-Learning, 15(2).

  • Rosé, C.P., Carlson, R., Yang, D., Wen, M., Resnick, L., Goldman, P., & Sherer, J. (2014). Social factors that contribute to attrition in MOOCs. In Proceedings of the first ACM conference on Learning@ scale conference (pp. 197–198). ACM.

  • Sabitha, A. S., Mehrotra, D., Bansal, A., & Sharma, B. K. (2016). A naive bayes approach for converging learning objects with open educational resources. Education and Information Technologies, 21(6), 1753–1767.

    Article  Google Scholar 

  • Sandanayake, T.C., & Madurapperuma, A.P. (2013). Computational model for affective e-Learning: Developing a model for recognising E-Learner's emotions. In Innovation and Technology in Education (MITE), 2013 IEEE International Conference in MOOC (pp. 174–179). IEEE.

  • Schaffer, J., Huynh, B., O'Donovan, J., Höllerer, T., Xia, Y., & Lin, S. (2016). An analysis of student behavior in two massive open online courses. In Advances in Social Networks Analysis and Mining (ASONAM), 2016 IEEE/ACM International Conference on (pp. 380–385). IEEE.

  • Shah, D. (2015). By the numbers: MOOCs in 2015. Class Central.

  • Sharkey, M., & Sanders, R. (2014). A process for predicting MOOC attrition. In Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs (pp. 50–54).

  • Shen, L., Wang, M., & Shen, R. (2009). Affective e-learning: Using “emotional” data to improve learning in pervasive learning environment. Journal of Educational Technology & Society, 12(2), 176.

    Google Scholar 

  • Shi, C., Fu, S., Chen, Q., & Qu, H. (2015). VisMOOC: Visualizing video clickstream data from massive open online courses. In Visualization Symposium (PacificVis), 2015 IEEE Pacific (pp. 159–166). IEEE.

  • Siemens, G. (2005). Connectivism: A learning theory for the digital age. International Journal of Instructional Technology and Distance Learning, 2(1), 3–10.

    Google Scholar 

  • Sooryanarayan, D.G., & Gupta, D. (2015). Impact of learner motivation on mooc preferences: Transfer vs. made moocs. In Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on (pp. 929–934). IEEE.

  • Sunar, A., White, S., Abdullah, N., & Davis, H. (2016). How learners’ interactions sustain engagement: a MOOC case study. IEEE Transactions on Learning Technologies.

  • Wu, Y., Pitipornvivat, N., Zhao, J., Yang, S., Huang, G., & Qu, H. (2016). Egoslider: visual analysis of egocentric network evolution. IEEE Transactions on Visualization and Computer Graphics, 22(1), 260–269.

    Article  Google Scholar 

  • Yousef, A.M.F., Chatti, M.A., Schroeder, U., & Wosnitza, M. (2014). What drives a successful MOOC? An empirical examination of criteria to assure design quality of MOOCs. In Advanced Learning Technologies (ICALT), 2014 IEEE 14th International Conference on (pp. 44–48). IEEE.

  • Zhou, N., Gifford, W.M., Yan, J., & Li, H. (2016). End-to-end solution with clustering method for attrition analysis. In Services Computing (SCC), 2016 IEEE International Conference on (pp. 363–370). IEEE.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shivangi Gupta.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gupta, S., Sabitha, A.S. Deciphering the attributes of student retention in massive open online courses using data mining techniques. Educ Inf Technol 24, 1973–1994 (2019). https://doi.org/10.1007/s10639-018-9829-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-018-9829-9

Keywords

Navigation