Abstract
Educational data mining (EDM) is an applied field of research that combines data mining, machine learning, and statistics in the educational setting at, but not limited to, schools, universities and intelligent tutoring systems, and MOOCs. Methods are developing and improving for analyzing the vast amount of data available in education to better understand learning behaviors and pedagogical outcomes by applying theories of educational psychology in order to improve the learning environment.
In this chapter, the authors explore the significance of data mining in the online education setting and how it can improve the student learning experience. We first review some commonly used data mining techniques that have been applied to education data, such as classification, clustering, and association rule mining. The new developments in knowledge tracing for modeling student data are also briefly described. We follow with an elaboration on the goals of educational data mining, for example, using the data related to student performance in an MOOC setting, some goals for data mining could be to predict how well the student will do on a new class activity or the probability of the student dropping the course. And we conclude with a look at how EDM applies to online business education and how learning systems can adapt to findings of this data analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agarwal, S., Pandey, G. N., & Tiwari, M. D. (2012). Data mining in education: Data classification and decision tree approach. International Journal of e-Education, e-Business, e-Management and e-Learning, 2(2), 140–144.
Al-Barrak, M. A., & Al-Razgan, M. (2016). Predicting students final GPA using decision trees: A case study. International Journal of Information and Education Technology, 6(7), 528–533.
Baepler, P., & Murdoch, C. J. (2010). Academic analytics and data mining in higher education. International Journal for the Scholarship of Teaching and Learning, 4(2), 17–25.
Baker, R. S. J. D. (2010). Data mining for education. International encyclopedia of education, 7, 112–118.
Baker, R. S., & Yacef, K. (2009). The state of educational data mining in 2009: A review and future visions. JEDM-Journal of Educational Data Mining, 1(1), 3–17.
Beck, J., Ostrow, K. & Wang, Y. (2016). Students vs. skills: Partitioning variance explained in learner models. Student modeling from different aspects, pp. 2–9.
Bienkowski, M., Feng, M., & Means, B. (2012). Enhancing teaching and learning through educational data mining and learning analytics: An issue brief. US Department of Education, Office of Educational Technology, pp. 1–57.
Brooks, C. (2013). A data-assisted approach to supporting instructional interventions in technology enhanced learning environments (PhD dissertation). University of Saskatchewan, Saskatoon.
Chang, H. M., Kuo, T. M. L., Chen, S. C., Li, C. A., Huang, Y. W., Cheng, Y. C., et al. (2016, July). Developing a data-driven learning interest recommendation system to promoting self-paced learning on MOOCs. In Advanced Learning Technologies (ICALT), 2016 IEEE 16th International Conference on (pp. 23–25). IEEE.
Corbett, A. T., & Anderson, J. R. (1995). Knowledge tracing: Modeling the acquisition of procedural knowledge. User Modeling and User-Adapted Interaction, 4(4), 253–278.
Creswell, J. W. (2014). Educational research: Planning, conducting, and evaluating quantitative and qualitative research (5th ed.). Saddle River, NJ: Prentice Hall.
David, Y. B., Segal, A., & Gal, Y. A. K. (2016, April). Sequencing educational content in classrooms using Bayesian knowledge tracing. In Proceedings of the sixth international conference on Learning Analytics & Knowledge (pp. 354–363). ACM.
Dutt, A., Aghabozrgi, S., Ismail, M. A. B., & Mahroeian, H. (2015). Clustering algorithms applied in educational data mining. International Journal of Information and Electronics Engineering, 5(2), 112–116.
Fanfarelli, J., & Vie, S. (2015). Medulla: A 2D sidescrolling platformer game that teaches basic brain structure and function. Well Played, 4(2), 7–29.
García, E., Romero, C., Ventura, S., & Calders, T. (2007). Drawbacks and solutions of applying association rule mining in learning management systems. In Proceedings of the international workshop on Applying Data Mining in e-Learning (ADML 2007), Crete, Greece, pp. 13–22.
Gers, F. A., Schmidhuber, J., & Cummins, F. (2000). Learning to forget: Continual prediction with LSTM. Neural Computation, 12(10), 2451–2471.
Gong, Y., Rai, D., Beck, J., & Heffernan, N. (2009). Does self-discipline impact students’ knowledge and learning? In Proceedings of the 2nd international conference on Educational Data Mining, pp. 61–70.
Gong, Y., Wang, Y., Beck, J. (2016). How long must we spin our wheels? Analysis of student time and classifier inaccuracy. Student modeling from different aspects, pp. 32–38.
Gutierrez, F., & Atkinson, J. (2011). Adaptive feedback selection for intelligent tutoring systems. Expert Systems with Applications, 38(5), 6146–6152.
Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. San Francisco: Morgan Kaufmann Publishers.
Hernández-García, Á., González-González, I., Jiménez Zarco, A. I., & Chaparro-Peláez, J. (2016). Visualizations of online course interactions for social network learning analytics. International Journal of Emerging Technologies in Learning, 11(7), 6–15.
Hochreiter, S. (1998). The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(02), 107–116.
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780.
Kapa, E. (2007). Transfer from structured to open-ended problem solving in a computerized metacognitive environment. Learning and Instruction, 17(6), 688–707.
Kelly, K., Wang, Y., Thompson, T., &Heffernan, N. (2015). Defining mastery: Knowledge tracing versus N-consecutive correct responses. In The proceedings of the 8th international conference on Educational Data Mining (pp. 630–631). ACM.
Khajah, M., Lindsey, R. V., & Mozer, M.C. (2016). How deep is knowledge tracing?. In The proceedings of the 9th International Conference on Educational Data Mining (pp. 94–101). ACM.
Lee, Y. J. (2016). Predicting students’ problem solving performance using support vector machine. Journal of Data Science, 14, 231–244.
Liang, J., Yang, J., Wu, Y., Li, C., & Zheng, L. (2016, April). Big data application in education: Dropout prediction in Edx MOOCs. In Multimedia Big Data (BigMM), 2016 IEEE second international conference on (pp. 440–443). IEEE.
Ma, Y., Liu, B., Wong, C. K., Yu, P. S., & Lee, S. M. (2000). Targeting the right students using data mining. In Proceedings of the 6th ACM SIGKDD international conference on Knowledge Discovery and Data Mining, KDD’00, pp. 457–464, New York, USA, 2000. ACM.
MacHardy, Z., & Pardos, Z. A. (2015). Evaluating the relevance of educational videos using BKT and big data. In the proceedings of the 8th international conference on Educational Data Mining (pp. 424–427). ACM.
Madhyastha, T., & Tanimoto, S. (2009). Student Consistency and implications for feedback in online assessment systems. In Proceedings of the 2nd international conference on Educational Data Mining, pp 81–90.
Marshall, S. (2014). Exploring the ethical implications of MOOCs. Distance Education, 35(2), 250–262.
Martin, F., & Ndoye, A. (2016). Using learning analytics to assess student learning in online courses. Journal of University Teaching & Learning Practice, 13(3), 1–20.
Min, W., Vail, A. K., Frankosky, M. H., Wiggins, J. B., Boyer, K. E., Wiebe, E. N., et al. (2016). Predicting dialogue acts for intelligent virtual agents with multimodal student interaction data. In Proceedings of the 9th international conference on Educational Data Mining, pp. 454–459.
Mitrovic, A., & Ohlsson, S. (1999). Evaluation of a constraint-based tutor for a database language. International Journal of Artificial Intelligence in Education, 10, 238–256.
Mitrovic, A., Ohlsson, S., & Barrow, D. K. (2013). The effect of positive feedback in a constraint-based intelligent tutoring system. Computers & Education, 60(1), 264–272.
Montague, M. (1992). The effects of cognitive and metacognitive strategy instruction on the mathematical problem solving of middle school students with learning disabilities. Journal of Learning Disabilities, 25(4), 230–248.
Pardo, A., & Siemens, G. (2014). Ethical and privacy principles for learning analytics. British Journal of Educational Technology, 45(3), 438–450.
Pavlik, P. I., Cen, H., & Koedinger, K. R. (2009, July). Performance factors analysis – A new alternative to knowledge tracing. In Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling (pp. 531–538). IOS Press.
Piech, C., Bassen, J., Huang, J., Ganguli, S., Sahami, M., Guibas, L. J., & Sohl-Dickstein, J. (2015). Deep knowledge tracing. In Advances in neural information processing systems (pp. 505–513).
Ritter, S., Harris, T. K., Nixon, T., Dickison, D., Murray, R. C., & Towle, B. (2009). Reducing the knowledge tracing space. International Working Group on Educational Data Mining.
Romero, C., & Ventura, S. (2007). Educational data mining: A survey from 1995 to 2005. Expert Systems with Applications, 33(1), 135–146.
Romero, C., & Ventura, S. (2010). Educational data mining: A review of the state of the art. IEEE Transactions on Systems, Man & Cybernetics: Part C – Applications & Reviews, 40(6), 601–618.
Rus, V., Graesser, A., Moldovan, C., & Niraula, N. (2012). Automatic discovery of speech act categories in educational games. In The proceedings of the 5th International Conference on Educational Data Mining (pp. 25–32). ACM.
Seo, Y. J., & Bryant, D. (2012). Multimedia CAI program for students with mathematics difficulties. Remedial and Special Education, 33(4), 217–225.
Siemens, G. (2010). What are learning analytics? Available http://www.elearnspace.org/blog/2010/08/25/what-are-learning-analytics/. Accessed 8 Jan 2017.
Sweeney, M., Lester, J., & Rangwala, H. (2015, October). Next-term student grade prediction. In Big data (Big Data), 2015 IEEE international conference on (pp. 970–975). IEEE.
Sweeney, M., Rangwala, H., Lester, J., & Johri, A. (2016). Next-term student performance prediction: A recommender systems approach. JEDM-Journal of Educational Data Mining, 8(1), 22–51.
Wang, F., & Chen, L. (2016). A nonlinear state space model for identifying at-risk students in open online courses. In The proceedings of the 9th international conference on Educational Data Mining (pp. 527–532). ACM.
Wang, S., Walker, E., & Wylie, R. (2016). Analyzing frequent sequential patterns of learning behaviors in concept mapping. EDM.
Winne, P. H., & Baker, R. S. (2013). The potentials of educational data mining for researching metacognition, motivation and self-regulated learning. JEDM-Journal of Educational Data Mining, 5(1), 1–8.
Xiong, X., Zhao, S., Van Inwegen, E. G., & Beck, J. E. (2016). Going deeper with deep knowledge tracing. In Proceedings of the 9th international conference on Educational Data Mining (EDM 2016) (pp. 545–550).
Xu, B., Recker, M., Qi, X., Flann, N., & Ye, L. (2013). Clustering educational digital library usage data: Comparisons of latent class analysis and K-means algorithms. Journal of Educational Data Mining, 5(2), 38–68.
Yudelson, M. V., Medvedeva, O. P., & Crowley, R. S. (2008). A multifactor approach to student model evaluation. User Modeling and User-Adapted Interaction, 18(4), 349–382.
Zhang, Y., Shah, R., & Chi, M. (2016). Deep learning+ student modeling+ clustering: A recipe for effective automatic short answer grading. In The proceedings of the 9th international conference on Educational Data Mining (pp. 562–567). ACM.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Khare, K., Lam, H., Khare, A. (2018). Educational Data Mining (EDM): Researching Impact on Online Business Education. In: Khare, A., Hurst, D. (eds) On the Line. Springer, Cham. https://doi.org/10.1007/978-3-319-62776-2_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-62776-2_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62775-5
Online ISBN: 978-3-319-62776-2
eBook Packages: Business and ManagementBusiness and Management (R0)