Abstract
We present an adaptive learning Intelligent Tutoring System, which uses model-based reinforcement learning in the form of contextual bandits to assign learning activities to students. The model is trained on the trajectories of thousands of students in order to maximize their exercise completion rates and continues to learn online, automatically adjusting itself to new activities. A randomized controlled trial with students shows that our model leads to superior completion rates and significantly improved student engagement when compared to other approaches. Our approach is fully-automated unlocking new opportunities for learning experience personalization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Albacete, P., Jordan, P., Katz, S., Chounta, I.-A., McLaren, B.M.: The impact of student model updates on contingent scaffolding in a natural-language tutoring system. In: Isotani, S., Millán, E., Ogan, A., Hastings, P., McLaren, B., Luckin, R. (eds.) AIED 2019. LNCS (LNAI), vol. 11625, pp. 37–47. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-23204-7_4
Anania, J.: The influence of instructional conditions on student learning and achievement. Eval. Educ. Int. Rev. Ser. 7(1), 3–76 (1983)
Auer, P.: Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res. 3, 397–422 (2002)
Bloom, B.S.: The 2 sigma problem: the search for methods of group instruction as effective as one-to-one tutoring. Educ. Res. 13(6), 4–16 (1984)
Brunskill, E., Mu, T., Goel, K., Bragg, J.: Automatic curriculum generation applied to teaching novices a short Bach Piano segment. In: NeurIPS Demonstrations (2018)
Chi, M., VanLehn, K., Litman, D., Jordan, P.: Empirically evaluating the application of reinforcement learning to the induction of effective and adaptive pedagogical strategies. User Model. User-Adap. Inter. 21(1), 137–180 (2011)
Doroudi, S., Aleven, V., Brunskill, E.: Where’s the reward? Int. J. Artifi. Intell. Educ. 29(4), 568–620 (2019)
Kulik, J.A., Fletcher, J.: Effectiveness of intelligent tutoring systems: a meta-analytic review. Rev. Educ. Res. 86(1), 42–78 (2016)
Lan, A.S., Baraniuk, R.G.: A contextual bandits framework for personalized learning action selection. In: EDM, pp. 424–429 (2016)
Li, L., Chu, W., Langford, J., Schapire, R.E.: A contextual-bandit approach to personalized news article recommendation. In: Proceedings of the 19th International Conference on World Wide Web - WWW 2010 (2010)
Liu, Y.E., Mandel, T., Brunskill, E., Popovic, Z.: Trading off scientific knowledge and user learning with multi-armed bandits. In: EDM (2014)
Lopes, M., Clement, B., Roy, D., Oudeyer, P.: Multi-armed bandits for intelligent tutoring systems (2013). http://arxiv.org/abs/1310.3174
Mu, T., Wang, S., Andersen, E., Brunskill, E.: Combining adaptivity with progression ordering for intelligent tutoring systems. In: Proceedings of the Fifth Annual ACM Conference on Learning at Scale, pp. 1–4 (2018)
Nye, B.D., Graesser, A.C., Hu, X.: AutoTutor and family: a review of 17 years of natural language tutoring. IJAIED 24(4), 427–469 (2014)
Rowe, J.P., Lester, J.C.: Improving student problem solving in narrative-centered learning environments: a modular reinforcement learning Framework. In: Conati, C., Heffernan, N., Mitrovic, A., Verdejo, M.F. (eds.) AIED 2015. LNCS (LNAI), vol. 9112, pp. 419–428. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19773-9_42
Rus, V., Stefanescu, D., Niraula, N., Graesser, A.C.: DeepTutor: towards macro-and micro-adaptive conversational intelligent tutoring at scale. In: Proceedings of the First ACM Conference on Learning@ Scale Conference, pp. 209–210 (2014)
Serban, I.V., et al.: A large-Scale, open-domain, mixed-interface dialogue-based ITS for STEM. In: Bittencourt, I.I., Cukurova, M., Muldner, K., Luckin, R., Millán, E. (eds.) AIED 2020. LNCS (LNAI), vol. 12164, pp. 387–392. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-52240-7_70
VanLehn, K.: The relative effectiveness of human tutoring, intelligent tutoring systems, and other tutoring systems. Educ. Psychol. 46(4), 197–221 (2011)
Whitehill, J., Movellan, J.: Approximately optimal teaching of approximately optimal learners. IEEE Trans. Learn. Technol. 11(2), 152–164 (2017)
Williams, J.J., Rafferty, A.N., Tingley, D., Ang, A., Lasecki, W.S., Kim, J.: Enhancing Online Problems Through Instructor-Centered Tools for Randomized Experiments, p. 1–12. Association for Computing Machinery, New York (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Belfer, R., Kochmar, E., Serban, I.V. (2022). Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits. In: Rodrigo, M.M., Matsuda, N., Cristea, A.I., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2022. Lecture Notes in Computer Science, vol 13355. Springer, Cham. https://doi.org/10.1007/978-3-031-11644-5_74
Download citation
DOI: https://doi.org/10.1007/978-3-031-11644-5_74
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11643-8
Online ISBN: 978-3-031-11644-5
eBook Packages: Computer ScienceComputer Science (R0)