Abstract
Classroom observation methods are fundamental tools for improving the quality of education and students’ academic achievement. However, they traditionally require participation of trained observers, making them expensive, prone to rater bias and time consuming. Hence, to address these challenges we present a cost-effective and non-intrusive method that automatically detects different teaching practices. In particular, we extracted acoustic features and transcriptions from teachers’ talk recordings to train a multimodal learning model called Teacher Activity Recognizer from Transcriptions and Audio (TARTA), which detects three categories derived from the Classroom Observation Protocol for Undergraduate STEM (COPUS), namely Presenting, Administration, and Guiding. We found that by combining acoustic features and transcriptions, our model outperforms separate acoustic- and transcription-based models at the task of predicting teachers’ activities along the lessons. In fact, TARTA can predict with high accuracy and discriminative power the presence of these teaching practices, achieving over 88\(\%\) of accuracy and 92% AUC for all three categories. Our work presents improvements with respect to previous studies since (1) we focus on classifying what teachers do according to a validated protocol instead of discerning whether they or their students are speaking and (2) our model does not rely on expensive or third party equipment, making it easier to scale to large volumes of lessons. This approach represents a useful tool for stakeholders and researchers who intend to analyze teaching practices on a large scale, but also for teachers to receive effective and continuous feedback.
Support from ANID/ PIA/ Basal Funds for Centers of Excellence FB0003 and ANID-FONDECYT grant N\(^\circ \) 3180590 are gratefully acknowledged.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Speech-to-text: automatic speech recognition; google cloud, https://cloud.google.com/speech/
Akiha, K., et al.: What types of instructional shifts do students experience? Investigating active learning in science, technology, engineering, and math classes across key transition points from middle school to the university level. Front. Educ. 2, 68 (2018)
Brian, K.: OECD Insights Human Capital How what you know shapes your life: how what you know shapes your life. OECD publishing (2007)
Canete, J., Chaperon, G., Fuentes, R., Pérez, J.: Spanish pre-trained bert model and evaluation data. PML4DC at ICLR 2020 (2020)
Cosbey, R., Wusterbarth, A., Hutchinson, B.: Deep learning for classroom activity detection from audio. In: ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3727–3731. IEEE (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Donnelly, P.J., Blanchard, N., Olney, A.M., Kelly, S., Nystrand, M., D’Mello, S.K.: Words matter: automatic detection of teacher questions in live classroom discourse using linguistics, acoustics, and context. In: Proceedings of the Seventh International Learning Analytics & Knowledge Conference, pp. 218–227 (2017)
Ford, M., Baer, C.T., Xu, D., Yapanel, U., Gray, S.: The lenatm language environment analysis system (2008)
Hill, H., Grossman, P.: Learning from teacher observations: challenges and opportunities posed by new teacher evaluation systems. Harv. Educ. Rev. 83(2), 371–384 (2013)
Hill, H.C., Charalambous, C.Y., Kraft, M.A.: When rater reliability is not enough: teacher observation systems and a case for the generalizability study. Educ. Res. 41(2), 56–64 (2012)
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning, pp. 448–456. PMLR (2015)
James, A., et al.: Automated classification of classroom climate by audio analysis. In: D’Haro, L.F., Banchs, R.E., Li, H. (eds.) 9th International Workshop on Spoken Dialogue System Technology. LNEE, vol. 579, pp. 41–49. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-9443-0_4
Kelly, S., Olney, A.M., Donnelly, P., Nystrand, M., D’Mello, S.K.: Automatically measuring question authenticity in real-world classrooms. Educ. Res. 47(7), 451–464 (2018)
Kohavi, R., et al.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Ijcai, vol. 14, pp. 1137–1145. Montreal, Canada (1995)
Kronholm, H., Caballero, D., Araya, R., Viiri, J.: A smartphone application for ASR and observation of classroom interactions. In: Finnish Mathematics and Science Education Research Association (FMSERA) Annual Symposium (2016)
Li, H., et al.: Multimodal learning for classroom activity detection. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 9234–9238. IEEE (2020)
McDonald, M., Kazemi, E., Kavanagh, S.S.: Core practices and pedagogies of teacher education: a call for a common language and collective activity. J. Teach. Educ. 64(5), 378–386 (2013)
McIntyre, D.J.: Teacher evaluation and the observer effect. NASSP Bull. 64(434), 36–40 (1980)
Owens, M.T., et al.: Classroom sound can be used to classify teaching practices in college science courses. Proc. Natl. Acad. Sci. 114(12), 3085–3090 (2017)
Samph, T.: Observer effects on teacher behavior (1968)
Schlotterbeck, D., Uribe, P., Araya, R., Jimenez, A., Caballero, D.: What classroom audio tells about teaching: a cost-effective approach for detection of teaching practices using spectral audio features. In: LAK21: 11th International Learning Analytics and Knowledge Conference, pp. 132–140 (2021)
Smith, M.K., Jones, F.H., Gilbert, S.L., Wieman, C.E.: The classroom observation protocol for undergraduate stem (COPUS): a new instrument to characterize university stem classroom practices. CBE-Life Sci. Educ. 12(4), 618–627 (2013)
Smith, M.K., Vinson, E.L., Smith, J.A., Lewin, J.D., Stetzer, M.R.: A campus-wide study of stem courses: new perspectives on teaching practices and perceptions. CBE-Life Sci. Educ. 13(4), 624–635 (2014)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
Wang, Z., Pan, X., Miller, K.F., Cortina, K.S.: Automatic classification of activities in classroom discourse. Comput. Educ. 78, 115–123 (2014)
Wolf, T., et al.: Huggingface’s transformers: State-of-the-art natural language processing (2019). arXiv preprint arXiv:1910.03771
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Schlotterbeck, D., Uribe, P., Jiménez, A., Araya, R., van der Molen Moris, J., Caballero, D. (2021). TARTA: Teacher Activity Recognizer from Transcriptions and Audio. In: Roll, I., McNamara, D., Sosnovsky, S., Luckin, R., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2021. Lecture Notes in Computer Science(), vol 12748. Springer, Cham. https://doi.org/10.1007/978-3-030-78292-4_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-78292-4_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-78291-7
Online ISBN: 978-3-030-78292-4
eBook Packages: Computer ScienceComputer Science (R0)