TARTA: Teacher Activity Recognizer from Transcriptions and Audio

Schlotterbeck, Danner; Uribe, Pablo; Jiménez, Abelino; Araya, Roberto; van der Molen Moris, Johan; Caballero, Daniela

doi:10.1007/978-3-030-78292-4_30

Danner Schlotterbeck¹³,
Pablo Uribe¹³,
Abelino Jiménez¹³,
Roberto Araya¹³,
Johan van der Molen Moris¹³ &
…
Daniela Caballero¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12748))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

3141 Accesses
5 Citations

Abstract

Classroom observation methods are fundamental tools for improving the quality of education and students’ academic achievement. However, they traditionally require participation of trained observers, making them expensive, prone to rater bias and time consuming. Hence, to address these challenges we present a cost-effective and non-intrusive method that automatically detects different teaching practices. In particular, we extracted acoustic features and transcriptions from teachers’ talk recordings to train a multimodal learning model called Teacher Activity Recognizer from Transcriptions and Audio (TARTA), which detects three categories derived from the Classroom Observation Protocol for Undergraduate STEM (COPUS), namely Presenting, Administration, and Guiding. We found that by combining acoustic features and transcriptions, our model outperforms separate acoustic- and transcription-based models at the task of predicting teachers’ activities along the lessons. In fact, TARTA can predict with high accuracy and discriminative power the presence of these teaching practices, achieving over 88\(\%\) of accuracy and 92% AUC for all three categories. Our work presents improvements with respect to previous studies since (1) we focus on classifying what teachers do according to a validated protocol instead of discerning whether they or their students are speaking and (2) our model does not rely on expensive or third party equipment, making it easier to scale to large volumes of lessons. This approach represents a useful tool for stakeholders and researchers who intend to analyze teaching practices on a large scale, but also for teachers to receive effective and continuous feedback.

Support from ANID/ PIA/ Basal Funds for Centers of Excellence FB0003 and ANID-FONDECYT grant N\(^\circ \) 3180590 are gratefully acknowledged.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recognition of Teaching Activities from University Lecture Transcriptions

Toward an Automatic Speech Classifier for the Teacher

“Teacher, Can You Say It Again?" Improving Automatic Speech Recognition Performance over Classroom Environments with Limited Data

References

Speech-to-text: automatic speech recognition; google cloud, https://cloud.google.com/speech/
Akiha, K., et al.: What types of instructional shifts do students experience? Investigating active learning in science, technology, engineering, and math classes across key transition points from middle school to the university level. Front. Educ. 2, 68 (2018)
Google Scholar
Brian, K.: OECD Insights Human Capital How what you know shapes your life: how what you know shapes your life. OECD publishing (2007)
Google Scholar
Canete, J., Chaperon, G., Fuentes, R., Pérez, J.: Spanish pre-trained bert model and evaluation data. PML4DC at ICLR 2020 (2020)
Google Scholar
Cosbey, R., Wusterbarth, A., Hutchinson, B.: Deep learning for classroom activity detection from audio. In: ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3727–3731. IEEE (2019)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Donnelly, P.J., Blanchard, N., Olney, A.M., Kelly, S., Nystrand, M., D’Mello, S.K.: Words matter: automatic detection of teacher questions in live classroom discourse using linguistics, acoustics, and context. In: Proceedings of the Seventh International Learning Analytics & Knowledge Conference, pp. 218–227 (2017)
Google Scholar
Ford, M., Baer, C.T., Xu, D., Yapanel, U., Gray, S.: The lenatm language environment analysis system (2008)
Google Scholar
Hill, H., Grossman, P.: Learning from teacher observations: challenges and opportunities posed by new teacher evaluation systems. Harv. Educ. Rev. 83(2), 371–384 (2013)
Article Google Scholar
Hill, H.C., Charalambous, C.Y., Kraft, M.A.: When rater reliability is not enough: teacher observation systems and a case for the generalizability study. Educ. Res. 41(2), 56–64 (2012)
Article Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning, pp. 448–456. PMLR (2015)
Google Scholar
James, A., et al.: Automated classification of classroom climate by audio analysis. In: D’Haro, L.F., Banchs, R.E., Li, H. (eds.) 9th International Workshop on Spoken Dialogue System Technology. LNEE, vol. 579, pp. 41–49. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-9443-0_4
Chapter Google Scholar
Kelly, S., Olney, A.M., Donnelly, P., Nystrand, M., D’Mello, S.K.: Automatically measuring question authenticity in real-world classrooms. Educ. Res. 47(7), 451–464 (2018)
Article Google Scholar
Kohavi, R., et al.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Ijcai, vol. 14, pp. 1137–1145. Montreal, Canada (1995)
Google Scholar
Kronholm, H., Caballero, D., Araya, R., Viiri, J.: A smartphone application for ASR and observation of classroom interactions. In: Finnish Mathematics and Science Education Research Association (FMSERA) Annual Symposium (2016)
Google Scholar
Li, H., et al.: Multimodal learning for classroom activity detection. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 9234–9238. IEEE (2020)
Google Scholar
McDonald, M., Kazemi, E., Kavanagh, S.S.: Core practices and pedagogies of teacher education: a call for a common language and collective activity. J. Teach. Educ. 64(5), 378–386 (2013)
Article Google Scholar
McIntyre, D.J.: Teacher evaluation and the observer effect. NASSP Bull. 64(434), 36–40 (1980)
Article Google Scholar
Owens, M.T., et al.: Classroom sound can be used to classify teaching practices in college science courses. Proc. Natl. Acad. Sci. 114(12), 3085–3090 (2017)
Article Google Scholar
Samph, T.: Observer effects on teacher behavior (1968)
Google Scholar
Schlotterbeck, D., Uribe, P., Araya, R., Jimenez, A., Caballero, D.: What classroom audio tells about teaching: a cost-effective approach for detection of teaching practices using spectral audio features. In: LAK21: 11th International Learning Analytics and Knowledge Conference, pp. 132–140 (2021)
Google Scholar
Smith, M.K., Jones, F.H., Gilbert, S.L., Wieman, C.E.: The classroom observation protocol for undergraduate stem (COPUS): a new instrument to characterize university stem classroom practices. CBE-Life Sci. Educ. 12(4), 618–627 (2013)
Article Google Scholar
Smith, M.K., Vinson, E.L., Smith, J.A., Lewin, J.D., Stetzer, M.R.: A campus-wide study of stem courses: new perspectives on teaching practices and perceptions. CBE-Life Sci. Educ. 13(4), 624–635 (2014)
Article Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Wang, Z., Pan, X., Miller, K.F., Cortina, K.S.: Automatic classification of activities in classroom discourse. Comput. Educ. 78, 115–123 (2014)
Article Google Scholar
Wolf, T., et al.: Huggingface’s transformers: State-of-the-art natural language processing (2019). arXiv preprint arXiv:1910.03771

Download references

Author information

Authors and Affiliations

Center for Advanced Research in Education, University of Chile, Santiago, Chile
Danner Schlotterbeck, Pablo Uribe, Abelino Jiménez, Roberto Araya, Johan van der Molen Moris & Daniela Caballero

Authors

Danner Schlotterbeck
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Uribe
View author publications
You can also search for this author in PubMed Google Scholar
Abelino Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Araya
View author publications
You can also search for this author in PubMed Google Scholar
Johan van der Molen Moris
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Caballero
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technion – Israel Institute of Technology, Haifa, Israel
Ido Roll
Arizona State University, Tempe, AZ, USA
Danielle McNamara
Utrecht University, Utrecht, The Netherlands
Sergey Sosnovsky
London Knowledge Lab, London, UK
Rose Luckin
University of Leeds, Leeds, UK
Vania Dimitrova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schlotterbeck, D., Uribe, P., Jiménez, A., Araya, R., van der Molen Moris, J., Caballero, D. (2021). TARTA: Teacher Activity Recognizer from Transcriptions and Audio. In: Roll, I., McNamara, D., Sosnovsky, S., Luckin, R., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2021. Lecture Notes in Computer Science(), vol 12748. Springer, Cham. https://doi.org/10.1007/978-3-030-78292-4_30

Download citation

DOI: https://doi.org/10.1007/978-3-030-78292-4_30
Published: 11 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-78291-7
Online ISBN: 978-3-030-78292-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

TARTA: Teacher Activity Recognizer from Transcriptions and Audio

Abstract

Access this chapter

Similar content being viewed by others

Recognition of Teaching Activities from University Lecture Transcriptions

Toward an Automatic Speech Classifier for the Teacher

“Teacher, Can You Say It Again?" Improving Automatic Speech Recognition Performance over Classroom Environments with Limited Data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

TARTA: Teacher Activity Recognizer from Transcriptions and Audio

Abstract

Access this chapter

Similar content being viewed by others

Recognition of Teaching Activities from University Lecture Transcriptions

Toward an Automatic Speech Classifier for the Teacher

“Teacher, Can You Say It Again?" Improving Automatic Speech Recognition Performance over Classroom Environments with Limited Data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation