Skip to main content

TARTA: Teacher Activity Recognizer from Transcriptions and Audio

  • Conference paper
  • First Online:
Artificial Intelligence in Education (AIED 2021)

Abstract

Classroom observation methods are fundamental tools for improving the quality of education and students’ academic achievement. However, they traditionally require participation of trained observers, making them expensive, prone to rater bias and time consuming. Hence, to address these challenges we present a cost-effective and non-intrusive method that automatically detects different teaching practices. In particular, we extracted acoustic features and transcriptions from teachers’ talk recordings to train a multimodal learning model called Teacher Activity Recognizer from Transcriptions and Audio (TARTA), which detects three categories derived from the Classroom Observation Protocol for Undergraduate STEM (COPUS), namely Presenting, Administration, and Guiding. We found that by combining acoustic features and transcriptions, our model outperforms separate acoustic- and transcription-based models at the task of predicting teachers’ activities along the lessons. In fact, TARTA can predict with high accuracy and discriminative power the presence of these teaching practices, achieving over 88\(\%\) of accuracy and 92% AUC for all three categories. Our work presents improvements with respect to previous studies since (1) we focus on classifying what teachers do according to a validated protocol instead of discerning whether they or their students are speaking and (2) our model does not rely on expensive or third party equipment, making it easier to scale to large volumes of lessons. This approach represents a useful tool for stakeholders and researchers who intend to analyze teaching practices on a large scale, but also for teachers to receive effective and continuous feedback.

Support from ANID/ PIA/ Basal Funds for Centers of Excellence FB0003 and ANID-FONDECYT grant N\(^\circ \) 3180590 are gratefully acknowledged.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Speech-to-text: automatic speech recognition; google cloud, https://cloud.google.com/speech/

  2. Akiha, K., et al.: What types of instructional shifts do students experience? Investigating active learning in science, technology, engineering, and math classes across key transition points from middle school to the university level. Front. Educ. 2, 68 (2018)

    Google Scholar 

  3. Brian, K.: OECD Insights Human Capital How what you know shapes your life: how what you know shapes your life. OECD publishing (2007)

    Google Scholar 

  4. Canete, J., Chaperon, G., Fuentes, R., Pérez, J.: Spanish pre-trained bert model and evaluation data. PML4DC at ICLR 2020 (2020)

    Google Scholar 

  5. Cosbey, R., Wusterbarth, A., Hutchinson, B.: Deep learning for classroom activity detection from audio. In: ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3727–3731. IEEE (2019)

    Google Scholar 

  6. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  7. Donnelly, P.J., Blanchard, N., Olney, A.M., Kelly, S., Nystrand, M., D’Mello, S.K.: Words matter: automatic detection of teacher questions in live classroom discourse using linguistics, acoustics, and context. In: Proceedings of the Seventh International Learning Analytics & Knowledge Conference, pp. 218–227 (2017)

    Google Scholar 

  8. Ford, M., Baer, C.T., Xu, D., Yapanel, U., Gray, S.: The lenatm language environment analysis system (2008)

    Google Scholar 

  9. Hill, H., Grossman, P.: Learning from teacher observations: challenges and opportunities posed by new teacher evaluation systems. Harv. Educ. Rev. 83(2), 371–384 (2013)

    Article  Google Scholar 

  10. Hill, H.C., Charalambous, C.Y., Kraft, M.A.: When rater reliability is not enough: teacher observation systems and a case for the generalizability study. Educ. Res. 41(2), 56–64 (2012)

    Article  Google Scholar 

  11. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning, pp. 448–456. PMLR (2015)

    Google Scholar 

  12. James, A., et al.: Automated classification of classroom climate by audio analysis. In: D’Haro, L.F., Banchs, R.E., Li, H. (eds.) 9th International Workshop on Spoken Dialogue System Technology. LNEE, vol. 579, pp. 41–49. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-9443-0_4

    Chapter  Google Scholar 

  13. Kelly, S., Olney, A.M., Donnelly, P., Nystrand, M., D’Mello, S.K.: Automatically measuring question authenticity in real-world classrooms. Educ. Res. 47(7), 451–464 (2018)

    Article  Google Scholar 

  14. Kohavi, R., et al.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Ijcai, vol. 14, pp. 1137–1145. Montreal, Canada (1995)

    Google Scholar 

  15. Kronholm, H., Caballero, D., Araya, R., Viiri, J.: A smartphone application for ASR and observation of classroom interactions. In: Finnish Mathematics and Science Education Research Association (FMSERA) Annual Symposium (2016)

    Google Scholar 

  16. Li, H., et al.: Multimodal learning for classroom activity detection. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 9234–9238. IEEE (2020)

    Google Scholar 

  17. McDonald, M., Kazemi, E., Kavanagh, S.S.: Core practices and pedagogies of teacher education: a call for a common language and collective activity. J. Teach. Educ. 64(5), 378–386 (2013)

    Article  Google Scholar 

  18. McIntyre, D.J.: Teacher evaluation and the observer effect. NASSP Bull. 64(434), 36–40 (1980)

    Article  Google Scholar 

  19. Owens, M.T., et al.: Classroom sound can be used to classify teaching practices in college science courses. Proc. Natl. Acad. Sci. 114(12), 3085–3090 (2017)

    Article  Google Scholar 

  20. Samph, T.: Observer effects on teacher behavior (1968)

    Google Scholar 

  21. Schlotterbeck, D., Uribe, P., Araya, R., Jimenez, A., Caballero, D.: What classroom audio tells about teaching: a cost-effective approach for detection of teaching practices using spectral audio features. In: LAK21: 11th International Learning Analytics and Knowledge Conference, pp. 132–140 (2021)

    Google Scholar 

  22. Smith, M.K., Jones, F.H., Gilbert, S.L., Wieman, C.E.: The classroom observation protocol for undergraduate stem (COPUS): a new instrument to characterize university stem classroom practices. CBE-Life Sci. Educ. 12(4), 618–627 (2013)

    Article  Google Scholar 

  23. Smith, M.K., Vinson, E.L., Smith, J.A., Lewin, J.D., Stetzer, M.R.: A campus-wide study of stem courses: new perspectives on teaching practices and perceptions. CBE-Life Sci. Educ. 13(4), 624–635 (2014)

    Article  Google Scholar 

  24. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)

    MathSciNet  MATH  Google Scholar 

  25. Wang, Z., Pan, X., Miller, K.F., Cortina, K.S.: Automatic classification of activities in classroom discourse. Comput. Educ. 78, 115–123 (2014)

    Article  Google Scholar 

  26. Wolf, T., et al.: Huggingface’s transformers: State-of-the-art natural language processing (2019). arXiv preprint arXiv:1910.03771

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Schlotterbeck, D., Uribe, P., Jiménez, A., Araya, R., van der Molen Moris, J., Caballero, D. (2021). TARTA: Teacher Activity Recognizer from Transcriptions and Audio. In: Roll, I., McNamara, D., Sosnovsky, S., Luckin, R., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2021. Lecture Notes in Computer Science(), vol 12748. Springer, Cham. https://doi.org/10.1007/978-3-030-78292-4_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-78292-4_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-78291-7

  • Online ISBN: 978-3-030-78292-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics