A simple teacher behavior recognition method for massive teaching videos based on teacher set

Gang, Zhao; Wenjuan, Zhu; Biling, Hu; Jie, Chu; Hui, He; Qing, Xia

doi:10.1007/s10489-021-02329-y

A simple teacher behavior recognition method for massive teaching videos based on teacher set

Published: 14 April 2021

Volume 51, pages 8828–8849, (2021)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Zhao Gang¹,
Zhu Wenjuan ORCID: orcid.org/0000-0003-0987-6730¹,
Hu Biling¹,
Chu Jie¹,
He Hui¹ &
…
Xia Qing¹

1364 Accesses
14 Citations
Explore all metrics

Abstract

The analysis of teacher behavior of massive teaching videos has become a surge of research interest recently. Traditional methods rely on accurate manual analysis, which is extremely complex and time-consuming for analyzing massive teaching videos. However, existing works on action recognition are difficultly transplanted to the teacher behavior recognition, because it is difficult to extract teacher’s behavior from complex teaching scenario, and teacher’s behaviors are given professional educational semantics. These methods are not adequate for the need of the teacher behavior recognition. Thus, a novel and simple recognition method of teacher behavior in the actual teaching scene for massive teaching videos is proposed, which can provide technical assistance for analyzing teacher behavior and fill the blank of automatic recognition of teacher behavior in actual teaching scene. Firstly, we discover the educational pattern which it be named “teacher set”, that is, the spatial region of the video of the whole class where teachers should exist. Based on this, the algorithm of teacher set identification and extraction (Teacher-set IE algorithm) is studied to identify the teacher in the teaching video, and reduce the interference factors of classroom background. Then, an improved behavior recognition network based on 3D bilinear pooling (3D BP-TBR) is presented to enhance fusion representation of three-dimensional features thus identifying the categories of teacher behavior, and experiments show that 3D BP-TBR can achieve better performance on public and self-built dataset (TAD-08). Hence, our whole approach can increase recognition accuracy of teacher behavior in the actual teaching scene to utilize the deep integration of educational characteristics and action recognition technology.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Student Classroom Behavior Detection Based on YOLOv7+BRA and Multi-model Fusion

Bag of Deep Features for Instructor Activity Recognition in Lecture Room

Multimodal behavior analysis in computer-enabled laboratories using nonverbal cues

Article 29 May 2020

References

Van den Hurk HTG, Houtveen AAM, Van de Grift WJCM (2016) Fostering effective teaching behavior through the use of data-feedback. Teach Teach Educ60:444–451
Hadie SNH, Hassan A, Talip SB et al (2018) The Teacher Behavior Inventory: validation of teacher behavior in an interactive lecture environment. Teacher Development, pp 1–14
Gebhard JG (1998) Teaching English as a foreign or second language: A teacher self-development and methodology guide. University of Michigan Press, Michigan
Google Scholar
Cheng K H, Tsai C C (2019) A Case Study of Immersive Virtual Field Trips in an Elementary Classroom: Students’ Learning Experience and Teacher-student Interaction Behaviors. Comput Educ 140:103600
Nagro S A, Cornelius K E (2013) Evaluating the evidence base of video analysis: a special education teacher development tool. Teach Educ Special Educ 36(4):312–329
Article Google Scholar
Mintzes J J (1982) Relationships between student perceptions of teaching behavior and learning outcomes in college biology. J Res Sci Teach 19(9):789–794
Article Google Scholar
Flanders N A (1961) Analyzing teacher behavior. Educ Leadersh 19(3):173
Google Scholar
Kucuk S, Sisman B (2017) Behavioral Patterns of Elementary Students and Teachers in one-to-one Robotics Instruction. Comput Educ 111:31–43
Article Google Scholar
Zhang J, Zhu K (2012) The analytical research on teaching behavior based on classroom observation. Mod Educ Technol 22(4):25–28
Google Scholar
Man X (2018) An Analysis of Japanese Teaching Behavior Based on the Combination Membership Function. In: International Conference on Intelligent Transportation, Big Data & Smart City, pp 258–261
Simonyan K, Zisserman A Two-stream Convolutional Networks for Action Recognition in Videos. arXiv:1406.2199
Tran D, Bourdev L, Fergus R, Torresani L, Paluri M (2015) Learning Spatiotemporal Features with 3D Convolutional Networks. In: IEEE International Conference on Computer Vision, pp 4489–4497
Wang L, Xiong Y, Wang Z, Qiao Y, Lin D et al Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. arXiv:1608.00859
Zhou B, Andonian A, Oliva A, Torralba A Temporal Relational Reasoning in Videos. arXiv:1711.08496
Zolfaghari M, Singh K, Brox T (2018) ECO: Efficient Convolutional Network for Online Video Understanding. In: Lecture Notes in Computer Science, pp 713–730
Qiu Z, Yao T, Mei T (2017) Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks. In: IEEE International Conference on Computer Vision, pp 5534– 5542
Diba A, Fayyaz M, Sharma V et al Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classiffcation. arXiv:1711.08200
Carreira J, Zisserman A (2017) Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 4724–4733
Ren H, Xu G (2002) Human Action Recognition in Smart Classroom. In: IEEE International Conference on Automatic Face and Gesture Recognition, pp 417–422
Raza A, Yousaf M H, Sial H A, Raja G (2015) HMM-Based Scheme for Smart Instructor Activity Recognition in a Lecture Room Environment. Smart Comput Rev 5(6):578–590
Article Google Scholar
Nida N, Yousaf M H, Irtaza A, Velastin S A (2019) Instructor activity recognition through deep spatiotemporal features and feedforward extreme learning machines. Math Probl Eng:1–13
Reinke WM, Herman KC, Newcomer L (2016) The Brief Student–Teacher Classroom Interaction Observation: Using Dynamic Indicators of Behaviors in the Classroom to Predict Outcomes And Inform Practice. Assessment for Effective Intervention, pp 1–11
Flanders N A (1963) Intent, action and feedback: a preparation for teaching. J Teach Educ 14 (3):251–260
Article Google Scholar
Kiemer K, Gröschner A, Pehmer A K, Seidel T (2015) Effects of a classroom discourse intervention on teachers’ practice and students’ motivation to learn mathematics and science. Learn Instr 35(1):94–103
Article Google Scholar
Wang H, Schmid C (2013) Action Recognition with Improved Trajectories. In: IEEE International Conference on Computer Vision, pp 3551–3558
Mahjoub A B, Atri M (2019) An Efficient end-to-end Deep Learning Architecture for Activity Classification. Analog Integr Circ Sig Process 99:23–32
Article Google Scholar
Wang X, Gao L, Song J, Shen H (2017) Beyond Frame-level CNN: Saliency-aware 3D CNN with LSTM for Video Action Recognition. IEEE Signal Process Lett 24(4):510–514
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 770–778
Gao H, Liu Z, Laurens VDM, Kilian QW (2017) Densely Connected Convolutional Networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2261–2269
Xiong X, Min W, Zheng W, et al. (2020) S3d-CNN: Skeleton-based 3D Consecutive-low-pooling Neural Network for Fall Detection. Appl Intell 50:3521–3534
Article Google Scholar
Song H, Wu X, Zhu B, Wu Y, Chen M, Jia Y (2019) Temporal action localization in untrimmed videos using action pattern trees. IEEE Trans Multimed 21(3):717–730
Article Google Scholar
Purwanto D, Pramono R R A, Chen Y T, Fang W H (2019) Three-Stream Network with bidirectional Self-Attention for action recognition in extreme Low-Resolution videos. IEEE Signal Process Lett 26 (8):1187–1191
Article Google Scholar
Li Z, Gavrilyuk K, Gavves E, Jain M, Snoek C G M (2017) VideoLSTM Convolves, Attends and Flows for Action Recognition. Comput Vis Image Underst 166:41–50
Article Google Scholar
Soomro K, Zamir A R, Shah M (2012) UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild. arXiv:1212.0402
Kuehne H, Jhuang H, Garrote E, Poggio T, Serre T (2011) HMDB: A Large Video Database for Human Motion Recognition. In: International Conference on Computer Vision, pp 2556–2563
Heilbron FC, Escorcia V, Ghanem B, Niebles JC (2015) ActivityNet: A Large-Scale Video Benchmark for Human Activity Understanding. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 961–970
Gu C, Chen S, David R et al (2018) AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. In: IEEE International Conference on Computer Vision, pp 6047–6056
Pan J, Chen S, Shou Z, Shao J, Li H (2020) Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization. arXiv:2006.07976
Linstone H, Turoff M (1975) The Delphi Method. Techniques and Applications
Okoli C, Pawlowski SD (2004) The Delphi Method as A Research Tool: An Example, Design Considerations and Applications - Sciencedirect. Inf Manag 42(1):15–29
Belton I, Macdonald A, Wright G, Hamlin I (2019) Improving the Practical Application of The Delphi Method in Group-based Judgment: A Six-step Prescription for A Well-founded and Defensible Process. Technol Forecast Soc Change 147:72–82
Valtonen T, Sointu E, Kukkonen J, Kontkanen S et al (2017) TPACK Updated to Measure Pre-service Teachers’ Twenty-first Century Skills. Austral J Educ Technol 33(3):15–31
Liu Q, Zhang N, Chen W, Wang Q, Yuan Y, Xie K (2020) Categorizing Teachers’ Gestures in Classroom Teaching: From the Perspective of Multiple Representations. Social Semiotics, pp 1–21
He K, Gkioxari G, Dollar P, Girshick R (2017) Mask R-CNN. IEEE Trans Pattern Anal Mach Intell 42(2):386–397
Article Google Scholar
Wojke N, Bewley A, Paulus D (2017) Simple Online and Realtime Tracking with a Deep Association Metric. In: IEEE International Conference on Image Processing, pp 3645–3649
Lin TY, RoyChowdhury A, Maji S (2015) Bilinear CNN Models for Fine-Grained Visual Recognition. In: IEEE International Conference on Computer Vision, pp 1449–1457
Yu C, Zhao X, Zheng Q, Zhang P, You X (2018) Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition. In: European Conference on Computer Vision, pp 595– 610
Szegedy C, Ioffe S, Vanhoucke V. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning. arXiv:1602.07261
Majd M, Safabakhsh R (2019) A Motion-aware convLSTM Network for Action Recognition. Appl Intell 49(7):2515– 2521
Article Google Scholar
Ray J, Chang S F, Paluri M ConvNet Architecture Search for Spatiotemporal Feature Learning. arXiv:1708.05038
Liu Z, Li Z, Wang R, Zong M, Ji W (2020) Spatiotemporal Saliency-based Multi-stream Networks with Attention-aware LSTM for Action Recognition. Neural Computing & Application (11)
Khowaja S A, Lee S (2020) Semantic image networks for human action recognition. Int J Comput Vis 128:393–419
Article Google Scholar
Zhang Z, Lv Z, Gan C, Zhu Q (2020) Human Action Recognition using Convolutional LSTM and Fully-connected LSTM with Different Attentions. Neurocomputing 410:304–316
Article Google Scholar
Zong M, Wang R, Chen Z, et al. (2020) Multi-cue based 3D Residual Network for Action Recognition. Neural Comput Appl:1–15
Zheng Z, An G, Wu D, Ruan Q (2019) Spatial-temporal Pyramid based Convolutional Neural Network for Action Recognition. Neurocomputing 358:446–455
Article Google Scholar
Qiu ZF, Yao T, Ngo CW, Tian XM, Mei T (2019) Learning Spatio-Temporal Representation With Local and Global Diffusion. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 12056–12065
Yao G, Lei T, Zhong J, et al. (2019) Learning Multi-temporal-scale deep Information for Action Recognition. Appl Intell 49:2017–2029
Article Google Scholar
Zhu Y, Liu G (2020) Fine-grained Action Recognition using Multi-view Attentions. Vis Comput 36:1771–1781
Article Google Scholar
Fang M, Bai X, Zhao J, et al. (2020) Integrating gaussian mixture model and dilated residual network for action recognition in videos. Multimed Syst 26:715–725
Article Google Scholar
Li J, Liu X, Zhang M, Wang D (2020) Spatio-temporal Deformable 3D ConvNets with Attention for Action Recognition. Pattern Recogn 98(2020):107037

Download references

Acknowledgements

This work was supported by the Research on Automatic Segmentation and Recognition of Teaching Scene with the Characteristics of Teaching Behavior of National Natural Science Foundation of China [61977034]; and the Project named Research on Outdoor Experiential Learning Environment Construction Method Based on Scene Perception granted by the Humanities and Social Science project of Chinese Ministry of Education[17YJA880104]; and the Research on Key Technology of Intelligent Education Evaluation and Service Based on Blockchain Technology (CCNU20ZN004) financially supported by self-determined research funds of CCNU from the colleges basic research and operation of MOE. We also thank the anonymous reviewers for their valuable comments and suggestions.

Author information

Authors and Affiliations

School of Educational Information Technology, Faculty of Artificial Intelligence Education, Central China Normal University, Wuhan, China
Zhao Gang, Zhu Wenjuan, Hu Biling, Chu Jie, He Hui & Xia Qing

Authors

Zhao Gang
View author publications
You can also search for this author in PubMed Google Scholar
Zhu Wenjuan
View author publications
You can also search for this author in PubMed Google Scholar
Hu Biling
View author publications
You can also search for this author in PubMed Google Scholar
Chu Jie
View author publications
You can also search for this author in PubMed Google Scholar
He Hui
View author publications
You can also search for this author in PubMed Google Scholar
Xia Qing
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhu Wenjuan.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gang, Z., Wenjuan, Z., Biling, H. et al. A simple teacher behavior recognition method for massive teaching videos based on teacher set. Appl Intell 51, 8828–8849 (2021). https://doi.org/10.1007/s10489-021-02329-y

Download citation

Accepted: 06 March 2021
Published: 14 April 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s10489-021-02329-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A simple teacher behavior recognition method for massive teaching videos based on teacher set

Abstract

Access this article

Similar content being viewed by others

Student Classroom Behavior Detection Based on YOLOv7+BRA and Multi-model Fusion

Bag of Deep Features for Instructor Activity Recognition in Lecture Room

Multimodal behavior analysis in computer-enabled laboratories using nonverbal cues

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Declaration of interests

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A simple teacher behavior recognition method for massive teaching videos based on teacher set

Abstract

Access this article

Similar content being viewed by others

Student Classroom Behavior Detection Based on YOLOv7+BRA and Multi-model Fusion

Bag of Deep Features for Instructor Activity Recognition in Lecture Room

Multimodal behavior analysis in computer-enabled laboratories using nonverbal cues

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Declaration of interests

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation