Abstract
We address the task of automatically scoring the competency of candidates based on textual features, from the automatic speech recognition transcriptions in the asynchronous video job interviews. The key challenge is to construct the dependency relations and semantic level interaction over each question–answer (QA) pair. However, most recent studies focus on the representation of questions and answers, but ignore the dependency information and interaction between them, which is critical for QA evaluation. In this work, we propose a hierarchical reasoning graph neural network for the automatic assessment of question–answer pairs. Specifically, we construct a sentence-level relational graph neural network to capture the dependency information of sentences in or between the question and the answer. Based on these graphs, we employ a semantic-level reasoning graph attention network to model the interaction states of the current QA session. Finally, we propose a gated recurrent unit encoder to represent the temporal question–answer pairs for the final prediction. Empirical results on CHNAT (a real-world dataset) validate that our proposed model significantly outperforms matching-based benchmark models. Ablation studies and experimental results with 10 random seeds also show the effectiveness and stability of our models.
Similar content being viewed by others
References
Battaglia PW, Hamrick JB, Bapst V, Sanchez-Gonzalez A, Zambaldi V, Malinowski M, Tacchetti A, Raposo D, Santoro A, Faulkner R, Gulcehre C, Song F, Ballard A, Gilmer J, Dahl G, Vaswani A, Allen K, Nash C, Langston V, Dyer C, Heess N, Wierstra D, Kohli P, Botvinick M, Vinyals O, Li Y, Pascanu R (2018) Relational inductive biases, deep learning, and graph networks. arXiv:1806.01261
Bian S, Xin ZW, Song Y, Zhang T, Wen J-R (2019) Domain adaptation for person-job fit with transferable deep global match network. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4812–4822
Chen Y, Wu L, Zaki MJ (2020) Toward subgraph guided knowledge graph question generation with graph neural networks. arXiv:2004.06015
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR:1412.3555
L CLAUDIA and MARTIN CC-rater (2003) Automated scoring of short-answer questions. Comput Human 37:92–96
Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. EMNLP, pp 670–680
Dai H, Dai B, Song L (2016) Discriminative embeddings of latent variable models for structured data. In: International conference on machine learning, pp 2702–2711
Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805
Richard DA, Paula D, Donald DL (1991) Reproducibility and responsiveness of health status measures statistics and strategies for evaluation. Control Clin Trials 12(4):S142–S158
Ghosal D, Majumder N, Poria S, Chhaya N, Gelbukh A (2019) Dialoguegcn: a graph convolutional neural network for emotion recognition in conversation. CoRR, abs/1908.11540
Goenka P, Piplani M, Sawhney R, Mathur P, Shah RR (2020) Esas: towards practical and explainable short answer scoring (student abstract). In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 13797–13798
Hemamou Léo FG, Martin J-C, Clavel C (2019) Slices of attention in asynchronous video job interviews. In: 2019 8th international conference on affective computing and intelligent interaction (ACII). IEEE, pp 1–7
Hemamou Léo FG, Vandenbussche V, Martin J-C, Clavel C (2019) Hirenet: A hierarchical attention model for the automatic analysis of asynchronous video job interviews. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 573–581
Yunseok J, Yale S, Dongjoo KC, Youngjae Y, Youngjin K, Gunhee K (2019) Video question answering with spatio-temporal reasoning. Int J Comput Vis 127(10):1385–1412
Jiang J-Y, Zhang M, Li C, Bendersky M, Golbandi N, Najork M (2019) Semantic text matching for long-form documents. In: The world wide web conference, pp 795–806
Jiang P, Han Y (2020) Reasoning with heterogeneous graph alignment for video question answering. In: AAAI, pp 11109–11116
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In ICLR, pp 1324–1339
Lawrence I, Lin K (1989) A concordance correlation coefficient to evaluate reproducibility. Biometrics, pp 255–268
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196
Liu X, Chen Q, Liu Y, Siebert J, Baotian H, Xiangping W, Tang B (2021) Decomposing word embedding with the capsule network. Knowl Based Syst 212:106611
Lun J, Zhu J, Tang Y, Yang M (2020) Multiple data augmentation strategies for improving performance on automatic short answer scoring. In: AAAI, pp 13389–13396
Luo W, Zhang C, Zhang X, Wu H (2019) Improving action recognition with the graph-neural-network-based interaction reasoning. In: 2019 IEEE visual communications and image processing (VCIP). IEEE, pp 1–4
Luo Y, Zhang H, Wen Y, Zhang X (2019) Resumegan: an optimized deep representation learning framework for talent-job fit via adversarial learning. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 1101–1110
Marcheggiani D, Bastings J, Titov I (2018) Exploiting semantics in neural machine translation with graph convolutional networks. In: Proceedings of NAACL, pp 486–492
Mueller J, Thyagarajan A (2016) Siamese recurrent architectures for learning sentence similarity. In: Thirtieth AAAI conference on artificial intelligence
Pan L, Xie Y, Feng Y, Chua T-S, Kan M-Y (2020) Semantic graphs for generating deep questions. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 1463–1475
Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Qin C, Zhu H, Xu T, Zhu C, Jiang L, Chen E, Xiong H (2018) Enhancing person-job fit for talent recruitment: An ability-aware neural network approach. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 25–34
Riordan B, Horbach A, Cahill A, Zesch T, Lee C (2017) Investigating neural architectures for short answer scoring. In: Proceedings of the 12th workshop on innovative use of NLP for building educational applications, pp 159–168
Saha S, Dhamecha TI, Marvaniya S, Foltz P, Sindhgatta R, Sengupta B (2019) Joint multi-domain learning for automatic short answer grading. arXiv:1902.09183
Schlichtkrull M, Kipf TN, Bloem P, Den Berg RV, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks. In: European semantic web conference. Springer, pp 593–607
Shen D, Zhu H, Zhu C, Xu T, Ma C, Xiong H (2018) A joint learning approach to intelligent job interview assessment. In: IJCAI, pp 3542–3548
Suen H-Y, Hung K-E, Lin C-L (2019) Tensorflow-based automatic personality recognition used in asynchronous video interviews. IEEE Access 7:61018–61023
Panagiotis T, George T, Nicolaou MA, Schuller BW, Stefanos Z (2017) End-to-end multimodal emotion recognition using deep neural networks. IEEE J Sel Top Signal Process 11(8):1301–1309
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations
Wang D, Liu P, Zheng Y, Qiu X, Huang X (2020) Heterogeneous graph neural networks for extractive document summarization. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 6209–6219
Xu T, Zhu H, Zhu C, Li P, Xiong H (2018) Measuring the popularity of job skills in recruitment market: a multi-criteria approach. In: Proceedings of the AAAI conference on artificial intelligence, vol 32
Yao L, Mao C, Luo Y (2019) Graph convolutional networks for text classification. In: Proceedings of the AAAI conference on artificial intelligence, pp 7370–7377
Zhang Y, Yu X, Cui Z, Wu S, Wen Z, Wang L (2020) Every document owns its structure: inductive text classification via graph neural networks. In: Association for computational linguistics, pp 334–339
Zhang Y, Chen X, Yang Y, Ramamurthy A, Li B, Qi Y, Song L (2020) Efficient probabilistic logic reasoning with graph neural networks. ICLR
Zhao S, Zhang Y, Xiong X, Botelho A, Heffernan N (2017) A memory-augmented neural model for automated grading. In: Proceedings of the fourth (2017) ACM conference on learning@ scale, pp 189–192
Zhou J, Han X, Yang C, Liu Z, Wang L, Li C, Sun M (2019) Gear: graph-based evidence aggregating and reasoning for fact verification. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 892–901
Acknowledgements
This work is supported by Natural Science Foundation of China (Grant No. 61872113, 61573118, U1813215, 61876052), Special Foundation for Technology Research Program of Guangdong Province (Grant No. 2015B010131010), Strategic Emerging Industry Development Special Funds of Shenzhen (Grant No. JCYJ20170307150528934, JCYJ2017 0811153836555, JCYJ20180306172232154), Innovation Fund of Harbin Institute of Technology (Grant No. HIT. NSRIF.2017052).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, K., Niu, M. & Chen, Q. A hierarchical reasoning graph neural network for the automatic scoring of answer transcriptions in video job interviews. Int. J. Mach. Learn. & Cyber. 13, 2507–2517 (2022). https://doi.org/10.1007/s13042-022-01540-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-022-01540-8