Abstract
Addressing the challenges of wild facial expression datasets being affected by illumination and pose variations, and the expression features being dispersed across multiple easily overlooked facial regions, this paper proposes a facial expression recognition network called Binary and Joint Attention Network. Firstly, The multi-scale binary convolution module integrates texture features of different granularities. Subsequently, the proposed Multi-head Joint Attention module focusing on multiple distinct facial areas through multiple attention heads. Lastly, the inter-attention map loss is designed to prevent attention overlap while assisting in classification. Experimental results on multiple datasets, including SFEW2.0, RAF-DB, FER2013, demonstrate that the proposed network can effectively recognize various facial expressions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lyons, M., Akamatsu, S., Kamachi, M., Gyoba, J.: Coding facial expressions with Gabor wavelets. In: Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, pp. 200â205. IEEE computer society Nara, Japan (1998). https://doi.org/10.1109/AFGR.1998.670949
Calder, A.J., Burton, A.M., Miller, P., Young, A.W., Akamatsu, S.: A principal component analysis of facial expressions. Vision Res. 41, 1179â1208 (2001). https://doi.org/10.1016/S0042-6989(01)00002-5, https://doi.org/10.1007/11823285_121
Juefei-Xu, F., Boddeti, V.N., Savvides, M.: Local Binary Convolutional Neural Networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4284â4293. IEEE, Honolulu, HI (2017)
Wang, K., Peng, X., Yang, J., Lu, S., Qiao, Y.: Suppressing uncertainties for large-scale facial expression recognition. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6896â6905. IEEE, Seattle, WA, USA (2020). https://doi.org/10.1109/CVPR42600.2020.00693
Wang, K., Peng, X., Yang, J., Meng, D., Qiao, Y.: Region attention networks for pose and occlusion robust facial expression recognition. IEEE Trans. Image Process. 29, 4057â4069 (2020). https://doi.org/10.1109/TIP.2019.2956143
Li, Y., Zeng, J., Shan, S., Chen, X.: Patch-gated CNN for occlusion-aware facial expression recognition. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 2209â2214 (2018). https://doi.org/10.1109/ICPR.2018.8545853
Zhao, Z., Liu, Q., Wang, S.: Learning deep global multi-scale and local attention features for facial expression recognition in the wild. IEEE Trans. Image Process. 30, 6544-6556 (2021). https://doi.org/10.1109/TIP.2021.3093397
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770â778. IEEE, Las Vegas, NV, USA (2016). https://doi.org/10.1109/CVPR.2016.90
Ding, X., Guo, Y., Ding, G., Han, J.: ACNet: strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 1911â1920. IEEE, Seoul, Korea (South) (2019)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: Cbam: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3â19 (2018)
Teng, X., Deng, X.: Optimization of a helical flow inducer of endovascular stent based on the principle of swirling flow in arterial system. J. Biomed. Eng. 27(2):429â434 (2010)
Teng, X., Hwang, W.: Chapter 4. Structural and dynamical hierarchy of fibrillar collagen. In: Kaunas, R., Zemel, A., (eds) Cell and matrix mechanics, pp. 101â118. Taylor and Francis (2014)
Dhall, A., Goecke, R., Lucey, S., Gedeon, T.: Static facial expression analysis in tough conditions: Data, evaluation protocol and benchmark. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 2106â2112. IEEE, Barcelona, Spain (2011). https://doi.org/10.1109/ICCVW.2011.6130508
Li, S., Deng, W., Du, J.: Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2584â2593. IEEE, Honolulu, HI (2017). https://doi.org/10.1109/CVPR.2017.277
Teng, X., Hwang, W.: Chain registry and load-dependent conformational dynamics of collagen. Biomacromolecules. 15, 3019â3029 (2014). https://doi.org/10.1021/bm500641f
Li, Y., Zeng, J., Shan, S., Chen, X.: Occlusion aware facial expression recognition using CNN with attention mechanism. IEEE Trans. Image Process. 28(5), 2439â2450 (2018)
Florea, C., Florea, L., Badea, M.S., Vertan, C., Racoviteanu, A. (Sept 2019) Annealed label transfer for face expression recognition. In: BMVC, p. 104
Li, Y., Zeng, J., Shan, S., Chen, X.: Occlusion aware facial expression recognition using CNN with attention mechanism. IEEE Trans. Image Process. 28, 2439â2450 (2019). https://doi.org/10.1109/TIP.2018.2886767
Li, Y., Lu, Y., Li, J., Lu, G.: Separate loss for basic and compound facial expression recognition in the wild. In: Proceedings of The Eleventh Asian Conference on Machine Learning, pp. 897â911. PMLR (2019)
Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y., Zeng, J., Shan, S., Chen, X., (eds).: Facial expression recognition with inconsistently annotated datasets. In: ECCV, pp 222â237. Springer International Publishing, Cham (2018). https://doi.org/10.1007/978-3-030-01261-8
Farzaneh, A.H., Qi, X.: Facial expression recognition in the wild via deep attentive center loss. In: 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 2401â2410 (2021). https://doi.org/10.1109/WACV48630.2021.00245
Deep disturbance-disentangled learning for facial expression recognition. In: Proceedings of the 28th ACM International Conference on Multimedia, https://dl.acm.org/doi/10.1145/3394171.3413907. Accessed 20 March 2023
Ruan, D., Yan, Y., Lai, S., Chai, Z., Shen, C., Wang, H.: Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition (2021) http://arxiv.org/abs/2104.05160
Sadeghi, H., Raie, A.-A.: HistNet: histogram-based convolutional neural network with Chi-squared deep metric learning for facial expression recognition. Inf. Sci. 608, 472â488 (2022). https://doi.org/10.1016/j.ins.2022.06.092
Ruan, D., Mo, R., Yan, Y., Chen, S., Xue, J.-H., Wang, H.: Adaptive deep disturbance-disentangled learning for facial expression recognition. Int. J. Comput. Vis. 130, 455â477 (2022). https://doi.org/10.1007/s11263-021-01556-7
Farzaneh, A.H., Qi, X.: Discriminant distribution-agnostic loss for facial expression recognition in the wild. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1631â1639 (2020). https://doi.org/10.1109/CVPRW50498.2020.00211
Xie, S., Hu, H., Wu, Y.: Deep multi-path convolutional neural network joint with salient region attention for facial expression recognition. Pattern Recognit. 92, 177â191 (2019). https://doi.org/10.1016/j.patcog.2019.03.019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Âİ 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Qin, M., Li, L. (2023). Fusing Multi-scale Binary Convolution with Joint Attention Face Expression Recognition Algorithm. In: Jia, Y., Zhang, W., Fu, Y., Wang, J. (eds) Proceedings of 2023 Chinese Intelligent Systems Conference. CISC 2023. Lecture Notes in Electrical Engineering, vol 1091. Springer, Singapore. https://doi.org/10.1007/978-981-99-6886-2_34
Download citation
DOI: https://doi.org/10.1007/978-981-99-6886-2_34
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-6885-5
Online ISBN: 978-981-99-6886-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)