Skip to main content
Log in

Facial expression recognition using iterative fusion of MO-HOG and deep features

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Facial expression recognition is a challenging problem in computer vision. Due to the limited feature extraction capability of a single feature descriptor, this paper proposes a facial expression recognition method that iteratively fuses classifiers based on multi-orientation gradient calculated HOG (MO-HOG) features and deep-learned features. Diagonal orientation gradient calculated HOG (D-HOG) is a complementary part to the histogram of oriented gradient (HOG), which is proposed to obtain the diagonal gradient information and combines HOG to form a novel feature descriptor MO-HOG. Our method extracts MO-HOG features from whole facial images and expression-rich local facial images. Meanwhile, deep-learned features are not reliable enough on small databases but contain high-level semantic information, so the deep network is designed to extract effective deep-learned features. In addition, a classifier fusion method based on an optimization algorithm is proposed, and the best-fused classifier is obtained through iteration. The experiments are evaluated on the public databases (CK+ and JAFFE). The proposed method shows the effectiveness of facial expression recognition and outperforms the state-of-the-art methods. The recognition accuracy is 97.70% on the CK+ database and 97.64% on the JAFFE database.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Lei Y, Yuan W, Wang H, Wenhu Y, Bo W (2017) A skin segmentation algorithm based on stacked autoencoders. IEEE Trans Multimed 19:740–749

    Article  Google Scholar 

  2. Zhang L, Tjondronegoro D, Chandran V (2014) Random Gabor based templates for facial expression recognition in images with facial occlusion. Neurocomputing 145:451–464

    Article  Google Scholar 

  3. Chen J, Takiguchi T, Ariki Y (2017) Rotation-reversal invariant HOG cascade for facial expression recognition. Signal Image Video Process 11:1485–1492

    Article  Google Scholar 

  4. Mlakar U, Potočnik B (2015) Automated facial expression recognition based on histograms of oriented gradient feature vector differences. Signal Image Video Process 9:245–253

    Article  Google Scholar 

  5. Liu P, Han S, Meng Z, Tong Y (2014) Facial expression recognition via a boosted deep belief network. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 1805–1812

  6. Liu M, Li S, Shan S, Chen X (2015) Au-inspired deep networks for facial expression feature learning. Neurocomputing 159:126–136

    Article  Google Scholar 

  7. Liu Y, Zeng J, Shan S, Zheng Z (2018) Multi-channel pose-aware convolution neural networks for multi-view facial expression recognition. In: 2018 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2018), pp 458–465

  8. Lucey P, Cohn JF, Kanade T, Saragih J, Ambadar Z, Matthews I (2010) The extended Cohn-Kanade dataset (CK+): A complete dataset for action unit and emotion-specified expression. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp 94–101

  9. Lyons M, Akamatsu S, Kamachi M, Gyoba J (1998) Coding facial expressions with Gabor wavelets. In: Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, pp 200–205

  10. Happy SL, Routray A (2015) Automatic facial expression recognition using features of salient facial patches. IEEE Trans Effect Comput 6:1–12

    Article  Google Scholar 

  11. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2005, pp 886–893

  12. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp 1097–1105

  13. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv Preprint arXiv:1409.1556

  14. Yuan X, Xie L, Abouelenien M (2018) A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data. Pattern Recognit 77:160–172

    Article  Google Scholar 

  15. Yuan X, Abouelenien M (2006) A multi-class boosting method for learning from imbalanced data. Int J Granul Comput 4:13–29

    Google Scholar 

  16. Kennedy J (2011) Particle swarm optimization. In: Encyclopedia of machine learning. Springer, pp 760–766

  17. Jia Y, Shelhamer E, Donahue J, Karayev S, Long J (2014) Caffe: convolutional architecture for fast feature embedding, pp 675–678

  18. Carrier P-L, Courville A, Goodfellow IJ, Mirza M, Bengio Y (2013) FER-2013 face database. Univ. Montral

  19. Lopes AT, de Aguiar E, De Souza AF, Oliveira-Santos T (2017) Facial expression recognition with convolutional neural networks: coping with few data and the training sample order. Pattern Recognit 61:610–628

    Article  Google Scholar 

  20. Sun Y, Wen G (2017) Cognitive facial expression recognition with constrained dimensionality reduction. Neurocomputing 230:397–408

    Article  Google Scholar 

  21. Owusu E, Zhan Y, Mao QR (2014) A neural-AdaBoost based facial expression recognition system. Expert Syst Appl 41:3383–3390

    Article  Google Scholar 

  22. Liu Y, Xie Z, Yuan X, Chen J, Song W (2017) Multi-level structured hybrid forest for joint head detection and pose estimation. Neurocomputing 266:206–215

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the Natural Science Foundation of Anhui Province (1708085MF146), Science and Technology Support Project of Sichuan Province (2016GZ0389), Project of Innovation Team of Ministry of Education of China (IRT17R32), and the Fundamental Research Funds for the Central Universities (No. PA2018GDQT0011).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Baofu Fang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, H., Wei, S. & Fang, B. Facial expression recognition using iterative fusion of MO-HOG and deep features. J Supercomput 76, 3211–3221 (2020). https://doi.org/10.1007/s11227-018-2554-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-018-2554-8

Keywords

Navigation