Skip to main content
Log in

Rotation-reversal invariant HOG cascade for facial expression recognition

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

This paper presents a novel classification framework derived from AdaBoost to classify facial expressions. The proposed framework adopts rotation-reversal invariant HOG as features. The framework is implemented by configuring the area under receiver operating characteristic curve of the weak classifier with HOG, which is a discriminative classification framework. The proposed classification framework is evaluated with three very popular and representative public databases: CK+, MMI, and AFEW. The results showed that the proposed classification framework outperforms the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Ashir, A.M., Eleyan, A.: Facial expression recognition based on image pyramid and single-branch decision tree. Signal Image Video Process (2017). doi:10.1007/s11760-016-1052-9

  2. Bourdev, L., Brandt, J.: Robust object detection via soft cascade. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 2, pp. 236–243 (2005)

  3. Brubaker, S., Wu, J., Sun, J., Mullin, M., Rehg, J.: On the design of cascades of boosted ensembles for face detection. Int. J. Comput. Vis. (IJCV) 77(1–3), 65–86 (2008)

    Article  Google Scholar 

  4. Chen, D., Ren, S., Wei, Y., Cao, X., Sun, J.: Joint cascade face detection and alignment. In: Proc. Eur. Conf. Comput. Vis. (ECCV), pp. 109–122 (2014)

  5. Chen, J., Ariki, Y., Takiguchi, T.: Robust facial expressions recognition using 3 D average face and ameliorated adaboost. In: Proc. ACM Multimedia Conf. (MM), pp. 661–664 (2013)

  6. Chen, J., Luo, Z., Takiguchi, T., Ariki, Y.: Multithreading cascade of SURF for facial expression recognition. EURASIP J. Image Video Process. 2016(1), 1–13 (2016)

    Article  Google Scholar 

  7. Chen, J., Nakashika, T., Takiguchi, T., Ariki, Y.: Content-based image retrieval using rotation-invariant histograms of oriented gradients. In: Proc. Int. Conf. on Multimedia Retrieval (ICMR), pp. 443–446 (2015)

  8. Chen, J., Takiguchi, T., Ariki, Y.: Facial expression recognition with multithreaded cascade of rotation-invariant hog. In: Proc. Int. Conf. on Affective Comput. and Intelligent Interaction (ACII), ACII ’15, pp. 636–642 (2015)

  9. Chew, S., Lucey, P., Lucey, S., Saragih, J., Cohn, J., Sridharan, S.: Person-independent facial expression detection using constrained local models. In: FG, pp. 915–920 (2011)

  10. Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 23(6), 681–685 (2001). doi:10.1109/34.927467

    Article  Google Scholar 

  11. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 1, pp. 886–893 (2005)

  12. Dhall, A., Goecke, R., Lucey, S., Gedeon, T.: Collecting large, richly annotated facial-expression databases from movies. IEEE MultiMed. 19(3), 34–41 (2012)

    Article  Google Scholar 

  13. Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)

    MATH  Google Scholar 

  14. Ferri, C., Flach, P.A., Hernández-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Proc. Int. Conf. Machine Learn. (ICML), pp. 139–146 (2002)

  15. Klaeser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: Proc. British Machine Vis. Conf. (BMVC), pp. 99.1–99.10 (2008)

  16. Li, J., Wang, T., Zhang, Y.: Face detection using SURF cascade. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV) Workshops, pp. 2183–2190 (2011)

  17. Li, J., Zhang, Y.: Learning SURF cascade for fast and accurate object detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 3468–3475 (2013)

  18. Li, S.Z., Zhang, Z., Shum, H.Y., Zhang, H.: FloatBoost learning for classification. In: Proc. Adv. Neural Inf. Proc. Syst. (NIPS), pp. 993–1000 (2002)

  19. Liu, M., Li, S., Shan, S., Wang, R., Chen, X.: Deeply learning deformable facial action parts model for dynamic expression analysis. In: Proc. Asia Conf. Comput. Vis. (ACCV), vol. 9006, pp. 143–157 (2014)

  20. Liu, M., Shan, S., Wang, R., Chen, X.: Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 1749–1756 (2014). doi:10.1109/CVPR.2014.226

  21. Long, P., Servedio, R.: Boosting the area under the ROC curve. In: Proc. Adv. Neural Inf. Proc. Syst. (NIPS), pp. 945–952 (2007)

  22. Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR) Workshops, pp. 94–101 (2010)

  23. Pantic, M., Valstar, M.F., Rademaker, R., Maat, L.: Web-based database for facial expression analysis. In: Proc. IEEE Int. Conf. on Multimedia and Expo (ICME), pp. 317–321 (2005)

  24. Rudovic, O., Pavlovic, V., Pantic, M.: Multi-output Laplacian dynamic ordinal regression for facial expression recognition and intensity estimation. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 2634–2641 (2012)

  25. Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proc. ACM Multimedia Conf. (MM), pp. 357–360 (2007)

  26. Sochman, J., Matas, J.: WaldBoost-learning for time constrained sequential detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 2, pp. 150–156 (2005)

  27. Takacs, G., Chandrasekhar, V., Tsai, S., Chen, D., Grzeszczuk, R., Girod, B.: Fast computation of rotation-invariant image features by an approximate radial gradient transform. IEEE Trans. Image Proc. (TIP) 22(8), 2970–2982 (2013)

    Article  MathSciNet  Google Scholar 

  28. Trzcinski, T., Christoudias, M., Lepetit, V.: Learning image descriptors with boosting. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 37(3), 597–610 (2015)

    Article  Google Scholar 

  29. Viola, P., Jones, M.: Robust real-time face detection. Int. J. Comput. Vis. (IJCV) 57(2), 137–154 (2004)

    Article  Google Scholar 

  30. Wang, L., Qiao, Y., Tang, X.: Motionlets: mid-level 3D parts for human motion recognition. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 2674–2681 (2013)

  31. Xiao, R., Zhu, H., Sun, H., Tang, X.: Dynamic cascades for face detection. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV), pp. 1–8 (2007)

  32. Zhalehpour, S., Akhtar, Z., Eroglu Erdem, C.: Multimodal emotion recognition based on peak frame selection from video. Signal Image Video Process. (SIViP) 10(5), 827–834 (2016)

    Article  Google Scholar 

  33. Zhao, G., Pietikainen, M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Patt. Anal. Mach. Intell. (TPAMI) 29(6), 915–928 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jinhui Chen.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, J., Takiguchi, T. & Ariki, Y. Rotation-reversal invariant HOG cascade for facial expression recognition. SIViP 11, 1485–1492 (2017). https://doi.org/10.1007/s11760-017-1111-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-017-1111-x

Keywords

Navigation