Rotation-reversal invariant HOG cascade for facial expression recognition

Chen, Jinhui; Takiguchi, Tetsuya; Ariki, Yasuo

doi:10.1007/s11760-017-1111-x

Rotation-reversal invariant HOG cascade for facial expression recognition

Original Paper
Published: 12 May 2017

Volume 11, pages 1485–1492, (2017)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

469 Accesses
33 Citations
Explore all metrics

Abstract

This paper presents a novel classification framework derived from AdaBoost to classify facial expressions. The proposed framework adopts rotation-reversal invariant HOG as features. The framework is implemented by configuring the area under receiver operating characteristic curve of the weak classifier with HOG, which is a discriminative classification framework. The proposed classification framework is evaluated with three very popular and representative public databases: CK+, MMI, and AFEW. The results showed that the proposed classification framework outperforms the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ashir, A.M., Eleyan, A.: Facial expression recognition based on image pyramid and single-branch decision tree. Signal Image Video Process (2017). doi:10.1007/s11760-016-1052-9
Bourdev, L., Brandt, J.: Robust object detection via soft cascade. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 2, pp. 236–243 (2005)
Brubaker, S., Wu, J., Sun, J., Mullin, M., Rehg, J.: On the design of cascades of boosted ensembles for face detection. Int. J. Comput. Vis. (IJCV) 77(1–3), 65–86 (2008)
Article Google Scholar
Chen, D., Ren, S., Wei, Y., Cao, X., Sun, J.: Joint cascade face detection and alignment. In: Proc. Eur. Conf. Comput. Vis. (ECCV), pp. 109–122 (2014)
Chen, J., Ariki, Y., Takiguchi, T.: Robust facial expressions recognition using 3 D average face and ameliorated adaboost. In: Proc. ACM Multimedia Conf. (MM), pp. 661–664 (2013)
Chen, J., Luo, Z., Takiguchi, T., Ariki, Y.: Multithreading cascade of SURF for facial expression recognition. EURASIP J. Image Video Process. 2016(1), 1–13 (2016)
Article Google Scholar
Chen, J., Nakashika, T., Takiguchi, T., Ariki, Y.: Content-based image retrieval using rotation-invariant histograms of oriented gradients. In: Proc. Int. Conf. on Multimedia Retrieval (ICMR), pp. 443–446 (2015)
Chen, J., Takiguchi, T., Ariki, Y.: Facial expression recognition with multithreaded cascade of rotation-invariant hog. In: Proc. Int. Conf. on Affective Comput. and Intelligent Interaction (ACII), ACII ’15, pp. 636–642 (2015)
Chew, S., Lucey, P., Lucey, S., Saragih, J., Cohn, J., Sridharan, S.: Person-independent facial expression detection using constrained local models. In: FG, pp. 915–920 (2011)
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 23(6), 681–685 (2001). doi:10.1109/34.927467
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 1, pp. 886–893 (2005)
Dhall, A., Goecke, R., Lucey, S., Gedeon, T.: Collecting large, richly annotated facial-expression databases from movies. IEEE MultiMed. 19(3), 34–41 (2012)
Article Google Scholar
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
MATH Google Scholar
Ferri, C., Flach, P.A., Hernández-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Proc. Int. Conf. Machine Learn. (ICML), pp. 139–146 (2002)
Klaeser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: Proc. British Machine Vis. Conf. (BMVC), pp. 99.1–99.10 (2008)
Li, J., Wang, T., Zhang, Y.: Face detection using SURF cascade. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV) Workshops, pp. 2183–2190 (2011)
Li, J., Zhang, Y.: Learning SURF cascade for fast and accurate object detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 3468–3475 (2013)
Li, S.Z., Zhang, Z., Shum, H.Y., Zhang, H.: FloatBoost learning for classification. In: Proc. Adv. Neural Inf. Proc. Syst. (NIPS), pp. 993–1000 (2002)
Liu, M., Li, S., Shan, S., Wang, R., Chen, X.: Deeply learning deformable facial action parts model for dynamic expression analysis. In: Proc. Asia Conf. Comput. Vis. (ACCV), vol. 9006, pp. 143–157 (2014)
Liu, M., Shan, S., Wang, R., Chen, X.: Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 1749–1756 (2014). doi:10.1109/CVPR.2014.226
Long, P., Servedio, R.: Boosting the area under the ROC curve. In: Proc. Adv. Neural Inf. Proc. Syst. (NIPS), pp. 945–952 (2007)
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR) Workshops, pp. 94–101 (2010)
Pantic, M., Valstar, M.F., Rademaker, R., Maat, L.: Web-based database for facial expression analysis. In: Proc. IEEE Int. Conf. on Multimedia and Expo (ICME), pp. 317–321 (2005)
Rudovic, O., Pavlovic, V., Pantic, M.: Multi-output Laplacian dynamic ordinal regression for facial expression recognition and intensity estimation. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 2634–2641 (2012)
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proc. ACM Multimedia Conf. (MM), pp. 357–360 (2007)
Sochman, J., Matas, J.: WaldBoost-learning for time constrained sequential detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 2, pp. 150–156 (2005)
Takacs, G., Chandrasekhar, V., Tsai, S., Chen, D., Grzeszczuk, R., Girod, B.: Fast computation of rotation-invariant image features by an approximate radial gradient transform. IEEE Trans. Image Proc. (TIP) 22(8), 2970–2982 (2013)
Article MathSciNet Google Scholar
Trzcinski, T., Christoudias, M., Lepetit, V.: Learning image descriptors with boosting. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 37(3), 597–610 (2015)
Article Google Scholar
Viola, P., Jones, M.: Robust real-time face detection. Int. J. Comput. Vis. (IJCV) 57(2), 137–154 (2004)
Article Google Scholar
Wang, L., Qiao, Y., Tang, X.: Motionlets: mid-level 3D parts for human motion recognition. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 2674–2681 (2013)
Xiao, R., Zhu, H., Sun, H., Tang, X.: Dynamic cascades for face detection. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV), pp. 1–8 (2007)
Zhalehpour, S., Akhtar, Z., Eroglu Erdem, C.: Multimodal emotion recognition based on peak frame selection from video. Signal Image Video Process. (SIViP) 10(5), 827–834 (2016)
Article Google Scholar
Zhao, G., Pietikainen, M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Patt. Anal. Mach. Intell. (TPAMI) 29(6), 915–928 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

RIEB, Kobe University, Kobe, 657-8501, Japan
Jinhui Chen
The Organization of Advanced Science and Technology, Kobe University, Kobe, 657-8501, Japan
Tetsuya Takiguchi & Yasuo Ariki

Authors

Jinhui Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuya Takiguchi
View author publications
You can also search for this author in PubMed Google Scholar
Yasuo Ariki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinhui Chen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, J., Takiguchi, T. & Ariki, Y. Rotation-reversal invariant HOG cascade for facial expression recognition. SIViP 11, 1485–1492 (2017). https://doi.org/10.1007/s11760-017-1111-x

Download citation

Received: 16 November 2016
Revised: 22 March 2017
Accepted: 03 May 2017
Published: 12 May 2017
Issue Date: November 2017
DOI: https://doi.org/10.1007/s11760-017-1111-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Rotation-reversal invariant HOG cascade for facial expression recognition

Abstract

Access this article

Similar content being viewed by others

Expression Recognition with Ri-HOG Cascade

Fast facial expression recognition using Boosted Histogram of Oriented Gradient (BHOG) features

Facial Expression Recognition Based on Classification Tree

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Rotation-reversal invariant HOG cascade for facial expression recognition

Abstract

Access this article

Similar content being viewed by others

Expression Recognition with Ri-HOG Cascade

Fast facial expression recognition using Boosted Histogram of Oriented Gradient (BHOG) features

Facial Expression Recognition Based on Classification Tree

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation