Abstract
This paper presents a novel classification framework derived from AdaBoost to classify facial expressions. The proposed framework adopts rotation-reversal invariant HOG as features. The framework is implemented by configuring the area under receiver operating characteristic curve of the weak classifier with HOG, which is a discriminative classification framework. The proposed classification framework is evaluated with three very popular and representative public databases: CK+, MMI, and AFEW. The results showed that the proposed classification framework outperforms the state-of-the-art methods.
Similar content being viewed by others
References
Ashir, A.M., Eleyan, A.: Facial expression recognition based on image pyramid and single-branch decision tree. Signal Image Video Process (2017). doi:10.1007/s11760-016-1052-9
Bourdev, L., Brandt, J.: Robust object detection via soft cascade. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 2, pp. 236–243 (2005)
Brubaker, S., Wu, J., Sun, J., Mullin, M., Rehg, J.: On the design of cascades of boosted ensembles for face detection. Int. J. Comput. Vis. (IJCV) 77(1–3), 65–86 (2008)
Chen, D., Ren, S., Wei, Y., Cao, X., Sun, J.: Joint cascade face detection and alignment. In: Proc. Eur. Conf. Comput. Vis. (ECCV), pp. 109–122 (2014)
Chen, J., Ariki, Y., Takiguchi, T.: Robust facial expressions recognition using 3 D average face and ameliorated adaboost. In: Proc. ACM Multimedia Conf. (MM), pp. 661–664 (2013)
Chen, J., Luo, Z., Takiguchi, T., Ariki, Y.: Multithreading cascade of SURF for facial expression recognition. EURASIP J. Image Video Process. 2016(1), 1–13 (2016)
Chen, J., Nakashika, T., Takiguchi, T., Ariki, Y.: Content-based image retrieval using rotation-invariant histograms of oriented gradients. In: Proc. Int. Conf. on Multimedia Retrieval (ICMR), pp. 443–446 (2015)
Chen, J., Takiguchi, T., Ariki, Y.: Facial expression recognition with multithreaded cascade of rotation-invariant hog. In: Proc. Int. Conf. on Affective Comput. and Intelligent Interaction (ACII), ACII ’15, pp. 636–642 (2015)
Chew, S., Lucey, P., Lucey, S., Saragih, J., Cohn, J., Sridharan, S.: Person-independent facial expression detection using constrained local models. In: FG, pp. 915–920 (2011)
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 23(6), 681–685 (2001). doi:10.1109/34.927467
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 1, pp. 886–893 (2005)
Dhall, A., Goecke, R., Lucey, S., Gedeon, T.: Collecting large, richly annotated facial-expression databases from movies. IEEE MultiMed. 19(3), 34–41 (2012)
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
Ferri, C., Flach, P.A., Hernández-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Proc. Int. Conf. Machine Learn. (ICML), pp. 139–146 (2002)
Klaeser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: Proc. British Machine Vis. Conf. (BMVC), pp. 99.1–99.10 (2008)
Li, J., Wang, T., Zhang, Y.: Face detection using SURF cascade. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV) Workshops, pp. 2183–2190 (2011)
Li, J., Zhang, Y.: Learning SURF cascade for fast and accurate object detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 3468–3475 (2013)
Li, S.Z., Zhang, Z., Shum, H.Y., Zhang, H.: FloatBoost learning for classification. In: Proc. Adv. Neural Inf. Proc. Syst. (NIPS), pp. 993–1000 (2002)
Liu, M., Li, S., Shan, S., Wang, R., Chen, X.: Deeply learning deformable facial action parts model for dynamic expression analysis. In: Proc. Asia Conf. Comput. Vis. (ACCV), vol. 9006, pp. 143–157 (2014)
Liu, M., Shan, S., Wang, R., Chen, X.: Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 1749–1756 (2014). doi:10.1109/CVPR.2014.226
Long, P., Servedio, R.: Boosting the area under the ROC curve. In: Proc. Adv. Neural Inf. Proc. Syst. (NIPS), pp. 945–952 (2007)
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR) Workshops, pp. 94–101 (2010)
Pantic, M., Valstar, M.F., Rademaker, R., Maat, L.: Web-based database for facial expression analysis. In: Proc. IEEE Int. Conf. on Multimedia and Expo (ICME), pp. 317–321 (2005)
Rudovic, O., Pavlovic, V., Pantic, M.: Multi-output Laplacian dynamic ordinal regression for facial expression recognition and intensity estimation. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 2634–2641 (2012)
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proc. ACM Multimedia Conf. (MM), pp. 357–360 (2007)
Sochman, J., Matas, J.: WaldBoost-learning for time constrained sequential detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), vol. 2, pp. 150–156 (2005)
Takacs, G., Chandrasekhar, V., Tsai, S., Chen, D., Grzeszczuk, R., Girod, B.: Fast computation of rotation-invariant image features by an approximate radial gradient transform. IEEE Trans. Image Proc. (TIP) 22(8), 2970–2982 (2013)
Trzcinski, T., Christoudias, M., Lepetit, V.: Learning image descriptors with boosting. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 37(3), 597–610 (2015)
Viola, P., Jones, M.: Robust real-time face detection. Int. J. Comput. Vis. (IJCV) 57(2), 137–154 (2004)
Wang, L., Qiao, Y., Tang, X.: Motionlets: mid-level 3D parts for human motion recognition. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 2674–2681 (2013)
Xiao, R., Zhu, H., Sun, H., Tang, X.: Dynamic cascades for face detection. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV), pp. 1–8 (2007)
Zhalehpour, S., Akhtar, Z., Eroglu Erdem, C.: Multimodal emotion recognition based on peak frame selection from video. Signal Image Video Process. (SIViP) 10(5), 827–834 (2016)
Zhao, G., Pietikainen, M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Patt. Anal. Mach. Intell. (TPAMI) 29(6), 915–928 (2007)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, J., Takiguchi, T. & Ariki, Y. Rotation-reversal invariant HOG cascade for facial expression recognition. SIViP 11, 1485–1492 (2017). https://doi.org/10.1007/s11760-017-1111-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-017-1111-x