Abstract
Gender recognition from images is generally approached by extracting the salient visual features of the observed subject, either focusing on the facial appearance or by analyzing the full body. In real-world scenarios, image-based gender recognition approaches tend to fail, providing unreliable results. Face-based methods are compromised by environmental conditions, occlusions (presence of glasses, masks, hair), and poor resolution. Using a full-body perspective leads to other downsides: clothing and hairstyle may not be discriminative enough for classification, and background cluttering could be problematic. We propose a novel approach for body-shape-based gender classification. Our contribution consists in introducing the so-called Skinned Multi-Person Linear model (SMPL) as 3D human mesh. The proposed solution is robust to poor image resolution and the number of features for the classification is limited, making the recognition task computationally affordable, especially in the classification stage, where less complex learning architectures can be easily trained. The obtained information is fed to an SVM classifier, trained and tested using three different datasets, namely (i) FVG, containing videos of walking subjects (ii) AMASS, collected by converting MOCAP data of people performing different activities into realistic 3D human meshes, and (iii) SURREAL, characterized by synthetic human body models. Additionally, we demonstrate that our approach leads to reliable results even when the parametric 3D mesh is extracted from a single image. Considering the lack of benchmarks in this area, we trained and tested the FVG dataset with a pre-trained Resnet50, for comparing our model-based method with an image-based approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Poser - 3d character art and animation software. https://www.posersoftware.com/
Blackwell, S., et al.: Civilian american and european surface anthropometry resource (caesar), Descriptions, vol. 2, p. 192, June 2002
Bogo, F., Kanazawa, A., Lassner, C., Gehler, P., Romero, J., Black, M.J.: Keep it SMPL: automatic estimation of 3d human pose and shape from a single image. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9909, pp. 561ā578. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46454-1_34
Burger, J., Henderson, J., Kim, G., Zarrella, G.: Discriminating gender on twitter, pp. 1301ā1309, January 2011
Deng, Y., Luo, P., Loy, C.C., Tang, X.: Pedestrian attribute recognition at far distance, pp. 789ā792, November 2014. https://doi.org/10.1145/2647868.2654966
Dhomne, A., Kumar, R., Bhan, V.: Gender recognition through face using deep learning. Proc. Comput. Sci. 132, 2ā10 (2018). https://doi.org/10.1016/j.procs.2018.05.053
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition (2015)
Kanazawa, A., Black, M.J., Jacobs, D.W., Malik, J.: End-to-end recovery of human shape and pose (2018)
Kastaniotis, D., Theodorakopoulos, I., Economou, G., Fotopoulos, S.: Gait-based gender recognition using pose information for real time applications. In: 2013 18th International Conference on Digital Signal Processing (DSP), pp. 1ā6 (2013). https://doi.org/10.1109/ICDSP.2013.6622766
Kolotouros, N., Pavlakos, G., Black, M.J., Daniilidis, K.: Learning to reconstruct 3d human pose and shape via model-fitting in the loop. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1ā10 (2019)
Loper, M., Mahmood, N., Romero, J., Pons-Moll, G., Black, M.J.: SMPL: a skinned multi-person linear model. ACM Trans. Graphics (Proc. SIGGRAPH Asia) 34(6), 248:1ā248:16 (2015)
Mahmood, N., Ghorbani, N., Troje, N.F., Pons-Moll, G., Black, M.J.: AMASS: Archive of motion capture as surface shapes. In: International Conference on Computer Vision, pp. 5442ā5451, Oct 2019
Ng, C.-B., Tay, Y.-H., Goi, B.-M.: Pedestrian gender classification using combined global and local parts-based convolutional neural networks. Pattern Anal. Appl. 22(4), 1469ā1480 (2018). https://doi.org/10.1007/s10044-018-0725-0
Omran, M., Lassner, C., Pons-Moll, G., Gehler, P.V., Schiele, B.: Neural body fitting: Unifying deep learning and model-based human pose and shape estimation (2018)
Oren, M., Papageorgiou, C., Sinha, P., Osuna, E., Poggio, T.: Pedestrian detection using wavelet templates, pp. 193ā199, July 1997. https://doi.org/10.1109/CVPR.1997.609319
Pablo, N., Bruno, P., Celia, C., Virginia, R., Rolando, G.J., Claudio, D.: Gender recognition using 3d human body scans. In: 2018 IEEE Biennial Congress of Argentina (ARGENCON), pp. 1ā6 (2018). https://doi.org/10.1109/ARGENCON.2018.8646293
Pavlakos, G., Zhu, L., Zhou, X., Daniilidis, K.: Learning to estimate 3d human pose and shape from a single color image (2018)
Raza, M., Sharif, M., Yasmin, M., Khan, M., Saba, T., Fernandes, S.: Appearance based pedestriansā gender recognition by employing stacked auto encoders in deep learning. Future Gener. Comput. Syst. 88, 28ā39 (2018). https://doi.org/10.1016/j.future.2018.05.002
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection (2016)
Tan, V., Budvytis, I., Cipolla, R.: Indirect deep structured learning for 3d human body shape and pose prediction. In: BMVC (2017)
Tang, J., Liu, X., Cheng, H., Robinette, K.M.: Gender recognition using 3-d human body shapes. IEEE Trans. Syst. Man Cybern. Part C (Appli. Rev.) 41(6), 898ā908 (2011). https://doi.org/10.1109/TSMCC.2011.2104950
Tang, J., Liu, X., Cheng, H., Robinette, K.M.: Gender recognition with limited feature points from 3-d human body shapes. In: 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 2481ā2484 (2012). https://doi.org/10.1109/ICSMC.2012.6378116
Tung, H.Y.F., Tung, H.W., Yumer, E., Fragkiadaki, K.: Self-supervised learning of motion capture (2017)
Varol, G., et al.: Learning from synthetic humans. In: CVPR (2017)
Wuhrer, S., Shu, C., Rioux, M.: Posture invariant gender classification for 3d human models. In: 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 33ā38 (2009). https://doi.org/10.1109/CVPRW.2009.5204295
Zhang, Z., et al.: Gait recognition via disentangled representation learning. In: Proceeding of IEEE Computer Vision and Pattern Recognition, Long Beach, CA, June 2019
Zhu, J., Liao, S., Lei, Z., Yi, D., Li, S.: Pedestrian attribute classification in surveillance: database and evaluation, pp. 331ā338, Dec 2013. https://doi.org/10.1109/ICCVW.2013.51
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Martinelli, G., Garau, N., Conci, N. (2022). Gender Recognition fromĀ 3D Shape Parameters. In: Mazzeo, P.L., Frontoni, E., Sclaroff, S., Distante, C. (eds) Image Analysis and Processing. ICIAP 2022 Workshops. ICIAP 2022. Lecture Notes in Computer Science, vol 13374. Springer, Cham. https://doi.org/10.1007/978-3-031-13324-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-031-13324-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13323-7
Online ISBN: 978-3-031-13324-4
eBook Packages: Computer ScienceComputer Science (R0)