REALY: Rethinking the Evaluation of 3D Face Reconstruction

Chai, Zenghao; Zhang, Haoxian; Ren, Jing; Kang, Di; Xu, Zhengzhuo; Zhe, Xuefei; Yuan, Chun; Bao, Linchao

doi:10.1007/978-3-031-20074-8_5

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13668))

Included in the following conference series:

European Conference on Computer Vision

1805 Accesses
7 Citations

Abstract

The evaluation of 3D face reconstruction results typically relies on a rigid shape alignment between the estimated 3D model and the ground-truth scan. We observe that aligning two shapes with different reference points can largely affect the evaluation results. This poses difficulties for precisely diagnosing and improving a 3D face reconstruction method. In this paper, we propose a novel evaluation approach with a new benchmark REALY, consists of 100 globally aligned face scans with accurate facial keypoints, high-quality region masks, and topology-consistent meshes. Our approach performs region-wise shape alignment and leads to more accurate, bidirectional correspondences during computing the shape errors. The fine-grained, region-wise evaluation results provide us detailed understandings about the performance of state-of-the-art 3D face reconstruction methods. For example, our experiments on single-image based reconstruction methods reveal that DECA performs the best on nose regions, while GANFit performs better on cheek regions. Besides, a new and high-quality 3DMM basis, HIFI3D\(^{\pmb {+}\pmb {+}}\), is further derived using the same procedure as we construct REALY to align and retopologize several 3D face datasets. We will release REALY, HIFI3D\(^{\pmb {+}\pmb {+}}\), and our new evaluation pipeline at https://realy3dface.com.

Z. Chai and H. Zhang—Equal Contributions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amberg, B., Romdhani, S., Vetter, T.: Optimal step nonrigid ICP algorithms for surface registration. In: CVPR (2007)
Google Scholar
Bagdanov, A.D., Bimbo, A.D., Masi, I.: The Florence 2d/3d hybrid face dataset. In: J-HGBU@MM (2011)
Google Scholar
Bai, Z., Cui, Z., Liu, X., Tan, P.: Riggable 3d face reconstruction via in-network optimization. In: CVPR (2021)
Google Scholar
Bao, L., et al.: High-fidelity 3d digital human head creation from RGB-D selfies. TOG (2021)
Google Scholar
Besl, P.J., McKay, N.D.: A method for registration of 3d shapes. TPAMI (1992)
Google Scholar
Blanz, V., Vetter, T.: A morphable model for the synthesis of 3d faces. In: SIGGRAPH (1999)
Google Scholar
Blanz, V., Vetter, T.: Face recognition based on fitting a 3d morphable model. TPAMI (2003)
Google Scholar
Booth, J., Roussos, A., Zafeiriou, S., Ponniah, A., Dunaway, D.: A 3d morphable model learnt from 10,000 faces. In: CVPR (2016)
Google Scholar
Brunton, A., Salazar, A., Bolkart, T., Wuhrer, S.: Review of statistical shape spaces for 3d data with comparative analysis for human faces. In: CVIU (2014)
Google Scholar
Cao, C., Weng, Y., Lin, S., Zhou, K.: 3d shape regression for real-time facial animation. TOG 32, 1 (2013)
Article MATH Google Scholar
Cao, C., Weng, Y., Zhou, S., Tong, Y., Zhou, K.: Facewarehouse: a 3d facial expression database for visual computing. TVCG 20, 413–425 (2014)
Google Scholar
Cao, C., Wu, H., Weng, Y., Shao, T., Zhou, K.: Real-time facial animation with image-based dynamic avatars. TOG 35, 1–13 (2016)
Google Scholar
Cao, K., Rong, Y., Li, C., Tang, X., Loy, C.C.: Pose-robust face recognition via deep residual equivariant mapping. In: CVPR (2018)
Google Scholar
Chang, F., Tran, A.T., Hassner, T., Masi, I., Nevatia, R., Medioni, G.G.: Faceposenet: making a case for landmark-free face alignment. In: ICCV Workshops (2017)
Google Scholar
Chang, F., Tran, A.T., Hassner, T., Masi, I., Nevatia, R., Medioni, G.G.: ExpNet: landmark-free, deep, 3d facial expressions. In: FG (2018)
Google Scholar
Chaudhuri, B., Vesdapunt, N., Shapiro, L., Wang, B.: Personalized face modeling for improved face reconstruction and motion retargeting. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12350, pp. 142–160. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58558-7_9
Chapter Google Scholar
Chen, Y., Wu, F., Wang, Z., Song, Y., Ling, Y., Bao, L.: Self-supervised learning of detailed 3d face reconstruction. TIP 29, 8696–8705 (2020)
MATH Google Scholar
Dai, H., Pears, N.E., Smith, W.A.P., Duncan, C.: Statistical modeling of craniofacial shape and texture. IJCV 128, 547–571 (2020)
Article Google Scholar
Deng, Y., Yang, J., Xu, S., Chen, D., Jia, Y., Tong, X.: Accurate 3d face reconstruction with weakly-supervised learning: From single image to image set. In: CVPR Workshops (2019)
Google Scholar
Dib, A., Thebault, C., Ahn, J., Gosselin, P., Theobalt, C., Chevallier, L.: Towards high fidelity monocular face reconstruction with rich reflectance using self-supervised learning and ray tracing. In: ICCV (2021)
Google Scholar
Egger, B., et al.: 3d morphable face models - past, present, and future. TOG. 39, 1–38 (2020)
Article Google Scholar
Feng, Y., Feng, H., Black, M.J., Bolkart, T.: Learning an animatable detailed 3d face model from in-the-wild images. In: SIGGRAPH (2021)
Google Scholar
Feng, Y., Wu, F., Shao, X., Wang, Y., Zhou, X.: Joint 3D face reconstruction and dense alignment with position map regression network. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 557–574. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_33
Chapter Google Scholar
Feng, Z., et al.: Evaluation of dense 3d reconstruction from 2d face images in the wild. In: FG (2018)
Google Scholar
Gao, Z., Zhang, J., Guo, Y., Ma, C., Zhai, G., Yang, X.: Semi-supervised 3d face representation learning from unconstrained photo collections. In: CVPR Workshops (2020)
Google Scholar
Gecer, B., Ploumpis, S., Kotsia, I., Zafeiriou, S.: GANFIT: generative adversarial network fitting for high fidelity 3d face reconstruction. In: CVPR (2019)
Google Scholar
Gecer, B., Ploumpis, S., Kotsia, I., Zafeiriou, S.: Fast-GANFIT: generative adversarial network for high fidelity 3d face reconstruction. TPAMI (2021)
Google Scholar
Genova, K., Cole, F., Maschinot, A., Sarna, A., Vlasic, D., Freeman, W.T.: Unsupervised training for 3d morphable model regression. In: CVPR (2018)
Google Scholar
Guo, J., Zhu, X., Yang, Y., Yang, F., Lei, Z., Li, S.Z.: Towards Fast, Accurate and Stable 3D Dense Face Alignment. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12364, pp. 152–168. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58529-7_10
Chapter Google Scholar
Hu, L., et al.: Avatar digitization from a single image for real-time rendering. TOG 36, 1–4 (2017)
Article Google Scholar
Jiang, D., et al.: Reconstructing recognizable 3d face shapes based on 3d morphable models. CoRR, abs/2104.03515 (2021)
Google Scholar
Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. In: CVPR (2019)
Google Scholar
Lattas, A., et al.: Avatarme: Realistically renderable 3d facial reconstruction “in-the-wild”. In: CVPR (2020)
Google Scholar
Lee, G., Lee, S.: Uncertainty-aware mesh decoder for high fidelity 3d face reconstruction. In: CVPR (2020)
Google Scholar
Li, T., Bolkart, T., Black, M.J., Li, H., Romero, J.: Learning a model of facial shape and expression from 4d scans. TOG (2017)
Google Scholar
Lin, J., Yuan, Y., Shao, T., Zhou, K.: Towards high-fidelity 3d face reconstruction from in-the-wild images using graph convolutional networks. In: CVPR (2020)
Google Scholar
Lin, J., Yuan, Y., Zou, Z.: Meingame: create a game character face from a single portrait. In: AAAI (2021)
Google Scholar
Liu, F., Zhu, R., Zeng, D., Zhao, Q., Liu, X.: Disentangling features in 3d face shapes for joint face reconstruction and recognition. In: CVPR (2018)
Google Scholar
Liu, P., Han, X., Lyu, M.R., King, I., Xu, J.: Learning 3d face reconstruction with a pose guidance network. In: ACCV (2020)
Google Scholar
Liu, Z., Luo, P., Wang, X., Tang, X.: Deep learning face attributes in the wild. In: ICCV (2015)
Google Scholar
Luo, H., et al.: Normalized avatar synthesis using styleGAN and perceptual refinement. In: CVPR (2021)
Google Scholar
Lyu, J., Li, X., Zhu, X., Cheng, C.: Pixel-face: A large-scale, high-resolution benchmark for 3d face reconstruction. arXiv preprint arXiv:2008.12444 (2020)
Ma, S., et al.: Pixel codec avatars. In: CVPR (2021)
Google Scholar
Pan, X., Dai, B., Liu, Z., Chen, C.L., Luo, P.: Do 2d GANs know 3d shape? Unsupervised 3d shape reconstruction from 2d image GANs. In: ICLR (2021)
Google Scholar
Paysan, P., Knothe, R., Amberg, B., Romdhani, S., Vetter, T.: A 3d face model for pose and illumination invariant face recognition. In: AVSS (2009)
Google Scholar
Phillips, P.J., et al.: Overview of the face recognition grand challenge. In: CVPR (2005)
Google Scholar
Piao, J., Sun, K., Wang, Q., Lin, K., Li, H.: Inverting generative adversarial renderer for face reconstruction. In: CVPR (2021)
Google Scholar
Ploumpis, S., Wang, H., Pears, N.E., Smith, W.A.P., Zafeiriou, S.: Combining 3d morphable models: a large scale face-and-head model. In: CVPR (2019)
Google Scholar
R, M.B., Tewari, A., Seidel, H., Elgharib, M., Theobalt, C.: Learning complete 3d morphable face models from images and videos. In: CVPR (2021)
Google Scholar
Ramon, E., et al.: H3D-Net: few-shot high-fidelity 3d head reconstruction. In: ICCV (2021)
Google Scholar
Richardson, E., Sela, M., Kimmel, R.: 3d face reconstruction by learning from synthetic data. In: 3DV (2016)
Google Scholar
Sanyal, S., Bolkart, T., Feng, H., Black, M.J.: Learning to regress 3d face shape and expression from an image without 3d supervision. In: CVPR (2019)
Google Scholar
Sela, M., Richardson, E., Kimmel, R.: Unrestricted facial geometry reconstruction using image-to-image translation. In: ICCV (2017)
Google Scholar
Shang, J., et al.: Self-supervised monocular 3d face reconstruction by occlusion-aware multi-view geometry consistency. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12360, pp. 53–70. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58555-6_4
Chapter Google Scholar
Smith, W.A.P., Seck, A., Dee, H., Tiddeman, B., Tenenbaum, J.B., Egger, B.: A morphable face albedo model. In: CVPR (2020)
Google Scholar
Stratou, G., Ghosh, A., Debevec, P.E., Morency, L.P.: Effect of illumination on automatic expression recognition: a novel 3d relightable facial database. In: FG (2011)
Google Scholar
Tewari, A., et al.: MoFA: Model-based deep convolutional face autoencoder for unsupervised monocular reconstruction. In: ICCV (2017)
Google Scholar
Thies, J., Zollhöfer, M., Stamminger, M., Theobalt, C., Nießner, M.: Face2face: Real-time face capture and reenactment of RGB videos. In: CVPR (2016)
Google Scholar
Tran, A.T., Hassner, T., Masi, I., Medioni, G.G.: Regressing robust and discriminative 3d morphable models with a very deep neural network. In: CVPR (2017)
Google Scholar
Tran, L., Liu, F., Liu, X.: Towards high-fidelity nonlinear 3d face morphable model. In: CVPR (2019)
Google Scholar
Tran, L., Liu, X.: Nonlinear 3d face morphable model. In: CVPR (2018)
Google Scholar
Tran, L., Liu, X.: On learning 3d face morphable model from in-the-wild images. TPAMI 43, 157–171 (2021)
Google Scholar
Wen, Y., Liu, W., Raj, B., Singh, R.: Self-supervised 3d face reconstruction via conditional estimation. In: ICCV (2021)
Google Scholar
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. TPAMI (2009)
Google Scholar
Wu, F., et al.: MVF-Net: multi-view 3d face morphable model regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 959–968 (2019)
Google Scholar
Wu, S., Rupprecht, C., Vedaldi, A.: Unsupervised learning of probably symmetric deformable 3d objects from images in the wild. In: CVPR (2020)
Google Scholar
Yamaguchi, S., et al.: High-fidelity facial reflectance and geometry inference from an unconstrained image. TOG. 37, 1–4 (2018)
Article Google Scholar
Yang, H., et al.: Facescape: a large-scale high quality 3d face dataset and detailed riggable 3d face prediction. In: CVPR (2020)
Google Scholar
Yenamandra, T., et al.: i3DMM: deep implicit 3d morphable model of human heads. In: CVPR (2021)
Google Scholar
Yin, L., Wei, X., Sun, Y., Wang, J., Rosato, M.J.: A 3d facial expression database for facial behavior research. In: FG (2006)
Google Scholar
Zeng, X., Peng, X., Qiao, Y.: DF2Net: a dense-fine-finer network for detailed 3d face reconstruction. In: ICCV (2019)
Google Scholar
Zhang, Z., et al.: Learning to aggregate and personalize 3d face from in-the-wild photo collection. In: CVPR (2021)
Google Scholar
Zhu, X., Liu, X., Lei, Z., Li, S.Z.: Face alignment in full pose range: a 3d total solution. TPAMI. 41, 18–92 (2019)
Article Google Scholar
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: CVPR (2012)
Google Scholar
Zhu, X., et al.: Beyond 3DMM space: towards fine-grained 3d face reconstruction. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12353, pp. 343–358. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58598-3_21
Chapter Google Scholar
Zollhöfer, M., et al.: State of the art on monocular 3d face reconstruction, tracking, and applications. In: CGF (2018)
Google Scholar

Download references

Acknowledgment

This work was supported by SZSTC Grant No. JCYJ20190 809172201639 and WDZC20200820200655001, Shenzhen Key Laboratory ZDSY S20210623092001004.

Author information

Authors and Affiliations

Shenzhen International Graduate School, Tsinghua University, Beijing, China
Zenghao Chai, Zhengzhuo Xu & Chun Yuan
Tencent AI Lab, Bellevue, China
Haoxian Zhang, Jing Ren, Di Kang, Xuefei Zhe & Linchao Bao
Peng Cheng National Laboratory, Shenzhen, China
Chun Yuan

Authors

Zenghao Chai
View author publications
You can also search for this author in PubMed Google Scholar
Haoxian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Ren
View author publications
You can also search for this author in PubMed Google Scholar
Di Kang
View author publications
You can also search for this author in PubMed Google Scholar
Zhengzhuo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xuefei Zhe
View author publications
You can also search for this author in PubMed Google Scholar
Chun Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Linchao Bao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chun Yuan or Linchao Bao .

Editor information

Editors and Affiliations

Tel Aviv University, Tel Aviv, Israel
Shai Avidan
University College London, London, UK
Gabriel Brostow
Google AI, Accra, Ghana
Moustapha Cissé
University of Catania, Catania, Italy
Giovanni Maria Farinella
Facebook (United States), Menlo Park, CA, USA
Tal Hassner

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 18358 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chai, Z. et al. (2022). REALY: Rethinking the Evaluation of 3D Face Reconstruction. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds) Computer Vision – ECCV 2022. ECCV 2022. Lecture Notes in Computer Science, vol 13668. Springer, Cham. https://doi.org/10.1007/978-3-031-20074-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-031-20074-8_5
Published: 12 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20073-1
Online ISBN: 978-3-031-20074-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

REALY: Rethinking the Evaluation of 3D Face Reconstruction