Learning to Detect Deepfakes via Adaptive Attention and Constrained Difference

Su, Lichao; Wu, Bin; Dai, Chenwei; Luo, Huan; Chen, Jian

doi:10.1007/978-981-99-7356-9_31

Lichao Su¹¹,
Bin Wu¹¹,
Chenwei Dai¹¹,
Huan Luo¹¹ &
…
Jian Chen¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14252))

Included in the following conference series:

International Conference on Information and Communications Security

643 Accesses

Abstract

Since facial forgery techniques have made remarkable progress, the area of forgery detection attracts a significant amount of attention due to security concerns. Existing methods attempt to utilize convolutional neural networks (CNNs) to mine discriminative clues for forgery detection. However, most of these coarse-grained and vanilla methods struggle to extract subtle and multiscale clues in forgery detection. To address such problems, we propose a well-designed deep learning framework, named SCA-Net, to exploit subtle, multiscale and multiview clues. Specifically, our framework consists of a skipped channel attention module (SCM), a constrained difference module (CDM) and an adaptive attention module (AAM). First, the skipped channel attention module is used as the backbone to extract sufficient different information, including low-level and high-level features. Then, the constrained difference module captures manipulation clues from the input image based on constrained characteristics. Finally, the adaptive attention module captures multiscale features represented by facial forgery. Moreover, we introduce a combined loss to address the learning difficulty of our framework. The experimental results demonstrate that the proposed model has great detection performance compared with other face forgery detection methods in most cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Goodfellow, I., et al.: Generative adversarial nets, vol. 27 (2014)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes (2013)
Google Scholar
Deepfakes. (https://github.com/deepfakes/faceswap/tree/v2.0.0). Accessed 13 May 2022
Thies, J., Zollhofer, M., Stamminger, M., Theobalt, C., Nießner, M.: Face2Face: real-time face capture and reenactment of RGB videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2387–2395 (2016)
Google Scholar
Ferrara, P., Bianchi, T., De Rosa, A., Piva, A.: Image forgery localization via fine-grained analysis of CFA artifacts. IEEE Trans. Inf. Forensics Secur. 7(5), 1566–1577 (2012)
Article Google Scholar
Bahrami, K., Kot, A.C., Li, L., Li, H.: Blurred image splicing localization by exposing blur type inconsistency. IEEE Trans. Inf. Forensics Secur. 10(5), 999–1009 (2015)
Article Google Scholar
Matern, F., Riess, C., Stamminger, M.: Exploiting visual artifacts to expose deepfakes and face manipulations. In: IEEE Winter Applications of Computer Vision Workshops (WACVW), vol. 2019, pp. 83–92. IEEE (2019)
Google Scholar
Afchar, D., Nozick, V., Yamagishi, J., Echizen, I.: MesoNet: a compact facial video forgery detection network. In: IEEE International Workshop on Information Forensics and Security (WIFS), vol. 2018, pp. 1–7. IEEE (2018)
Google Scholar
Yang, X., Li, Y., Qi, H., Lyu, S.: Exposing GAN-synthesized faces using landmark locations. In: Proceedings of the ACM Workshop on Information Hiding and Multimedia Security, pp. 113–118 (2019)
Google Scholar
Li, Y., Chang, M.-C., Lyu, S.: In ICTU oculi: exposing AI created fake videos by detecting eye blinking. In: IEEE International Workshop on Information Forensics and Security (WIFS), vol. 2018, pp. 1–7. IEEE (2018)
Google Scholar
Yang, X., Li, Y., Lyu, S.: Exposing deep fakes using inconsistent head poses. In: ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8261–8265. IEEE (2019)
Google Scholar
Chen, X., Dong, C., Ji, J., Cao, J., Li, X.: Image manipulation detection by multi-view multi-scale supervision. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 14185–14193 (2021)
Google Scholar
Zhao, H., Zhou, W., Chen, D., Wei, T., Zhang, W., Yu, N.: Multi-attentional deepfake detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2185–2194 (2021)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Tolosana, R., Vera-Rodriguez, R., Fierrez, J., Morales, A., Ortega-Garcia, J.: Deepfakes and beyond: a survey of face manipulation and fake detection. Inf. Fusion 64, 131–148 (2020)
Article Google Scholar
Mirsky, Y., Lee, W.: The creation and detection of deepfakes: a survey. ACM Comput. Surv. (CSUR) 54(1), 1–41 (2021)
Article Google Scholar
Zhang, Y., Zheng, L., Thing, V.L.: Automated face swapping and its detection. In: 2017 2nd International Conference on Signal and Image Processing (2017)
Google Scholar
Afchar, D., Nozick, V., Yamagishi, J., Echizen, I.: MesoNet: a compact facial video forgery detection network. In: 2018 IEEE International Workshop on Information Forensics and Security (WIFS) (2018)
Google Scholar
Rossler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J., Nießner, M.: FaceForensics++: learning to detect manipulated facial images. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1–11 (2019)
Google Scholar
Li, L., et al.: Face X-Ray for more general face forgery detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5001–5010 (2020)
Google Scholar
Chen, H.S., Rouhsedaghat, M., Ghani, H., Hu, S., You, S., Kuo, C.C.J.: Defakehop: a light-weight high-performance deepfake detector. In: 2021 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6 IEEE (2021)
Google Scholar
Liu, H., et al.: Spatial-phase shallow learning: rethinking face forgery detection in frequency domain. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 772–781 (2021)
Google Scholar
Wang, J., Wu, Z., Chen, J., Jiang, Y.-G.: M2TR: multi-modal multi-scale transformers for deepfake detection. (2022)
Google Scholar
Cao, J., Ma, C., Yao, T., Chen, S., Ding, S., Yang, X.: End-to-end reconstruction-classification learning for face forgery detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4113–4122 (2022)
Google Scholar
Bayar, B., Stamm, M.C.: Constrained convolutional neural networks: a new approach towards general purpose image manipulation detection. IEEE Trans. Inf. Forensics Secur. 13(11), 2691–2706 (2018)
Article Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Li, Y., Yang, X., Sun, P., Qi, H., Lyu, S.: Celeb-DF: a large-scale challenging dataset for deepfake forensics. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3207–3216 (2020)
Google Scholar
FaceSwap. (https://github.com/MarekKowalski/FaceSwap). Accessed 13 May 2022
Thies, J., Zollhöfer, M., Nießner, M.: Deferred neural rendering: image synthesis using neural textures. ACM Trans. Graph. (TOG) 38(4), 1–12 (2019)
Article Google Scholar
Deng, J., Guo, J., Ververas, E., Kotsia, I., Zafeiriou, S.: RetinaFace: single-shot multi-level face localisation in the wild. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5203–5212 (2020)
Google Scholar
Qian, Y., Yin, G., Sheng, L., Chen, Z., Shao, J.: Thinking in frequency: face forgery detection by mining frequency-aware clues. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12357, pp. 86–103. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58610-2_6
Chapter Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Learning deep features for discriminative localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2921–2929 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer and Data Science, Fuzhou University, Fujian, China
Lichao Su, Bin Wu, Chenwei Dai & Huan Luo
College of Physics and Information Engineering, Fuzhou University, Fujian, China
Jian Chen

Authors

Lichao Su
View author publications
You can also search for this author in PubMed Google Scholar
Bin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Chenwei Dai
View author publications
You can also search for this author in PubMed Google Scholar
Huan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Jian Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bin Wu .

Editor information

Editors and Affiliations

Nankai University, Tianjin, China
Ding Wang
Columbia University, New York, NY, USA
Moti Yung
Nankai University, Tianjin, China
Zheli Liu
Xidian University, Xi’an, China
Xiaofeng Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Su, L., Wu, B., Dai, C., Luo, H., Chen, J. (2023). Learning to Detect Deepfakes via Adaptive Attention and Constrained Difference. In: Wang, D., Yung, M., Liu, Z., Chen, X. (eds) Information and Communications Security. ICICS 2023. Lecture Notes in Computer Science, vol 14252. Springer, Singapore. https://doi.org/10.1007/978-981-99-7356-9_31

Download citation

DOI: https://doi.org/10.1007/978-981-99-7356-9_31
Published: 20 October 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-7355-2
Online ISBN: 978-981-99-7356-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning to Detect Deepfakes via Adaptive Attention and Constrained Difference