Skip to main content
Log in

Deepfake generation and detection, a survey

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Deepfake refers to realistic, but fake images, sounds, and videos generated by articial intelligence methods. Recent advances in deepfake generation make deepfake more realistic and easier to make. Deepfake has been a signicant threat to national security, democracy, society, and our privacy, which calls for deepfake detection methods to combat potential threats. In the paper, we make a survey on state-ofthe-art deepfake generation methods, detection methods, and existing datasets. Current deepfake generation methods can be classified into face swapping and facial reenactment. Deepfake detection methods are mainly based features and machine learning methods. There are still some challenges for deepfake detection, such as progress on deepfake generation, lack of high quality datasets and benchmark. Future trends on deepfake detection can be efficient, robust and systematical detection methods and high quality datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. (2018) Malicious deep fake prohibition act of 2018. https://www.congress.gov/bill/115th-congress/senate-bill/3805/text

  2. (2019) Defending each and every person from false appearances by keeping exploitation subject to accountability Act of 2019. https://www.congress.gov/bill/116th-congress/house-bill/3230/text

  3. Afchar D, et al (2018) Mesonet: a compact facial video forgery detection network. In: 2018 IEEE international workshop on information forensics and security (WIFS). IEEE

  4. Agarwal S, et al (2019) Protecting world leaders against deep fakes. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops

  5. Ajder H, Cavalli F, Patrini G, Cullen L (2019) The state of Deepfakes: Landscape, threats, and impact

  6. Amerini I, Caldelli R (2020) Exploiting prediction error inconsistencies through LSTM-based classifiers to detect deepfake videos. In: Proceedings of the 2020 ACM workshop on information hiding and multimedia security. Association for Computing Machinery: Denver, CO, USA, pp 97–102

  7. Amerini I, et al (2019) Deepfake video detection through optical flow based CNN. In: Proceedings of the IEEE international conference on computer vision workshops

  8. Bayar B, Stamm MC (2016) A deep learning approach to universal image manipulation detection using a new convolutional layer, pp 5–10

  9. Carlini N, Farid H (2020) Evading deepfake-image detectors with white- and black-box attacks. In: 2020 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW)

  10. Chan C, et al (2018) Everybody dance now. arXiv:1808.07371

  11. Chang X, et al (2020) Deepfake face image detection based on improved VGG convolutional neural network. In: 2020 39th chinese control conference (CCC)

  12. Chen D et al (2019) Face swapping: realistic image synthesis based on facial landmarks alignment. Mathematical Problems in Engineering 2019:1–11

    Google Scholar 

  13. Chesney R, Citron DK (2018) Deep fakes: a looming challenge for privacy, democracy, and national security

  14. Chintha A et al (2020) Recurrent convolutional structures for audio spoof and video deepfake detection. IEEE Journal of Selected Topics in Signal Processing 14(5):1024–1037

    Article  Google Scholar 

  15. Chintha A, et al (2020) Leveraging edges and optical flow on faces for Deepfake detection. In: 2020 IEEE international joint conference on biometrics (IJCB)

  16. Choi Y, et al (2018) Stargan: Unified generative adversarial networks for multi-domain image-to-image translation. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  17. Ciftci UA, Demir I, Yin L (2020) How do the hearts of deep fakes beat? Deep fake source detection via interpreting residuals with biological signals. In: 2020 IEEE international joint conference on biometrics (IJCB)

  18. Dang L et al (2018) Deep learning based computer generated face identification using convolutional neural network. Applied Sciences 8(12):2610

    Article  Google Scholar 

  19. DeepFaceLab. Available from: https://github.com/iperov/DeepFaceLab

  20. Deepfake reddit. Available from: https://www.reddit.com/user/Deepfakes/

  21. DeepFakeDetection data. https://github.com/ondyari/FaceForensics/blob/master/dataset/README.md

  22. Deepfakes. Available from: https://github.com/Deepfakes/

  23. Ding X, et al (2019) Swapped face detection using deep learning and subjective assessment. arXiv:1909.04217

  24. Dolhansky B, et al (2019) The deepfake detection challenge (DFDC) preview dataset. arXiv:1910.08854

  25. Dolhansky B, et al (2020) The deepfake detection challenge (DFDC) dataset. arXiv:2006.07397

  26. Du CXT, et al (2020) Efficient-frequency: a hybrid visual forensic framework for facial forgery detection. In: 2020 IEEE symposium series on computational intelligence (SSCI)

  27. Faceapp. Available from: http://www.faceapp.com/

  28. Farid H (2009) Image forgery detection. IEEE Signal Processing Magazine 26(2):16–25

    Article  Google Scholar 

  29. Fernandes S, et al (2019) Predicting heart rate variations of deepfake videos using neural ODE. In: Proceedings of the IEEE international conference on computer vision workshops

  30. Gandhi A, Jain S (2020) Adversarial perturbations fool deepfake detectors. In: 2020 international joint conference on neural networks (IJCNN)

  31. Geng Z, Cao C, Tulyakov S (2019) 3d guided fine-grained face manipulation. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  32. Gera D, Delp EJ (2018) Deepfake video detection using recurrent neural networks. In: 2018 15th IEEE international conference on advanced video and signal based surveillance (AVSS)

  33. Goodfellow I, et al (2014) Generative adversarial nets. In: Advances in neural information processing systems

  34. Gouhara K, Watanabe T, Uchikawa Y (1991) Learning process of recurrent neural networks. In: [Proceedings] 1991 IEEE international joint conference on neural networks

  35. Guan H, et al (2019) MFC datasets: Large-scale benchmark datasets for media forensic challenge evaluation. In: 2019 IEEE winter applications of computer vision workshops (WACVW). IEEE

  36. Guarnera L, Giudice O, Battiato S (2020) Deepfake detection by analyzing convolutional traces. In: 2020 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW)

  37. Gupta P, et al (2020) The eyes know it: FakeET-an eye-tracking database to understand Deepfake perception, In: Proceedings of the 2020 international conference on multimodal interaction, Association for Computing Machinery: Virtual Event, Netherlands, pp 519–527

  38. Hearst MA et al (1998) Support vector machines. IEEE Intelligent Systems and their Applications 13(4):18–28

    Article  Google Scholar 

  39. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Computation 9(8):1735–1780

    Article  Google Scholar 

  40. Hsu C-C, Lee C-Y, Zhuang Y-X (2018) Learning to detect fake face images in the wild. In: 2018 international symposium on computer, consumer and control (IS3C). IEEE

  41. Huang Y, et al (2020) FakePolisher: making deepfakes more detection-evasive by shallow reconstruction. In: Proceedings of the 28th ACM international conference on multimedia. Association for Computing Machinery, pp 1217–1226

  42. Huh M, et al (2018) Fighting fake news: Image splice detection via learned self-consistency. In: Proceedings of the European conference on computer vision (ECCV)

  43. Isola P, et al (2017) Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  44. Jung T, Kim S, Kim K (2020) DeepVision: Deepfakes detection using human eye blinking pattern. IEEE Access 8:83144–83154

    Article  Google Scholar 

  45. Karras T, et al (2017) Progressive growing of gans for improved quality, stability, and variation. arXiv:1710.10196

  46. Karras T, Laine S, Aila T (2019) A style-based generator architecture for generative adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  47. Khalid H, Woo SS (2020) OC-FakeDect: classifying deepfakes using one-class variational autoencoder. In: 2020 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW)

  48. Kharbat FF, et al (2019) Image feature detectors for deepfake video detection. In: 2019 IEEE/ACS 16th international conference on computer systems and applications (AICCSA)

  49. Khodabakhsh A, Busch C (2020) A generalizable deepfake detector based on neural conditional distribution modelling. In: 2020 international conference of the biometrics special interest group (BIOSIG)

  50. Khodabakhsh A, et al (2018) Fake face detection methods: can they be generalized? In: 2018 international conference of the biometrics special interest group (BIOSIG). IEEE

  51. Kim H et al (2018) Deep video portraits. ACM Transactions on Graphics (TOG) 37(4):163

    Google Scholar 

  52. Korshunova I, et al (2017) Fast face-swap using convolutional neural networks. In: Proceedings of the IEEE international conference on computer vision

  53. Korshunov P, Marcel S (2018) Deepfakes: a new threat to face recognition? assessment and detection. arXiv:1812.08685

  54. Korshunov P, Marcel S (2018) Speaker inconsistency detection in tampered video. In: 2018 26th european signal processing conference (EUSIPCO). IEEE

  55. Korshunov P, Marcel S (2019) Vulnerability assessment and detection of Deepfake videos. In: The 12th IAPR international conference on biometrics (ICB)

  56. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Article  Google Scholar 

  57. Liang T, et al (2020) SDHF: spotting deepfakes with hierarchical features. In: 2020 IEEE 32nd international conference on tools with artificial intelligence (ICTAI)

  58. Li Y, Chang M-C, Lyu S (2018) In ictu oculi: Exposing ai created fake videos by detecting eye blinking. In: 2018 IEEE international workshop on information forensics and security (WIFS). IEEE

  59. Li H, et al (2018) Can forensic detectors identify GAN generated images? In: 2018 Asia-pacific signal and information processing association annual summit and conference (APSIPA ASC)

  60. Li Y, et al (2020) Celeb-DF: A large-scale challenging dataset for deepfake forensics. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  61. Li Y, Lyu S (2018) Exposing Deepfake videos by detecting face warping artifacts. arXiv:1811.00656

  62. Matern F, Riess C, Stamminger M (2019) Exploiting visual artifacts to expose Deepfakes and face manipulations. In: 2019 IEEE winter applications of computer vision workshops (WACVW)

  63. Mittal T, et al (2020) Emotions Don’t Lie: An audio-visual deepfake detection method using affective cues. In: Proceedings of the 28th ACM international conference on multimedia. Association for Computing Machinery, pp 2823–2832

  64. Mo H, Chen B, Luo W (2018) Fake faces identification via convolutional neural network. In: Proceedings of the 6th ACM workshop on information hiding and multimedia security. ACM

  65. Montserrat DM, et al (2020) Deepfakes detection with automatic face weighting. In: 2020 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW)

  66. Nataraj L, et al (2019) Detecting GAN generated fake images using co-occurrence matrices. arXiv:1903.06836

  67. Natsume R, Yatagawa T, Morishima S (2018) RSGAN: face swapping and editing using face and hair representation in latent spaces. arXiv:1804.03447

  68. Neves JC et al (2020) GANprintR: improved fakes and evaluation of the state of the art in face manipulation detection. IEEE Journal of Selected Topics in Signal Processing 14(5):1038–1048

    Article  Google Scholar 

  69. Nguyen HM, Derakhshani R (2020) Eyebrow recognition for identifying Deepfake videos. In: 2020 international conference of the biometrics special interest group (BIOSIG)

  70. Nguyen HH, Yamagishi J, Echizen I (2019) Capsule-forensics: using capsule networks to detect forged images and videos. In: ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE

  71. Nirkin Y, Keller Y, Hassner T (2019) FSGAN: Subject agnostic face swapping and reenactment. arXiv:1908.05932

  72. Oord Avd, et al (2016) Wavenet: A generative model for raw audio. arXiv:1609.03499

  73. Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22(10):1345–1359

    Article  Google Scholar 

  74. Patel M, et al (2020) Trans-DF: a transfer learning-based end-to-end deepfake detector. In: 2020 IEEE 5th international conference on computing communication and automation (ICCCA)

  75. Pu J, et al (2020) NoiseScope: detecting deepfake images in a blind setting. In: Annual computer security applications conference. Association for Computing Machinery, Austin, pp 913–927

  76. Pumarola A, et al (2018) Ganimation: Anatomically-aware facial animation from a single image. In: Proceedings of the european conference on computer vision (ECCV)

  77. Rahmouni N, et al (2017) Distinguishing computer graphics from natural images using convolution neural networks. In: 2017 IEEE workshop on information forensics and security (WIFS). IEEE

  78. Rana MS, Sung AH (2020) DeepfakeStack: a deep ensemble-based learning technique for deepfake detection. In: 2020 7th IEEE international conference on cyber security and cloud computing (CSCloud)/2020 6th IEEE international conference on edge computing and scalable cloud (EdgeCom)

  79. Ranjan P, Patil S, Kazi F (2020) Improved generalizability of deep-fakes detection using transfer learning based CNN framework. In: 2020 3rd international conference on information and computer technologies (ICICT)

  80. Rssler A, et al (2018) Faceforensics: A large-scale video dataset for forgery detection in human faces. arXiv:1803.09179

  81. Rssler A, et al (2019) FaceForensics++: learning to detect manipulated facial images. In: 2019 IEEE/CVF international conference on computer vision (ICCV)

  82. Sabir E et al (2019) Recurrent convolutional strategies for face manipulation detection in videos. Interfaces (GUI) 3:1

    Google Scholar 

  83. Thies J et al (2018) Headon: Real-time reenactment of human portrait videos. ACM Transactions on Graphics (TOG) 37(4):164

    Article  Google Scholar 

  84. Thies J, Zollhfer M, Niener M (2019) Deferred neural rendering: image synthesis using neural textures. ACM Trans. Graph 38(4):66

    Article  Google Scholar 

  85. Thies J, et al (2016) Face2face: Real-time face capture and reenactment of rgb videos. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  86. Tursman E, et al (2020) Towards untrusted social video verification to combat deepfakes via face geometry consistency. In: 2020 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW)

  87. Wang S-Y, et al (2019) Detecting photoshopped faces by scripting photoshop. arXiv:1906.05856

  88. Xie D, et al (2020) Deepfake detection on publicly available datasets using modified AlexNet. In: 2020 IEEE symposium series on computational intelligence (SSCI)

  89. Yang T, et al (2020) VTD-Net: depth face forgery oriented video tampering detection based on convolutional neural network. In: 2020 39th chinese control conference (CCC)

  90. Yang X, Li Y, Lyu S (2019) Exposing deep fakes using inconsistent head poses. In: 2019 IEEE international conference on acoustics, speech and signal processing (ICASSP)

  91. Yu N, Davis L, Fritz M (2018) Attributing fake images to GANs: Analyzing fingerprints in generated images. arXiv:1811.08180

  92. ZAO. Available from: https://www.zaoapp.net

  93. Zakharov E, et al (2019) Few-shot adversarial learning of realistic neural talking head models. arXiv:1905.08233

  94. Zhao Y et al (2020) Capturing the persistence of facial expression features for deepfake video detection. Springer International Publishing, Cham

    Book  Google Scholar 

  95. Zhao Z, Wang P, Lu W (2020) Detecting deepfake video by learning two-level features with two-stream convolutional neural network. In: Proceedings of the 2020 6th international conference on computing and artificial intelligence. Association for Computing Machinery, pp 291–297

  96. Zhou P, et al (2017) Two-stream neural networks for tampered face detection. In: 2017 IEEE conference on computer vision and pattern recognition workshops (CVPRW). IEEE

  97. Zhuang Y-X, Hsu C-C (2019) Detecting generated image based on a coupled network with two-step pairwise learning. In: 2019 IEEE international conference on image processing (ICIP). IEEE

  98. Zhu K, Wu B, Wang B (2020) Deepfake detection with clustering-based embedding regularization. In: 2020 IEEE fifth international conference on data science in cyberspace (DSC)

  99. Zi B, et al (2020) WildDeepfake: a challenging real-world dataset for deepfake detection. In: Proceedings of the 28th ACM international conference on multimedia. Association for Computing Machinery, pp 2382–2390

Download references

Acknowledgements

This work was supported by Guangxi Key Laboratory of Cryptography and Information Security (GCIS201806), Guangxi Key Laboratory of Trusted Software (No. kx202016), Key Lab of Film and TV Media Technology of Zhejiang Province (No.2020E10015), Guangxi Key Laboratory of Hybrid Computation and IC Design Analysis, Guangxi University for Nationalities(GXIC20-03), Key Laboratory of Oceanographic Big Data Mining & Application of Zhejiang Province(obdma202001). We’d like to thank Zelei Cheng from Purdue University and Yingjie Wang from Virginia Tech for writing assistance, language editing, and proofreading.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tao Zhang.

Ethics declarations

Conflicts of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, T. Deepfake generation and detection, a survey. Multimed Tools Appl 81, 6259–6276 (2022). https://doi.org/10.1007/s11042-021-11733-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11733-y

Keywords

Navigation