Detect Video Forgery by Performing Transfer Learning on Deep Neural Network

  • Zhaohe Zhang
  • Qingzhong LiuEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1075)


Nowadays, the authenticity of digital image and videos becomes hard while the forgery techniques are more advanced. Given the recent progress on Generative Neural Network (GNN) development that may generate realistic images and videos, it becomes more difficult to detect the authenticity. In this paper, we expose a popular open source video forgery library called “DeepFaceLab” by making use of deep learning. We retrain the existing state-of-the-art image classification neural networks to capture the features from manipulated video frames. After passing various sets of forgery video frames through a well-trained neural network, a bottleneck layer is created for each image, this layer contains compact information for all images, and exposes the artifacts in forgery videos. We obtained above 99% accuracy when testing on DeepFake videos. In addition, we tested our method on FaceForensics dataset and achieved good detection accuracy.


Forgery detection Transfer learning Deep neural network Face forensics DeepFake FaceForensics 


  1. 1.
    Buzzfeed: You Won’t Believe What Obama Says in This Video! [video] (2019).
  2. 2.
    Facebook: Systems and Methods for Dynamically Generating Emojis Based on Image Analysis of Facial Features. US 20170140214A1 (2017)Google Scholar
  3. 3.
    Thies, J., Zollhofer, M., Stamminger, M., Theobalt, C., Niessner, M.: Face2Face: real-time face capture and reenactment of RGB videos. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2387–2395 (2016)Google Scholar
  4. 4.
    Thies, J., Zollhöfer, M., Theobalt, C., Stamminger, M., Niessner, M.: Headon: real-time reenactment of human portrait videos. ACM Trans. Graph. 37(4), 1–13 (2018)CrossRefGoogle Scholar
  5. 5.
    Suwajanakorn, S., Seitz, S., Kemelmacher-Shlizerman, I.: Synthesizing Obama: learning lip sync from audio. ACM Trans. Graph. 36(4), 1–13 (2017)CrossRefGoogle Scholar
  6. 6.
    Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks, arXiv:1812.04948 [cs, stat] (2018)
  7. 7.
    Lample, G., Zeghidour, N., Usunier, N., Bordes, A., Denoyer, L., Ranzato, M.: Fader networks: manipulating images by sliding attributes, arXiv:1706.00409 [cs] (2017)
  8. 8.
    Lu,Y., Tai, Y.-W., Tang, C.-K.: Attribute-guided face generation using conditional CycleGAN, arXiv:1705.09966 [cs, stat] (2017)
  9. 9.
  10. 10.
    Liu, Z., Luo, P., Wang, X., Tang, X.: Deep learning face attributes in the wild. In: 2015 IEEE International Conference on Computer Vision (2015)Google Scholar
  11. 11.
    Seibold, C., Samek, W., Hilsmann, A., Eisert, P.: Detection of face morphing attacks by deep learning. In: Kraetzer, C., Shi, Y.-Q., Dittmann, J., Kim, H.J. (eds.) Digital Forensics and Watermarking, vol. 10431. Springer International Publishing, pp. 107–120 (2017)Google Scholar
  12. 12.
    Guera, D., Delp, E.J.: DeepFake video detection using recurrent neural networks. In: Proceedings 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Auckland, New Zealand, pp. 1–6 (2018)Google Scholar
  13. 13.
    Li, Y., Lyu, S.: Exposing DeepFake Videos By Detecting Face Warping Artifacts, arXiv:1811.00656 [cs] (2018)
  14. 14.
    Hui, J.: How deep learning fakes videos (DeepFake) and how to detect it? (2018).
  15. 15.
    Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (2014).
  16. 16.
  17. 17.
    Li, C., Balaban, S.: Transfer Learning with TensorFlow Tutorial: Image Classification Example (2018).
  18. 18.
    TensorFlow Hub. How to Retrain an Image Classifier for NewCategories (2019):
  19. 19.
    Rössler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J., Nießner, M.: FaceForensics: a large-scale video dataset for forgery detection in human faces, arXiv:1803.09179 [cs] (2018)
  20. 20.
    Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision applications, arXiv:1704.04861 [cs] (2017)
  21. 21.
    Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision, arXiv:1512.00567 [cs] (2015)

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Department of Computer ScienceSam Houston State UniversityHuntsvilleUSA

Personalised recommendations