Abstract
With the large-scale and pervading social media platforms and the recent advances in generative deep learning techniques, it is nowadays quite common to forge highly-realistic and credible misleading videos known as DeepFakes. These videos mean to alter the original intention behind the video to put forth their hidden ploys. In this work, a simple yet effective lightweight time distributed (LW-DeepFakeNet) model that uses both spatial and temporal information to determine whether the video has been altered is proposed. The model utilizes a transfer learning approach with pre-trained convolutional networks for spatial feature extraction, topped up with LSTMs for temporal information extraction, requiring little training data and time. This research also considers a special use case of DeepFake where a particular video sequence has a scene change and proposes a way to counter the class-imbalance present in the dataset. The resulting model is much lighter with up to 152x times reduction in parameter count while achieving a significant accuracy of 99.24% at a remarkable rate of 80 fps.
Similar content being viewed by others
Data availability
All data will be available from the corresponding author on reasonable request.
References
Juefei-Xu, F., Wang, R., Huang, Y., Guo, Q., Ma, L., Liu, Y.: Countering malicious deepfakes: survey, battleground, and horizon (2021). https://doi.org/10.48550/ARXIV.2103.00218. arxiv:https://arxiv.org/abs/2103.00218
Zhang, Y., Zheng, L., Thing, V.L.L.: Automated face swapping and its detection. In: 2017 IEEE 2nd International Conference on Signal and Image Processing (ICSIP), pp. 15–19 (2017). https://doi.org/10.1109/SIPROCESS.2017.8124497
Zhou, P., Han, X., Morariu, V.I., Davis, L.S.: Two-stream neural networks for tampered face detection (2018). https://doi.org/10.48550/ARXIV.1803.11276 . arXiv:1803.11276
Marra, F., Gragnaniello, D., Cozzolino, D., Verdoliva, L.: Detection of GAN-generated fake images over social networks. In: IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), vol. 2018, pp. 384–389 (2018). https://doi.org/10.1109/MIPR.2018.00084
Li, Y., Chang, M.-C., Lyu, S.: In Ictu Oculi: exposing ai created fake videos by detecting eye blinking. In: IEEE International Workshop on Information Forensics and Security (WIFS), vol. 2018, pp. 1–7 (2018). https://doi.org/10.1109/WIFS.2018.8630787
Mo, H., Chen, B., Luo, W.: Fake faces identification via convolutional neural network. In: IH &MMSec ’18, Association for Computing Machinery, New York, NY, USA, pp. 43–47 (2018). https://doi.org/10.1145/3206004.3206009
Koopman, M., Macarulla Rodriguez, A., Geradts, Z.: Detection of deepfake video manipulation (2018)
Güera, D., Delp, E.J.: Deepfake video detection using recurrent neural networks. In: 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 1–6 (2018). https://doi.org/10.1109/AVSS.2018.8639163
Matern, F., Riess, C., Stamminger, M.: Exploiting visual artifacts to expose deepfakes and face manipulations. In: IEEE Winter Applications of Computer Vision Workshops (WACVW), vol. 2019, pp. 83–92 (2019). https://doi.org/10.1109/WACVW.2019.00020
Xuan, X., Peng, B., Wang, W., Dong, J.: On the generalization of GAN image forensics (2019). https://doi.org/10.48550/ARXIV.1902.11153. arXiv:1902.11153
Tarasiou, M., Zafeiriou, S.: Extracting deep local features to detect manipulated images of human faces (2019). https://doi.org/10.48550/ARXIV.1911.13269. arXiv:1911.13269
Li, Y., Yang, X., Sun, P., Qi, H., Lyu, S.: Celeb-df: A large-scale challenging dataset for deepfake forensics (2019). https://doi.org/10.48550/ARXIV.1909.12962. arXiv:1909.12962
Kumar, P., Vatsa, M., Singh, R.: Detecting face2face facial reenactment in videos (2020). https://doi.org/10.48550/ARXIV.2001.07444. arXiv:2001.07444
Li, L., Bao, J., Zhang, T., Yang, H., Chen, D., Wen, F., Guo, B.: Face x-ray for more general face forgery detection (2019). https://doi.org/10.48550/ARXIV.1912.13458. arXiv:1912.13458
Sun, K., Xiao, B., Liu, D., Wang, J.: Deep high-resolution representation learning for human pose estimation (2019). https://doi.org/10.48550/ARXIV.1902.09212. arXiv:1902.09212
Sun, K., Zhao, Y., Jiang, B., Cheng, T., Xiao, B., Liu, D., Mu, Y., Wang, X., Liu, W., Wang, J.: High-resolution representations for labeling pixels and regions (2019). https://doi.org/10.48550/ARXIV.1904.04514. arXiv:1904.04514
Lima, O. de., Franklin, S., Basu, S., Karwoski, B., George, A.: Deepfake detection using spatiotemporal convolutional networks (2020). https://doi.org/10.48550/ARXIV.2006.14749. arXiv:2006.14749
Hernandez-Ortega, J., Tolosana, R., Fierrez, J., Morales, A.: Deepfakeson-phys: Deepfakes detection based on heart rate estimation (2020). https://doi.org/10.48550/ARXIV.2010.00400. arXiv:2010.00400
Schwarcz, S., Chellappa, R.: Finding facial forgery artifacts with parts-based detectors (2021). https://doi.org/10.48550/ARXIV.2109.10688. arXiv:2109.10688
Saif, S., Tehseen, S., Ali, S.S., Kausar, S., Jameel, A.: Generalized deepfake video detection through time-distribution and metric learning. IT Prof. 24(2), 38–44 (2022). https://doi.org/10.1109/MITP.2022.3168351
Saikia, P., Dholaria, D., Yadav, P., Patel, V., Roy, M.: A hybrid CNN-LSTM model for video deepfake detection by leveraging optical flow features (2022)
Ke, J., Wang, L.: Df-udetector: an effective method towards robust deepfake detection via feature restoration. Neural Netw. 160, 216–226 (2023). https://doi.org/10.1016/j.neunet.2023.01.001
Dolhansky, B., Bitton, J., Pflaum, B., Lu, J., Howes, R., Wang, M., Ferrer, C.C.: The deepfake detection challenge (DFDC) dataset (2020). https://doi.org/10.48550/ARXIV.2006.07397. arXiv:2006.07397
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). https://doi.org/10.48550/ARXIV.1409.1556. arXiv:1409.1556
Vamsi, V.V.V.N.S., Shet, S.S., Reddy, S.S.M., Rose, S.S., Shetty, S.R., Sathvika, S., Shankar, S.M.S., S.P.: Deepfake detection in digital media forensics. Glob. Transit. Proc. 3(1), 74–79 (2022). International Conference on Intelligent Engineering Approach (ICIEA-2022). https://doi.org/10.1016/j.gltp.2022.04.017
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2009, pp. 248–255 (2009). https://doi.org/10.1109/CVPR.2009.5206848
Acknowledgements
This work was supported by the EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia.
Funding
No funding was obtained for this study.
Author information
Authors and Affiliations
Contributions
Conceptualization, UM, MS, SM, MA, and AAAE-L; methodology, UM, MS, SM; software, UM, and MS; validation, MA, and AAAE-L; formal analysis, UM, and MA; writing-original draft preparation, UM, MS, and SM; writing-review and editing, MA, and AAAE-L; All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no known financial or conflicts of interest associated with this article.
Ethical approval
Ethical approval is not applicable for this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Masud, U., Sadiq, M., Masood, S. et al. LW-DeepFakeNet: a lightweight time distributed CNN-LSTM network for real-time DeepFake video detection. SIViP 17, 4029–4037 (2023). https://doi.org/10.1007/s11760-023-02633-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-023-02633-9