Skip to main content

Video Analysis System Using Deep Learning Algorithms

  • Conference paper
  • First Online:
Ambient Intelligence – Software and Applications (ISAmI 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1239))

Included in the following conference series:

Abstract

Detection of video duplicates is an active field of research, motivated by the protection of intellectual property, the fight against piracy or the tracing of the origin of reused video segments.

In this work, a method for the detection of duplicate videos is proposed and implemented, making use of deep learning methods and techniques typical of the field of information recovery. This method has been evaluated with a data set usually used in the field, with which high average accuracies, above 85%, have been obtained. The effect of the different layers of the convolutional neural network used by the algorithm, the aggregation mechanisms that can be used on them, and the influence of the recovery model have been studied, finding a set of parameters that optimize the overall accuracy of the system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.scopus.com.

  2. 2.

    https://github.com/BVLC/caffe/tree/master/models/bvlc_alexnet.

  3. 3.

    Although in our case this is not possible due to the use of the code book.

References

  1. Chou, C.-L., Chen, H.-T., Lee, S.-Y.: Pattern-based near-duplicate video retrieval and localization on web-scale videos. IEEE Trans. Multimed. 17(3), 382–395 (2015)

    Article  Google Scholar 

  2. Liu, H., Zhao, Q., Wang, H., Lv, P., Chen, Y.: An image-based near-duplicate video retrieval and localization using improved edit distance. Multimed. Tools Appl. 76(22), 24435–24456 (2017)

    Article  Google Scholar 

  3. Wu, X., Ngo, C.-W., Hauptmann, A., Tan, H.-K.: Real-time near-duplicate elimination for web video search with content and context. IEEE Trans. Multimed. 11(2), 196–207 (2009). https://doi.org/10.1109/TMM.2008.2009673

    Article  Google Scholar 

  4. Wu, X., Hauptmann, A.G., Ngo, C.-W.: Practical elimination of near-duplicates from web video search. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 218–227. ACM (2007)

    Google Scholar 

  5. Hernandez, G.: Sistema de análisis de vídeo mediante la utilización del marco metodológico de los sistemas de razonamiento basados en casos y el uso de algoritmos de aprendizaje profundo. In: Avances en Informática y Automática - Décimotercer Workshop (2019)

    Google Scholar 

  6. Garcıa-Peñalvo, F.: Revisiones y mapeos sistemáticos de literatura (2019). https://doi.org/10.5281/zenodo.2586725

  7. Kitchenham, B., Charters, S.: Guidelines for performing systematic literature reviews in software engineering (2007)

    Google Scholar 

  8. Hu, Y., Lu, X.: Learning spatial-temporal features for video copy detection by the combination of CNN and RNN. J. Vis. Commun. Image Represent. 55, 21–29 (2018). https://doi.org/10.1016/j.jvcir.2018.05.013

    Article  Google Scholar 

  9. Zhang, X., Xie, Y., Luan, X., He, J., Zhang, L., Wu, L.: Video copy detection based on deep CNN features and graph-based sequence matching. Wirel. Pers. Commun. 103(1), 401–416 (2018). https://doi.org/10.1007/s11277-018-5450-x

    Article  Google Scholar 

  10. Law-To, J., Buisson, O., Gouet-Brunet, V., Boujemaa, N.: ViCopT: a robust system for content-based video copy detection in large databases. Multimed. Syst. 15(6), 337–353 (2009). https://doi.org/10.1007/s00530-009-0164-2

    Article  Google Scholar 

  11. Liu, H., Zhao, Q., Wang, H., Lv, P., Chen, Y.: An image-based near-duplicate video retrieval and localization using improved edit distance. Multimed. Tools Appl. 76(22), 24435–24456 (2017). https://doi.org/10.1007/s11042-016-4176-6

    Article  Google Scholar 

  12. Liao, K., Liu, G.: An efficient content based video copy detection using the sample based hierarchical adaptive k-means clustering. J. Intell. Inf. Syst. 44(1), 133–158 (2014). https://doi.org/10.1007/s10844-014-0332-5

    Article  Google Scholar 

  13. Su, P.-C., Wu, C.-S.: Efficient copy detection for compressed digital videos by spatial and temporal feature extraction. Multimed. Tools Appl. 76(1), 1331–1353 (2017). https://doi.org/10.1007/s11042-015-3132-1

    Article  Google Scholar 

  14. Guzman-Zavaleta, Z., Feregrino-Uribe, C., Morales-Sandoval, M., Menendez-Ortiz, A.: A robust and low-cost video fingerprint extraction method for copy detection. Multimed. Tools Appl. 76(22), 24143–24163 (2017). https://doi.org/10.1007/s11042-016-4168-6

    Article  Google Scholar 

  15. Boukhari, A., Serir, A.: Weber Binarized Statistical Image Features (WBSIF) based video copy detection. J. Vis. Commun. Image Represent. 34, 50–64 (2016). https://doi.org/10.1016/j.jvcir.2015.10.015

    Article  Google Scholar 

  16. Chamoso, P., González-Briones, A., Rodrguez, S., Corchado, J.M.: Tendencies of technologies and platforms in smart cities: a state-of-the art review. Wirel. Commun. Mob. Comput. 2018, 17 (2018)

    Article  Google Scholar 

  17. Li, T., Sun, S., Bolić, M., Corchado, J.M.: Algorithm design for parallel implementation of the SMC-PHD filter. Sig. Process. 119, 115–127 (2016)

    Article  Google Scholar 

  18. Coria, J.A.G., Castellanos-Garzón, J.A., Corchado, J.M.: Intelligent business processes composition based on multi-agent systems. Expert Syst. Appl. 41(4), 1189–1205 (2014)

    Article  Google Scholar 

  19. Bullón, J., González Arrieta, A., Hernández Encinas, A., Queiruga Dios, A., et al.: Manufacturing processes in the textile industry. Expert Syst. Fabrics Prod. 6(1), 41–50 (2017)

    Google Scholar 

  20. Casado-Vara, R., Martin-del Rey, A., Affes, S., Prieto, J., Corchado, J.M.: IoT network slicing on virtual layers of homogeneous data for improved algorithm operation in smart buildings. Future Gener. Comput. Syst. 102, 965–977 (2020)

    Article  Google Scholar 

  21. Chiu, C.-Y., Wang, H.-M.: Time-series linear search for video copies based on compact signature manipulation and containment relation modeling. IEEE Trans. Circuits Syst. Video Technol. 20(11), 1603–1613 (2010). https://doi.org/10.1109/TCSVT.2010.2087471

    Article  Google Scholar 

  22. Chiu, C.-Y., Tsai, T.-H., Liou, Y.-C., Han, G.-W., Chang, H.-S.: Near-duplicate subsequence matching between the continuous stream and large video dataset. IEEE Trans. Multimed. 16(7), 1952–1962 (2014). https://doi.org/10.1109/TMM.2014.2342668

    Article  Google Scholar 

  23. Kordopatis-Zilos, G., Papadopoulos, S., Patras, I., Kompatsiaris, Y.: Near-duplicate video retrieval by aggregating intermediate CNN layers. In: International Conference on Multimedia Modeling, pp. 251–263. Springer (2017)

    Google Scholar 

  24. Panagiotakis, C., Doulamis, A., Tziritas, G.: Equivalent key frames selection based on ISO-content principles. IEEE Trans. Circuits Syst. Video Technol. 19(3), 447–451 (2009)

    Article  Google Scholar 

  25. Paul, M.K.A., Kavitha, J., Rani, P.A.J.: Key-frame extraction techniques: a review. Recent Pat. Comput. Sci. 11(1), 3–16 (2018)

    Article  Google Scholar 

  26. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  27. Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)

    Google Scholar 

  28. Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.: Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)

  29. Sculley, D.: Web-scale k-means clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1177–1178. ACM (2010)

    Google Scholar 

  30. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)

    Article  Google Scholar 

  31. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Boston (1999)

    Google Scholar 

  32. Massey Jr., F.J.: The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 46(253), 68–78 (1951)

    Article  MATH  Google Scholar 

  33. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  34. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)

    Google Scholar 

Download references

Acknowledgments

This research has been supported by the project “Intelligent and sustainable mobility supported by multi-agent systems and edge computing (InEDGEMobility): Towards Sustainable Intelligent Mobility: Blockchain-based framework for IoT Security”, Ref.: RTI2018-095390-B-C32, (MCIU/AEI/FEDER, UE).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sara Rodríguez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hernández, G., Rodríguez, S., González, A., Corchado, J.M., Prieto, J. (2021). Video Analysis System Using Deep Learning Algorithms. In: Novais, P., Vercelli, G., Larriba-Pey, J.L., Herrera, F., Chamoso, P. (eds) Ambient Intelligence – Software and Applications . ISAmI 2020. Advances in Intelligent Systems and Computing, vol 1239. Springer, Cham. https://doi.org/10.1007/978-3-030-58356-9_19

Download citation

Publish with us

Policies and ethics