Video Analysis System Using Deep Learning Algorithms

Hernández, Guillermo; Rodríguez, Sara; González, Angélica; Corchado, Juan Manuel; Prieto, Javier

doi:10.1007/978-3-030-58356-9_19

Guillermo Hernández¹⁹,
Sara Rodríguez¹⁹,
Angélica González¹⁹,
Juan Manuel Corchado¹⁹ &
…
Javier Prieto¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1239))

Included in the following conference series:

International Symposium on Ambient Intelligence

603 Accesses
6 Citations

Abstract

Detection of video duplicates is an active field of research, motivated by the protection of intellectual property, the fight against piracy or the tracing of the origin of reused video segments.

In this work, a method for the detection of duplicate videos is proposed and implemented, making use of deep learning methods and techniques typical of the field of information recovery. This method has been evaluated with a data set usually used in the field, with which high average accuracies, above 85%, have been obtained. The effect of the different layers of the convolutional neural network used by the algorithm, the aggregation mechanisms that can be used on them, and the influence of the recovery model have been studied, finding a set of parameters that optimize the overall accuracy of the system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.scopus.com.
2.
https://github.com/BVLC/caffe/tree/master/models/bvlc_alexnet.
3.
Although in our case this is not possible due to the use of the code book.

References

Chou, C.-L., Chen, H.-T., Lee, S.-Y.: Pattern-based near-duplicate video retrieval and localization on web-scale videos. IEEE Trans. Multimed. 17(3), 382–395 (2015)
Article Google Scholar
Liu, H., Zhao, Q., Wang, H., Lv, P., Chen, Y.: An image-based near-duplicate video retrieval and localization using improved edit distance. Multimed. Tools Appl. 76(22), 24435–24456 (2017)
Article Google Scholar
Wu, X., Ngo, C.-W., Hauptmann, A., Tan, H.-K.: Real-time near-duplicate elimination for web video search with content and context. IEEE Trans. Multimed. 11(2), 196–207 (2009). https://doi.org/10.1109/TMM.2008.2009673
Article Google Scholar
Wu, X., Hauptmann, A.G., Ngo, C.-W.: Practical elimination of near-duplicates from web video search. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 218–227. ACM (2007)
Google Scholar
Hernandez, G.: Sistema de análisis de vídeo mediante la utilización del marco metodológico de los sistemas de razonamiento basados en casos y el uso de algoritmos de aprendizaje profundo. In: Avances en Informática y Automática - Décimotercer Workshop (2019)
Google Scholar
Garcıa-Peñalvo, F.: Revisiones y mapeos sistemáticos de literatura (2019). https://doi.org/10.5281/zenodo.2586725
Kitchenham, B., Charters, S.: Guidelines for performing systematic literature reviews in software engineering (2007)
Google Scholar
Hu, Y., Lu, X.: Learning spatial-temporal features for video copy detection by the combination of CNN and RNN. J. Vis. Commun. Image Represent. 55, 21–29 (2018). https://doi.org/10.1016/j.jvcir.2018.05.013
Article Google Scholar
Zhang, X., Xie, Y., Luan, X., He, J., Zhang, L., Wu, L.: Video copy detection based on deep CNN features and graph-based sequence matching. Wirel. Pers. Commun. 103(1), 401–416 (2018). https://doi.org/10.1007/s11277-018-5450-x
Article Google Scholar
Law-To, J., Buisson, O., Gouet-Brunet, V., Boujemaa, N.: ViCopT: a robust system for content-based video copy detection in large databases. Multimed. Syst. 15(6), 337–353 (2009). https://doi.org/10.1007/s00530-009-0164-2
Article Google Scholar
Liu, H., Zhao, Q., Wang, H., Lv, P., Chen, Y.: An image-based near-duplicate video retrieval and localization using improved edit distance. Multimed. Tools Appl. 76(22), 24435–24456 (2017). https://doi.org/10.1007/s11042-016-4176-6
Article Google Scholar
Liao, K., Liu, G.: An efficient content based video copy detection using the sample based hierarchical adaptive k-means clustering. J. Intell. Inf. Syst. 44(1), 133–158 (2014). https://doi.org/10.1007/s10844-014-0332-5
Article Google Scholar
Su, P.-C., Wu, C.-S.: Efficient copy detection for compressed digital videos by spatial and temporal feature extraction. Multimed. Tools Appl. 76(1), 1331–1353 (2017). https://doi.org/10.1007/s11042-015-3132-1
Article Google Scholar
Guzman-Zavaleta, Z., Feregrino-Uribe, C., Morales-Sandoval, M., Menendez-Ortiz, A.: A robust and low-cost video fingerprint extraction method for copy detection. Multimed. Tools Appl. 76(22), 24143–24163 (2017). https://doi.org/10.1007/s11042-016-4168-6
Article Google Scholar
Boukhari, A., Serir, A.: Weber Binarized Statistical Image Features (WBSIF) based video copy detection. J. Vis. Commun. Image Represent. 34, 50–64 (2016). https://doi.org/10.1016/j.jvcir.2015.10.015
Article Google Scholar
Chamoso, P., González-Briones, A., Rodrguez, S., Corchado, J.M.: Tendencies of technologies and platforms in smart cities: a state-of-the art review. Wirel. Commun. Mob. Comput. 2018, 17 (2018)
Article Google Scholar
Li, T., Sun, S., Bolić, M., Corchado, J.M.: Algorithm design for parallel implementation of the SMC-PHD filter. Sig. Process. 119, 115–127 (2016)
Article Google Scholar
Coria, J.A.G., Castellanos-Garzón, J.A., Corchado, J.M.: Intelligent business processes composition based on multi-agent systems. Expert Syst. Appl. 41(4), 1189–1205 (2014)
Article Google Scholar
Bullón, J., González Arrieta, A., Hernández Encinas, A., Queiruga Dios, A., et al.: Manufacturing processes in the textile industry. Expert Syst. Fabrics Prod. 6(1), 41–50 (2017)
Google Scholar
Casado-Vara, R., Martin-del Rey, A., Affes, S., Prieto, J., Corchado, J.M.: IoT network slicing on virtual layers of homogeneous data for improved algorithm operation in smart buildings. Future Gener. Comput. Syst. 102, 965–977 (2020)
Article Google Scholar
Chiu, C.-Y., Wang, H.-M.: Time-series linear search for video copies based on compact signature manipulation and containment relation modeling. IEEE Trans. Circuits Syst. Video Technol. 20(11), 1603–1613 (2010). https://doi.org/10.1109/TCSVT.2010.2087471
Article Google Scholar
Chiu, C.-Y., Tsai, T.-H., Liou, Y.-C., Han, G.-W., Chang, H.-S.: Near-duplicate subsequence matching between the continuous stream and large video dataset. IEEE Trans. Multimed. 16(7), 1952–1962 (2014). https://doi.org/10.1109/TMM.2014.2342668
Article Google Scholar
Kordopatis-Zilos, G., Papadopoulos, S., Patras, I., Kompatsiaris, Y.: Near-duplicate video retrieval by aggregating intermediate CNN layers. In: International Conference on Multimedia Modeling, pp. 251–263. Springer (2017)
Google Scholar
Panagiotakis, C., Doulamis, A., Tziritas, G.: Equivalent key frames selection based on ISO-content principles. IEEE Trans. Circuits Syst. Video Technol. 19(3), 447–451 (2009)
Article Google Scholar
Paul, M.K.A., Kavitha, J., Rani, P.A.J.: Key-frame extraction techniques: a review. Recent Pat. Comput. Sci. 11(1), 3–16 (2018)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Google Scholar
Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.: Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)
Sculley, D.: Web-scale k-means clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1177–1178. ACM (2010)
Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)
Article Google Scholar
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Boston (1999)
Google Scholar
Massey Jr., F.J.: The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 46(253), 68–78 (1951)
Article MATH Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Google Scholar

Download references

Acknowledgments

This research has been supported by the project “Intelligent and sustainable mobility supported by multi-agent systems and edge computing (InEDGEMobility): Towards Sustainable Intelligent Mobility: Blockchain-based framework for IoT Security”, Ref.: RTI2018-095390-B-C32, (MCIU/AEI/FEDER, UE).

Author information

Authors and Affiliations

BISITE Research Group, University of Salamanca, Edificio Multiusos I+D+i Calle Espejo 2, Salamanca, Spain
Guillermo Hernández, Sara Rodríguez, Angélica González, Juan Manuel Corchado & Javier Prieto

Authors

Guillermo Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Sara Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Angélica González
View author publications
You can also search for this author in PubMed Google Scholar
Juan Manuel Corchado
View author publications
You can also search for this author in PubMed Google Scholar
Javier Prieto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sara Rodríguez .

Editor information

Editors and Affiliations

Departamento de Informática, University of Minho, ALGORITMI Center, Braga, Portugal
Paulo Novais
DIBRIS, University of Genoa, Genoa, Italy
Gianni Vercelli
Data Management Group, Technical University of Catalonia, Barcelona, Barcelona, Spain
Josep L. Larriba-Pey
Department Computer Science and Artificial Intelligence, ETS de Ingenierias Informática y de Telecomunicación, University of Granada, Granada, Spain
Francisco Herrera
BISITE Research Group, University of Salamanca, Salamanca, Salamanca, Spain
Pablo Chamoso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernández, G., Rodríguez, S., González, A., Corchado, J.M., Prieto, J. (2021). Video Analysis System Using Deep Learning Algorithms. In: Novais, P., Vercelli, G., Larriba-Pey, J.L., Herrera, F., Chamoso, P. (eds) Ambient Intelligence – Software and Applications . ISAmI 2020. Advances in Intelligent Systems and Computing, vol 1239. Springer, Cham. https://doi.org/10.1007/978-3-030-58356-9_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-58356-9_19
Published: 10 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58355-2
Online ISBN: 978-3-030-58356-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics