Abstract
Feature indexing for video retrieval poses a significant hurdle for indexing due to three significant challenges. First, there are different types of features in varying nature, such as deep Convolutional Neural Network (CNN) features, handcrafted features, recognized text from the videos, and audio features, etc. Secondly, feature matching for those varying types of features requires different similarity measure methods. And thirdly, considering the Big-Data era the number of features to be indexed is enormous. To address these issues, in this paper, we present a lambda style distributed in-memory scale-out inverted-index based feature indexing framework for video retrieval, which operates as SaaS in the cloud. First, the video features are acquired, decoupled, and the visual features are encoded using an adaptation of an existing feature encoder with improvements. Secondly, the visual encoded features and the textual features are aggregated. Finally, the aggregated features are indexed and readily available for retrieval. Our framework supports incremental updates without the need to re-index the data and can serve enormous concurrent queries. Experimental results show that our framework performs reasonably well in terms of, accuracy, precision, and efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adamu, F.B., Habbal, A., Hassan, S., Cottrell, R.L., White, B., Abdullah, I.: A survey on big data indexing strategies. Tech. rep., SLAC National Accelerator Laboratory (2015)
Alvarez, V., Richter, S., Chen, X., Dittrich, J.: A comparison of adaptive radix trees and hash tables. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 1227–1238 (2015). https://doi.org/10.1109/ICDE.2015.7113370
Amato, F., Santo, A.D., Gargiulo, F., Moscato, V., Persia, F., Picariello, A., Poccia, S.R.: Semtree: an index for supporting semantic retrieval of documents. In: 2015 31st IEEE International Conference on Data Engineering Workshops, pp. 62–67 (2015). https://doi.org/10.1109/ICDEW.2015.7129546
Amato, G., Bolettieri, P., Carrara, F., Falchi, F., Gennaro, C.: Large-scale image retrieval with elasticsearch. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, SIGIR 2018, pp. 925–928. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3209978.3210089
Amato, G., Debole, F., Falchi, F., Gennaro, C., Rabitti, F.: Large scale indexing and searching deep convolutional neural network features. In: International Conference on Big Data Analytics and Knowledge Discovery, pp. 213–224. Springer (2016)
Chatterjee, K., Chen, S.C.: Hah-tree: towards a multidimensional index structure supporting different video modelling approaches in a video database management system. Int. J. Inf. Decis. Sci. 2(2), 188–207 (2010)
Chen, X., Zhang, C., Chen, S.C., Chen, M.: A latent semantic indexing based method for solving multiple instance learning problem in region-based image retrieval. In: Seventh IEEE International Symposium on Multimedia (ISM 2005), p. 8 (2005). https://doi.org/10.1109/ISM.2005.10
Eldawy, A., Mokbel, M.F.: Spatialhadoop: a mapreduce framework for spatial data. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 1352–1363 (2015). https://doi.org/10.1109/ICDE.2015.7113382
Fleites, F.C., Chen, S.: Efficient content-based multimedia retrieval using novel indexing structure in PostgreSQL. In: 2013 IEEE International Symposium on Multimedia, pp. 500–501 (2013). https://doi.org/10.1109/ISM.2013.96
Fleites, F.C., Chen, S.C., Chatterjee, K.: A semantic index structure for multimedia retrieval. Int. J. Semant. Comput. 6(02), 155–178 (2012)
Gani, A., Siddiqa, A., Shamshirband, S., Hanum, F.: A survey on indexing techniques for big data: taxonomy and performance evaluation. Knowl. Inf. Syst. 46(2), 241–284 (2016). https://doi.org/10.1007/s10115-015-0830-y
Giangreco, I., Kabary, I.A., Schuldt, H.: Adam - a database and information retrieval system for big multimedia collections. In: 2014 IEEE International Congress on Big Data, pp. 406–413 (2014). https://doi.org/10.1109/BigData.Congress.2014.66
Gollub, T., Völske, M., Hagen, M., Stein, B.: Dynamic taxonomy composition via keyqueries. In: IEEE/ACM Joint Conference on Digital Libraries, pp. 39–48 (2014). https://doi.org/10.1109/JCDL.2014.6970148
Hellerstein, J.M., Naughton, J.F., Pfeffer, A.: Generalized search trees for database systems (September 1995)
Khan, M.N., Alam, A., Lee, Y.: Falkon: large-scale content-based video retrieval utilizing deep-features and distributed in-memory computing. In: 2020 IEEE International Conference on Big Data and Smart Computing (BigComp), pp. 36–43 (2020). https://doi.org/10.1109/BigComp48618.2020.0-102
Khan, M.N., Alam, A., Uddin, M.A., Lee, Y.K.: SurVRet: distributed surveillance video retrieval on large-scale video data using deep learning tt - survret: , pp. 892–894. The Korean Institute of Information Scientists and Engineers (June 2019). http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE08763364
Liu, R., Wei, S., Zhao, Y., Yang, Y.: Indexing of the CNN features for the large scale image search. Multimed. Tools Appl. 77(24), 32107–32131 (2018). https://doi.org/10.1007/s11042-018-6210-3
Matsui, A., Nishimura, S., Katsura, S.: A classification method of motion database using hidden Markov model. In: 2014 IEEE 23rd International Symposium on Industrial Electronics (ISIE), pp. 2232–2237 (2014). https://doi.org/10.1109/ISIE.2014.6864965
Khan, M.N., Alam, A., Islam, M.A., Khan, J., Lee, Y.K.: DISIVR: distributed deep feature indexer for video big dataretrieval on spark. , pp. 118–120 (2020). http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE09874358
Pouyanfar, S., Yang, Y., Chen, S.C., Shyu, M.L., Iyengar, S.S.: Multimedia big data analytics: a survey. ACM Comput. Surv. 51(1), 1–34 (2018). https://doi.org/10.1145/3150226
Seres, A.: Three database management systems (DBMS) compared. Open Source Sci. J. 2(4), 65–82 (2010)
Shahi, D.: Apache Solr. Springer (2016)
Soomro, K., Zamir, A.R., Shah, M.: Ucf101: a dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012)
Wei, S., Wu, X., Xu, D.: Partitioned k-means clustering for fast construction of unbiased visual vocabulary. In: The Era of Interactive Media, pp. 483–493. Springer, New York (2013)
White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Inc. (2012)
Xu, H., Yao, N., Hu, W., Pan, H., Gao, X.: The design and implementation of image information retrieval. In: 2012 International Conference on Computer Science and Service System, pp. 1547–1550 (2012). https://doi.org/10.1109/CSSS.2012.387
Zaharia, M., Xin, R., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M., Gonzalez, J., Shenker, S., Stoica, I.: Apache spark: a unified engine for big data processing. Commun. ACM 59, 56–65 (2016). https://doi.org/10.1145/2934664
Zhang, C., Lin, Y., Zhu, L., Liu, A., Zhang, Z., Huang, F.: CNN-VWII: an efficient approach for large-scale video retrieval by image queries. Pattern Recogn. Lett. 123, 82–88 (2019). https://doi.org/10.1016/j.patrec.2019.03.015. http://www.sciencedirect.com/science/article/pii/S0167865518308341
Acknowledgment
This work was supported by the Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No. 2016-0-00406, SIAT CCTV Cloud Platform).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Khan, M.N., Alam, A., Afridi, T.H., Khalid, S., Lee, YK. (2021). Lambda-IVR: An Indexing Framework for Video Big Data Retrieval Using Distributed In-memory Computation in the Cloud. In: Ben Ahmed, M., Rakıp Karaș, İ., Santos, D., Sergeyeva, O., Boudhir, A.A. (eds) Innovations in Smart Cities Applications Volume 4. SCA 2020. Lecture Notes in Networks and Systems, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-030-66840-2_105
Download citation
DOI: https://doi.org/10.1007/978-3-030-66840-2_105
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-66839-6
Online ISBN: 978-3-030-66840-2
eBook Packages: EngineeringEngineering (R0)