Skip to main content

Lambda-IVR: An Indexing Framework for Video Big Data Retrieval Using Distributed In-memory Computation in the Cloud

  • Conference paper
  • First Online:
Innovations in Smart Cities Applications Volume 4 (SCA 2020)

Abstract

Feature indexing for video retrieval poses a significant hurdle for indexing due to three significant challenges. First, there are different types of features in varying nature, such as deep Convolutional Neural Network (CNN) features, handcrafted features, recognized text from the videos, and audio features, etc. Secondly, feature matching for those varying types of features requires different similarity measure methods. And thirdly, considering the Big-Data era the number of features to be indexed is enormous. To address these issues, in this paper, we present a lambda style distributed in-memory scale-out inverted-index based feature indexing framework for video retrieval, which operates as SaaS in the cloud. First, the video features are acquired, decoupled, and the visual features are encoded using an adaptation of an existing feature encoder with improvements. Secondly, the visual encoded features and the textual features are aggregated. Finally, the aggregated features are indexed and readily available for retrieval. Our framework supports incremental updates without the need to re-index the data and can serve enormous concurrent queries. Experimental results show that our framework performs reasonably well in terms of, accuracy, precision, and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Adamu, F.B., Habbal, A., Hassan, S., Cottrell, R.L., White, B., Abdullah, I.: A survey on big data indexing strategies. Tech. rep., SLAC National Accelerator Laboratory (2015)

    Google Scholar 

  2. Alvarez, V., Richter, S., Chen, X., Dittrich, J.: A comparison of adaptive radix trees and hash tables. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 1227–1238 (2015). https://doi.org/10.1109/ICDE.2015.7113370

  3. Amato, F., Santo, A.D., Gargiulo, F., Moscato, V., Persia, F., Picariello, A., Poccia, S.R.: Semtree: an index for supporting semantic retrieval of documents. In: 2015 31st IEEE International Conference on Data Engineering Workshops, pp. 62–67 (2015). https://doi.org/10.1109/ICDEW.2015.7129546

  4. Amato, G., Bolettieri, P., Carrara, F., Falchi, F., Gennaro, C.: Large-scale image retrieval with elasticsearch. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, SIGIR 2018, pp. 925–928. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3209978.3210089

  5. Amato, G., Debole, F., Falchi, F., Gennaro, C., Rabitti, F.: Large scale indexing and searching deep convolutional neural network features. In: International Conference on Big Data Analytics and Knowledge Discovery, pp. 213–224. Springer (2016)

    Google Scholar 

  6. Chatterjee, K., Chen, S.C.: Hah-tree: towards a multidimensional index structure supporting different video modelling approaches in a video database management system. Int. J. Inf. Decis. Sci. 2(2), 188–207 (2010)

    Google Scholar 

  7. Chen, X., Zhang, C., Chen, S.C., Chen, M.: A latent semantic indexing based method for solving multiple instance learning problem in region-based image retrieval. In: Seventh IEEE International Symposium on Multimedia (ISM 2005), p. 8 (2005). https://doi.org/10.1109/ISM.2005.10

  8. Eldawy, A., Mokbel, M.F.: Spatialhadoop: a mapreduce framework for spatial data. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 1352–1363 (2015). https://doi.org/10.1109/ICDE.2015.7113382

  9. Fleites, F.C., Chen, S.: Efficient content-based multimedia retrieval using novel indexing structure in PostgreSQL. In: 2013 IEEE International Symposium on Multimedia, pp. 500–501 (2013). https://doi.org/10.1109/ISM.2013.96

  10. Fleites, F.C., Chen, S.C., Chatterjee, K.: A semantic index structure for multimedia retrieval. Int. J. Semant. Comput. 6(02), 155–178 (2012)

    Article  Google Scholar 

  11. Gani, A., Siddiqa, A., Shamshirband, S., Hanum, F.: A survey on indexing techniques for big data: taxonomy and performance evaluation. Knowl. Inf. Syst. 46(2), 241–284 (2016). https://doi.org/10.1007/s10115-015-0830-y

    Article  Google Scholar 

  12. Giangreco, I., Kabary, I.A., Schuldt, H.: Adam - a database and information retrieval system for big multimedia collections. In: 2014 IEEE International Congress on Big Data, pp. 406–413 (2014). https://doi.org/10.1109/BigData.Congress.2014.66

  13. Gollub, T., Völske, M., Hagen, M., Stein, B.: Dynamic taxonomy composition via keyqueries. In: IEEE/ACM Joint Conference on Digital Libraries, pp. 39–48 (2014). https://doi.org/10.1109/JCDL.2014.6970148

  14. Hellerstein, J.M., Naughton, J.F., Pfeffer, A.: Generalized search trees for database systems (September 1995)

    Google Scholar 

  15. Khan, M.N., Alam, A., Lee, Y.: Falkon: large-scale content-based video retrieval utilizing deep-features and distributed in-memory computing. In: 2020 IEEE International Conference on Big Data and Smart Computing (BigComp), pp. 36–43 (2020). https://doi.org/10.1109/BigComp48618.2020.0-102

  16. Khan, M.N., Alam, A., Uddin, M.A., Lee, Y.K.: SurVRet: distributed surveillance video retrieval on large-scale video data using deep learning tt - survret: , pp. 892–894. The Korean Institute of Information Scientists and Engineers (June 2019). http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE08763364

  17. Liu, R., Wei, S., Zhao, Y., Yang, Y.: Indexing of the CNN features for the large scale image search. Multimed. Tools Appl. 77(24), 32107–32131 (2018). https://doi.org/10.1007/s11042-018-6210-3

    Article  Google Scholar 

  18. Matsui, A., Nishimura, S., Katsura, S.: A classification method of motion database using hidden Markov model. In: 2014 IEEE 23rd International Symposium on Industrial Electronics (ISIE), pp. 2232–2237 (2014). https://doi.org/10.1109/ISIE.2014.6864965

  19. Khan, M.N., Alam, A., Islam, M.A., Khan, J., Lee, Y.K.: DISIVR: distributed deep feature indexer for video big dataretrieval on spark. , pp. 118–120 (2020). http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE09874358

  20. Pouyanfar, S., Yang, Y., Chen, S.C., Shyu, M.L., Iyengar, S.S.: Multimedia big data analytics: a survey. ACM Comput. Surv. 51(1), 1–34 (2018). https://doi.org/10.1145/3150226

    Article  Google Scholar 

  21. Seres, A.: Three database management systems (DBMS) compared. Open Source Sci. J. 2(4), 65–82 (2010)

    Google Scholar 

  22. Shahi, D.: Apache Solr. Springer (2016)

    Google Scholar 

  23. Soomro, K., Zamir, A.R., Shah, M.: Ucf101: a dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012)

  24. Wei, S., Wu, X., Xu, D.: Partitioned k-means clustering for fast construction of unbiased visual vocabulary. In: The Era of Interactive Media, pp. 483–493. Springer, New York (2013)

    Google Scholar 

  25. White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Inc. (2012)

    Google Scholar 

  26. Xu, H., Yao, N., Hu, W., Pan, H., Gao, X.: The design and implementation of image information retrieval. In: 2012 International Conference on Computer Science and Service System, pp. 1547–1550 (2012). https://doi.org/10.1109/CSSS.2012.387

  27. Zaharia, M., Xin, R., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M., Gonzalez, J., Shenker, S., Stoica, I.: Apache spark: a unified engine for big data processing. Commun. ACM 59, 56–65 (2016). https://doi.org/10.1145/2934664

    Article  Google Scholar 

  28. Zhang, C., Lin, Y., Zhu, L., Liu, A., Zhang, Z., Huang, F.: CNN-VWII: an efficient approach for large-scale video retrieval by image queries. Pattern Recogn. Lett. 123, 82–88 (2019). https://doi.org/10.1016/j.patrec.2019.03.015. http://www.sciencedirect.com/science/article/pii/S0167865518308341

    Article  Google Scholar 

Download references

Acknowledgment

This work was supported by the Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No. 2016-0-00406, SIAT CCTV Cloud Platform).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Young-Koo Lee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Khan, M.N., Alam, A., Afridi, T.H., Khalid, S., Lee, YK. (2021). Lambda-IVR: An Indexing Framework for Video Big Data Retrieval Using Distributed In-memory Computation in the Cloud. In: Ben Ahmed, M., Rakıp Karaș, İ., Santos, D., Sergeyeva, O., Boudhir, A.A. (eds) Innovations in Smart Cities Applications Volume 4. SCA 2020. Lecture Notes in Networks and Systems, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-030-66840-2_105

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-66840-2_105

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-66839-6

  • Online ISBN: 978-3-030-66840-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics