Skip to main content
Log in

An efficient access method for multimodal video retrieval

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper presents the Slim 2-tree, an efficient and effective content-based video retrieval technique allowing the use of multiple modalities within a single index structure. Slim 2-tree is capable of dealing with different distance measures for the modalities and can perform both multimodal and unimodal searches using the same tree structure. Experimental studies on a large real dataset show the video similarity search performance of the proposed technique. Additionally, we present experiments comparing our method against state-of-the-art of multimodal solutions. Comparative test results demonstrate that our technique improves the performance of video similarity queries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. Available at http://lear.inrialpes.fr/src/lear_gist-1.1.tgz

References

  • Almeida J, Valle E, Torres RS, Leite NJ (2010) DAHC-tree: an effective index for approximate search in high-dimensional metric spaces. JIDM 1(3):375–390

    Google Scholar 

  • Atrey PK, Hossain MA, El Saddik A, Kankanhalli MS (2010) Multimodal fusion for multimedia analysis: a survey. Multimedia Systems 16(6):345–379

    Article  Google Scholar 

  • Bustos B, Kreft S, Skopal T (2012) Adapting metric indexes for searching in multi-metric spaces. Multimed Tools Appl 58(3):467–496

    Article  Google Scholar 

  • Chávez E, Navarro G, Baeza-Yates R, Marroquín JL (2001) Searching in metric spaces. ACM Comput Surv 33(3):273–321

    Article  Google Scholar 

  • Ciaccia P, Patella M (2000) The M2-tree: processing complex multi-feature queries with just one index. In: 1st DELOS workshop: ISSQDL

  • Ciaccia P, Patella M, Zezula P (1997) M-tree: an efficient access method for similarity search in metric spaces. In: Proc. 23rd VLDB97. pp 426–435

  • Döller M, Stegmaier F, Jans S, Kosch H (2012) TempoM2: a multi feature index structure for temporal video search. In: AMM, vol 7131, pp 323–333. Springer

  • Douze M, Jégou H, Sandhawalia H, Amsaleg L, Schmid C (2009) Evaluation of gist descriptors for web-scale image search. In: Proc. ACM CIVR’09, pp 19:1–19:8. ACM, New York. doi:10.1145/1.646396.1646421

  • Gaede V, Günther O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231

    Article  Google Scholar 

  • Ganchev T., Fakotakis N., Kokkinakis G. Comparative evaluation of various MFCC implementations on the speaker verification task. In: Proc. SPECOM. pp 191–194

  • Goh ST, Tan KL (2000) MOSAIC: a fast multi-feature image retrieval system. Data Knowl Eng 33(3):219–239

    Article  MATH  Google Scholar 

  • He Y, Yu J (2010) MFI-tree: a effective multi-feature index structure for weighted query application. Comput Sci Inf Syst 7(1):139–152

    Article  Google Scholar 

  • Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV 42(3):145–175

    Article  MATH  Google Scholar 

  • Shao J, Shen HT, Zhou X (2008) Challenges and techniques for effective and efficient similarity search in large video databases. PLDV 1(2):1598–1603

    Google Scholar 

  • Traina C, Traina A, Seeger B, Faloutsos C (2000) Slim-trees: high performance metric trees minimizing overlap between nodes. In: 7th EDBT. pp 51–65

  • Yan R, Hauptmann AG (2007) A review of text and image retrieval approaches for broadcast news video. Inf Retr 10(4–5):445–484

    Article  Google Scholar 

  • Zezula P, Amato G, Dohnal V, Batko M (2010) Similarity search: the metric space approach. Advances in database systems. Springer

Download references

Acknowledgments

The authors are grateful to PUC Minas, CNPq, CAPES and FAPEMIG for the financial support of this work. The authors also thank to the anonymous reviewers for their valuable comments and suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zenilton K. G. Patrocínio Jr..

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sperandio, R.C., Patrocínio, Z.K.G., de Paula, H.B. et al. An efficient access method for multimodal video retrieval. Multimed Tools Appl 74, 1357–1375 (2015). https://doi.org/10.1007/s11042-014-1917-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-1917-2

Keywords

Navigation