Skip to main content

Videofall - A Hierarchical Search Engine for VBS2022

  • Conference paper
  • First Online:
MultiMedia Modeling (MMM 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13142))

Included in the following conference series:

Abstract

In this paper, we introduce a multi-user hierarchical video search tool called Videofall. Our objective, in the Video Browser Showdown (VBS) 2022, is to explore if Videofall interactive video retrieval under time constraints is a useful approach to take, given the overhead of requiring multiple users. It is our conjecture that combining different skills of normal users can support a master user to retrieve target videos efficiently. The system is designed on top of the CLIP pre-trained model and the video keyframes are embedded into a vector space in which queries would also be encoded to facilitate retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amato, G., et al.: VISIONE at video browser showdown 2021. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 473–478. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_47

    Chapter  Google Scholar 

  2. Berns, F., Rossetto, L., Schoeffmann, K., Beecks, C., Awad, G.: V3C1 dataset: an evaluation of content characteristics. In: Proceedings of the 2019 on International Conference on Multimedia Retrieval, ICMR 2019, pp. 334–338. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3323873.3325051

  3. Heller, S., et al.: Towards explainable interactive multi-modal video retrieval with vitrivr. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 435–440. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_41

    Chapter  Google Scholar 

  4. Radford, A., et al.: Learning transferable visual models from natural language supervision. CoRR arXiv:2103.00020 (2021)

  5. Rossetto, L., Schoeffmann, K., Bernstein, A.: Insights on the V3C2 dataset. CoRR arXiv:2105.01475 (2021)

  6. Rossetto, L., Schuldt, H., Awad, G., Butt, A.A.: V3C - a research video collection. CoRR arXiv:1810.04401 (2018)

  7. Schoeffmann, K., et al.: Collaborative feature maps for interactive video search. In: Amsaleg, L., Guðmundsson, G.Þ, Gurrin, C., Jónsson, B.Þ, Satoh, S. (eds.) MMM 2017. LNCS, vol. 10133, pp. 457–462. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51814-5_41

    Chapter  Google Scholar 

  8. Tran, L., et al.: A VR interface for browsing visual spaces at VBS2021. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 490–495. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_50

    Chapter  Google Scholar 

  9. Veselý, P., Mejzlík, F., Lokoč, J.: SOMHunter V2 at video browser showdown 2021. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 461–466. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_45

    Chapter  Google Scholar 

Download references

Acknowledgments

This research was conducted with the financial support of Science Foundation Ireland under Grant Agreement No. 18/CRT/6223, and 13/RC/2106_P2 at the ADAPT SFI Research Centre at DCU. ADAPT, the SFI Research Centre for AI-Driven Digital Content Technology, is funded by Science Foundation Ireland through the SFI Research Centres Programme.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thao-Nhu Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nguyen, TN., Puangthamawathanakun, B., Healy, G., Nguyen, B.T., Gurrin, C., Caputo, A. (2022). Videofall - A Hierarchical Search Engine for VBS2022. In: Þór Jónsson, B., et al. MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science, vol 13142. Springer, Cham. https://doi.org/10.1007/978-3-030-98355-0_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-98355-0_48

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-98354-3

  • Online ISBN: 978-3-030-98355-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics