Abstract
In this paper, we introduce a multi-user hierarchical video search tool called Videofall. Our objective, in the Video Browser Showdown (VBS) 2022, is to explore if Videofall interactive video retrieval under time constraints is a useful approach to take, given the overhead of requiring multiple users. It is our conjecture that combining different skills of normal users can support a master user to retrieve target videos efficiently. The system is designed on top of the CLIP pre-trained model and the video keyframes are embedded into a vector space in which queries would also be encoded to facilitate retrieval.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amato, G., et al.: VISIONE at video browser showdown 2021. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 473–478. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_47
Berns, F., Rossetto, L., Schoeffmann, K., Beecks, C., Awad, G.: V3C1 dataset: an evaluation of content characteristics. In: Proceedings of the 2019 on International Conference on Multimedia Retrieval, ICMR 2019, pp. 334–338. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3323873.3325051
Heller, S., et al.: Towards explainable interactive multi-modal video retrieval with vitrivr. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 435–440. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_41
Radford, A., et al.: Learning transferable visual models from natural language supervision. CoRR arXiv:2103.00020 (2021)
Rossetto, L., Schoeffmann, K., Bernstein, A.: Insights on the V3C2 dataset. CoRR arXiv:2105.01475 (2021)
Rossetto, L., Schuldt, H., Awad, G., Butt, A.A.: V3C - a research video collection. CoRR arXiv:1810.04401 (2018)
Schoeffmann, K., et al.: Collaborative feature maps for interactive video search. In: Amsaleg, L., Guðmundsson, G.Þ, Gurrin, C., Jónsson, B.Þ, Satoh, S. (eds.) MMM 2017. LNCS, vol. 10133, pp. 457–462. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51814-5_41
Tran, L., et al.: A VR interface for browsing visual spaces at VBS2021. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 490–495. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_50
Veselý, P., Mejzlík, F., Lokoč, J.: SOMHunter V2 at video browser showdown 2021. In: Lokoč, J., et al. (eds.) MMM 2021. LNCS, vol. 12573, pp. 461–466. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67835-7_45
Acknowledgments
This research was conducted with the financial support of Science Foundation Ireland under Grant Agreement No. 18/CRT/6223, and 13/RC/2106_P2 at the ADAPT SFI Research Centre at DCU. ADAPT, the SFI Research Centre for AI-Driven Digital Content Technology, is funded by Science Foundation Ireland through the SFI Research Centres Programme.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Nguyen, TN., Puangthamawathanakun, B., Healy, G., Nguyen, B.T., Gurrin, C., Caputo, A. (2022). Videofall - A Hierarchical Search Engine for VBS2022. In: Þór Jónsson, B., et al. MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science, vol 13142. Springer, Cham. https://doi.org/10.1007/978-3-030-98355-0_48
Download citation
DOI: https://doi.org/10.1007/978-3-030-98355-0_48
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-98354-3
Online ISBN: 978-3-030-98355-0
eBook Packages: Computer ScienceComputer Science (R0)