VERGE in VBS 2020

Andreadis, Stelios; Moumtzidou, Anastasia; Apostolidis, Konstantinos; Gkountakos, Konstantinos; Galanopoulos, Damianos; Michail, Emmanouil; Gialampoukidis, Ilias; Vrochidis, Stefanos; Mezaris, Vasileios; Kompatsiaris, Ioannis

doi:10.1007/978-3-030-37734-2_69

Stelios Andreadis¹⁶,
Anastasia Moumtzidou¹⁶,
Konstantinos Apostolidis¹⁶,
Konstantinos Gkountakos¹⁶,
Damianos Galanopoulos¹⁶,
Emmanouil Michail¹⁶,
Ilias Gialampoukidis¹⁶,
Stefanos Vrochidis¹⁶,
Vasileios Mezaris¹⁶ &
…
Ioannis Kompatsiaris¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11962))

Included in the following conference series:

International Conference on Multimedia Modeling

2260 Accesses
3 Citations

Abstract

This paper demonstrates VERGE, an interactive video retrieval engine for browsing a collection of images or videos and searching for specific content. The engine integrates a multitude of retrieval methodologies that include visual and textual searches and further capabilities such as fusion and reranking. All search options and results appear in a web application that aims at a friendly user experience.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Awad, G., Butt, A., et al.: TRECVID 2018: benchmarking video activity detection, video captioning and matching, video storytelling linking and video search (2018)
Google Scholar
Lokoč, J., Kovalčík, G., et al.: Interactive search or sequential browsing? A detailed analysis of the video browser showdown 2018. ACM TOMM 15(1), 29 (2019)
Google Scholar
Yang, H.-F., Lin, K., Chen, C.-S.: Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Trans. PAMI 40(2), 437–451 (2017)
Article Google Scholar
Jegou, H., Douze, M., Schmid, C.: Product quantization for nearest neighbor search. IEEE Trans. PAMI 33(1), 117–128 (2011)
Article Google Scholar
Markatopoulou, F., Moumtzidou, A., Galanopoulos, D., et al.: ITI-CERTH participation in TRECVID 2017. In: Proceedings of TRECVID 2017 Workshop, USA (2017)
Google Scholar
Zhou, B., Lapedriza, A., et al.: Places: a 10 million image database for scene recognition. IEEE Trans. PAMI 40(6), 1452–1464 (2017)
Article Google Scholar
Markatopoulou, F., Mezaris, V., Patras, I.: Implicit and explicit concept relations in deep neural networks for multi-label video/image annotation. IEEE Trans. Circuits Syst. Video Technol. 29(6), 1631–1644 (2018)
Article Google Scholar
Guangnan, Y., Yitong, L., Hongliang, X., et al.: EventNet: a large scale structured concept library for complex event detection in video. In Proceedings of ACM MM (2015)
Google Scholar
Gu, C., Sun, C., Ross, D.A., et al.: AVA: a video dataset of spatio-temporally localized atomic visual actions. In: Proceedings of the IEEE Conference on CVPR, pp. 6047–6056 (2018)
Google Scholar
Tan, W.R., Chan, C.S., Aguirre, H.E., Tanaka, K.: Ceci n’est pas une pipe: a deep convolutional network for fine-art paintings classification. In 2016 IEEE ICIP, pp. 3703–3707. IEEE (2016)
Google Scholar
Dong, J., Li, X., Xu, C., et al.: Dual encoding for zero-example video retrieval. In: Proceedings of the IEEE Conference on CVPR, pp. 9346–9355 (2019)
Google Scholar
Cho, K., Van M.B., Gulcehre, C., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint. arXiv:1406.1078 (2014)
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint. arXiv:1408.5882 (2014)
Faghri, F., Fleet, D.J., Kiros, J.R., Fidler, S.: VSE++: improving visual-semantic embeddings with hard negatives (2018)
Google Scholar
Li, Y., Song, Y., Cao, L., et al.: TGIF: a new dataset and benchmark on animated GIF description. In: The IEEE Conference on CVPR (2016)
Google Scholar
Xu, J., Mei, T., Yao, T., Rui, Y.: MSR-VTT: a large video description dataset for bridging video and language. In: The IEEE Conference on CVPR (June 2016)
Google Scholar
Lamere, P., Kwok, P., Gouvea, E., et al.: The CMU SPHINX-4 speech recognition system. In: IEEE ICASSP 2003, vol. 1, pp. 2–5, Hong Kong (2003)
Google Scholar
Venugopalan, S., Rohrbach, M., Donahue, J., et al.: Sequence to sequence-video to text. In: Proceedings of the IEEE ICCV, pp. 4534–4542 (2015)
Google Scholar
Phan, S., Henter, G.E., Miyao, Y., Satoh, S.: Consensus-based sequence training for video captioning. arXiv preprint. arXiv:1712.09532 (2017)
Gialampoukidis, I., Moumtzidou, A., Liparas, D., et al.: A hybrid graph-based and non-linear late fusion approach for multimedia retrieval. In: 2016 14th International Workshop on CBMI, pp. 1–6 (June 2016)
Google Scholar

Download references

Acknowledgements

This work was supported by the EU’s Horizon 2020 research and innovation programme under grant agreements H2020-779962 V4Desi-gn, H2020-786731 CONNEXIONs and H2020-780656 ReTV.

Author information

Authors and Affiliations

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Stelios Andreadis, Anastasia Moumtzidou, Konstantinos Apostolidis, Konstantinos Gkountakos, Damianos Galanopoulos, Emmanouil Michail, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris & Ioannis Kompatsiaris

Authors

Stelios Andreadis
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia Moumtzidou
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos Apostolidis
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos Gkountakos
View author publications
You can also search for this author in PubMed Google Scholar
Damianos Galanopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Emmanouil Michail
View author publications
You can also search for this author in PubMed Google Scholar
Ilias Gialampoukidis
View author publications
You can also search for this author in PubMed Google Scholar
Stefanos Vrochidis
View author publications
You can also search for this author in PubMed Google Scholar
Vasileios Mezaris
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Kompatsiaris
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stelios Andreadis .

Editor information

Editors and Affiliations

Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Yong Man Ro
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
National Cheng Kung University, Tainan City, Taiwan
Wei-Ta Chu
Tsinghua University, Beijing, China
Peng Cui
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jung-Woo Choi
National Tsing Hua University, Hsinchu, Taiwan
Min-Chun Hu
Ghent University, Ghent, Belgium
Wesley De Neve

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Andreadis, S. et al. (2020). VERGE in VBS 2020. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11962. Springer, Cham. https://doi.org/10.1007/978-3-030-37734-2_69

Download citation

DOI: https://doi.org/10.1007/978-3-030-37734-2_69
Published: 24 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37733-5
Online ISBN: 978-3-030-37734-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics