Explicit Performance Metric Optimization for Fusion-Based Video Retrieval

Kim, Ilseo; Oh, Sangmin; Byun, Byungki; Perera, A. G. Amitha; Lee, Chin-Hui

doi:10.1007/978-3-642-33885-4_40

Ilseo Kim¹⁹,
Sangmin Oh²⁰,
Byungki Byun²¹,
A. G. Amitha Perera²⁰ &
…
Chin-Hui Lee¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7585))

Included in the following conference series:

European Conference on Computer Vision

4092 Accesses
7 Citations

Abstract

We present a learning framework for fusion-based video retrieval system, which explicitly optimizes given performance metrics. Real-world computer vision systems serve sophisticated user needs, and domain-specific performance metrics are used to monitor the success of such systems. However, the conventional approach for learning under such circumstances is to blindly minimize standard error rates and hope the targeted performance metrics improve, which is clearly suboptimal. In this work, a novel scheme to directly optimize such targeted performance metrics during learning is developed and presented. Our experimental results on two large consumer video archives are promising and showcase the benefits of the proposed approach.

Download to read the full chapter text

Chapter PDF

Efficient, robust and divisible paired comparison for subjective quality assessment

Article 05 July 2017

Rui Song, Yunsong Li, … Peng Rao

Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment in Recognition Tasks

Toward an objective benchmark for video completion

Article 09 November 2018

Alexander Bokov, Dmitriy Vatolin, … Yury Gitman

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Toderici, G., Aradhye, H., Pasca, M., Sbaiz, L., Yagnik, J.: Finding meaning on youtube: Tag recommendation and category discovery. In: CVPR (2010)
Google Scholar
Jiang, Y.G., Ye, G., Chang, S.F., Ellis, D., Loui, A.C.: Consumer video understanding: A benchmark database and an evaluation of human and machine performance. In: ACM ICMR (2011)
Google Scholar
Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and trecvid. In: ACM MIR (2006)
Google Scholar
Wang, Z., Zhao, M., Song, Y., Kumar, S., Li, B.: Youtubecat: Learning to categorize wild web videos. In: CVPR (2010)
Google Scholar
Yang, W., Toderici, G.: Discriminative tag learning on youtube videos with latent sub-tags. In: CVPR (2011)
Google Scholar
Joachims, T.: A support vector method for multivariate performance measures. In: ICML (2005)
Google Scholar
Calonder, M., Lepetit, V., Fua, P.: Pareto-optimal Dictionaries for Signatures. In: CVPR (2010)
Google Scholar
Gao, S., Wu, W., Lee, C.H., Chua, T.S.: A mfom learning approach to robust multiclass multi-label text categorization. In: ICML (2004)
Google Scholar
Varma, M., Ray, D.: Learning the discriminative power-invariance trade-off. In: ICCV (2007)
Google Scholar
Gehler, P.V., Nowozin, S.: On feature combination for multiclass object classification. In: IEEE International Conference on Computer Vision, ICCV (2009)
Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos ïn the wild.̈ In: CVPR (2009)
Google Scholar
Katagiri, S., Juang, B.H., Lee, C.H.: Pattern recognition using a family of design algorithm based upon the generalized probabilistic descent method. Proc. of the IEEE, 2345–2373 (1998)
Google Scholar
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: CVPR (2008)
Google Scholar
Kläser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: BMVC (2008)
Google Scholar
Li, L.J., Su, H., Xing, E.P., Fei-Fei, L.: Object bank: A high-level image representation for scene classification and semantic feature sparsification. In: Proceedings of the Neural Information Processing Systems, NIPS (2010)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope 42, 145–175 (2001)
Google Scholar
Lee, C.H., Soong, F., Juan, B.H.: A segment model based approach to speech recognition. In: ICASSP (1988)
Google Scholar
Martin, A.F., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The DET curve in assessment of detection task performance. In: Eurospeech (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Georgia Institute of Technology, USA
Ilseo Kim & Chin-Hui Lee
Kitware Inc., USA
Sangmin Oh & A. G. Amitha Perera
Microsoft, USA
Byungki Byun

Authors

Ilseo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sangmin Oh
View author publications
You can also search for this author in PubMed Google Scholar
Byungki Byun
View author publications
You can also search for this author in PubMed Google Scholar
A. G. Amitha Perera
View author publications
You can also search for this author in PubMed Google Scholar
Chin-Hui Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Ingegneria Elettrica, Gestionale e Meccanica (DIEGM), Università degli Studi di Udine, Via delle Scienze, 208, 33100, Udine, Italy
Andrea Fusiello
IIT Istituto Italiano di Tecnologia, Via Morego 30, 16163, Genoa, Italy
Vittorio Murino
Dipartimento di Ingegneria dell’Informazione, Università degli Studi di Modena e Reggio Emilia, Strada Vignolege, 905, 41125, Modena, Italy
Rita Cucchiara

Additional information

This work was supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number D11PC20069. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DOI/NBC, or the U.S. Government.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, I., Oh, S., Byun, B., Perera, A.G.A., Lee, CH. (2012). Explicit Performance Metric Optimization for Fusion-Based Video Retrieval. In: Fusiello, A., Murino, V., Cucchiara, R. (eds) Computer Vision – ECCV 2012. Workshops and Demonstrations. ECCV 2012. Lecture Notes in Computer Science, vol 7585. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33885-4_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-33885-4_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33884-7
Online ISBN: 978-3-642-33885-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Explicit Performance Metric Optimization for Fusion-Based Video Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

Efficient, robust and divisible paired comparison for subjective quality assessment

Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment in Recognition Tasks

Toward an objective benchmark for video completion

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Additional information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Explicit Performance Metric Optimization for Fusion-Based Video Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

Efficient, robust and divisible paired comparison for subjective quality assessment

Survey on the State-Of-The-Art Methods for Objective Video Quality Assessment in Recognition Tasks

Toward an objective benchmark for video completion

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Additional information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation