Classification with Global, Local and Shared Features

Bilen, Hakan; Namboodiri, Vinay P.; Van Gool, Luc J.

doi:10.1007/978-3-642-32717-9_14

Hakan Bilen¹⁸,
Vinay P. Namboodiri¹⁹ &
Luc J. Van Gool^18,20

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7476))

Included in the following conference series:

Joint DAGM (German Association for Pattern Recognition) and OAGM Symposium

4131 Accesses
3 Citations

Abstract

We present a framework that jointly learns and then uses multiple image windows for improved classification. Apart from using the entire image content as context, class-specific windows are added, as well as windows that target class pairs. The location and extent of the windows are set automatically by handling the window parameters as latent variables. This framework makes the following contributions: a) the addition of localized information through the class-specific windows improves classification, b) windows introduced for the classification of class pairs further improve the results, c) the windows and classification parameters can be effectively learnt using a discriminative max-margin approach with latent variables, and d) the same framework is suited for multiple visual tasks such as classifying objects, scenes and actions. Experiments demonstrate the aforementioned claims.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bilen, H., Namboodiri, V.P., Van Gool, L.J.: Object and Action Classification with Latent Variables. In: BMVC (2011)
Google Scholar
Boureau, Y., Le Roux, N., Bach, F., Ponce, J., LeCun, Y.: Ask the locals: multi-way local pooling for image recognition. In: ICCV. IEEE (2011)
Google Scholar
Dekel, O., Keshet, J., Singer, Y.: Large margin hierarchical classification. In: International Conference on Machine Learning (ICML), pp. 27–35 (2004)
Google Scholar
Everingham, M., Zisserman, A., Williams, C.K.I., Van Gool, L.: The PASCAL Visual Object Classes Challenge 2006 (VOC 2006) Results (2006), http://www.pascal-network.org/challenges/VOC/voc2006/results.pdf
Fergus, R., Bernal, H., Weiss, Y., Torralba, A.: Semantic Label Sharing for Learning with Many Categories. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 762–775. Springer, Heidelberg (2010)
Chapter Google Scholar
Gehler, P.V., Nowozin, S.: On feature combination for multiclass object classification. In: ICCV, pp. 221–228 (2009)
Google Scholar
Hoai, M., Lan, Z.Z., De la Torre, F.: Joint segmentation and classification of human actions in video. In: CVPR (2011)
Google Scholar
Lampert, C., Austria, I.: Maximum margin multi-label structured prediction (2011)
Google Scholar
Laptev, I., Lindeberg, T.: Space-time interest points. In: ICCV, pp. 432–439 (2003)
Google Scholar
Laptev, I., Marszałek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: CVPR (2008)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, pp. 2169–2178 (2006)
Google Scholar
Li, L.-J., Su, H., Xing, E.P., Fei-Fei, L.: Object bank: A high-level image representation for scene classification & semantic feature sparsification. In: Advances in Neural Information Processing Systems, NIPS (2010)
Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: ICCV, p. 1150 (1999)
Google Scholar
Marszałek, M., Schmid, C.: Semantic hierarchies for visual object recognition. In: CVPR (2007)
Google Scholar
Nguyen, M.H., Torresani, L., De la Torre, F., Rother, C.: Weakly supervised discriminative localization and classification: a joint learning process. In: ICCV (2009)
Google Scholar
Nilsback, M.E., Zisserman, A.: A visual vocabulary for flower classification. In: CVPR, vol. 2, pp. 1447–1454 (2006)
Google Scholar
Opelt, A., Pinz, A., Zisserman, A.: Incremental learning of object detectors using a visual shape alphabet. In: CVPR, pp. 3–10 (2006)
Google Scholar
Pandey, M., Lazebnik, S.: Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV (2011)
Google Scholar
Patron, A., Marszalek, M., Zisserman, A., Reid, I.D.: High five: Recognising human interactions in tv shows. In: BMVC, pp. 1–11 (2010)
Google Scholar
Pinz, A.: Object categorization. Foundations and Trends in Computer Graphics and Vision 1(4) (2005)
Google Scholar
Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: CVPR (2009)
Google Scholar
Sadeghi, M.A., Farhadi, A.: Recognition using visual phrases. In: CVPR (2011)
Google Scholar
Salakhutdinov, R., Torralba, A., Tenenbaum, J.: Learning to share visual appearance for multiclass object detection. In: CVPR (2011)
Google Scholar
Torresani, L., Szummer, M., Fitzgibbon, A.: Efficient Object Category Recognition Using Classemes. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 776–789. Springer, Heidelberg (2010)
Chapter Google Scholar
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: International Conference on Machine Learning (ICML), pp. 104–112 (2004)
Google Scholar
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: ICCV, pp. 606–613 (2009)
Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR, pp. 3360–3367 (2010)
Google Scholar
Yu, C.N.J., Joachims, T.: Learning structural svms with latent variables. In: International Conference on Machine Learning (ICML), pp. 1169–1176. ACM (2009)
Google Scholar
Yuille, A., Rangarajan, A.: The concave-convex procedure. Neural Computation 15(4), 915–936 (2003)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

ESAT-PSI/IBBT,VISICS/KU Leuven, Belgium
Hakan Bilen & Luc J. Van Gool
Alcatel-Lucent Bell Labs, Antwerp, Belgium
Vinay P. Namboodiri
Computer Vision Laboratory, BIWI/ETH Zürich, Switzerland
Luc J. Van Gool

Authors

Hakan Bilen
View author publications
You can also search for this author in PubMed Google Scholar
Vinay P. Namboodiri
View author publications
You can also search for this author in PubMed Google Scholar
Luc J. Van Gool
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Electrical Measurement and Measurement Signal Processing, Graz University of Technology, Kronesgasse 5, 8010, Graz, Austria
Axel Pinz
Institute for Computer Graphics and Vision, Graz University of Technology, Inffeldgasse 16, 8010, Graz, Austria
Thomas Pock , Horst Bischof & Franz Leberl , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bilen, H., Namboodiri, V.P., Van Gool, L.J. (2012). Classification with Global, Local and Shared Features. In: Pinz, A., Pock, T., Bischof, H., Leberl, F. (eds) Pattern Recognition. DAGM/OAGM 2012. Lecture Notes in Computer Science, vol 7476. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32717-9_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-32717-9_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32716-2
Online ISBN: 978-3-642-32717-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics