Abstract
We observe that everyday images contain dozens of objects, and that humans, in describing these images, give different priority to these objects. We argue that a goal of visual recognition is, therefore, not only to detect and classify objects but also to associate with each a level of priority which we call ‘importance’. We propose a definition of importance and show how this may be estimated reliably from data harvested from human observers. We conclude by showing that a first-order estimate of importance may be computed from a number of simple image region measurements and does not require access to image meaning.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision (2004)
Oliva, A., Torralba, A.B.: Scene-centered description from spatial envelope properties. In: Biologically Motivated Computer Vision, pp. 263–272 (2002)
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 18–32. Springer, Heidelberg (2000)
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR, vol. 2, pp. 264–271 (2003)
Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering objects and their localization in images. In: ICCV, 370–377 (2005)
Grauman, K., Darrell, T.: Efficient image matching with distributions of local invariant features. In: CVPR, vol. 2, pp. 627–634 (2005)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, vol. 2, pp. 2169–2178 (2006)
Barnard, K., Forsyth, D.A.: Learning the semantics of words and pictures. In: ICCV, pp. 408–415 (2001)
Russell, B.C., Efros, A.A., Sivic, J., Freeman, W.T., Zisserman, A.: Using multiple segmentations to discover objects and their extent in image collections. In: Proceedings of CVPR (2006)
Andreetto, M., Zelnik-Manor, L., Perona, P.: Unsupervised learning of categorical segments in image collections. In: Computer Vision and Pattern Recognition (CVPR 2008) (2008)
Todorovic, S., Ahuja, N.: Extracting texels in 2.5d natural textures. In: Proceeddings of ICCV (2007)
von Ahn, L., Dabbish, L.: Labeling images with a computer game. In: CHI, pp. 319–326 (2004)
Russell, B.C., Torralba, A., Murphy, K.P., Freeman, W.T.: Labelme: a database and web-based tool for image annotation. Technical report (2005)
Elazary, L., Itti, L.: Interesting objects are visually salient. Journal of Vision 8, 1–15 (2008)
Mayer, M., Switkes, E.: Spatial frequency taxonomy of the visual environment. Investigative Ophthalmology and Visual Science 26 (1985)
Shore, S.: Stephen Shore: American Surfaces. Phaidon Press (2005)
Shore, S., Tillman, L., Schmidt-Wulffen, S.: Uncommon Places: The Complete Works. Aperture (2005)
(Wordnet)
Fog, A.: Calculation methods for wallenius’ noncentral hypergeometric distribution. Communications In statictics, Simulation and Computation 37, 258–273 (2008)
Manly, B.F.J.: A model for certain types of selection experiments. Biometrics 30, 281–294 (1974)
Kullback, S., Leibler, R.A.: On information and sufficiency. Annals of Mathematical Statistics 22, 79–86 (1951)
Angelova, A., Matthies, L., Helmick, D.M., Perona, P.: Fast terrain classification using variable-length representation for autonomous navigation. In: CVPR (2007)
Walther, D., Koch, C.: Modeling attention to salient proto-objects. Neural Networks 19, 1395–1407 (2006)
Yarbus, A.: Eye movements and vision. Plenum Press, New York (1967)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Spain, M., Perona, P. (2008). Some Objects Are More Equal Than Others: Measuring and Predicting Importance. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5302. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88682-2_40
Download citation
DOI: https://doi.org/10.1007/978-3-540-88682-2_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88681-5
Online ISBN: 978-3-540-88682-2
eBook Packages: Computer ScienceComputer Science (R0)