Abstract
We propose in this chapter to highlight the impact of visual saliency information in Content Based Image Retrieval (CBIR) systems. We firstly present results of subjective evaluations for emotion analysis with and without use of saliency to reduce the image size and conclude that image reduction to more salient regions implies a better evaluation of emotional impact. We also test eye-tracking methods to validate our results and conclusions. Those experiments lead us to study saliency to improve the image description for indexing purpose. We first show the influence of selecting salient features for relevant image indexing and retrieval. Then, we propose a novel approach that makes use of saliency in an information gain criterion to improve the selection of a visual dictionary in the well-known Bags of Visual Words approach. Our experiments will underline the effectiveness of the proposal. Finally, we present some results on emotional impact recognition using CBIR descriptors and Bags of Visual Words approach with image saliency information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Our saliency values are computed using the Graph-Based Visual Saliency (GBVS) software http://www.klab.caltech.edu/~harel/share/gbvs.php which implements also Itti et al.’s algorithm.
- 2.
Those from the chosen detectors.
- 3.
We used the descriptors provided by Jegou et al. available at http://lear.inrialpes.fr/people/jegou/data.php.
References
Abdel-Hakim, A.E., Farag, A.A.: CSIFT: A SIFT Descriptor with color invariant characteristics. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2006)
Bay, H., Tuytelaars, T., Van Gool, L.: Surf: speeded up robust features. Lecture Notes in Computer Science, vol. 3951, pp. 404–417. Springer, Berlin (2006)
Beke, L., Kutas, G., Kwak, Y., Sung, G.Y., Park, D., Bodrogi, P.: Color preference of aged observers compared to young observers. Color. Res. Appl. 33(5), 381–394 (2008)
Borji, A., Sihite, D., Itti, L.: Quantitative analysis of human-model agreement in visual saliency modeling: A comparative study. IEEE Trans. Image Process. 22(1), 55–69 (2013)
Boyatziz, C., Varghese, R.: Children’s emotional associations with colors. J. Gen. Psychol. 155, 77–85 (1993)
Bradley, M.M., Codispoti, M., Sabatinelli, D., Lang, P.J.: Emotion and motivation ii: sex differences in picture processing. Emotion 1(3), 300–319 (2001)
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp. 1–22 (2004)
Denis, P., Courboulay, V., Revel, A., Gbehounou, S., Lecellier, F., Fernandez-Maloigne, C.: Improvement of natural image search engines results by emotional filtering. EAI Endorsed Trans. Creative Technologies 3(6), e4 (2016). https://hal.archives-ouvertes.fr/hal-01261237
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)
Gao, K., Lin, S., Zhang, Y., Tang, S., Ren, H.: Attention model based sift keypoints filtration for image retrieval. In: Proceedings of IEEE International Conference on Computer and Information Science, pp. 191–196 (2008)
Gbèhounou, S., Lecellier, F., Fernandez-Maloigne, C., Courboulay, V.: Can Salient Interest Regions Resume Emotional Impact of an Image?, pp. 515–522 Springer, Berlin (2013). doi:10.1007/978-3-642-40261-6_62. http://dx.doi.org/10.1007/978-3-642-40261-6_62
Gbehounou, S., Lecellier, F., Fernandez-Maloigne, C.: Evaluation of local and global descriptors for emotional impact recognition. J. Vis. Commun. Image Represent. 38, 276–283 (2016)
Gbèhounou, S., Lecellier, F., Fernandez-Maloigne, C.: Evaluation of local and global descriptors for emotional impact recognition. J. Vis. Commun. Image Represent. 38(C), 276–283 (2016). doi:10.1016/j.jvcir.2016.03.009. http://dx.doi.org/10.1016/j.jvcir.2016.03.009
González-Díaz, I., Buso, V., Benois-Pineau, J.: Perceptual modeling in the problem of active object recognition in visual scenes. Pattern Recognition 56, 129–141 (2016). doi:10.1016/j.patcog.2016.03.007. http://dx.doi.org/10.1016/j.patcog.2016.03.007
Gordoa, A., Rodriguez-Serrano, J.A., Perronnin, F., Valveny, E.: Leveraging category-level labels for instance-level image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3045–3052 (2012)
Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems, pp. 545–552. MIT Press, Cambridge (2007)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the 4th Alvey Vision Conference, pp. 147–151 (1988)
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)
Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Proceedings of the 10th European Conference on Computer Vision: Part I, ECCV’08, pp. 304–317. Springer, Berlin (2008)
Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: Proceedings of the 23rd IEEE Conference on Computer Vision & Pattern Recognition, pp. 3304–3311. IEEE Computer Society, New York (2010)
Kaya, N., Epps, H.H.: Color-emotion associations: Past experience and personal preference. In: AIC Colors and Paints, Interim Meeting of the International Color Association (2004)
Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 506–513 (2004)
Kootstra, G., de Boer, B., Schomaker, L.: Predicting eye fixations on complex visual stimuli using local symmetry. Cogn. Comput. 3(1), 223–240 (2011)
Lang, P.J., Bradley, M.M., Cuthbert, B.N.: International affective picture system (IAPS): affective ratings of pictures and instruction manual. technical report A-8. Technical Report, University of Florida (2008)
Le Meur, O., Le Callet, P., Barba, D., Thoreau, D.: A coherent computational approach to model bottom-up visual attention. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 802–817 (2006)
Li, Y., Zhou, Y., Yan, J., Niu, Z., Yang, J.: Visual saliency based on conditional entropy. Lecture Notes in Computer Science, vol. 5994, pp. 246–257. Springer, Berlin (2010)
Liu, W., Xu, W., Li, L.: A tentative study of visual attention-based salient features for image retrieval. In: Proceedings of the 7th World Congress on Intelligent Control and Automation, pp. 7635–7639 (2008)
Liu, N., Dellandréa, E., Chen, L.: Evaluation of features and combination approaches for the classification of emotional semantics in images. In: International Conference on Computer Vision Theory and Applications (2011)
Lowe, D.G.: Object recognition from local scale-invariant features. In: International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Lucassen, M.P., Gevers, T., Gijsenij, A.: Adding texture to color: quantitative analysis of color emotions. In: Proceedings of CGIV (2010)
Machajdik, J., Hanbury, A.: Affective image classification using features inspired by psychology and art theory. In: Proceedings of the international conference on Multimedia, pp. 83–92 (2010)
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Proceedings of the 8th IEEE International Conference on Computer Vision, vol. 1, pp. 525–531 (2001)
Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: Computer Vision-ECCV. Lecture Notes in Computer Science, vol. 2350, pp. 128–142. Springer, Berlin (2002)
Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vision 65(1-2), 43–72 (2005)
Mindru, F., Tuytelaars, T., Van Gool, L., Moons, T.: Moment invariants for recognition under changing viewpoint and illumination. Comput. Vis. Image Underst. 94(1–3), 3–27 (2004)
Moravec, H.P.: Towards automatic visual obstacle avoidance. In: Proceedings of the 5th International Joint Conference on Artificial Intelligence, vol. 2, pp. 584–584. Morgan Kaufmann, San Francisco (1977)
Nauge, M., Larabi, M.C., Fernandez-Maloigne, C.: A statistical study of the correlation between interest points and gaze points. In: Human Vision and Electronic Imaging, p. 12. Burlingame (2012)
Nistér, D., Stewénius, H.: Scalable recognition with a vocabulary tree. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2161–2168 (2006)
Ou, L.C., Luo, M.R., Woodcock, A., Wright, A.: A study of colour emotion and colour preference. part i: Colour emotions for single colours. Color. Res. Appl. 29(3), 232–240 (2004)
Ou, L.C., Luo, M.R., Woodcock, A., Wright, A.: A study of colour emotion and colour preference. part ii: Colour emotions for two-colour combinations. Color. Res. Appl. 29(4), 292–298 (2004)
Ou, L.C., Luo, M.R., Woodcock, A., Wright, A.: A study of colour emotion and colour preference. Part iii: colour preference modeling. Color. Res. Appl. 29(5), 381–389 (2004)
Paleari, M., Huet, B.: Toward emotion indexing of multimedia excerpts. In: Proceedings on Content-Based Multimedia Indexing, International Workshop, pp. 425–432 (2008)
Perreira Da Silva, M., Courboulay, V., Prigent, A., Estraillier, P.: Evaluation of preys/predators systems for visual attention simulation. In: Proceedings of the International Conference on Computer Vision Theory and Applications, pp. 275–282, INSTICC (2010)
Perronnin, F.: Universal and adapted vocabularies for generic visual categorization. IEEE Trans. Pattern Anal. Mach. Intell. 30(7), 1243–1256 (2008)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Minneapolis, MI (2007)
Rosten, E., Drummond, T.: Fusing points and lines for high performance tracking. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2, pp. 1508–1511 (2005)
Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Proceedings of the European Conference on Computer Vision, vol. 1, pp. 430–443 (2006)
Schmid, C., Mohr, R., Bauckhage, C.: Evaluation of interest point detectors. Int. J. Comput. Vision 37(2), 151–172 (2000)
Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision, pp. 1470–1477 (2003)
Smith, S.M., Brady, J.M.: Susan—a new approach to low level image processing. Int. J. Comput. Vision 23(1), 45–78 (1997)
Solli, M., Lenz, R.: Color harmony for image indexing. In: Proceedings of the 12th International Conference on Computer Vision Workshops, pp. 1885–1892 (2009)
Solli, M., Lenz, R.: Emotion related structures in large image databases. In: Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 398–405. ACM, New York (2010)
Tuytelaars, T., Mikolajczyk, K.: Local invariant feature detectors: a survey. Found. Trends Comput. Graph. Vis. 3(3), 177–280 (2008)
Urruty, T., Gbèhounou, S., Le, T.L., Martinet, J., Fernandez-Maloigne, C.: Iterative random visual words selection. In: Proceedings of International Conference on Multimedia Retrieval, ICMR’14, pp. 249–256. ACM, New York (2014)
van de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1582–1596 (2010)
Wang, W., Yu, Y.: Image emotional semantic query based on color semantic description. In: Proceedings of the The 4th International Conference on Machine Leraning and Cybernectics, vol. 7, pp. 4571–4576 (2005)
Wei, K., He, B., Zhang, T., He, W.: Image Emotional classification based on color semantic description. Lecture Notes in Computer Science, vol. 5139, pp. 485–491. Springer, Berlin (2008)
Yanulevskaya, V., Van Gemert, J.C., Roth, K., Herbold, A.K., Sebe, N., Geusebroek, J.M.: Emotional valence categorization using holistic image features. In: Proceedings of the 15th IEEE International Conference on Image Processing, pp. 101–104 (2008)
Zdziarski, Z., Dahyot, R.: Feature selection using visual saliency for content-based image retrieval. In: Proceedings of the IET Irish Signals and Systems Conference, pp. 1–6 (2012)
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. Int. J. Comput. Vis. 73(2), 213–238 (2007)
Zhang, L., Tong, M.H., Marks, T.K., Shan, H., Cottrell, G.W.: Sun: A Bayesian framework for saliency using natural statistics. J. Vis. 8(7), 1–20 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Gbehounou, S., Urruty, T., Lecellier, F., Fernandez-Maloigne, C. (2017). Introducing Image Saliency Information into Content Based Indexing and Emotional Impact Analysis. In: Benois-Pineau, J., Le Callet, P. (eds) Visual Content Indexing and Retrieval with Psycho-Visual Models. Multimedia Systems and Applications. Springer, Cham. https://doi.org/10.1007/978-3-319-57687-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-57687-9_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57686-2
Online ISBN: 978-3-319-57687-9
eBook Packages: Computer ScienceComputer Science (R0)