Tell Me What You Like and I’ll Tell You What You Are: Discriminating Visual Preferences on Flickr Data
The John Ruskin’s 19th century adage suggests that personal taste is not merely an absolute set of aesthetic principles valid for everyone: actually, it is a process of interpretation which have also roots in one’s life experiences. This aspect represents nowadays a major problem for inferring automatically the quality of a picture. In this paper, instead of trying to solve this age-old problem, we consider an intriguing, orthogonal direction, aimed at discovering how different are the personal tastes. Given a set of preferred images of a user, obtained from Flickr, we extract a pool of low- and high-level features; LASSO regression is then exploited to learn the most discriminative ones, considering a group of 200 random Flickr users. Such aspects can be easily recovered, allowing to understand what is the “what we like” which distinguish us from the others. We then perform multi-class classification, where a test sample is a set of preferred pictures of an unknown user, and the classes are all the users. The results are surprising: given only 1 image as test, we can match the user preferences definitely more than the chance, and with 20 images we reach an nAUC of 91%, considering the cumulative matching characteristic curve. Extensive experiments promote our approach, suggesting new intriguing perspectives in the study of computational aesthetics.
KeywordsTraining Image Visual Preference Personal Taste Regression Score Lasso Regression
Unable to display preview. Download preview PDF.
- 4.Kaplan, R., Kaplan, S.: The Experience of Nature: A Psychological Perspective. Cambridge University Press (1989)Google Scholar
- 5.Bhattacharya, S., Sukthankar, R., Shah, M.: A framework for photo-quality assessment and enhancement based on visual aesthetics. In: Proceedings of the International Conference on Multimedia, MM 2010, pp. 271–280. ACM, New York (2010)Google Scholar
- 7.Yeh, C.H., Ho, Y.C., Barsky, B.A., Ouhyoung, M.: Personalized photograph ranking and selection system. In: Proceedings of the International Conference on Multimedia, MM 2010, pp. 211–220. ACM, New York (2010)Google Scholar
- 10.Ke, Y., Tang, X., Jing, F.: The design of high-level features for photo quality assessment. In: CVPR 2006, pp. 419–426. IEEE Computer Society, Washington, DC (2006)Google Scholar
- 13.Bozzon, A., Brambilla, M., Ceri, S.: Answering search queries with crowdsearcher. In: WWW, pp. 1009–1018 (2012)Google Scholar
- 16.Isola, P., Jianxiong, X., Torralba, A., Oliva, A.: What makes an image memorable? In: 2011 IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pp. 145–152 (2011)Google Scholar
- 17.Curran, W., Moore, T., Kulesza, T., Wong, W., Todorovic, S., Stumpf, S., White, R., Burnett, M.M.: Towards recognizing ”cool”: can end users help computer vision recognize subjective attributes of objects in images? In: ACM International Conference on Intelligent User Interfaces, pp. 285–288 (2012)Google Scholar
- 19.Georgescu, C.: Synergism in low level vision. In: International Conference on Pattern Recognition, pp. 150–155 (2002)Google Scholar
- 20.Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Discriminatively trained deformable part models, release 4 (2010), http://www.cs.brown.edu/~pff/latent-release4/
- 22.Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)Google Scholar
- 26.Cheng, D., Cristani, M., Stoppa, M., Bazzani, L., Murino, V.: Custom pictorial structures for re-identification. In: Proceedings of British Machine Vision Conference (2011)Google Scholar