Abstract
In this research we have developed and evaluated a system that uses the image compositional metric called ‘Rule of Thirds’ used by photographers to grade visual aesthetics of an image. The novel aspect of the work is that it combines quantitative and qualitative aspects of research by taking human psychology into account. The core idea is to identify how similar the perception of a ‘good image’ and ‘bad image’ is by machines versus humans (through a user study based on 255 participants on 5000 images from the standard MIRFLICKR database [9]). We have considered the compositional norm, namely ‘rule of thirds’ used by photographers and inspired by the golden ratio that states that - if an image is segmented on a 3 × 3 grid, then it is appealing to the eye when the most salient object(s) or ‘subject(s)’ of the image is located precisely on or aligned on the middle grid lines [11]. First, we preprocess the input image by labeling the regions of attraction for human eye using two saliency algorithms namely Graph-Based Visual Saliency (GBVS) [3] and Itti-Koch [4]. Next, we quantify the rule of thirds property in images by mathematically considering the location of salient region(s) adhering to rule of thirds. This is then used to rank or score an input image. To validate, we conducted a user study where 255 human subjects ranked the images and compared our algorithmic results, making it a both a quantitative and qualitative research. We have also analyzed and presented the performance differences between two saliency algorithms and presented ROC plots along with similarity quantification between algorithms and human subjects. Our massive user study and experimental results provides the evidence of modern machine’s ability to mimic human-like behavior. Along with it, results computationally prove significance of rule of thirds.
Keywords
- Image processing
- Visual perception
- Image saliency
- Rule of thirds
- Photography
- Golden ratio
- Computer vision
- Image score
- Image composition
- Image
- Flickr
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Mai, L., Le, H., Niu, Y., Liu, F.: Rule of thirds detection from photograph. In: Proceedings of the IEEE International Symposium on Multimedia (ISM), Dana Point, CA, USA, pp. 91–96 (2011)
Peterson, B.: Learning to See Creatively, 1st edn, pp. 92–93. Amphoto Books, New York (2003)
Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems, pp. 545–552 (2006)
Itti, L., Koch, C.: A saliency-based search mechanism for overt and covert shifts of visual attention. Vis. Res. 40(10–12), 1489–1506 (2000)
Mai, L., Le, H., Niu, Y., Liu, F.: Rule of thirds detection from photograph. In: IEEE International Symposium on Multimedia, Portland (2011)
Amirshahi, S.A., Hayn-Leichsenring, G.U., Denzler, J., Redies, C.: Evaluating the rule of thirds in photographs and paintings. Art Percept. 2, 163–182 (2014)
Maleš, M., Heđi, A., Grgić, M.: Compositional rule of thirds detection. In: IEEE International Symposium on ELMAR 2011, Dana Point (2011)
Murillo, A., Košecká, J., Guerrero, J., Sagüés, C.: Visual door detection integrating appearance and shape cues. Rob. Auton. Syst. 56(6), 512–521 (2008)
Huiskes, M.J., Lew, M.S.: The MIR flickr retrieval evaluation. In: ACM International Conference on Multimedia Information Retrieval (MIR 2008), Vancouver, Canada (2008)
Weisstein, E.W.: Golden Ratio. From MathWorld–A Wolfram Web Resource. http://mathworld.wolfram.com/GoldenRatio.html
McCurry, S.: 9 Photo Composition Tips (feat. Steve McCurry). Youtube (2015). https://youtu.be/7ZVyNjKSr0M. Accessed 2 March 2016
Acknowledgments
This work was funded by North South University’s annual research grant for the fiscal year 2017–18. We would like to thank Professor John Kender (http://www.cs.columbia.edu/~jrk/), Professor of Computer Science at Columbia University for his insights.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Firoze, A., Osman, T., Psyche, S.S., Rahman, R.M. (2018). Scoring Photographic Rule of Thirds in a Large MIRFLICKR Dataset: A Showdown Between Machine Perception and Human Perception of Image Aesthetics. In: Nguyen, N., Hoang, D., Hong, TP., Pham, H., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2018. Lecture Notes in Computer Science(), vol 10751. Springer, Cham. https://doi.org/10.1007/978-3-319-75417-8_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-75417-8_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-75416-1
Online ISBN: 978-3-319-75417-8
eBook Packages: Computer ScienceComputer Science (R0)