Machine learning revealed symbolism, emotionality, and imaginativeness as primary predictors of creativity evaluations of western art paintings

Spee, Blanca T. M.; Mikuni, Jan; Leder, Helmut; Scharnowski, Frank; Pelowski, Matthew; Steyrl, David

doi:10.1038/s41598-023-39865-1

Machine learning revealed symbolism, emotionality, and imaginativeness as primary predictors of creativity evaluations of western art paintings

Article
Open access
Published: 10 August 2023

Volume 13, article number 12966, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Machine learning revealed symbolism, emotionality, and imaginativeness as primary predictors of creativity evaluations of western art paintings

Download PDF

Blanca T. M. Spee^1,2,3,
Jan Mikuni³,
Helmut Leder^1,3,
Frank Scharnowski^3,4,
Matthew Pelowski^1,3 &
…
David Steyrl^3,4

1908 Accesses
5 Citations
2 Altmetric
Explore all metrics

Abstract

Creativity is a compelling yet elusive phenomenon, especially when manifested in visual art, where its evaluation is often a subjective and complex process. Understanding how individuals judge creativity in visual art is a particularly intriguing question. Conventional linear approaches often fail to capture the intricate nature of human behavior underlying such judgments. Therefore, in this study, we employed interpretable machine learning to probe complex associations between 17 subjective art-attributes and creativity judgments across a diverse range of artworks. A cohort of 78 non-art expert participants assessed 54 artworks varying in styles and motifs. The applied Random Forests regressor models accounted for 30% of the variability in creativity judgments given our set of art-attributes. Our analyses revealed symbolism, emotionality, and imaginativeness as the primary attributes influencing creativity judgments. Abstractness, valence, and complexity also had an impact, albeit to a lesser degree. Notably, we observed non-linearity in the relationship between art-attribute scores and creativity judgments, indicating that changes in art-attributes did not consistently correspond to changes in creativity judgments. Employing statistical learning, this investigation presents the first attribute-integrating quantitative model of factors that contribute to creativity judgments in visual art among novice raters. Our research represents a significant stride forward building the groundwork for first causal models for future investigations in art and creativity research and offering implications for diverse practical applications. Beyond enhancing comprehension of the intricate interplay and specificity of attributes used in evaluating creativity, this work introduces machine learning as an innovative approach in the field of subjective judgment.

Cross-cultural comparison of beauty judgments in visual art using machine learning analysis of art attribute predictors among Japanese and German speakers

Article Open access 10 July 2024

Quantitative Analysis of Artists’ Characteristic Styles through Biologically-Motivated Image Processing Techniques: Uncovering a Mentor to Johannes Vermeer

Computational Beauty: Aesthetic Judgment at the Intersection of Art and Science

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

Creativity can be found in many domains and within various human endeavors, ranging from art and science to social gaming and creativity in daily life^1,2,3. Almost indisputably, creativity is strongly associated with the fine arts, particularly visual arts⁴. This association is not only valid under consideration of sociological aspects⁵ but is also scientifically substantiated. Representative here is the collection of 225 creativity tests by Torrance and Goff⁶, where a significant portion of tests fall into the category of “figural image-making”. Creativity is also esteemed as an estimation marker for often pragmatic ulterior motives, such as the selection of artworks made by museum curators to attract museum visitors⁷, scoring individuals on creativity as a parameter to attend art schools, and the assessment of artworks in terms of their aesthetic and qualitative value^4,8. In addition—and looking at what impact creative engagements may have on people—creative art interactions are considered pathways to exploration, problem-solving^9,10, and communication channels for individuals and societal discourse⁵. Thus, artworks are valued as creative icons not only for their beauty or elegance, but paintings, drawings, and even scribbles from famous artists—like the one-line animal sketches by Pablo Picasso—are also appreciated because they are judged as creative.

Artworks as creative products in all their diversity have become ubiquitous in human societies and serve as central commodities for inter-social and cultural comparisons and benchmarks^5,11,12,13. The judgment of creativity plays hereby a pivotal role in establishing such benchmarks¹⁴, where the degree of creativity determines the status of artists or geniuses, the famousness of an artwork¹⁵, its monetary value and the degree of copyright protections¹⁶. The relevance of creativity becomes even more evident when considering its consistent presence in the discourse surrounding the conceptualization of art across cultures and historical periods, where creativity appears to be universally applied as an art judgment for all kinds of visual art forms, innovations, and at different historical time-points³.

However, despite its fundamental importance in artistic and societal domains, creativity remains a polysemous concept lacking a clear definition, even in terms of its meaning as an art judgment¹⁷. Consequently, due to its multifaceted nature, researchers have employed various approaches, both objective and subjective, to measure creativity. On the one hand, studies have focused on objective formal-perceptual and design-oriented features, such as the Golden Ratio and luminance, comparing them to subjective preference ratings using classical and statistical learning methods^18,19,20,21. Computational aesthetics has also evaluated aesthetic measures and quantification to generate designs (e.g.^22,23). However, to the best of our knowledge, none of these studies, also the ones using statistical learning methods for objective measures, have centrally investigated creativity as a judgment and examined subjectively perceived attribute structures along this art evaluation.

On the other hand, researchers in visual arts, creativity, and psychology typically measure creativity levels through subjective assessments of the final products using mainly linear statistical methods (³, see for review^24,25). Such assessments have been employed in art viewing studies, including, e.g., standardized tests where participants complete drawings and provide subsequent scores or ratings²⁶. These assessments gained prominence following the seminal work of Daniel Berlyne^27,28,29, a central figure in art research who primarily studied art-specific properties through preference judgments. Subsequent studies have focused on composition comparisons³⁰, evaluation of realistic copying²⁵, differences between trained artists and amateurs, and distinctions in technique, styles, artists’ traits, states, and skill sets. For example, Hager et al.³¹ included creativity in their art reception survey connecting through principal component analysis artistic quality and creativity with five personality factors encompassing cognitive engagement, comprehension, affective appraisal, self-referential, and knowledge aspects. Hence, creativity judgments in art and within other creative domains are often centered around studying exceptionally creative people along their accomplishments³². Another example is the Consensual Assessment Technique (CAT)^30,33,34, which involves participants sorting creative products into groups based on their perceived levels of creativity. Notably, studies applying the CAT, also to Picasso’s work³⁵, have found high coherence even among non-art experts (see also Amabile³⁰ reporting a high inter-rater-reliability ranging from 0.72 to 0.93 in 20 studies in the visual art domain).

In all these studies, creativity judgments rely on people’s ability and coherence to recognize and judge what is creative or not and to explain their decision along attributes^30,31,34. This interactive use of terminology to communicate opinions and justification using attributes appears to be a result of social-cultural and educational training in interacting with aesthetic goods³⁶, and certainly might differ depending on social-cultural background and expertise^37,38. Similarly, in scientific psychological assessments, there is a common practice of establishing correspondence between the concept of a term and behavioral outcomes^3,4. Hence, studying the complex associations between judgment and attributes that can potentially be incorporated into numeric predicting patterns^4,5,36,39, is intriguing considering that individuals appear to be adept at recognizing creativity.

However, previous studies have not addressed the underlying question of how individuals identify creative art based on specific attributes. Neither of the studies used subjectively perceived content-representational and formal-perceptual attributes predicting creativity judgements (see “Materials and methods” for attribute list). In addition, most studies used statistical models assuming linear associations between independent and dependent variable, i.e., changes in art-attribute scores would correspond directly to changes in creativity judgments with a constant factor. Human behavior, however, is most likely not based on such linear associations and such a simplification can lead potentially to irrelevant theories and questionable scientific conclusions⁴⁰. Exemplary here is, again, Berlyne’s and later research findings indicating that preferences, such as liking or appreciation, are not linearly associated with attributes like complexity; instead, at a specific threshold, judgments exhibit a sudden shift, leading to a pronounced increase or decrease in liking^{29,41,42,43,44}. This phenomenon, which might also extend to the realm of creativity, suggests the association of unknown attributes influencing the perception and evaluation of creative works might also not be linear. These issues led us to the following research question: Which subjectively perceived art-attributes contribute to the judgment of an artwork’s level of creativity?

To address this gap, we conducted a study using 17 visual art-attributes assessed through semantic differential scales⁴⁵. These attributes, such as warm/cold, simple/complex, emotionless/emotionally loaded⁴⁶, have been identified in art research as influential factors in rating artworks^{14,28,47,48,49,50}. We employed Random Forest machine learning regression models, which can learn non-linear association patterns and interactions from data³⁹, to predict creativity judgments based on the aforementioned attributes. To analyze the importance of each individual art-attribute in predicting creativity judgments, we utilized permutation importance, a method from the interpretable machine learning field⁶. Our hypothesis is that art-attributes enable the prediction of creativity judgments, and our analysis of the model will identify art-attributes that contribute significantly to the prediction. By adopting this exploratory approach that encompasses multiple attributes, we broaden the scope of statistical designs typically employed in art research, opening new avenues to delve into the intricacies of judgment behavior, particularly within the domain of creativity in visual art. Furthermore, our study presents a quantitative and testable model that elucidates the statistical interplay between art-attributes and assessments of creativity. As a result, our findings have implications not only for practical applications but also for socio-cultural understanding across a diverse range of scientific fields. Moreover, this methodological framework can be adapted for investigating other types of judgments and participant samples, further expanding its utility and applicability.

Materials and methods

Sample size

Precise sample size justification (power analysis) for complex machine learning-based data analysis methods is still an open matter, and to the best of our knowledge, no standards have been established. Therefore, we followed a series of available suggestions regarding a reasonable sample size. First, a commonly disputed suggestion is that 50 samples are required to start any meaningful machine learning-based data analysis (scikit-learn 2021⁵¹). Second, a controversial suggestion is that 10 to 20 samples per degree of freedom (independent variable, art-attribute) is reasonable, particularly for logistic regression, which would result in a total of 170 to 340 samples needed for our study⁵². Third, a traditional power analysis using the two-tailed Student’s t-test, with an alpha of 0.05 and a power of 0.8, suggests that 394 samples would be required to detect small effects (Cohen’s d 0.2⁵³). Generally, a larger sample size allows for the detection of smaller effects, as is the case with other statistical data analysis methods. Based on these considerations and available resources, we collected a total of 4206 samples (78 participants × 54 images, with 6 ratings missing due to recording issues). This number adheres to best practices in artwork rating studies^26,54.

Participants

Ratings were made by a final sample of 78 psychology students recruited at the University of Vienna (55 female, M_age = 24.23, SD = 3.45, ranged from 19 to 35). All participants signed an informed consent. All participants were fluent in German (surveyed via mother tongue), were informed of the purpose of the study, and participated for course credits. Raters were all assessed in terms of their self-reported art expert level (M = 2.23, SD = 1.27), their art education on art theory (M = 1.14, SD = 0.59), on art history (M = 1.13, SD = 0.55), on studio art (M = 1.13, SD = 0.48), their self-reported art-making expertise (M = 1.27, SD = 0.47) with 7-point Likert visual scales (1: does not at all apply to me, 7: completely applies to me), and answered the Vienna Art Interest and Art Knowledge (VAIAK) Questionnaire to assess their art interest and knowledge⁵⁵. The art interest scores ranged from 12 to 71 across participants; the mean art interest score across all participants was 40.00 (SD = 13.90). Note that the possible range of the art interest score is from 7 to 71. The art knowledge scores ranged from 2 to 20 across participants; the mean art knowledge score across all participants was 7.03 (SD = 3.75). Note also that the possible range of the art knowledge score is from 0 to 36. Given all ratings and scores from our sample, it is plausible to assume that they can be identified as general art novices (see⁵⁵ for rationale). This study was conducted in accordance with the guidelines and regulations outlined by the Ethics Committee of the University of Vienna (reference number ethics committee 00256), which approved the research involving human participants. Informed consent was obtained from all participants, all methods were performed in accordance with the relevant guidelines and regulations approved by the Ethics Committee, adhering to the principles set forth in the Declaration of Helsinki.

Procedure

The testing was conducted in a laboratory room with four separate workstations, and participants were positioned approximately 50 cm from the screen. Participants received written instructions stating that they would be shown a series of visual artworks and were asked to provide ratings for each image using a set of attribute items, including the creativity judgment. It is important to note that in this paper, we focused our analysis solely on the creativity judgments along the attributes. Participants were allowed to take as much time as they needed to complete the ratings and could take short breaks if necessary. The artworks were displayed on a 19ʹʹ Iiyama ProLite B1906S monitor, with the longest dimension of each artwork fixed at a maximum of 500 pixels (1280 × 1024, 60 Hz resolution).

To evaluate the artworks, the images were shown individually and centered on the screen, with a top margin distance of 1 inch and a white background. Below each image, the attribute items and judgments were presented, and participants could scroll down using the mouse while keeping the artwork image fixed on the screen. Each artwork was presented once to each participant. Each participant rated the full set of artworks, resulting in a total of 4206 ratings (including 17 art-attributes and creativity judgments, with 6 ratings missing due to recording issues). The order of the rating items and artworks was randomized between participants, and the rating items were randomized for each image presentation (per trial) to avoid rating sequence effects.

The set of art-attribute items was presented as bipolar scales (semantic differentials), with a slider positioned in the middle between the two poles on a 100-point Likert scale. The full list of items for the attributes, along with their dimensions and the exact wording in German, is provided in Supplementary Information Table S1, and the English version is described in the following subsection (also see Table 1).

Table 1 Items of art-attributes along their semantic differential dimension poles (independent variables used in machine learning analysis).

Full size table

At the end of the testing session, all participants answered questions about their art expertise, education, and experience in making art. Additionally, they responded to the questions included in VAIAK⁵⁵ to measure their art interest and knowledge scores.

Art-attribute selection for rater assessments

Our focus was on the judgment of an artwork’s creativity, by asking: “How creative do you find the artwork?”. To evaluate the independent variables for this judgment we used art-attributes representing formal perceptual and content-representational dimension poles. We based our art-attribute set on previous literature, where the Assessment of Art Attributes battery (“AAA”⁵⁶) presented a most suitable and well-educated set of attributes.

The AAA covers six formal-perceptual attributes (balance, color saturation, color temperature, depth, complexity, brushstroke) and six conceptual-representational attributes (abstractness, vividness, emotionality, imaginativeness, factual accuracy, symbolism), which we also used, in a slightly adapted version, in our study. We adapted the scales suitable for German terminology usage. We also used coherent semantic differentials (word-pair pole) introduced by Osgood et al.⁵⁷, an approach commonly applied in many research fields⁵⁸.

We further added the following art-attributes: in the color sector for completeness we added color variety with the opposite poles few and many colors and color world with dark/light color as opposite poles^59,60,61; we added utilization of drawing area (little/full utilization) and focus (much context/focused content) as two attributes respecting different cultural background (⁶⁰, e.g., Asian vs. European foci⁶²); last, we included valence as one of the most commonly used art-attributes in empirical art research^8,54. Our art-attributes including opposite attributes pole dimensions are listed in Table 1 (see for German version Supplementary Information Table S1).

Stimuli

The stimuli were a selection of 54 images (2D static images of visual artworks) chosen from the Vienna Art Picture System (VAPS), which is a dataset of 999 fine art paintings and subjective ratings for art research⁶⁶. The artworks differ in the depicted motif (genre categories) and stylistic characteristics (style, historical period subcategories) and were developed to be applicable for research purposes assuring high-quality images. All artworks are fine art originated in a Western culture. We used three genres (1) portrait, (2) landscape, and (3) still life, and three style-categorization: (1) representative (Baroque and Rococo images), (2) impressionistic (Impressionsism and Post Impressionism), and (3) abstract art (Cubism and Surrealism; for full list of artworks Table S2 see Supplementary Information; see for pre-ratings and stimulus selection Supplementary Information text and Table S3). We showed the same number of images in terms of depicted motif and style.

Machine learning based data analysis approach

The machine learning-based regression analysis was performed in Python v3.8.5 using scikit-learn library v0.23.2⁵¹. In the analysis, the independent variables were visual harmony (balanced), depth, complexity, color saturation, color variety, color temperature, color world, brushwork, utilization of drawing area, abstraction, imaginativeness, symbolism (ambiguity), accurate object representation, liveliness/animation, emotionality, valence, and focus, whereas the creative judgment was the dependent variable.

Random Forests (RF) ensemble models were chosen to perform the actual regression task because this machine-learning method is computationally efficient and highly accurate³⁹. It can inherently model associations between independent and dependent variables that go beyond a constant factor (that is non-linear associations) as well as variable interactions³⁹. Furthermore, RF ensemble models are robust against multicollinearity in data³⁹. Regression performance—the prediction accuracy—was assessed using a nested cross-validation (CV) procedure⁶⁷. CV implements repeated train-test splits of the data to assess the generalizability of a model to new unseen data. A shuffle-split scheme with 128 repetitions (20% of the participants in the testing-set, 80% of the participants in the training-set) was applied in the main (outer) CV loop. In each repetition, the training-set was used for data scaling (z-scoring) and model complexity optimization. Model complexity optimization was carried out in a nested (inner) CV procedure using a sequential Bayesian optimization procedure in combination with again a shuffle-split scheme (20% testing, 80% training, 128 repetitions, 96 initial points) to find the best-performing complexity parameters (BayesSearchCV, scikit-optimize, v0.8.1, min_samples_split 2 to 128, min_samples_leaf 1 to 128, max_features 1 to total number of features). The complexity parameters that led to the highest prediction accuracy in the training-set of the inner CV were subsequently used along with the constant parameters n_estimators = 256, criterion = friedman_mse and all other parameters left to default, to train RF regressor models in the main CV loop using the training dataset only. The models were subsequently tested on the respective testing-set of the main CV loop. The testing-set was explicitly not used in the inner CV loop. Regression performance was measured with (1) the prediction coefficient of determination (prediction R²), and (2) the mean absolute error⁶⁸. R² scores are scaled that 0 equals a performance as good as using the average value of the dependent variable as predictor (the trivial predictor), and that 1 means no error at all. However, R² scores can have values between minus infinity (a model that performs worse than using the mean value of the dependent variable as prediction) and 1 (perfect model, no error at all). Notably, the prediction R² will be smaller than R² values of conventional statistical models because the prediction R² measures prediction performance for unknown data and not post hoc model fit. The mean absolute error reflects the average error that is made at each prediction and is not normalized; hence, it is measured in the scale of the dependent variable, the creativity rating (1–101).

The importance of single independent variables for the prediction (regression performance) was assessed with a permutation procedure⁴⁶. For that, the change in regression performance was measured between original data and when single independent variables values were permuted between data instances. The drop in regression performance (drop in prediction R²) is a direct measure for the importance of the permuted independent variable⁴⁶.

Statistical significance of the prediction R² metric and of the independent variables’ importance were assessed using a shuffle (randomization) testing procedure a.k.a. exact test^71,72. After evaluating the regression performance, the model was refitted using the same training data, but with shuffled dependent variable values. The shuffling was repeated 64 times in each main CV loop—16,384 times in total—to assess how likely it is that the obtained prediction R² (original data) and the associated independent variable importance’s appear by chance^71,72. The Null hypothesis was that there is no association between independent and dependent variables, and therefore, the obtained R² and independent variable’s importance’s appear by chance. To reject the Null hypothesis at 95% confidence, it is not allowed that more than 5% of the metrics obtained with shuffle data are more extreme than the metric obtained by the original data^71,72.

The influence of independent variables on the prediction can vary across the range of these variables due to the capacity of RF models to capture non-linear associations between independent and dependent variables. Therefore, the strength of these associations can exhibit variability across different levels of the independent variables. Partial dependence analysis was applied to assess these non-linear associations⁴⁶. Partial dependence analysis computes the expected (average) model prediction for an independent variable’s range.

Results

We predicted creativity judgement ratings of artworks based on ratings of 17 visual art-attributes using machine learning. We computed multivariate Random Forest (RF) regressor models and determined their prediction performances. On average the RF model’s predicting creativity judgements were off by 17.5 points from the actual judgements, within a creativity judgement range of 1 to 101 (see Table 2). The coefficient of determination (R²) between the predicted creativity judgements and the actual creativity judgements was on average 0.30, which indicates that the RF models explain on average 30% of the total variance in creativity judgement ratings.

Table 2 Performance of Random Forests regressor predicting creativity ratings.

Full size table

We analyzed the contributions of single visual art-attributes (independent variables) to the RF model prediction performance, indicating which visual art-attributes are important to predict the creativity judgment (see Fig. 1). We used a permutation-based variable importance measure and shuffle (randomization) significance tests.

The most important (statistically significant incl. Bonferroni correction) visual art-attributes sorted by their effect size were^63,73: 1. symbolisms with the two poles distinct/clear versus symbolic (Fig. 2a) and a median effect size of 0.10, meaning that the RF prediction performance dropped from a R² of 0.3 to 0.2 as the symbolisms values were shuffled; 2. emotionality with emotionless and emotionally loaded (Fig. 2b) and a median effect size of 0.07, hence, the RF prediction performance dropped from a R² of 0.3 to 0.23; and 3. imaginativeness with realistic content versus imaginary, unreal, fantastic content (Fig. 2c) and an median effect size 0.05, accordingly, the RF prediction performance dropped from a R² of 0.3 to 0.25. Effect sizes of ≥ 0.04 are usually considered practically relevant if reported in R²⁷³. Less important (statistically significant excl. Bonferroni correction) were complexity (simple/complex), abstraction (representative /abstract), and valence (negative/ positive). The important art-attributes show a positive association with creativity ratings, thus, a higher art-attribute rating results in higher creativity judgments. Partial dependency plots show, however, that this association cannot be described via a constant factor (Fig. 2, for partial dependency plots for complexity, abstraction and valence see also Fig. S1 in Supplementary Information). Instead, sudden non-linear changes become apparent. No other art-attribute contributed significantly (see Fig. 1). Some examples of artworks rated high versus low in symbolism and imaginativeness as well as high versus low emotionality can be found in the Supplementary Information Fig. S2.

The analysis of our data uncovered an interplay among the relevant predictors, i.e., the attributes, and the target variable, creativity. Further analysis revealed, that while no significant correlation was found between the predictors and creativity, there were some observable correlations among the predictors themselves. This pattern is depicted in a correlation heatmap (see Fig. S3 Supplementary Information). Specifically, the correlation coefficients between symbolism and emotionality, and between imaginativeness and emotionality were low, with values of 0.22 and 0.16 respectively. A high correlation was found between imaginativeness and symbolism with a coefficient of 0.78, but no correlations above that. These finding still align with our initial hypothesis as the predictors were deliberately chosen to encapsulate the multifaceted and interrelated attributes of artistic creativity^3,4,64.

The Principal Component Analysis (PCA, see Fig. S4 in Supplementary Information) showed that all but one of the components were necessary to account for more than 99% of the variance, suggesting that the original variables cannot be reduced linearly into a lower-dimensional space without a significant loss of information. These results suggests that even though predictors are correlated, they each contain unique aspects of information, instrumental in our exploration of creativity in visual art.

Discussion

In our study, we examined the criteria for evaluating an artwork as creative using machine learning models. These models analyzed 17 art-attributes and revealed that artworks with higher levels of symbolism, emotionality, and imaginativeness were perceived as more creative by our sample of novice art participants. Attributes such as complexity, abstraction, and valence held less significance in determining creativity. None of the remaining art-attributes had a significant impact. Figure 3a visually presents these findings in the form of a directed acyclic graph.

The subsequent paragraphs delve into the interpretation of our findings from the perspectives of diverse research fields, including the humanities, philosophy, art history, and psychology, which have all focused on the concept of creativity. In addition to our discussion, we propose a model for future research that explores potential associations between art-attributes and creativity judgment behavior (see Fig. 3b). We also address the implications of our findings for future research, as well as the merits and limitations of our machine learning approach.

The general correlation between predictors and target were evaluated by creating a heatmap depicting all possible correlations (see Fig. S3 in Supplementary Information). This heatmap shows that there was no significant correlation between the predictors and the target variable (creativity). Our Principal Component Analysis (see Fig. S4 in Supplementary Information) further suggests that while these predictors are somewhat interrelated, they each contribute unique information to the understanding of creativity.

The most influential art-attribute was symbolism. As depicted in Fig. 2a, ratings of symbolism were associated with an increase in creativity judgments scores, particularly noticeable at a medium level of symbolism rating scores, increasing steadily even for the highest ratings. Symbolism was represented by the dimensions distinct/clear (unambiguous interpretation) versus symbolic (more space for interpretation, ambiguous). Ambiguity within symbolic presentation is an essential characteristic of art^74,75; an attribute that has generated much discussion and research since the beginning of empirical art research about 100 years ago^49,76. This distinction between artists presenting clear interpretation or using ambiguous more symbolic elements entails the ability of the artist to translate symbols (or on a meta-level, analogies) artistically and masterfully into a work of art. This distinction finds it roots in philosophical discourses by Heidegger⁷⁷, Dewey⁷⁸, and other philosophers (e.g.⁷⁹) and is most prominent contrasting different styles. For instance, realism often showcases the artist’s creative abilities through the symbolic representation of meanings⁸⁰, expressionism, however, seeks to convey emotional content through the creative use of provocative or perceptually dissonant symbols^41,81, while abstract art tries to tease the viewer’s imaginative ideas^80,82. Within all these styles, certain aspects (e.g., the specific color usage, the kinds of brushstrokes, aspects/preference of beauty, and aesthetic appeal) might lose relevance or vary depending on temporal and societal context, nonetheless, the interplay between the level of symbolism (ambiguity) and the perceived creativity appears to remain a consistent factor. This interplay is supported by studies demonstrating that great—creative—art maintains a level of enduring meaning diversity⁸³, even after the ambiguity of the depicted symbolism has been resolved^74,77. Note, the same seems true for innovative design⁸⁴.

The second most influential art-attribute is emotionality. Figure 2b shows a similar association as with symbolism: high emotionality art-ratings are associated with higher creativity judgments ratings. This association exhibits an even sharper increase at the medium level of emotionality ratings. The role of perceived emotionality presents a complex facet of study: we had asked whether the artists managed to express emotion within the artwork. However, there might be many other attributes mediating emotionality^{14,48,75,85,86,87,88}. Firstly, etymologically intertwined with emotionality is its relationship with symbolism, abstraction, and imaginativeness^85,86,87. These three attributes within art representations allow individuals to be inspired by the artwork and impinge one’s personal meaning of emotional expression, drawing on personal experiences and value systems^50,89,90. Consequently, emotional associations (experience) are probably part of this emotionality rating⁶². However, we want to note that emotionality had a very distinct impact: we found only low correlation between symbolism and emotionality (r = 0.22), and between imaginativeness and emotionality (r = 0.16).

Secondly, the quality of emotionality, might be determined by valence. Valence likely emerges from the presented content in conjunction with attributes such as symbolism, abstraction, and imaginativeness (⁴⁰, see Fig. 3b for potential associations). However, emotionality and valence (see Fig. S3 in Supplementary Information) showed very low correlations with the other attributes in general. This leaves open questions and implications for future research seeking to deepening the understanding of the concept of emotionality as creative expression in art itself.

The third most influential predictor is associated with the human faculty of imagination (Fig. 2c). Our results indicate that artworks higher judged on this variable, thus with more imaginative, fantastical, or surreal content were judged as more creative by participants, while on the other end, more realistic artworks received lower creativity scores. This association is, again, not linear and shows a rather sudden change in influence at medium rating scores. Though seldom addressed in the context of creativity judgments in art research⁴, imagination holds a pivotal role in creativity research more broadly (referring especially to Torrance and colleagues work⁶). It is a critical component for divergent thinking^65,91, for innovation⁹², driving new designs and developments⁸⁴. Imagination is also a central element in various creativity scales^1,2. Consequently, this attribute might even be crucial as a factor across different domains of creativity¹, potentially bridging the experience of viewing art with the act of creating it. Interestingly, imaginativeness and symbolism had a comparatively high correlation coefficient of 0.78 in our study. Thus, it cannot be excluded that imaginativeness might turn out to be a common denominator between creative domains, however, in the combination with symbolism might specifically address the art sector. This might account especially for non-experts³⁷. As such, the role of imagination merits further exploration in future research.

In addition to symbolism, emotionality, and imaginativeness, also the attributes complexity, abstractness, and valence predicted creativity judgments to a lesser extent, all showing a positive association with judged creativity (see Fig. S1a–c in Supplementary Information).

Complexity has been a topic in many art-research studies (e.g.^42,93,94), and was discussed as driving attribute for preference and beauty aspects. Herein, empirical art research studies found medium levels of complexity to be most preferred and liked by especially laymen²⁹. Our results regarding creativity judgments, however, show a rather steady increase in creativity judgments ratings along increasing medium levels of complexity ratings scores (see Fig. S1a in Supplementary Information). Interestingly, complexity also contributes to the aspects of ambiguity^70,74. Our correlation heatmap shows a low-level association with symbolism (r = 0.38) and imaginativeness (r = 0.36). This might be because as complexity increases, so do visual perceptual factors, which can lead to a multilayered ambiguous impression and thereby also increasing the impression of symbolism, imaginativeness, and abstractness and vice versa⁹⁵. However, as some research also claimed that a very high level of complexity might be perceived as uncomfortable and thus negative^43,44, we did not find a correlation between complexity and valence. As our main results showed that complexity is not as important as other factors, one might conclude that this formal-perceptual feature is partially ingrained in content-representational features and thereby mediating the process (see Fig. 3b). This idea needs to be tested in in-depth research; however, our research might deliver a new basis to solve the many controversial hypotheses and findings regarding the effects of complexity on art judgments^93,96, including the aspect of valence. Note, that the average level of complexity in relation to higher creativity, might differ between different art expertise levels⁹⁷, which suggest testing the model for different levels of expertise.

A similar argumentation might hold for abstractness. Abstractness is a content stylistic aspect that is qualitatively deeply connected with symbolism and imaginativeness^8,14. And, indeed, in our study there were high correlations between abstractness and symbolism (r = 0.72), and imaginativeness (r = 0.78) but not emotionality (r = 0.22). Interestingly, as abstract art has been discussed as a universally readable attribute of art⁸², abstractness is also often not liked by non-experts; hence, abstractness offers interesting research questions for expertise-related, but also cross-cultural comparisons^37,54,55.

Finally, valence, as influential predictor, with a range from negative to positive valence, is a standard variable employed in art research¹⁴. Art that evokes positive or negative valence, can be appealing and hereby influence the perception of emotionality⁹⁸. Our results focusing on creativity, nonetheless, showed a positive association between valence scores with creativity judgments, hence, more positive valence scores were associated with higher creativity at a nearly constant rate. This result might be explained by the low level of art expertise of our participants, favoring mainly positive art⁷⁵. Art experts compared to laymen, however, appreciate both positive and negative art, thus, judging art more independently of the level of valence³⁷. In addition to investigating the model in different cohorts of expertise, an interesting future viewpoint should also consider that all mentioned predictors (except for emotionality) might mediate valence and the judged creativity (Fig. 3b)^29,64. Our results furthermore indicate that for creativity, emotionality is a stronger predictor than whether the artwork was negative or positive in valence⁹⁸. Hence emotionality could be a much more central factor, where valence might be less important for the evaluation of creativity. However, again, one can speculate that this depends on the current context and person-related factors.

As mentioned already several times throughout the paper, in our study with non-art expert participants, it is crucial to consider the potential differences in rating behavior between art experts and non-experts^37,54,97. Prior literature has highlighted substantial variations in art judgments between these two groups. Non-experts tend to place greater emphasis on the content of artworks, as reflected in our findings where content-driven attributes, such as symbolism, emotionality, and imaginativeness, played significant roles in predicting creativity judgments^55,100. However, it is plausible that an analysis of expert judges’ ratings using the same art-attributes of our study could yield a different pattern of results. Considering past literature, we would assume that art experts may use more formal-perceptual attributes to evaluate an artwork, such as specific color usage or technical skill requirements like brushstroke or visualization of depths^37,101,102. As mentioned before, also the interplay of complexity and valence direction could differ between art novices and art experts, as they engage an artwork with different knowledge seeing the skill in depicting, for example, negative art or less emotional expressive art. In addition, we would expect that experts will use more art-attributes for their evaluation in general.

As for our main attributes of symbolism, emotionality, and imaginativeness, it is uncertain if they would retain the same significance in an art experts’ group, although they probably represent general markers of creativity in visual art that are independent of expertise. Further research is needed to test these assumptions and explore the systematic differences in attribute significance between art experts and non-experts.

Although our machine learning-based analysis contained many attributes that could have contributed to creativity, only few attributes explained most of the variance. Additionally, five of six significant attributes—including all three main predictors—are attributes that belong to the content-representational category (see Table 1). We would like to emphasize this since so far as machine learning has only been applied with objective attributes gained from image analysis, which rather correspond to formal-perceptive attributes^18,19,20,21. According to our results, however, humans appear to judge artwork’s creativity according to the level of content and aspects of meaning.

Our approach intends to show that art-novice behavior, using the evaluation of creativity of visual art, is inherently complex and non-linear due to the multitude of factors influencing each decision, many of which are interrelated and affect each other in a non-sequential manner. Individuals’ perceptions of art can be influenced by a range of factors including personal experiences, cultural background, emotional state, and inherent biases, all of which can change over time and in different contexts. This dynamism and its non-linearity make it challenging to accurately predict using traditional linear models. Machine learning models, on the other hand, excels in handling such complexities. Its ability to model intricate patterns and interrelationships in high-dimensional space allows for a more nuanced understanding and prediction of non-linear human behavior, making it a powerful tool in art research.

The observed stepwise shape in the graphs, especially for emotionality and imaginativeness could indicate a threshold phenomenon, a concept reminiscent of findings in psychophysics where perceptual changes do not scale linearly with stimulus variations. At these transition points, further increases in these attributes might be less efficient. This suggests that artworks may be deemed equally creative beyond a certain attribute intensity, potentially indicative of an aesthetic threshold. Previous research in the art domain has indeed suggested the existence of non-linear relationships^{3,4,29,44,103}, although a comprehensive investigation into this aspect remains under-explored, especially in the field of creativity and psychological empirical art theories. Our machine learning approach holds promise in delving deeper into the concept of an aesthetic threshold. It remains possible that these patterns may reflect inherent aspects of human aesthetic perception, and if so, our method could potentially illuminate its characteristics and implications, thereby enriching the understanding of aesthetic evaluations in art and other creative traditions.

Regarding the methods employed, our approach was a combination of RF ensemble regression³⁹ with techniques from the field of interpretable machine learning to gain insights into the associations learned by the model⁴⁶. With the prediction of creativity judgements ratings as a target of art-attributes, we introduce a comprehensive method and a newly established initial model for art judgment analysis. The introduction of machine learning into art research is a consequent development of various methods to identify predictors of aesthetic appreciation and art evaluations, beginning with pure correlation or regression analyses^27,28,29 and more recent approaches using machine learning with objective values^8,19,20,21.

However, machine learning also faces general limitations of data-driven modelling method. These comprise aspects such as no causal associations, unexplained variance in data (in our study 70% on the variance stays unexplained), unclear transferability of associations learned by models to human behavior, model selection bias (other algorithms are available), data sample bias (limited variability, e.g., age, culture, etc.), variables selection bias (including other confounding variables may lead to different results), and stimulus material selection bias. Nevertheless, interpretable machine learning overcomes some limitations of traditional data modelling. That is, interpretable machine learning offers flexible, complex, yet robust, credible, and assessable models³⁹. These models are not limited or biased by problematic assumptions on pre-determined variable interactions, by variable scales, or by model oversimplification³⁹. Such assumptions and simplifications are problematic since they can lead to irrelevant or questionable scientific theories and conclusions (e.g., on causality³⁹). This approach delivers the first steps towards establishing the basis for new hypothesis-driven research³⁹, where quantitative statistical-learning models are the basis for future research on causal relationships.

Furthermore, by working with those methods that use less assumptions about the data, we were able to identify a complex interplay of variables beyond associations that can be described by a constant factor^39,46. This is relevant for the social psychological field^30,34 looking at judgment pattern behavior and penetrating the existing approaches of art research (see for review^8,14): namely, that art judgments are made along complex patterns including sudden changes and that only a limited number of variables learn a quantitative prediction model.

While our study aimed to introduce a comprehensive approach by considering a diverse range of attributes, it needs to be acknowledged that our list of attributes is not exhaustive. There may be additional relevant factors that were not included in our analysis. Furthermore, it is crucial to note that our focus was primarily on Western art, which may limit the generalizability of our findings to other artistic traditions and cultural contexts. Additionally, our participant sample consisted solely of laymen and psychology students, which may restrict the representativeness of the results. Therefore, further research is warranted to address these limitations as implications for future research: expanding the attribute set, encompassing a broader range of artistic traditions and cultures, and incorporating diverse participant samples (e.g., art experts).

Furthermore, a potential constraint concerns the distribution of our data. The characteristics of our data distributions might have influenced the form of the predictors’ impact, leading to a step function-like shape (see Fig. S5 in Supplementary Information). This distribution pattern could have implications for the interpretation of our results and should be taken into consideration. In future studies, it would be beneficial to further explore the influence of data distribution, possibly by applying different statistical methods or transformations to ascertain the robustness of our findings.

The present study underlines the potential of machine learning in art research, and more specifically, in the study of creativity judgements⁵⁴. In doing so, we fundamentally and conceptually intervene in the question of which attribute dimensions contribute to the judgment of art as creative. We also intervene in social-cultural and communication aspects: the attributes (see Table 1) used in our study are all attributes that are also used as interpersonal linguistic tool to justify judgments^5,89, in the field of fine art and other creative areas. Judgments where we humans seem to trust ourselves to judge how creative a product or work of art is^3,30,31.

Finally, as described in the Introduction, creativity is relevant in many areas^1,2. Therefore, our study might be a promising approach for the future of creativity research in general^1,2,3. It could be applied to studying traits, states, skills, which in turn is tied to psychological phenomena and outstanding personalities such as geniuses^32,104; also, creative design and product development (non-virtual and virtual) outside the visual art domain could employ consumer-based judgment analysis. Again, interpretable machine learning⁴⁶ can reveal a more comprehensive picture of what creativity entails in individual domains; through comparison it can uncover how specific attributes are generalizable across domains in the future and how these might be universal between individuals, traditions, and cultures¹⁰⁵. Indeed, as Immanuel Kant proposed, visual art is subject to the subjective perception of each viewer while simultaneously encapsulating universal aspects of human experience.

Data availability

The datasets generated and/or analysed during the current study are available in the Figshare repository, https://doi.org/10.6084/m9.figshare.19097099.

References

Silvia, P. J., Kaufman, J. C. & Pretz, J. E. Is creativity domain-specific? Latent class models of creative accomplishments and creative self-descriptions. Psychol. Aesthet. Creat. Arts 3, 139–148 (2009).
Google Scholar
Rodriguez, R. M. et al. Taking Inventory of the Creative Behavior Inventory: An Item Response Theory Analysis of the CBI. https://doi.org/10.31234/osf.io/b7cfd (2021).
Sternberg, R. J. Handbook of Creativity (Cambridge University Press, 1999).
Google Scholar
Pelowski, M., Leder, H. & Tinio, P. P. L. Creativity in the visual arts. In The Cambridge Handbook of Creativity across Domains (eds Kaufman, J. C. et al.) 80–109 (Cambridge University Press, 2017).
Google Scholar
Becker, H. S. Art as collective action. Am. Sociol. Rev. 39, 767–776 (1974).
Google Scholar
Torrance, E. P. & Goff, K. A quiet revolution. J. Creat. Behav. 23, 136 (1989).
Google Scholar
Hanquinet, L. Visitors to modern and contemporary art museums: Towards a new sociology of ‘cultural profiles’. Sociol. Rev. 61, 790 (2013).
Google Scholar
Pelowski, M., Markey, P. S., Lauring, J. O. & Leder, H. Visualizing the impact of art: An update and comparison of current psychological models of art experience. Front. Hum. Neurosci. 10, 1–21 (2016).
Google Scholar
Beaty, R. E., Benedek, M., Silvia, P. J. & Schacter, D. L. Creative cognition and brain network dynamics. Trends Cogn. Sci. 20, 87–95 (2016).
PubMed Google Scholar
Silvia, P. J. Cognitive appraisals and interest in visual art: Exploring an appraisal theory of aesthetic emotions. Empir. Stud. Arts 23, 119 (2005).
Google Scholar
Dissanayake, E. The artification hypothesis and its relevance to cognitive science, evolutionary aesthetics, and neuroaesthetics. Cogn. Semiot. 5, 136–191 (2009).
Google Scholar
Fingerhut, J. Habits and the enculturated mind: Pervasive artifacts, predictive processing, and expansive habits. In Habits: Pragmatist Approaches from Cognitive Neuroscience to Social Science (ed. Fingerhut, J.) 352–375 (Cambridge University Press, 2020).
Google Scholar
Stamkou, E. & Keltner, D. Aesthetic revolution: The role of art in culture and social change. SSRN Electron. J. https://doi.org/10.2139/ssrn.3578575 (2020).
Article Google Scholar
Leder, H. & Nadal, M. Ten years of a model of aesthetic appreciation and aesthetic judgments: The aesthetic episode—Developments and challenges in empirical aesthetics. Br. J. Psychol. 105, 443–464 (2014).
PubMed Google Scholar
Newman, G. E. & Bloom, P. Art and authenticity: The importance of originals in judgments of value. J. Exp. Psychol. Gen. 141, 558 (2012).
PubMed Google Scholar
Cohen, J. Creativity and culture in copyright theory. UC Law Davies Rev. 40, 1151–1205 (2006).
Google Scholar
Plucker, J. A., Beghetto, R. A. & Dow, G. T. Why isn’t creativity more important to educational psychologists? Potentials, pitfalls, and future directions in creativity research. Educ. Psychol. 39, 83 (2004).
Google Scholar
Li, C. & Chen, T. Aesthetic visual quality assessment of paintings. IEEE J. Sel Top Signal Process 3, 236 (2009).
ADS Google Scholar
Iigaya, K., Fonseca, M. S., Murakami, M., Mainen, Z. F. & Dayan, P. An effect of serotonergic stimulation on learning rates for rewards apparent after long intertrial intervals. Nat. Commun. 9, 2 (2018).
Google Scholar
Iigaya, K., Yi, S., Wahle, I. A., Tanwisuth, K. & O’Doherty, J. P. Aesthetic preference for art emerges from a weighted integration over hierarchically structured visual features in the brain. BioRxiv. https://doi.org/10.1101/2020.02.09.940353 (2020).
Article Google Scholar
Iigaya, K., Yi, S., Wahle, I. A., Tanwisuth, K. & O’Doherty, J. P. Aesthetic preference for art can be predicted from a mixture of low- and high-level visual features. Nat. Hum. Behav. 5, 743 (2021).
PubMed Central PubMed Google Scholar
Bo, Y., Yu, J. & Zhang, K. Computational aesthetics and applications. Vis. Comput. Ind. Biomed. Art 1, 1–19 (2018).
Google Scholar
Greenfield, G. On the origins of the term “computational aesthetics”. In Proc. First Eurographics Conference on Computational Aesthetics in Graphics, Visualization and Imaging 9–12 (2005).
Kozbelt, A. & Seeley, W. P. Integrating art historical, psychological, and neuroscientific explanations of artists’ advantages in drawing and perception. Psychol. Aesthet. Creat. Arts 1, 80 (2007).
Google Scholar
Pelowski, M., Markey, P. S., Goller, J., Förster, E. L. & Leder, H. But, how can we make “art?” artistic production versus realistic copying and perceptual advantages of artists. Psychol. Aesthet. Creat. Arts. https://doi.org/10.1037/aca0000178 (2018).
Article Google Scholar
Pelowski, M., Gerger, G., Chetouani, Y., Markey, P. S. & Leder, H. But is it really art? The classification of images as “art”/"not art" and correlation with appraisal and viewer interpersonal differences. Front. Psychol. 8, 1729 (2017).
PubMed Central PubMed Google Scholar
Berlyne, D. E. Studies in the New Experimental Aesthetics: Steps Toward an Objective Psychology of Aesthetic Appreciation (Hemisphere, 1974).
Google Scholar
Berlyne, D. E. Dimensions of perception of exotic and pre-Renaissance paintings. Can. J. Psychol. 29, 151 (1975).
CAS PubMed Google Scholar
Cupchik, G. C. & Berlyne, D. E. The perception of collative properties in visual stimuli. Scand. J. Psychol. 20, 93–104 (1979).
CAS PubMed Google Scholar
Amabile, T. M. A consensual technique for creativity assessment. In The Social Psychology of Creativity (ed. Amabile, T. M.) (Springer, 1983).
Google Scholar
Hager, M., Hagemann, D., Danner, D. & Schankin, A. Assessing aesthetic appreciation of visual artworks—The construction of the art reception survey (ARS). Psychol. Aesthet. Creat. Arts 6, 1 (2012).
Google Scholar
Eysenck, H. J. Genius: A Natural History of Creativity (Cambridge University Press, 1995).
Google Scholar
Baer, J. & McKool, S. Assessing creativity using the consensual assessment. In Handbook of Assessment Technologies, Methods, and Applications in Higher Education (ed. Schreiner, C. S.) (IGI Global, 2009).
Google Scholar
Amabile, T. M. Social psychology of creativity: A consensual assessment technique. J. Pers. Soc. Psychol. 43, 997 (1982).
Google Scholar
Hennessey, B. A. The consensual assessment technique: An examination of the relationship between ratings of product and process creativity. Creat. Res. J. 7, 193 (1994).
Google Scholar
Fingerhut, J. Enacting media. An embodied account of enculturation between neuromediality and new cognitive media theory. Front. Psychol. 12, 993 (2021).
Google Scholar
Leder, H., Gerger, G., Brieber, D. & Schwarz, N. What makes an art expert? Emotion and evaluation in art appreciation. Cogn. Emot. 28, 1137–1147 (2014).
PubMed Google Scholar
Che, J., Sun, X., Gallardo, V. & Nadal, M. Cross-cultural empirical aesthetics. In Progress in Brain Research (eds Che, J. et al.) 77–103 (Elsevier B.V, 2018).
Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
MATH Google Scholar
Breiman, L. Statistical modeling: The two cultures. Stat. Sci. 16, 726 (2001).
MathSciNet MATH Google Scholar
Jakesch, M. & Leder, H. The qualitative side of complexity: Testing effects of ambiguity on complexity judgments. Psychol. Aesthet. Creat. Arts 9, 200–205 (2015).
Google Scholar
Van Geert, E. & Wagemans, J. Order, complexity, and aesthetic appreciation. Psychol. Aesthet. Creat. Arts 14, 135 (2020).
Google Scholar
Marin, M. M., Lampatz, A., Wandl, M. & Leder, H. Berlyne revisited: Evidence for the multifaceted nature of hedonic tone in the appreciation of paintings and music. Front. Hum. Neurosci. 10, 1–20 (2016).
Google Scholar
Berlyne, D. E. Conflict, Arousal, and Curiosity (McGraw-Hill, 1960).
Google Scholar
Osgood, C. E. Semantic differential technique in the comparative study of cultures. Am. Anthropol. 66, 171–200 (1964).
Google Scholar
Molnar, C. Interpretable Machine Learning. A Guide for Making Black Box Models Explainable. https://christophm.github.io/interpretable-ml-book/ (2019).
Chatterjee, A., Widick, P., Sternschein, R., Smith, W. & Bromberger, B. The assessment of art attributes. Empir. Stud. Arts 28, 207–222 (2010).
Google Scholar
Pelowski, M., Markey, P. S., Forster, M., Gerger, G. & Leder, H. Move me, astonish me… delight my eyes and brain: The Vienna integrated model of top-down and bottom-up processes in art perception (VIMAP) and corresponding affective, evaluative, and neurophysiological correlates. Phys. Life Rev. 21, 80–125 (2017).
ADS PubMed Google Scholar
Nadal, M. & Ureña, E. One Hundred Years of Empirical Aesthetics: Fechner to Berlyne (1876–1976) (2021).
Arnheim, R. Art and Visual Perception: A Psychology of the Creative Eye (University of California Press, 1954).
Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Vittinghoff, E. & McCulloch, C. E. Relaxing the rule of ten events per variable in logistic and cox regression. Am. J. Epidemiol. 165, 710 (2007).
PubMed Google Scholar
Cohen, J. Statistical Power Analysis for the Behavioural Sciences (Laurence Erlbaum Associates, 1988).
MATH Google Scholar
Leder, H., Gerger, G., Dressler, S. G. & Schabmann, A. How art is appreciated. Psychol. Aesthet. Creat. Arts 6, 2–10 (2012).
Google Scholar
Specker, E. et al. The Vienna art interest and art knowledge questionnaire (VAIAK): A unified and validated measure of art interest and art knowledge. Psychol. Aesthet. Creat. Arts. https://doi.org/10.1037/aca0000205 (2018).
Article Google Scholar
Chatterjee, A. The neuropsychology of visual artistic production. Neuropsychologia 42, 1568–1583 (2004).
PubMed Google Scholar
Osgood, C. E., Suci, G. J. & Tannenbaum, P. H. The Measurement of Meaning (University of Illinois Press, 1957).
Google Scholar
Ploder, A. & Eder, A. Semantic differential. In International Encyclopedia of the Social & Behavioral Sciences 2nd edn (ed. Wright, J. D.) 563–571 (Elsevier Inc., 2015).
Google Scholar
Adams, F. M. & Osgood, C. E. A cross-cultural study of the affective meanings of color. J. Cross Cult. Psychol. 4, 135 (1973).
Google Scholar
Masuda, T., Gonzalez, R., Kwan, L., & Nisbett, R. E. Culture and aesthetic preference: Comparing the attention to context of East Asians and Americans. Pers. Soc. Psychol. Bull. 34(9), 1260–1275. https://doi.org/10.1177/0146167208320555 (2008).
Article PubMed Google Scholar
Palmer, S. E., Schloss, K. B. & Sammartino, J. Hidden knowledge in asthetic judgments: Preference for color and spatial composition. In Aesthetic Science. Connecting Minds, Brains and Experience (eds Shimamura, A. P. & Palmer, S. E.) 189–222 (Oxford University Press, 2012).
Google Scholar
Brinkmann, H. et al. Cultural diversity in oculometric parameters when viewing art and non-art. Psychol. Aesthet. Creat. Arts. https://doi.org/10.1037/aca0000563 (2023).
Article Google Scholar
Lovakov, A. & Agadullina, E. R. Empirically derived guidelines for effect size interpretation in social psychology. Eur. J. Soc. Psychol. https://doi.org/10.1002/ejsp.2752 (2021).
Article Google Scholar
Beghetto, R. A. & Kaufman, J. C. Toward a broader conception of creativity: A case for “mini-c” creativity. Psychol. Aesthet. Creat. Arts 1, 73–79 (2007).
Google Scholar
Baer, J. Creativity and Divergent Thinking (Psychology Press, 2014).
Google Scholar
Fekete, A. et al. The Vienna art picture system (VAPS): A data set of 999 paintings and subjective ratings for art and aesthetics research. Psychol. Aesthet. Creat. Arts. https://doi.org/10.1037/aca0000460 (2022).
Article Google Scholar
Cawley, G. C. & Talbot, N. L. C. On over-fitting in model selection and subsequent selection bias in performance evaluation. J. Mach. Learn. Res. 11, 2079–2107 (2010).
MathSciNet MATH Google Scholar
Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning Data Mining, Inference, and Prediction 2nd edn. (Springer, 2009).
MATH Google Scholar
Redies, C. Combining universal beauty and cultural context in a unifying model of visual aesthetic experience. Front. Hum. Neurosci. 9, 1–20 (2015).
Google Scholar
Martindale, C. Recent trends in the psychological study of aesthetics, creativity, and the arts. Empir. Stud. Arts 25, 121–141 (2007).
Google Scholar
Ojala, M. & Garriga, G. C. Permutation tests for studying classifier performance. J. Mach. Learn. Res. 11, 1833–1863 (2010).
MathSciNet MATH Google Scholar
Edgington, E. & Onghena, P. Randomization Tests (CRC Press, 2007).
MATH Google Scholar
Ferguson, C. J. An effect size primer: A guide for clinicians and researchers. Prof. Psychol. Res. Pr. 40, 532 (2009).
Google Scholar
Muth, C., Hesslinger, V. M. & Carbon, C. C. The appeal of challenge in the perception of art: How ambiguity, solvability of ambiguity, and the opportunity for insight affect appreciation. Psychol. Aesthet. Creat Arts 9, 206–216 (2015).
Google Scholar
Leder, H., Belke, B., Oeberst, A. & Augustin, D. A Model of Aesthetic Appreciation and Aesthetic Judgments. Br. J. Psychol. 95, 489 (2004).
PubMed Google Scholar
Fechner, G. T. Vorschule der Aesthetik (Breikopf & Haertel, 1876).
Google Scholar
Heidegger, M. Der Ursprung des Kunstwerks (The Origin of the Work of Art) 1st edn. (Holzwege, 1971).
Google Scholar
Dewey, J. Art as Experience 1st edn. (Perigee, 1980).
Google Scholar
Benjamin, W. Das Kunstwerk im Zeitalter seiner technischen Reproduzierbarkeit: Drei Studien zur Kunstsoziologie 22nd edn. (Suhrkamp Verlag, 1996).
Google Scholar
Sidhu, D. M., McDougall, K. H., Jalava, S. T. & Bodner, G. E. Prediction of beauty and liking ratings for abstract and representational paintings using subjective and objective measures. PLoS ONE 13, 1–15 (2018).
Google Scholar
Van de Cruys, S. & Wagemans, J. Putting reward in art: A tentative prediction error account of visual art. Iperception 2, 1035–1062 (2011).
PubMed Central PubMed Google Scholar
Brinkmann, H., Commare, L., Leder, H. & Rosenberg, R. Abstract art as a universal language? Leonardo 47, 256 (2014).
Google Scholar
Jakesch, M. & Leder, H. Finding meaning in art: Preferred levels of ambiguity in art appreciation. Q. J. Exp. Psychol. 62, 2105–2112 (2009).
Google Scholar
Hekkert, P., Snelders, D. & Wieringen, P. C. W. ‘Most advanced, yet acceptable’: Typicality and novelty as joint predictors of aesthetic preference in industrial design. Br. J. Psychol. 94, 111 (2003).
PubMed Google Scholar
Schindler, I. et al. Measuring aesthetic emotions: A review of the literature and a new assessment tool. PLoS ONE 12, e0178899 (2017).
PubMed Central PubMed Google Scholar
Cupchik, G. C. Emotion in aesthetics: Reactive and reflective models. Poetics 23, 177 (1995).
Google Scholar
Marković, S. Experience and the emotional content of paintings. Psihologija 43, 47–64 (2010).
Google Scholar
Pelowski, M. Tears and transformation: Feeling like crying as an indicator of insightful or “aesthetic” experience with art. Front. Psychol. 6, 1006 (2015).
PubMed Central PubMed Google Scholar
Bourdieu, P. Distinction: A Social Critique of the Judgement of Taste (Harvard University Press, 1984).
Google Scholar
Fingerhut, J. & Prinz, J. J. Wonder, appreciation, and the value of art. In The Arts and The Brain—Psychology and Physiology Beyond Pleasure (eds Fingerhut, J. & Prinz, J. J.) 107–128 (Elsevier, 2018).
Google Scholar
Runco, M. A. Divergent thinking, creativity, and ideation. In The Cambridge Handbook of Creativity (eds Kaufman, J. C. & Sternberg, R. J.) (Cambridge University Press, 2019).
Google Scholar
Cupchik, G. C. & Hilscher, M. C. Holistic perspectives on the design of experience. In Product Experience (eds Cupchik, G. C. & Hilscher, M. C.) (Elsevier, 2008).
Google Scholar
Marin, M. M. & Leder, H. Effects of presentation duration on measures of complexity in affective environmental scenes and representational paintings. Acta Psychol. (Amst.) 163, 38–58 (2016).
PubMed Google Scholar
Clemente, A., Pearce, M. T., Skov, M. & Nadal, M. Evaluative judgment across domains: Liking balance, contour, symmetry and complexity in melodies and visual designs. Brain Cogn. 151, 105729 (2021).
PubMed Google Scholar
Commare, L., Rosenberg, R. & Leder, H. More than the sum of its parts: Perceiving complexity in painting. Psychol. Aesthet. Creat. Arts 12, 380–391 (2018).
Google Scholar
Martindale, C., Moore, K. & Borkum, J. Aesthetic preference: Anomalous findings for Berlyne’s psychobiological theory. Am. J. Psychol. 103, 53–80 (1990).
Google Scholar
Van Paasschen, J., Bacci, F. & Melcher, D. P. The influence of art expertise and training on emotion and preference ratings for representational and abstract artworks. PLoS ONE 10, 1–21 (2015).
Google Scholar
Van de Cruys, S., Chamberlain, R. & Wagemans, J. Tuning in to art: A predictive processing account of negative emotion in art. Behav. Brain Sci. 40, e377 (2017).
PubMed Google Scholar
Schepman, A., Rodway, P., Pullen, S. J. & Kirkham, J. Shared liking and association valence for representational art but not abstract art. J. Vis. 15, 11 (2015).
PubMed Google Scholar
Pelowski, M., Forster, M., Tinio, P. P. L., Scholl, M. & Leder, H. Beyond the lab: An examination of key factors influencing interaction with “real” and museum-based art. Psychol. Aesthet. Creat. Arts 11, 245–264 (2017).
Google Scholar
Leder, H. Mastering Style-Effects of Explicit Style-Related Information, Art Knowledge and Affective State on Appreciation of Abstract Paintings (2006).
Hekkert, P. & van Wieringen, P. C. W. The impact of level of expertise on the evaluation of original and altered versions of post-impressionistic paintings. Acta Psychol. (Amst.) 94, 117–131 (1996).
Google Scholar
Amabile, T. M. Creativity in Context: Update to the Social Psychology of Creativity (Westview Press, 1996).
Google Scholar
Simonton, D. K. Origins of Genius: Darwinian Perspectives on Creativity (Oxford University Press, 1999).
Google Scholar
Kant, I. Critique of Judgment 1st edn. (Hackett Publishing Company, 1987).
Google Scholar

Download references

Author information

Authors and Affiliations

Vienna Cognitive Science Hub, University of Vienna, 1010, Vienna, Austria
Blanca T. M. Spee, Helmut Leder & Matthew Pelowski
Donders Institute for Brain, Cognition and Behavior, Department of Neurology, Center of Expertise for Parkinson & Movement Disorders, Radboud University Medical Center, 6525 GC, Nijmegen, The Netherlands
Blanca T. M. Spee
Department of Cognition, Emotion, and Methods in Psychology, Faculty of Psychology, University of Vienna, 1010, Vienna, Austria
Blanca T. M. Spee, Jan Mikuni, Helmut Leder, Frank Scharnowski, Matthew Pelowski & David Steyrl
Department of Psychiatry, Psychotherapy and Psychosomatics, Psychiatric University Hospital, University of Zurich, 8008, Zurich, Switzerland
Frank Scharnowski & David Steyrl

Authors

Blanca T. M. Spee
View author publications
You can also search for this author in PubMed Google Scholar
Jan Mikuni
View author publications
You can also search for this author in PubMed Google Scholar
Helmut Leder
View author publications
You can also search for this author in PubMed Google Scholar
Frank Scharnowski
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Pelowski
View author publications
You can also search for this author in PubMed Google Scholar
David Steyrl
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.T.M.S., H.L., F.S., M.P., D.S. wrote the main manuscript text; B.T.M.S., J.M., H.L., F.S., D.S. designed the study. H.L., F.S., and M.P. supervised the study. B.T.M.S. was the lead of the study. B.T.M.S. and J.M. prepared data set, analysed rating, conducted the experiment, pre-processed the dataset. B.T.M.S. and D.S. prepared all figures and tables. D.S. conducted the machine learning analysis. All authors reviewed the manuscript.

Corresponding author

Correspondence to Blanca T. M. Spee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Spee, B.T.M., Mikuni, J., Leder, H. et al. Machine learning revealed symbolism, emotionality, and imaginativeness as primary predictors of creativity evaluations of western art paintings. Sci Rep 13, 12966 (2023). https://doi.org/10.1038/s41598-023-39865-1

Download citation

Received: 22 January 2022
Accepted: 01 August 2023
Published: 10 August 2023
DOI: https://doi.org/10.1038/s41598-023-39865-1
Springer Nature Limited

This article is cited by

Determinants of multimodal fake review generation in China’s E-commerce platforms
- Chunnian Liu
- Xutao He
- Lan Yi
Scientific Reports (2024)
Cross-cultural comparison of beauty judgments in visual art using machine learning analysis of art attribute predictors among Japanese and German speakers
- Jan Mikuni
- Blanca T. M. Spee
- David Steyrl
Scientific Reports (2024)
Using machine learning to predict artistic styles: an analysis of trends and the research agenda
- Jackeline Valencia
- Geraldine García Pineda
- Renata Teodori de la Puente
Artificial Intelligence Review (2024)

Machine learning revealed symbolism, emotionality, and imaginativeness as primary predictors of creativity evaluations of western art paintings

Abstract

Similar content being viewed by others

Cross-cultural comparison of beauty judgments in visual art using machine learning analysis of art attribute predictors among Japanese and German speakers

Quantitative Analysis of Artists’ Characteristic Styles through Biologically-Motivated Image Processing Techniques: Uncovering a Mentor to Johannes Vermeer

Computational Beauty: Aesthetic Judgment at the Intersection of Art and Science

Introduction