The aim of Study 1 was to examine to which degree gender-role expectations influence self-reports of empathy and objective performance in emotion recognition. Inasmuch as the stereotype depicts women as more empathetic and better mind-readers, we presumed that females have a motivational advantage when it comes to displaying empathic behaviors.
To test this assumption, participants were randomly assigned to two experimental conditions. In both conditions, self-reports of empathy and performance in emotion recognition were assessed. Whereas participants in the empathy condition were explicitly informed about the fact that the tasks assessed empathy (“in the subsequent tasks, your empathic capacity will be assessed”), participants in the second condition were informed that the tasks assessed ‘social-analytic abilities’ (“in the subsequent tasks, your social-analytic capacity will be assessed”), because we expected this term to be less deterrent for male participants. Based on the results of Eisenberg and Lennon (1983), we predicted that gender differences in self-reported empathic capacity (Hypothesis 1a) and objective performance in emotion recognition (Hypothesis 1b) would be smaller when individuals are not aware that empathy is measured. In both conditions, verbal intelligence was assessed as a neutral capacity and was not expected to have a reliable association with empathy (e.g., Koch et al. 2007). Whereas participants in the social-analytic condition were informed about the true nature of this task, participants in the empathy condition were told that empathy and verbal intelligence had a strong positive association. We expected to observe differences in gender-specific performance between the two conditions if women indeed have a higher motivation in proving their empathic capacity (Hypothesis 1c). To examine the possible mediating influence of gender-role expectations, we assessed the traditional gender-role orientation of all participants.
Method
Participants
In Study 1, 80 participants were excluded from the analyses because they were non-native speakers of German—a condition that had to be fulfilled to complete the verbal intelligence task. Further, nine participants were excluded because they defined their gender as neither male nor female. The final sample comprised 736 participants (494 females, 242 males; mean age = 25.5, SD = 8.3).
Procedure and Materials
Participants were randomly assigned to either be instructed that their empathic capacity or their social-analytic capacity would be assessed. Subsequently, empathy was assessed on a self-report scale. To this end, the German short version (Samson and Huber 2010) of the Empathy Quotient (EQ) by Baron-Cohen and Wheelwright (2004) was employed. It contained 13 items (α = .85). Sample item: “I can easily tell if someone else wants to enter a conversation.” All items were assessed on a scale from 1 (“strongly agree”) to 4 (“strongly disagree”). According to Baron-Cohen and Wheelwright (2004), “strongly agree” responses scored 2 points, “slightly agree” responses scored 1 point, and the remaining options scored 0 points. These scores were then summed. Performance in emotion recognition was assessed with the German version of the “Reading the Mind in the Eyes” Test (Baron-Cohen et al. 1997; revised version Baron-Cohen et al. 2001; German version Bölte 2005). It contains 36 black and white photographs of the eye region (α = .61), providing four different mental state terms to select the correct answer from (e.g., “playful”, “comforting”, “irritated”, “bored”).
As a manipulation check, participants were asked to report if either empathic capacity or social-analytic capacity had been assessed in the previous tasks. The manipulation check was partially successful, in that 66.8% in the empathy condition and 69.6% in the social-analytic condition were able to provide the expected answer. Given that the pattern of the main findings remained unchanged when excluding participants that had not passed the manipulation check, the following analyses are based on all participants.
Subsequently, participants were again provided with condition-specific information. Whereas participants in the empathy condition were informed that the following task would assess their verbal intelligence and there would be a strong association between empathy and verbal intelligence, participants in the social-analytic condition were told that the following task assessed verbal intelligence. Subsequently, verbal intelligence was assessed with the Mehrfach-Wortschatz-Intelligenztest (MWT-B) by Lehrl (2005), which is exclusively available in German. This performance test originally contains 37 items in the course of which the degree of difficulty rises gradually (α = .64). To keep the task as short as possible, only the last 15 items were used in the present research. Every item in the MWT-B consists of five words (e.g., “Tuhl – Lar – Lest – Dall – Lid“), from which only one word is an existing term in German; the other four are nonsense words. Participants are required to identify the true term, in this case, “Lid” (also translated as a/the lid). For the subsequent assessment of gender role orientation, the Traditional Masculinity and Femininity Scale (TMF) by Kachel et al. (2016) was employed. Published both in English and in German, this scale contains 6 items (α = .94), which can be completed on a scale from 1 (“very masculine”) to 7 (“very feminine”). Sample item: “I consider myself as…” After providing their demographic information, participants were asked to guess the purpose of the study. (None of the participants was able.) They were also asked to voluntarily leave their e-mail addresses for the prize draw and pointed to the possibility of contacting the study leader.
Results
Descriptive statistics and intercorrelations of the measures are reported in Table 1. Overall, across both conditions, gender was weakly negatively associated with self-reported empathy and emotion recognition, suggesting a female superiority. In contrast, verbal intelligence was positively associated with gender, suggesting a higher male performance. As expected, gender-role orientation showed a strong negative association with gender. Self-reported empathy was positively associated with emotion recognition and feminine gender-role orientation, whereas emotion recognition was associated with verbal intelligence and, again, feminine gender-role orientation, although this second correlation was smaller than the first one. In turn, gender-role orientation was negatively associated with verbal intelligence, suggesting an association with masculinity.
Table 1 Means, Standard Deviations, Bivariate Correlations with internal consistency reliabilities (Cronbach’s Alpha) in the diagonal (Study 1) To examine whether gender effects in self-reported empathy would be smaller when empathy was assessed as a social-analytic capacity (Hypothesis 1a), a two-way ANOVA was performed on the data. There was a significant main effect of gender, F(1, 732) = 14.69, p = .000, \( {\upeta}_p^2 \) = .02; female participants (N = 494, M = 14.05, SD = 5.32) reported higher empathic capacities than male participants (N = 242, M = 12.44, SD = 5.13). The main effect of condition was non-significant, F(1, 732) = .16, p = .688, \( {\upeta}_p^2 \) = .00, reflecting that the empathic condition (N = 371, M = 13.60, SD = 5.31) and the social-analytic condition (N = 365, M = 13.44, SD = 5.31) did not differ, as well as there was a non-significant interaction effect between gender and condition, F(1, 732) = .64, p = .425, \( {\upeta}_p^2 \) = .00.
As noted above, the statistically more powerful planned comparisons provide a better test of our hypotheses (Rosenthal and Rosnow 1985; Steiger 2004). To this end, we computed a new variable, distinguishing between the groups female/empathic (N = 262, M = 13.97, SD = 5.35), male/empathic (N = 109, M = 12.72, SD = 5.13), female/social-analytic (N = 232, M = 14.14, SD = 5.30), and male/social-analytic (N = 133, M = 12.22, SD = 5.13). There was homogeneity of the error variances, as assessed by Levene’s test (p > .05). The four groups significantly differed, F(3, 732) = 5.28, p = .001, η2 = .02. Planned contrasts demonstrated a significant difference between female/empathic and male/empathic, t(732) = 2.10, p = .036, r = .12. We also found a significant difference between female/social-analytic and male/social-analytic, t(732) = 3.35, p = .001, r = .18. Therefore, women rated themselves significantly higher in empathy across both conditions. Unexpectedly, however, when participants believed that their social-analytic capacities were assessed, gender differences tended to be more pronounced (see Fig. 1).
To further examine whether gender effects on performance in emotion recognition would be smaller when empathy was assessed as a social-analytic capacity (Hypothesis 1b), a two-way ANOVA was conducted. Female participants (N = 494, M = 24.62, SD = 4.00) had higher scores than male participants (N = 242, M = 23.47, SD = 4.56), F(1, 732) = 11.54, p = .001, \( {\upeta}_p^2 \) = .02. There was a non-significant main effect of condition, F(1, 732) = 0.57, p = .449, \( {\upeta}_p^2 \) = .00, reflecting that the empathic condition (N = 371, M = 24.33, SD = 4.14) and the social-analytic condition (N = 365, M = 24.14, SD = 4.31) did not differ. The interaction effect between gender and condition was non-significant, F(1, 732) = 1.61, p = .205, \( {\upeta}_p^2 \) = .00.
To further test for possible mean differences across conditions, planned contrasts were performed. Again, we distinguished between the four groups female/empathic (N = 262, M = 24.54, SD = 3.90), male/empathic (N = 109, M = 23.83, SD = 4.65), female/social-analytic (N = 232, M = 24.71, SD = 4.12), and male/social-analytic (N = 133, M = 23.17, SD = 4.48). There was homogeneity of the error variances, as assessed by Levene’s test (p > .05). The four groups significantly differed, F(3, 732) = 4.65, p = .003, η2 = .02. Planned contrasts demonstrated a non-significant difference between female/empathic and male/empathic, t(732) = 1.47, p = .142, r = .08. In contrast, there was a significant difference between female/social-analytic and male/social-analytic, t(732) = 3.38, p = .001, r = .18. Accordingly, the pattern of the first analysis, evaluating gender effects on self-reported empathy (Hypothesis 1a), was replicated for the performance in emotion recognition. In this case, there was no significant difference in the performance of males and females when participants were informed that their empathic capacities would be assessed, whereas women performed significantly better in emotion recognition than men when they believed that their social-analytic capacities were assessed. Therefore, contrary to our expectations, gender differences were not smaller when empathy was assessed as a social-analytic capacity, but actually more pronounced (Fig. 1).
To determine gender effects on a neutral performance task (verbal intelligence) when participants believed that this task was strongly associated with empathy (Hypothesis 1c), a further two-way ANOVA was performed on the data. There was a significant main effect of gender, F(1, 732) = 12.51, p = .000, \( {\upeta}_p^2 \) = .02; female participants (N = 494, M = 8.06, SD = 2.59) had lower scores than male participants (N = 242, M = 8.76, SD = 2.59). The main effect of condition was non-significant, F(1, 732) = 1.13, p = .288, \( {\upeta}_p^2 \) = .00, reflecting that the empathy condition (N = 371, M = 8.30, SD = 2.71) and the verbal intelligence condition (N = 365, M = 8.28, SD = 2.50) did not differ. There was a significant interaction effect between gender and condition, F(1, 732) = 4.06, p = .044, \( {\upeta}_p^2 \) = .01.
To further test for possible mean differences across conditions, planned contrasts were analyzed. We distinguished between the groups female/empathy (N = 262, M = 7.97, SD = 2.66), male/empathy (N = 109, M = 9.10, SD = 2.68), female/verbal intelligence (N = 232, M = 8.16, SD = 2.51), and male/verbal intelligence (N = 133, M = 8.47, SD = 2.49). There was homogeneity of the error variances, as assessed by Levene’s test (p > .05). The four groups significantly differed, F(3, 732) = 5.32, p = .001, η2 = .02. Planned contrasts demonstrated a significant difference between female/empathy and male/empathy, t(732) = −3.84, p = .000, r = .21. In contrast, there was a non-significant difference between female/verbal intelligence and male/verbal intelligence, t(732) = −1.10, p = .271, r = .06. Hence, male participants performed better than female participants on the verbal intelligence task when they were led to believe that empathy and verbal intelligence were associated. On the other hand, there were no significant gender differences when participants were solely told that we would assess their verbal intelligence. While we originally expected women to outperform men when they believed that empathy and verbal intelligence were associated, men actually achieved higher results in this condition (Fig. 1).
Analyses for the possible mediating impact of traditional gender-role expectations (Hypothesis 3a, Hypothesis 3b) will be reported for Studies 1 and 2 combined (see below).
Discussion
Study 1 lends initial support for the hypothesis that self-reported empathy and even performance in emotion recognition are subject to contextual influences. While women rated themselves significantly higher in empathic capacity, there was no significant gender difference in performance in emotion recognition when participants were told that empathy was measured. However, when they were told that their social-analytic capacity would be assessed, we observed more pronounced gender differences both in self-reported empathy and performance in emotion recognition, indicating a female superiority. This result is remarkable, insofar as we expected the term social-analytic to be less deterrent for male participants and less influenced by the subtle gender stereotype that is associated with the term empathy. In fact, using the word social-analytic increased and even created gender differences not only in self-reported capacity but also in performance in emotion recognition. In addition, we exclusively detected gender differences in the neutral task (verbal intelligence) when participants believed that it was linked to empathy. Unexpectedly, in this case, men outperformed women by solving significantly more items on the verbal intelligence task. The fact that verbal intelligence was weakly associated with masculine gender role orientation across the full sample cannot provide an explanation for this effect because gender differences were only evident in the condition that had received the manipulation. Overall, Study 1 provides evidence that both self-reported empathy and objective performance in emotion recognition can be influenced through a subtle experimental variation and that even a presumed association with the concept of empathy might induce gender differences.