Emotional information is an intrinsic part of social communication and social interactions. During social interactions, emotions are conveyed either by suprasegmental aspects of speech (e.g., prosody) and/or by the emotional semantic content of verbal expressions such as “I am feeling so excited” (e.g., Beaucousin et al., 2007; Pell, Jaywant, Monetta, & Kotz, 2011). Facial expressions, voice modulations, gestures, and even posture may also effectively communicate emotions (e.g., Ekman, 1993; Juslin & Laukka, 2003; Scherer, 1995; Wallbott, 1998). Therefore, listeners constantly face the challenge of effectively and rapidly decoding the affective states of social partners on the basis of verbal and nonverbal cues, to ensure success in social interactions (e.g., Frith & Frith, 1999; Lopes et al., 2004).

In the last decades, robust evidence has accumulated showing that we are especially adept at and tuned to decode emotional information from numerous types of signals (e.g., Kousta, Vinson, & Vigliocco, 2009; Liu et al., 2012; Pinheiro, del Re, Mezin, et al., 2013; Pinheiro et al., 2014). This is not surprising, considering the role played by the fast detection of emotional salience in approach versus avoidance behaviors, and its relevance for survival or attainment of goals. For example, listening to a pleasant birdsong in a garden with blooming flowers or being informed that “The garden is beautiful” may increase the tendency to approach such a place. Instead, listening to unexpected footstep sounds in the middle of the night or becoming aware that “The burglar is invading the house” may lead to a fight-or-flight (avoidance) response as a result of perceived danger. It is generally accepted that the detection of an emotionally salient stimulus may result in the activation of two distinct motivational systems: The defensive system is activated by threat-related stimuli, and is associated with withdrawal or attack behaviors; the appetitive system is activated by stimuli that are related to well-being and survival, and is associated with approach behaviors (M. M. Bradley, Codispoti, Sabatinelli, & Lang, 2001; M. M. Bradley & Lang, 1994).

In spite of the vast number of studies dedicated to probing the behavioral and neural correlates of emotion, as well as to exploring its philosophical and social aspects (a current PubMed search revealed 193,089 published studies on this topic), there is not a consensual definition of emotion. Nondiscrete approaches propose that emotion derives from the evaluation of a stimulus or situation along a small number of dimensions (e.g., M. M. Bradley & Lang, 2000; Russell, 2003). These dimensions include valence (the perceived pleasantness or unpleasantness of a stimulus or situation, ranging from unpleasant to pleasant), arousal (the degree of alertness or excitement elicited by the stimulus or situation, ranging from calm to arousing), and dominance (the degree of perceived control over the stimulus or situation, ranging from controlled to in control) (e.g., Bradley & Lang, 1994, 2000; Osgood, Suci, & Tannenbaum, 1957). These dimensions have been often assessed using a 9-point nonverbal pictographic self-report measure—the Self-Assessment Manikin (SAM; M. M. Bradley & Lang, 1994). This approach may be better suited to characterize stimuli for which it is difficult to attribute a primary emotional category (Kanske & Kotz, 2011). Recent studies also suggest that the representations of different affective dimensions rely on distinct cognitive and neural substrates (e.g., Dresler, Mériau, Heekeren, & van der Meer, 2009). For example, whereas the amygdala tends to respond more intensely as a function of stimulus arousal, the prefrontal cortex seems to be more sensitive to the valence of the stimulus (e.g., Kensinger & Schacter, 2006; Lewis, Critchley, Rotshtein, & Dolan, 2007; Posner et al., 2009).

On the other hand, discrete approaches to emotion propose the existence of discrete affective categories, which are triggered by specific conditions, elicit specific behavioral and biological patterns, and are considered more or less universal (e.g., Ekman, 1992; LeDoux, 1996; Öhman & Wiens, 2004; Scherer & Ellgring, 2007). Most supporters of this account agree that at least five basic emotions should be considered: anger, happiness, fear, sadness, and disgust (e.g., Ekman, 1992). In behavioral studies, these categories are typically assessed with forced choice identification tasks (in which participants are required to choose a specific emotional label, among other options, to better characterize a given stimulus), or with intensity scales (in which participants are instructed to rate the intensity of a given stimulus on several intensity scales representing different emotions). Studies probing discrete emotional categories have suggested that these categories have different neurobiological substrates, which cannot be explained by dimensional models (e.g., Buchanan et al., 2000; Grandjean et al., 2005; S. K. Scott et al., 1997).

Studies that have tested the predictive value of discrete categories on dimensional ratings (and vice versa) have demonstrated that the two types of assessments offer different information about the affective stimuli. For example, each discrete emotion may be identified by a specific set of coordinates in the multidimensional affective space, such as valence and arousal (e.g., Stevenson & James, 2008; Stevenson, Mikels, & James, 2007; Wierzba et al., 2015). Additionally, the effects of categories on affective dimensions are heterogeneous, supporting the need to reconsider dimensional and categorical approaches as independent perspectives (e.g., Stevenson et al., 2007). Therefore, dimensional and discrete approaches are not mutually exclusive: When combined, they may provide complementary information, and both should be taken into account when conducting experimental studies of emotion (e.g., Christie & Friedman, 2004; Russell, 2003; Stevenson & James, 2008; Stevenson et al., 2007; Wierzba et al., 2015). Recognizing the advantages of the combination of dimensional and categorical accounts, with the present study we aimed to collect categorical and dimensional affective ratings for 192 European Portuguese sentences, to provide a deeper understanding of the emotional features of the stimuli under study. To the best of our knowledge, this is the first battery to present emotional ratings for sentences, which is surprising, considering the critical role that sentences play in the expression of emotional meaning during social communication.

In spite of the high number of studies investigating the behavioral and neurofunctional correlates of emotional verbal stimuli, most of these studies have used single words instead of sentences. These studies have demonstrated that words such as “stupid,” “beautiful,” or “round” are processed differently, even when they are task-irrelevant, and that neutral and emotional words are associated with distinct neurofunctional mechanisms. For example, as compared with neutral information, emotional words tend to attract more attention (e.g., Anderson, 2005; Dresler et al., 2009), to be remembered better (e.g., Hadley & MacKay, 2006; Kensinger & Corkin, 2003; Kissler, Herbert, Peyk, & Junghofer, 2007; Monnier & Syssau, 2008), and to be associated with enhanced source memory (Doerksen & Shimamura, 2001)Footnote 1 and faster reaction times (e.g., G. G. Scott, O’Donnell, Leuthold, & Sereno, 2009; Stormark, Nordby, & Hugdahl, 1995). Support for the differential processing of neutral versus emotional words has also been provided by event-related potential (ERP) and functional neuroimaging studies. For example, ERPs have revealed early and later effects of emotionality, indexed, respectively, by enhanced early posterior negativity and late positive components to emotional relative to neutral words (e.g., Herbert, Junghofer, & Kissler, 2008; Kissler et al., 2007; Kissler, Herbert, Winkler, & Junghofer, 2009). The neutral–emotional differentiation may already be observed within 100 ms after stimulus onset (G. G. Scott et al., 2009). Faster processing of emotional words, reflected in earlier latency effects, has also been demonstrated (Schacht & Sommer, 2009). Some studies have shown a processing advantage for pleasant words, in support of a positivity offset account (e.g., Herbert et al., 2008; Palazova, Mantwill, Sommer, & Schacht, 2011). Furthermore, neuroimaging studies have shown that, compared to neutral words, emotional words elicit increased activation in the amygdala (e.g., Kensinger & Schacter, 2006; Lewis et al., 2007), in the left orbitofrontal gyrus and bilateral inferior frontal gyrus (Kuchinke et al., 2005), or in the left subgenual cingulate cortex (Maddock, Buonocore, Kile, & Garrett, 2003). Other studies have demonstrated that discrete emotion information (e.g., sadness) affects the processing of single words earlier than do broader affective dimensions (e.g., negativity). For example, Briesemeister, Kuchinke, and Jacobs (2014) found that the emotional category of words (happiness) affected the early N1 ERP component, whereas the valence of the words (positivity) affected later ERP components—an N400-like component and the late positive complex. Discrete categories and affective dimensions were also found to be dissociable in the brain, as suggested by different patterns of brain activation in response to, for example, happiness versus the overall positivity of words (Briesemeister, Kuchinke, Jacobs, & Braun, 2015). Examples of standardized sets of single words include the Affective Norms for English Words (ANEW; M. M. Bradley & Lang, 1999) and associated discrete norms (Stevenson et al., 2007), the Berlin Affective Word List (Võ et al., 2009) and associated discrete norms for nouns (e.g., DENN-BAWL; Briesemeister, Kuchinke, & Jacobs, 2011), the Leipzig Affective Norms for German (Kanske & Kotz, 2010), and the Nencki Affective Word List (Riegel et al., 2015).

The few studies that have probed emotional processing using more complex linguistic units have mainly focused on emotional prosody, relying on sentences with neutral semantic content and spoken with different emotional intonations (e.g., Kotz et al., 2003; Lima & Castro, 2011; Lima, Garrett, & Castro, 2013; Paulmann & Kotz, 2008a, 2008b; Pinheiro, del Re, Mezin, et al., 2013; Pinheiro, Vasconcelos, Dias, Arrais, & Gonçalves, 2015) or on pseudosentences (e.g., Paulmann & Kotz, 2008a, 2008b; Pinheiro et al., 2015; Pinheiro, del Re, Mezin, et al., 2013). Other studies have attempted to study affective processing in more natural contexts by embedding affective words in sentence contexts (e.g., Bayer, Sommer, & Schacht, 2010; Delaney-Busch & Kuperberg, 2013; Ding, Wang, & Yang, 2014; Holt, Lynn, & Kuperberg, 2009; Lüdtke & Jacobs, 2015; Martín-Loeches et al., 2012). More recently, the capacity to compute implied emotion in sentences has also been investigated (e.g., Lai, Willems, & Hagoort, 2015; Willems, Clevis, & Hagoort, 2011). This new line of research is especially important, considering that sentences represent more ecological stimuli than single words and provide contextual information that is critical for emotional meaning comprehension. However, standardized stimulus sets with normative values for affective sentences are needed. Indeed, emotion research has been hampered by the lack of stimuli controlled for both psycholinguistic and affective properties, and validated for a given cultural context. One of the greatest advantages of verbal stimuli is that they can be more precisely controlled for physical features (e.g., size) or frequency of occurrence in daily life (Kanske & Kotz, 2011). However, to the best of our knowledge, no normative ratings have been provided for affective sentences. The relevance of experimentally controlled sentences fits with recent attempts at using more natural approaches to language use (e.g., Lai et al., 2015; Willems et al., 2011). The simultaneous control of the valence, arousal, and dominance properties of sentences, as well as the accuracy in recognizing the underlying discrete categories, is crucial for experimental studies of verbal emotional processing, using behavioral and/or neuroscience methodologies. Therefore, a sentences battery stands out as a valuable resource for emotion research. It can be used, for example, to probe how semantic and prosodic types of information are integrated when they are congruent or incongruent (e.g., Wittfoth et al., 2010), or to probe the comprehension of emotional meaning in healthy subjects, in patients with brain lesions (e.g., Lalande, Braun, Charlebois, & Whitaker, 1992), or in psychopathological disorders characterized by altered emotional processing, such as schizophrenia (e.g., Pinheiro, del Re, Mezin, et al., 2013; Pinheiro et al., 2014).

The role of interindividual differences in the response to emotional verbal stimuli

It is well known that individual responses to emotional stimuli, such as sentences, may vary across individuals. Evidence from emotion research lends support to sex differences in the processing of verbal emotional information (e.g., Hamann & Canli, 2004), and particularly to a female advantage in emotion recognition (e.g., Hampson, van Anders, & Mullin, 2006; Li, Yuan, & Lin, 2008). Specifically, women tend to rate unpleasant words as more arousing and more unpleasant than do men (e.g., Kanske & Kotz, 2012; Soares, Comesaña, Pinheiro, Simões, & Frade, 2012). These studies suggest that the selection of experimental stimuli should consider sex-related affective norms (i.e., affective norms for female and male participants separately). Sex differences have also been demonstrated by studies probing either prosodic or semantic processes. Investigating the effects of emotional prosody on the processing of subsequently presented positive and negative words, Schirmer, Kotz, and Friederici (2002) found that the effects of prosodic primes on visual target words occurred earlier in women than in men. Using semantically related and unrelated word pairs, Wirth et al. (2007) provided evidence for deeper semantic analysis in female than in male participants, reflected in an earlier and longer N400 effect. These studies indicate that women and men differ in the ways they perceive verbal and emotional information.

Besides sex differences, it is coming to light that interindividual differences influence the perception and response to emotional stimuli, including words and sentences. For example, the emotional states of participants have been shown to modulate the processing of sentences: where positive mood leads to increased semantic activation, negative mood results in semantic inhibition (Federmeier, Kirson, Moreno, & Kutas, 2001; Pinheiro, del Re, Nestor, et al., 2013). Mood-congruency effects were also observed, suggesting that individuals preferentially process emotional information that is congruent with their current mood state (e.g., Bower, 1981; Mayer, McCormick, & Strong, 1995; Rusting, 1998). Along these lines, subclinical levels of anxiety and depression were also found to modulate the affective ratings of verbal stimuli (Kanske & Kotz, 2012). Moreover, some evidence supports the modulatory role of personality traits and social orientation measures (i.e., interest and concern for other individuals) in the response to emotional stimuli (Schirmer et al., 2008). The Big Five model of personality traits has been widely accepted to account for broad individual differences in the social and emotional domains, organizing these differences in five categories: extraversion, neuroticism, conscientiousness, agreeableness, and openness to experience (e.g., McCrae & Costa, 2008). Studies that have probed the links between personality and emotional processing indicate that personality dimensions associated with positive (e.g., extraversion) and negative (e.g., neuroticism) moods may effectively predict the processing and response to emotional information (e.g., B. P. Bradley & Mogg, 1994; De Pascalis & Speranza, 2000; Rusting & Larsen, 1998). Whereas the behavioral inhibition system (e.g., neuroticism) is associated with the processing of unpleasant information, the behavioral approach system (e.g., extraversion) is associated with the processing of pleasant emotional information (e.g., Gomez & Gomez, 2002). For example, levels of extraversion were positively associated with the amygdala response to happy faces (e.g., Canli, Sivers, Whitfield, Gotlib, & Gabrieli, 2002), whereas neuroticism was positively associated with amygdala activation in response to film clips of snakes (Fischer, Tillfors, Furmark, & Fredrikson, 2001). Therefore, it is expected that individuals preferentially process emotional stimuli that are congruent with their stable personality traits (trait-congruency effects; Rusting, 1998).

Furthermore, individual differences in empathy also seem to modulate the response to emotional stimuli. The empathy construct involves at least the capacities to feel what another person is feeling (affective dimension), to know what the other person is feeling (cognitive dimension), and to react to the experience of the other person (behavioral dimension; Decety & Jackson, 2006; Derntl et al., 2010; Shamay-Tsoory, Aharon-Peretz, & Perry, 2009). For example, Singer and collaborators (2004) demonstrated that, when viewing signals indicating that a loved one was experiencing pain, women with higher levels of empathy showed more activation in brain regions involved in the affective experience of pain than did women with lower empathy levels.

As such, the role of interindividual differences in influencing the response to affective sentences needs particular clarification. In an attempt to shed light on this research question, we additionally probed how interindividual differences in empathy, mood, and personality influence the ways that affective sentences are assessed both dimensionally and categorically.

The present studies and hypotheses

In our first study, we probed how individuals rated the affective properties of sentences. To investigate this question, we designed a set of 192 simple neutral, positive, and negative declarative sentences—the Minho Affective Sentences (MAS), which were strictly controlled for psycholinguistic properties such as the numbers of words and letters in the sentence, and per million word frequency. The sentences were designed to represent examples of each of the five basic emotions (anger, sadness, disgust, fear, and happiness; Ekman, 1992) and of neutral situations. These sentences were presented to 536 participants who rated the stimuli using both dimensional and categorical measures of emotions. For the dimensional assessment, participants were instructed to rate the valence, arousal, and dominance of the sentences using the SAM scale. For the categorical assessment, they were instructed to rate the intensity of each sentence on six intensity scales: anger, sadness, disgust, fear, happiness, and neutrality. As in single-word datasets such as the ANEW, we hypothesized higher arousal, lower valence, and lower dominance ratings for negative than for positive and neutral sentences, and higher valence ratings for positive than for negative and neutral sentences (e.g., Soares et al., 2012). Also, we expected that sentences expressing a given emotion would be rated with higher intensity ratings in the corresponding emotion scale. Because previous studies had indicated that the effects of emotion categories on affective dimensional ratings are heterogeneous (i.e., each discrete category may influence affective dimensions in unique ways; Stevenson et al., 2007), we also hypothesized that the predictive value of each affective dimension (valence, arousal, or dominance) for the categorical ratings would differ as a function of the discrete emotion type (e.g., Stevenson et al., 2007).

On the basis of a robust body of evidence demonstrating differences between women and men in ratings of affective stimuli (e.g., Soares et al., 2012; Soares et al., 2013, 2015), we additionally probed the influence of participants’ sex on the sentences evaluations. We expected differences in affective ratings between male and female participants. Specifically, we hypothesized that women would rate sentences as more arousing, and that they would use more extreme valence scores to rate affective sentences, particularly those expressing negative content, in keeping with previous studies on single-word affective ratings in Portuguese (e.g., Soares et al., 2012) and in other languages (e.g., Spanish; Redondo, Fraga, Padrón, & Comesaña, 2007). We also expected increased intensity ratings on the scale reflecting the a priori MAS classification in female relative to male participants (e.g., Hampson et al., 2006; Li et al., 2008).

In our second study, we collected information regarding personality, empathy, and mood from a subset of 40 participants, to investigate whether and how the affective ratings of the sentences would be related to these interindividual measures. On the basis of previous findings (e.g., Schirmer et al., 2008), we hypothesized that personality traits such as extraversion (reflecting the tendency to approach other individuals and to be engaged in social interactions) or empathy levels (reflecting the capacity to adopt and understand somebody else’s point of view) would be variables of interest for the present study, by modulating the ways that individuals would react to the MAS affective sentences. In addition, we hypothesized that the affective normative data would be influenced by the affective states of the participants at the time of the assessment (e.g., Pinheiro, del Re, Nestor, et al., 2013). Specifically, we expected that higher levels of extraversion would be associated with higher valence ratings of positive sentences, whereas neuroticism would be associated with lower valence ratings of negative sentences (e.g., B. P. Bradley & Mogg, 1994; Canli et al., 2002; De Pascalis & Speranza, 2000; Gomez & Gomez, 2002; Rusting & Larsen, 1998). Moreover, we hypothesized that higher empathy levels would be associated with more extreme valence scores (i.e., negative sentences would be rated as more unpleasant and positive sentences would be rated as more pleasant) and arousal ratings (i.e., emotional sentences would be rated as more arousing) (e.g., Singer et al., 2004).

Finally, we hypothesized that positive mood states would be associated with increased perceived pleasantness and arousal of the positive sentences. On the other hand, negative mood states would be associated with a bias for unpleasant or negative information, reflected in increased perceived unpleasantness and arousal of the negative sentences. This would be in line with observations of mood-congruency effects (e.g., Bower, 1981).

Study 1: MAS dimensional and categorical norms

The aim of our first study was to provide norms for dimensional (valence, arousal, and dominance) and categorical (intensity on five emotional scales and one neutral scale) ratings of 192 affective sentences. An additional goal was to probe whether and how female and male participants differed in the affective ratings of sentences. In addition, we aimed to test the predictive value of the affective dimensions on discrete ratings of the MAS sentences.

Method

Participants

A sample of 536 European Portuguese healthy subjects (299 female, 237 male; mean age = 23.29 years, SD = 6.02, age range 14–64 years) participated in the experiment. All participants were European Portuguese native speakers and reported normal or corrected-to-normal visual acuity: 54.7 % had completed 12 years of education, an additional 24.6 % had a bachelor’s degree, another 14.9 % had a master’s degree, 4.1 % had completed only 9 years of education, and 1.7 % had a PhD degree.

Participants whose responses were illustrative of nondiscriminative ratings or suggestive of inattention (e.g., repeatedly using the same values across dimensions to rate different sentences) were excluded from the total sample (nine were excluded and not considered for the analyses).

Stimuli

A set of 192 sentences in European Portuguese was developed to represent five basic emotions (happiness, sadness, anger, fear, and disgust) and a sixth neutral (control) category. These sentences covered the six categories usually considered in emotion research. Overall, the sentences were intended to convey neutral (n = 64), positive (n = 64), or negative (n = 64) semantic content. Specifically, the positive sentences were designed to represent the emotion happiness, in line with the basic-emotions account (Ekman, 1992), and the negative sentences covered the four basic negative categories: anger (n = 16), fear (n = 16), disgust (n = 16), or sadness (n = 16). The affective content of the MAS sentences was previously assessed by two judges with expertise in language and emotion research. While creating these sentences, the authors focused on the simplicity and familiarity of their semantic content. All exemplars portrayed common daily activities and situations that were intentionally chosen to evoke specific emotional reactions (anger, fear, disgust, sadness, and happiness) and, additionally, the absence of emotion (neutral). Thus, the MAS corpus can be used across a broad range of populations (clinical and nonclinical) and age groups.

The sentences in each emotional category were matched for length (mean number of words and mean number of letters within words in the sentence) and per-million word frequency, considering the part of speech (PoS) of each word in the sentence, obtained from the Procura-PALavras lexical database (P-PAL; Soares et al., 2014, available at http://p-pal.di.uminho.pt/tools). Compound nouns were excluded. Table 1 illustrates the psycholinguistics characteristics of the MAS sentences by emotional category.

Table 1 Psycholinguistic characterization of the MAS sentences, according to emotional category

As expected, when comparing neutral, positive, and negative sentences, no differences were observed between the three MAS valence types in the mean number of words, F(2, 189) = 1.274, p > .05, mean number of letters, F(2, 189) = 5.474, p > .05, or per-million word frequency, F(2, 189) = 0.775, p > .05. Similarly, when comparing the four negative categories (anger, fear, disgust, and sadness), no differences were observed regarding the mean number of words, F(3, 60) = 0.916, p > .05, mean number of letters, F(3, 60) = 2.216, p > .05, and mean word frequency, F(3, 60) = 0.194, p > .05. Thus, the different MAS sentence categories were equivalent in their underlying psycholinguistic properties, which is critical for avoiding potential confounds in the emotional ratings.

Procedure

A Web survey procedure was used for data collection, due to advantages such as speed, accuracy, and low cost, and particularly the easy access provided to a larger number of participants (see, e.g., Couper, 2000). Data were collected using a website in HTML format that was specifically developed for this experiment, following the recommendations of Burke and James (2006): (1) clearly describing the aims of the research; (2) identifying the research team by providing e-mail contacts; (3) including clear instructions and informing about the estimated time to complete the survey; and (4) emphasizing privacy policy and data confidentiality. The experiment was advertised by e-mail or through the website of the research group (www.nerp-um.com). Students were given course credit for their participation. For the rest of the participants, a lottery of a gift voucher (value: €50) to a national bookstore chain was used as an incentive.

The online survey began with general and specific instructions for performing the experiment. Participants were also reminded about the volunteer nature of their participation and the confidentiality of the results. After providing socio-demographic information (age, educational level, and linguistic skills), participants had to rate a maximum of 28 neutral sentences, 28 positive sentences, and 28 negative sentences (representing the four negative categories—seven from each category), randomly selected from the full set of sentences (192). This was done to reduce fatigue and distraction. The order of the sentences was pseudorandomized across participants.

Participants were first asked to rate each sentence, in terms of valence (completely unpleasant to completely pleasant), arousal (completely calm to completely excited), and dominance (completely controlled by to completely in control), using the 9-point SAM scale (M. M. Bradley & Lang, 1994). After rating the three affective dimensions (valence, arousal, and dominance) using the SAM, participants were asked to rate the intensity of each sentence in six emotional categories (anger, fear, disgust, sadness, happiness, and neutrality) using a 5-point intensity scale, ranging from (1) low intensity to (5) high intensity (following Stevenson et al., 2007). The order of the tasks (dimensional and categorical assessments) was the same for all participants, following previous studies that had implemented dimensional and categorical stimulus ratings (e.g., Belin, Fillion-Bilodeau, & Gosselin, 2008). Examples of the sentences (not used in the experiment) were provided at the beginning of each assessment, to help participants get familiarized with the rating procedure and the scales. Participants were not only reminded that a personal, subjective rating was required (and, therefore, that there were not correct or false answers), but also encouraged to rate the sentences according to their first impressions. During the assessment, each sentence remained at the center of the computer screen until a response was made. Sentences were presented in Arial 30-point font. Participants used the numbers on the keyboard to indicate their response. Answers were automatically saved after the participant’s response, and the next sentence was presented. No time limit was imposed, but participants were encouraged to answer as quickly as they could. On average, the procedure took about 30 min to complete. Each sentence in the dataset was rated by at least 97 participants (M = 122.88, SD = 9.96, range = 97–145).

Results

In this first study, we aimed to collect normative values for the valence, arousal, dominance, and intensity of the MAS sentences. The norms for each of the 192 MAS sentences are presented in the supplemental materials. Mean values (M) and standard deviations (SD) for the valence, arousal, dominance, and intensity of each of the abovementioned categories are presented for each of the 192 sentences.

The mean values and SDs for neutral, positive, and negative sentences, and for each negative category (anger, sadness, disgust, and fear) in the total sample, are provided in Table 2. Following the procedure adopted by single-word normative studies (e.g., ANEW; Soares et al., 2012), we also present means and SDs for each sentence category considering female and male participants separately.

Table 2 Mean ratings of valence, arousal and dominance for each MAS sentence category in the total sample, and in female and male participants separately

Dimensional assessment

Figure 1 shows the bivariate distribution of valence and arousal ratings averaged across all participants. The distribution confirms the typical boomerang-shaped distribution reported by previous studies that have probed affective ratings for verbal (e.g., M. M. Bradley & Lang, 1999; Soares et al., 2012) and nonverbal (e.g., M. M. Bradley & Lang, 2007; Lang, Bradley, & Cuthbert, 2008; Soares et al., 2013, 2015) stimuli. Similarly to pictures (e.g., Soares et al., 2015), sounds (e.g., Soares et al., 2013), and words (e.g., Soares et al., 2012), sentences rated as either highly pleasant or highly unpleasant were simultaneously rated as more arousing. The quadratic relationship (r quad = .89, p < .001) explained 78.8 % of the variance. We also probed the relationships between valence and arousal in female and male participants separately. The percentage of variance explained by the quadratic relationship was higher in female (r quad = .90, p < .001, R 2 = 81.5 %) than in male (r quad = .82, p < .001, R 2 = 66.4 %) participants.

Fig. 1
figure 1

Relationship between valence and arousal in the total sample (A), and in female (B) and male (C) participants separately

Using repeated measures analyses of variance (ANOVAs), we tested the effects of MAS sentence valence (neutral, positive, or negative, based on the a priori sentence classifications) on the valence, arousal, and dominance ratings. All analyses using repeated measures ANOVA were corrected for nonsphericity using the Greenhouse–Geisser method (the original df is reported). All significance levels are two-tailed, with the present significance alpha level of p < .05. Main effects and interactions were followed by pairwise comparisons between conditions, using the Bonferroni adjustment for multiple comparisons. The SPSS 22.00 (SPSS Corp., USA) software package was used for the statistical analyses.

In a subsequent analysis, we tested the effects of negative category type on these dimensional ratings, considering the four negative categories separately, as each includes the same number of sentences (n = 16). For each analysis, participants’ sex was added as a between-subjects factor to assess sex differences.

Valence

Valence ratings differed as a function of MAS sentence valence (i.e., neutral, positive, or negative), F(2, 1070) = 4149.78, p < .001, η p 2 = .886 (Table 2). Planned pairwise comparisons revealed that positive sentences were rated as more pleasant than both neutral (p < .001) and negative (p < .001) sentences, and that neutral sentences were rated as more pleasant than negative sentences (p < .001). A significant interaction between valence and participant’s sex, F(2, 1068) = 58.75, p < .001, η p 2 = .099, indicated that negative sentences were rated as more negative (p < .001) and positive sentences were rated as more positive (p < .001) by female than by male participants, but the sexes did not differ in their valence ratings of neutral sentences (p > .05).

In a second ANOVA testing the four negative categories only, a main effect of negative category indicated differences between anger, fear, disgust, and sadness sentences, F(3, 1605) = 48.10, p < .001, η p 2 = .082 (Table 2). Sentences expressing fear were rated as less unpleasant than sentences expressing disgust (p < .001), anger (p < .001), and sadness (p < .001), whereas sentences expressing disgust were rated as less unpleasant than sentences expressing anger (p < .001) and sadness (p < .001). Anger and sadness sentences did not differ in valence ratings (p > .05), being perceived as more unpleasant than sentences expressing fear (p < .001) and disgust (p < .001). A main effect of participant’s sex, F(1, 534) = 87.47, p < .001, η p 2 = .141, revealed that women rated all types of negative sentences as more unpleasant than did men.

Arousal

The ANOVA comparing arousal ratings between neutral, positive, and negative sentences indicated a main effect of MAS sentence valence, F(2, 1070) = 418.11, p < .001, η p 2 = .439 (Table 2). Specifically, negative sentences were rated as more arousing than positive (p < .001) and neutral (p < .001) stimuli, whereas positive sentences were rated as more arousing than neutral sentences (p < .001).

The second ANOVA comparing anger, fear, disgust, and sadness sentences revealed differences between the negative emotional categories, F(3, 1605) = 54.88, p < .001, η p 2 = .093 (Table 2): Both fear and anger sentences were perceived as more arousing than sadness (p < .001) and disgust (p < .001) sentences, whereas fear and anger sentences did not differ in arousal ratings (p > .05). A significant main effect of participant’s sex, F(1, 534) = 31.42, p < .001, η p 2 = 0.056, indicated that women rated all types of negative sentences as more arousing than did men.

Dominance

The ANOVA testing the effects of MAS sentence valence (neutral, positive, and negative) on dominance ratings yielded a significant effect of valence, F(2, 1070) = 1,133.20, p < .001, η p 2 = .679 (Table 2). Negative sentences were rated with the lowest dominance ratings when compared to both neutral (p < .001) and positive (p < .001) sentences; neutral sentences received lower dominance scores than positive sentences (p < .001). The significant interaction between valence and participant’s sex, F(2, 1068) = 9.03, p < .001, η p 2 = .017, showed that female participants used lower dominance ratings than did men when rating negative (p < .001) and neutral sentences (p = .037), but not with positive sentences (p > .05).

In the second ANOVA, comparing the four negative categories, a main effect of sentence category, F(3, 1605) = 33.74, p < .001, η p 2 = .059 (Table 2), indicated that disgust sentences received higher dominance ratings than did anger, sadness, and fear sentences (ps < .001 for all comparisons). A main effect of participant’s sex, F(1, 534) = 17.44, p < .001, η p 2 = .032, revealed that women rated all negative categories with lower dominance ratings than did men.

Categorical assessment

Table 3 shows the mean intensity ratings for each sentence category in the six intensity scales.

Table 3 Mean intensity ratings per MAS emotional category in the total sample, and in female and male participants separately

To probe which intensity scale received the highest score in the assessment of each MAS category (neutral, happiness, anger, sadness, fear, and disgust) and to probe sex differences in intensity ratings, we ran a separate repeated measures ANOVA for each MAS category, with scale (neutral, happiness, anger, sadness, fear, or disgust) as a within-subjects factor and sex as a between-subjects factor.

Intensity ratings were the highest in the scale corresponding to the a priori MAS sentence classification (e.g., neutral intensity scale for neutral MAS sentences), as revealed by a main effect of scale: neutral, F(5, 2675) = 1,458.91, p < .001, η p 2 = .732 (ps < .001 for all comparisons); positive/happiness, F(5, 2675) = 1,941.42, p < .001, η p 2 = .784 (ps < .001 for all comparisons); anger, F(5, 2675) = 559.51, p < .001, η p 2 = .586 (ps < .001 for all comparisons); fear, F(5, 2675) = 607.00, p < .001, η p 2 = .532 (ps < .001 for all comparisons); disgust, F(5, 2675) = 779.83, p < .001, η p 2 = .593 (ps < .001 for all comparisons); and sadness, F(5, 2675) = 959.33, p < .001, η p 2 = .642 (ps < .001 for all comparisons). Furthermore, with the exception of anger, a significant intensity scale and sex interaction indicated that female participants rated the sentences with higher intensity values on the scale corresponding to the a priori MAS classification, when compared with men [neutral, F(5, 2670) = 8.53, p < .002, η p 2 = .016; positive/happiness, F(5, 2670) = 16.77, p < .001, η p 2 = .030; fear, F(5, 2670) = 27.05, p < .001, η p 2 = .048; disgust, F(5, 2670) = 18.12, p < .001, η p 2 = .033; sadness, F(5, 2670) = 15.62, p < .001, η p 2 = .028].

Relationship between dimensional and categorical ratings

Multiple linear regressions were run to predict dimensional ratings from categorical intensity ratings, following Stevenson and collaborators (2007). This was done for neutral, positive, and negative sentences separately, and, subsequently, for the four types of negative categories (anger, fear, sadness, and disgust). Dimensional ratings were taken as predictors, and the dependent variable was the intensity ratings for each MAS category on the corresponding intensity scale. Table 4 shows standardized β coefficients for the three affective dimensions.

Table 4 Regressions of dimensional ratings predicting discrete emotional categories

The regressions showed that the categorical intensity ratings were significantly predicted by the three affective dimensions. This was the case for neutral sentences, F(3, 188) = 188.332, p < . 001, R 2 = .750; positive sentences, F(3, 188) = 817.507, p < .001, R 2 = .929; and negative sentences, F(3, 188) = 927.942, p < .001, R 2 = .937. This was also observed for all types of negative categories: anger, F(3, 188) = 96.303, p < .001, R 2 = .606; disgust, F(3, 188) = 43.102, p < .001, R 2 = .408; sadness, F(3, 188) = 165.627, p < .001, R 2 = .725; and fear, F(3, 188) = 104.919, p < .001, R 2 = .626. However, in the cases of anger and happiness, dominance did not add statistically significantly to the prediction (p > .05); in the case of sadness, valence did not add statistically significantly to the prediction (p > .05); in the case of fear, arousal did not add statistically significantly to the prediction (p > .05).

Interim summary

The results from Study 1 provided support for the a priori MAS classifications and indicated that the sentences from each category were reliably distinguished in both the dimensional and categorical ratings. They additionally revealed that women and men reacted differently to the affective sentences. Specifically, women used more extreme valence scores, rating negative sentences as more unpleasant and positive sentences as more pleasant than did men. They also rated negative sentences as more arousing, and both negative and neutral sentences with lower dominance scores than did men. Furthermore, women judged the sentences as being more intense on the corresponding intensity scale, with the exception of anger.

Study 2: Relationships between affective ratings and empathy, mood, and personality

Considering the role played by differences in empathy, personality traits, and mood in modulating the response to emotionally salient stimuli (e.g., Canli et al., 2002; Federmeier et al., 2001; Fischer et al., 2001; Pinheiro, del Re, Nestor, et al., 2013; Schirmer et al., 2008; Singer et al., 2004), in our second study we probed the relationship between individual differences on these variables and the affective ratings of the MAS sentences. A subsample of 40 participants was selected from the original sample that had assessed the MAS sentences both dimensionally and categorically. These participants completed empathy, personality, and mood questionnaires. A correlational analysis tested the associations between these measures and the MAS affective ratings.

Method

Participants

A subset of 40 participants from the original sample (20 female, 20 male; mean age = 21.88 years, SD = 3.22, age range 18–30 years) completed empathy, personality, and mood questionnaires (see Table 5). All participants provided written informed consent for the experimental protocol approved by a local Ethical Committee. The study was conducted according to the principles expressed in the Declaration of Helsinki.

Table 5 Participants’ scores on empathy (IRI), personality (NEO-FFI), and mood (PANAS) assessments

Measures

The NEO Five-Factor Inventory (NEO-FFI; Costa & McCrae, 1992; adapted to European Portuguese by Magalhães et al., 2014), composed of 60 items, was used to assess five personality dimensions: conscientiousness, extraversion, neuroticism, openness to experience, and agreeableness. This inventory has been consistently used in different cultural contexts to assess personality (e.g., Costa & Mccrae, 1994; Costa, Terracciano, & McCrae, 2001; McCrae et al., 1999). The Portuguese adaptation of the NEO-FFI revealed high levels of internal consistency and supported its validity and reliability (Magalhães et al., 2014).

The interpersonal reactivity index (IRI; Davis, 1983; adapted by Limpo, Alves, & Castro, 2010) was used to measure empathy. The IRI is based on a multidimensional perspective of empathy. As such, this index is composed of four subscales: perspective taking (i.e., the tendency to adopt somebody else’s perspective), empathic concern (i.e., the tendency to experience compassion and concern for others), personal discomfort (i.e., experience of anxiety and discomfort in tense interpersonal contexts), and fantasy (i.e., the tendency to imagine oneself in fictional situations). The IRI is one of the most widely used measures to assess empathy (e.g., Cliffordson, 2001; Haker & Rössler, 2009; Raposo, Vicens, Clithero, Dobbins, & Huettel, 2011). The validation to Portuguese confirmed the adequate internal consistency and good reliability of this instrument in the assessment of empathy (Limpo et al., 2010).

The Positive and Negative Affect Schedule (PANAS) was used to assess the participants’ current mood state (Watson & Tellegen, 1985; adapted by Galinha & Ribeiro, 2005). The PANAS has been widely used in emotion and mood research (e.g., Brown, Chorpita, & Barlow, 1998; Crawford & Henry, 2004; Fredrickson & Joiner, 2002; Ostir, Smith, Smith, & Ottenbacher, 2005). It includes ten adjectives covering positive mood (e.g., “excited,” “aroused”) and ten adjectives covering negative mood states (e.g., “nervous,” “scared”). Participants indicate the extent to which they have felt each emotion during the past two weeks. Higher scores reflect higher positive and/or negative affect.

Procedure

Participants were tested individually. First, they rated the valence, arousal, dominance, and intensity of the 192 sentences (Study 1). After, they were asked to complete the empathy (IRI), personality (NEO-FFI), and mood (PANAS) questionnaires.

Results

First, we tested whether male and female participants differed in personality (five factors—conscientiousness, extraversion, neuroticism, openness to experience, and agreeableness), empathy (four subscales—perspective taking, empathic concern, personal discomfort, and fantasy), and mood (two dimensions—positive and negative affect) scores. Independent-sample t tests were conducted to probe sex differences. Empathic concern (IRI) was higher in female than in male participants, t(38) = –4.666, p < .001, as was personal discomfort (IRI), t(38) = 2.678, p = .013. No other sex differences were observed (p > .05). On the basis of these findings, correlations were tested in the total sample for all measures, with the exception of empathic concern and personal discomfort: For these variables, the association with affective ratings was tested in female and male participants separately. Pearson’s r correlation coefficient was calculated, to test the relationships between affective ratings (valence, arousal, dominance, and intensity)Footnote 2 and empathy, personality, and mood scores.

Personality measures were not significantly associated with the affective ratings (p > .05). Furthermore, no significant association was found between ratings of neutral sentences and the individual measures (p > .05). The Pearson’s r coefficients describing the associations between the ratings of positive and negative sentences and the empathy and mood scores are presented in Table 6.Footnote 3

Table 6 Correlations between affective ratings of the positive and negative sentences and participants’ empathy and mood scores

Positive sentences

When considering the total sample, increased valence (p = .035) and intensity (p = .024) scores were associated with higher IRI perspective taking. Furthermore, increased arousal ratings were associated with higher positive affect in the total sample (p = .008). No significant correlations were observed for dominance ratings (p > .05). In males only, increased valence scores were associated with higher IRI empathic concern (p = .012). In females, no association was found between the affective ratings of positive sentences and either empathic concern or personal discomfort (p > .05).

Negative categories (anger, fear, sadness, and disgust)

When considering the four negative categories separately, we observed that increased empathy scores were associated with increased perceived unpleasantness and arousal of the distinct types of negative sentences in the total sample. Specifically, increased empathic perspective taking was associated with less pleasant (p = .027) and more arousing (p = .036) ratings of sadness sentences.

In male participants, increased IRI empathic concern was associated with lower valence ratings (p = .009) and higher arousal ratings (p = .005) of sadness sentences, and with lower valence ratings of fear sentences (p = .04). In females, no association was found between affective ratings of the negative sentences and either empathic concern or personal discomfort (p > .05).

Interim summary

The results from Study 2 indicated that empathy and mood, but not personality traits, modulated the affective ratings of positive and negative sentences only. Empathic concern and fantasy were higher in female than in male participants, but no sex differences were observed when contrasting the scores for the other variables.

Overall, ratings of negative and positive sentences, but not of neutral sentences, were associated with differences in empathy and mood scores, when considering the total sample. In particular, in males only, the higher the empathic concern, the more unpleasant and arousing negative sentences were perceived to be, and the more pleasant positive sentences were perceived to be.

General discussion

In the present study, we aimed to probe how the MAS sentences, designed to represent different emotions, are rated both dimensionally and categorically. An additional aim was to explore how sex and individual variables, such as personality, empathy, and mood, modulate the affective ratings of the MAS sentences. These norms provide a means to investigate the two approaches to emotion with a set of more ecologic verbal stimuli. Moreover, they corroborate the emotional potential of simple sentences (e.g., Lüdtke & Jacobs, 2015) and, in particular, the observation that emotion may be derived from word combinations in sentences (Lai et al., 2015).

Our first study confirmed that the affective norms of the MAS sentences are valid measures to guide the selection of stimuli for experimental studies of emotion. The dimensional results corroborated the a priori sentence categorizations: negative sentences (e.g., “The boss humiliated the team”) were classified as the most unpleasant and arousing verbal stimuli, whereas positive sentences (e.g., “The actor is hilarious”) were classified as the most pleasant verbal stimuli. In addition, we confirmed a tendency previously observed when investigating the affective ratings of pictures (e.g., Soares et al., 2015), sounds (e.g., Soares et al., 2013), and single words (e.g., Soares et al., 2012): the quadratic function underlying valence and arousal ratings indicated that highly pleasant and unpleasant sentences were more likely to be rated as highly arousing. This finding demonstrates the comparability of the present findings to previously published affective norms for single words, pictures, and sounds (e.g., M. M. Bradley & Lang, 1999; Grühn & Scheibe, 2008; Moors et al., 2013; Redondo et al., 2007; Soares et al., 2012; Soares et al., 2013, 2015; Võ et al., 2009; Warriner, Kuperman, & Brysbaert, 2013).

Our results for the discrete assessment revealed that all sentences were classified with the highest intensity scores on the scale corresponding to the a priori sentence classification (e.g., positive sentences received the highest intensity ratings on the happiness scale). Both dimensional and categorical ratings indicated that neutral (e.g., “The ink is blue”) and emotional (e.g., “The garden is beautiful”) sentences are reliably distinguished, and suggested that these sentences are effective stimuli that can be used in experimental research aiming to probe verbal emotional processing of sentence contexts.

In keeping with previous studies (e.g., Stevenson et al., 2007), the regression analysis demonstrated that the information provided by dimensional ratings is not the same affective information provided by categorical ratings. Even though categorical intensity ratings were significantly predicted by the dimensional data for all categories, we observed that specific dimensions did not have a significant impact on intensity ratings of the specific categories. These findings indicate that discrete emotions are associated with different weights of the three affective dimensions, at least when they are expressed verbally: where anger and happiness may be better conveyed through the modulation of valence and arousal, sadness may be better conveyed through the modulation of arousal and dominance, and fear may be better conveyed through the modulation of valence and dominance. In other words, the relevance of specific affective dimensions seems to vary as a function of emotional category type. Furthermore, we observed that the level of prediction was the highest for sentences conveying happiness, and lowest for sentences conveying disgust. We should note that the sign of the β coefficient was positive in the case of valence predicting fear, unlike with the other negative categories, indicating that an increase in valence corresponded to an increase in the intensity ratings of fear sentences. Because sentences expressing fear were perceived as less unpleasant than the other three negative categories, this may explain the specific difference. It is possible that fear is less effectively conveyed through sentences than through pictures or sounds, pointing to the need to consider the effects of stimulus type in the explanation of emotional effects. Although a discussion of the processes or mechanisms underlying the difference between fear and the other negative pictures goes beyond the scope of this article, another possibility is that fear represents a special case of a negative emotion, in line with the assumption that distinct emotional categories are processed by distinct neural substrates (e.g., Calder, Lawrence, & Young, 2001). This hypothesis should be addressed in future studies.

The heterogeneous effects of the affective dimensions on specific categories lend support to the idea that dimensional and categorical approaches to emotion are not independent, and therefore should be integrated (e.g., Stevenson & James, 2008; Stevenson et al., 2007). In an attempt to control for as many variables as possible, researchers should take both perspectives into consideration when selecting stimuli for studies on emotion.

Our results also confirmed a consistent finding in the emotion literature—that is, that men and women tend to perceive affective stimuli differently. In keeping with our previous findings (e.g., Soares et al., 2012; Soares et al., 2013, 2015), women rated negative sentences as more pleasant and more arousing, and positive sentences as more pleasant than did men. Also, women provided lower dominance ratings than did men when rating neutral and negative sentences. Furthermore, female participants perceived the sentences as more intense than did male participants, with the exception of angry sentences. In particular, the U-shaped association between valence and arousal was stronger in female than in male participants, showing that women had a stronger disposition to use more extreme arousal scores to rate emotional (vs. neutral) sentences than did men. This finding supports the observation that women tend to show increased defensive and appetitive activation in response to emotional stimuli, with the exception of erotica (e.g., M. M. Bradley et al., 2001; Soares et al., 2015). Possible biological and sociocultural factors may account for the sex differences in the emotional responses to salient stimuli (e.g., Archer, 1996; Ratner, 2000). The observed sex differences suggest that sex-based norms may be more effective when planning stimulus selection in emotion research, and as such, norms for female and male participants separately are provided in the supplementary materials.

Our second study demonstrated that, besides sex, interindividual variability in affective ratings of sentences was also explained by mood and empathy, but not by personality traits. In particular, higher empathy scores were related to more extreme valence scores of both negative and positive, but not of neutral, sentences. Higher empathy scores were also related to increases in the perceived arousal of negative sentences and in the intensity of positive sentences. On the other hand, positive affect correlated with increased arousal of positive sentences. These results suggest that mood and empathy modulate the emotional response to verbal signals, by enhancing the salience or relevance of those signals (see also Schirmer et al., 2008). For example, mood has been shown to act as a modulator of cognitive processes, including how semantic memory is used online (Pinheiro, del Re, Nestor, et al., 2013). Effects of mood congruency may explain the pattern of correlations observed here, that positive mood states were associated with increased perceived arousal of positive (congruent) sentences. This finding is in line with previous studies showing that positive mood states have facilitatory effects on semantic processing (e.g., Chwilla, Virgillito, & Vissers, 2011; Pinheiro, del Re, Nestor, et al., 2013).

Furthermore, we corroborated the observation that empathy mediates judgments of the emotional states of others in response to both positive and negative events, but not to nonemotional events. The association between empathy scores and the affective ratings of the MAS sentences suggests that comprehending emotional meaning from verbal cues may also involve simulating emotion-expressing actions, or in other words, sharing the joy or distress of others (Perry, Hendler, & Shamay-Tsoory, 2012), which fits well with embodied theories (e.g., Glenberg, Havas, Becker, & Rinck, 2005). In keeping with this finding, previous studies have demonstrated that, by mapping the bodily feelings of others onto one’s own internal bodily states, increased empathy scores were found to predict reactivity to both positive and negative emotional expressions (e.g., Jabbi, Swart, & Keysers, 2007; Perry et al., 2012). Furthermore, others have demonstrated that inferring the meaning conveyed by a speaker involves empathizing feelings about the emotional scenarios described in the sentences (e.g., Hagoort, 2013). The association observed in our study may indicate that variability in the responses to emotional verbal meaning reflects differences in empathy.

We also observed that empathic concern and fantasy—measuring the affective dimension of empathy—were higher in female than in male participants, consistent with the previous literature (e.g., Chakrabarti, Bullmore, & Baron-Cohen, 2006). In particular, we observed that different levels of empathic concern modulated the affective ratings of sentences expressing happiness, sadness, and disgust, but in male participants only. The correlational results raise the interesting possibility that the observed sex differences in affective ratings may be secondary to differences in mood and empathy. Moreover, they underscore the importance of taking into consideration the influence of interindividual differences in affective rating studies, and of checking whether the affective norms are well suited to the participant samples (see also Kanske & Kotz, 2012). In other words, researchers should include measures of individual differences in their analysis of responses to verbal emotional stimuli (e.g., as covariates), rather than relying exclusively on group-averaged responses to these stimuli (e.g., Hamann & Canli, 2004).

Although our study provided support for mood-congruency effects, there was little support for personality-congruency effects. This may suggest that the relationship between personality traits and emotional processing is mediated by mood states. Future studies should clarify which aspects of emotion processing are relatively invariant across individuals and which are modulated by individual differences (e.g., Hamann & Canli, 2004).

Conclusions

We expect the MAS norms to allow a more controlled and refined selection of sentences as a function of an intended manipulation (e.g., Valence × Arousal) in emotion research. Similar efforts in characterizing and controlling stimuli according to both dimensional and discrete theories of emotion will contribute to a more thorough understanding of how emotions influence human perception, cognition, and behavior. We believe these norms are a starting point for systematic research on verbal emotional processing with Portuguese-speaking participants, as well as for cross-language comparisons aiming to probe whether and how verbal emotion recognition is shaped by language and culture.

Furthermore, our findings indicate that, in their quest to reveal the neurofunctional underpinnings of emotion processing, researchers should not only consider the influence of sex in the responses to an emotional stimulus, but also of other individual differences that are somewhat modulated by the context, such as empathy and mood states. These two variables should be added to the discussion of how emotionally salient stimuli are perceived and assessed.