Categorical emotion recognition from voice improves during childhood and adolescence

Grosbras, Marie-Hélène; Ross, Paddy D.; Belin, Pascal

doi:10.1038/s41598-018-32868-3

Categorical emotion recognition from voice improves during childhood and adolescence

Article
Open access
Published: 04 October 2018

Volume 8, article number 14791, (2018)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Categorical emotion recognition from voice improves during childhood and adolescence

Download PDF

Marie-Hélène Grosbras¹,
Paddy D. Ross² &
Pascal Belin^3,4

5738 Accesses
32 Citations
11 Altmetric
Explore all metrics

Abstract

Converging evidence demonstrates that emotion processing from facial expressions continues to improve throughout childhood and part of adolescence. Here we investigated whether this is also the case for emotions conveyed by non-linguistic vocal expressions, another key aspect of social interactions. We tested 225 children and adolescents (age 5–17) and 30 adults in a forced-choice labeling task using vocal bursts expressing four basic emotions (anger, fear, happiness and sadness). Mixed-model logistic regressions revealed a small but highly significant change with age, mainly driven by changes in the ability to identify anger and fear. Adult-level of performance was reached between 14 and 15 years of age. Also, across ages, female participants obtained better scores than male participants, with no significant interaction between age and sex effects. These results expand the findings showing that affective prosody understanding improves during childhood; they document, for the first time, continued improvement in vocal affect recognition from early childhood to mid- adolescence, a pivotal period for social maturation.

Emotional prosody recognition enhances and progressively complexifies from childhood to adolescence

Article Open access 13 October 2022

The development of cross-cultural recognition of vocal emotion during childhood and adolescence

Article Open access 14 June 2018

Mid-Adolescents’ and Adults’ Recognition of Vocal Cues of Emotion and Social Intent: Differences by Expression and Speaker Age

Article 16 January 2018

Introduction

Recognizing other people’s emotions is crucial for successful social interactions and for establishing intimate relationships throughout the lifespan^1,2. This ability emerges early on and within the first half-year of life infants can respond to basic emotions from non-verbal cues. Yet infants’ reading of emotions is still rudimentary³ and it continues to refine throughout childhood^4,5 and even adolescence e.g.^{6,7,8,9,10,11}. Moreover, during late childhood and adolescence, the ability to identify emotions from non-verbal behavior (i.e. facial expression, body posture or tone of voice in speech) is related to academic achievement¹², peer-rated popularity and quality of relationship with adults^13,14, while being independent of IQ. Alteration of emotion processing in this age range has also been linked to a variety of developmental disorders such as autism¹⁵, attention deficit and hyperactivity disorder¹⁶, social anxiety¹⁷, psychopathy¹⁸, or conduct disorder¹⁹. It is thus of pivotal relevance to characterize the normal developmental trajectory of emotion perception during childhood and adolescence.

Most of the existing research on this topic, however, has focused on the perception of static facial expressions, presumably due to the assumption that faces are the most universal and reliable carriers of emotional signals. Some discrepancy exists between studies, but most show substantial improvement in facial expressions decoding skills until at least 10 years of age, with a number of studies indicating that, in more subtle detection tasks, 14 or 15 years old teenagers still do not perform as well as adults^4,7,9. This line of research also points towards a small advantage of girls, across ages, in identifying nonverbal emotional cues rev. in^8,10,20, with possibly an increase in the size of this advantage from childhood to early adulthood²¹.

In contrast, although the importance of voice as a medium of affective communication has been documented throughout history rev. in²², few studies have explored the development of emotion recognition in the auditory modality. This is surprising given the importance of voice in parent-child or peer-to-peer interactions, especially those taking place from a distance. Data indicate that infants can discriminate affect from tone of voice or from composite face-voice stimuli earlier than from faces alone^23,24,25, leading researchers to suggest that in infancy voice is the primary channel for understanding others’ emotions. This is consistent with the earlier maturation of the auditory as compared to the visual system. Studies focusing on childhood, relying on different methodologies, have shown that children as young as four years old can label the emotions expressed by unfamiliar adults from their tone of voice and revealed substantial development between 4 and 11 years of age for the perception of basic emotions from the prosody in sentences^{5,26,27,28,29,30,31,32}. Here the literature suggests that in this age range vocal expressions would be recognized equally well or with more difficulty than facial expressions^{5,27,33,34,35}. Moreover, especially in pre-adolescents’ girls, abilities to process affective vocal expressions predict social competences better than facial expression recognition scores¹⁴. Given the extensive literature documenting a protracted development of face processing into adolescence, one can expect that full maturation of voice processing would thus also be occurring quite late. Yet, it is unclear whether emotional voice processing is adult-like at the end of childhood or whether maturation still occurs during adolescence: more research is needed with adolescents to provide a complete picture of development of emotion perception in the auditory modality (see meta-analysis in²¹). It is indeed increasingly recognized that important socio-emotional development occurs during adolescence³⁶ in relation to changes in perceptual abilities³⁷ and brain organization³⁸. In particular, brain regions involved in voice and prosody processing, in the superior temporal and in the prefrontal cortex^39,40,41, undergo structural and functional changes well into adolescence^38,42,43.

Moreover, research so far has focused on how children process prosody in speech, which is confounded by the development of linguistic abilities. The processing of semantic content could have different interference effect at different ages, further confusing the interpretation of developmental findings^44,45. Only a handful of studies have used auditory stimuli other than sentences to circumvent this issue, but have yielded partly discrepant results and only one has considered adolescents, but not beyond age 13⁴⁶. Matsumoto and Kishimoto⁴⁷ asked 50 4- to 9- years old children to select which amongst four emotions was expressed by a speaker naming individual letters or numbers with intonations depicting happiness, surprise, sadness or anger. They observed a substantial increase in correct recognition with age, with only minor differences across emotion categories. More recently, Allgood and Heaton⁴⁸ used affect bursts, that is, short non-verbal vocal expressions such as laughter, weeping or scream^49,50. They tested 228 children aged between 5 and 10 years and observed that 5- years old were significantly worse than 10-years old children when categorizing these stimuli as happy, sad or fearful. In neither of these studies, did the authors provide indication as to whether performance was different from adults. However, as in both studies 10 years old produced only 80% correct responses, one might expect that there is still improvement until adulthood. In contrast, Sauter et al.²⁹ using the same material in addition to six others emotions (Anger, Disgust, Surprise, Contentment, Relief, Achievement) reported no significant age effect in a sample of 48 children of similar age (5–10 years), except for the identification of surprise, with again no comparison with adults’ data. It should be noted though that even their younger participants performed at surprisingly high level (overall 78% correct responses compared to 83% for the older children) leaving little room for improvement. In the same participants, however, recognition and categorization of emotion from inflected spoken words improved linearly for most emotions and was poorer than recognition from affect bursts in all age groups. Chronaki et al.²⁷ used non-word interjections pronounced with three different emotions (happiness, anger and sadness) and manipulated with acoustic morphing to have different emotional intensities. They tested 98 children sorted into three groups (3–5; 6–9 and 10–11 years old respectively). They observed an improvement in the ability to discriminate these emotions between 3 and 11 years of age, with 11-years old exhibiting significantly poorer performance than an adult control group (73% accuracy vs 83% for the adults). In a subsequent study, they used meaningless sentences in different languages in a similar design and observed that early adolescents (age 11–13, n = 32) were not better than children (age 8–10, n = 25) at discriminating emotions, while they performed more poorly than adults (age 19–35, n = 22)⁴⁶. Amongst these studies Allgood and Heaton⁴⁸ and Chronaki and et al.²⁷ observed a gender effect, consisting of better performance of females, albeit with apparently no interaction with age.

In summary, research on the development of vocal emotion perception using non-linguistic material is limited and provides fragmentary or inconsistent results regarding (i) the developmental trajectory, (ii) whether adults performance is reached by the end of childhood or whether this ability still changes during adolescence, a critical period for social adjustment, and (iii) whether there is any significant gender difference.

Regarding the latter question, it must be noted that gender effect in another domain, namely recognition of facial expressions, is small in children^20,51, and might thus be undetected in individual smaller-scale studies. Moreover, in adults, females are faster and better at categorizing emotions from faces^10,52 but also from voices^49,53. The question of sex differences in the development of emotion perception in the auditory domain is thus worth more investigation.

Another factor that makes difficult comparing studies is the heterogeneity in testing procedures⁵¹. For example, Sauter and colleagues²⁹ presented the audio stimuli on speakers and asked participants to select the corresponding emotion by choosing, on a printed sheet, a label illustrated with a photo depicting the facial expression. Responses were logged by the experimenter. Matsumoto et al.⁴⁷ used the same procedure. Allgood and Heaton⁴⁸ tested participants in groups in the classroom, playing the sounds on loudspeakers and asking children to circle, on a scoring sheet, the cartoon face corresponding to the emotion they though was displayed. Lastly, Chronaki et al.^27,46 used a fully-computerized method, presenting the stimuli on headphones and asking participants to select amongst keyboard keys with emotional word labels printed on them (with the help of the experimenter for younger children). For the literature on facial emotions categorization, the oldest studies consist mostly of paper-and-pencil tests while more recent studies use mostly computerized task. Only one account, to our knowledge, shows that testing procedure did not impact developmental findings⁷. The question remains open with regards to affective vocalization processing.

The goal of the present study was to characterize the maturation of basic emotions identification from non-verbal vocalizations between childhood and adulthood. More specifically we aimed to confirm improvement in this ability during childhood and extend the investigation into adolescence. We asked 225 participants age 5–17 and 30 adults to recognize four discrete emotions (anger, happiness, sadness and fear) from short affective bursts in a forced-choice task. We expected to reproduce the findings of Mastumoto et al.⁴⁷, Chronaki et al.²⁷; Allgood and Heaton⁴⁸ showing that recognition of emotion from non-linguistic vocalization improves throughout childhood. Further, we hypothesized that this improvement continues during adolescence. This would be in accordance with functionalist views of adolescent developments putting forward an adaptive refinement of social skills during this period of life⁵⁴ and with the accruing data showing protracted maturation of the brain circuits involved in social and emotional signals perception rev. in³⁶. In addition, in line with what has been described for faces^3,8 and for prosody in children^27,29,34,45, we expect different trajectories for the different emotions: happiness is likely to be recognized with adult’s accuracy at a younger age and sadness at a later age. Lastly, using a large enough sample we wanted to test whether the female advantage reported by Allgood and Heaton⁴⁸ can be reproduced.

Results

Descriptive statistics

The average performance in the sample of children-adolescents was well above chance level (25%) with an average of 81.7% correct responses (median 85% correct, standard deviation 11.8%; skewness = −1.87, kurtosis = 9.73). The adults displayed an average score of 86.7 (median 87.5%, standard deviation 9.3; skewness = −0.42; kurtosis = 1.93). The difference between the two groups was statistically significant (Welsh t(253 = 2.67, CI of difference = [1.33 8.67], p = 0.011).

Effect of testing procedure

In order to pool all participants into a single analysis, we aimed to verify first that there was no effect of testing procedure nor interaction with other factors. We first estimated a mixed-model in the children-adolescents sample, including the continuous factor age and categorical factors sex (male/female) and testing procedure (computer/ paper) with random intercept for subjects. There was no effect of testing procedure [Z-Wald = 0.2334; p = 0.81) nor interaction with the other factors (with age, Z = −0.179, p = 0.85; with sex: z = −0.19, p = 0.85). Removing this factor from the model did not alter the fit [Chi-square (4) = 0.75, p = 0.9447]. In addition, when we modeled separately data from the sample tested on computer and the sample tested on paper, with age and sex as factors, we observed similar parameter estimates in the two analyses and in the full model. Thus, the conclusions of our study hold whether we consider the full sample or only subsamples of participants tested with one or the other method. Therefore, for simplification we present now analysis of the full sample without the factor “testing procedure”.

Effect of age and sex

The effect of age was highly significant (Z = 5.52, p = 3.4 × 10⁻⁸). Adding a quadratic term (age²) improved the model fit: Chi-sq(1) = 5.14, p < 0.023, indicating a non-linear developmental trajectory with faster improvement with age in childhood, followed by slower changes and plateau in late adolescence. The model with a cubic term did not converge. Based on the values predicted by the model at different ages (converted from logit to percent correct responses), the adults’ mean level of performance is reached between age 14 and 15.

The effect of sex was also significant in the full model (Z = −2.017, p = 0.043). This was explained by females having significantly higher scores than males (83.2 +/− 6.2% vs 80.3 ± 6.7%, post-hoc t(223) = 3.27, CI = [1.1: 4.5]; p = 0.0012). There was no interaction between sex and age (comparison models with and without interaction term: Chi-square (1) = 0.021, p = 0.88), indicating similar age related trends in boys and girls. Looking at the model predictions separately for boys and girls showed that adults’ level of performance (mean for female adults: 88.6 +/− 9.9% correct; mean for male adults: 85 +/− 8.7%) is reached between 14 and 15 years of age in both sexes (Fig. 1).

Effect of emotion

Including the factor Emotion in the mixed model logistic regression showed a highly significant effect of emotion (Z = 12.3, p < 10⁻¹⁶), in addition to the effects of age (Z = 6.6; p < 10–4), age-square (Z = −2.36, p = 0.018) and sex (Z = −2.025, p = 0.043). There was no interaction between emotion and sex (Z = 0.014, p = 0.98). The interaction between emotion and the linear effect of age was not significant (comparison models with and without interaction term: Chi-square (1) = 2.41, p = 0.12). The interaction between emotion and the quadratic term was significant (comparison models with and without interaction term: Chi-square (1) = 6.73, p = 0.00094). To explore this interaction, we fitted a logistic linear model on the percent correct recognition score for each emotion separately (Fig. 2 and Table 1). This revealed significant age-related effects for all the emotions with the quadratic term being significant only for sadness.

Table 1 Results from the mixed model logistic regression analyses for each emotion separately.

Full size table

Across the whole age range, happiness was the emotion recognized with the highest accuracy, followed by sadness and fear at around the same level while anger was the least well recognized (Fig. 3 and Table 3). Improvement within the age span was smaller for happiness and greatest for anger and fear. To characterize this effect, we compared the four emotions pairwise in five different age-groups (children, pre-adolescents, early adolescents, late adolescents and adults; see methods). The results are displayed in Table 2 and Fig. 3. Anger was significantly different from all the other emotions in all age-groups except in adults where it did not differ from fear. Happiness was significantly better recognized than all other emotions in pre- and early adolescents, but the difference between happiness and fear was not significant in adults and the difference between happiness and sadness was not statistically significant in adults and children. Performances for fear and sadness stimuli did not differ in the youngest age groups, but in late adolescents fear was better recognized than sadness and the reverse in adults.

Table 2 Comparisons of performance for pairs of emotions in four different age groups.

Full size table

Second, we looked at how each emotion was miscategorized for each of the others, in the incorrect trials. Figure 4 and Table 3 show the confusion matrices in the four age groups and in adults. We observe relatively similar patterns of confusion across development. Anger was miscategorized most often as fear by children aged 5–8, and as sadness by adolescents. Fearful vocalizations were also sometimes miscategorized as angry by younger children as well as adults, while this confusion was less apparent in adolescents. Instead, adolescents showed a tendency to categorize sad vocalization as angry.

Table 3 Percentage of responses from the 4 emotions categories for each portrayed emotion. Diagonal represent the percent correct recognition.

Full size table

Discussion

We report for the first time the developmental trajectory of emotion perception from voice during childhood and adolescence. In a sample of 225 participants spanning an age-range from 5 to 17, we show that the ability to identify emotion from short affective bursts, in a forced-choice task, improves slowly but significantly with age, reaching adults’ level around the age of 15. The developmental curve is better characterized by a non-linear function, with faster improvement during childhood followed by slower changes and plateau during adolescence. Across ages, female participants recognized emotion slightly more accurately than male participants with no interaction between age and sex effects. Whether participants were tested on individual computers or in small groups with response given on paper did not affect performance nor the observed developmental trajectory. We discuss these findings and their implications in relation to the existing literature on the maturation of non-verbal signals perception.

First, our findings significantly expand the literature on affective bursts recognition during childhood. They comfort, with a different population and material, the results published by Matsumoto et al.⁴⁷, Chronaki et al.²⁷ and Allgood and Heaton⁴⁸, who reported an improvement in vocal affect recognition between 5 and 10 years of age. In contrast, Sauter et al.²⁹ did not observe any developmental differences in this age range, except for the emotion surprise, when participants had to categorize short vocalizations such as screams or laughers. In this latter study, which had a much smaller sample size, the young participants displayed very high accuracy, leaving little room for improvement. Maybe the stimuli employed appeared exaggerated and were therefore recognized better by younger children; the same participants had more difficulty identifying emotion from single-words (digits) inflected speech and, in this case, a difference between younger and older participants was apparent. This concurs to establish that maturation of affective voice processing takes place during childhood, although further investigation would be needed to characterize the acoustical parameters that facilitate emotion recognition by young children. It must be noted that the stimuli used in our study, taken from the Montreal Affective Voices database, have been validated for their ecological value, their reliability and for yielding similar intensity judgments as emotional stimuli from standard databases of facial and bodily expressions⁵⁵.

In addition, here we show that the improvement in vocal affect recognition continues beyond childhood into adolescence, and we are able to model this trajectory. Such protracted development is unlikely to be linked to age-related changes in the processing of low-level acoustic cues that differs across emotions. Indeed, the ability to process frequency variations or pitch, which differentiate these emotions^49,56, has been reported to be adult-like in children aged about 5⁵⁷. Nonetheless, late development has been also observed in other aspects of voice processing: Mann et al.⁵⁸ reported that the ability to recognize familiar voices and to form new memories of voices improved until 14 years of age. Some studies investigating sentence prosody recognition also suggest an improvement during adolescence: pooling together data from several studies using one of the most standardized measures of receptive nonverbal communication³⁴ and its version using children voices as stimuli (DANVA2-CP), points towards a linear improvement in affective prosody identification between 4 and 19 years Table 2⁵⁹. This is in agreement with a recent study showing that 13–15 years old were outperformed by adults for the recognition of affective sentence prosody, even more so in the more difficult condition of decoding emotion from sentences spoken by children⁶⁰. It is also consistent with a study using sentences with non-words, i.e. mimicking phonological and morphosyntactic properties of language but without distinguishable semantic information and showing that 11–13 years old children performed much worse than adults, for recognizing emotions from stimuli derived from their native (English) as well stimuli from a foreign languages⁴⁶. In this study, however, the 11–13 years old were not better than 8–10 years old, therefore suggesting no development during late childhood. It may be that the small sample size would have prevented the observation of small differences in performance between 10 and 13 years old; indeed our data show that the maturation curve is less steep than in early childhood. In addition, it is possible that the context of being exposed to pseudo-sentences from different languages adds another level of complexity, which might impact more the pre-and early adolescents, who may automatically search for meaning and thereby be distracted from the task. Indeed it has been shown that when other contextual information is present, young children, but also 13-years old individuals, tend to rely less on prosody and more on semantic information than adults to infer the speaker’s emotion³². This might explain differences between studies, but also reflect the fact that vocal emotion discrimination might still be difficult at the beginning of adolescence. The choice of stimulus material and confounds related to different linguistic materials might also explain while some individual studies using sentences as stimuli have reported no change in prosody processing beyond late childhood^4,5,28.

Our data using non-linguistic vocalizations clearly show that vocal affect categorization progresses throughout childhood and well into adolescence. This developmental timeline is similar to the one reported by many studies of affective facial^4,7,8,9 or bodily⁶¹ expressions recognition. Such changes in social perception abilities are in line with social challenges and refinement that occur during this period of life: as social relationships become more complex, individuals need to be able to detect and categorize subtle social cues more accurately. Improvement in social signals reading also coincides in time with structural and functional changes in brain areas involved in processing emotional signals, such as the amygdala⁶² or the ventral prefrontal cortex^63,64,65,66, as well as posterior cortical areas involved in social cognition, including voice processing^38,42,43. Additional studies are needed to relate regional changes in brain activity and structure to behavioral changes in emotion processing. Nonetheless the protracted development of social abilities, at the cerebral and behavioural level, is in line with evolutionary theories which propose that the adaptive value of adolescence, as a distinct developmental stage, is to enable the maturation of refined social perception and social skills abilities^67,68.

The observed female advantage for emotional categorization echoes numerous studies in adults and children focusing on different channels of emotion communication, including faces^{7,52,53,69,70,71}, body movements^72,73, voice and prosody^49,74,75,76. Although robust in meta-analyses^20,21, this effect is small and not always apparent in individual studies. Its expression and amplitude may depend upon factors intrinsic to the stimuli or contexts⁷⁷. In particular, Sauter et al.²⁹ and Chronaki et al.²⁷, who used also non-linguistic affective vocalisations did not report any statistically significant sex difference in affective voice processing in children, in contrast to our ‘ and Allgood et al.’s results. The larger interindividual variability in children, compared to adolescents and adults, may also have masked small differences. In addition, Chronaki et al.²⁷ used only three emotions, happiness, sadness and anger; in our dataset, the sex effect was larger for fear recognition in the youngest participants, which contributed to the overall statistical significant effect. Interestingly the highest gender differences for fear recognition, as compared to other emotions, mirrors what has been reported in developmental studies of facial affect recognition¹⁰. Crucially, we observed no interaction between sex and age effects: that is the difference between boys and girls did not change with age. This developmental stability might reflect some evolved differences between males and females present, at least partly, very early in ontogeny. This is in line with earlier meta-analyses of facial or multimodal affective expression recognition⁵³ and thus compatible with integrative models suggesting that scaffolding and/or socialisation factors would contribute to maintaining gender differences in non-verbal signal processing that appear during infancy²⁰. Moreover the observation of the same developmental trajectory in boys and girls indicates that the improvement in recognizing emotions from vocalizations is independent of pubertal and hormonal changes, which occurs earlier for girls.

Not all basics emotions were identified with equal accuracy. In fact, age-related changes, although significant for all emotions, varied with emotion. Across age groups happiness was the easiest emotion to recognize. This mimics what has been described in adults⁴⁹ and children^27,47. This is also consistent with the easiest recognition of happiness from facial expressions by adults and children alike^6,10,78. This high score for happiness might be explained, in part, by the fact that it was the only positive emotion and thus could potentially be categorized using only valence information. The high accuracy of sadness recognition contrasts with previous reports of vocal affect recognition showing that young children had difficulty recognizing sadness^27,47 and reporting a slow development for the decoding of this emotion. This discrepancy could be related to the nature of the stimuli: in our study, sadness stimuli consisted of relatively salient cries lasting about 2 seconds, and which were also recognized with relatively high accuracy in adults⁴⁹. In contrast, stimuli in Chronaki’s study²⁷ were shorter (700 ms) which may have made them more ambiguous and difficult to decipher. In accordance with this hypothesis developmental studies of affective prosody decoding from sentences report high accuracy for sadness in children, pre-adolescents^29,46 and adolescents⁶⁰.

The two emotions that showed the most significant improvement with age were anger and fear, two negative-valence threat-related emotions. Also, anger proved to be the most difficult emotion to recognize with the oldest adolescents still having lower scores than adults, often miscategorizing angry expressions as sad. This is consistent with some developmental studies of emotion recognition from vocal⁴⁷ or facial expressions^10,78. However, this finding contrasts with other studies in adults or younger children showing a superior recognition for angry voices as compared to other emotions such as sadness or fear^5,27,29,49. From a behavioural point of view, in theory, one could argue that a reduced sensitivity to threat signals could promote social exploration and expansion of interpersonal relationships and thus be beneficial in late childhood and adolescence. The poor performance of children for fear recognition and the steeper development for this emotion is striking. It is in line with a couple of developmental studies on processing affective prosody from sentences in children and adolescents^5,46,60.In addition, meta-analyses indicate that the association between psychopathy and poor fear processing from prosody is stronger in children and adolescents compared to adults¹⁸. Yet they are based on very few studies and lack comparison data. Our results could serve as normative data to assess vocal emotion recognition, from non-linguistic material, in pathological populations and eventually identify vulnerability periods and/or prognostic markers, for example for psychopathy¹⁸ or schizophrenia⁷⁹. Likewise, the present study could serve as a starting point to investigate how perceptual (e.g. hearing threshold, or sensitivity to individual acoustic cues), personal or behavioural (e.g. IQ, social anxiety or attention) characteristics could affect vocal emotion recognition across development.

Lastly, we observed the same range of performance as well as the same developmental trajectory in the two groups of participants, namely those tested with sounds presented on individual headphones and response given on keyboard vs. those tested with sounds presented on loudspeakers and responses given by circling labels on a pre-formatted paper sheet. Firstly, this signifies that children and adolescents were not influenced by the mode of sound presentation and response while expressing their judgments. Secondly, it shows that data from developmental studies conducted with different procedures, computerized or not, can be directly compared. The same conclusion has been reached in a large scale (n > 1500) study of facial expressions recognition, where strikingly similar developmental trajectories were observed, for all emotions investigated, in sub-samples tested in school on a paper version, in the lab on a computerized version or online⁷. The same conclusion was also apparent in a meta-analysis of developmental studies of facial affect recognition²¹. Thus, the choice of computerized or other conventional laboratory methods should not matter in the design of new developmental studies, thus facilitating the collection of larger datasets.

In summary, our results demonstrate, for the first time, that the ability to identify basic emotions from non-linguistic elementary affect bursts continues to mature from childhood until mid- adolescence. This is akin to what has been described for the recognition of emotion through other channels, like faces and bodies. Insight into this protracted development has implications to characterize adolescents’ interpersonal relationships and to shed light on the development of more complex social competences, such as learning from others or social feedback processing⁸⁰. Charting the normal developmental path of vocal emotion perception has also critical implications for enhancing our understanding neurodevelopmental disorders, like schizophrenia, psychopathy or autism, for which abnormal voice processing has been proposed as an underlying factor of social anomalies^18,81.

Methods

Participants and procedure

Participants were recruited from several after-school clubs, a private school and a college in the greater Glasgow area (Scotland). Permission was obtained from the relevant authorities. Parents as well as children were informed of the purpose of the study and signed a consent form. The study was approved by the University of Glasgow College of Science and Engineering Ethics committee and all experimenters had appropriate disclosure from the Scottish government to work with children. All research was performed in accordance with relevant guidelines from the Declaration of Helsinki, recommendations from the Economic and Social Research council of the United Kingdom and local regulations in particular with regards to data anonymization and storage.

We collected complete datasets from 225 participants (104 males) aged 5–17 (mean 11.87; sd 3.34). Based on existing literature in vocal and facial emotion perception recognition this sample is adequate to detect linear age-related changes as well as gender effects^48,82 with small to medium effect size. One hundred and thirty-three were tested in small groups (three to 10 participants), with the sounds being presented through loudspeakers and participants reporting their responses on a preformatted sheet of paper with one line per trial. They had to circle or tick, to their choice, the box corresponding to their response. The other 92 participants were tested on individual laptops, with the sounds presented through headphones and responses made through keyboard presses. In both cases the responses were made by selecting one box out of four boxes labeled with the emotions happy, fearful, angry or sad displayed as emoticons on a horizontal line. All participants were familiar with the matching between the emoticons (i.e. facial expression cartoons) and individual emotions. Four different spatial orders were used and counterbalanced across participants to avoid response biases due to the location of the response boxes. All participants completed five practice trials before the experiment. For the youngest participants (<10 years old) the experimenters took extra care in ensuring that the task was well understood and asked them to report a situation in which they would experience each of the emotions to confirm that they knew the meaning of each label. The experimenter stayed close to the children in all cases, monitoring that they had no difficulty selecting the labels (with pencil or keypress) and that they moved smoothly from one trial to the next.

As a control group, we tested 30 adults (age 20–47, 16 males). They were recruited through our local subjects’ pool and tested on individual laptops.

Stimuli and design

Stimuli were taken from the Montreal Affective Voices databases (MAV⁴⁹). They consist of nonverbal affective bursts uttered by five male and five female speakers selected out of 22 actors to yield maximum recognition agreement amongst young adult listeners. We thus had ten exemplars of each emotion (anger, fear, happiness and sadness) and each participant was presented randomly with five stimuli for each emotion. In a four-alternative forced choice task, they were asked to select the emotion they thought the speaker was expressing.

Analysis

We assessed the effect of age on the proportion of correct labeling choice with mixed-model logistic regression⁸³ chapter⁸⁴. We used the lme4 toolbox (version 1.0–5) in R⁸⁵. We entered each response as a binary variable (i.e. correct or incorrect) into the regression model with random effect “subject” and fixed effects “age” (centered around the sample mean), “sex”, and “testing procedure” (computer or paper) as well as the interaction terms. We tested both linear and non-linear effects of age by adding polynomial terms to the regression model. Significance of main effects or interaction is reported in terms of the Wald statistic. Fixed-effects were also further evaluated using chi-square tests on the log-likelihood values to compare models with and without the effect of interest. From the model yielding the best fit, we computed the predicted percent correct responses at each age by inverting the logit link using the “fitted” function in R.

For additional description of the data we grouped the participants in four different age-groups, which we also compared to the adults group: children: 5–8 years old, n = 41 (25 females); early adolescents: 9–11 years old, n = 67 (28 females); mid-adolescents: 12–14 years old n = 54 (30 females); late-adolescents: 15–17 years old, n = 63 (38 females). There was no significant difference in gender balance between the groups (Chi-square = 5.8799, df = 3, p-value = 0.1176). For all comparisons between groups we used the Holm-Bonferonni method to account for multiple comparisons. We also computed confusion matrices in these age groups, reflecting the patterns of misinterpretation amongst the different emotions. That is, for each emotion portrayed we computed the number of times it was misclassified for each of the other emotions.

The datasets generated during the current study are available from the corresponding author upon request.

References

Carton, J. S., Kessler, E. A. & Pape, C. L. Nonverbal decoding skills and relationship well-being in adults. Journal of Nonverbal Behavior 23, 91–100 (1999).
Article Google Scholar
Feldman, R. S., Philippot, P. & Custrini, R. J. In Fundamentals of Nonverbal Behavior (eds Feldman, R. S. & Rimé, B.) 329:350 (Cabridge University Press, 1991).
Nelson, C. The recognition of facial expressions in the first two years of life: mechanisms of development. Child Dev 58, 889–909 (1987).
Article CAS Google Scholar
Tonks, J., Williams, W. H., Frampton, I., Yates, P. & Slater, A. Assessing emotion recognition in 9–15-years olds: preliminary analysis of abilities in reading emotion from faces, voices and eyes. Brain Inj. 21, 623–629 (2007).
Article Google Scholar
Zupan, B. Recognition of high and low intensity facial and vocal expressions of emotion by children and adults. Journal of Social Sciences and Humanities 1, 22 (2015).
Google Scholar
De Sonneville, L. M. et al. Facial identity and facial emotions: speed, accuracy, and processing strategies in children and adults. J Clin Exp Neuropsychol 24, 200–213, https://doi.org/10.1076/jcen.24.2.200.989 (2002).
Article PubMed Google Scholar
Wade, A. M., Lawrence, K., Mandy, W. & Skuse, D. Charting the Development of Emotion Recognition from 6 Years of Age. Journal of Applied Statistics 33, 18 (2006).
Article MathSciNet Google Scholar
Herba, C. & Phillips, M. Annotation: Development of facial expression recognition from childhood to adolescence: behavioural and neurological perspectives. J Child Psychol Psychiatry 45, 1185–1198, https://doi.org/10.1111/j.1469-7610.2004.00316.x (2004).
Article PubMed Google Scholar
Kolb, B., Wilson, B. & Taylor, L. Developmental changes in the recognition and comprehension of facial expression: implications for frontal lobe function. Brain Cogn 20, 74–84 (1992).
Article CAS Google Scholar
Williams, L. M. et al. Explicit identification and implicit recognition of facial emotions: I. Age effects in males and females across 10 decades. J Clin Exp Neuropsychol 31, 257–277, https://doi.org/10.1080/13803390802255635 (2009).
Article PubMed Google Scholar
Lawrence, K., Campbell, R. & Skuse, D. Age, gender, and puberty influence the development of facial emotion recognition. Front Psychol 6, 761, https://doi.org/10.3389/fpsyg.2015.00761 (2015).
Article PubMed PubMed Central Google Scholar
Halberstadt, A. G. & Hall, J. A. Who’s getting the message? Children’s nonverbal skill and their evaluation by teachers. Developmental Psychology 16, 564 (1980).
Article Google Scholar
Nowicki, S. & Duke, M. P. The association of children’s nonverbal decoding abilities with their popularity, locus of control, and academic achievement. The Journal of genetic psychology 153, 385–393 (1992).
Article Google Scholar
Maxim, L. & Nowicki, S. J. Developmental association between nonverbal ability and social competence. Philosophy, Sociology, and Psychology 2, 13 (2003).
Google Scholar
Golarai, G., Grill-Spector, K. & Reiss, A. L. Autism and the development of face processing. Clinical neuroscience research 6, 145–160, https://doi.org/10.1016/j.cnr.2006.08.001 (2006).
Article PubMed PubMed Central Google Scholar
Sinzig, J., Morsch, D. & Lehmkuhl, G. Do hyperactivity, impulsivity and inattention have an impact on the ability of facial affect recognition in children with autism and ADHD? European child & adolescent psychiatry 17, 63–72, https://doi.org/10.1007/s00787-007-0637-9 (2008).
Article Google Scholar
McClure, E. B. & Nowicki, S. Jr. Associations between social anxiety and nonverbal processing skill in preadolescent boys and girls. Journal of Nonverbal Behavior 25, 3–19 (2001).
Article Google Scholar
Dawel, A., O’Kearney, R. & McKone, E. & Palermo, R. Not just fear and sadness: meta-analytic evidence of pervasive emotion recognition deficits for facial and vocal expressions in psychopathy. Neurosci Biobehav Rev 36, 2288–2304, https://doi.org/10.1016/j.neubiorev.2012.08.006 (2012).
Article PubMed Google Scholar
Cadesky, E. B., Mota, V. L. & Schachar, R. J. Beyond words: how do children with ADHD and/or conduct problems process nonverbal information about affect? Journal of the American Academy of Child & Adolescent Psychiatry 39, 1160–1167 (2000).
Article CAS Google Scholar
McClure, E. B. A meta-analytic review of sex differences in facial expression processing and their development in infants, children, and adolescents. Psychol Bull 126, 424–453 (2000).
Article CAS Google Scholar
Thompson, A. E. & Voyer, D. Sex differences in the ability to recognise non-verbal displays of emotion: a meta-analysis. Cognition & emotion 28, 1164–1195, https://doi.org/10.1080/02699931.2013.875889 (2014).
Article Google Scholar
Scherer, K. R. Vocal affect expression: a review and a model for future research. Psychol Bull 99, 143–165 (1986).
Article CAS Google Scholar
Walker-Andrews, A. Infants’ perception of expressive behaviors: differentiation of multimodal information. Psychol Bull 121, 437–456 (1997).
Article CAS Google Scholar
Caron, A. J., Caron, R. F. & MacLean, D. J. Infant discrimination of naturalistic emotional expressions: the role of face and voice. Child Dev 59, 604–616 (1988).
Article CAS Google Scholar
Flom, R. & Bahrick, L. E. The development of infant discrimination of affect in multimodal and unimodal stimulation: The role of intersensory redundancy. Dev Psychol 43, 238–252, https://doi.org/10.1037/0012-1649.43.1.238 (2007).
Article PubMed PubMed Central Google Scholar
Boucher, J., Lewis, V. & Collis, G. M. Voice processing abilities in children with autism, children with specific language impairments, and young typically developing children. J Child Psychol Psychiatry 41, 847–857 (2000).
Article CAS Google Scholar
Chronaki, G., Hadwin, J. A., Garner, M., Maurage, P. & Sonuga-Barke, E. J. The development of emotion recognition from facial expressions and non-linguistic vocalizations during childhood. The British journal of developmental psychology. https://doi.org/10.1111/bjdp.12075 (2014).
Article PubMed Google Scholar
Cohen, M., Prather, A., Town, P. & Hynd, G. Neurodevelopmental differences in emotional prosody in normal children and children with left and right temporal lobe epilepsy. Brain Lang 38, 122–134 (1990).
Article CAS Google Scholar
Sauter, D. A., Panattoni, C. & Happé, F. Children’s recognition of emotions from vocal cues. British Journal of Developmental Psychology 31, 97–113 (2013).
Article Google Scholar
Van Lancker, D., Cornelius, C. & Kreiman, J. Recognition of emotional‐prosodic meanings in speech by autistic, schizophrenic, and normal children. Developmental Neuropsychology 5, 207–226 (1989).
Article Google Scholar
Golan, O., Sinai-Gavrilov, Y. & Baron-Cohen, S. The Cambridge Mindreading Face-Voice Battery for Children (CAM-C): complex emotion recognition in children with and without autism spectrum conditions. Molecular Autism 6, 22, https://doi.org/10.1186/s13229-015-0018-z (2015).
Article PubMed PubMed Central Google Scholar
Aguert, M., Laval, V., Lacroix, A., Gil, S. & Le Bigot, L. Inferring Emotions from Speech Prosody: Not So Easy at Age Five. PLOS ONE 8, e83657, https://doi.org/10.1371/journal.pone.0083657 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Nelson, N. L. & Russell, J. A. Preschoolers’ use of dynamic facial, bodily, and vocal cues to emotion. J Exp Child Psychol 110, 52–61, https://doi.org/10.1016/j.jecp.2011.03.014 (2011).
Article PubMed Google Scholar
Nowicki, S. Jr. & Duke, M. P. Individual differences in the nonverbal communication of affect: The Diagnostic Analysis of Nonverbal Accuracy Scale. Journal of Nonverbal behavior 18, 9–35 (1994).
Article Google Scholar
Gil, S., Aguert, M., Le Bigot, L., Lacroix, A. & Laval, V. Children’s understanding of others’ emotional states: Inferences from extralinguistic or paralinguistic cues? International Journal of Behavioral Development 38, 10 (2014).
Article Google Scholar
Kilford, E. J., Garrett, E. & Blakemore, S. J. The development of social cognition in adolescence: An integrated perspective. Neurosci Biobehav Rev 70, 106–120, https://doi.org/10.1016/j.neubiorev.2016.08.016 (2016).
Article PubMed Google Scholar
Scherf, K. S. & Scott, L. S. Connecting developmental trajectories: biases in face processing from infancy to adulthood. Dev Psychobiol 54, 643–663, https://doi.org/10.1002/dev.21013 (2012).
Article PubMed Google Scholar
Giedd, J. N. et al. Brain development during childhood and adolescence: a longitudinal MRI study. Nat.Neurosci 2, 861–863 (1999).
Article CAS Google Scholar
Belin, P., Zatorre, R. J., Lafaille, P., Ahad, P. & Pike, B. Voice-selective areas in human auditory cortex. Nature 403, 309–312 (2000).
Article ADS CAS Google Scholar
Sammler, D., Grosbras, M. H., Anwander, A., Bestelmeyer, P. E. & Belin, P. Dorsal and Ventral Pathways for Prosody. Curr Biol 25, 3079–3085, https://doi.org/10.1016/j.cub.2015.10.009 (2015).
Article CAS PubMed Google Scholar
Schirmer, A. & Kotz, S. A. Beyond the right hemisphere: brain mechanisms mediating vocal emotional processing. Trends Cogn Sci 10, 24–30, https://doi.org/10.1016/j.tics.2005.11.009 (2006).
Article PubMed Google Scholar
Gogtay, N. et al. Dynamic mapping of human cortical development during childhood through early adulthood. Proc Natl Acad Sci USA 101, 8174–8179, https://doi.org/10.1073/pnas.0402680101 (2004).
Article ADS CAS PubMed Google Scholar
Bonte, M. et al. Development from childhood to adulthood increases morphological and functional inter-individual variability in the right superior temporal cortex. Neuroimage 83, 739–750, https://doi.org/10.1016/j.neuroimage.2013.07.017 (2013).
Article PubMed Google Scholar
Morton, J. B. & Trehub, S. E. Children’s understanding of emotion in speech. Child Dev 72, 834–843 (2001).
Article CAS Google Scholar
Friend, M. Developmental changes in sensitivity to vocal paralanguage. Developmental science 3, 148–162, https://doi.org/10.1111/1467-7687.00108 (2000).
Article PubMed Google Scholar
Chronaki, G., Wigelsworth, M., Pell, M. D. & Kotz, S. A. The development of cross-cultural recognition of vocal emotion during childhood and adolescence. Sci Rep 8, 8659, https://doi.org/10.1038/s41598-018-26889-1 (2018).
Article ADS PubMed PubMed Central Google Scholar
Matsumoto, D. & Kishimoto, H. Developmental characteristics in judgments of emotion from nonverbal vocal cues. International Journal of Intercultural Relations 7, 415–424 (1983).
Article Google Scholar
Allgood, R. & Heaton, P. Developmental change and cross-domain links in vocal and musical emotion recognition performance in childhood. British Journal of Developmental Psychology 33, 398–403, https://doi.org/10.1111/bjdp.12097 (2015).
Article PubMed Google Scholar
Belin, P., Fillion-Bilodeau, S. & Gosselin, F. The Montreal Affective Voices: a validated set of nonverbal affect bursts for research on auditory affective processing. Behav Res Methods 40, 531–539 (2008).
Article Google Scholar
Schröder, M. Experimental study of affect bursts. Speech communication 40, 99–116 (2003).
Article Google Scholar
Gur, R. E. & Gur, R. C. Sex differences in brain and behavior in adolescence: Findings from the Philadelphia Neurodevelopmental Cohort. Neuroscience & Biobehavioral Reviews 70, 159–170, https://doi.org/10.1016/j.neubiorev.2016.07.035 (2016).
Article Google Scholar
Hampson, E., van Anders, S. M. & Mullin, L. I. A female advantage in the recognition of emotional facial expressions: Test of an evolutionary hypothesis. Evolution and Human Behavior 27, 401–416 (2006).
Article Google Scholar
Hall, J. A. Gender effects in decoding nonverbal cues. Psychological bulletin 85, 845 (1978).
Article Google Scholar
Johnson, M. H., Grossmann, T. & Cohen Kadosh, K. Mapping functional brain development: Building a social brain through interactive specialization. Dev Psychol 45, 151–159, https://doi.org/10.1037/a0014548 (2009).
Article PubMed Google Scholar
Peelen, M. V., Atkinson, A. P. & Vuilleumier, P. Supramodal representations of perceived emotions in the human brain. The journal of Neurosciences 30, 10127–10134 (2010).
CAS Google Scholar
Murray, I. R. & Arnott, J. L. Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. The Journal of the Acoustical Society of America 93, 1097–1108 (1993).
Article ADS CAS Google Scholar
Quam, C. & Swingley, D. Development in children’s interpretation of pitch cues to emotions. Child Dev 83, 236–250, https://doi.org/10.1111/j.1467-8624.2011.01700.x (2012).
Article PubMed Google Scholar
Mann, V. A., Diamond, R. & Carey, S. Development of voice recognition: parallels with face recognition. J Exp Child Psychol 27, 153–165 (1979).
Article CAS Google Scholar
Rothman, A. & Nowicki, S. Jr. A Measure of the Ability to Identify Emotion in Children’s Tone of Voice. Journal of Nonverbal Behavior 28, 67–92, https://doi.org/10.1023/B:JONB.0000023653.13943.31 (2004).
Article Google Scholar
Morningstar, M., Dirks, M. A., Rappaport, B. I., Pine, D. S. & Nelson, E. E. Associations Between Anxious and Depressive Symptoms and the Recognition of Vocal Socioemotional Expressions in Youth. Journal of clinical child and adolescent psychology: the official journal for the Society of Clinical Child and Adolescent Psychology, American Psychological Association, Division 53, 1–10, https://doi.org/10.1080/15374416.2017.1350963 (2017).
Ross, P. D., Polson, L. & Grosbras, M. H. Developmental changes in emotion recognition from full-light and point-light displays of body movement. PLoS One 7, e44815, https://doi.org/10.1371/journal.pone.0044815(2012).
Fecteau, S., Belin, P., Joanette, Y. & Armony, J. L. Amygdala responses to nonlinguistic emotional vocalizations. Neuroimage 36, 480–487, https://doi.org/10.1016/j.neuroimage.2007.02.043 (2007).
Article PubMed Google Scholar
Fecteau, S., Armony, J. L., Joanette, Y. & Belin, P. Sensitivity to voice in human prefrontal cortex. J Neurophysiol 94, 2251–2254, https://doi.org/10.1152/jn.00329.2005 (2005).
Article PubMed Google Scholar
Schumann, C. M. et al. The amygdala is enlarged in children but not adolescents with autism; the hippocampus is enlarged at all ages. J Neurosci 24, 6392–6401, https://doi.org/10.1523/jneurosci.1297-04.2004 (2004).
Article CAS PubMed Google Scholar
Thomas, K. M. et al. Amygdala response to facial expressions in children and adults. Biol.Psychiatry 49, 309–316 (2001).
Article CAS Google Scholar
Uematsu, A. et al. Developmental trajectories of amygdala and hippocampus from infancy to early adulthood in healthy individuals. PLoS One 7, e46970, https://doi.org/10.1371/journal.pone.0046970 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Bogin, B. Adolescence in evolutionary perspective. Acta paediatrica (Oslo, Norway: 1992). Supplement 406, 29–35; discussion 36 (1994).
Weisfeld, G. Evolutionary Principles Of Human Adolescence (Lives in Context). (Basic Books, 1999).
Barth, J. M. & Bastiani, A. A longitudinal study of emotion recognition and preschool children’s social behavior. Merrill-Palmer quarterly 1, 11 (1997).
Google Scholar
Thayer, J. F. & Johnsen, B. H. Sex differences in judgement of facial affect: a multivariate analysis of recognition errors. Scand J Psychol 41, 243–246 (2000).
Article CAS Google Scholar
Montirosso, R., Peverelli, M., Frigerio, E., Crespi, M. & Borgatti, R. The Development of Dynamic Facial Expression Recognition at Different Intensities in 4- to 18-Year-Olds. Social Development 19, 71–92, https://doi.org/10.1111/j.1467-9507.2008.00527.x (2010).
Article Google Scholar
Sokolov, A. A., Krüger, S., Enck, P., Krägeloh-Mann, I. & Pavlova, M. A. Gender Affects Body Language Reading. Frontiers in Psychology 2, 16, https://doi.org/10.3389/fpsyg.2011.00016 (2011).
Article PubMed PubMed Central Google Scholar
Alaerts, K., Nackaerts, E., Meyns, P., Swinnen, S. P. & Wenderoth, N. Action and emotion recognition from point light displays: an investigation of gender differences. PloS one 6, e20989 (2011).
Article ADS CAS Google Scholar
Besson, M., Magne, C. & Schon, D. Emotional prosody: sex differences in sensitivity to speech melody. Trends Cogn Sci 6, 405–407 (2002).
Article Google Scholar
Everhart, D. E., Demaree, H. A. & Shipley, A. J. Perception of emotional prosody: moving toward a model that incorporates sex-related differences. Behavioral and cognitive neuroscience reviews 5, 92–102, https://doi.org/10.1177/1534582306289665 (2006).
Article PubMed Google Scholar
Morningstar, M., Dirks, M. A. & Huang, S. Vocal cues underlying youth and adult portrayals of socio-emotional expressions. Journal of Nonverbal Behavior 41, 155–183 (2017).
Article Google Scholar
Kret, M. E. & De Gelder, B. A review on sex differences in processing emotional signals. Neuropsychologia 50, 1211–1221 (2012).
Article CAS Google Scholar
Thomas, L. A., De Bellis, M. D., Graham, R. & LaBar, K. S. Development of emotional facial recognition in late childhood and adolescence. Dev Sci 10, 547–558, https://doi.org/10.1111/j.1467-7687.2007.00614.x (2007).
Article PubMed Google Scholar
Corcoran, C. M. et al. Emotion recognition deficits as predictors of transition in individuals at clinical high risk for schizophrenia: a neurodevelopmental perspective. Psychological medicine 45, 2959–2973, https://doi.org/10.1017/S0033291715000902 (2015).
Article CAS PubMed PubMed Central Google Scholar
Vrticka, P. et al. Social feedback processing from early to late adolescence: influence of sex, age, and attachment style. Brain and behavior 4, 703–720, https://doi.org/10.1002/brb3.251 (2014).
Article PubMed PubMed Central Google Scholar
Gervais, H. et al. Abnormal cortical voice processing in autism. Nat Neurosci 7, 801–802, https://doi.org/10.1038/nn1291 (2004).
Article CAS PubMed Google Scholar
Kessels, R. P., Montagne, B., Hendriks, A. W., Perrett, D. I. & de Haan, E. H. Assessment of perception of morphed facial expressions using the Emotion Recognition Task: normative data from healthy participants aged 8-75. Journal of neuropsychology 8, 75–93, https://doi.org/10.1111/jnp.12009 (2014).
Article PubMed Google Scholar
Jaeger, T. F. Categorical Data Analysis: Away from ANOVAs (transformation or not) and towards Logit Mixed Models. Journal of memory and language 59, 434–446, https://doi.org/10.1016/j.jml.2007.11.007 (2008).
Article PubMed PubMed Central Google Scholar
Agresti, A. Categorical Data Analysis 2nd edition. (Wiley, 2002).
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models usinglme4. arXiv preprint arXiv:1406.5823 (2014).

Download references

Acknowledgements

This research was supported by grants from the Agence Nationale pour la Recherche (France, ANR-14-ACHN-0023) and the foundation A*MIDEX (France, ANR-11-IDEX-0001-02) the Excellence Initiative of Aix-Marseille University (A*MIDEX) a French “Investissement d’avenir” program to MHG, by an Economic and Social Research Council (UK, ESRC) studentship to Paddy Ross. PB was also supported by the French Fondation pour la Recherche Médicale (AJE201214) and Agence Nationale de la Recherche (PRIMAVOICE), and by grants ANR-16-CONV-0002 (ILCB), ANR-11-LABX-0036 (BLRI). We thank Emma Azzopardi, Alice Mitchell, Eniko Zlodos, Chloe Noblet for the assistance with data collection as well as the afterschool clubs from Kelvindale Primary School, Hyndland Primary School, Anniesland College, Dollar Academy, and Boys Brigade Newton Mearns for providing a friendly and professional testing environment. Special thanks are also due to all participants, children and adults. We thanks Dr. Dale Barr for advices on statistical analyses.

Author information

Authors and Affiliations

Aix-Marseille Université, CNRS, Laboratoire de Neurosciences Cognitives, Marseille, France
Marie-Hélène Grosbras
Department of Psychology, Durham University, Durham, United Kingdom
Paddy D. Ross
La Timone Neuroscience Institute, Mixed Research Unit 7289 Centre National de la Recherche Scientifique and Aix-Marseille University, Marseille, France
Pascal Belin
Département de Psychologie, Université de Montréal, Montréal, Québec, Canada
Pascal Belin

Authors

Marie-Hélène Grosbras
View author publications
You can also search for this author in PubMed Google Scholar
Paddy D. Ross
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Belin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.H.G. and P.B. designed the study; M.H.G. and P.R. collected the data; M.H.G. analysed the data and wrote the manuscript. All authors reviewed and edited the manuscript.

Corresponding author

Correspondence to Marie-Hélène Grosbras.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Grosbras, MH., Ross, P.D. & Belin, P. Categorical emotion recognition from voice improves during childhood and adolescence. Sci Rep 8, 14791 (2018). https://doi.org/10.1038/s41598-018-32868-3

Download citation

Received: 29 December 2017
Accepted: 20 August 2018
Published: 04 October 2018
DOI: https://doi.org/10.1038/s41598-018-32868-3
Springer Nature Limited

Keywords

This article is cited by

Unleashing the Power of Positivity: How Positive Instructors Benefit Learning from Instructional Videos — A Meta-analytic Review
- Fangfang Zhu
- Zhongling Pi
- Jiumin Yang
Educational Psychology Review (2024)
Facial and Vocal Emotion Recognition in Adolescence: A Systematic Review
- Barbra Zupan
- Michelle Eskritt
Adolescent Research Review (2024)
Universality, domain-specificity and development of psychological responses to music
- Manvir Singh
- Samuel A. Mehr
Nature Reviews Psychology (2023)
Predictors of Treatment Response to a Community-Delivered Group Social Skills Intervention for Youth with ASD
- Alan H. Gerber
- Erin Kang
- Matthew D. Lerner
Journal of Autism and Developmental Disorders (2023)
Emotional prosody recognition enhances and progressively complexifies from childhood to adolescence
- M. Filippa
- D. Lima
- D. M. Grandjean
Scientific Reports (2022)

Categorical emotion recognition from voice improves during childhood and adolescence

Abstract

Similar content being viewed by others

Emotional prosody recognition enhances and progressively complexifies from childhood to adolescence

The development of cross-cultural recognition of vocal emotion during childhood and adolescence

Mid-Adolescents’ and Adults’ Recognition of Vocal Cues of Emotion and Social Intent: Differences by Expression and Speaker Age

Introduction