Enhanced emotional and motor responses to live versus videotaped dynamic facial expressions

Hsu, Chun-Ting; Sato, Wataru; Yoshikawa, Sakiko

doi:10.1038/s41598-020-73826-2

Enhanced emotional and motor responses to live versus videotaped dynamic facial expressions

Article
Open access
Published: 08 October 2020

Volume 10, article number 16825, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Enhanced emotional and motor responses to live versus videotaped dynamic facial expressions

Download PDF

Chun-Ting Hsu¹,
Wataru Sato¹ &
Sakiko Yoshikawa²

1914 Accesses
13 Citations
Explore all metrics

Abstract

Facial expression is an integral aspect of non-verbal communication of affective information. Earlier psychological studies have reported that the presentation of prerecorded photographs or videos of emotional facial expressions automatically elicits divergent responses, such as emotions and facial mimicry. However, such highly controlled experimental procedures may lack the vividness of real-life social interactions. This study incorporated a live image relay system that delivered models’ real-time performance of positive (smiling) and negative (frowning) dynamic facial expressions or their prerecorded videos to participants. We measured subjective ratings of valence and arousal and facial electromyography (EMG) activity in the zygomaticus major and corrugator supercilii muscles. Subjective ratings showed that the live facial expressions were rated to elicit higher valence and more arousing than the corresponding videos for positive emotion conditions. Facial EMG data showed that compared with the video, live facial expressions more effectively elicited facial muscular activity congruent with the models’ positive facial expressions. The findings indicate that emotional facial expressions in live social interactions are more evocative of emotional reactions and facial mimicry than earlier experimental data have suggested.

Altering Facial Movements Abolishes Neural Mirroring of Facial Expressions

Article Open access 12 October 2021

A Dynamic Disadvantage? Social Perceptions of Dynamic Morphed Emotions Differ from Videos and Photos

Article Open access 13 January 2024

Perception of Discrete Emotions in Others: Evidence for Distinct Facial Mimicry Patterns

Article Open access 13 March 2020

Introduction

Facial expressions of emotions are indispensable communicative signals that facilitate the establishment and maintenance of social relationships in real-life¹. Studies have shown that human beings prioritize attention to faces over other forms of visual input². Moreover, emotional facial expressions, when compared with neutral expressions, accelerate attentional engagement and delay attentional disengagement³. Attention to emotional facial expressions supports the understanding of others’ emotional expressions and intentions in social interaction.

During social interaction, the observation of facial expressions automatically elicits emotional arousal and spontaneous facial mimicry^4,5,6, as reflected by increased zygomaticus major (ZM) activity when viewing happy facial expressions and increased corrugator supercilii (CS) activity when viewing angry facial expressions. Specifically, the observation of dynamic emotional facial expressions elicits stronger subjective arousal experiences and more evident facial mimicry, compared with observation of static facial expressions^7,8. Automatic mimicry may facilitate emotion perception or recognition via an embodied mechanism^9,10. It also enhances interpersonal synchronization¹¹, which further facilitates emotion contagion^12,13,14, empathy^15,16, and social interaction^17,18.

Most psychological studies regarding social interaction have used highly controlled laboratory settings with prerecorded static photographs or video clips that differed from naturalistic social behaviors in aspects such as visual fidelity, social context, and the potential for social interaction with the stimulus material. Experts have called for a reconsideration of the ecological validity of these stimuli and of the design of psychological studies of non-verbal and verbal interpersonal communication and interactions, which may improve the interpretability and generalizability of such findings^19,20. To the best of our knowledge, only one study has compared the effects of live interaction with those of prerecorded stimuli in terms of emotional and motor responses to facial expressions. In that study, the researchers designed a liquid crystal (LC) shutter system that switched between transparent and opaque to control the timing of participants’ exposure to models who were making eye contact or averting their gaze in real time²¹. Their findings indicated that differences in ZM activity while viewing a static smiling versus a neutral facial expression were observed only when the participants believed that they were being watched by the model²². However, the implications of their EMG results were ambiguous because the ZM responses to smiles did not differ between conditions with and without beliefs about being watched. Instead, the attenuation of ZM responses to neutral expressions caused the differences. Therefore, increased facial mimicry when participants believed that they were being watched could not be concluded. Furthermore, the study used static (rather than dynamic) facial expressions as stimuli, despite previous studies’^7,8 suggestions that dynamic facial expressions would be more effective for investigating the effects of live interactions on subjective emotion perception and facial mimicry.

In the present study, we tested the effect of live interaction using dynamic facial expressions on subjective emotion and facial mimicry. We designed a video camera relay system to allow participants to view either models’ live or prerecorded dynamic facial expressions (Fig. 1). Variable mimicry responses and emotion contagion across genders have been widely reported^23,24,25. Thus, to avoid confounding factors, we recruited only female participants and models. Following a 3-min conversation with the model using the video relay system, each participant passively viewed 60 prerecorded or live videos of positive (smiling) or negative (frowning) dynamic facial expressions, while ZM and CS EMG activity was recorded to estimate the magnitude of spontaneous facial mimicry. The participants then completed 16 rating trials that assessed dimensions of valence and arousal²⁶ (Fig. 2). Considering that arousal, social attention, and pro-social behavior are enhanced in live social interaction^27,28, we expected to observe an interaction between the presentation condition (i.e., live vs. video) and emotion (i.e., positive vs. negative), such that subjective emotion perceptions and spontaneous facial mimicry would be more prominent under the live conditions.

Results

Subjective ratings

A linear mixed effects (LME) model was used to analyze subjective valence and arousal ratings (Fig. 3). Fixed effects included emotion (positive and negative) and presentation condition (live and video) and the interaction term. Random effects included by-subject random intercepts and random slopes for the effect of emotion and presentation condition to account for the within-subjects design, which was determined by means of model comparisons.

Valence ratings included a significant main effect of emotion in which positive was rated higher than negative conditions and the interaction between emotion and presentation condition (Table 1). Planned simple effect analysis showed higher valence ratings for the positive-live than for the positive-video condition (estimate = 0.243, SE = 0.119, df = 46, t = 2.052, two-tailed p = 0.0459), but no difference between the negative-live and negative-video conditions (estimate = −0.021, SE = 0.0121, df = 47.2, t = 0.174).

Table 1 Statistical summary of valence ratings.

Full size table

Arousal ratings also included a significant main effect of emotion, with positive conditions rated as more arousing than negative ones; a trend for the main effect of presentation condition, with live rated as more arousing than video conditions; and the significant interaction between emotion and presentation condition (Table 2). Simple effect analysis showed higher arousal ratings for the positive-live than for the positive-video condition (estimate = 0.534, SE = 0.0167, df = 33.5, t = 3.203, two-tailed p = 0.003), but no difference between negative-live and negative-video conditions (estimate = 0.046, SE = 0.0167, df = 33.8, t = 0.277).

Table 2 Statistical summary of arousal ratings.

Full size table

Facial EMG

The log-transformed absolute values of the ZM and CS activity were used to calculate differences between effects of the neutral phase (0–1 s after stimulus onset) and the maximal phase (2.5–3.5 s after stimulus onset) of the dynamic facial expression, which were used as the dependent variable (Fig. 4). The same LME model as that used for the ratings was applied for both ZM and CS.

The ZM also showed a significant main effect of emotion, with greater muscle activation under the positive conditions than under the negative conditions as well as an interaction between emotion and presentation condition (Table 3). Simple effect analysis showed greater muscle contraction in positive-live than in positive-video condition (estimate = 0.0225, SE = 0.00074, df = 67.7, t = 3.024, two-tailed p = 0.0035), but no difference between the negative-live and negative-video conditions (estimate = −0.0069, SE = 0.0074, df = 68.2, t = −0.929).

Table 3 Statistical summary of zygomaticus major reactions.

Full size table

The CS showed a main effect of emotion, with the CS less active under the positive conditions than under the negative conditions, as well as an interaction (Table 4). Simple effect analysis showed less activity under the positive-live condition than under the positive-video condition (estimate = −0.024, SE = 0.0096, df = 51.3, t = −2.467, two-tailed p = 0.017), but no difference between the negative-live and negative-video conditions (estimate = −0.002, SE = 0.0096, df = 50.6, t = −0.212). In addition, the by-subject random slope of the presentation condition exhibited a clear positive correlation with the random slope of emotion (Table 4, 95% CI = 0.455, 0.474).

Table 4 Statistical summary of corrugator supercilii reactions.

Full size table

Live and prerecorded stimuli validation

We asked a separate group of naïve participants to rate the valence and arousal of a subset of prerecorded and live video clips, without knowing whether the clip showed a live performance. The LME analysis and model comparison showed significant emotion effect in valence and arousal ratings, and significant presentation condition effect in valence ratings (live performances were rated as more negative), but the fixed effect of interaction was no longer statistically significant (Supplementary Table S1, S2, Fig. S1); model comparison favored the model without the fixed effect of interaction (valence: Χ² = 0.0426, two-tailed p = 0.836; arousal: Χ² = 0.8648, two-tailed p = 0.352). A Bayesian paired t-test also showed moderate supporting evidence for the null hypothesis, i.e., no difference between the subject-wise (positive-live minus negative-live) and (positive-prerecorded minus negative-prerecorded) (for valence and arousal ratings: BF₁₀ = 0.211, BF₀₁ = 4.73, see Supplementary Table S3 for statistical details and frequentist statistics). When evaluating/guessing whether a clip is prerecorded or a live performance, there was moderate supporting evidence for the null hypothesis that participants’ accuracy rates did not differ from the level expected by chance (BF₁₀ = 0.214, BF₀₁ = 4.679, see Supplementary Table S4 for statistical details and frequentist statistics).

Discussion

In both the rating and EMG data, we observed a significant main effect of emotion in a pattern that was consistent with those of earlier studies^7,8. In the rating data, we observed higher valence and arousal ratings for the positive conditions. In the EMG data, we observed enhanced ZM and reduced CS activity under the positive smiling condition. The consistent pattern confirmed the validity of our stimuli showing dynamic emotional facial expressions.

More importantly, we observed significant interactions between live conditions and emotion in both the rating and EMG data. The interactions showed a consistent pattern, in which positive smiling expressions were rated as more arousing and more positive under live conditions; the ZM contracted more and the CS was more relaxed under the positive-live conditions than under the positive-video conditions. The effect of dynamic facial expressions on subjective ratings and spontaneous facial mimicry was more pronounced when the stimuli were live than when they were prerecorded. The validation rating study of randomly selected prerecorded and live performance video clips showed that such an effect is unlikely due to the model performing more enthusiastically during live performance trials.

Our findings corroborate those of Hietanen et al.²², who also reported greater ZM activation when participants viewed smiling expressions, compared to neutral facial expressions, if they believed that they were being watched. However, that study presented only static and not dynamic facial expressions; only smiling faces were included in the stimuli. Furthermore, their EMG results were not conclusive because the differences mainly consisted of attenuated ZM activity when participants viewed neutral facial expressions while believing that they were being watched by the model; ZM responses to smiles did not vary respect to participants’ beliefs about being watched. Therefore, to the best of our knowledge, the present study is the first to offer evidence that live social interaction causes greater enhancement of facial mimicry of dynamic facial expressions, compared to prerecorded video presentations.

While our results clearly demonstrate the facilitative effect of live interaction on emotional and motor responses to dynamic facial expressions, the underlying factors remain unclear. Below, we discuss possible underlying factors from the metacognitive (e.g., belief/knowledge-induced) and stimulus-driven (e.g., detection of microexpressions) perspectives. We first consider the socially facilitative properties of live interaction. Social psychological theories have long been proposed to explore the possible mechanisms by which the presence of others changes behavior; for example, the “Audience Effect” has been thoroughly reviewed by Hamilton and Lind²⁹. In the context of social facilitation, Zajonc used drive theory to explain how the presence of conspecifics increased individuals’ arousal and influenced their performance of tasks depending on the nature of the task^30,31. Bond attributed social facilitation to the performer’s active regulation of self-representation (i.e., public image) and suggested that the loss of public esteem from poor public performance would cause embarrassment and exacerbate social impairment³². Tennie et al. further elaborated on the motivation for reputation self-management. To function effectively, social interaction depends largely on mutual trust; a good reputation makes an individual more trustworthy, gives them an advantage in partner choices, and the possibility of engaging with an individual of good standing provides an incentive for others to cooperate³³.

In the social top-down response modulation (STORM) theory, Wang and Hamilton specifically proposed that mimicry is a strategic intervention that aims to change the social world for self-advancement. Earlier studies have indeed demonstrated that mimicry has positive consequences in terms of social interaction. Mimicry leads to increased liking and perceived closeness toward the mimicker, facilitating pro-social behavior^34,35,36; it functions as a “social glue” that helps to establish social rapport^37,38. Through associative learning or evolutionary hardwiring³⁹, human beings display more pro-social behavior, including facial mimicry, when they know that they are being watched. The role of associative learning has been emphasized by appeals to the fact that facial mimicry considers learned social context and knowledge^40,41. For example, a low-power individual is likely to respond by smiling to any expression from a higher-power person⁴². Frith further highlighted the role of implicit metacognition in social interaction⁴³. It enables us to adopt a “we-mode” through which we mentalize and automatically consider the knowledge and intentions of others. In the present study, we invited only models and participants of the same gender and age and allowed them to socialize harmoniously before the experiment to control for the effects of social context. It is important to further investigate the effects of social context in future studies using the present live interaction paradigm.

Although some theories of social facilitation have emphasized the importance of metacognitive or top-down modulation, evidence that spontaneous facial mimicry is a result of non-conscious perception of facial expressions has also been presented^5,44. In real-life social interaction, circumstances often arise in which an individual strives to suppress or conceal their genuine emotion, resulting in microexpressions that last 1/25–1/5 seconds^45,46,47. Research regarding microexpressions has focused on their applications in clinical diagnosis, forensic investigation, and security systems⁴⁸; it is important to investigate whether, in live interaction, the detection of microexpressions could also elicit facial mimicry and emotion contagion^49,50. Such evidence could support the alternative bottom-up component of the mechanisms of emotion contagion in social interactions. In the present study, we could not clearly differentiate between metacognitive (e.g., belief/knowledge-induced) and stimulus-driven (e.g., detection of microexpressions) components of facial mimicry and emotion contagion. However, our validation study showed that without the prior knowledge of whether the video clip is a live performance, the interaction effect no longer exists. This provided additional information that effects observed in the current study were more likely to be affected by metacognitive components of emotion contagion. Future studies with more realistic designs may yield further insights regarding this aspect.

We found only increased ZM activity in the positive-live condition compared with the positive-video condition; we found no difference in CS activity between negative-live and negative-video conditions. Several factors may explain this difference. First, it is compatible with earlier findings regarding social interaction, in which frowning is not mimicked in the context of anger because anger is not an emotion that facilitates affiliation between people^17,40, which further supports the interpretation of mimicry and social facilitation. Second, at the beginning of the experiment, each participant had 3 min of friendly interaction with the model through the video relay system. This affiliation-building atmosphere may have reduced the likelihood that participants would mimic negative facial expressions during the experiment. Third, the negative expression (frowning) performed by the models in both the video and live situations was not particularly intense. This situation was reflected in the arousal ratings, which were significantly lower for the negative conditions than for the positive conditions. The lack of intensity in the models’ performances may also have weakened negative emotion contagion.

Finally, while our sample size was insufficient to investigate the effect of individual differences, CS responses showed a positive correlation between the by-subject random slope of presentation conditions and the by-subject random slope of emotion. This indicated that participants who responded more strongly to the emotion conditions, also responded more to the manipulation of presentation conditions. In future studies, it will be important to investigate whether individual differences in personality traits would modulate the effect of live interaction in emotion contagion.

In conclusion, by using a live-relay prompter system to present prerecorded and live images, we showed that subjects’ perception and spontaneous facial mimicry of dynamic facial expressions were stronger under the live conditions than under videotaped conditions. Our paradigm was shown to be valid with respect to the balance between real-life social experiences and well-controlled laboratory settings. Variations based on this design could be used to further investigate the effects of individual differences, as well as social cognitive variables such as gender, power imbalances, and social contexts and to differentiate between metacognitive versus stimulus-driven components of emotion contagion.

Methods

Participants

Twenty-three female adults were recruited (mean age ± SD = 22.48 ± 2.27 years, range 18–27 years) in the city of Kyoto. The required sample size was determined by a priori power analysis using G*Power software ver. 3.1.9.2⁵¹, based on a previous study that had tested the effects of dynamic versus static facial expressions on subjective emotional ratings. As an approximation of the present analyses, we assumed a two-step procedure that included estimating the individual average values and conducting within-subjects analysis of variance. An F-value of 0.25 (medium-sized effect), an α level of 0.05, and power (1 − β) of 0.80 were assumed. The results of the power analysis showed that more than 21 participants were needed. The study was approved by the Ethics Committee of the Unit for Advanced Studies of the Human Mind, Kyoto University, and was performed in accordance with the ethical standards of the committee. Written informed consent was obtained from all participants prior to their participation in the study. All participants consented to publication, because video recording was carried out in this study. All participants were rewarded monetarily.

Material

Two female models, both 20 years of age, were employed. Each model recorded at least 20 video clips of positive and negative dynamic facial expression. Both models provided written informed consent for their identity-revealing images, which appeared in Fig. 2, to be used for scientific publications. The positive facial expression consisted of smiling, and the negative expression consisted of frowning. The models wore an apron covering their clothes when recording the video clips and during the experiments. They were asked to use hair pins to maintain their hairstyles consistent between the recorded video clips and the live stimuli. The models faced the camera frontally. The clips lasted three seconds, comprising one second of neutral expression, one second of gradual dynamic change, and one second of sustained maximal emotional facial expression. Fifteen positive and fifteen negative clips were used in the passive viewing part of the experiment, two of each condition were used for the rating practice, and four of each condition were used for the actual ratings. No clips were repeated within a part.

Facilities

The model and the participant each faced one prompter, which consisted of a mirror reflecting a TV screen and a Canon VIXIA HF R800 camera concealed behind the mirror (Fig. 1). The model was able to view the live-relay image of the participant at all times. The input from the participant’s prompter was controlled by an Imagenics SL-41C Switcher, which received serial port signals from the Precision T3500 with Windows 7 Professional and Presentation (Neurobehavioral Systems) software running the experimental paradigm and switched between the visual input from the PC and the model’s camera. The models wore the same apron in the prerecorded clips and wore a Pasonomi TWS-X9 Bluetooth earphone that received sound signals from the Presentation PC that instructed them to produce dynamic facial expressions. The original resolution of the video clips and live-relay images from the Canon VIXIA HF R800 camera was 1920 × 1080. The Presentation software outputs the video clips at a resolution of 1280 × 960, cropping the images’ flanks; the screens that the participant faced were also set to 1280 × 960 resolution to ensure consistency across the prerecorded video and live-relay images. The participants were fitted with electrodes to record EMG data from the ZM and CS, which were transmitted to the BrainAmp ExG MR amplifier and saved by the BrainVision Recorder software.

Paradigm and procedure

The study design involved two crossed factors: presentation condition (video vs. live) and emotion (positive vs. negative), resulting in four conditions. Each participant only interacted with one of the models. Upon arrival, the participant was debriefed with respect to informed consent and signed the consent form. Then, the model attached the EMG electrodes to the participant and sat down in front of her prompter. To demonstrate to the participant that the prompter could deliver live relay of the model’s image and to allow the participant to become familiar with the model, the participant and the model engaged in 3 min of conversation about school, work, food, and the weather through the prompter system. The first part of the experiment consisted of passive viewing. The participant fixated on the cross in the center of the screen (jittered between 2000 and 3750 ms, mean inter-trial interval was 2604 ms) until the instruction screen showed either “Video” or “Real Person” for one second. During the video trials, one of the prerecorded video clips followed immediately after the instruction. During the live trials, the models heard the signal through their earphones as the instruction screen was shown, and a change in pitch signaled the model to perform the correct dynamic facial expression. The screen showed the fixation cross again after the facial expressions had been shown. The participant first performed eight practice trials, two trials under each condition. Afterwards, participants performed 15 passive viewing trials under each condition, for a total of 60 trials, with a break after 32 trials. For each participant, the sequence of conditions at the trial level was pseudo-randomized to ensure that there were no more than three sequential trials of the same emotion (positive or negative) or presentation condition (prerecorded or live). For the prerecorded positive and negative conditions, the presentation sequence of the 16 videos of the same condition was randomized. EMG data were collected during the passive viewing trials.

The second part involved affective grid ratings²⁶. The following instruction was provided to the participants: “When you see the images, please rate the emotion that you feel.” The participants were then instructed to rate the valence (pleasant vs. unpleasant) and arousal (low vs. high arousal) according to their feeling on Likert scales ranging from 1 to 9 using the keyboard in front of the prompter. Each trial began in a manner similar to the passive viewing component, and the dynamic facial expressions were followed by valence and then arousal ratings (Fig. 2). Participants first performed eight practice rating trials (two trials per condition) and then 16 test rating trials. Finally, the electrodes were removed, and the participant completed the autistic spectrum quotient and interpersonal reactivity index questionnaires, which were not analyzed in the current study. The participant was then monetarily rewarded and dismissed.

Live stimuli validation

Both models’ live performances in the passive viewing trials were video-recorded and visually inspected to ensure that dynamic facial expressions were performed correctly. One positive trial for two participants and one negative trial for two other participants were excluded because the dynamic facial expressions were not performed correctly or were performed with obvious delay, leaving 1376 trials of EMG data for analysis.

To ensure that the observed interaction between emotion and presentation condition was not due to enhanced emotional expression during live performances, an online rating study was designed. Thirty-two prerecorded video clips (8 per emotion condition—positive or negative, per model) and 32 randomly selected clips of live performances were presented to naïve participants. The following hypothetical context was provided: “Ms. A and B attended an audition for a play. The director wanted to see them perform smiling and frowning expressions. They were recorded making the expressions when the director was not in the room, and also when they performed live in front of the director.” For each stimulus, participants were asked to rate the valence and arousal using the same instruction as in the live-interaction experiment. They were then asked to judge whether they were watching a prerecorded video or a live performance. Data were collected from 25 participants (8 females, mean age ± SD = 24.68 ± 6.57, range 18–51 years).

EMG data preprocessing

Raw EMG data were preprocessed using the Toolbox EEGLAB v.2019.0 on MATLAB. The data were screened for movement artifacts, then a notch filter was applied around 60 Hz and at multiples of 60 Hz, a high-pass filter was applied at 20 Hz, and a low-pass filter was applied at 500 Hz. For each trial, the signal was detrended and the baseline was removed. Baseline was defined as the mean value of 3 s prior to stimuli onset until 1 s after the stimuli onset, which was the end of the model’s neutral expression, immediately prior to the dynamic facial expression change. All data points in the trial, 5000 Hz for 8 s, from 3 s prior to the stimulus onset to 5 s after the stimulus onset, were subtracted from the baseline value. For each signal, the absolute values plus one was natural log-transformed to correct for the right-skewness of the raw data distribution. This transformation is commonly applied to EMG data to reduce the impact of extreme values^{13,52,53,54,55}. The log-transformed data were used in the main text to show results in a manner comparable to the approach used in previous studies. However, the statistical analyses of non-transformed data are shown in the supplementary information (Supplementary Text, Table S5 and S6 and Fig. S2). They show similar patterns, while the assumption of residual normality in the LME models was not as strongly satisfied.

Statistical analyses of live interaction data

LME models, model comparisons and influence diagnostics were performed in R (v4.0.2) using the packages lmer4 1.1-23, lmerTest 3.1-2, HLMdiag 0.3.1, and emmeans 1.4.6, and optimizer BOBYQA. Dependent variables included valence ratings, arousal ratings, EMG difference (ZM and CS) between the neutral (0–1 s after stimulus onset) and maximal phases (2.5–3.5 s after stimulus onset) of the dynamic facial expression. For each LME model, emotion and presentation condition was considered as fixed effects, while the subject was included as a random factor. Model complexity in terms of fixed and random effects was increased in a stepwise fashion; each step was compared with the less complicated model using F-tests in lmerTest::anova() until the more complex model was no longer significantly preferable to the simpler model. The optimal models were described by the formula Y ~ 1 + emotion * presentation_condition + (1 + emotion + presentation_condition | subject). As fixed effects, emotion and presentation condition were entered with the interaction term into the model. As random effects, random intercepts for subjects were included, as were by-subject random slopes for the effect of emotion and presentation condition.

For model criticism⁵⁶, upward residual and influence analysis was performed using the HLMdiag package⁵⁷. At the trial level, trials were excluded with an absolute value of the standardized residual larger than three, or with a Cook’s distance larger than 1.5 × the interquartile range above the third quartile (Q₃ + 1.5 × IQR). After the reduced data had been re-fitted, the Cook’s distance was checked to exclude highly influential participants. In total, the following trials were excluded: 22 (5.98%) of valence rating, 35 (9.51%) of arousal rating, 195 (14%) of CS data (including all data from participant 14), and 267 (19%) of ZM data (including all data from participants 8 and 14). Participant 14 had very high CS activity in negative conditions; both participants 8 and 14 had very high ZM activity in positive conditions.

Satterthwaite’s estimation of degrees of freedom is reported. Significant interactions were followed by a planned simple effect analysis of calculating the difference between the live and video conditions in each emotion category. Effect size estimation of the semi-partial R-squared (variance explained) of fixed effects was carried out using r2glmm 0.1.2 with the Kenward–Roger approach^58,59,60. Boxjitter plots were created with packages afex 0.27–2, ggplot2 3.3.0, ggpol 0.0.6 and cowplot 1.0.0. Two-tailed statistical p values are reported throughout the text.

Statistical analyses of validation rating data

To determine whether the fixed effect of interaction arose in the validation data, for valence and arousal ratings, LME and model diagnostics were performed using the same formula: Y ~ 1 + emotion * presentation_condition + (1 + emotion + presentation_condition | subject), as described in the section “Statistical analyses of live interaction data.” Model comparisons were also carried out with the LME models without the fixed effect of interaction between emotion and presentation condition, using F-tests in lmerTest::anova(). A Bayesian paired sample t-test was also performed for subject-wise values of (positive-live minus negative-live) versus (positive-prerecorded minus negative-prerecorded) using JASP 0.13.1. For the evaluation of prerecorded versus live performances, the subject-wise accuracy rate was tested against the 0.5 chance level using a Bayesian one-sample t-test. The default Cauchy prior width of 0.707 was used.

References

Keltner, D. & Kring, A. M. Emotion, social function, and psychopathology. Rev. Gen. Psychol. 2, 320–342 (1998).
Article Google Scholar
Vuilleumier, P. Faces call for attention: evidence from patients with visual extinction. Neuropsychologia 38, 693–700 (2000).
Article CAS PubMed Google Scholar
Sawada, R. & Sato, W. Emotional attention capture by facial expressions. Sci. Rep. 5, 14042 (2015).
Article ADS PubMed PubMed Central Google Scholar
Dimberg, U. Facial reactions to facial expressions. Psychophysiology 19, 643–647 (1982).
Article CAS PubMed Google Scholar
Dimberg, U., Thunberg, M. & Elmehed, K. Unconscious facial reactions to emotional facial expressions. Psychol. Sci. 11, 86–89 (2000).
Article CAS PubMed Google Scholar
Dimberg, U., Thunberg, M. & Grunedal, S. Facial reactions to emotional stimuli: automatically controlled emotional responses. Cogn. Emot. 16, 449–471 (2002).
Article Google Scholar
Sato, W. & Yoshikawa, S. Spontaneous facial mimicry in response to dynamic facial expressions. Cognition 104, 1–18 (2007).
Article PubMed Google Scholar
Sato, W. & Yoshikawa, S. Enhanced experience of emotional arousal in response to dynamic facial expressions. J. Nonverbal Behav. 31, 119–135 (2007).
Article Google Scholar
Niedenthal, P. M. Embodying emotion. Science 316, 1002–1005 (2007).
Article ADS CAS PubMed Google Scholar
Wood, A., Rychlowska, M., Korb, S. & Niedenthal, P. Fashioning the face: sensorimotor simulation contributes to facial expression recognition. Trends Cogn. Sci. 20, 227–240 (2016).
Article PubMed Google Scholar
Prochazkova, E. & Kret, M. E. Connecting minds and sharing emotions through mimicry: a neurocognitive model of emotional contagion. Neurosci. Biobehav. Rev. 80, 99–114 (2017).
Article PubMed Google Scholar
Hess, U. & Blairy, S. Facial mimicry and emotional contagion to dynamic emotional facial expressions and their influence on decoding accuracy. Int. J. Psychophysiol. 40, 129–141 (2001).
Article CAS PubMed Google Scholar
McIntosh, D. N. Spontaneous facial mimicry, liking and emotional contagion. Pol. Psychol. Bull. 37, 31–42 (2006).
Google Scholar
Mui, P. H. C., Goudbeek, M. B., Roex, C., Spierts, W. & Swerts, M. G. J. Smile mimicry and emotional contagion in audio-visual computer-mediated communication. Front. Psychol. 9, 2077 (2018).
Article PubMed PubMed Central Google Scholar
Drimalla, H., Landwehr, N., Hess, U. & Dziobek, I. From face to face: the contribution of facial mimicry to cognitive and emotional empathy. Cogn. Emot. 33, 1672–1686 (2019).
Article PubMed Google Scholar
Hess, U. & Fischer, A. Emotional mimicry as social regulation. Pers. Soc. Psychol. Rev. 17, 142–157 (2013).
Article PubMed Google Scholar
Hess, U. & Bourgeois, P. You smile–I smile: emotion expression in social interaction. Biol. Psychol. 84, 514–520 (2010).
Article PubMed Google Scholar
Chartrand, T. L. & Bargh, J. A. The chameleon effect: the perception-behavior link and social interaction. J. Pers. Soc. Psychol. 76, 893–910 (1999).
Article CAS PubMed Google Scholar
Reader, A. T. & Holmes, N. P. Examining ecological validity in social interaction: problems of visual fidelity, gaze, and social potential. Cult. Brain 4, 134–146 (2016).
Article PubMed PubMed Central Google Scholar
Verga, L. & Kotz, S. A. Putting language back into ecological communication contexts. Lang. Cogn. Neurosci. 34, 536–544 (2018).
Article Google Scholar
Myllyneva, A. & Hietanen, J. K. There is more to eye contact than meets the eye. Cognition 134, 100–109 (2015).
Article PubMed Google Scholar
Hietanen, J. K., Kylliäinen, A. & Peltola, M. J. The effect of being watched on facial EMG and autonomic activity in response to another individual’s facial expressions. Sci. Rep. 9, 14759 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Dimberg, U. & Lundquist, L.-O. Gender differences in facial reactions to facial expressions. Biol. Psychol. 30, 151–159 (1990).
Article CAS PubMed Google Scholar
Doherty, R. W., Orimoto, L., Singelis, T. M., Hatfield, E. & Hebb, J. Emotional contagion: gender and occupational differences. Psychol. Women Q. 19, 355–371 (1995).
Article Google Scholar
Kret, M. E. & Dreu, C. K. W. D. Pupil-mimicry conditions trust in partners: moderation by oxytocin and group membership. Proc. R. Soc. B 284, 20162554 (2017).
Article PubMed PubMed Central Google Scholar
Russell, J. A., Weiss, A. & Mendelsohn, G. A. Affect grid: a single-item scale of pleasure and arousal. J. Pers. Soc. Psychol. 57, 493–502 (1989).
Article Google Scholar
Conty, L., George, N. & Hietanen, J. K. Watching eyes effects: when others meet the self. Conscious. Cogn. 45, 184–197 (2016).
Article PubMed Google Scholar
Wang, Y. & Hamilton, A. F. D. Social top-down response modulation (STORM): a model of the control of mimicry in social interaction. Front. Hum. Neurosci. 6, 153 (2012).
PubMed PubMed Central Google Scholar
de Hamilton, A. F. C. & Lind, F. Audience effects: what can they tell us about social neuroscience, theory of mind and autism?. Cult. Brain 4, 159–177 (2016).
Article PubMed PubMed Central Google Scholar
Zajonc, R. B. Social facilitation. Science 149, 269–274 (1965).
Article ADS CAS PubMed Google Scholar
Cottrell, N. B., Rittle, R. H. & Wack, D. L. The presence of an audience and list type (competitional or noncompetitional) as joint determinants of performance in paired-associates learning. J. Personal. 35, 425–434 (1967).
Article CAS Google Scholar
Bond, C. F. Social facilitation: a self-presentational view. J. Pers. Soc. Psychol. 42, 1042–1050 (1982).
Article Google Scholar
Tennie, C., Frith, U. & Frith, C. D. Reputation management in the age of the world-wide web. Trends Cogn. Sci. 14, 482–488 (2010).
Article PubMed Google Scholar
van Baaren, R. B., Holland, R. W., Kawakami, K. & van Knippenberg, A. Mimicry and prosocialbehavior. Psychol. Sci. 15, 71–74 (2004).
Article PubMed Google Scholar
Ashton-James, C., van Baaren, R. B., Chartrand, T. L., Decety, J. & Karremans, J. Mimicry and me: the impact of mimicry on self-construal. Soc. Cogn. 25, 518–535 (2007).
Article Google Scholar
Stel, M. & Vonk, R. Mimicry in social interaction: benefits for mimickers, mimickees, and their interaction. Brit. J. Psychol. 101, 311–323 (2010).
Article PubMed Google Scholar
Lakin, J. L., Jefferis, V. E., Cheng, C. M. & Chartrand, T. L. The chameleon effect as social glue: evidence for the evolutionary significance of nonconscious mimicry. J. Nonverbal Behav. 27, 145–162 (2003).
Article Google Scholar
Chartrand, T. L. & Lakin, J. L. The antecedents and consequences of human behavioral mimicry. Annu. Rev. Psychol. 64, 285–308 (2013).
Article PubMed Google Scholar
Heyes, C. Causes and consequences of imitation. Trends Cogn. Sci. 5, 253–261 (2001).
Article CAS PubMed Google Scholar
Hess, U. & Fischer, A. Emotional mimicry: why and when we mimic emotions: emotional mimicry. Soc. Pers. Psychol. Compass 8, 45–57 (2014).
Article Google Scholar
Kavanagh, L. C. & Winkielman, P. The functionality of spontaneous mimicry and its influences on affiliation: an implicit socialization account. Front. Psychol. 7, 458 (2016).
Article PubMed PubMed Central Google Scholar
Carr, E. W., Winkielman, P. & Oveis, C. Transforming the mirror: power fundamentally changes facial responding to emotional expressions. J. Exp. Psychol. Gen. 143, 997–1003 (2014).
Article PubMed Google Scholar
Frith, C. D. The role of metacognition in human social interactions. Philos. Trans. R. Soc. B 367, 2213–2223 (2012).
Article Google Scholar
Tamietto, M. et al. Unseen facial and bodily expressions trigger fast emotional reactions. PNAS 106, 17661–17666 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Ekman, P. & Friesen, W. V. Nonverbal leakage and clues to deception. Psychiatry 32, 88–106 (1969).
Article CAS PubMed Google Scholar
Ekman, R. What the Face Reveals: Basic and Applied Studies of Spontaneous Expression Using the Facial Action Coding System (FACS) (Oxford University Press, Oxford, 1997).
Google Scholar
Ekman, P. Telling Lies: Clues to Deceit in the Marketplace, Politics, and Marriage (Revised Edition) (W. W. Norton & Company, New York, 2009).
Google Scholar
Oh, Y.-H., See, J., Le Ngo, A. C., Phan, R.C.-W. & Baskaran, V. M. A survey of automatic facial micro-expression analysis: databases, methods, and challenges. Front. Psychol. 9, 1128 (2018).
Article PubMed PubMed Central Google Scholar
Svetieva, E. Seeing the unseen: explicit and implicit communication effects of naturally occuring emotion microexpressions. Dissertations and Theses@SUNY Buffalo, ProQuest Dissertations and Theses Global (2014).
Zhao, G. & Li, X. Automatic micro-expression analysis: open challenges. Front. Psychol. 10, 1833 (2019).
Article PubMed PubMed Central Google Scholar
Faul, F., Erdfelder, E., Lang, A.-G. & Buchner, A. G*Power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods 39, 175–191 (2007).
Article PubMed Google Scholar
Winkielman, P. & Cacioppo, J. T. Mind at ease puts a smile on the face: psychophysiological evidence that processing facilitation elicits positive affect. J. Pers. Soc. Psychol. 81, 989–1000 (2001).
Article CAS PubMed Google Scholar
Sims, T. B., Van Reekum, C. M., Johnstone, T. & Chakrabarti, B. How reward modulates mimicry: EMG evidence of greater facial mimicry of more rewarding happy faces. Psychophysiology 49, 998–1004 (2012).
Article PubMed Google Scholar
Grèzes, J. et al. Self-relevance appraisal influences facial reactions to emotional body expressions. PLoS ONE 8, e55885 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Golland, Y., Hakim, A., Aloni, T., Schaefer, S. & Levit-Binnun, N. Affect dynamics of facial EMG during continuous emotional experiences. Biol. Psychol. 139, 47–58 (2018).
Article PubMed Google Scholar
Baayen, R. H. & Milin, P. Analyzing reaction times. Int. J. Psychol. Res. 3, 12–28 (2010).
Article Google Scholar
Loy, A. & Hofmann, H. HLMdiag: a suite of diagnostics for hierarchical linear models in R. J. Stat. Soft. 56, 1–28 (2014).
Article Google Scholar
Edwards, L. J., Muller, K. E., Wolfinger, R. D., Qaqish, B. F. & Schabenberger, O. AnR² statistic for fixed effects in the linear mixed model. Stat. Med. 27, 6137–6157 (2008).
Article MathSciNet PubMed PubMed Central Google Scholar
Nakagawa, S. & Schielzeth, H. A general and simple method for obtaining R² from generalized linear mixed-effects models. Methods Ecol. Evol. 4, 133–142 (2013).
Article Google Scholar
Jaeger, B. C., Edwards, L. J., Das, K. & Sen, P. K. An R² statistic for fixed effects in the generalized linear mixed model. J. Appl. Stat. 44, 1086–1105 (2017).
Article MathSciNet Google Scholar

Download references

Acknowledgements

This project was generously supported by the Japan Science and Technology Agency CREST (JPMJCR17A5). The authors would like to thank Rena Kato, Mami Fujikura, Masaru Usami and Kazusa Minemoto for their assistance.

Author information

Authors and Affiliations

Psychological Process Team, BZP, RIKEN, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0288, Japan
Chun-Ting Hsu & Wataru Sato
Institute of Philosophy and Human Values, Kyoto University of the Arts, 2-116 Uryuyama Kitashirakawa, Sakyo, Kyoto, Kyoto, 606-8271, Japan
Sakiko Yoshikawa

Authors

Chun-Ting Hsu
View author publications
You can also search for this author in PubMed Google Scholar
Wataru Sato
View author publications
You can also search for this author in PubMed Google Scholar
Sakiko Yoshikawa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.-T. H., W.S. and S.Y. conceived the experiment. C.-T. H. conducted the experiments and analyzed the results. C.-T. H., W.S. and S.Y reviewed the manuscript.

Corresponding authors

Correspondence to Chun-Ting Hsu or Wataru Sato.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hsu, CT., Sato, W. & Yoshikawa, S. Enhanced emotional and motor responses to live versus videotaped dynamic facial expressions. Sci Rep 10, 16825 (2020). https://doi.org/10.1038/s41598-020-73826-2

Download citation

Received: 04 June 2020
Accepted: 22 September 2020
Published: 08 October 2020
DOI: https://doi.org/10.1038/s41598-020-73826-2
Springer Nature Limited

This article is cited by

High-resolution surface electromyographic activities of facial muscles during the six basic emotional expressions in healthy adults: a prospective observational study
- Orlando Guntinas-Lichius
- Vanessa Trentzsch
- Christoph Anders
Scientific Reports (2023)

Enhanced emotional and motor responses to live versus videotaped dynamic facial expressions

Abstract

Similar content being viewed by others

Altering Facial Movements Abolishes Neural Mirroring of Facial Expressions

A Dynamic Disadvantage? Social Perceptions of Dynamic Morphed Emotions Differ from Videos and Photos

Perception of Discrete Emotions in Others: Evidence for Distinct Facial Mimicry Patterns

Introduction