Introduction

Autistic individuals across cultures tend to show reduced interest in both dynamic and static social vs. non-social stimuli using eye-tracking measures1. We recently compared three such paradigms in children with autistic spectrum disorder (ASD) compared with typical developing (TD) controls2. The paradigm showing the greatest discriminatory accuracy was a dynamic task simultaneously contrasting social stimuli (individuals dancing) with geometric patterns and was also associated with symptom severity, particularly in relation to social dysfunction2,3. Two other paradigms, which discriminated ASD from TD children, were one comparing biological with non-biological motion using simultaneous point-light images and a static task where subjects view a toy/object alone (non-social) vs. the same toy-or object being played with by a person (social)2. However, the extent to which these same tasks are sensitive to autistic traits in healthy adults and influenced by potential therapeutic interventions is not established.

Our ability to recognize individuals and their facial expressions is of key importance for social behavior and interaction and ASD is associated with marked impairments in face-emotion recognition4, as well as social anxiety, depression5,6,7 and schizophrenia8. Previous eye-tracking studies have demonstrated that facial emotion recognition relies on a triangular visual search pattern, including eye-gaze towards the eyes as well as nose and mouth regions, although there is some degree of individual specificity9. In ASD and many other psychiatric disorders, the patterns of visual scanning of faces are abnormal with the most notable difference tending to be reduced time scanning the eyes and other salient regions, such as mouth and nose in comparison with other, less informative, face regions1,10,11,12. Thus, there is considerable interest in developing therapeutic approaches for normalizing eye-gaze patterns towards both social vs. non-social stimuli and salient facial cues in the context of ASD.

The hypothalamic neuropeptide oxytocin (OXT) has been reported to promote increased eye-gaze towards social stimuli13,14 and improve face-emotion recognition in healthy humans15 and may be a promising therapeutic candidate in the context of ASD16. While no studies to date have investigated the effects of intranasal OXT on the social vs. non-social task paradigms that are sensitive to ASD, a number of eye-tracking studies have reported effects on eye-gaze towards faces in both monkeys and humans, although results have been inconsistent. In humans, an initial study demonstrated increased gaze towards the eye region, but not the nose and mouth or cheeks and forehead, of statically presented neutral expression faces17. Other studies using dynamically presented faces have reported increased eye-gaze towards neutral expression faces with a marginal trend towards effects being maintained when faces changed to a happy expression and decreased when they changed to an angry one18 and increased eye-gaze across dynamic expressions of sadness, happiness, pain or fear19. Another study also reported that OXT increased the frequency of saccades from the mouth towards the eye region in angry, happy and fearful expression faces, with saccade frequency also being associated with amygdala responses20. In rhesus monkeys, OXT increased the number of fixations on the eyes relative to those on the mouth region when viewing faces of conspecifics, irrespective of whether neutral or threat expressions were displayed21. Similarly, in marmosets OXT also increased gaze time to the eyes relative to other face regions, and this effect could be prevented using an OXT receptor antagonist22. Importantly, OXT administration has also been reported to increase viewing of the eye region during viewing of neutral expression faces in autistic individuals required to make gender or gaze-direction decisions (fixation duration)23 and also during real-time social interactions in natural settings in both healthy control and autistic individuals (number of fixations per second)24.

However, some studies have failed to find significant effects of OXT on eye-gaze25,26,27. An early study with 47 male subjects found no effect of OXT on fixation time spent on either whole face or eye or mouth regions of static emotional faces (happy, sad, angry, and fearful) with different intensities although did find that OXT increased the recognition accuracy for fearful faces25. A subsequent study also found no effect of OXT in women on viewing the eyes relative to the rest of the face in static angry, fearful, happy, and neutral expression faces despite finding increased neural responses to either fearful or happy and angry faces, although data were from only 14 subjects27. Another more recent detailed study reported that while OXT marginally increased recognition speed, but not accuracy, for different static emotional faces (happy, angry, fear, sad, disgust and surprise) it did not increase the proportion of time spent viewing the eye region during the task26.

Thus, while a number of studies have reported that intranasal OXT increases the amount or proportion of time spent gazing at the eye region of faces others have failed to find effects and therefore further replication is important. Additionally, it is unclear whether there is an influence of specific emotional expressions. Furthermore, studies on human subjects to date have only included Caucasian populations and there is increasing evidence for different patterns of gaze to faces in Asian populations, with a greater focus on the nose region in Asian subjects compared to the eye, and lesser extent mouth regions in Caucasian ones28,29,30,31.

In the current study, we have therefore used a randomized within-subject placebo-controlled paradigm to investigate the effects of intranasal OXT (24IU) on patterns of eye-gaze in five different tasks involving social vs. non-social dynamic and static stimuli and also towards salient regions of emotional faces (eyes, nose, and mouth). In view of the potential therapeutic relevance of observed effects to autism we also investigated associations between OXT-effects and trait autism and empathy measured by routinely used questionnaires, the autism-spectrum quotient (AQ32—which measures all autistic symptoms), the Social Responsiveness Scale (SRS33—which focuses specifically on social impairments), and the Interpersonal Responsivity Index (IRI34—which focusses on trait empathy). We hypothesized firstly that higher levels of autistic and empathic traits would be associated with reduced viewing of social relative to non-social stimuli, and of the eyes relative to other face regions across emotional faces, and that secondly OXT treatment would increase interest in social stimuli and the eyes.

Materials and methods

Participants

Forty healthy male adults (mean age ± sem = 20.8 ± 0.38) were recruited via university advertisement. Only male subjects were recruited due to the main focus on potential relevance to autism and because the majority of previous studies have also only used male subjects. In a within-subject design, each participant attended the experiment twice and with an interval of around 2 weeks between each treatment/session (mean ± sem = 14.86 ± 0.16 days). Subjects randomly received either OXT (24 IU OXT in water, 0.9% sodium chloride and glycerol supplied by the Sichuan Meike Pharmaceutical Company, China) or placebo (PLC containing all ingredients except for the neuropeptide and supplied by the same company) intranasal treatment and the order of receiving OXT or PLC was counterbalanced. Subject number was determined by an a priori power analysis (using G-power with 1 group, 4 measurements, α = 0.05 and a 0.5 correlation between repeated measures), which showed that for a within-subject design 36 subjects would achieve >80% power for a medium effect size both with ANOVAs (F = 0.25, partial eta squared—0.06) and post-hoc pairwise comparisons (Cohen’s d = 0.5). We therefore decided to recruit 40 subjects to compensate for potential subjects dropping out and data collection issues. Subjects were required to abstain from caffeine, alcohol, nicotine, or other psychoactive substances in the 24 h before each experiment, and in initial interviews self-reported absence of current or previous psychiatric illness, drug, or alcohol abuse. All participants had normal or corrected to normal vision. The study had full approval from the local ethics committee of the University of Electronic Science and Technology of China and procedures were in accordance with the latest revision of the declaration of Helsinki. The study was pre-registered at clinical trials.gov (Trial registration ID: NCT03293511; URL: https://clinicaltrials.gov/ct2/show/NCT03293511). In the study, subjects actually completed seven different tasks but data from the final two tasks (involving directed attention and empathy) will be reported separately. All subjects signed written informed consent and paid for participation (100RMB).

Experimental procedure

In a double-blind placebo-controlled within-subject design, subjects self-administered the nasal sprays (24 IU, three puffs per nostril, each containing four IU of OXT or PLC) following a standardized protocol35. One experimenter was responsible for assigning subjects to receive OXT or PLC first using a computer-generated randomization and was not involved in conducting the experiment. All nasal spray bottles were covered by black tape so that neither the second experimenter responsible for supervising intranasal administration and running the tasks nor the subjects could recognize their contents. At the first visit before treatment administration, participants initially completed a set of Chinese version questionnaires to evaluate mood and personality traits both as a control for possible confounders due to any treatment group differences (State Trait Anxiety Inventory—STAI36, Childhood trauma questionnaire—CTQ37,38). The Positive and Negative Affect Schedule (PANAS39) was completed three times before treatment administration (t1) and immediately before (t2) and after the tasks (t3) in order to assess possible treatment effects on mood. For the assessment of associations with experimental findings in the two groups in relation to autistic traits and empathy Chinese versions of the Autism-Spectrum Quotient (AQ32), Social Responsivity Scale (SRS33) and Interpersonal Responsivity Index (IRI34) were used.

Full task details are provided in the Supplementary Section. In brief, task 1 involved differential attention to dynamic social (dancing individuals) and non-social (geometric patterns) images modified from Pierce et al.3 and similar to our previous experiment comparing ASD and TD children2 (see Supplementary Fig. S1). Task 2 was a static face-emotion (FE) processing task where participants viewed 24 emotional faces, including happy, angry, fear, and neutral expressions from the same three males and three females (in color and with a 400 × 500 pixel resolution and black background—see Supplementary Fig. S1). The faces used were from the Taiwanese facial expression image database (TFEID) (http://bml.ym.edu.tw/tfeid/index.php). A novel task (Task 3) involved subjects viewing pairs of static facial images of human and emoticon faces (HEF) with the same neutral, happy or angry expression, and the other an emoticon (HEF see Supplementary Fig. S1). The expectation here was that individuals with higher trait autism would show a preference for the emoticon version of the emotional face. For task 4, we adapted our previous paradigm (Kou et al.)2 for static visual attention (SVA). Pairs of static images were displayed, one showing a picture of an adult with a happy face playing with a toy/object while looking outwards towards the subject (i.e., sharing their enjoyment) and the other only including the same toy/object (see Supplementary Fig. S1). For Task 5, visual preference for biological motion (BM) was investigated using point-light displaying identical sized animate (walking human or cat) and inanimate (randomly moving point-light) videos2,40 (see Supplementary Fig. S1). A flow chart of the whole-experiment procedure is provided in the supplementary (see Supplementary Fig. S2) and a CONSORT flow chart (see Supplementary Fig. S3).

Eye-tracking data collection

Eye movement data were recorded using an eye tracker (Tobii TX300, Tobii, Danderyd, Sweden), which employs infrared (IR) light‐emitting diodes and IR camera to measure corneal reflections and calculate eye‐gaze direction. All gaze data were recorded at 300 Hz sampling rate with a gaze accuracy of 0.4°. Recording and stimulus presentation were conducted using Tobii Pro Studio, E-prime 2.0 software, and E-Prime Extensions for Tobii (Psychology Software Tools, Pittsburgh, PA). All raw data with at least 80% gaze weight were analyzed using Tobii studio. The primary measure collected in the current study was total fixation duration, although additionally the number of fixations and duration of individual fixations were measured in order to help interpret, which were contributing to any observed significant changes in total gaze time. See supplementary for full details of each task.

Statistical analyses

All statistical analyses were performed using SPSS 22 (SPSS Inc., Chicago, IL, USA). In tasks 1, 3 and 4, two-way repeated ANOVAs were performed with two within-subject factors (treatment—OXT, PLC) and areas of interest (AOI -two regions), and total fixation duration as the dependent variable. For the FE task (Task 2) a three-way ANOVA was performed with treatment, AOIs (eyes, nose and mouth, other parts of the face) and face-emotion (angry, happy, fear, and neutral) as factors, and the percentage of total time spent viewing each region compared to the time viewing the screen as the dependent variable. For the eye region, we used an AOI only including the left and right eye, and not the bridge of the nose in line with observed scanning patterns by subjects and with several other eye-tracking experiments comparing Chinese and Caucasian subjects28,31 (AOIs and typical viewing patterns are shown in Supplementary Fig S4). We used percentages for the relative amount of time spent viewing each region compared to the whole screen as the dependent variable similar to previous studies investigating effects of OXT. In task 5, the difference in total fixation duration for the social stimulus (walking human) minus the total fixation duration to the scrambled sequence was calculated and subtracted from that for the non-social stimulus (i.e., cat–scrambled sequence). The overall difference score for PLC was then compared with that for OXT using paired t-tests. Where significant main or interaction effects were observed in the different tasks for total fixation duration, additional analyses were performed with the total number of fixations (fixation count), or percentages of fixations on difference face regions (Task 2), and average individual fixation duration (fixation duration) as dependent variables to determine which were contributing to the observed changes in overall fixation duration. The results of these secondary analyses are given in the supplementary and show that for the majority of tasks the effects of OXT on total or percentage fixation durations were due to altered fixation counts. Bonferroni correction was applied to all post-hoc tests. Associations between total or percentage fixation durations and autistic and empathy traits (AQ, SRS, and IRI scores) were performed using Pearson correlation and permutation analysis (bootstrap = 10,000 using the method described by Baguely41 and Wilcox42 for non-independent samples) used to assess possible treatment group differences. To correct for multiple comparisons (three questionnaires), the threshold p-value was adjusted to p < 0.0167 for the correlation analyses (e.g., p = 0.05/3 = 0.0167).

Results

Demographic and questionnaire data and effects of treatment on mood and overall gaze

We performed independent t-tests to show that there were no significant differences in terms of mood and personality traits between subjects with the two different treatment orders (see Supplementary Table. S1). A two-way repeated ANOVA with 2 treatment × 3 time points for PANAS scores revealed no significant main or interaction treatment effects on positive or negative mood (all ps > 0.665).

For all five tasks, there were no significant difference in overall gaze time towards the screen between the treatment groups or tasks (on average, subjects were viewing the screen 97–98% of the time—see Supplementary Table S2). There was also no significant difference in time spent viewing the actual stimuli across treatments (Task 1: t (1,39) = 0.771, p = 0.446; Task 2: t(1,35) = 1.758, p = 0.088; Task 3: t(1,35) = 1.227, p = 0.228; Task 4: t(1,35) = 1.183, p = 0.245; Task 5: t(1,32) = –0.542, p = 0.592). Thus, OXT did not affect the total time spent viewing the screen or the stimuli in each task.

Effects of oxytocin in the dynamic social attention (DSA) task (Task 1)

A two-way repeated ANOVA with treatment and type (social vs. non-social AOIs) as within-subject factors and total fixation duration as the dependent variable revealed no main effect of treatment (F(1, 39) = 0.594, p = 0.446, partial ƞ2 = 0.015), but a main effect of type (F(1, 39) = 39.258, p < 0.001, partial ƞ2 = 0.502) and a treatment x type interaction (F(1, 39) = 8.665, p = 0.005, partial ƞ2 = 0.182). A post-hoc Bonferonni-corrected analysis revealed that OXT treatment significantly increased the time spent looking at the social (person dancing) (t(1, 39) = 2.923, p = 0.006, Cohen’s d = 0.470) and decreased that for the non-social (dynamic geometric) stimuli (t(1, 39) = –2.801, p = 0.008, Cohen’s d = 0.442) (see Fig. 1a).

Fig. 1: The effects of oxytocin (OXT) on eye-gaze towards dynamic social and geometric stimuli in the dynamic social attention (DSA) task—Task 1.
figure 1

a The effects OXT on mean total fixation duration for each type of stimulus in subjects following either OXT or placebo (PLC) treatment. Error bars represent SEM. **p < 0.01, *p < 0.05, OXT vs. PLC and Social vs. Geometric. b, c Correlation (Pearson) between scores on two autistic traits questionnaires and total fixation duration. AQ autism-spectrum quotient, SRS Social Responsiveness Scale.

Relationship between autistic traits and eye-gaze in Task 1

For both OXT and PLC treatments AQ and SRS scores were negatively correlated with total fixation duration for the dynamic social stimuli (person dancing) (AQ: OXT: r = – 0.471, p = 0.007; PLC: r = –0.346, p = 0.029 but does not pass correction; SRS: OXT: r = – 0.432, p = 0.005; PLC: r = –0.244, p = 0.129), although the treatment difference was not significant (AQ: p = 0.585; SRS: p = 0.179). They were also positively correlated with total fixation duration for the dynamic non-social stimuli (geometric patterns) (AQ: OXT: r = 0.461, p = 0.003; PLC: r = 0.397, p = 0.011; SRS: OXT: r = 0.461, p = 0.003; PLC: r = 0.254, p = 0.113), although again the treatment difference was not significant (AQ: p = 0.666; SRS: p = 0.184) (see Fig. 1b, c). There were no significant correlations observed between total fixation duration and IRI scores (all rs < 0.158). Details of all correlations are given in Supplementary Table S3.

Effects of oxytocin in the face-emotion processing (FE) task (Task 2)

Four subjects were excluded from analysis of the FE task due to technical problems with data collection, resulting in n = 36 subjects. A three-way ANOVA analysis was performed with treatment (OXT vs. PLC), face region (eyes, nose, mouth, other parts of the face—see Fig. 2) and face-emotion (angry, fear, happy, neutral) as factors. There was no main effect of face-emotion (F(3, 105) = 0.597, p = 0.592) but a significant main effect of face region (F(1, 35) = 22.287, p < 0.001), due to the mouth region being viewed less than the other three regions (ps < 0.001), and an emotion x region interaction (F(3, 105) = 10.095, p < 0.001, partial ƞ2 = 0.224). Post-hoc Bonferroni corrected tests showed that the eyes were viewed less for happy than angry and neutral faces (ps < 0.012), the nose less for angry than fearful or happy faces (ps < 0.032). The mouth was viewed more for happy faces than all other emotions (ps < 0.001) and for fear more than angry and neutral faces (ps < 0.017), and the rest of the face was viewed less for happy than for angry and fearful faces (ps < 0.020. There was no significant main effect of treatment (F(1, 35) = 0.099, p = 0.755) or treatment x emotion (F(3, 105) = 1.763, p = 0.182) or treatment x region (F(1, 35) = 1.868, p = 0.157) interactions, but there was a significant treatment x region x emotion interaction (F(3, 105) = 2.154, p = 0.046, partial ƞ2 = 0.058). Post-hoc Bonferonni-corrected tests revealed that OXT increased the percentage of total fixation duration for the eyes of fearful faces (t(1, 35) = 3.567, p = 0.001, Cohen’s d = 0.601) and decreased it for the nose region (t(1, 35) = –2.487, p = 0.042, Cohen’s d = 0.415), but not for the mouth (p = 0.130) or other parts of the face (p = 0.799) (see Fig. 2 and Supplementary Fig S4 for examples of heat maps). Analysis of the proportion of fixation counts displayed produced similar findings (see supplementary).

Fig. 2: The effects of oxytocin (OXT) treatment on eye-gaze towards emotional faces - Task 2 (FE).
figure 2

Percentage of time viewing the eyes (a), nose (b), mouth (c) regions, and the rest of face (d) during the static face-emotion (angry, fear, happy, and neutral expressions) processing task. Mean percentage total fixation durations on different regions are relative to time spent viewing the screen during the presentation of faces. Error bars represent SEM. ***p<0.001, **p<0.01, *p<0.05, OXT vs. PLC.

Relationship between autistic traits and eye-gaze in face-emotion task (Task 2)

Pearson correlation analyses were used to determine if autistic (AQ and SRS) or empathic (IRI) traits were significantly associated with the proportion of time viewing the different face regions across emotions or for specific emotions. For autistic traits, there was a significant positive correlation under OXT, but not PLC, for AQ scores for the nose region of happy expression faces (OXT—r = 0.420, p = 0.011; PLC—r = 0.037, p = 0. 829), while for SRS scores were positively correlated with viewing the nose region of neutral faces (OXT—r = 0.414, p = 0.012; PLC—r = –0.181, p = 0.290). In all cases, the correlation difference between OXT and PLC was significant (all ps < 0.006). Thus, higher autistic traits tended to be associated with a greater percentage of time viewing the nose region in happy and neutral expression faces under OXT. For trait empathy (IRI scores), there was a positive correlation with the percentage total fixation duration for the eyes of happy expression faces (OXT—r = 0.490, p = 0.002; PLC—r = 0.117, p = 0.495) following OXT but not PLC administration. There was a similar association for neutral (OXT—r = 0.364, p = 0.029; PLC—r = 0.051, p = 0.770) and angry faces (OXT—r = 0.331, p = 0.049; PLC—r = 0.038, p = 0.827), which did not pass correction. For the eyes of happy expression faces the correlation difference between OXT and PLC for IRI scores was significant (p = 0.007) and marginal for neutral (p = 0.05), and angry (p = 0.070) ones. Thus, OXT tended to increase the percentage of total fixation duration for the eyes of happy and to a lesser extent neutral and angry expression faces in individuals with higher empathy scores.

Effects of oxytocin on gaze towards human and emoticon faces (HEF—Task 3)

Four subjects were excluded from analysis of the HEF task due to technical problems with data collection resulting in n = 36 subjects. Two-way ANOVA analysis revealed a significant effects of stimulus category (F(1, 35) = 106.178, p < 0.001, partial ƞ2 = 0.752) but not treatment (F(1, 35) = 2.295, p = 0.139, partial ƞ2 = 0.062) on total fixation duration due to subjects spending a longer time looking at human faces across treatment conditions. There was also a significant x social category (F(1, 35) = 4.10, p = 0.05, partial ƞ2 = 0.105) interaction. Post-hoc Bonferonni-corrected tests revealed that OXT significantly increased total fixation duration on the human face compared to PLC treatment (t(1, 35) = 2.235, p = 0.032, Cohen’s d = 0.379) (see Fig. 3a).

Fig. 3: The effects of oxytocin (OXT) treatment on eye-gaze directed towards emotional human or emoticon faces—Task 3.
figure 3

a Mean total fixation durations for each type of stimulus in subjects following either OXT or placebo (PLC) treatment. Error bars represent SEM. **p < 0.01, *p < 0.05, OXT vs. PLC and Human faces vs. Emoticons. b, c Correlation (Pearson) between scores on autistic traits questionnaires and total fixation duration. AQ autism-spectrum quotient, SRS Social Responsiveness Scale.

Relationship between autistic traits and eye-gaze in human and emoticon face task (Task 3)

The SRS score was significantly negatively associated with total fixation duration on the human face under PLC, but not OXT (OXT: r = –0.123, p = 0.474; PLC: r = –0.402, p = 0.015), but the treatment difference was not significant (p = 0.1). There were no significant correlations with AQ or IRI (all ps > 0.058, see Supplementary Table S3 and Fig. 3b, c).

Effects of oxytocin in the static visual attention (SVA) task (Task 4)

Four subjects were excluded from analysis of this SVA task due to technical problems with data collection resulting in n = 36 subjects. A two-way ANOVA with treatment (OXT vs. PLC) and stimulus category (social—human with play object vs. non-social—play object alone) as factors investigated the effect of OXT on total fixation duration. There was no significant main effect of treatment (F(1, 35) = 1.399, p = 0.245, partial ƞ2 = 0.038) but a main effect of category (F(1, 35) = 129.102, p < 0.001, partial ƞ2 = 0.787) due to subjects showing a greater total fixation duration on the social category (person playing with toy/object) across treatments. There was also a significant treatment x category interaction (F(1, 35) = 8.870, p = 0.005, partial ƞ2 = 0.202) and post-hoc Bonferonni-corrected tests revealed that OXT increased the total fixation duration on the social stimulus relative to PLC (t(1, 35) = 2.951, p = 0.006, Cohen’s d = 0.499) (see Fig. 4a).

Fig. 4: The effects of oxytocin (OXT) treatment on eye-gaze directed towards human with toy/object or toy/object during the static visual attention task—Task 4.
figure 4

a The effects OXT on mean total fixation duration for each type of stimulus in subjects following either OXT or placebo (PLC) treatment. Error bars represent SEM. **p < 0.01, *p < 0.05, OXT vs. PLC or human with toy/object or toy/object alone. b, c Correlation between scores on two autistic traits questionnaires and total fixation duration. AQ autism-spectrum quotient, SRS Social Responsiveness Scale.

Relationship between autistic traits and eye-gaze in the static visual attention task (Task 4)

Pearson correlation analyses revealed that the SRS score was negatively associated with total fixation duration on the social stimulus (human with toy/object) in the PLC but not the OXT condition (PLC: r = –0.407, p = 0.014; OXT: r = –0.163, p = 0.341) although the difference between the correlations was not significant (p = 0.1). No significant correlations with AQ or IRI were observed (all ps > 0.151) (see Supplementary Table S3 and Fig. 4b, c).

Effects of oxytocin on gaze towards social vs. non-social biological motion (BM) (Task 5)

Four subjects were excluded from analysis of this BM task due to technical problems with data collection and three subjects due to presentation problems in their second session. Thus, data from n = 33 subjects were analyzed. Overall subjects showed greater total fixation duration to the scrambled stimuli compared to the cat or human ones (see Fig. 5a). To determine treatment effects on social vs. non-social biological motion we used difference score for total fixation duration as the dependent variable (dnonsocial = cat – scrambled Cat vs. dsocial = human – scrambled human). A paired sample t-test revealed that OXT significantly decreased the difference between the total fixation duration on social vs. non-social biological motion (t(1, 32) = 2.232, p = 0.033, Cohen’s d = 0.394) (see Fig. 5b). Analysis of fixation counts and individual fixation durations did not reveal which contributed to OXT-effects on total fixation duration in the BM task (see Supplementary Results).

Fig. 5: The effects of oxytocin (OXT) treatment on eye-gaze directed towards social (human walking) or non-social (cat walking) stimuli during the Biological motion task.
figure 5

a The effects OXT on total fixation duration for each type of stimulus in subjects following either OXT or placebo (PLC) treatment. Error bars represent SEM. **p < 0.01, *p < 0.05 for cat vs. scrambled or human vs. scrambled. b The effects of OXT on the difference in total fixation duration for human minus scrambled (social) and cat minus scrambled (non-social). Error bars represent SEM. **p < 0.01, *p < 0.05, OXT vs. PLC. c Correlation (Pearson) between scores on the autistic spectrum quotient (AQ) and differences in total fixation duration.

Relationship between autistic traits and eye towards social vs. non-social biological motion (Task 5)

Pearson correlation analyses revealed no significant correlations between questionnaires and total fixation durations during the biological motion task after correction for multiple comparisons (all ps > 0.020) (see Supplementary Table S3 and Fig. 5c).

Discussion

The current study employed an eye-tracking approach in healthy adult male Chinese subjects to determine whether intranasal OXT increased gaze towards either dynamic or static social vs. non-social stimuli in four different tasks sensitive to autistic traits, and additionally towards the eye region of emotional faces. Our results demonstrate that OXT treatment increased the amount of time spent viewing social stimuli across all four different tasks. Viewing time in all four tasks showed negative associations with autistic traits (either AQ or SRS scores) in both PLC and OXT treatment conditions, suggesting that OXT was promoting increased interest in social stimuli equivalently across trait autism scores. For face emotions, OXT only increased the percentage of time viewing the eyes and decreased that for the nose for fearful expression faces. Thus, the effects of OXT on increasing time spent viewing the eyes appears to be highly emotion specific in this context. Viewing time for the eyes was also positively associated with trait empathy (IRI) for angry, happy, and neutral expression faces following OXT administration.

The dynamic social vs. geometric pattern paradigm (Task 1) has been shown to be one of the most robust for identifying children with ASD, as well as the severity of their social symptoms in both Caucasian3,43 and Chinese populations2,44. In the current study, we have confirmed this negative association between interest in the social stimuli and trait autism (AQ and SRS scores) in healthy adult subjects. For the other three tasks (tasks 3, 4, and 5) total fixation duration on the social stimuli were also significantly negatively associated with either AQ or SRS scores, supporting their utility as being sensitive to autistic symptoms. There was, however, no association within subjects between autism scores and the differences in total fixation duration between PLC and OXT treatments, suggesting that overall effects of OXT were similar across the range of autistic symptom magnitudes.

In the biological motion task, subjects tended to pay more attention towards the scrambled light point displays rather than either those showing a human or a cat, which is the opposite of the pattern usually reported in young children2,45,46. Possibly this reflects an age effect with greater interest exhibited by adults trying to interpret the scrambled images and, therefore, paying more attention to them. However, in the healthy adults we did find a negative correlation with AQ scores when we used difference scores between the total fixation duration for human/cat vs. scrambled stimuli, whereas in ASD children there were no associations with symptom severity.

Importantly, in the context of the primary objective of the current study, intranasal OXT significantly increased the total amount of time spent viewing the social stimuli in all four paradigms (both dynamic and static) with its effect appearing to be similar across subjects independent of their trait autism scores. A secondary analysis of fixation counts and individual fixation durations indicated that the effects of OXT result primarily from an increased number of fixations towards the social stimuli, although for the human vs. emoticon faces and biological motion tasks, changes in both numbers of fixations and individual fixation durations may have contributed.

For face expression stimuli, OXT only increased the proportion of time spent looking at the eyes of static fearful expression faces and correspondingly decreased that for viewing the nose region. This contrasts with some previous studies using static presentations reporting that OXT either increased the proportion of time viewing the eyes of neutral expression faces17 or did not influence it at all for neutral or emotional faces in males25,26 or females27, although the latter study included very few subjects. These discrepant findings may be due to several factors. Unlike most previous studies, we only measured viewing time for the eyes themselves and did not additionally include parts of the face between or above them. This reflected the typical gaze patterns of our subjects where the main focus was on the eyes themselves rather than regions between or above them. Indeed, the average proportion of time viewing the eyes in our study was approximately half of that reported in another study26, although this may reflect cultural differences since our Chinese subjects viewed the eyes and nose for similar amounts of time, whereas Caucasian subjects view the eyes more than all other regions28,29,30,31,47. Asian subjects appear to use a nose-centric scanning pattern during face recognition in contrast to Caucasians who use an eye-centric28,29,30,31. Another relevant contributory factor could be that the subjects in the current study viewed faces passively, whereas the previous study on male Caucasians reporting no effects of OXT used an active task paradigm where subjects were required to identify each different face-emotion26. Another study reporting no effect of OXT on eye-gaze also used task rather than passive viewing conditions where participants were required to recognize different intensity emotional faces as quickly possible25. Previous findings using dynamic presentations of face emotions on gaze towards the eyes have used a variety of different approaches but produced somewhat inconsistent results. One study found OXT significantly increased the time spent viewing the eye region19, while another only found a marginal effect of OXT on happy faces18, and one reported no effect of OXT at all25. For static faces, one study has reported OXT increased time spent viewing the eyes of neutral expression faces17, and another that it increased gaze shift upwards to the eye region across different emotions and related to brain amygdala activity20. However, two other studies have found no effect of OXT on time spent viewing the eyes26,27. Thus, findings have generally tended to be inconsistent, and at this stage it is still unclear whether OXT may have different effects on statically compared to dynamically presented face stimuli or during task-related compared to passive viewing conditions.

Our finding that OXT primarily influenced gaze towards the eyes and away from the nose of fearful faces resonates with a number of previous observations on its behavioral and neural effects. A meta-analysis of the effects of OXT on face-emotion recognition reported effects for fearful, angry, and happy faces with the effect size being much larger for fearful faces (0.591)15, and increased fear face recognition has also been reported in schizophrenia patients48. The eye region is of particular importance for the recognition of fearful faces and the amygdala appears to be a key region for controlling this49. The amygdala is one of the primary neural targets for functional effects of OXT and a number of studies have reported that OXT alters amygdala responses to both fearful faces across cultures27,50,51,52 and subliminal presentation of the eyes of fearful faces50. Another study has reported that the frequency of shifts in gaze (saccades) upwards from the mouth to the eyes of fearful faces is increased following OXT, and that amygdala responses to emotional faces are associated with such upward gaze shifts20. Interestingly, accuracy in recognizing face identity, gender and emotion is associated with such upward shifts of gaze from lower regions of the face towards the eyes53, Other studies have also reported that OXT can increase attentional orientation54 and self-reported empathy in response to fearful faces19. However, it should be noted that in the latter study self-reported empathy was negatively associated with time spent looking at the eyes19. One study reporting positive effects of OXT on the recognition accuracy failed to find any association with proportion of time spent viewing the eyes25 while another reporting marginal effects of OXT on reducing recognition time across face emotions also failed to find any association with proportion of time spent viewing the eye region26. Thus, the relationship between the effects of OXT on viewing the eyes and recognition of, and behavioral responses to, face emotions remains unclear.

While intranasal OXT consistently decreases amygdala responses to threatening faces in males it has the opposite effect in females51,55,56, and an early study reported that increased amygdala responses to fear in a small number of female subjects were not influenced by gaze towards the eyes27. Similarly, another study has found that increased amygdala responses to emotional scenes are not associated with patterns of eye-gaze in women51. Thus, while OXT may increase interest in threatening faces/scenes in males it may not do so in females. Future larger scale studies are therefore needed to investigate the possibility of sex differences in effects of OXT on patterns of eye-gaze towards emotional faces.

We did not find evidence for a significant association between trait autism scores and time spent viewing the eyes, even though individuals with ASD do tend to show this10,57. A previous study has reported a similar lack of association between trait autism and time spent viewing the eyes in healthy subjects58. However, a meta-analysis of the degree of impaired recognition of emotional faces in ASD has shown that fear faces have the largest effect size59. There was also a positive correlation between autistic traits and viewing the nose region in both happy and neutral expression faces under OXT, indicating that it may influence Chinese subjects with higher autistic traits to look more at the nose region of positive valence faces. Following OXT but not PLC administration we did observe a positive association with empathy (IRI) scores and the proportion of time spent viewing the eyes of happy expression faces and also to a lesser extent neutral and angry ones. Thus, in healthy subjects, time spent viewing the eyes may be more strongly associated with empathic rather than autistic traits following OXT administration.

Overall, our findings from the four social vs. non-social tasks are in line with the hypothesis that OXT promotes greater attention generally towards social stimuli. However, the findings from the face-emotion task showing that it biases attention towards the eyes relative to the nose of fearful faces, support the social salience hypothesis, which proposes that OXT functions primarily to enhance attention towards social cues that are of particular importance for an individual, with detection of threat being highly relevant for survival60.

There are several limitations in the current study. Firstly, in line with other studies investigating eye-gaze effects of OXT, only male subjects19,20,52,61,62 were included given the focus on relevance to autism. As discussed above, it is possible that there are sex differences in the effects of OXT on both eye-gaze and amygdala responses in the context of fear faces and a number of studies have reported other sex-dependent effects of OXT (see refs. 13,48,60,63). Future studies will need to recruit both male and female subjects to address this. Secondly, we used a fixed rather than randomized order of task presentation, although importantly we found significant effects of OXT across all five tasks.

In summary, our findings show for the first time that intranasal OXT enhances visual preference for dynamic and social vs. non-social stimuli in four different tasks sensitive to autistic traits. Additionally, they demonstrate that OXT promotes gaze towards the eye region and away from the nose region of static fearful, but not other face expressions. Overall, our findings therefore provide further support for the potential of OXT in therapeutic interventions for autism through shifting attention more towards social stimuli. While the effects of OXT on increased gaze towards the eyes were only significant for fearful expressions, recognition of fearful faces is the most impaired in ASD59. This might still suggest OXT could be beneficial therapeutically by helping to improve interpretation of social threat cues, although some caution may be needed since ASD is often co-morbid with anxiety disorders64. Further studies on clinical populations are required to better assess if OXT more generally promotes gaze towards the eyes and improves emotion recognition.