Group Accuracy in the Social and Non-Social Condition
Accuracy was determined by comparing the participant’s guess with the choice made by the original, eye-tracked participant. The proportion of correctly identified designs was determined for each participant by calculating how many designs they correctly identified out of the total number of trials in each condition. Shapiro-Wilk tests of normality revealed that the data was not normally distributed (ps < .05). However, previous research has indicated that ANOVAs are consistently robust to violations of normality both within and across groups (Blanca et al. 2017). Therefore, an ANOVA was judged to be the most appropriate test for this analysis. A 2 × 2 mixed model ANOVA, with a within-subject factor of condition (social/non-social) and a between-subjects factor of group (autistic/NT) on the proportion of correctly chosen designs revealed a main effect of condition (F(1,54) = 10.096, p =.002, ηρ2 =.158), as the proportion of correct responses was greater in the social condition (M = .68, SD = .13) compared to the non-social condition (M = .61, SD = .12 ). There was also a main effect of group (F(1,54) = 4.896, p = .031, ηρ2 = .083), as the proportion of correct responses was greater for the NT group (M = .68, SD = .05) compared to the autistic group (M = .62, SD = .12). The accuracy across all conditions (approximately 60–70%) is similar to that observed previously in NT participants by Foulsham and Lock (2015) and indicates that participants can efficiently interpret the abstract moving dot in most trials. There was no condition x group interaction (F(1,54) = .014, p = .906, ηρ2 < .001), demonstrating that there were no significant differences between the groups in how they responded to the manipulation of the perception of the stimulus as social or non-social. Paired samples t tests confirmed that both the autistic (t(28) = − 2.265, p = .031, d = .391) and NT (t(26) = -2.226, p = .035, d = .684) group were significantly more accurate at predicting which design was chosen in the social, compared to the non-social, condition (Fig. 2).
Ease of Completion
At the end of each part of the study participants were asked to rate on a Likert scale ‘How easy did you find it to guess which of the patterns was selected?’ The scale ranged from 1—‘Very Difficult’ to 7—‘Very Easy’. Planned comparisons investigated whether the NT and autistic groups differed in the ease with which they reported completing the task in the social or non-social conditions. Shapiro-Wilk tests of normality revealed that the data was not normally distributed (p < .05); therefore Mann-Whitney U tests were used. The results revealed that in the social condition autistic participants (Med = 2.00) reported finding it significantly more difficult to identify the chosen design then the NT participants (Med = 4.00; U =− 2.604, p = .009, d = .73). In contrast, in the non-social condition participants did not display this effect and there was no difference in reported difficulty between the autistic (Med = 3.00) and NT groups (Med = 3.00; U = − 1.263, p = .207, d = .34). Therefore, despite the fact that the only change made to the stimuli was the way in which they were described (eye movements vs computer program), the groups differed in their estimates of difficulty (Fig. 3). The NT participants reported finding the second block easier, whereas autistic participants reported finding the social condition more difficult (when, in fact, they were also better at accurately predicting which design had been selected in the eye-movement condition than in the computer program condition).
In order to test whether individual perceptions of difficulty were related to how well each person performed on the task, Spearman’s Rho correlations were conducted between the self-rating of task difficulty and actual task performance. The results revealed that ease of completion did not correlate with task accuracy in the social condition for either the NT (r = .024, p = .906) or autistic (r = .154, p = .425) group. Likewise, there was also no correlation between ease of completion and task accuracy in the non-social condition for either the NT (r = − .146, p = .467) or autistic (r = .194, p = .314) group. This suggests that individual differences in perceived task difficulty did not reflect the NT or autistic groups’ accuracy in either the social or non-social condition, further demonstrating that the lack of confidence of the autistic participants in their own performance on the social task was not warranted.
Social Responsiveness Scale Scores
To investigate whether task accuracy was related to the level of self-reported social difficulties, Pearson’s correlations were used to assess the relationship between task performance and SRS-2 t-scores. In order to assess the difference in performance between the first and second part of the study, a difference score was calculated by subtracting the accuracy scores for the non-social condition from the accuracy score for the social condition. SRS-2 t-scores were not significantly correlated with the difference score for either the neurotypical participants (r = .318, p = .106) or the autistic participants (r = .054, p = .782). Therefore, sensitivity to the social agency of the cue was not related to the level of social impairment shown by either autistic or neurotypical participants.
Strategy Use in the Social Agency Attribution and Non-Social Agency Attribution Condition
We aimed to test whether the autistic and NT groups used different strategies in order to identify the chosen design. Upon completion of each condition participants were asked to identify the strategies they used in order to select their chosen image. Participants were presented with a pre-determined list of potential strategies and asked to identify as many strategies as applied. The proportion of respondents who reported using each strategy in each condition is shown below (Table 2).
Table 2 Percentage of participants in the autistic and NT groups who used each strategy to identify the chosen design in the social and non-social condition. Text in brackets indicates alternate phrasing used in the social part of the study. Visual inspection of the strategy-use percentages indicated that each group used similar strategies, with the three most commonly used strategies in both the autistic and NT group being ‘where the dot moved/where the person looked’; ‘how long or how much the dot selected a pattern/they looked’; and ‘where the dot moved last/where they looked last’. This indicates that both participant groups interpreted the movements of the stimulus cue using comparable strategies, and that these strategies remained comparable across the groups in both the social and non-social conditions.
Order Effects
Next, we aimed to determine that the improvement between the first and second part of the study was not explained by order effects arising from participants having already completed the same task in the non-social condition prior to the social condition. We therefore compared performance between the first half of the trials and the second half of the trials for each part of the study. A Shapiro-Wilk test of normality confirmed that the data was not normally distributed (p < .05). Wilcoxon Signed Rank tests revealed that there was there was no significant difference in accuracy between the first and second half of the trials in either the non-social or social conditions for the autistic and neurotypical groups respectively (p > .05). For both groups overall, there was no significant difference in accuracy in the non-social condition between the first half of the trials (Med = 0.67) and the second half of the trials (Med = 0.67; Z = -.02, p = .982, d = .01). There was also no significant difference in accuracy in the social condition between the first half of the trials (Med = 0.67) and the second half of the trials (Med = 0.67; Z = .28, p = .778, d = .08).
This analysis suggests that the significant improvement in accuracy between the non-social and social conditions is not explained by order effects. However, to confirm that the effect found in this main experiment was due to the manipulation of the cue, we conducted a control experiment which investigated whether participants who only believed a cue to represent the eye movements of another participant were significantly more accurate at the task than a separate group of participants who only believed the cue to represent a computer program.