Age of avatar modulates the altercentric bias in a visual perspective-taking task: ERP and behavioral evidence

Ferguson, Heather J.; Brunsdon, Victoria E. A.; Bradford, Elisabeth E. F.

doi:10.3758/s13415-018-0641-1

Age of avatar modulates the altercentric bias in a visual perspective-taking task: ERP and behavioral evidence

Open access
Published: 21 September 2018

Volume 18, pages 1298–1319, (2018)
Cite this article

Download PDF

You have full access to this open access article

Cognitive, Affective, & Behavioral Neuroscience Aims and scope Submit manuscript

Age of avatar modulates the altercentric bias in a visual perspective-taking task: ERP and behavioral evidence

Download PDF

Heather J. Ferguson ORCID: orcid.org/0000-0002-1575-4820¹,
Victoria E. A. Brunsdon¹ &
Elisabeth E. F. Bradford¹

4843 Accesses
23 Citations
1 Altmetric
Explore all metrics

Abstract

Despite being able to rapidly and accurately infer their own and other peoples’ visual perspectives, healthy adults experience difficulty ignoring the irrelevant perspective when the two perspectives are in conflict; they experience egocentric and altercentric interference. We examine for the first time how the age of an observed person (adult vs. child avatar) influences adults’ visual perspective-taking, particularly the degree to which they experience interference from their own or the other person’s perspective. Participants completed the avatar visual perspective-taking task, in which they verified the number of discs in a visual scene according to either their own or an on-screen avatar’s perspective (Experiments 1 and 2) or only from their own perspective (Experiment 3), where the two perspectives could be consistent or in conflict. Age of avatar was manipulated between (Experiment 1) or within (Experiments 2 and 3) participants, and interference was assessed using behavioral (Experiments 1–3) and ERP (Experiment 1) measures. Results revealed that altercentric interference is reduced or eliminated when a child avatar was present, suggesting that adults do not automatically compute a child avatar’s perspective. We attribute this pattern to either enhanced visual processing for own-age others or an inference on reduced mental awareness in younger children. The findings argue against a purely attentional basis for the altercentric effect, and instead support an account where both mentalising and directional processes modulate automatic visual perspective-taking, and perspective-taking effects are strongly influenced by experimental context.

Manipulating avatar age and gender in level-2 visual perspective taking

Article Open access 13 February 2023

Spontaneous visual perspective-taking with constant attention cue: A modified dot-perspective task paradigm

Article 08 September 2023

Visual perspective taking for avatars in a Simon task

Article 05 September 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Visual perspective-taking involves an assessment of what or how another person sees a visual stimulus, independent of what or how we see that same stimulus ourselves. These processes are therefore central to Theory of Mind (ToM), and the ability to ascribe mental states (e.g., knowledge, beliefs, intentions, etc.) to the self and others. In recent years, researchers have become increasingly interested in the individual differences that predict an observer’s ability to take another person’s perspective. This busy field of research has identified numerous characteristics that modulate success on a variety of ToM tasks, including the observer’s age (e.g., Phillips et al., 2011), working memory, and inhibitory control skills (e.g., Bradford, Jentzsch, & Gomez, 2015; Brown-Schmidt, 2009; Cane, Ferguson, & Apperly, 2017; German & Hehman, 2006; Lin et al., 2010), attentional processes (Rubio-Fernández & Geurts, 2016), social skills (Brunyé et al., 2012; Ferguson et al., 2015; Kessler & Wang, 2012; Nielsen et al., 2015), mood (Converse et al., 2008), and cultural background (Wu & Keysar, 2007). In contrast, very little research has considered how characteristics of the observed person might influence perspective-taking success. The current study addresses this issue by examining how the age of an observed person (adult vs. child avatar) influences adults’ visual perspective-taking, particularly the degree to which they experience interference from their own (i.e., egocentric) or the other person’s (i.e., altercentric) perspective when responding from the “other” or “self” perspective, respectively.

A popular paradigm that has been used to examine visual perspective-taking is the “avatar” task, in which participants have to verify the number of discs in a visual scene according to either their own or a central on-screen avatar’s perspective. Crucially, in some trials the two perspectives are inconsistent (i.e., each sees a different number of discs), while in others they are consistent. Samson, Apperly, Braithwaite, Andrews, and Bodley Scott (2010) found that healthy adults can rapidly and accurately compute other people’s visual perspectives, or respond according to their own broader viewpoint (which may include objects that are hidden from the avatar’s view). Nevertheless, participants’ responses were slower and less accurate for trials in which judging what the avatar could see required them to inhibit their own visual perspective, and when judging what they could see required them to inhibit the avatar’s visual perspective. Thus, participants experienced difficulty ignoring the irrelevant perspective (i.e., either what they saw or what the avatar saw) when the two perspectives differed; performance on the task was influenced by both egocentric and altercentric tendencies.

While this pattern has been replicated numerous times (e.g., Catmur et al., 2016; Conway et al., 2017; Ferguson, Apperly, & Cane, 2017; Nielsen et al., 2015; Qureshi, Apperly, & Samson, 2010; Santiesteban et al., 2014), there has been much debate in the literature regarding whether the altercentric effect genuinely reflects interference from the avatar’s perspective (i.e., automatic mentalising), or whether it is driven by domain-general attentional cues based on directional features of the avatar (i.e., sub-mentalising; Heyes, 2014; Santiesteban et al., 2014). To test these alternatives, researchers have compared effects when the central avatar is replaced by a non-social (directional) cue (e.g., an arrow, lamp, or wall; Samson et al., 2010, Experiment 3; Nielsen et al., 2015; Santiesteban et al., 2014; Schurz et al., 2015) or when the avatar’s view of the stimulus is restricted (e.g., by opaque goggles/barrier, or an “invisibility” telescope; Furlanetto et al., 2016; Cole et al., 2016; Conway et al., 2017). Results are inconsistent across these studies, with some showing that altercentric interference is attenuated when a non-social (i.e., inanimate) agent is present or when they have a restricted view of the stimulus, therefore supporting a mentalizing account, but others revealing comparable inconsistency effects for inanimate and restricted view designs, thus supporting the dominant role of attentional processes.

The current study uses the avatar visual perspective-taking task to test whether the age of the observed person (adult vs. child avatar) influences adults’ visual perspective-taking performance. Therefore, while we do not directly aim to test mentalizing versus directional accounts of automatic perspective-taking, the results clearly have a bearing on this debate. Specifically, a purely attentional account would predict no difference between child and adult avatars since directional features (i.e., forehead, eyes, nose, etc.) are equated between avatars. In contrast, if we find that avatar age modulates altercentric interference this would suggest that participants have inferred different mental states for child and adult avatars, and therefore would support the role of mentalizing in this task.

Our age manipulation links to neuroimaging research that has revealed overlapping neural activation between self and other mentalizing when the person is considered to be similar to the self, but not when the person is different from the self (Davis et al., 1996; Mahajan & Wynn, 2012; Mitchell et al., 2006; Pfeifer et al., 2009). This pattern suggests that participants refer to their own perspective to understand how a similar person might be seeing, feeling, or thinking, and fits with a spontaneous perspective-taking mechanism that is especially pronounced when one feels socially connected to the other person (Smith & Mackie, 2016). In line with this, studies that have examined how similarity between the self and other influences mental state inferences report greater egocentric interference when people are taking the perspective of an ingroup member compared to an outgroup member (e.g., Simpson & Todd, 2017; Savitsky et al., 2011; Todd et al., 2011). In particular, Simpson and Todd (2017) adapted the avatar task described above by manipulating the group membership of the avatar, such that university affiliations and personality traits distinguished in-group from out-group members. Results revealed increased egocentric interference with in-group than out-group avatars, but no influence of avatar group membership on altercentric interference (though shared group membership did facilitate “other” perspective-taking on consistent trials). In the current study our choice to examine effects of the avatar’s age was based on research that has demonstrated an own-age bias, reflecting enhanced performance in a range of social perception tasks when the other person is in the same age category as the perceiver (e.g., Bailey et al., 2014; Melinder et al., 2010; Rhodes & Anastasi, 2012; Slessor et al., 2010; Slessor et al., 2014).

We complement the standard behavioral data collected in this paradigm by recording event-related brain potentials (ERPs) to examine the effects of perspective-taking and age of avatar in real-time. To date only one study has applied this technique to the avatar visual perspective-taking paradigm (McCleery et al., 2011); however, a growing number of studies have used ERPs to examine other aspects of ToM. Many of these studies have examined the brain’s response as participants answer explicit belief questions (e.g., “where does X think the Y is?”, e.g., Liu et al., 2004, 2009; Sabbagh & Taylor, 2000; Wang et al., 2008; Zhang et al., 2009), or passively observe pictorial sequences of events depicting beliefs and desires (e.g., Geangu et al., 2013; Kühn-Popp et al., 2013; Meinhardt et al., 2012), and have consistently demonstrated a positive-going late frontal slow wave (LFSW, ~300 ms onwards) when people are required to reason about others’ (false) beliefs versus reality. Though there is general agreement that differences on the LFSW reflect the key processes that distinguish mental states from reality (Liu et al., 2004; Sabbagh & Taylor, 2000), the exact mechanisms that underlie this component remain controversial due to the variety of paradigms and component definitions (i.e., time course or topography) that have been used in existing studies. Thus, deflections of the LFSW have been attributed to the experience of conflicting self/other perspectives (Jiang et al., 2016), the need to inhibit the self-perspective when inferring others’ beliefs (Zhang et al., 2009), and shifting between external stimuli and internal mental representations (Meinhardt et al., 2011).

More recently, researchers have reported effects of self-other processing on another ERP component, the P300, in a variety of social cognitive paradigms (see Knyazev, 2013 for a review). These studies typically manipulate the consistency of self-reference with auditory, visual, or sensory experiences (e.g., own name/face pairings, Cygan, Tacikowski, Ostaszewski, Chojnicka, & Nowicka, 2014; observed/intended actions, Deschrijver, Wiersema, & Brass, 2017; self/other touch, Deschrijver, Wieserma, & Brass, 2016). This work has consistently shown modulation of the P300 when processing self-relevant information, suggesting that this component indexes the distinction between self and other perspectives. However, in contrast to non-social oddball-type effects (e.g., Picton, 1992; Polich, 2007), these self-referenced effects reveal larger P300 amplitudes for self-compatible conditions compared to self-incompatible conditions. It has been suggested that this pattern reflects the increased need to resist interference when self and other perspectives are inconsistent, meaning that less resources are available to generate the P300 (Deschrijver et al., 2017). Thus, similar to the LFSW, self-referenced modulations of the P300 are likely to reflect both the social process of distinguishing self and other perspectives, and the recruitment of higher-order cognitive processes to evaluate self-related stimuli, and support increased allocation of attention and conflict resolution (Conde et al., 2015; Tacikowski & Nowicka, 2010). We note that the existing literature does not provide a clear distinction between the social and cognitive contributions to LFSW and P300 components, and indeed some researchers have reflected on whether the two components might reflect common processes given the overlapping time windows and scalp distributions (Jiang, Li, Li, Wang, Cao, & Li, 2016).

One study has directly explored the neural basis of visual perspective-taking by recording ERPs and estimating the neural sources while participants completed an auditory-visual version of the avatar visual perspective-taking task (McCleery et al., 2011; e.g., “she sees N” - [image]). Results revealed that perspective and consistency modulated numerous ERP components, including the amplitude of the P200 (larger amplitude over occipital midline electrodes for self-inconsistent trials than any other trial types), and the latency and amplitude of a middle latency component (referred to as TP450; longer peak latencies for other- than self-perspective trials, particularly other-inconsistent, and larger peak amplitudes for consistent compared to inconsistent trials). Consistency also modulated the LFSW between 600–800 ms (consistent > inconsistent). The authors suggest that modulations of the P200 component reflect strategic allocation of visual attention, since the self-inconsistent condition is the only trial type that requires attention to be divided between both walls (i.e., in front and behind the avatar). Crucially, the latency of deflections of the TP450 were attributed to the processing costs of calculating the avatar’s perspective (which are highest in the other-inconsistent condition), with source analyses linking TP450 effects to the temporal parietal cortex (Saxe & Kanwisher, 2003). These TP450 effects, showing influences of perspective and consistency, are therefore compatible with the interference effects seen on the P300 in the self-other tasks described above. Finally, the consistency effect on the LFSW amplitude is interpreted as reflecting the recruitment of executive functions to manage conflicting perspectives (localized to the right frontal cortex), and is therefore consistent with the ERP studies of belief processing, described above (e.g., Jiang et al., 2016; Zhang et al., 2009).

In this paper, we present three experiments that systematically examine whether and how age of avatar influences adults’ visual perspective-taking. In our first experiment, we recorded ERPs and behavioral responses while participants completed a version of the avatar visual perspective-taking task, with age of avatar manipulated between two groups (adult vs. child). In line with previous studies we predicted that this task would elicit both egocentric and altercentric interference effects, reflected in reduced accuracy and increased reaction times when the two perspectives were in conflict (i.e., a main effect of consistency). Replicating previous work, we also expected this consistency effect on reaction times to be larger when cued to take the avatar’s perspective than when cued to take the self perspective (i.e., a perspective × consistency interaction), reflecting the heightened need to inhibit irrelevant perspectives, with greater interference from the egocentric perspective than the altercentric perspective. Given the converging effects seen across previous ERP investigations of ToM processing, our ERP analyses focused on three key components: P200 (associated with perceptual processing), P300, and LFSW (reflecting self-other distinctions and the management of self-other conflicts). Thus, if deflections of the P200 reflect relatively low-level strategic allocation of visual attention, we expected to replicate McCleery et al.’s pattern of maximal amplitude for self-inconsistent trials (due to divided attention on these trials). More importantly, we expected the higher-level processes of distinguishing self/other perspectives and inhibiting the alternative perspective to be reflected in reduced P300 and LFSW amplitudes for inconsistent trials, with a larger consistency effect on these components for other- than self-perspective trials since the self (egocentric) perspective causes greater interference (and thus less available cognitive resources) than altercentric intrusions (as in Deschrijver et al., 2017). McCleery et al. used source localization to make a clear distinction between the mechanisms underlying their mid-latency TP450 component (representing social self-other conflict processes) and the LFSW waveform (executive processes to manage conflict). However, due to paradigm and component differences in the current studies, and based on the existing literature that implicates both ToM and executive processes, we do not tie our predictions on the P300 and LFSW components to distinct social/cognitive mechanisms.

Crucially, if the age of avatar manipulation activates distinct mental state processing mechanisms for similar and dissimilar others then we would expect to see modulations of these perspective-taking effects according to the age of avatar, via an own-age bias (i.e., an avatar × consistency interaction, or an avatar ×perspective × consistency interaction). Such modulations should be limited to response times, P300 amplitude, and LFSW amplitude since these measures have been shown to directly reflect high-level self-other processing and conflict (note that we do not expect age of avatar to influence low-level attention allocation, as measured by P200 amplitude). We expected this effect to reflect reduced processing of the other perspective when a child (i.e., dissimilar) versus adult (i.e., similar) avatar was present, and for it to be manifest in a reduced or absent altercentric interference effect (similar to previous studies that have manipulated avatar animacy/view, e.g., Furlanetto et al., 2016; Schurz et al., 2015), and/or a larger egocentric interference effect (similar to the ingroup effect seen in Simpson & Todd, 2017). In contrast, a purely directional account that does not activate spontaneous mentalizing would not predict any differences in processing between adult and child avatars, since directional features are matched between avatars.

Experiment 1

Method

Participants

A total of 38 English-speaking Caucasian students from the University of Kent took part in the study. Four of these participants were excluded due to poor accuracy on the task (<50%) or poor quality of EEG data (resulting in a trial loss of > 40%). Thus, the final sample included 34 participants (24 female; 29 right-handed; M_age = 20.5 years), split equally between the adult and child avatar groups, and matched between groups on gender and age. This sample size was determined a priori to match the sample size used in McCleery et al.’s (2011) ERP avatar visual perspective-taking task (N = 17) in each of our avatar age groups.

All participants completed the Empathy Quotient (Baron-Cohen & Wheelwright, 2004), a 40-item self-report questionnaire that assesses empathy and social aptitude. In addition, all participants completed the Simon task (Simon & Wolf, 1963), consisting of 80 trials (40 consistent/40 inconsistent), and an inhibitory control score was calculated by subtracting reaction times for correct responses on consistent trials from inconsistent trials. Age of avatar groups was therefore statistically matched on participants’ gender (12 females and five males in each group), age (adult M = 21.5; child M = 19.5; t = 1.57), empathy quotient score (adult M = 40.8; child M = 37.9; t = .88), and inhibitory control score (adult M = 21.4; child M = 19.0; t = .88).

Materials

Participants took part in a visual perspective-taking task (adapted from Samson et al., 2010) while EEG activity was continuously recorded. The visual stimuli included a 3D lateral view of a room, where the ceiling, floor, left, right and back walls were visible. Red discs were displayed on one or two of the left/right walls. The number and position of discs changed on each trial. In addition, a realistic human avatar was standing in the center of the room, facing either the left or right wall. The avatar’s gender always matched the participant’s gender, but half the participants saw an adult-like avatar, and the other half saw a child-like avatar.^{Footnote 1} On half the trials, the avatar’s orientation meant that s/he saw the same number of discs as the participant (consistent condition), and on the other half, the avatar’s orientation meant that s/he could not see some of the discs that were visible to the participant (since they were placed on the wall behind the avatar; inconsistent condition). See Fig. 1 for examples of these visual stimuli, and the Open Science Framework for the full set of materials (https://osf.io/bqw4h).

To ensure that the directional features were matched between child and adult avatars, the stimuli were pre-tested using a Posner paradigm (Posner, 1980). Sixteen participants (M_age = 24.8 years) completed a total of 96 trials in a within-subjects design that crossed avatar (child vs. adult) and gaze-cue validity (valid vs. invalid), thus 24 trials in each condition. Trials began with a central fixation cross in the empty 3D room (700 ms), followed by the central avatar facing left or right (i.e., the gaze cue; 300 ms), and finally a single red disc appeared on the left or right wall (replicating the position of discs used in the main task) until a response was made. Correct response times were analyzed using a within-subjects 2 × 2 ANOVA, crossing avatar (adult vs. child) and gaze-cue validity (valid vs. invalid). Results revealed a significant main effect of gaze-cue validity (valid = 311 ms vs. invalid = 336 ms, F(1, 15) = 141.3, p < .001, _pη² = .9), but no main effect of avatar (F = .01, p = .91), or an interaction between the two variables (F = .51, p = .49).

Procedure

Participants were informed about the EEG procedure and experimental task. Their task was to verify the number of discs that were visible either according to their own perspective (self-perspective condition), or according to the avatar’s perspective (other-perspective condition). Trials were either matching or mismatching. On matching trials, the cue digit corresponded to the number of discs that could be seen from the cue perspective for the target image. On mismatching trials, the cue digit did not correctly correspond to the number of discs that could be seen from the cue perspective. After electrode application they were seated in a booth where they read the materials from a computer screen. The experiment was controlled using E-Prime software.

Each trial began with a fixation cross in the center of the screen for 750 ms. Following a blank screen inter-stimulus interval (ISI) of 150 ms, 250 ms, or 350 ms,^{Footnote 2} the word “YOU” or “SHE/ HE” was presented for 750 ms. This informed participants whether to respond to the current trial according to their own or the avatar’s perspective. Following a second blank screen ISI, a digit between 0 and 3 was shown in the center of the screen for 750 ms. This indicated the number of discs the participant needed to verify, according to the given perspective. Finally, the target image of the room, avatar, and discs (650 × 480 pixels) appeared centrally on-screen. Participants were instructed to judge whether the number of discs in the target image matched the preceding digit according to the cued perspective or not, using keys “z” and “m” (key associations were counterbalanced across participants). Participants were asked to respond as quickly and accurately as possible. The screen advanced to the next trial once a keyboard response had been detected or for a maximum of 2000 ms (see Fig. 2).

Participants completed a practice block of 26 trials, followed by the main task, which consisted of 12 blocks, each with 52 trials. In total there were 288 matching trials, 288 mismatching trials, and 48 “filler” trials (where no discs were displayed on either wall so that the disc number 0 was sometimes correct for self-perspective trials). Participants were asked to respond according to their own perspective on half the trials, and to respond according to the avatar’s perspective on the other half. Of these, half were consistent trials, where the avatar and participant saw the same amount of discs on the wall, and half were inconsistent trials, where the avatar and participants’ views were different. Trials were presented in a pseudorandom order mixing self and other perspectives, such that no more than four consecutive trials that tapped the same perspective, and no more than three consecutive trials tapped the same perspective-consistency condition. No complete stimulus repetitions (i.e., same perspective cue and image) were included. The full experiment lasted for about 80 min.

In sum, three independent variables were manipulated in a 2 (Consistency: consistent vs. inconsistent) × 2 (Perspective: self vs. other) × 2 (Avatar: adult vs. child) mixed design, with Consistency and Perspective being within-subjects and Avatar being between-subjects. Effects were analyzed at the target image, on accuracy of responses, response time, and the ERP components as detailed below.

Electrophysiological measures

EEG activity was recorded continuously using a Brain Vision Quickamp amplifier system with a 62-channel ActiCap, over midline electrodes Fz, Cz, CPz, Pz, POz, and Oz, over the left hemisphere from electrodes Fp1, AF3, AF7, F1, F3, F5, F7, FC1, FC3, FC5, FC7, C1 C3, C5, T7, CP1, CP3, CP5, TP7, A1, P1, P3, P5, P7, PO3, PO7, PO9, O1, and from the homologue electrodes over the right hemisphere. EEG data were referenced online to electrode FCz, and grounded to electrode AFz. EEG and EOG recordings were sampled at a rate of 500 Hz. Electrode impedances were kept at <25 KΩ.

Brain Vision Analyzer 2 software was used to prepare the data prior to analysis. First, noisy or faulty electrodes were interpolated from surrounding channels (a maximum of three channels), then all channels were re-referenced offline to an average reference (excluding eye channels and mastoids) and the EEG signal was band-pass filtered (0.3–40 Hz, 12 dB/oct). Data containing blinks and horizontal eye movements were corrected using semi-automatic ocular Independent Components Analysis (ICA) correction (which removed an average of three components per participant), then the data was segmented into epochs of 1,100 ms time-locked to picture onset (-100 – 1,000 ms). Any trial where the participant made an incorrect picture judgment was eliminated from further ERP analysis, then each trial was individually inspected to identify and discard trials with non-ocular artifacts (drifts, channel blockings, EEG activity exceeding ± 75μV), using a semi-automatic artifact rejection algorithm. Together, these procedures resulted in an average trial loss of 11.3% per participant, and an average of 64 accepted segments per condition/participant. A 2 (Perspective) × 2 (Consistency) × 2 (Avatar) ANOVA testing trial loss across conditions revealed no difference between avatar conditions (p = .71) or perspective (p = .15) or any interactions (all ps > .1), but significantly less accepted segments per participant for inconsistent trials than consistent trials (61 vs. 67; F(1, 32) = 65.23, p < .001, _pη² = .67), due to differences in accuracy in these conditions (see behavioral results below). Finally, the signal at each electrode site was aligned to a 100-ms baseline, then averaged separately for each experimental condition (Fig. 3).

ERP data analysis

Three ERP components were identified for analysis, based on previous research that has examined perspective and consistency effects in a visual perspective-taking task (McCleery et al., 2011), and ERP studies of self-referential processing (e.g., Cygan et al., 2014; Deschrijver et al., 2016, 2017). Thus, our analyses focused on the peak amplitude of the P200 (a positive-going component, peaking between 200–260 ms over central occipital electrode sites, associated with perceptual processing, see Fig. 4), the peak latency and amplitude of the P300 (a positive-going component peaking between 250–400 ms over central parietal electrode sites, reflecting self-other distinctions, see Fig. 5), and the mean amplitude over a late frontal slow-wave (LFSW, between 400–700 ms over the left and right lateral frontal cortex, reflecting management of self-other conflicts, see Fig. 6). We note that our P300 component is consistent with research in the field of self-referential processing and is comparable to the TP450 component seen in McCleery et al., and attribute the slightly different topography and peak latency to the fact that stimuli in McCleery et al. were presented in a multi-modal auditory-visual format (e.g., “she sees N” - [image]), whereas all stimuli in the current study were presented in a visual sequence (as is typical in this paradigm, e.g., Samson et al., 2010; Santiesteban et al., 2014). In addition, we conducted exploratory analyses on the P100 amplitude (an early positive-going component peaking between 80–120 ms over central occipital electrode sites), since visual inspection of the ERP waveforms suggested a group difference on this component (see Figs. 4 and 7). The P100 is a sensory response to visual stimuli, and is sensitive to stimulus parameters, such as size and luminance, thus we tested for between-groups differences here to quantify early differences in the waveform due to physical differences between adult and child avatar stimuli, which may contaminate subsequent ERP effects.

The electrodes used to measure each component were as follows: left frontal: AF7, F7, F5; right frontal: AF8, F6, F8; central parietal: CP1, CP2, CPz, Pz, P1, P2; central occipital: POz, PO3, PO4, Oz, O1, O2. Peak amplitudes (P100, P200, and P300), latencies to peak amplitudes (P300), and mean amplitudes (LFSW) were identified using the time intervals defined above using Brain Vision Analyzer’s automatic peak detection algorithm and measured for each individual electrode in the relevant conglomerate, then averaged within the relevant region for each participant and condition. For the statistical analysis of amplitude (and latency) data over central occipital and central parietal components (P100, P200, and P300), ANOVAs with variables Perspective (self vs. other), Consistency (consistent vs. inconsistent), and Avatar (adult vs. child) were conducted. The LFSW was analyzed as the mean amplitude over lateral frontal sites using an ANOVA with variables Hemisphere (left vs. right), Perspective (self vs. other), Consistency (consistent vs. inconsistent), and Avatar (adult vs. child).

Results

Accuracy and response times for matching trials were analyzed using separate 2 × 2 × 2 analyses of variance (ANOVA), with Perspective (Self vs. Other) and Consistency (Consistent vs. Inconsistent) as within-subjects variables, and Avatar (Adult vs. Child) as the between-subjects variable. Note that due to space constraints, only significant or marginal (p <= .06) effects are presented in the text throughout this manuscript. Full statistical effects for each experiment and measure are summarised in the Appendix, and full data for each experiment and measure are available on the Open Science Framework (https://osf.io/bqw4h/?view_only=e275ad0e97dc42b7b6dcf17e089df06d). In line with standard procedures, behavioral analyses did not exclude trials based on ERP preprocessing. Incorrect picture verification responses and trials where the participant did not respond to the image in the given 2,000 ms were excluded from the response-time analysis (5.5%), which was measured from the onset of the picture. Resulting mean response accuracy and response times for each condition are shown in Fig. 3.

Response accuracy

The ANOVA revealed a significant main effect of Perspective (F(1, 32) = 4.20, p = .049, _pη² = .12), reflecting higher accuracy when participants responded according to their own (M = 93.2%) compared to the avatar’s perspective (M = 91.8%). In addition, a significant main effect of Consistency (F(1, 32) = 72.19, p < .001, _pη² = .69) showed that accuracy was higher when participants shared the same visual perspective with the avatar (M = 96.5%), compared to when the two perspectives were inconsistent (M = 88.5%). Neither Avatar or the interactions were significant (all Fs < 2.25, p > .11).

Response times

The ANOVA showed a significant main effect of Consistency (F(1, 32) = 93.62, p < .001, _pη² = .75), with responses being slower when perspectives were inconsistent (M = 673 ms) compared to when perspectives were consistent (M = 612ms). In addition, Perspective interacted significantly with Consistency (F(1, 32) = 16.65, p < .001, _pη² = .34). Bonferroni corrected post hoc tests revealed that the Consistency effect was larger when taking the Other perspective (t(33) = 11.56, p < .001; inconsistent minus consistent = 77 ms), compared to when taking the Self perspective (t(33) = 5.47, p < .001; inconsistent minus consistent = 44 ms). These results replicate previous studies and show that participants experienced both egocentric and altercentric interference, though intrusions from one’s own knowledge were significantly larger (paired-samples t-test comparing consistency effect in each perspective condition: t(33) = 4.04, p < .001). Neither the main effect of Perspective, Avatar, or the other interactions were significant (all Fs < 1.72, ps > .2).

ERP effects

P100

The analysis of P100 amplitude revealed a significant effect of Avatar (F(1, 32) = 4.73, p = .037, _pη² = .13), with the child avatar eliciting a larger amplitude (M = 7.57 μV) than the adult avatar (M = 5.21 μV). Since the P100 is known to reflect low-level perceptual analysis, we attribute this avatar effect to physical differences between the child and adult stimuli (e.g., luminance, spatial frequency; Linkenkaer-Hansen et al., 1998). There were no other significant main effects or interactions (all Fs < 2.82, ps > .1).

P200

The ANOVA on P200 amplitude revealed a significant main effect of Perspective (F(1, 32) = 9.99, p < .003, _pη² = .24), reflecting a larger P200 amplitude on self (M = 9.36 μV) than other trials (M = 8.85 μV), and a significant main effect of Consistency (F(1, 32) = 27.66, p < .001, _pη² = .46), reflecting a larger P200 amplitude on consistent (M = 9.47 μV) compared to inconsistent trials (M = 8.75 μV). In addition, a significant interaction between Perspective and Consistency (F(1, 32) = 7.83, p < .01, _pη² = .2) was found. Bonferroni corrected post hoc tests revealed that the Consistency effect was only significant when participants were cued to take the other perspective (t(33) = 5.66, p < .001), and not when cued to take their own perspective (t(33) = 1.71, p = .098). This pattern suggests a robust egocentric interference effect, but a weaker or absent altercentric interference effect. The effect of Avatar and the remaining interactions were not significant, all Fs < 3.07, ps > .09.

P300

The ANOVA on latencies revealed a significant main effect of Perspective (F(1, 32) = 25.26, p < .001, _pη² = .44), with longer peak latencies in other (M = 355 ms) than self (M = 341 ms) trials. There was no significant main effect of Consistency or Avatar, or any interactions (all Fs < 2.57, p > .119).

Analysis of P300 amplitude revealed a significant main effect of Consistency (F(1, 32) = 104.63, p < .001, _pη² = .77), with consistent trials (M = 4.89 μV) eliciting a larger amplitude than inconsistent (M = 3.65 μV). Interestingly, a significant interaction between Perspective and Avatar (F(1, 32) = 4.80, p = .036, _pη² = .13) was found, subsumed under a significant three-way interaction between Perspective, Consistency, and Avatar (F(1, 32) = 5.41, p = .026, _pη² = .15). Follow-up analyses examined effects for adult and child avatars separately. The adult avatar condition showed only a significant consistency effect (F(1, 16) = 39.50, p < .001, _pη² = .71), with consistent trials (M = 4.68 μV) eliciting a larger P300 amplitude than inconsistent trials (M = 3.50 μV). In contrast, the child avatar condition revealed significant main effects of Perspective (F(1, 16) = 21.66, p < .001, _pη² = .58; self M = 4.87 μV vs. other M = 4.04 μV), and Consistency (F(1, 16) = 72.10, p < .001, _pη² = .82; consistent M = 5.10 μV vs. inconsistent M = 3.81 μV). Moreover, the Perspective × Consistency interaction was significant (F(1, 16) = 19.43, p < .001, _pη² = .55). Bonferroni corrected post hoc tests revealed a significant consistency effect when participants were taking the other perspective (t(16) = 7.73, p < .001; consistent = 5.14 μV vs. inconsistent = 2.93 μV), but not when taking the self perspective (t(16) = 1.57, p = .136; consistent = 5.04 μV vs. inconsistent = 4.69 μV).

LFSW

The ANOVA revealed a significant main effect of Hemisphere (F(1, 32) = 5.02, p = .03, _pη² = .14), with a larger, more negative-going amplitude over the left hemisphere (M = -2.80 μV) than the right hemisphere (M = -2.38 μV). There was also a significant main effect of Consistency (F(1, 32) = 8.90, p = .005, _pη² = .22; consistent < inconsistent), and a main effect of Perspective (F(1, 32) = 6.28, p = .02, _pη² = .16; other < self). Similar to the P300 component, the three-way interaction between Perspective, Consistency, and Avatar was significant (F(1, 32) = 7.45, p = .01, _pη² = .19). Further analyses examined effects for adult and child avatars separately, and showed that the Perspective × Consistency interaction was only significant in the child avatar condition (F(1, 16) = 5.15, p < .05, _pη² = .24), and not in the adult avatar condition (F(1, 16) = 2.43, p = .14). Bonferroni corrected post hoc tests in the child avatar group revealed a significant consistency effect when participants were cued to take the avatar’s perspective (t(16) = 2.81, p = .01; consistent = -3.12 μV vs. inconsistent = -2.26 μV), but not when they were cued to use the self perspective (t(16) = .77, p = .45; consistent = -2.68 μV vs. inconsistent = -2.53 μV).

To further investigate whether the condition effects observed on the P300 and LFSW components can be differentiated, we ran an exploratory ANOVA that crossed Component (P300 vs. LFSW) × Site (Anterior vs. Posterior^{Footnote 3}) × Perspective (Self vs. Other) × Consistency (Consistent vs. Inconsistent) × Avatar (Adult vs. Child). This analysis showed a significant interaction between Component and Site (F(1, 32) = 26.63, p < .001, _pη² = .45), reflecting a significantly larger positivity over posterior sites for the P300 compared to the LFSW component. More importantly, this effect was subsumed under three-way interactions that revealed statistically different topographic distributions of condition effect between the two components. A significant Component × Site × Consistency interaction (F(1, 32) = 43.42, p < .001, _pη² = .58) showed that the consistency effect was significantly larger on the P300 component than the LFSW component over posterior (t(33) = 11.74, p < .001) and anterior sites (t(33) = 4.93, p < .001), though this difference was greater over posterior sites. Additionally, a significant Component × Site × Perspective interaction (F(1, 32) = 48.67, p < .001, _pη² = .60) revealed different effects of Perspective between P300 and LFSW components over posterior (t(33) = 6.63, p < .001) and anterior sites (t(33) = -4.93, p < .001). These findings provide some tentative evidence to suggest that the two components, emerging in consecutive but non-overlapping time windows, may reflect distinct stages of processing.

In summary, Experiment 1 replicated previous research in showing that both egocentric and altercentric biases interfered with visual perspective-taking, though the altercentric effect was smaller than the egocentric effect. Crucially, our ERP data revealed the first evidence that age of avatar modulates these effects; effects consistent with egocentric and altercentric intrusions were evident on P300 and LFSW amplitudes for adult avatars (i.e., increased amplitudes on consistent vs. inconsistent trials for both other and self perspectives), but altercentric effects on these components were attenuated with a child avatar (i.e., increased amplitudes on consistent vs. inconsistent trials only for the other perspective). These findings provide initial evidence that participants inferred different mental states for child and adult avatars, possibly due to an own-age bias, which facilitated spontaneous perspective-taking for a similar age other, but weakened perspective-taking for a dissimilar age other.

Nevertheless, age of avatar did not modulate behavioural responses, as the hypothesized Perspective × Consistency × Avatar interaction was not significant on the reaction time measure. When reflecting on why such effects did not emerge on behavioural measures it is important to note that these results were revealed when age of avatar was manipulated between groups, when participants were tested on a high number of trials, and when they were instructed to respond according to both self and other perspectives. Although we reduced influences from individual differences on participants’ responses by matching the adult and child avatar groups across numerous key measures (i.e., gender, age, empathy, inhibitory control), it is possible that other unexpected differences existed between the two groups. In addition, by testing both self and other perspectives within the same experiment, the difference between self and avatar perspectives was made salient, and computing the avatar’s perspective on a given trial was task-relevant. This design makes it difficult to conclude that modulations of altercentric interference (i.e., on the self trials) reflect genuine influences on automatic perspective-taking, and thus might reflect simple carry-over effects from having to compute the avatar’s perspective on ‘other’ perspective trials. Indeed, whether participants were asked to verify the number of discs according to both their own and the avatar’s perspective, or whether judgments were limited to their own perspective only, has been identified as a key methodological difference between previous studies that do or do not show mentalising effects (see Cole et al., 2016; Conway et al., 2017), since automaticity can only be certain when the other perspective is task-irrelevant. This observation is supported by a recent eye-tracking study showing that altercentric interference is greatest when participants have to switch between their own and the avatar’s perspective across consecutive trials (Ferguson et al., 2017), and a computerized false-belief task showing that switching perspectives from self-to-other is more costly than from other-to-self (Bradford et al., 2015). Finally, Experiment 1 tested a high number of trials (essential for ERP analysis, see Luck, 2014) as in McCleery et al. (2011), which is significantly higher than is typically used in behavioural studies (e.g., Samson et al., 2010, N = 208), and thus may have led to fatigue in our participants. This possibility was tested in a post-hoc analysis on reaction time data, including only the first half of experimental trials, which replicated the finding that age of avatar did not modulate the Perspective × Consistency interaction (F = .64, p = .43). As such, fatigue is less likely to account for reaction time insensitivity to the predicted avatar-dependent modulation of the perspective effect.

In Experiments 2 and 3, we employed a purely behavioural design (no ERPs), which allowed us to test a larger sample of participants in a within-subjects design, in line with previous research (e.g., Cole et al., 2016; Conway et al., 2017). While employing such a within-subjects design alongside ERPs would be ideal to fully understand the observed ERP effects, the necessary impact on increased trial numbers makes this option unviable (i.e., this design would require 1248 trials in total to match trials per condition (N = 72) to Experiment 1). By not recording ERPs, we were able to significantly reduce the number of trials (Expt. 1 = 624 vs. Expt. 2 = 312 vs. Expt. 3 = 208 trials). Thus, in Experiment 2 we tested the effects of age of avatar in a fully crossed within-subjects design that tapped both self and other perspectives to examine whether age of avatar effects would be evident on behavioural responses when effects from individual differences (resulting from Experiment 1’s mixed design) were eliminated. We expected to replicate the egocentric and altercentric effects on accuracy and reaction time measures. More important for the current research, if the effects of avatar seen on P300 and LFSW amplitudes genuinely reflect distinct self-other biases for adult and child observers then we expected to observe this avatar × consistency × perspective interaction on the behavioural responses in Experiment 2. Specifically, we predicted that egocentric interference would disrupt reaction times for both adult and child avatars, but that altercentric interference would only be observed for an adult, and not a child, avatar.

In Experiment 3 we further examined whether age of avatar influences ‘pure’ altercentric intrusion effects by testing self-perspective trials in isolation. Thus, if the altercentric effect truly reflects age-biased differences in spontaneous other perspective-taking then we expected to see reduced altercentric interference for child versus adult avatars when participants were never prompted to take the avatars perspective. In contrast, if these effects purely reflect carry-over effects from explicit, non-automatic mentalizing on other perspective trials, then we would expect the age-modulation of the altercentric effect to disappear when the self perspective was assessed in isolation.