Introduction

Moral evaluation, a core content in moral cognition research, is defined as judging the moral acceptability or appropriateness of another’s behavior in a moral dilemma (Garrigan et al., 2016; Van Bavel et al., 2015). Generally, people evaluate the moral acceptability of one’s behavior according to the types of behavior (Anderson et al., 2020). Previous studies have classified such behavior into two types, i.e., prescriptive and proscriptive behavior (Janoff-Bulman et al., 2009; Noval & Stahl, 2017; Zhan et al., 2020). According to this classification, prescriptive morality indicates that people have established a positive desire and activated the motivation to do something good, whereas proscriptive morality indicates that people should overcome a negative desire and restrain a motivation to do something bad (Janoff-Bulman et al., 2009; Noval & Stahl, 2017). Therefore, if a person engages in prescriptive behavior, such as altruistic giving or helping, she/he would be evaluated as morally acceptable, because this target person’s altruistic behavior promotes the interests of others at a cost to oneself (Cushman, 2015; Hu et al., 2017a, b; Yuan et al., 2021). Conversely, proscriptive behavior, such as selfish behavior, is motivated by a desire to benefit oneself, which may cause harm to others’ interests (this is especially true in contexts where the interests of others are aligned with one’s own), and this type of behavior is thought to be morally unacceptable (Kilduff et al., 2016; Lu et al., 2017; Schein & Gray, 2018). If a person inhibits the motivation to conduct a proscriptive behavior toward others, she/he would be evaluated as morally acceptable (Janoff-Bulman et al., 2009).

Recent studies have applied relationship-regulation theory to moral evaluation research (Anderson et al., 2020; McManus et al., 2020) and have proposed that how people evaluate the morality of actions depends not only on types of actions (prescriptive/altruistic or proscriptive/selfish), but also on the social-relational context in which those actions occur (Earp et al., 2021). In the context where agents should select one recipient to receive the consequence of altruistic behavior in a moral dilemma, the agent who conducts altruistic behavior toward a socially close (vs. socially distant) other individual is evaluated as being more morally acceptable (Law et al., 2022; McManus et al., 2020). However, in the context in which agents should choose one recipient to receive the consequence of selfish behavior in a moral dilemma, the agent who can inhibit or restrain selfish behavior toward a distant stranger (instead of a socially close other person) is evaluated as being more morally acceptable (McManus et al., 2020; Zhan et al., 2020). These studies have demonstrated that in terms of prescriptive (altruistic) behavior, people recognize a positive obligation to help socially close others more than socially distant others (Chalik & Dunham, 2020). However, in the context in which people are required to conduct a proscriptive behavior, the obligations for socially close individuals are reprioritized in moral evaluations and people’s “socially close individual-favoritism” is perceived as inappropriate (Earp et al., 2021; McManus et al., 2020). Individuals recognize a positive obligation not to engage in proscriptive (selfish) behavior toward socially distant others more than toward socially close others. In short, whether the agent directs prescriptive (altruistic) or proscriptive (selfish) actions at a socially distant instead of a socially close other person, he/she would be evaluated as less morally acceptable. Based on these aforementioned studies, two questions needed to be further clarified. First, most studies investigated the effect of interpersonal relationships on moral evaluations from a third-party (i.e., uninvolved observer) perspective (Law et al., 2022; McManus et al., 2020). In these studies, participants were required to read vignettes describing hypothetical scenarios and then evaluate the moral (un)acceptability of the protagonists in the vignettes. To the best of our knowledge, little evidence has directly discussed the effect of the interpersonal relationship on moral evaluations from a second-party perspective (i.e., how participants evaluate the actor’s action when participants act as direct beneficiaries or victims of the consequences of the prescriptive or proscriptive action) (Xie et al., 2022). Unlike the third-party perspective, from a second-party perspective, people could involve the scenarios and experience the consequences of actions for real. In this way, the findings from the second-party perspective are convincing to substantiate prior studies. Second, most of the studies regarding the relationship between interpersonal relationships and moral evaluations employed self-report measurements (i.e., participants were asked to subjectively report the degree of moral acceptability of targets). However, individuals’ subjective ratings that offered explicit responses may be a controlled process, which cannot completely reflect their inner thoughts (Boudreau et al., 2009; Perry et al., 2013). For example, studies have shown that there might be a gap between explicit self-report and implicit (e.g., neural) responses because of potential interferences, such as social desirability responses bias (Cikara & Fiske, 2013; Kaneko et al., 2019; Randall & Fernandes, 1991; Zhang et al., 2021). The development of neuroscience has offered insights into the neural activities (i.e., implicit responses) underlying moral evaluations beyond observable behavior (Gui et al., 2016; Van Bavel et al., 2015). Therefore, we will assess neurophysiological processes during moral evaluations to also clarify the impact of interpersonal relationships on implicit responses that cannot be captured with self-report.

Accordingly, to address these two questions appropriately, we employed the “Social judgment in the modified version of Dictator Game (DG)” task (Park et al., 2020). In a classic DG, allocators have to decide how much, if any, of an endowment to give to another individual (Camerer, 2011; Charness & Haruvy, 2002). If the allocators gave money to recipients (especially those who started with less money than the allocator), they would be evaluated as being altruistic (Li et al., 2019). The major difference between the classic DG and modified DG is that in the modified DG, allocators could decide to take money from the other person. If the allocator took money from recipients for personal gain, they would be considered as selfish (Park et al., 2020). Corresponding to the first question, participants could play the role of recipient in the modified DG, and thus participants could evaluate the allocator’s behavior from the recipient’s perspective (or second-party perspective) (Wu et al., 2011; Xie et al., 2022). Corresponding to the second question, to capture neurophysiological measures of interpersonal relationships on moral evaluations, we employed the event-related potential (ERP) technique because ERP can measure neural activities with millisecond resolution and offer insights into the implicit responses, especially the neural dynamics of moral evaluations beyond observable behavior (Gui et al., 2016; Van Bavel et al., 2015). The use of ERP would be helpful in avoiding the potential influence of social desirability responses bias (Pletti et al., 2019; Zhang et al., 2021). For example, participants’ explicit negative evaluations of friends are socially undesirable and might be avoided in the self-report. In the current study, we designed two experiments. In Experiment 1, in aspects of morality, we used a self-report measurement to assess how participants evaluated when their friends (vs. strangers) conducted altruistic and selfish behavior toward participants themselves (vs. someone else). In Experiment 2, we aimed to replicate Experiment 1’s findings and further explored the neural underpinnings of interpersonal relationships on moral evaluations via ERPs.

Experiment 1

The goal of Experiment 1 was to investigate the effect of interpersonal relationships on moral evaluations from the second-party perspective via self-report. Specifically, as mentioned above, the altruistic (prescriptive) and proscriptive (selfish) behaviors were operationalized via the phrases “took money from” and “gave money to” the recipient in the modified DG. According to the definition of moral evaluation (Garrigan et al., 2016), we created a moral dilemma for allocators, in which allocators were forced to select one recipient to receive the consequence of given prescriptive or proscriptive behavior. Here, the participant (her-/himself) and the other person (stranger) were manipulated as the candidate recipients in DG. That is, allocators had to make a choice between the participant and a stranger to be the recipient of behavior’s consequence. Using this manipulation, participants could evaluate the allocator’s behavior from a second-party perspective. Importantly, to achieve the interpersonal relationship manipulation, the target person of the moral evaluation (i.e., the allocator in modified DG) was either the participant’s friend or a stranger. As the dependent variable of the described manipulations, we assessed participants’ moral (un)acceptability ratings in response to altruistic and selfish behavior conducted by the friend or a stranger, toward the participant oneself or another stranger.

As mentioned above, from a third-party perspective, whether the agent directs prescriptive (altruistic) or proscriptive (selfish) actions at a socially distant instead of socially close other, people would evaluate the behavior as less morally acceptable (Law et al., 2022; McManus et al., 2020). From a second-party perspective, people involved similar scenarios and experienced the consequences of actions for real. Therefore, we predicted that this effect of interpersonal relationships on moral evaluations would be similar or even more pronounced from a second-party perspective.

Participants

This study was approved by the Hunan Normal University ethics committee and was conducted in accordance with the Declaration of Helsinki (see Declaration of ethics section). We conducted a priori power analysis for a 2 (allocator) × 2 (allocator’s behavior) × 2 (recipient) within-subject design using G*Power 3.1.9 (F tests, analysis of variance [ANOVA]: repeated-measures, within factor, and the “power = 0.95,” “effect size f = 0.25,” “α = 0.05”). According to the result of this power analysis, 23 participants, at least, could ensure 95% statistical power in the case of small to medium effect sizes (Faul et al., 2007; Mayr et al., 2007; Vazire, 2015; Zhang et al., 2020). We used the same approach in the following experiment to determine our sample size. To reach the minimum sample size criteria (23), we recruited 41 participants (16 men, ages ranging from 19-27 years) from Hunan Normal University. All participants provided written, informed consent before the experiment and were paid at least Chinese Yuan (CNY) 30 for their participation after completing the task.

Cover story and task

The modified version of “Social judgment in DG” (Park et al., 2020) was adopted. As per Fig. 1, the participants (Player A) were informed that they would complete a modified DG with three other people, one of whom was their friend (Player B), and the other two were strangers (Players C and D who did not know each other) to their corresponding friends and themselves. In this task, each participant (Player A) and one stranger (Player D) occupied the role of candidate “recipient.” Each participant’s friend (Player B) and another stranger (Player C) were always assigned to the role of “allocator” (note that all roles were decided by drawing lots ostensibly, although the participant was always drawn to be assigned as the candidate recipient and his/her corresponding friend was always drawn to be assigned as the allocator via pre-programming). “Allocators” (i.e., Players B and C) were endowed with CNY 100 as their starting money, whereas “recipients” (i.e., Players A and D) were endowed with CNY 30Footnote 1. In each round, only one allocator (Players B or C) would be presented with a given behavior (give CNY 10 to or take CNY 10 from one recipient) that was randomly predetermined by a computer. Then, this allocator was required to select one recipient (forced two-option: Player A or D, i.e., moral dilemma for allocators) to receive the given behavior’s consequences in this round.

Fig. 1
figure 1

Interpersonal relationship between each player (including participants and corresponding friends, as well as two strangers) in the cover story. Full line indicates the interpersonal relationship between two people who were friends. Dotted line indicates the interpersonal relationship between two people who were strangers. Participants (Player A) and one stranger (Player D) were assigned to the role of candidate “Recipient” with a starting money of CNY 30, and the participant’s corresponding friend (Player B) and one other stranger (Player C) were assigned to the role of “Allocator” with a starting money of CNY 100. Unknown to participants, Players C and D were two undergraduates serving as confederates

Unbeknownst to the allocators, after observing the allocator’s behavior (gave money to or took money from the participant oneself or another stranger), similar to Rom and Conway (2018)’s measurement questions, participants were required to rate how morally (un)acceptable they had perceived the respective behavior with two different framing items: 1) “How morally acceptable do you evaluate this allocator’s behavior?” and 2) “How morally unacceptable do you evaluate this allocator’s behavior?” The participants rated these items on a 7-point scale (from “1 = not at all” to “7 = very much”). The first item was reverse-coded. These two items were averaged to determine the degree of moral (un)acceptability (r = 0.92). A higher rating score indicated that the allocator’s behavior was evaluated as being less morally acceptable (or, more morally unacceptable). Therefore, for participants, this evaluation task resulted in a 2 (allocator: friend vs. stranger) × 2 (allocator’s behavior: took money vs. gave money) × 2 (recipient: oneself vs. stranger) within-participant design.

Procedure

Based on an interpersonal relationship manipulation from our group (Li et al., 2020a, b), participants were required to bring a same-gender friend with them to participate in the experiment. The participants and friends had to have known each other for more than two years. Two same-gender strangers (who were played by research assistants employed by the laboratory; care was taken that participants and corresponding friends did not know them and were unfamiliar with their names) also took part in the experiment. Before the experiment, participants were informed that the roles of allocators and recipients were randomly determined by drawing lots, although the participants were always selected as the candidate recipients. Each participant also was informed that they and the other three individuals were assigned to separate rooms to complete the experiment together onlineFootnote 2.

After having been escorted to a separate room, to ensure that each participant was sufficiently close to their friend and distant from the other two strangers, they completed the Chinese version of the “Inclusion of Other in the Self” (IOS) scale (Tan et al., 2015), which is a pictorial measure of interpersonal closenessFootnote 3 developed by Aron et al. (1992). It served to assess the closeness between the self and target individuals (i.e., friends and two strangers: one stranger was assigned as the allocator, and the other was assigned as the recipient) on a 7-point pictorial scale from “1= very distant” to “7 = very close.” The assessment results confirmed that participants perceived that their interpersonal relationships were significantly closer with “friends” than with the two “strangers” of the current experiment. See Table S1 in supplementary materials for the participants’ perceived interpersonal relationships assessment.

In the formal experiment, each participant received the instruction and was required to repeatedly complete the moral evaluations task (rating how morally [un]acceptable the allocator’s behavior was) in a series of trials. As shown in Fig. 2, at the beginning of each trial, a fixation cross was presented against a black background for 500 ms. Thereafter, the participants viewed the allocator (friends or strangers)’s name (lasted for 2,000 ms) for this given trial. Then, the participants observed that this allocator gave CNY 10 to or took CNY 10 from oneself or a stranger (note that to reduce the cognitive burden of processing the stranger’s [i.e., Player D’s] name, we used the term “someone else” instead). This occurred for 2,000 ms. This was followed by two items assessing the degree of moral (un)acceptability successively (the presented sequence of two items was random across participants) without a time limit. After responding, an inter-trial interval (ITI) was shown for 1,000 ms.

Fig. 2
figure 2

Task sequence in a single trial in Experiment 1

To enhance the experiment’s validity, prior to the experiment, each participant was told that in addition to their base compensation for participation, some trials in this experiment would be randomly selected, and the allocators and recipients would gain or lose the amount of money based on the allocators’ behavior in these selected trials. In reality, there were no real allocators and the allocators’ actions were pre-programmed and randomly presented. The software package E-prime 3.0 (Psychological Software Tools, Pittsburgh, PA) was used for stimulus presentation and data collection. The entire task consisted of 320 trials, divided into eight blocks. The break between the blocks was determined by the participant’s self-pace, and the entire experiment took approximately 32-35 minutes.

After the experiment, we asked all participants about the cover story’s credibility using the question: “Do you believe that you have played the game with real people?” None of them expressed any suspicions. Finally, we told the participants that their friend’s actions were pre-programmed to ensure that this task did not influence their friendship quality.

Statistical analysis

Statistical analysis was performed using IBM SPSS Statistics (version 20.0; IBM Corp., Armonk, NY). Descriptive statistics are reported as the mean (M) and standard deviation (SD). A repeated-measures ANOVA 2 (allocator) × 2 (allocator’s behavior:) × 2 (recipient) was used to analyze the data. The statistical significance was set at p < 0.05. Partial eta-squared (ƞp2) was reported as a measure of the effect size of the ANOVA. This is where 0.05 represents a small effect, 0.1 represents a medium effect, and 0.2 represents a large effect (Cohen, 1973; Kirk, 1996). Post-hoc tests were corrected for multiple testing using the Bonferroni correction.

Results

A higher rating score indicated that the allocator’s behavior was evaluated as being less morally acceptable. For the moral (un)acceptability ratings, the main effect of recipient was significant, F (1,40) = 44.383, p < 0.001, ƞp2 = 0.526. This suggested that choosing the unknown other person as recipient was evaluated as being less morally acceptable (M = 4.344, SD = 1.061) than choosing the participant oneself (M = 3.381, SD = 1.020). The main effect of allocator’s behavior was also significant, F (1,40) = 125.401, p < 0.001, ƞp2 = 0.758, suggesting that participants evaluated “took money” (M = 4.572, SD = 1.00) as being less acceptable than “gave money” (M = 3.154, SD = 1.073). There was no significant main effect of allocator, F (1,40) = 3.627, p = 0.064, ƞp2 = 0.083. In addition to the main effects, significant interactions of allocator × recipient [F (1,40) = 31.981, p < 0.001, ƞp2 = 0.444], allocator × behavior [F (1,40) = 28.617, p < 0.001, ƞp2 = 0.417], and recipient × behavior [F (1,40) = 24.357, p < 0.001, ƞp2 = 0.378] were observed (see the simple effects analysis for two-way interactions in the supplementary materials).

Importantly, there was a significant interaction between allocator × recipient × behavior, F (1,40) = 18.947, p < 0.001, ƞp2 = 0.321. To explore this three-way interaction, we conducted a separate analysis for the “allocator_friend” and “allocator_stranger” conditions. The analysis for the “allocator_friend” condition yielded a significant interaction of recipient × behavior, F (1,40) = 34.931, p < 0.001, ƞp2 = 0.466. Therefore, when friends gave money to a recipient, the participants evaluated it as being less morally acceptable if the recipient was a stranger (Mrecipient_stranger = 4.010 vs. Mrecipient_oneself = 3.152), F (1,40) = 9.380, p = 0.004, ƞp2 = 0.190. When friends decided to take money from a recipient, participants evaluated it as being less morally acceptable when the recipient was a stranger (Mrecipient_stranger = 5.711 vs. Mrecipient_oneself = 3.045), F (1,40) = 62.81, p < 0.001, ƞp2 = 0.61. In contrast, in the case of the “allocator_stranger” condition, the interaction of recipient × behavior was not significant, F (1,40) = 2.128, p = 0.152, ƞp2 = 0.051 (Fig. 3a for the results).

Fig. 3
figure 3

Subjective ratings of moral unacceptability in Experiment 1 (a) and Experiment 2 (b). Note: Error bars indicate standard errors of the mean. *p < 0.05; **p < 0.01; ***p < 0.001

Summary

In accordance with our hypothesis that participants’ moral (un)acceptability ratings for altruistic (prescriptive) and selfish (proscriptive) behavior would be more sensitive to the identities of recipients in cases where the allocator was a friend, our results showed that the friend’s altruistic (prescriptive) and selfish (proscriptive) behavior toward someone else was evaluated as being less morally acceptable than when the friend showed altruistic and selfish behavior toward the participants themselves. However, in the condition where strangers were assigned as the allocator, participants’ moral (un)acceptability ratings for altruistic (prescriptive) and selfish (proscriptive) behaviors were not influenced the recipient identity (someone else vs. participant oneself). In Experiment 1, we have shown how interpersonal relationships affect moral evaluations from the second-party perspective by assessing the subjective ratings. However, little is known about how interpersonal relationships modulate neural responses of moral evaluations. Therefore, we explored the effect of interpersonal relationships on moral evaluations by means of ERP in Experiment 2.

Experiment 2

In Experiment 2, we aimed to replicate the behavioral findings of Experiment 1 and further extend them through their neurophysiological underpinnings. Specifically, ERPs were measured when participants observed the choice behavior of the allocators. Prior studies have demonstrated that several ERP components are reflective of outcome evaluation such as medial frontal negativity (MFN) and P3 (Donchin & Coles, 1988; Gehring & Willoughby, 2002; Miltner et al., 1997; Polich, 2007).

In particular, the MFN component is a negative frontocentral ERP deflection, which peaks at approximately 250-350 ms after the outcome’s onset. Its amplitude variation is assumed to reflect early and semi-automatic stimulus evaluation processes (Chen et al., 2018; Hajcak et al., 2006; Wu et al., 2011). The MFN amplitudes reflect the binary evaluation of positive versus negative outcomes (Hauser et al., 2014; Simons, 2010), such that the MFN is reported to be more negative in response to negative (unfavorable/unacceptable) than positive (favorable/acceptable) outcomes (Gu et al., 2011; Hajcak et al., 2006; Pornpattananangkul et al., 2017). Recently, studies have shown that the MFN is an indicator reflecting moral valence evaluations (Scheuble et al., 2021; Scheuble & Beauducel, 2020). For example, Scheuble et al. (2021) have found that the agent’s behavior of negative moral valence (i.e., conflict with moral norms) elicited a more negative MFN.

Following the MFN, the P3 component has been measured to index outcome evaluations at a later and more elaborate processing stage (Gray et al., 2004; Polich, 2007; Wu & Zhou, 2009). It is a positive ERP component that peaks at parietal electrodes along the midline between 350-600 ms after outcome onset. The P3 component is associated with the allocation of attentional resources and outcomes’ motivational salience (Donchin & Coles, 1988; Gray et al., 2004; Kleih et al., 2010; Polich, 2007; Wu & Zhou, 2009; Zhou et al., 2010). However, previous findings about outcome evaluations in P3 are partly inconsistent. For example, several studies have demonstrated that the P3 is larger for positive than for negative outcome (Hajcak et al., 2006; Yang et al., 2018). This suggests that positive outcome signals greater psychological significance and demand for more motivational or attentional resources. Conversely, other studies have shown a larger P3 for negative rather than positive outcomes (Olofsson et al., 2008). These studies suggest that a negative outcome receives preferential access to attentional resources. Moreover, some studies have claimed that the P3 encodes only the outcome magnitude (i.e., large vs. small), but not the outcome valence (i.e., positive vs. negative) (Leng & Zhou, 2010; Sato et al., 2005).

Based on the results in Experiment 1, we again expected to observe an effect of the interpersonal relationship on how the allocators’ choice will be morally evaluated. This effect of interpersonal relationship might also be reflected in neural responses to outcome evaluations. Regarding ERP results predictions, at the early and semiautomatic processing stage (mirrored in the MFN component), we expected that, compared with the “allocator_stranger” condition, in the case of the “allocator_friend” condition, whether following the altruistic or selfish behavior toward someone else (vs. participant oneself), it would evoke a more negative MFN component since a more negative MFN component is associated with a less morally acceptable experience (Scheuble & Beauducel, 2020). However, we had no directional hypothesis for the later responding stage reflected by P3 amplitude variation, because previous research has not reached a consensus about whether the P3 component is sensitive to outcome valence or not.

Participants

As mentioned in Experiment 1, the minimum sample size (n = 23) was calculated by G*Power 3.1.9. To reach this minimum sample size criteria, we recruited 36 participants (19 men, age range 18-25 years). All the participants were right-handed and had normal or corrected-to-normal vision. They all reported no history of traumatic brain injury, brain surgery, mental or neurological diseases, or color blindness. Each EEG participant was paid a remuneration of CNY 50 after completing the experiment. This experiment was conducted in accordance with the Declaration of Helsinki, and written, informed consent was obtained from all participants. The protocol was approved by the Institutional Review Board of Hunan Normal University, Department of Psychology. Notably, the data of two participants were excluded from analysis, as they had more than 30% rejected trials due to EEG artifacts. See the rejection criterion in past studies (Shafir et al., 2018; Sternberg et al., 2021). Finally, the remaining 34 participants’ data (18 men, age range 18-25 years) were included in the final behavioral and ERP analysis.

Procedure

The cover story and procedure of Experiment 2 was almost identical to that of Experiment 1. However, to adapt the task paradigm of Experiment 1 to be used in the ERP experiment, we slightly changed the experimental timeline, number of trials, and outcome presentation from Experiment 1. These changes were implemented to avoid potential confounding factors, such as the different complexity of Chinese characters,Footnote 4 which might cause EEG artifacts in Experiment 2, according to ERP studies investigating outcome evaluations (Hu et al., 2017a, b; Leng & Zhou, 2010; Zhang et al., 2021).

The IOS scale ratings showed that participants perceived interpersonal relationships as being closer with their “friends” than with the two “strangers.” Please see Table S2 for the assessment of participants’ interpersonal relationships. During the EEG data collection, participants were comfortably seated in an electromagnetic shielded room, approximately 75 cm in front of a computer monitor (19-inch LED screen, refresh rate: 60 Hz; resolution: 1,440 × 900 pixels). As shown in Fig. 4, at each trial’s beginning, a fixation cross was presented on a black background for 200 ms. Thereafter, the friend’s or stranger’s name was presented above the fixation cross for 1,000 ms to indicate who would be the “allocator” in this given trial. After a variable interval of 500-1,000 ms, the allocator’s behavior and corresponding recipients was displayed for 1,000 ms: specifically, two squares (1.9° × 1.9° of visual angle) with red and blue borders (red-bordered square represented the participant’s outcome, and blue-bordered square represented the stranger’s outcome) appeared on the left and right sides of the fixation point against the black background. If the allocator gave CNY 10 to the participant in this trial, “+10” would be presented in the red-bordered square in this trial, but if the allocator took CNY 10 from the participant, “−10” would be presented in the red-bordered square in this trial. Alternatively, if the allocator gave CNY 10 to someone else (i.e., another stranger), “+10” would be presented in the blue-bordered square, but if the allocator took CNY 10 from someone else, “−10” would be presented in the blue-bordered square. The two squares with blue and red borders’ positions were counterbalanced across the trials. Thereafter, a blank screen was displayed for 1,000 ms, followed by rating one item regarding moral acceptability (“How morally unacceptable do you evaluate this allocator’s behavior?”) on a 7-point scale (from “1 = not at all” to “7 = very much”), with their index finger (no time limit). Note that one item was removed due to high correlation between two items in Experiment 1 (Barrett et al., 2004; Hawes et al., 2013). The allocator’s behavior that was evaluated as being less morally acceptable would be rated as a higher score. After the rating, a blank screen lasted for 1,000 ms as ITI.

Fig. 4
figure 4

Task sequence in a single trial in Experiment 2 (this task procedure was adapted from the procedure in Experiment 1 to have a more reliable ERP measure). Note: ERPs were locked to the outcome presentation (marked with a red square)

The whole task consisted of 480 trials which were divided into eight blocks. Each outcome consisted of 60 trials. The participants were allowed to take a self-paced break between each block. Unlike Experiment 1, before the formal experiment, participants were given 16 practical trials with two trials per condition. They were thereafter asked whether they understood the experimental procedure. This especially focused on the implications of the different square border colors. The entire experiment took approximately 45-50 minutes.

EEG analysis

Please see Section 2.2 in the supplementary materials for details about EEG recording and analysis. After artifact rejection, no significant differences in artifact-free trials were observed across the experimental conditions (all Fs < 1, all ps > 0.05; Table S3). Mean MFN amplitudes were extracted between 280 and 340 ms, and mean P3 amplitudes were extracted between 350 and 500 ms based on visual inspection of the grand-averaged ERPs and based on prior studies (Hu et al., 2017a, b; Li, Liu, et al., 2020a; Luck & Gaspelin, 2017; Yang et al., 2018). In particular, regarding the MFN measurement, Williams et al. (2020) recently suggested that the mean amplitudes measure is the most appropriate method for feedback-related component quantification. According to the topographical distribution (Figs. 5b and 6b) and representative electrodes commonly used in previous studies (Liu et al., 2021; Severo et al., 2020; Wang et al., 2022), the MFN was calculated as the average of Fz and FCz electrode sites, and the P3 was calculated as the average of CPz and Pz electrode sites. The mean amplitude values were extracted and averaged for all the selected electrode sites per the ERP component. The statistical analysis method was identical to that used in Experiment 1.

Fig. 5.
figure 5

a Grand-average ERP waveforms at the Fz and FCz electrode sites. Grey bars highlight the time window of the MFN (280-340 ms). b Differences in MFN (MFN effect) voltage topographies between the conditions indicating “gave money to/took money from “someone else” and “gave money to/took money from participants” in case of “allocator_friend” trials and “allocator_stranger” conditions, separately. c MFN mean amplitudes (in μV) in each condition. Note: Error bars indicate standard errors of the mean. *p < 0.05; **p < 0.01; ***p < 0.001

Fig. 6
figure 6

a Grand-average ERP waveforms at the CPz and Pz electrode sites. Grey bars highlight the time window of the P3 (350-500 ms). b Topographies voltage distribution of P3 in each condition. c Bar graphs showing P3 mean amplitudes (in μV) in each condition. Note: Error bars indicate standard errors of the mean. *p < 0.05; **p < 0.01; ***p < 0.001

Neural-behavioral correlation

To explore the relationship between subjective ratings and neural activities, Pearson’s correlation (r, two-sided) analysis was calculated between subjective ratings and ERP components according to past literatures (Hu & Mai, 2021; Santesso et al., 2005).

Results

Subjective ratings of moral acceptability

Replicating the results in Experiment 1, a higher rating score indicated that the allocator’s behavior was evaluated as being less morally acceptable. The analysis of the moral acceptability degree ratings showed a significant main effect of recipient, F (1,33) = 52.097, p < 0.001, ƞp2 = 0.612, suggesting that compared with choosing the participant oneself as the recipients (M = 3.404, SD = 0.953), choosing someone else as the recipient was evaluated as being less acceptable (M = 4.218, SD = 1.053). The main effect of behavior was also observed, F (1,33) = 65.985, p < 0.001, ƞp2 = 0.667. This indicated that relative to “gave money” (M = 3.276, SD = 0.862), the participants evaluated “took money” (M = 4.346, SD = 1.144) as being less acceptable. The main effect of allocator was not observed, F (1,33) = 0.109, p = 0.744, ƞp2 = 0.003. Moreover, interactions of allocator × recipient [F (1,33) = 48.898, p < 0.001, ƞp2 = 0.597], and allocator × behavior [F (1,33) = 9.051, p = 0.005, ƞp2 = 0.215] were observed. The interaction of recipient × behavior approached significance, F (1,33) = 3.929, p = 0.056, ƞp2 = 0.106 (see the simple effects analysis for two-way interactions in the supplementary materials).

The three-way interaction of allocator × recipient × behavior was significant, F (1,33) = 5.553, p = 0.025, ƞp2 = 0.114. The follow-up analysis showed that the interaction of recipient × behavior in the “allocator_friend” condition was significant, F (1,33) = 7.122, p = 0.012, ƞp2 = 0.178, suggesting that when friends gave money to someone else instead of the participant oneself, the participants rated this as being less morally acceptable (Mrecipient_stranger = 4.027 vs. Mrecipient_oneself = 2.884), F (1,33) = 28.03, p < 0.001, ƞp2 = 0.459. When friends took money from someone else instead of the participant oneself, the participants experienced this as being less morally acceptable (Mrecipient_stranger = 5.221 vs. Mrecipient_oneself = 3.031), F (1,33) = 46.54, p < 0.001, ƞp2 = 0.585. In contrast, the interaction of recipient × behavior in the “allocator _stranger” condition was not significant, F (1,33) = 0.032, p = 0.860, ƞp2 = 0.001 (Fig. 3b). Thus, Experiment 2 showed the same pattern of behavioral results as Experiment 1.

ERP results

MFN (280-340 ms)

As demonstrated in Fig. 5, the main effect of the recipient was significant, F (1,33) = 15.137, p < 0.001, ƞp2 = 0.314. This suggests that the MFN component was more negative when the recipient was a stranger than the participant oneself (Mrecipient_stranger = 0.532 μV vs. Mrecipient_oneself = 1.507 μV). The main effect of behavior also was significant, F (1,33) = 7.572, p = 0.010, ƞp2 = 0.187, suggesting that “took money” evoked a more negative MFN amplitudes than “gave money” (Mselfish behavior = 0.617 μV vs. Maltruistic behavior = 1.421 μV). There was no main effect of allocator, F (1, 33) = 0.635, p = 0.431, ηp2 = 0.019. In addition, a significant interaction of allocator × recipient was observed [F (1,33) = 7.633, p = 0.009, ƞp2 = 0.188]. However, the allocator × behavior [F (1,33) = 0.718, p = 0.403, ƞp2 = 0.021], and recipient × behavior [F (1,33) = 0.991, p = 0.327, ƞp2 = 0.029] interactions were not significant (see the simple effects analysis for two-way interactions in the supplementary materials).

Critically, there was a three-way interaction of the allocator × recipient × behavior, F (1, 33) = 4.227, p = 0.048, ηp2 = 0.114. Separate analyses of MFN amplitudes for the “allocator_friend” and “allocator_stranger” conditions were conducted to explore this interaction: The interaction of the recipient × behavior in the “allocator_friend” condition was significant, F (1,33) = 5.853, p = 0.021, ƞp2 = 0.151. When friends decided to give money to someone else instead of the participants, the MFN was more negative (Mrecipient_stranger = 0.777 μV vs. Mrecipient_oneself = 1.733 μV), F (1,33) = 5.17, p = 0.030, ƞp2 = 0.135. When friends took money from someone else instead of the participants, the MFN was more negative (Mrecipient_stranger = −0.472 vs. Mrecipient_oneself = 1.680 μV), F (1,33) = 29.12, p < 0.001, ƞp2 = 0.469. Conversely, no significant interaction of the recipient × behavior in the “allocator_stranger” condition emerged, F (1,33) = 0.201, p = 0.657, ƞp2 = 0.006. This was reminiscent of the pattern of subjective ratings for moral unacceptability.

Albeit both friends’ altruistic and selfish behavior was sensitive to identities of recipients, the effect sizes were largely different. In order to check whether behavior types are differently weighed in different interpersonal relationship contexts, we further analyzed the MFN effect (i.e., subtracting the MFN induced by “recipient_oneself” condition from that induced by “recipient_stranger” condition) following the allocator’s altruistic and selfish behavior. A repeated-measures ANOVA 2 (allocator: friend vs. stranger) × 2 (allocator’s behavior: took vs. gave money) was created to analyze the MFN effect. The results demonstrated that the main effect of the allocator was significant, F (1,33) = 7.631, p = 0.009, ƞp2 = 0.188, demonstrating that the MFN effect in the condition of “allocator_friend” (M = −1.554 μV, SD = 2.385) was stronger than that in the condition of “allocator_stranger” (M = −0.395 μV, SD = 2.687). The main behavior was not significant, F (1,33) = 0.997, p = 0.325, ƞp2 = 0.029. Importantly, we found an interaction of allocator × behavior, F (1,33) = 4.215, p = 0.048, ƞp2 = 0.113. In the case of “allocator _friend” trials, the MFN effect for selfish behavior (M = −2.152 μV, SD = 2.324) was stronger than that for altruistic behavior (M = −0.957 μV, SD = 2.453), F (1,33) = 5.861, p = 0.021, ƞp2 = 0.124. However, in the case of “allocator_stranger” trials, the MFN effect following the selfish and altruistic behavior was not significant, F (1,33) = 0.204, p = 0.659, ƞp2 = 0.005.

P3 (350-500 ms)

As demonstrated in Fig. 6, the following significant main effects were observed: allocator [F (1,33) = 5.328, p = 0.027, ƞp2 = 0.139, suggesting that the P3 evoked by friends (M = 7.774 μV, SD = 3.373) was larger than that evoked by strangers (M = 7.321 μV, SD = 3.509)], recipient [F (1,33) = 37.606, p < 0.001, ƞp2 = 0.533, demonstrating that the P3 elicited by the participants (M = 8.100 μV, SD = 3.418) was larger than that evoked by someone else (M = 6.996 μV, SD = 3.461)], and allocator‘s behavior [F (1,33) = 21.569, p < 0.001, ƞp2 = 0.395, demonstrating that the P3 elicited by altruistic behavior (M = 8.079 μV, SD = 3.434) was larger than that evoked by selfish behavior (M = 7.017 μV, SD = 3.446)].

Moreover, the interactions of allocator × recipient [F (1,33) = 5.180, p = 0.029, ƞp2 = 0.136], and recipient × behavior [F (1,33) = 10.104, p = 0.003, ƞp2 = 0.234] were significant. First, a follow-up simple effects analysis was conducted to explore the interaction of allocator × recipient. This showed that when participants were the recipients, the P3 was sensitive to the identity of the allocator, F (1,33) = 8.910, p = 0.005, ƞp2 = 0.255; whereas when strangers were the recipients, the P3 was insensitive to the identity of the allocator, F (1,33) = 0.140, p = 0.713, ƞp2 = 0.002. Second, we explored the interaction of recipient × behavior and found that when the recipients were the participants, P3 was sensitive to the behavior, F (1,33) = 31.220, p < 0.001, ƞp2 = 0.345; whereas when the recipients were someone else, P3 was insensitive to the behavior, F (1,33) = 1.280, p = 0.267. However, the interactions of allocator × behavior [F (1,33) = 2.538, p = 0.121, ƞp2 = 0.071] and allocator × recipient × behavior [F (1,33) = 0.194, p = 0.663, ƞp2 = 0.006] were not significant.

Correlation analysis between MFN amplitudes and subjective ratings

Reminiscent of the similar interactions pattern between MFN amplitudes patterns and subjective ratings, a Pearson’s correlation analysis was conducted. This served to further explore the association between the ∆MFN to recipients (i.e., MFN evoked by recipients_stranger minus that which was evoked by recipients_oneself) and ∆subjective ratings to the recipients (subjective ratings for recipients_stranger minus ratings for recipients_oneself). This analysis was conducted in the “allocator_friend” and “allocator_stranger” conditions, respectively. The results revealed that in the condition where friends were assigned as allocators, the correlation was significant (r = −0.443, p = 0.009, 95% confidence interval [CI] [−0.656, −0.149]; Fig. 7). However, no correlation was observed in the “allocator_stranger” condition (r = 0.059, p = 0.741). In addition, the Steiger’s z test (Steiger, 1980) conducted to compare correlations within a sample indicated that the “allocator_friend” condition’s correlation was significantly different from the “allocator_stranger” condition’s correlation (z = −2.21, p < 0.05).

Fig. 7
figure 7

Scatter plots and regression lines of the ∆MFN responses to recipients (i.e., MFN mean values evoked by someone else minus those evoked by participants) as a function of the ∆moral unacceptability ratings of recipients (i.e., subjective ratings involving someone else minus subjective ratings involving participants) in the “allocator_friend” condition

Summary and discussion

First, the moral acceptability ratings in Experiment 2 replicated the results in Experiment 1. Second, the ERP results showed that interpersonal relationships modulated moral evaluations, as mirrored by MFN amplitude variation. Whether following friends’ altruistic or selfish behavior, the MFN elicited by behavior directed at someone else instead of participants was more negative. A more negative MFN amplitude is associated with an unfavorable or unacceptable outcome experience (Gangl et al., 2017; Hu & Mai, 2021; Li et al., 2021). Hence, from the neural perspective, friends’ proscriptive and prescriptive behavior directed to someone else (instead of participants) were processed as being more unacceptable and this effect occurred at the early processing stage. However, this effect did not emerge in the condition where strangers were assigned as the allocator. Furthermore, relative to friends’ altruistic behavior, the MFN effect was more negative following friends’ selfish behavior. It suggested that as compared with friends’ altruistic behavior, the participants were more sensitive to friends’ selfish behavior toward strangers. However, regarding the P3 component, we did not observe an effect of interpersonal relationships on the interaction allocators’ behavior × recipients, suggesting that the effect of interpersonal relationship on moral evaluations did not reflect on the P3 component.

Discussion

This study examined how people evaluated their friends’ (vs. strangers’) behavior in terms of prescriptive and proscriptive morality by providing moral (un)acceptability ratings (Experiment 1 and 2) and ERP (Experiment 2) evidence. Across two experiments, the moral acceptability ratings indicated that in the condition where participants’ friends were assigned as the allocators, the allocators’ altruistic and selfish behavior toward someone else were evaluated as being less morally acceptable than when the respective behavior was directed toward the participants themselves. By contrast, in the condition where strangers were allocators, the moral (un)acceptability rating for allocators’ behavior was not affected by the identities of recipients. In Experiment 2, the ERP results revealed that the interpersonal relationship influenced moral evaluations, as was mainly reflected in MFN amplitude variation.

Concerning the subjective ratings of moral acceptability, both Experiments 1 and 2 supported our predictions. From the recipient’s perspective, this finding was in accordance with previous research. These previous studies have indicated that an agent who conducts altruistic (prescriptive) and selfish (proscriptive) actions directed at a socially distant (vs. a socially close) individual one is evaluated as being less morally acceptable (Law et al., 2022; McManus et al., 2020).

With regard to the ERP results, we detailed the discussions at the early (i.e., mirrored by MFN) and later (i.e., mirrored by P3) outcome evaluation processing stages: at the earlier and semi-automatic processing stage, which is represented by the MFN (Gangl et al., 2017; Hu & Mai, 2021), the interaction between the allocators’ behavior × recipients was moderated by the allocator. Consistent with our prediction about the MFN, we found that relative to the condition where the allocator was a stranger, in the condition where the allocator was a friend, following altruistic and selfish behavior, the MFN was sensitive to recipients. To be more specific, when the friend’s altruistic (prescriptive) or selfish (proscriptive) behavior was directed toward someone else (vs. participant oneself), it elicited a more negative-going MFN. Combined with the correlation analysis, this finding repeatedly reflects that friend’s altruistic and selfish behavior toward someone else are less morally acceptable for participants. Interestingly, participants exhibited a stronger MFN effect following their friends’ selfish behavior relative to friends’ altruistic behavior. This suggested that compared with friends’ altruistic behavior directed at strangers, participants experience friends’ selfish behavior directed at strangers as being less morally acceptable. Although this finding was not predicted, it could be explained by previous evidence. Proscriptive (selfish) actions are heavily weighted in moral evaluations of others (Anderson et al., 2020; Van Bavel et al., 2015), and individuals think a morally acceptable individual should, at a minimum, not cause harm to an unknown person (Anderson et al., 2020; Zhan et al., 2020). People tend to think their friends to be morally good (Law et al., 2022; Moisuc & Brauer, 2019; Parker & Seal, 1996). Naturally, participants evaluated it as being less acceptable when friends did harm to others, especially to strangers. This could explain why, as compared with the friends’ altruistic behavior, a more pronounced MFN effect was generated following friends’ selfish behavior toward strangers. Moreover, it is worth noting that the current MFN finding is supported by previous findings. These findings also suggest that moral evaluations occurs at the earlier processing stage captured by ERPs (Decety & Cacioppo, 2012).

At the later and controlled responding stage represented by P3, we did not find a modulation of the allocator in the interaction of allocators’ behavior × recipients. The absent effect of interpersonal relationship on P3 amplitudes is supported by past evidence reporting that P3 amplitudes were not sensitive to interpersonal relationship manipulations during outcome evaluation (Li et al., 2021;Wu et al., 2011 ; Wu & Zhou, 2009). These studies have proposed as a possible explanation that the information regarding this may have already been coded by the preceding MFN. Therefore, the neural system does not recode it on the P3 (Wu et al., 2011; Wu & Zhou, 2009).

Taken together, from a second-party perspective, our findings support the relationship-regulation theory in moral psychology research which proposes that the moral evaluations about actions depends not only on the action types (i.e., altruistic/prescriptive or selfish/proscriptive) but also on the social-relational contexts in which these occur (Earp et al., 2021). The moral evaluations of behavior are contingent on the interpersonal relationship between the actors and receivers of behaviors.

Limitations and future directions

Some limitations should be considered when interpreting the results. First, the power analysis for the sample size was calculated for the within-subjects effects but not for the correlations. Hence, the correlation between MFN components and moral acceptability ratings may be underpowered. Second, the allocator and their behavior in this study were randomly decided, which were different from the everyday experiences and might affect the ecological validity. The future directions are as follows: first, we could further examine how participants respond if the other two strangers (i.e., Players C and D in this study) are manipulated as a pair of friends. Second, we only assigned the candidate recipients as participants or strangers. However, previous studies have suggested that people evaluate a friend more negatively when their friends behave more altruistically toward another friend (Barakzai & Shaw, 2018; Krems et al., 2021). In future studies, we can further explore the effects of other recipients’ identities.

Conclusions

The goal of this study was to examine the effect of interpersonal relationships on moral evaluations by providing moral acceptability ratings and electrophysiological evidence. The interpersonal relationship affected both the subjective ratings and MFN amplitude variation in moral evaluations. In summary, from the second-party perspective, individuals evaluated it as being less morally acceptable when their friends select the other unknown person (instead of the individual oneself) as the recipients of proscriptive and prescriptive behavior in a moral dilemma. This effect of interpersonal relationships on moral evaluations occurred at an earlier and semiautomatic processing stage in the brains.