1 Introduction

The Inclusion of Other in the Self (IOS) scale is one of the most popular measures of interpersonal closeness. It asks respondents to pick one of seven pairs of increasingly overlapping circles to indicate how close they feel toward another (Fig. 1a). Since its creation by Aron et al. (1992), the IOS scale has been repeatedly validated (Gächter et al., 2015) and widely adopted, with over 5000 citations on Google Scholar (see Aron et al., 2013; Branand et al., 2019, for reviews). In recent years it has spread to economics, for example to explain charitable donations (Goette & Tripodi, 2021), dictator game allocations (Robson, 2021), or team production (Gächter et al., 2023).

Fig. 1
figure 1

Four different IOS scales

We offer a continuous version of the IOS scale (Fig. 1b) that allows a finer measurement of interpersonal closeness. In the Continuous IOS scale, respondents click the left circle (representing self) and drag-and-drop it on the right circle (representing the other) at the point of overlap that best represents their relationship with the other. Additionally, it allows a greater degree of overlap than the original IOS scale, and would thus be able to represent a higher perceived closeness.

Our Continuous IOS scale has the design features set out by Aron et al. (1992) in their seminal paper. We find, however, that most existing implementations of the standard IOS scale do not. We thus offer, in addition to the Continuous IOS scale, a re-implementation of the standard IOS scale that does (Fig. 1c). As a middle-ground, we also offer the Step-Choice IOS scale (Fig. 1d) which displays only one pair of circles with arrows on the left and right. Respondents are instructed to find the pair of circles which best represents their connection with the other by clicking on the arrows to move back and forth between the different pairs. The Step-Choice IOS scale thus retains the discreteness of the standard IOS scale but allows the use of more pairs of circles.

In an online experiment, we validate the Continuous IOS scale and the Step-Choice IOS scale against our implementation of the standard IOS scale. We find that the standard IOS scale elicits higher IOS scores than the Continuous IOS scale. This mainly comes from subjects who select a minimal overlap on both scales. Subjects must select the second pair of circles in the standard IOS scale to report minimal overlap, while they can report minimal overlap by choosing an overlap very close to zero in the Continuous IOS scale. Our results thus suggest that subjects might be reluctant to report the lowest IOS score on the standard IOS scale. The Continuous IOS scale addresses this by offering subjects the opportunity to report a low level of overlap without having to select an overlap as large as the one that corresponds to the second pair of circles in the standard IOS scale.

Our IOS scales can be implemented in any web-based experimental software, such as Qualtrics, oTree (Chen et al., 2016), or LIONESS (Giamattei et al., 2020). They are available as ios.js at https://github.com/geoffreycastillo/ios-js with full documentation. We also offer detailed instructions for Qualtrics and an example app for oTree.

Our scales offer various options. For all scales, the size of the circles can be customised. For the Step-Choice and standard IOS scales, users can select the number of circles. We also offer unbalanced versions of these scales with twice as many pairs of overlapping circles in the first half of the scale. This option would be useful, as we will see below, in cases where a majority of IOS scores are concentrated at the lower end of the scale.

Users can also customise the labels of the circles—‘You’ and ‘Other’ in Fig. 1. By doing so, our IOS scales can be directly translated into the Inclusion of Ingroup in the Self (IIS) scale (Tropp & Wright, 2001). The IIS scale is an extension of the IOS scale to groups obtained by replacing the ‘Other’ label attached to the right circle by the ingroup. The IIS allows one to measure the distance between an individual and a group or even between two groups.

In the next section following our discussion of the related literature, we describe how we construct our IOS scales to have the Aron et al. (1992) features. In Sect. 3 we present the experiment we designed to validate our Continuous and Step-Choice IOS scales. Section 4 presents the results and Sect. 5 concludes.


Related literature. Other continuous implementations of the IOS scale exist. However, they use circles with a fixed diameter and thus do not have the features highlighted by Aron et al. (1992). Further, they do not validate their continuous versions of the IOS scale in an experiment. For example, Le et al. (2007) offer a version that no longer works on modern browsers because the technology they relied on—Java applets—is deprecated. Kamphorst et al. (2017) ported this version to JavaScript, the technology we also rely on. More recently, Kinley and van Vugt (2023) offer a version for jsPsych (de Leeuw et al., 2023).

Further, Baader et al. (2024) propose expanding the original IOS scale to 11 circles—thus their extension is named \(\hbox {IOS}_{11}\). They show that it outperforms the original, seven-circle IOS scale. Their extension introduces more circles in the first and in the final third of the scale. We have added an option to ios.js inspired by \(\hbox {IOS}_{11}\) that accomplishes this in the context of our scales.

As we hinted above, the IOS scale is becoming a popular tool in economics and has been used to show how social closeness impacts economic outcomes. For example, Goette and Tripodi (2021) find that social proximity as measured by the IOS scale drives social conformity, which influences how much money people give to an NGO. Hofmann et al. (2021) show that the presence of close others, as measured by the IOS scale, increases voluntary payments in a Pay-What-You-Want context. Dimant (2024) studies political polarisation and distinguishes between ingroup love and outgroup hate thanks in part to the IOS scale. Gächter et al. (2023) show that group cohesion, measured with an oneness scale that includes the IOS scale, increases team production as captured by a weak-link coordination game. We ourselves have already used the Continuous IOS scale and ios.js in a series of large online experiments to study social discounting (Beranek & Castillo, 2024).

Further, in a lab-in-the-field experiment in Uganda, Robson (2021) uses the IOS scale to show that social connectedness enters the utility function and affects decisions in a modified dictator game. Castillo (2021) uses the IOS scale to show how preferences defined over social distance depend on the task used to elicit them—a translation of the classical preference reversal phenomenon to the social domain. In both Robson (2021) and Castillo (2021), about half of the respondents report the same IOS scores for the two recipients they face. The finer measurement brought by the Continuous IOS scale would help in those cases where the different others evaluated by respondents are too similar to be discriminated by the original IOS scale.

2 Constructing the IOS scale

When creating the figures for the IOS scale, the two design features highlighted by Aron et al. (1992) were that: “(a) the total area of each figure is constant (thus as the overlap of the circles increases, so does the diameter), and (b) the degree of overlap progresses linearly, creating a seven-step, interval-level scale”. Feature (a) means that circle diameters increase as the distance between the circles decreases to keep constant the total area of self plus other. This feature ensures that no sense of self or other is lost when overlap increases, which aligns with Aron and Aron’s (1986) conception of relationships. Feature (b) ensures there are no jumps between any two pairs of circles, a necessary condition to generate a valid, linear, one-to-seven measure. Otherwise, some increases in overlap would be larger than others which might give rise to threshold effects.

Table 1 Total area and change in the overlap of IOS scales

We show on Table 1, however, that the original figures in Aron et al. (1992) do not have these features. That is the case for most, if not all, of the literature that followed which used either directly the Aron et al. (1992) figures, their own figures, or a series of overlapping circles with non-increasing diameters.

Finding a series of overlapping circles that have both of these features is a non-trivial problem without a closed-form solution. We rely on simulations (see Appendix A for details) to find, for every proportion of overlap, a corresponding circle diameter. These calculations allow us to create an IOS scale that has the features highlighted by Aron et al. (1992). We use these calculations to construct our version of the standard IOS scale as well as our new Step-Choice and Continuous IOS scales, which we test in the following experiment.

3 Experimental design

In our experiment, we test whether respondents choose the same overlap regardless of the version of our IOS scales they use. More specifically, within-subjects respondents indicate how close they feel toward another using our standard IOS scale and either our Continuous or our Step-Choice IOS scales. The order of the IOS scales is randomised between-subjects. We add a filler task between the IOS scales: we ask subjects to solve 10 mathematics problems involving the addition and subtraction of three-digit numbers under time pressure.

As highlighted by Kinley and van Vugt (2023), a difference between the standard IOS scale and IOS scales that show only one pair of circles at a time—such as our Continuous and Step-Choice IOS scales—is that in the latter subjects are presented with an initial “default” degree of overlap as opposed to seeing all possible degrees of overlap. For this reason, we require subjects to manipulate the IOS scale from no overlap to full overlap before they can proceed to the task itself (see Figs. 11 and 13 of the instructions in Appendix B). This also ensures our JavaScript code works as intended on subjects’ browsers; if it did not, subjects could not proceed beyond this intentional point of failure.

In contrast to most of the literature, we use real people as the target of the IOS scales as opposed to hypothetical ‘others’. To do so, we first surveyed members of the US general public on MTurk forming a diverse pool of potential targets. We asked them a number of questions, mostly standard demographic questions as well as some questions about their opinions on various social issues (see Supplementary material for a list of questions used). Thereafter, we invited the subjects for the experiment reported here and asked them the same questions. We paired them with a target drawn at random from the pool of the originally surveyed participants.

Fig. 2
figure 2

Card display used in the experiment

We present the target—the ‘other’ participant in the IOS scale—using the card display shown in Fig. 2. The card suit is assigned randomly to the target and used to refer to them throughout the experiment. To make sure subjects engage with their assigned target, we ask them to write the first things that come to their mind when they read the card in at least 25 characters.

The advantage of this procedure is that we can assess whether the IOS score decreases with the demographic dissimilarity between subjects and their target. We can also assess how subjects’ characteristics influence the IOS scores they report.

The experiment took place in November 2020. A session lasted about 13 min and the average payment was $1.43. The payment was composed of a fixed $0.50 participation fee and $0.10 for each mathematics problem correctly solved. We analyse below the choices of 644 participants: 328 who evaluate their target using the standard and the Continuous IOS scales and 316 who use the standard and the Step-Choice IOS scales.Footnote 1 Full instructions for the tasks used in the experiment can be found in Appendix B.

4 Results

4.1 Comparing the Continuous and Step-Choice IOS scales to the standard IOS scale

We consider our new IOS scales—Step-Choice IOS scale, Continuous IOS scale—to be validated against our implementation of the standard IOS scale if there is no difference between the scores reported by subjects when they use the new IOS scale or the standard IOS scale. The scores generated by the standard IOS scale and the Step-Choice IOS scale are discrete variables ranging from one (representing no overlap) to seven (representing substantial overlap). In contrast, the score generated by the Continuous IOS scale is a continuous variable ranging between zero and one. To allow comparison, we first convert the overlap given by the Continuous IOS scale into a one-to-seven measure.Footnote 2 We then subtract the score obtained with the new IOS scale from the score obtained with the standard IOS scale. A positive difference means that subjects reported a higher IOS score when using the standard IOS scale.

Fig. 3
figure 3

Average difference between the IOS scores reported with the standard IOS scale and the Continuous or the Step-Choice IOS scale (with bootstrapped 95% confidence intervals)

In Fig. 3, we plot the average of this difference. We start by comparing the Continuous IOS scale to the standard IOS scale in the top part of the Figure. We see that the standard IOS scale leads to higher IOS scores than the Continuous IOS scale: the difference is small but positive and significant (Wilcoxon matched-pairs signed-ranks test, \(z = 4.943\) and \(p < 0.001\)).Footnote 3 Therefore, we reject the null hypothesis that subjects report the same IOS score on both scales. Hence our first result:

Result 1

The standard IOS scale elicits higher scores than the Continuous IOS scale.

The bottom part of Fig. 3 compares the standard IOS scale to the Step-Choice IOS scale. As can be seen, there is no difference between these two scales: the difference is not statistically different from 0 (Wilcoxon matched-pairs signed-ranks test, \(z = -0.810\) and \(p = 0.4135\)).Footnote 4 Therefore, we cannot reject the null hypothesis that subjects report the same IOS score on both scales. Hence our second result:

Result 2

We do not detect a difference between the scores elicited from the standard IOS scale and the Step-Choice IOS scale.

Fig. 4
figure 4

Relation between the IOS scores reported with the standard IOS scale and those reported with the Continuous IOS scale

4.2 Why does the standard IOS scale result in higher IOS scores than the Continuous IOS scale?

To better understand why the standard IOS scale yields higher IOS scores compared to the Continuous IOS scale, Fig. 4 displays the IOS score reported with the Continuous IOS scale as a function of the IOS score reported with the standard IOS scale. Each dot represents a subject. There are two notables features. First, Continuous IOS scores are dispersed around their corresponding standard IOS scores. Second, when considering a specific standard IOS score, the Continuous IOS scores tend to skew towards lower values.Footnote 5

The dispersion can be explained by random errors resulting from asking subjects to report the IOS score twice in a row. Continuous IOS scores are mostly the same as the corresponding standard IOS scores or in the adjacent categories. For example, 25 of the 66 subjects (37.88%) who report a standard IOS score of 3 report the exact same score on the Continuous IOS scale. An additional 36 subjects report scores in the categories adjacent, with 25 subjects (37.88%) reporting a score of 2, and 11 subjects (16.67%), a score of 4. Only 5 subjects (7.58%) report scores beyond the adjacent categories. (See Appendix C for the detailed frequency tables.)Footnote 6

The skew of the Continuous IOS scores towards lower values is the reason the average difference in Fig. 3—standard IOS score minus Continuous IOS score—is positive. It is particularly apparent in Fig. 4 by focusing on subjects who report a Continuous IOS score of 1: many of these subjects report a standard IOS score of 2. In fact, when we remove these 54 subjects, the average difference between standard and Continuous IOS scale is no longer significant (Wilcoxon matched-pairs signed-ranks test, \(z = 1.018\) and \(p = 0.3095\)).

This pattern could be explained by the reluctance of subjects to report the lowest IOS score with the standard IOS scale. In the standard IOS scale, subjects who want to report some connection with their match, at least more connection than no overlap would imply, are forced to pick the second pair of circles. On the other hand, the Continuous IOS scale allows them to report at least some overlap; while at the same time, this overlap falls short of an overlap corresponding to an IOS score of 2. As would be predicted by this explanation, we observe some bunching below the threshold that corresponds to an IOS score of 2.

The fact that subjects avoid reporting an IOS score of 1 with the standard IOS scale could also be a manifestation of the compromise effect (see, for example, Beauchamp et al., 2020, in the context of risk preference elicitation). According to the compromise effect, people want to avoid extremes and prefer to choose options that are more in the middle. In the standard IOS scale, avoiding the low extreme means reporting an IOS score of 2. In the Continuous IOS scale, avoiding the low extreme means reporting a small degree of overlap that might not correspond to an IOS score of 2.

To mitigate this issue in the standard IOS scale, we propose adding intermediate pairs of overlapping circles in-between the first half of the circles to create an unbalanced, standard or Step-Choice IOS scale. This addition would give subjects more opportunity to report small levels of overlap. For example, the unbalanced version of the standard IOS scale now has 10 pairs of overlapping circles: the original 7 with extra pairs of overlapping circles between 1 and 2, between 2 and 3, and between 3 and 4. We have implemented this option in ios.js.

4.3 The effect of demographic dissimilarity and of demographic characteristics on IOS scores

We conclude with a number of exploratory, non-pre-registered analyses. We focus on the standard IOS scale since we have the most observations for this scale. We first look at how demographic dissimilarity between a subject and their target influence the IOS score they report. For us, demographic dissimilarity measures the proportion of discordant responses given to the questions displayed on the card shown in Fig. 2. A dissimilarity of 0 refers to a subject who answers exactly as their target to those questions, and a dissimilarity of 1, to a subject who answers as differently as possible.

Model (1) in Table 2 reports the result of the corresponding ordered logistic regression. We find that, as dissimilarity increases, interpersonal closeness as indicated by the IOS score decreases. Therefore, the IOS scale captures something tangible and measurable.

Then, in models (2) and (3) in Table 2, we look at how a subject’s own demographic characteristics influence the IOS score they report. We find that those who do part-time work, who have only completed 12th grade without a degree or less, or who support a political party different from the main parties, report lower IOS scores. This finding suggests that one needs a minimal level of stability to start feeling connected to others. Further, people who think others are trustworthy and people who belong to a labour union report higher IOS scores. The results are unchanged if we leave dissimilarity out (in model 2) or include it (in model 3).Footnote 7

Some associations are more difficult to interpret: for example, people who approve of sex before marriage report lower IOS scores. Others should be interpreted with care: for example, people who report their race as Chinese or Korean report lower IOS scores. However, there are not many people in our sample who reported their race as Chinese or Korean and, consequently, they are less likely to be matched with someone who reported the same race.

Table 2 How dissimilarity and respondents’ characteristics explain the reported IOS score, ordered logistic regression

5 Conclusion

In summary, we offer a new implementation of the standard IOS scale that has the features highlighted by Aron et al. (1992). We also offer a new version of the IOS scale—the Continuous IOS scale—that allows for the selection of any degree of overlap. As an intermediary, we propose the Step-Choice IOS scale. We validate our new IOS scales in an experiment, where we find that the standard IOS scale results in higher IOS scores than the Continuous IOS scale. We interpret this difference as a reluctance of some subjects to select the lowest IOS score corresponding to no overlap in the standard IOS scale. We also find that IOS scores decrease with demographic dissimilarity between subject and target.