1 Introduction

Demographic and cultural diversity is on the rise in many organizations. Labor forces diversify due to immigration; cultural minorities as well as women increasingly move upwards in occupational status; economic globalization gives rise to multi-national organizations; and pressures for more interdisciplinary work, especially in R&D and scientific research, increase disciplinary diversity in research teams (Meyer et al. 2014). Diversity can be an important asset for the performance of teams, especially when it comes to group tasks that require the combination of diverse sets of knowledge, skills, and experiences (Ellemers and Rink 2016).

Yet, diversity also has been characterized as a “double-edged sword” (Milliken and Martins 1996) “reducing social cohesion and increasing relationship conflict on one hand, and enhancing creativity and innovation on the other” (Carter and Phillips 2017, p. 1). Stereotypes and negative attitudes towards demographic or cultural outgroups have been found to fuel relational conflicts in a diverse team (Bowers et al. 2000; van Dijk et al. 2017; Van Knippenberg and Schippers 2007; Milliken and Martins 1996; Pelled 1996; Shemla et al. 2016; Stewart 2006; Webber and Donahue 2001; Williams and O’Reilly 1998). In addition, homophily, the well-documented tendency of people to preferentially link with similar others in informal networks (McPherson et al. 2001), can lead to between-group segregation of interpersonal relations in teams (Lau and Murnighan 1998; Reagans 2011), hampering the sharing and integration of the diverse pieces of knowledge and skills needed to master complex group tasks.

The notion that diversity may be a double-edged sword points to a complex process in which the effects on team-performance hinge on the interplay of multiple competing dynamics and contextual conditions. To unravel the complex social dynamics shaping consensus, cohesion or disagreement in organizations, researchers have employed the analytical power of computational modelling (Anzola et al. 2017; Harrison and Carroll 2002; Rouchier et al. 2014; Secchi and Gullekson 2016; Wang et al. 2017). A series of studies (Flache and Mäs 2008a, b; Fu and Zhang 2016; Grow and Flache 2011; Liu et al. 2015; Mäs et al. 2013; Mäs and Flache 2013; Pinasco et al. 2017; La Rocca et al. 2014) has focused in particular on formalizing and refining the theory of “demographic faultlines” (Lau and Murnighan 1998, 2005), highlighting the potential dangers of diversity for social cohesion that can arise from a `strong demographic Faultline’. “Group faultlines increase in strength as more attributes are highly correlated, reducing the number and increasing the homogeneity of resulting subgroups” (Lau and Murnighan 1998, p. 328). The core argument is that a strong faultline creates prominent subgroup distinctions, which may give rise to a ‘group-split’. In the wake of Lau and Murnighan’s seminal contribution, a range of empirical studies identified moderating conditions for this effect of faultline strength (Carter and Phillips 2017, p. 5; Leslie 2017). However, there is disagreement with regard to the theoretical assumptions explaining faultline effects, in that existing formal models are based on competing theoretical assumptions and generate opposing predictions about the conditions under which faultlines matter.

Our paper contributes to the literature by deepening the analysis of existing computational models of strong faultlines in teams (Feliciani et al. 2017; Fu and Zhang 2016; Mäs et al. 2013; Mäs and Flache 2013). In these models, authors translated the informal theory proposed by Lau and Murnighan into a computational model, drawing closely on their central psychological assumptions of (i) persuasive-argument communication and (ii) homophily. First, Lau and Murnighan assumed that individuals exert influence on each other’s opinion by communicating persuasive arguments that are in favor of or opposed to a given position on an issue (Myers and Lamm 1976; Vinokur and Burnstein 1978). In an interdisciplinary team of social scientists and computational modelers studying diverse organizations, for example, computer scientists may try to persuade their colleagues to develop a formal model by making arguments for the analytical precision of a computational theory, while social scientists may express counterarguments pointing to the danger of oversimplifying a theory through formalization. This argument communication can entail reinforcing influence, as actors with more similar opinions are likely to reinforce each other’s prevailing opinion tendency. As a consequence, a bi-polarized opinion division arises aligned with the demographic (i.e. disciplinary) team division.

The persuasive-argument model of faultline dynamics has implications that are intriguing for researchers of diversity in teams. First, it identifies new conditions under which faultlines have the effects predicted by informal theorizing (Mäs et al. 2013). Second, the persuasive-argument model offers a formal theoretical alternative compared to an earlier approach (Flache and Mäs 2008a, b; Grow and Flache 2011). Instead of persuasive argument communication, this previous work modelled the effects of faultlines assuming negative or, “repulsive” influence—that is, the tendency to increase one’s opinion difference from the opinion of the outgroup (Flache et al. 2017). This assumption has recently been challenged in experimental research (von Hohenberg et al. 2017; Mäs and Flache 2013; Takács et al. 2016), which has raised modelers’ interest in alternative theoretical accounts of faultline dynamics in teams.

While existing modeling work adopted the assumptions of persuasive-argument communication and homophily from Lau and Murnighan’s informal theory, these two assumptions can be interpreted and formally implemented in various ways. Accordingly, in this paper we ask the theoretical questions: do the central predictions of these models depend on the exact formal elaborations of the micro-processes of (1) argument communication and (2) homophily? And if so, how? In Sect. 2, we review the existing modeling literature and identify two dimensions of variation between existing modeling approaches. In Sect. 3, we formalize the competing modeling approaches and discuss possible implications for model dynamics. Section 4 presents results from computational experiments testing how the different model versions affect bi-polarization between subgroups. Possible implications for future research on diverse teams are discussed in the concluding section.

2 Existing models of persuasive argument communication under a strong faultline

2.1 Reinforcing influence and homophily: dynamics of group split

The mechanism generating bi-polarization in models of persuasive argument communication can more generally be described as “reinforcing influence”. Under reinforcing influence, communicating individuals who hold a similar opinion reinforce each other’s opinion and jointly become more extreme in their views. Different formalizations of micro-processes have been proposed that can entail reinforcing influence such as “biased assimilation” (Dandekar, Goel, and Lee 2013) or social learning from approval of opinions by relevant peers (Banisch 2010; Banisch and Olbrich 2017; Mäs and Flache 2013). In the study of faultline dynamics in teams, modelers (Flache and Mäs 2008a, b; Fu and Zhang 2016; Grow and Flache 2011; Liu et al. 2015; Mäs et al. 2013; Mäs and Flache 2013; Pinasco et al. 2017; La Rocca et al. 2014) have closely followed persuasive-argument theory (Myers 1978; Vinokur and Burnstein 1978), according to which individuals base their opinion about an issue on the relevant arguments they possess about that issue, and influence each other when they communicate these arguments. On the one hand, argument communication between two interacting individuals reduces opinion disagreement between them, as the communication of arguments increases the similarity between the sets of arguments on which they build their opinion. On the other hand, when individuals with similar opinions communicate arguments, they likely expose each other to new arguments supporting their current opinion. In this case, reinforcing influence shifts the opinions of both actors towards more extreme views, a process that can aggregate to ‘extreme consensus’, the emergence of consensus on an extreme opinion in a group. Social psychological studies of extreme consensus in what has been called “group polarization” (Myers 1982) have provided consistent empirical evidence in support of the persuasive argument theory as explanation for this outcome (for a review see Isenberg 1986).

Extreme consensus is fundamentally different from a group split that divides a team with a strong faultline into subgroups with strong mutual disagreement. However, following the informal reasoning of Lau and Murnighan (1998, 2005), the computational models discussed above showed how persuasive argument theory can provide an explanation also for group splits, if reinforcing influence is accompanied by homophily, the tendency of individuals to interact more likely with more similar individuals (McPherson et al. 2001). Homophily fosters influence between individuals who are demographically similar or hold similar views; and it discourages influence between dissimilar individuals. Homophily fosters group splits especially when a strong faultine is accompanied by initial “congruency” (Mäs and Flache 2013), a tendency of actors with the same fixed attributes, like gender or ethnicity, to hold similar opinions even prior to persuasive communication (Phillips 2003; Phillips et al. 2004). In this case, homophily decreases chances that individuals are exposed to arguments that contradict their prevalent conviction. Instead, in a team with a strong faultline and initial congruency individuals are mainly exposed to arguments that support their views and reinforce their opinions. As this happens simultaneously on both ends of the opinion spectrum, a divide can grow in groups with a strong demographic faultline. Homophily makes interactions between members of the different emergent opinion groups increasingly unlikely, inducing even more reinforcing influence of like-minded others. To the extreme, this generates perfect opinion bi-polarization, an outcome in which a group falls apart into subgroups with maximal disagreement between and maximal agreement within subgroups (Duclos et al. 2004; Flache and Macy 2011; Flache and Mäs 2008b), eventually resulting in a group-split aligned with the demographic faultline.

Reinforcing influence and homophily have been shown to be conducive to bi-polarization in a team with a strong faultline. However, both the process of reinforcing influence via argument communication as well as the exact way how homophily moderates social influence allow for several “degrees of freedom” in their theoretical conceptualization and formal implementation. In the following we discuss these degrees of freedom and their relation to the broader literature on modelling social influence processes.

2.2 Difference in the conceptualization of the argument-communication: explicit vs implicit variants

A controversial methodological debate in the agent-based modelling literature addresses the question how cognitively realistic agents should be. Some scholars call for “open[ing] the ‘black box’ of individual cognition” (Conte and Giardini 2016), arguing that modelers should identify and explicitly model the psychological mechanisms underlying social-influence processes. Others defend a more parsimonious, abstract definition of how to model behavioral micro-processes, arguing that cognitive realism can be progressively added to a minimalistic simple model until sufficient realism is met (Lindenberg 1992).

Reflecting these competing approaches, existing models of argument communication differ in the sophistication of the formal representation of arguments. On the one hand, sophisticated models represent arguments explicitly, assuming that an actor’s opinion is a function of the arguments she considers relevant and that arguments are being shared with communication partners. Actors who are exposed to an argument they did not consider before adjust their opinions according to the argument. On the other hand, there are more parsimonious models that model arguments implicitly. That is, these models do not represent arguments but make assumptions about how communication would have adjusted their opinions had they communicated arguments.

2.3 Difference in the conceptualization of homophily: likelihood or effectiveness of interaction

An important conception of homophily in the sociological literature is that people more likely interact and communicate with similar others (Lazarsfeld and Merton 1954; McPherson et al. 2001; Wimmer and Lewis 2010). This form of homophily may be caused by the preference that “likes attract” (Byrne 1971) and thus reflect the outcomes of a choice people make among available interaction partners, but it can also result from structural patterns of social interaction that systematically sort similar people into similar “foci” (Feld 1982) where they meet and interact, like schools, neighborhoods, or workplaces. In both cases, the reason that more similar people influence each other more is that they interact more frequently than less similar people do. The view that similarity increases likelihood of interaction resonates in how homophily is conceptualized in large number of computational models of social influence (Axelrod 1997; Baldassarri and Bearman 2007; Chen et al. 2013; Dandekar et al. 2013; Mark 2003). In all of these models, actors select an interaction partner from a set of available agents such that more similar agents are selected with a higher probability.

An alternative conceptualization of homophily builds on the notion that similar people influence each other more effectively, because individuals are more open to arguments communicated by similar others. People can make more sense of input from sources with whom they have more in common. Furthermore, people might trust similar others more that dissimilar people would (Mark 1998). This view on homophily is likewise reflected in formal models of social influence where, for example, the similarity between two agents is expressed in terms of a weight that scales how much the opinion of a source of influence is taken into account for opinion changes of the target of influence (Deffuant et al. 2000; Duggins 2017; Flache and Macy 2011; Flache and Mäs 2008b; Hegselmann and Krause 2002; Kitts 2006; Kurahashi-Nakamura et al. 2016; Mäs et al. 2010).

While both competing conceptualizations of homophily have been adopted in a variety of formal models of social influence processes, it remains unclear how exactly variation between them affect the outcomes of formal models of faultline dynamics.

3 The model

In this section we introduce a generic formal and computational model which embeds different combinations of the degrees of freedom discussed above. Specifically, two of these combinations represent earlier formal models of faultline dynamics that will be systematically compared for the first time in the present study. One combination is to model reinforcing influence with explicit communication of arguments, and implement homophily via the likelihood of selection of interaction partners, hereafter referred to as X–S model. The X–S model has been adopted in earlier work for example by (Mäs et al. 2013; Mäs and Bischofberger 2015; Mäs and Flache 2013). Another combination, differing in both dimensions from the X–S model, is that reinforcing influence builds on implicitly represented arguments and homophily affects the effectiveness of arguments communicated, but not the choice of interaction partners, hereafter called the I–E model (adopted by Feliciani et al. (2017) in earlier work). Finally, we introduce with our framework a new model of persuasive argument influence differing from both the X–S and the I–E model in one of the two degrees of freedom. This is the model in which the communication of arguments is modeled implicitly like in I–E, but homophily is implemented via selection of interaction partners like in X–S, called I–SFootnote 1.

Box 1 provides the pseudo-code of the ABM; the code itself can be found in a public GitHub repository (https://github.com/thomasfeliciani/persuasive_argument_model_NetLogo). Table 1 (in Sect. 4.1) contains an overview of the variables presented.

Box 1
figure a

ABM pseudo-code

3.1 General modeling framework

We assume a population of N agents in all three models. Typically, we assume N = 10, approximating the size of teams in many organizations, but we will also explore effects of a bigger population size. Agents have two main attributes: their group identity, and their opinion on an issue. A maximally strong faultline is implemented in terms of a dichotomous group identity gi ∈ {− 1, + 1} that is a fixed attribute randomly assigned to every agent i at the outsetFootnote 2. We assume that the two groups have equal size: exactly half of the population belongs to group − 1, and the other half to group + 1.

The opinion of an agent i at time point t is denoted oi,t, and is a continuous variable in the range [− 1, + 1]. Opinions are an aggregation of the positive (‘pro’) and negative (‘con’) arguments an agent considers relevant. More precisely, pro arguments are in favor of oi,t= + 1 and con argument support the opinion oi,t= − 1. At the outset of the simulation, agents’ opinion is initialized as follows. Agents hold S arguments (S is a model parameter, set to 4 by default). When arguments are represented implicitly, the initial arguments are only used to induce an initial opinion, otherwise they are explicitly assigned to agents’ memories. The model further contains a parameter w for the degree of congruency between the demographic attribute and the opinion. Specifically, for each of the S memory “slots”, agents with gi= 1 will receive a positive argument with probability wFootnote 3, and a negative argument otherwise. Conversely, the other group (gi= − 1) will receive a negative argument with probability w, and a positive argument otherwiseFootnote 4. Finally, opinions are calculated as the number of positive arguments over S, scaled to range from − 1 and + 1. Formally, if we define Pi,t as the number of pro arguments held by agent i, then:

$$o_{i,t} = 2 \cdot \frac{{P_{i,t} }}{S} - 1$$
(1)

This implies that parameter S also defines how much an argument can impact the opinion of an agent. For S = 4, for example, an agent can only know 4 arguments, and every argument accounts for one quarter of the agent’s opinion.

A congruency w = 0.5 yields an opinion distribution without any correlation between group and opinion; for 0.5 < w ≤ 1, higher values of w produce stronger correlation between group and initial opinion.

After the initialization, time elapses in discrete steps. For each time step t, all agents are selected for initiating an interaction. The sequence in which they are selected is randomly shuffled at the beginning of each time step. When agent i is selected for this, we first select an interaction partner j, and then simulate the interaction between j and i.

How interaction partners are selected (i.e. the implementation of homophily), and how interactions take place (the argument-communication mechanics) are the two main differences between the X–S and the I–E model versions, which we describe in the following two sections.

3.2 Difference in the formalization of the argument-communication

3.2.1 Explicit argument-communication

Under explicit argument communication, we assume that the pool of available arguments consists of 10 pro and 10 con arguments. To mirror the limits in human capability to retain and process information (Cowan 2001; Miller 1956), the X–S assumes that, at any point in time, agents can only memorize a subset of S arguments. Furthermore, the memory vector allows to implement the recency (or salience) of an argument: the first argument of the vector is the most recent—the last, the least. The underlying assumption is that recently acquired arguments linger in agents’ memory for longerFootnote 5 (Mäs et al. 2013).

During interaction, agent j communicates one of the arguments she considers salient to i. An argument from j’s memory is picked at random and then becomes the first and thus most recent argument in i’s memory. Due to the limited memory size S, agent i also drops the most dated argumentFootnote 6. If the argument communicated by j is already present in i’s memory (read: if i already considers it), then it shifts from the current location in i’s memory to the first position. A known argument, if encountered again, thus becomes more recent and remains salient longer. The interaction event is then terminated by updating the opinion of agent i as defined in Eq. 1.

3.2.2 Implicit argument-communication

Under implicit argument-communication, the interaction is simplified by mimicking only the opinion change induced by argument communication, without actually representing the arguments. To achieve this, the probabilities are calculated that an explicit argument communication would result in each of the possible outcomes of shifting the opinion upwards, downwards or not at all on the opinion scale. First, we determine the likelihood that agent j would communicate a pro argument according to Eq. 2. With one minus this probability, j would communicate a con argument to her communication partner i.

$${\text{Probability of}}\,j\,\,{\text{communicating a pro argument}} = \frac{1}{2} \cdot \left( {o_{j,t} + 1} \right)$$
(2)

Next, it is determined how likely i drops a pro or con argument at the end of the interaction. Like in Eq. 2, the probability that i drops a pro argument is:

$${\text{Probability of}}\,i\,\,{\text{dropping a pro argument}} = \frac{1}{2} \cdot \left( {o_{i,t} + 1} \right)$$
(3)

Based on these two probabilities, the algorithm of implicit argument-communication conducts a random experiment that selects a combination of the two events “j communicates a pro or a con argument” and “i drops a pro or con argument” as outcome of the interaction. Next, we compute the resulting shift of i’s opinion as follows. The magnitude of the opinion adjustment ai,t is a function of S, as S determines how much a new argument can affect an agent’s opinion:

$$a_{{i,t}} = \left\{ {\begin{array}{ll} {{2 \mathord{\left/{\vphantom {2 S}} \right. \kern-\nulldelimiterspace} S},} & {{\text{if j picks a pro and i drops a con argument}}} \\ { - {2 \mathord{\left/ {\vphantom {2 S}} \right. \kern-\nulldelimiterspace} S},} & {{\text{if j picks a con and i drops a pro argument}}} \\ {0,} & {{\text{if j pics and i drops the same kind of argument}}} \\ \end{array} } \right.$$
(4)

3.2.3 Implications

Explicit and implicit versions of the argument-communication seem equally plausible modeling approaches to model the underlying theory, and we want to test whether or for which conditions they yield consistent results. We know of a crucial difference between the two versions: the implicit argument-communication cannot reproduce all of the opinion outcomes generated by the explicit communication version. Specifically, in the model with explicit argument-communication, a population of agents might develop consensus over a moderate opinion, where all agents have the same number of pro and con arguments. If all agents have the exact same set of arguments, there are no arguments that can be communicated that would change an agent’s opinion: this means that the agents would be locked in consensus.

With implicit argument-communication, in contrast, a consensus on a moderate opinion is not an equilibrium. Since arguments are not explicitly represented, two agents can always influence each other as long as they have not agreed on the same extreme opinion. That is, Eqs. 2 and 3 always yield a positive probability for an outcome in which the argument communicated by j has the opposite sign than the argument dropped by i, unless oi= oj= ± 1—i.e. when the dyad is in the equilibrium state of consensus over an extreme opinion. The other possible equilibrium under implicit argument-communication is that two agents are maximally dissimilar and thus do no longer interact. If a strong faultline aligns with a group-split in opinions, this situation can arise for all pairs of agents in a team. Two agents then either fully agree and are maximally similar, or they are maximally dissimilar and maximally disagree. In this situation, implicit argument-communication can settle into the outcome of stable bi-polarization.

In sum, extreme consensus and perfect bi-polarization are the only stable equilibria in the implicit version, whereas the explicit version can generate moderate consensus as a third possibility. Based on this consideration, we expect that the implicit version of argument-communication is more likely to generate extreme opinion outcomes than the explicit version, all other things being equal.

3.3 Difference in the formalization of homophily

3.3.1 Homophily as likelihood of interaction

This implementation of homophily mirrors the implementation of the X–S model version. When i is selected for an interaction, her potential interaction partners are the remaining team members. The likelihood that the interaction takes place between i and j is a function of their similarity modeled as a combination of similarity in group identity and opinion similarity. Formally, the similarity between i and j at time point t is:

$$sim_{ij,t} = 1 - \frac{{\left( {\left| {g_{i} - g_{j} } \right| + h_{o} \cdot \left| {o_{i,t} - o_{j,t} } \right|} \right)}}{{2 + 2 \cdot h_{o} }}$$
(5)

The similarity simij,t can vary between 0 (no similarity) to 1 (perfect similarity). Parameter ho defines the relative weight of group identity and opinion on the similarity between two agents. For ho= 1, group similarity and opinion similarity have the same impact on the overall similarity between i and j. For ho> 1, opinion dissimilarity weighs more than group similarity, while group similarity is more important if 0 < ho< 1.

Throughout this paper we explore two different values for this parameter, ho∈{0.3,3}. The lower value, ho= 0.3, represents the assumption that demographic differences weigh much more for defining similarity than opinion differences do. This is similar to the X–S model version, with three different demographic attributes and one opinion attribute, where all attributes have the same weight. By contrast, in the I–E (baseline condition) it is assumed that opinion difference weighs 3 times more than group identity for the similarity between agents. This scenario is, thus, replicated with ho= 3 in our study. Finally, the probability that j is selected as interaction partner is computed for all the network neighbors of i, as a function of the similarity of a particular network partner relative to all other network partners, and is defined as:

$$P_{ij,t} = \frac{{\left( {sim_{ij,t} } \right)^{{h_{s} }} }}{{\mathop \sum \nolimits_{j = 1}^{N - 1} \left( {sim_{ij,t} } \right)^{{h_{s} }} }}$$
(6)

In every interaction of an agent i, exactly one of the potential interaction partners j is selected with the probability given by Eq. 6. The strength of homophily is represented by parameter hs (not to be confused with the parameter ho): higher values of hs make the relative similarity between i and j have a bigger effect on the probability that they interact.

3.3.2 Homophily as effectiveness of influence

In this approach (reproducing the setup in I–E), the chances of interaction are independent of the similarity between potential interaction partners. When i is selected to carry out an interaction, an interaction partner j is randomly picked from her teammates, ignoring Eq. 6. Next, the similarity between i and j is calculated according to Eq. 5. The similarity between i and j, however, impacts the magnitude of the opinion change that such an interaction can bring about. More precisely, the similarity simij,t moderates the effect that the communicated argument a has on the opinion of agent i as formalized in Eq. 7.

$$o_{i,t + 1} = o_{i,t} + a_{i,t} \cdot \left( {sim_{ij,t} } \right)^{{h_{p} }}$$
(7)

Mirroring the formalization of the effect of similarity on the probability of interaction (Eq. 6), we include a parameter hp that scales the impact of the relative similarity between i and j on the effectiveness of the interaction (hp > 0). The larger the value of hp, the stronger the effect that similarity has on the impact that a communicated argument has on the opinion of its recipient.

Finally, a truncating function ensures that the updated opinion oi,t+1 stays within the limits of the opinion scale [− 1, + 1]. That is, whenever an agent’s opinion is outside the range of the opinion scale after argument communication, the opinion is set to the value of the closest pole of the scaleFootnote 7.

3.3.3 Implications

The two notions of homophily are similar in that they both imply that actors who hold similar opinions influence each other more than those who are more dissimilar—this is the core of the reinforcing influence that can drive a group towards bi-polarization or extreme consensus. However, it remains unclear how the model differences exactly affect the chances that the persuasive argument model generates extreme opinion outcomes.

A possible clue lies in the number of interactions that are needed for an agent to develop an extreme opinion, depending on the two versions of homophily. When homophily affects the likelihood of interaction, every interaction can modify the agent’s opinion by a fixed amount (i.e. ± 2/S, see Eq. 4). Following previous studies (i.e. both X–S and I–E), here we assume S = 4. This means that agents are always respectively at most two interactions away from potentially developing an extreme opinion.

By contrast, when homophily affects the effectiveness of the influence, the opinion change is weighted by the similarity between interaction partners. Here, agents are always (not at most but) at least two interactions away from potentially becoming extremists. Influence will have relatively little effect especially in the early steps of a cascade of mutually reinforcing influence, when two interacting agents are still relatively dissimilar. This means that conceptualizing homophily as influence effectiveness might make it more difficult for agents to reach an extreme opinion. If agents need more interactions to develop extreme opinions, we can expect two things: first, that extreme consensus or bi-polarization are less likely to emerge within a given time frame; second, that when they do emerge, they do so after a higher number of interaction events compared to the other conceptualization of homophily. In other words, we expect that—all other things being equal—a team with a strong demographic faultline is less likely to develop a group-split or group-polarization within a given time frame if similarity affects the effectiveness of interaction rather than the likelihood of it.

4 Simulation experiments

4.1 Experiment design

In our experiments, we compared the X–S, the I–E and the I–S model. To have a defined point of comparison, we assume a baseline parameter configuration for all three models that represents the scenario we are most interested in. This is a team of a size plausible for real organizations, with a maximally strong demographic faultline. Moreover, in this team group identity is important as a source of similarity, and homophily as well as initial congruency are sufficiently strong so that the emergence of a group-split is possible, but not trivial. The parameter space we explored is tailored to accurately replicate the setup used in earlier work with the X–S and I–E model. Accordingly, we define a baseline scenario with N = 10, strength of homophily hs = hp= 4, impact of opinion differences on similarity ho = 3, congruency w = 0.8 and agents’ memory capacity S = 4. To assess the robustness of the effects of implementation of reinforcing influence and homophily, these effects will also be explored and reported for some alternative parameter settings. Table 1 provides an overview of the parameter space that was explored in this study.

Table 1 Overview of variables and parameters

For every condition inspected in our simulation experiment, we conducted 100 independent simulation runs using NetLogo (Wilensky 1999), where each run was initialized with a different random seed. If not reported otherwise, simulations are run at least 104 interaction events per agent, unless the model converged to equilibrium beforeFootnote 8.

Model outcomes at the end of a simulation were measured in two different ways. First, we tested whether an outcome fell into one of the categories of moderate consensus, extreme consensus or bi-polarization. Second, as we are interested in the degree to which the model generates a group-split, we also measured between-group polarization, defined as the absolute value of the distance between the average opinions within group − 1 and group + 1, respectively.

The classification of model outcomes into one of the three categories is most meaningful if model dynamics have converged, i.e. reached a state in which no further change of the distribution of opinions is theoretically possible. This was not feasible in all of the conditions we inspected: Exceptions will be discussed in more detail below (see also endnote vi).

Criteria for model convergence and classification of outcomes needed to be tailored to the different model types. If the communication of arguments is modelled implicitly, convergence occurs if and only if dissimilarity between every pair of agents is either maximal (sim = 0) or when they agree on the same extreme opinion (± 1). In the former case, influence is impossible because the probability or effectiveness of it is zero. In the latter case, influence cannot alter their opinion because no arguments with a different valence can be adopted. If argument communication is modelled explicitly, convergence occurs when no agent can receive an argument that would change her opinion. Technically, this can happen in two cases. First, all team members hold exactly the same set of arguments and are thus in perfect consensus (extreme or moderate)Footnote 9. Second, in all pairs of agents, they either have exactly the same set of arguments and thus the same opinion, or their similarity is zero, making interaction impossible or influence ineffective.

The conditions for convergence are only met by the three qualitatively different outcomes of moderate consensus, extreme consensus or maximal between-group bi-polarization (equivalent to a group-split). Moderate consensus occurs when all agents hold the same opinion and this opinion is neither − 1 nor + 1. This can only be a stable state when argument communication is explicitly modelled. If all agents hold the same vector of arguments containing both pro and con arguments, then no agent has an extreme opinion and no argument can be circulated that would change an agent’s opinion. Extreme consensus is possible when all agents agree on the same opinion coinciding with one of the poles of the opinion spectrum [− 1, + 1]. When the communication of arguments is explicit, this does not require that agents agree on the same set of arguments, but only that they all possess only arguments of the same valence. In this case no agent can receive an argument with a different valence from an interaction partner and further change is precluded. Similarly, if the argument communication is implicit, no agent has a positive probability of giving a positive (or negative) opinion push to their interaction partner. Either way, when extreme consensus emerges no further influence is possible: extreme consensus is thus a converged outcome. Maximal between-group bi-polarization, finally, occurs when both demographic groups have internally reached extreme consensus on the opposite poles of the opinion spectrum. When this occurs, the absolute difference between the average opinions of the two groups equals 2, since 2 is the span of the opinion scale. It is worth noting that the team could bi-polarize also along other lines of division than group identity, but only if the opinion divide overlaps with the group divide neither outgroup agents nor ingroup agents can further influence a focal agent.

4.2 Results 1: Effects of explicit or implicit argument communication

We expect that the implicit version of argument communication is more likely to generate extreme opinion outcomes than the explicit version, all other things being equal. To test this expectation, we compared the two versions of the models that conceptualize homophily as the likelihood of interaction but differ in assuming implicit vs. explicit argument-communication. Figure 1 shows results of 100 independent realizations of running these two models under the baseline condition. In addition, we conducted a robustness test with a ceteris-paribus replication with N = 100. All simulation runs reached convergence within the limit of 10,000 interaction events per agent so that all outcomes could be classified in the three categories of moderate consensus, extreme consensus and between-group bipolarization. Figure 1 shows how implicit vs. explicit argument communication affected the share of runs ending in each of the three categories.

Fig. 1
figure 1

Effect of implicit vs. explicit argument communication for the baseline scenario and a ceteris-paribus replication with N = 100

To begin with, the results show that both model versions can generate a group-split with bi-polarization. Beyond that, Fig. 1 reveals three main findings. First, as anticipated, we did not observe moderate consensus in any run with implicit argument-communication: moderate consensus emerges an outcome only with the explicit version of the argument-communication.

Second, as a consequence of the above and in line with our expectations, we found that the implicit argument-communication generated more extreme opinion outcomes (extreme consensus, between-group bi-polarization) than the explicit version. In the baseline condition (N = 10), the implicit argument-communication only generated extreme consensus (in roughly 40% of the runs) and between-group bi-polarization (~ 60%). Conversely, in the baseline condition the explicit argument-communication produced moderate consensus in most of the simulation runs, while the rest of the runs were roughly evenly divided between extreme consensus and between-group bi-polarization. The absence of moderate consensus as model equilibrium in the implicit version might explain the higher absolute number of runs that converged to bi-polarization (compared to the explicit version).

There might be another reason why the implicit argument-communication generates more between-group bi-polarization. In both the implicit and explicit versions of the argument-communication, an agent does not update her opinion when she receives an argument of the same valence (pro or con) as the one she forgets. However, a central difference between implicit and explicit argument-communication is that the explicit argument-communication carries some additional probability that the communication of an argument does not change the recipient’s opinion: this is what we call ‘argument redundancy’, and happens when the communicated argument is already known to the receiving agent. By contrast, the implicit argument-communication model has no argument redundancy, as it does not track which arguments are considered. As a consequence, the implicit argument-communication generates more opinion changes than explicit argument-communication. This small difference affects the reinforcement process that is responsible for the emergence of bi-polarization. Consider, for instance, an agent who holds 3 pro and 1 con argument. According to the homophily principle, this agent will most likely be exposed to another pro argument, which according to the implicit argument-communication model will likely intensify her positive opinion. Under the explicit argument-communication regime, this is also the most likely outcome, but it is less likely than under implicit argument-communication, as there is also a positive chance that the agent will receive a pro-argument she already considers. This means that, under the explicit argument-communication regime, the self-reinforcing process of homophily and argument-communication is weaker, which makes bi-polarization a less probable and consensus a more probable outcome of influence dynamics.

Third, Fig. 1 reveals that while the model with explicit argument-communication is able to generate moderate consensus, dynamics did not lead the bigger populations into this equilibrium. With N = 100, moderate consensus emerged only rarely, leading us to the conclusion that this difference between explicit and implicit argument communication affects the long-term outcomes of the dynamics mainly in small teams. However, it should also be noted that teams with 100 members rarely occur, if at all, in real organizations.

This effect of group size in the model with explicit argument communication can be derived from earlier work on the X–S. These studies have shown that moderate consensus is harder to reach in bigger population, because coordination on a single argument vector can take very long in big populations. Even when most agents hold moderate opinions, it is possible that the population will at some moment develop a small bias towards one of the poles of the opinion scale. Due to homophily, agents leaning towards one of the poles will most likely be exposed to further arguments that intensify their opinions. In a population with little opinion variation, agents that moved towards the pole can pull others with them, sparking a collective extremization of opinions, similar to the empirically observed opinion shifts in the experiments of the polarization paradigm from social psychology (Myers and Lamm 1976). In bigger populations, such a scenario is more likely, because it takes these populations longer to reach a consensus on moderate opinions, giving them more time to at some moment develop a small opinion bias that is subsequently intensified.

The effect of the team size in Fig. 1 is the fourth main finding: bigger teams are more likely to experience between-group bi-polarization than smaller teams, under both argument-communication regimes. For the explicit argument-communication, this trend could be the consequence of the previous finding: as simulations with bigger teams were less likely to converge to moderate consensus, there was a higher relative proportion of runs that converged to the other possible outcomes, extreme consensus and between-group bi-polarization. This explanation does not hold for the implicit version of the argument communication, where moderate consensus never emerges as simulation outcome, but still simulation runs were more likely to converge to between-group bi-polarization in big teams than in small teams. This result is both unexpected and puzzling. We acknowledge the need for further research to understand this effect.

To assess the robustness of the four main results of this first experiment, we conducted a ceteris-paribus replication of the experiment shown in Fig. 1 with a higher impact of opinion disagreement on similarity (ho = 3). Figure 2 shows the results.

Fig. 2
figure 2

Effect of implicit vs. explicit argument communication for high impact of opinion d opinion disagreement on similarity (ho = 3). All other parameters are taken from the baseline condition

Figure 2 shows that the four main findings described for Fig. 1 could be replicated. Also under ho = 3, moderate consensus occurs only with explicit argument communication, and extreme opinion outcomes are thus more likely with implicit argument-communication. Concerning the team size, we again find that high N suppresses moderate consensus, and makes between-group bi-polarization more likely. In a further robustness test, we also repeated the experiment of Figs. 1 and 2 with lower and higher initial congruency (w = 0.5 and w = 0.9). Again, the four main findings could be replicated.

4.3 Results 2: Competing notions of homophily

Both extreme consensus and bi-polarization are expected to occur less likely within a given time frame when homophily affects the effectiveness of an interaction rather than the likelihood that the interaction occurs. In addition, when these outcomes emerge, they should do so after a higher number of interaction events compared to a model in which homophily affects the likelihood of interaction.

To compare the two competing notions of homophily, we used the model with implicit argument communication, combining it with the two different versions of homophily from X–S and I–E. Figure 3 depicts results of this variation for the baseline condition. As a further test, we also varied homophily strength (hs and hp). Earlier work on the X–S has shown that homophily strength increases bi-polarization in a model with explicit argument communication (Mäs and Flache 2013). We wanted to know whether this result extends to both versions of the model with implicit argument communication. In the “Appendix”, we provide in addition a comparison of the effect of homophily strength across all model versions.

Fig. 3
figure 3

Effects of the conceptualization of homophily on model predictions after 10,000 opinion updates per agent. Baseline condition, 100 independent realizations per condition

Moderate opinion consensus is not an equilibrium candidate of the two models with implicit argument communication. Therefore, we quantify outcome differences in terms of between-group bi-polarization, that is, the absolute difference between the average opinions of the two demographic groups. Figure 3 reports the share of runs with an extreme consensus (between-group polarization = 0), and the share of runs characterized by a perfect split between the two subgroups after 10,000 interaction events per agent (between-group bi-polarization = 2). Since not all runs reached a state of equilibrium, Fig. 3 further shows the share of runs that had not reached equilibrium but ended instead with an opinion distribution very close to either extreme consensus or maximal between-group polarization. Finally, the figure informs about the share of runs that were not close to one of the equilibria even after 10,000 interaction events per agent. The size of the bubbles in Fig. 3 corresponds to the share of runs with the respective opinion distribution. In addition, the labels in the center of the bubble indicate the exact number of runs observed.

Figure 3 shows that both conceptualizations of homophily are able to explain the emergence of opinion consensus and a split into two opposing groups. Furthermore, both models generate more group splits when homophily is stronger (see hs and hp). The central difference between the two models is that the dynamics of the model with homophily conceptualized as influence effectiveness tend to rest longer in intermediate states and opinion distributions that are close to one of the equilibria of the dynamics. This results in less extreme outcomes after 10,000 interaction events, consistently with our expectation.

As a further test, we conducted a ceteris-paribus replication of this experiment with a high value for the impact of opinion disagreement on similarity (ho = 3). Figure 4 reports the results and shows that the qualitative effects of the conceptualization of homophily are robust against this variation the parameter ho.

Fig. 4
figure 4

Effects of the conceptualization of homophily on model predictions after 10,000 opinion updates per agent. Baseline condition except for ho = 3, 100 independent realizations per condition

Figures 3 and 4 and additional results provided in the “Appendix”, demonstrate that under the model that combines implicit argument communication and homophily via influence effectiveness, and for most values of homophily strength (except for ho= 3 and hs= hp= 1, see Fig. 4), a considerably smaller share of runs converges to perfect consensus or maximal bi-polarization than in the other models. This is shown by the fact that all runs display either minimal or maximal between-group bi-polarization in the left-hand panels of Figs. 3 and 4, whereas in most cases the right-hand panels show runs with non-extreme values. The reason is that unlike the other two models, this model (with homophily via influence effectiveness) generates dynamics that can last very long before an equilibrium is reached. For illustration, Fig. 5 shows the trajectories of the average opinions in the two subgroups in an ideal–typical simulation run. The size of the gray area at a given simulation event, thus, shows the degree of between-group bi-polarization. The dynamics are very rich. For instance, after every agent’s opinion had been updated 5140 times, in one of the two groups, all members had adopted an opinion of 1. The average opinion in the other group was close to the same pole (avg = .99155) but there was still one agent who was not maximally extreme (oi,t= .95775). While the population was very close to reaching an extreme consensus, Eq. 3 implies that it is very unlikely that the opinion of this agent shifted onto the pole. This shift required that the agent dropped a con argument, an event that is very unlikely when an agent holds an opinion so close to the opinion pole of + 1. Instead, dynamics moved the system away from the consensus equilibrium when the agent at some moment communicated a con argument to one of the other agents.

Fig. 5
figure 5

Evolution of the average opinion in the two groups in an ideal–typical simulation run of the model with homophily conceptualized as interaction effectiveness

Only the combination of homophily via influence effectiveness and implicit argument communication makes it possible that outcomes can occur in which the opinion distribution moves so close to one of the extreme outcomes that it is almost impossible to reach eventually. The reason is that the weighing of the impact of an argument with similarity can result in extremely small steps of opinion change, especially with high homophily strength. This property of the model also makes the opposite equilibrium (bi-polarization) very difficult to reach, as is also illustrated by the ideal–typical run in Fig. 5. After 7560 opinion updates per agent, the opinion averages in the two groups were − 98348 and .98001 respectively. In other words, the population was highly bi-polarized. Also in this setting, however, it was relatively unlikely that the population reached the state of maximal bi-polarization. To eventually reach the pole towards which a subgroup tended, all of its members would have needed to drop their last remaining opposite argument and replace it with one leading them all the way towards the extreme. However, as Eq. 3 implies, this was very unlikely to happen because in this situation agents held on average only about 1% arguments in favor of the opposite end of the opinion spectrum.

Reaching a perfectly bi-polarized opinion distribution is particularly time consuming when homophily (hp) is strong. The problem is that an agent who has adopted a maximally extreme opinion will likely be exposed to an argument challenging her opinion when interacting with an agent holding a very different opinion. The resulting opinion shift away from the opinion pole, however, can be extremely small when hp adopts high values. The updated opinion value will therefore be very close to the opinion pole, making it extremely unlikely that the agent drops the counter argument and returns to a maximally extreme view.

Figure 6 shows results that test the expectation that simulations take on average more interaction events before a convergence state is reached, when homophily was implemented via the effectiveness of influence rather than via likelihood of interaction. For Fig. 6, we conducted simulation experiments for the baseline condition with a time limit of 5 × 105 interaction events per agent. We found that all runs but one reached a stable rest point. Runs of the model with homophily modeled as influence effectiveness lasted particularly long when homophily was strong. The opposite effect, however, was found when homophily was implemented as increased likelihood of interaction. Here, homophily strength decreased the duration of the dynamics, an effect that is strong but visually diluted by the logarithmic scale of the y-axis in Fig. 6.

Fig. 6
figure 6

Boxplots showing the effect of the conceptualization of homophily on the logarithm of the duration of the dynamics measured in simulation events (100 simulation runs per treatment, S = 4, S0= 3, w = .8, ho= .3; N = 10, only models with implicit argument-communication)

To further assess the robustness of results for the effects of the implementation of homophily on between-group bi-polarization, additional tests were conducted, varying initial congruency (w), and the number of arguments in agents’ memory (S) for a specific scenario in which a group split was possible, but not easy to obtain within 10,000 interaction events per agent. The reported finding turned out to be robust. In the models with implicit argument communication, extreme outcomes were less likely to emerge within a given time frame for the model with effectiveness-homophily. Further results of these tests are reported in the “Appendix”.

5 Conclusion and discussion

Strong demographic faultlines have been identified as a possible reason why diversity can hamper a team’s cohesion and performance. While empirical research has pointed to a number of moderating conditions for the effects of demographic faultlines, computational modellers have recently begun to address the task of understanding these effects with models of the complex and interdependent dynamics of social relations and social influence in teams. We focused here on one agent-based modelling approach, the model of persuasive argument communication proposed by Mäs et al. (2013), Mäs and Bischofberger (2015) and Mäs and Flache (2013), which closely builds on the fundamental processes of reinforcing influence and homophily central in Lau and Murnighan’s (1998) original theory of faultlines.

The model of persuasive argument communication points to intriguing theoretical hypotheses about the dynamics of group splits in teams. First, it highlights a number of conditions that are required before a demographic faultline can really induce a group-split, including sufficiently strong initial congruency of opinions and demographics, and sufficiently strong homophily. Second, it allows explaining group-split dynamics without making the empirically debated assumption of repulsive forces in social influence, used by earlier formal accounts of group-split dynamics. In addition, Mäs et al. (2013) showed how the addition of “criss-crossing” actors connecting demographically separated subgroups could prevent group-split dynamics despite a strong faultline in a team. As such, the model of persuasive argument communication highlights theoretical directions research could take to test possible strategies organizations could employ to preclude between-group polarization in teams with a strong faultline. This potential practical use of the model of persuasive argument communication makes it highly important to carefully assess the robustness of its main theoretical predictions against alternative theoretically plausible specifications of the micro-processes of reinforcing influence and homophily that are at the heart of the model.

A comparison with alternative modelling approaches in formal models of social influence in general (cf. Flache et al. 2017) and more recent implementations of persuasive argument communication in particular (Feliciani et al. 2017) reveals two important distinctions in both processes. In modelling reinforcing influence, arguments can be explicitly represented in a model or be implicit, inferred from the opinions agents adopt. In modelling homophily, similarity can be assumed to affect the likelihood or the effectiveness of an interaction in which arguments are communicated. We developed a modelling framework that allowed us to separately compare the effects of both distinctions on model dynamics across a new ‘hybrid’ model and two earlier implementations proposed in the literature.

We found that all three model versions could generate the outcome of group-split in a team with a strong faultline, but there are also differences in the conditions and the dynamics of between-group bi-polarization across the models. We observed that implicit argument communication generated more between-groups bi-polarization and extreme consensus than the explicit argument-communication. This difference is explained by the fact that between-group bi-polarization and extreme consensus are the only outcome equilibria in the implicit argument-communication regime, whereas the explicit version produces a third possible outcome, moderate consensus. We found that this difference was limited to teams of small size, however: in large teams, the emergence of moderate consensus is highly unlikely with the explicit version, too. Additionally, we found that the team’s size has an interesting and unanticipated effect on the implicit argument-communication: we observed that, under most parameter configurations, between-group bi-polarization is more likely to emerge in big teams than in small ones.

Also the conceptualization of homophily affected team dynamics in our theoretical studies. When homophily affected the likelihood of interaction rather than its effectiveness, this resulted in more extreme opinion outcomes. Simulations more likely converged on bi-polarization or extreme consensus within a given time frame, and group-splits were more likely to occur.

In a nutshell, the present research generally supports the robustness of the persuasive argument model as a tool to theoretically disentangle the dynamics and conditions of group-split in teams with a strong demographic faultline. At the same time, our study has pointed to a number of potential limitations that highlight directions for future research and underline important lessons for computational models of team dynamics in a more general sense. Our study showed in particular how plausible and seemingly small modifications in the formalization of micro-processes of social influence can have profound consequences for model dynamics and theoretical predictions that require careful inspection. For example, the model with implicit argument communication was found to not generate the outcome of moderate consensus that is possible with explicit argument communication, also not in the small-sized populations that are relevant for modelling teams in organizations. A further example of a profound and intricate effect of a seemingly small change were the erratic opinion dynamics that was induced by the combination of homophily via effectiveness of interaction, and implicit argument communication. In this model, opinion differences between stable and unstable outcomes could become so small over time, that careful inspection of the robustness of its implications against the problem of floating point inaccuracy (Galán et al. 2009; Polhill et al. 2005) is needed in future studies employing similar combinations of assumptionsFootnote 10.

Notwithstanding its limitations, our research has highlighted potentially highly relevant insights, not only for research on organizations facing diversity in teams with strong faultlines. The important role of the conceptualization of homophily that we identified resonates with a wider debate on the forms of homophily. In real life, opinion formation processes by argument communication can happen in situations where homophily can play out in either one of the two ways, or both at the same time. In an online social network, for example, homophily can typically take both forms. To begin with, homophily can influence who a user might decide to engage in a discussion with, whom to send a ‘friendship request’, or whom to ‘follow’. In this sense, homophily affects the likelihood of interaction. At the same time, the opinion of close friends and relatives might count more than the opinion of complete strangers. Here, homophily is affecting the effectiveness of the interaction. In some other settings, there are structural constraints that exclude the individual from the decision of whom to interact with. This is the case, for example, when all discussants are asked to present their point of view in turns, so that everybody is exposed to everybody else in the same manner. Such interaction structure rules out the possibility for homophily to affect the chances of interaction, while it can still affect the effectiveness of the interaction: one can still choose whether to care, or to pay attention to whomever is speaking, based on the opinion that is being expressed, or the group identity of who is voicing it.

Our theoretical results also suggest an intriguing interpretation of empirical findings of Strandberg et al. (2017) about conditions for polarization in a group discussion where the discussion is either moderated or not. In a moderated discussion, a facilitator guides the discussion by ensuring inclusion of discussants and equality of the discussion. In terms of our implementations of homophily in persuasive argument communication, the moderation of a discussion limits homophily via likelihood of interaction, as participants cannot choose whom to interact with based on their preferences. Therefore, we can say that a moderated discussion favors homophily via effectiveness of interaction rather than as likelihood of interaction. People may put more weight on arguments obtained from others they agree with, but they cannot escape exposure to argument also from actors with dissimilar views. The main finding of this empirical study was that discussion between like-minded individuals is significantly less likely to produce group polarization when the discussion was moderated, compared to a free discussion. This finding resonates with the predictions of the model of persuasive argument communication: with homophily via effectiveness of interaction, the model predicts that extreme consensus and bi-polarization are less likely to emerge.

These considerations highlight interesting directions for future theoretical and empirical work. On the one hand, theoretical work could explore whether other processes of opinion formation are sensitive/to the way how homophily is conceptualized and implemented. On the other hand, empirical work could test these effects in real world settings with experiments by manipulating—for example—the link between similarity of opinions and likelihood of interaction. First studies (see Mäs and Flache 2013) along this line have been conducted. We believe our theoretical work here has provided new insights that could inspire future empirical tests.