1 Introduction

Social interactions play a crucial role in the evolution of culture for a minority. Inter-group contact and frequent exchanges can lead to a process of cultural assimilation, which promotes social trust and helps solve coordination problems, thereby fostering the provision of public goods (Alesina et al. 1999; Alesina and La Ferrara 2005), the transmission of human capital and the implementation of efficient policies (Guiso et al. 2016). Moreover, culturally assimilated minorities are less likely to form socially segregated enclaves, thereby reducing conflict. Cultural assimilation and social segregation are also key determinants of the economic and political consequences of immigration: if immigrants live in enclaves, hold on to their values, and strive to transmit them to their children, anti-immigrant feelings might arise (Dustmann and Preston 2001; Facchini and Mayda 2009), strengthening the support for nationalist parties and anti-immigration policies (Russo 2021; Tabellini 2020).

To study the joint determinants of cultural assimilation and social segregation, I propose a calibrated epidemiological model of cultural evolution in a country where a fraction of the population is a minority, and where the majority is culturally heterogeneous. The key feature of the model is a disutility from social matchings between individuals with different opinions, either as a result of social conformism (Akerlof 1997; Bernheim 1994; Cialdini and Goldstein 2004), or because cultural differences might hinder trade (Lazear 1999), for instance making it more costly to sign contracts. The accumulation of disutilities, in turn, induces agents, both in the majority and in the minority, to change their opinions and to avoid social interactions with the individuals from whom they are culturally distant. The costs of changing opinions are idiosyncratic and issue-specific, so the model implies heterogeneous assimilation patterns, both over individuals and over traits: not all minority agents assimilate, and some cultural traits tend to be persistent. The choice over social interactions, in turn, results in the endogenous formation of social networks by homophily (Currarini et al. 2009; MCPherson et al. 2001), potentially leading to social marginalization, with further implications for the assimilation process. The endogenous social networks, together with the model calibration and with the assumption of culturally heterogeneous majorities, are new with respect to the literature on cultural transmission (Bisin and Verdier 2000, 2001; Giavazzi et al. 2019).

I propose two measures to quantify the extent of cultural assimilation in the model. The first, which I call \(\sigma \)-assimilation, is designed to compare individuals at a given point in time, and it is a measure of the average cultural distance between a minority agent and the majority. Since opinions are binary in the model, I use the Hamming distance between individual vectors of opinions to measure cultural distances, and I define an individual in the minority as more \(\sigma \)-assimilated if her average cultural distance from the majority is smaller. The second measure, which I call \(\Delta \)-assimilation, is instead designed to track the evolution of assimilation over time, and it is simply computed as the first difference of the \(\sigma \)-assimilation measure over a pre-specified time interval. According to this second measure, there is \(\Delta \)-assimilation for an individual in the minority if her cultural distance from the majority shrinks.

The model cannot be solved analytically, so I discuss a numerical solution for a calibration targeted to the European countries, for which I have joint information on values and opinions from the European Social Survey (ESS), and on demographics from the EUROSTAT census. The calibration is targeted at first-generation immigrants, for which I take the country-specific shares from census data. Using information on the immigrant’s nationality, I can further construct a measure of initial cultural distances between immigrants and natives to feed in the model. The rest of the model parameters are then calibrated to match: the distribution of the frequency of social interactions for the immigrants and the distribution of the importance of tradition for immigrants, as they emerge from the ESS answers; the share of socially marginalized individuals, with few social interactions, also from the ESS.

I show that the model is able to capture several features of the data that it was not calibrated to match. Most importantly, the model is able to replicate some stylized facts, or correlation evidence, that emerges from the data. The exercise consists in computing an empirical measure of \(\sigma \)-assimilation of immigrants using the ESS, regressing it on some covariates and then showing that regression of the \(\sigma \)-assimilation measures computed on an artificial cross-section of simulated data from the model delivers the same results. In particular, I find that assimilation is weaker (higher cultural distance between the majority and the minority) in pluralistic societies characterized by a high variability of opinions within the majority. Moreover, assimilation is stronger for more socially active immigrants, but weaker in the case of immigrants whose origin country is culturally more distant, on average, from their host country. Finally, assimilation is not influenced by the immigrant share of the population. More than that, the model is able to replicate: the within-country variability of \(\sigma \)-assimilation; the cross-country variability of \(\Delta \)-assimilation; the positive correlation between cultural assimilation and individual well-being that emerges from the data.

Once I establish the model validity in matching the empirical evidence, I use it to perform comparative statics within an environment where assimilation patterns are heterogeneous across individuals and across cultural traits. First, pluralism, that is the initial cultural heterogeneity within the majority, reduces assimilation and reduces the average size of the social networks, leading to the formation of small cultural enclaves. Second, minorities who come from countries that are culturally more distant assimilate less. Third, the immigrant share does not affect assimilation. The model also predicts more cultural assimilation in countries with high social density defined as the frequency of social interactions, although the effect is small. Social segregation, in turn, increases with social density, with the minority share, and with the initial average cultural distance between the minority and the majority. These results are robust across different model parametrizations and several model extensions.

The model, however, cannot be used to perform a welfare analysis on the potential social desirability of cultural assimilation. The main reason is that it abstracts from the trade-off associated with cultural diversity discussed by Ashraf and Galor (2013): diversity hinders the transmission of human capital and increases the transaction costs related to within-country interactions, but it also fosters creativity and, therefore, technological change. Moreover, assimilation is not necessarily desirable because the very idea of cultural assimilation abstracts from a judgment about the kind of culture that prevails in the long run. For instance, there can be assimilation of a minority to a culture of violence and discrimination, with potentially severe social and economic consequences.

The rest of the paper is organized as follows: Section 2 briefly reviews the related literature. Section 3 describes the model and the calibration procedure. Section 4 explains the construction of cultural assimilation measure, both in the model and in the data. Section 5 summarizes the tests of the model fit to the data. Section 6 illustrates the model comparative statics. Section 7 discusses several robustness checks and model extensions. Section 8 concludes. The appendix discusses additional results from the empirical analysis. An additional on-line Appendix discusses additional results and further model extensions and provides details on the construction of the vectors of opinions that define culture.

A small note before proceeding: in the paper, I use the word assimilation, instead of integration, to be consistent with the previous literature but without any particular political or moral judgment on the process of assimilation.

2 Related literature

This paper is related to the economic, sociological, and anthropological literature on cultural evolution. The main contribution in the construction of a calibrated epidemiological model with endogenous social networks and with a culturally heterogeneous majority that is consistent with the empirical evidence on assimilation in Europe.

I build on the seminal paper by Lazear (1999) (and on its dynamic extension by Konya 2005), borrowing the assumption of a positive gain from cultural assimilation stemming from lower transaction costs. Bisin and Verdier (2000, 2001) model cultural evolution as a result of the interaction between the vertical transmission within the family and the horizontal transmission outside the family, resulting in persistent cultural traits because of homogamous marriages. A similar framework appears in Panebianco (2014), who focuses on inter-ethnic attitudes. Giavazzi et al. (2019) merge the Lazear (1999) and the Bisin and Verdier (2000, 2001) models, to study the speed of cultural evolution in the US, while Algan et al. (2022) use a similar model to explain the prevalence of Arabic names in France. In a related contribution, Adda et al. (2019) show that granting legal status to immigrants reduces intermarriages. The main innovation with respect to these contributions consists in proposing a calibrated model with endogenous network formation that, consistently with the data, delivers both heterogeneity in assimilation and social segregation.

Darity et al. (2006) and Bazzi et al. (2019) build evolutionary models of identity formation based on random matchings, showing that polarization hampers the emergence of a common culture. Kuran and Sandholm (2008) propose instead an evolutionary model where cultural evolution depends on parents’ socialization, coordination gains, and self-persuasion, all of which lead to culture hybridization. My model shares many features with those approaches, but it extends the analysis to the endogenous formation of social networks, making it also close to the literature on networkFootnote 1 formation (Currarini et al. 2009, among others), on the importance of networks for immigrants’ assimilation (Verdier and Zenou 2017). More broadly, my analysis is related to the economic literature on identity formation (Akerlof and Kranton 2000; Benabou and Tirole 2011; Shayo 2020 and on the recent literature on the evolution of narratives (Shiller 2017).

Grosjean (2011) proposes a gravity model of cultural integration, showing that cultural change can be slow. My model implies instead relatively fast assimilation, in line with the evidence in Manning and Roy (2010) and Cameron et al. (2015). Abramitzky et al. (2020) show instead that the speed of assimilation, in the US, did not change much over time. In a related contribution, Fouka et al. (2022) show that the inflow of a relatively more distant minority can ease the assimilation and reduce the segregation of the less distant minorities. Giuliano and Nunn (2021) propose and test a different theory of cultural evolution based on environmental similarity, showing that culture persists in case of climatic stability. I abstract from these features. Sato and Zenou (2020) propose a model of cultural integration that focuses on residential patterns within cities, a feature from which I also abstract, even if my model could be extended to residential, rather than only social, segregation.

With respect to the measurement of cultural assimilation, Meng and Gregory (2005) and Bisin et al. (2008), among others, quantify assimilation using intermarriages; Constant et al. (2009) propose a measure of ethnic identity based on language and self-identification; Arai et al. (2009) and Abramitzky et al. (2020) measure assimilation with the frequency of ethnic names; Aleksynska (2011) uses civic participation; Manning and Roy (2010) use a self-assessed measure of assimilation. With respect to these works, I measure cultural assimilation by leveraging a large amount of information on several cultural traits as it emerges from ESS survey questions, and I distinguish between a static and a dynamic measure of assimilation. My measures of assimilation are related to the social antagonism framework developed by Desmet et al. (2017), but, differently from them, I propose a specific metric for cultural assimilation at the individual level.

Several empirical studies highlight the persistence of culture, also among minorities and among immigrants, and my model is consistent with this evidence. Examples include Giuliano (2007) for living arrangements, Fernandez and Fogli (2005) for fertility and labor market participation, Guiso et al. (2006) and Algan and Cahuc (2010) for trust, Becker et al. (2014) for trust and (low levels of) corruption, Botticini and Eckstein (2005) for education preferences, Voigtlander and Voth (2012) for antisemitism, and Luttmer and Singhal (2011) for preferences for redistribution. My model is consistent with this persistence.

3 The model

The scope of this section is to proposes a simple theoretical framework to think about assimilation in culturally heterogeneous societies. I first describe the model setup (Section 3.1), and then the details of the calibration procedure (Section 3.2).

3.1 Model setup

A fictional country is inhabited by N citizensFootnote 2 indexed by \(i \in \{1,2,\dots N\}\). A fixed fraction \(\lambda < 0.5\) of the population belongs to a cultural minority (identifier MN), while a fixed fraction \((1-\lambda )\) to a majority (identifier MJ). The citizens match socially, every period, with a total of \(z_{i}=\lfloor \gamma _{i} N \rfloor \) individuals to discuss and exchange ideas, where \(\gamma _{i}\) is an individual measure of social participation. Social participation depends on a variety of factors, including the degree of urbanization, the amount of social capital, the average size of schools and firms, and the use of social networks. Although the total number of individuals met each period by agent i is fixed, I assume that the identity of those individuals changes. I denote with \(J_{it}\) the set of individual identifiers of the agents that match with agent i at time t, with cardinality \(\vert J_{it} \vert = z_{i}\). Without this feature, the model will deliver less heterogeneity in assimilation with respect to the observed one.

There is a total of M issues to potentially discuss in social matchings, and the opinions are binary. I denote with \(q^{m}_{i t} \in \{0,1\}\) the opinion of agent i at time t over the issue \(m \in \{1,2,\dots M\}\), with the convention that \(q^{m}_{i t}=1\) in case of “agreement” and \(q^{m}_{i t}=0\) in case of “disagreement.” For instance, an agent might agree that becoming rich is a primary goal in life or that praying every day is important. The assumption of binary opinions simplifies the analysis, but it is restrictive. In the on-line Appendix, I discuss a model extension to multiple opinions, which delivers similar results. I assume that the issues differ in their salience, which I define as a probability \(\theta ^{m}\) to discuss them in a social matching. I denote with \(Q_{it}=\{q^{1}_{it}, q^{2}_{it}, \dots , q^{M}_{it} \}\) the full set of opinions of agent i, which is the model equivalent of culture. Both assumptions are designed to match the wide heterogeneity in assimilation over different issues observed in the data.

The agents’ utility function is very stylizedFootnote 3: they simply derive a disutility when they discuss an issue with an agent who holds a different opinion. There are two possible interpretations of the disutility. The first entails the notion of social conformism (Akerlof 1997; Bernheim 1994; Cialdini and Goldstein 2004), with individuals that prefer to think and behave as the majority and that are negatively affected in case they realize that they do not. The second is related to the possibility of “missed trade” (Lazear 1999), in the sense that it is more difficult, or more costly, to sign contracts with commercial counterparts that come from different cultural backgrounds. For simplicity, I normalizeFootnote 4 the per-matching disutility to one (wlg). I define the disutility of agent i over the issue m, in a given period t, as:

$$\begin{aligned} h^{m}_{it}= \frac{\sum _{j \in J_{it}} \Theta _{ijt}^{m} \, \mathbbm {1}_{[q^{m}_{it} \ne q^{m}_{jt} ]} }{\sum _{j \in J_{it}} \Theta _{ijt}^{m}} \end{aligned}$$
(1)

where the \(\Theta _{ijt}^{m}\) are \(z_{i}\) independent draws from a Bernoulli distribution with probability \(\theta ^{m}\). The agents are characterized by a rule-of-thumb reaction to the accumulation of disutility, which works along two dimensions: changing opinions and avoiding social interactions. In both cases, the model features tipping points. To capture the empirically observed variability in assimilation, both over individuals and over issues, I assume that the agents have an idiosyncratic switching cost, which I model as an individual and issue-specific, total disutility threshold \(\hat{H}^{m}_{i}\), above which an agent switches (\(h^{m}_{it} > \hat{H}^{m}_{i}\)). Agents with high thresholds typically hold on to their opinions even in case of frequent disagreements; they are closely tied to their heritage, perhaps because of their upbringing. Viceversa, the agents with low thresholds are the conformist. These thresholds will be calibrated leveraging the survey information on the importance that different individuals attribute to the tradition. Since the opinions are binary, a switch simply entails changing from disagreement to agreement and viceversa. In this baseline model, I assume, for simplicity, that past disutilities are fully discounted, as agents were memoryless: all their decisions are based on what happens in the current period. In Section 7.1, I discuss a model extension to memory, which delivers similar results.

The second reaction to disagreement is a process of social exclusion: if a matching with a given agent results in a high disutility, then all future interactions with the same agent are avoided. The implication is that the identity of the agents in social matchings is the result of previous interactions or, alternatively, there is an endogenous social network formation mechanism based on homophily (MCPherson et al. 2001). The total disutility from a matching between agents i and j, at time t, conditional on their matching, is equal to:

$$\begin{aligned} \hat{f}_{it}^{j} = \sum _{m=1}^{M} \hat{\Theta }^{m}_{t} \vert q^{m}_{i t}-q^{m}_{j t} \vert \qquad j \in J_{it} \end{aligned}$$
(2)

where \(\hat{\Theta }^{m}_{t}\) are random draws from a Bernoulli distribution with parameter \(\theta ^{m}\). When this disutility is above a threshold, the agent j is excluded (“flagged”) from the social network, meaning that there will never be new matchings with her from period \(t+1\) onward. These flags are the result of spot interactions that lead to strong disagreements, which are akin to ideological fights. It is therefore possible to define a binary vector of “flags,” for each agent i and time t, \(F_{it}=\{f_{it}^{j}\}_{j \ne i}\), whose \(N-1\) elements are equal to 1 if the total disutility \(\hat{f}_{it}^{j}\) from a matching between i and j were above a pre-specified threshold \(\zeta _{i}\) in a previous interaction before time t:

$$\begin{aligned} f_{it}^{j} = \mathbbm {1}_{[ \hat{f}_{i \bar{t}}^{j} \, \ge \, \zeta _{i} ] } \qquad \text {for some} \quad \bar{t} < t \end{aligned}$$
(3)

For consistency, I assume that \(f_{it}^{j}=f_{jt}^{i} \quad \forall \, i,j\) (undirected links on the social network) at any point in time: if an agent i avoids any social contact with j, than there should be no social matching between i and j. This endogenous network formation is an essential element of the model, which allows to capture the co-evolution of culture and social interactions. Moreover, it allows the model to match a specific feature of the data, that is the heterogeneity in social participation, both within and between countries, and the extent of socially marginalized individuals. I assume that the identity of the agents in social matchings is chosen randomly, given \(\gamma _{i}\), but only among the non-excluded (non-flagged) individuals. For further consistency, I assume that the process of social exclusion is at work both for minority and majority agents. Furthermore, in this benchmark model, the flags are irreversible (see Section 7.1 for robustness), and they are chosen regardless of the eventual changes of opinions that happened in the period (see Section 7.1 for robustness). One possible outcome of this selection algorithm of the social network is the exclusion of all individuals, i.e., \(f_{it}^{j} = 1 \, \forall \, j\) (“zero” degree, see Jackson 2010). I define this condition as social marginalization (Constant et al. 2009), and I use this information to calibrate the model.

One crucial feature of the model is that it delivers heterogeneous assimilation patterns, that is a full distribution of \(\sigma \)-assimilation over minority agents. In other words, not all agents from the minority assimilate even if there is convergence, on average, for the minority as a group. Moreover, assimilation is also heterogeneous by traits, and some of them are persistent even for otherwise culturally assimilated minority agents. A second distinctive feature of the model is that assimilation is also the result of intragroup contact among minority agents.

The model, because of its complexity, cannot be solved analytically, and it is not possible to formally show the existence and uniqueness of an equilibrium. The complexity, in turn, is a consequence of the many heterogeneities, and of the joint decisions over opinions and social networks, both of which are included to make the model able to match the empirical evidence. To obtain a numerical solution, I propose a calibration to a cross-section of European countries and then show that the model is able to replicate several empirical features that it is not calibrated to match. Once a numerical solution is available, I will show that such a solution is stable across simulation rounds, proving that there is no concern about potential multiple equilibria. The details of the calibration procedure are outlined in the next Section 3.2. The uninterested reader can directly jump to Section 4 that discusses how to measure cultural assimilation.

3.2 Parameters, calibration, and solution

Paramters

I simulate countries composed by a fixed measure of \(N=1000\) individuals, therefore abstracting from cross-sectional population differences. I consider a total of \(M=50\) issues to potentially discuss, with salience equal to \(0.02 \cdot m\), meaning that the issue \(m=1\) is discussed in 2% of the social matchings only, while the issue \(m=M\), in all matchings (the results are robust for different values of M as long as this is not too small). For the maximum social densityFootnote 5D, I arbitrarily normalize it to 5%, to have a reasonable (i.e., not too big) number of social contacts, but the results turned out to be robust for different values of D (see Section 7.1 for the cases of 2.5% and 10%), thereby proving that this arbitrary normalization is not problematic.

I set the minority share \(\lambda \) according to the first-generation immigrants’ share from the 2011 EUROSTAT census. The initial opinions of the minority are randomly drawn from a Bernoulli distribution with parameter \(\alpha \), while the initial opinions of the majority from a Bernoulli distribution with parameter \(\beta \). To set \(0 \le \beta \le 0.5\), I use the average varianceFootnote 6 of the natives’ answers to the ESS questions, which is a large survey designed to elicit differences in cultural traits between individuals (see infra for further details). The difference between \(\alpha \) and \(\beta \) is the model equivalent of the initial average cultural distance between the majority and the minority. Following the literature on the vertical transmission of culture (Desmet et al. 2017 and Spolaore and Wacziarg 2009, among others), I use micro-satellite genetic distance to set it. For each country, I compute the weighted average genetic distanceFootnote 7 between the immigrants and the natives using the immigrants’ shares by nationality from the census as weights. Given the country value of \(\beta \), I use these distancesFootnote 8 to set \(\alpha \).

Calibration

The rest of the parameters are calibrated to match: the empirical distribution of social interactions, the empirical distribution of the individually assessed importance of tradition, and the extent of social marginalization.

To calibrate the individual level of social participation, or social density, \(\gamma _{i}\), which determines the number of matchings, I look at answers to the ESS question that asks about the frequency of social interactions (answers are on a scale from 1 to 7, where 1 corresponds to “Never” and 7 to “Everyday”; summary statistics in the on-line Appendix). I assume that \(\gamma =D \cdot \Gamma \), where \(\Gamma \) is distributed according to a Beta and where D is a scaling factor, and I set the parameters of the Beta distribution to match the empirical distribution of answers in the country.Footnote 9

The thresholds \(\hat{H}^{m}_{i}\) are instead calibrated to match the empirical distribution of the answers to the ESS question on the importance “[...] to follow traditions and customs,” again consistently with the empirical analysis (summary statistics in the on-line Appendix). First, I fit a Beta distribution to the discrete answers using the same procedure as for social density, to then draw randomly, from the fitted distribution, individual average thresholds for each \(i \in MN \). After that, I further draw m issue-specific thresholds \(\hat{H}^{m}_{i}\), for each individual, from a Uniform distribution with a mean equal to the draw from the Beta. In case \(H^{a,b}_{i}<=0.5\), the draw is from \(U[0; 2 H^{a,b}_{i}]\); in case of \(H^{a,b}_{i}>0.5\), from \(U[2 H^{a,b}_{i}-1; 1]\).

Finally, the threshold disutility for exclusion from social contacts is calibrated to match social marginalization as it emerges from the ESS. In particular, I look at the question that asks how often the respondent “[...] Takes part in social activities as compared to others of the same age” (summary statistics in the on-line Appendix), and take, as a target, the share of respondents that answersFootnote 10: “Much less than most.” I assume that there are two different thresholds for agents within or outside the social group: \(\zeta ^{in}\) for members of the same group (minority agents to exclude minority agents and majority agents to exclude majority agents) and \(\zeta ^{out}\) for members of the other group (minority agents to exclude majority agents and majority agents to exclude minority agents). Then, I set \(\zeta ^{in}\) and \(\zeta ^{out}\) so that, in the model, the fraction of minority agents with few social contacts at the end of the simulation:

$$\begin{aligned} SI_{T}=\frac{1}{\lambda N} \sum _{i=1}^{\lambda N} \mathbbm {1}_{\big [ \frac{1}{N-1}\sum _{j \ne i} (1-f_{i T}^{j}) \, \le \, K \big ]} \end{aligned}$$
(4)

matches the empirical share of socially marginalized respondents among the minority.Footnote 11 Setting \(K=0\) results in a very small percentage of marginalized minority agents in the simulation. To fit the data, I arbitrarily set \(K=0.05\), but I extensively test the results for robustness (see Section 7.1).

Solution

The benchmark simulation entails 100 rounds of T=100 periods, and I consider a window of 10 time periods, at the end of the simulation, to check for the solution stability, which is the numerical equivalent of showing the equilibrium existence. The coefficient of variation of the \(\sigma \)-assimilation measure over this time window is equal, on average (over countries and over simulation rounds), to 0.48%, while the coefficient of variation of the degree distribution of social networks to 3%. Both values are strong indicators of numerical stability (and that 80 periods are indeed enough for the benchmark model solution). In practice, the model simulations tend to reach, quite rapidly, a point where the configuration of assimilation and segregation is stable.

Since I cannot prove formally that the equilibrium is unique, I also checked numerically for differences over multiple simulation rounds (at T=100). The average (over countries) coefficient of variation turned out to be equal to 3% and, even more strikingly, the percentage difference between the bigger and the smaller values of \(\sigma \)-assimilation obtained over multiple simulations is, on average (over countries) equal to 10%. For what concerns the networks, the coefficient of variation of the country-average network degree is equal, on average (over countries), to 4.4%, and the average difference between the smaller and bigger values over different simulations is equal to 15%. Overall, the numerical differences between simulations are quite small and just driven by computational noise. In other words, there is no ground to be concerned with the possibility of equilibrium multiplicity.

4 Measuring cultural assimilation

In this section, I describe how to measure cultural assimilation, both in the model and in the data, respectively at a given point in time (Section 4.1) and over time (Section 4.2).

4.1 \(\sigma \)-Assimilation

In the model, I compute the average difference of opinions between the majority and the minority, that is an average cultural distance, to evaluate the extent of assimilation of a minority at a given point in time, with smaller differences indicating more assimilation. Since the opinions are binary vectors, and since the number of issues is the same for all agents, I use the Hamming distance, equal to the relative number of vector positions with different entries. The average cultural distance of a minority agent i from the majority, at time t, is equal to:

$$\begin{aligned} S^{k}_{it}= \frac{1}{(1-\lambda )N} \, \sum _{j \in MJ} \,\frac{1}{M} \, \vert \vert Q^{k}_{it}-Q^{k}_{jt} \vert \vert _{1} \end{aligned}$$
(5)

where k indicates the country and where \(\vert \vert Q^{k}_{it}-Q^{k}_{jt} \vert \vert _{1}\) is the \(L^{1}\) distance (or taxicab metric) between the vectors \(Q^{k}_{it}\) and \(Q^{k}_{jt}\), equal, for binary vectors, to the number of different elements \(\sum _{m} \vert q^{m}_{it} -q^{m}_{jt} \vert \) for \(j \in MJ\) and \(\forall i\). I call this measure \(\sigma \)-assimilation.

For the empirical implementation, I focus on European Social Survey (ESS) to elicit opinions on a wide array of subjects spanning from religiosity, social trust, and politics to the role of women and to the importance of human values such as tolerance and respect, although the measure can be in principle computed starting from any survey. Mimicking the computations in the model, I compute the cultural distance between the minority agent i and the majority agent j at time t as the Hamming distance between their vectors of answers to the ESS questions:

$$\begin{aligned} \bar{S}^{k}_{i t}= \frac{1}{\vert MJ \vert } \, \sum _{j \in MJ} \, \frac{1}{\bar{M}_{ij}} \, \Bigg [ \sum _{m=1}^{\bar{M}_{ij}} \mathbbm {1}_{ [ a^{m,k}_{it} \ne a^{m,k}_{jt} ]} \Bigg ] \end{aligned}$$
(6)

where \(\bar{M}_{ij} \le M\) is the number of questions answered by the (ij) pair in country k, with i in the minority (\(i \in MN\)) and j in the majority (\(j \in MJ\)), M is the total number of survey questions, \(\vert MJ \vert \) is the number of survey respondents in the majority (cardinality of the set MJ) and \(A^{k}_{it}=\{ a^{1,k}_{it}, a^{2,k}_{it} \dots a^{m,k}_{it}, \dots a^{M,k}_{it} \}\) is the full set of answers. Averaging these distances over all agents in the majority, I then obtain the average cultural distance between the minority agent i and the majority. The attractive empirical feature of the hamming distance is its capability to evaluate differences over cultural traits whose empirical measurement are not on ordinal scales, which is often the case for survey questions, as well as the possibility to aggregate over several traits measured differently.

Operationally, I focus on 28 countriesFootnote 12 and on the last 5 ESS wavesFootnote 13 (ESS5-2010; ESS6-2012; ESS7-2014; ESS8-2016; ESS9-2018) because of the joint availability of EUROSTAT data that I use both in the empirical analysis and to calibrate the model. I only excluded the questions that ask for subjective assessments related to feelings (feeling of happiness etc.), and questions about personal experiences (victim of robberies, etc.), that are not related to what people think, as well as the variables that I use as covariates to explain assimilation. The list of questions that I used is available in the on-line Appendix.

I focus on a single, well-defined minority, namely immigrants, and I assume that the immigrants respondents to the ESS are representative of the population of immigrants, thereby trusting the sampling design of the ESS sample. To check if this is a good assumption, I compared the ESS sample composition for immigrants according to gender, age, and education with the EUROSTAT 2011 census data. The results are summarized in Table 1, both for the entire sample and for the countries with the biggest share of ESS immigrant respondents, and they show that the ESS sample is indeed close to representative. The only exception is for primary and tertiary education, but, in this case, the ESS sample is not representative of nearly all countries, meaning that there are no systematic relationships between being non-representative and the other features of the data that I exploit for the empirical analysis. As for the majority, I consider all ESS survey respondents who are not immigrants, whose parents were born in the country (second-generation immigrants are excluded) and who did not define themselves as part of a minority.

One potential problem is that more integrated immigrants could be more likely to respond to ESS questions or to be included in the survey in the first place, so it is indeed possible for my empirical measure of assimilation to be an upper-bound for the observed assimilation, and potentially with a lower observed variance (missing observations of non-integrated immigrants in the the tail of the distribution). However, since the data show that the sample is representative of the population of immigrants as it emerges from the census data, the bias in my measure of assimilation should be, if anything, low.

Table 1 Sample statistics

4.2 \(\Delta \)-assimilation

A second approach to evaluate the extent of assimilation is to look at the changes in average cultural distance between the majority and the minority over time. In the model, the change in average cultural distance from period t to period \(t+q\), for an individual i in country k, is equal to:

$$\begin{aligned} I^{k}_{i q}= \bigg [ \frac{S^{k}_{i t+q}-S^{k}_{i t}}{S^{k}_{i t}} \bigg ] \end{aligned}$$
(7)

with lower values corresponding to more assimilation, basically a steeper reduction of the average cultural distance from the majority. I call this second measure \(\Delta \)-assimilation. Indeed, the model delivers \(\Delta \)-assimilation: the cultural distance between the minority and the majority shrinks over time (negative \(\Delta \)-assimilation measure), consistently with the empirical evidence of a stronger assimilation for immigrants that spent more time in the country (see infra).

The empirical counterpart of this measure is problematic to compute because there is no panel component in the ESS, meaning that it is impossible to track cultural assimilation over time for single individuals. To overcome this issue, I considered an aggregate empirical measure obtained comparing immigrants by years or cohort of immigration. The idea is that, in case of \(\Delta \)-assimilation, those who spent more time in the country, everything else equal, should be less culturally distant from the natives. Formally, I compute the empirical measure of \(\Delta \)-assimilation in country k, after s periods, as:

$$\begin{aligned} \bar{I}^{k}_{t s}=\frac{1}{J-s} \sum _{j=1}^{J-s} \bigg [ \frac{\bar{S}^{j+s, k}_{t} - \bar{S}^{j, k}_{t}}{\bar{S}^{j, k}_{t}} \bigg ] \end{aligned}$$
(8)

where \(\bar{S}^{j, k}_{t}\) is the average empirical cultural distance between natives and immigrants that spent exactly j years in the country (J maximum), s is the time span of the comparison and t is the survey wave. I implement the computation with the \(9^{th}\) ESS wave, administered in 2018, 7 years after the period used for the model calibration, which entails having model periods of slightly less than 1 month. Since there are relatively few immigrants respondents to the ESS, I consider 5 years cohorts: \(\{ j\in [0,4]\), \(j \in [5,9]\), \(j \in [10,14]\), \(\dots \), \(j \ge 40 \}\). This implies comparing immigrants that spent between 5 and 9 years in the country to immigrants that spent less than 5 years, then comparing immigrants that spent between 10 and 14 years to immigrants that spent between 5 and 9 years, etc. For robustness, I also considered cohorts of 10 years, obtaining similar results.

The empirical computations of \(\Delta \)-assimilation measures on cohorts are meaningful if the individual characteristics of the immigrants in each cohort are similar, and the data confirm this, at least for the attributes recorded in the ESS (gender, marital status, number of children, years of education, hours worked, income). The only systematic difference across cohorts that I find in the data is that the immigrants that spent more years in the country are older, but this is the by-product of the cohort construction.

5 Testing the model

I propose several tests to assess the model performance at matching the empirical evidence regarding features that it was not calibrated to match. First, I analyze the model ability to replicate several stylized facts on assimilation as they emerge from the data (Section 5.1). Second, I analyze the model predictions with respect to the dynamic of assimilation (Section 5.2). Third, I test the model performance at matching the within-country variability of sigma assimilation (Section 5.3). Fourth, I show that the model is able to replicate the empirically observed positive relationship between cultural assimilation and individual well-being (Section 5.4). Unfortunately I cannot test the model performance at matching empirical data on social networks, because of the absence of detailed, and comparable, cross-country information. The only relevant available information is the fraction of socially marginalized individuals in the majority, but I use it to calibrate the model.

5.1 Stylized facts on \(\sigma \)-assimilation

I first establish some empirical stylized facts on \(\sigma \)-assimilation and then simulate artificial cross-sections in the model to show that it is able to replicate them. The scope of the empirical analysis is only to highlight the factors that are correlated with the cultural assimilation of the immigrants, and none of the results must be interpreted causally. Notice also that the empirical analysis highlighted additional empirical results, discussed in appendix, on which however I will not focus in the rest of the paper. Given the individual empirical measures of \(\sigma \)-assimilation \(\bar{S}^{k}_{i t}\), I run regressions of the following form:

$$\begin{aligned} \bar{S}^{k}_{i t} = \eta _{k} + \delta _{t} + \Upsilon X_{ikt} + \Gamma Y_{kt} +\varepsilon _{ikt} \end{aligned}$$
(9)

where \(\eta _{k}\) are country fixed effects, \(\delta _{t}\) are survey wave (time) fixed effects, \(X_{ikt}\) are individual-level covariates, and \(Y_{kt}\) are country-level covariates. Since there is no panel component in the ESS, I cannot add individual fixed effects to the regression. In line with the model set-up, I consider two determinants of assimilation with country-level variability, the immigrants’ share from EUROSTAT and a measure of cultural pluralism or heterogeneityFootnote 14 of the majority computed as the country-average cultural distance between each majority agent and the rest of the majority. For what concerns the determinants of assimilation with individual variability, I consider, in line with the model: the self-assessed frequency of social interaction, as it emerges from the ESS question: “How often do you socially meet with friends, relatives or colleagues”; the self-assessed importance to “[...] follow tradition and customs,” again from the ESS; the microsatellite genetic distance (Cavalli-Sforza et al. 1994; Pemberton et al. 2013; Spolaore and Wacziarg 2009, 2018) between the immigrant’s origin country and her host country, which I use as a proxy for the initial cultural distanceFootnote 15 (Desmet et al. 2011, 2017; Spolaore and Wacziarg 2015); three additional controls for cultural similarity, namely geodesic distance between the origin and the host country capital cities, linguistic proximityFootnote 16 (from the Automated Similarity Judgment Program - ASJP) and religious distanceFootnote 17 Mecham et al. 2006 between the origin and the host country of each immigrant; a dummy equal to 1 in case the immigrant has citizenship; the number of years that the immigrant spent in the country; the age at immigration; a dummy equal to one for EU born immigrants; gender, education, employment status and religious denomination.

Table 2 Covariates cultural assimilation

The regression results, or stylized facts on the \(\sigma \)-assimilation of immigrants, are summarized in Table 2. The first evidence, which is new to the literature and which basically motivates the analysis, is that assimilation is weaker (bigger cultural distance between immigrants and natives) in case of pluralistic societies characterized by a higher variability of opinions within the majority. Second, more socially active immigrants are more assimilated (lower cultural distance from the majority), which is consistent with the analysis by Verdier and Zenou (2017) and Patacchini and Zenou (2012). Third, assimilation is stronger for immigrants that spent more time in the country, suggesting that there is cultural assimilation over time, in line with the evidence of Giavazzi et al. (2019) for the US and Manning and Roy (2010) for the UK, among others. Fourth, immigrants with a higher initial cultural distance are less assimilated, which is again in line with the result of a differential speed of assimilation by country of origin in Giavazzi et al. (2019). Fifth, assimilation does not depend on the immigrants’ share in the country.Footnote 18 Sixth, cultural assimilation is a convex function of the importance assigned to tradition, which extends the result by Giavazzi et al. (2019) who find more assimilation in case of traits parents care more about. These regression results are robust across specifications, that is for alternative sets of controls. Moreover, the results are robust in case of inclusion of the square of the immigrants’ share, to account for possible non-linear effects in line with the analysis in Lazear (1999), and the coefficient on the square of the immigrant share turned out to be not statistically different from zero.

To the test if the model is consistent with the empirical correlations, I simulated an artificial cross-section of individual \(\sigma \)-assimilation, feeding the model with random choices of the main parameters that reflect the empirical determinants of assimilation (within their empirical ranges), to then regress \(\sigma \)-assimilation on those parameters. The results are summarized in Table 3 for a simulation of 200 artificial countries,Footnote 19 and closely resemble the empirical results. Consistently with the empirical analysis, the largest effect is for pluralism, while the effect of social participation is only minor. The convex relationship between assimilation and the importance of tradition is a consequence of the cultural evolution process. In particular, minority agents who are very closely tied to their traditions have a high disutility threshold and rarely change opinion, resulting in less \(\sigma \)-assimilation. Minority agents with very small thresholds, instead, change opinion very often, and the resulting lack of stability prevents them from assimilating (Table 4).

Importantly, notice that these results can also be used to motivate the construction of the model. In particular, the relationship between assimilation and pluralism can emerge only in a model with individual heterogeneities (more on that in Section 5.3). Similarly, the endogenous choice of networks is what delivers the relationship between social participation and assimilation.

Table 3 Regression on simulated individual data

5.2 Assimilation over time

The second test of the model entails its performance at capturing the dynamics of cultural assimilation. The exercise consists in feeding the model with the observed cross-section of parameters, computing \(\Delta \)-assimilation measures, and comparing them to the empirical counterparts. The results are summarized in columns (1) and (2) of Table 5. To smooth out the computational noise, I simulated 50 cross-sections and then took averages over simulation runs. The correlation between the simulated and the empirical measure of \(\Delta \)-assimilation is equal to 0.612 when using 5 years cohorts, and 0.538 when using 10 years cohorts. Overall, the model captures reasonably well the evolution of cultural assimilation. This result is important because it also address one of the potential model drawbacks, which is the absence of selective return migration: non-assimilated immigrants are more likely to leave the country, meaning that the model is at risk of under-estimating assimilation over time, but the results show that this is not the case. One potential explanation is that return migration is not common in the dataset that I have, perhaps because of the relatively short time span.

Table 4 Assimilation and well-being
Table 5 Evaluating the model performance

5.3 Individual variability of assimilation

The third test of the model concerns its ability to match the observed within-country variability of \(\sigma \)-assimilation. This is particularly important because it helps motivating one of the main modelling choices, namely the individual heterogeneities over opinion switching costs.

I simply check if the simulated and the empirical distributions of cultural distance between the immigrants and the natives are statistically different, running Kolmogorov-Smirnov tests of equal distributions in each country. The results are reported in column (3) of Table 5. Averaging over simulations, the null hypothesis of equal distributions cannot be rejected, at the 1% confidence level, in 61% of the cases.Footnote 20 Figure 1 plots the simulated and the empirical distribution of \(\sigma \)-assimilation for 9 selected countries, where the simulated distribution were re-scaled to the mean of the observed one. The countries are simply selected on the basis of their prominence, size and political importance. The graphs for the other countries in the sample are in on-line Appendix.

Overall, the model implies a slightly higher variability of assimilation with respect to the data, but it matches the distributions particularly well for many countries such as Germany and Ireland. The model however does a particularly poor job at matching the observed variability of \(\sigma \)-assimilation for Italy. To gain more insights on the potential determinants of the model performance along this dimension, I regressed the cross-section of euclidean differences between the simulated and the empirical distributions of cultural distance on the parameters used for the simulation. The result is that the minority share, the social density, the initial cultural distance between the majority and the minority and cultural pluralism in the majority are not statistically significant. Thus the differences between the model and the data are not systematically related to most of the mechanisms that drive assimilation in the model. The only significant variable in the regression (at the 5% level) is the shape parameter b of the Beta distributions of the thresholds \(\hat{H}^{m}_{i}\), with higher values associated to smaller distances. Since b is positively related to both the mean and the variance of the thresholds, this means that the model works best in case of sufficiently high and volatile thresholds. This results is not surprising, given that, with small thresholds, opinion changes become too frequent, which translates into an excessive variability of \(\sigma \)-assimilation with respect to the data. In case of Italy, the problem is exactly the small threshold and the resulting instability.

Fig. 1
figure 1

Simulated and empirical distributions of cultural distance. Notes: Simulated (solid line) and empirical (dashed line) distributions of cultural distance for 9 selected countries. Simulation for immigrants. Empirical measures for immigrants who have spent at least 10 years in the country. Simulated distributions re-scaled to the mean of the empirical ones

5.4 Assimilation and well-being

The last test of the model performance entails the relationship between cultural assimilation and individually assessed well-being. As empirical measure of well-being, I use the ESS questions that asks about the feeling of happiness; in the model, I simply look at the accumlated disutility. The results are summarized in Table 4 and show that, both in the model and in the data, assimilation and well-being are positively correlated. In greater detail, column (1) of Table 4 reports the results of the regression of the ESS question on happiness on the empirical measure of \(\sigma \)-assimilation with the same empirical specification of Eq. 9. Column 2 reports instead the results of the regression of a simulated model indicator of individual well-being, equal to 1 minus the average disutility over all issues at the end of the simulation, on the individual measures of \(\sigma \)-assimilation and on the parameters used for the simulations, in the spirit of the exercise summarized in Section 5.1. In both cases, the regression coefficient on \(\sigma \)-assimilation is negative and strongly significant: the bigger the average cultural distance from the majority, the lower the well-being.

6 Comparative statics

In this section, I discuss the comparative statics, starting from a benchmark calibration with the cross-sectional median values of the main model parameters. I look at the response of 4 outcomes: the distributions of \(\sigma \)-assimilation over minority agents at the end of the simulation (kernel estimate), the median and the interquartile range of the distribution of \(\Delta \)-assimilation (in percentage terms, with a linear smoothing), the degree distribution of the social networks at the end of the simulation (normalized number of social contacts, kernel estimate) and the average segregation of minority agents, measured as the ratio of social contacts within the minority to total social contacts.

Pluralism

The first comparative static is for pluralism, that is for the variance of opinions within the majority equal to \(\beta (1-\beta )\), which is central to the analysis. The results are in Fig. 2. Since the majority agents are allowed to change opinion, the comparative statics must be intended with respect to the initial level of pluralism.

More (initial) pluralism in the majority impairs both \(\sigma \) and \(\Delta \) assimilation. To get the basic intuition, consider a single minority agent that disagrees about a given issue in a society in which a share \(\beta > 0.5\) of the population agrees (smaller values of \(\beta \) do not lead to any action), and in which majority agents are so attached to their opinions that they never change them. The bigger is the value of \(\beta \), the bigger is the probability, for a minority agent, to accumulate disutilities from purely random social interactions and, therefore, the more likely is the agent to switch, therefore reducing his cultural distance from the majority. But the bigger is \(\beta \) the smaller is the level of pluralism in this country, which explains the result. So the lower levels of assimilation in heterogeneous societies, in the model, does not stem from the endogenous network formation channel and from the endogenous opinion changes in the majority.

Fig. 2
figure 2

Comparative statics: pluralism. Notes: \(\beta (1-\beta )\) is the variance of opinions in the majority (see text). Upper-left panel: kernel density estimates of the simulated distributions of cultural distance between the minority agents and the majority at the end of the simulation. Upper-right panel: median and interquartile range of the distribution of the change in cultural distance (percentage terms, linear smoothing). Lower-left panel: kernel density estimates of the simulated degree distribution of the social networks (normalized number of social contacts) at the end of the simulation (all agents). Lower-right panel: average segregation for minority agents (ratio of social contacts within the minority to total social contacts). Model calibration based on the median empirical ranges (see text) except for \(\beta \)

In a model with endogenous network formation, there is an additional important effect of pluralism, again going through the increased volatility of the cultural identity of the majority agents in social matchings. Since it is easier, in case of high pluralism, to match with culturally distant agents, flags become more frequent, and social networks become, on average, smaller and more segregated, thereby reducing social contacts between the minority and the majority and, therefore, further impairing assimilation. The fact that culturally heterogeneous societies become more segregated is consistent with previous literature on social networks (see Currarini et al. 2010, among others).

Overall, both \(\Delta \) and \(\sigma \) assimilation are more difficult in pluralistic societies, and small enclaves emerge. As an empirical example, minorities in Italy are more \(\sigma \)-assimilated with respect to France, despite the similar cultural distance and social density, because of the cultural homogeneity of the majority. Consistently with the empirical analysis, the magnitude of the effect of pluralism on assimilation is also quantitatively large. Importantly, these comparative statics will be the same even in case of different levels of the initial cultural distance between the majority and the minority (unless of course this is very small, but that would make the problem uninteresting), in case of different levels of social participation, and in case of different minority shares. In case of minorities that come from culturally more distant countries, the effect on segregation is actually bigger, as expected. Moreover, even assuming that only minority agents are allowed to change opinions will deliver the same results.

Minority share

Figure 3 shows the comparative statics with respect to the minority share \(\lambda \). This is useful to understand how will assimilation patterns change either in case of new immigration or in case of a higher population growth rate for the minority. Bigger minorities, for fixed social participation, imply a higher probability of social interactions with minority agents, even in a framework of purely random matchings. In case of homogeneous minorities, sufficiently different from the majority, this would imply less assimilation. But in case of heterogeneous minorities, more intragroup contacts do not necessarily lead to less assimilation, since the interactions between culturally different minority agents can also trigger opinion changes. As for network formation, bigger minority shares lead to the emergence of social networks that are more segregated, and, on average, smaller especially in case of lower heterogeneity within the minority, as a consequence of less frequent flags, thereby making intra-group contacts within the minority more frequent. Less interactions of minority agents with the majority in these more segregated networks, in turn, impair assimilation. The net effect on assimilation of these contrasting forces, for empirically plausible levels of \(\lambda \) and \(\alpha \), as shown in Fig. 3, is actually very small, which is also confirmed by the evidence discussed in Section 5.1. The effect on assimilation of bigger minority shares will be a reduction only in case of very big (\(\lambda > 30\%\)) minorities, although, in this case, the very notion of a minority is challenged. The comparative static is consistent with the empirical evidence. For instance, Switzerland and Norway are both characterized by assimilated minorities (small average empirical \(\sigma \)-assimilation), although the immigrants share in Switzerland is twice as big as the one in Norway. Moreover, Poland, Hungary and Czech Republic are characterized by non-assimilated minorities even if the immigrants share is quite low.

Fig. 3
figure 3

Comparative statics: minority share. Notes: \(\lambda \) is the minority share. Upper-left panel: kernel density estimates of the simulated distributions of cultural distance between the minority agents and the majority at the end of the simulation. Upper-right panel: median and interquartile range of the distribution of the change in cultural distance (percentage terms, linear smoothing). Lower-left panel: kernel density estimates of the simulated degree distribution of the social networks (normalized number of social contacts) at the end of the simulation (all agents). Lower-right panel: average segregation for minority agents (ratio of social contacts within the minority to total social contacts). Model calibration based on the median empirical ranges (see text) except for the minority share

Fig. 4
figure 4

Comparative statics: initial average cultural distance. Notes: \(\alpha -\beta \) is the average cultural distance between the majority and the minority at the beginning of the simulation (see text). Upper-left panel: kernel density estimates of the simulated distributions of cultural distance between the minority agents and the majority at the end of the simulation. Upper-right panel: median and interquartile range of the distribution of the change in cultural distance (percentage terms, linear smoothing). Lower-left panel: kernel density estimates of the simulated degree distribution of the social networks (normalized number of social contacts) at the end of the simulation (all agents). Lower-right panel: average segregation for minority agents (ratio of social contacts within the minority to total social contacts). Model calibration based on the median empirical ranges (see text) except for cultural distance

Initial average cultural distance

Figure 4 shows the comparative statics with respect to the initial average cultural distance of the minority \(\alpha -\beta \). The bigger is the average initial cultural distance, the bigger is the probability of social matchings with majority agents with different opinions, meaning that opinions will change more often but also that flags become more frequent. The more frequent flags, in turn, determine the emergence of segregated social networks that, for minorities that are more different from the majority, impair assimilation. Moreover, there will also be more volatility of assimilation as a result of increasing randomness in the identity of the individuals in matchings determined by the bigger initial cultural distance. Overall, there will be a bigger share of non-assimilated minority agents, and less overall \(\sigma \)-assimilation, with smaller and more segregated social networks. As an empirical example, among the (potentially many) reasons why big minorities in Switzerland are more \(\sigma \)-assimilated than small minorities in Hungary, there is the smaller average initial cultural distance between the majority and the minority in Switzerland. Shutting down the endogenous network formation channel determines smaller effects on \(\sigma \)-assimilation, although the main results of less \(\sigma \)-assimilated minorities in case of bigger initial levels of \(\alpha -\beta \) remains. The comparative statics is also similar in case of bigger minorities.

Social density

Figure 5 shows the comparative statics with respect to the maximum level of social density D. Along this exercise, the average, country-level, social participation increases, but its within-country variability remains fixed. In the absence of endogenous network formation, different levels of maximum social density will not impact on assimilation, as this will only increase proportionally the number of social matchings with culturally different and culturally similar individuals. In case of endogenous network formation, a higher social density implies more opinion changes and more flags because of the more frequent matchings with culturally distant individuals. The net effect on assimilation is a small increase, because the distribution of social contacts shifts rapidly to the left, impairing assimilation, and limiting the positive effect on assimilation of increases opinion changes. Overall, socially dense societies display more \(\sigma \) assimilation, smaller social networks and stronger social segregation, although the magnitude of the effect on assimilation is small, consistently with the empirical evidence. In case of bigger or culturally closer minorities, the overall effect of social density on assimilation is even slightly smaller.

Fig. 5
figure 5

Comparative statics: social density. Notes: dens is the normalized maximum level of social density (see text). Upper-left panel: kernel density estimates of the simulated distributions of cultural distance between the minority agents and the majority at the end of the simulation. Upper-right panel: median and interquartile range of the distribution of the change in cultural distance (percentage terms, linear smoothing). Lower-left panel: kernel density estimates of the simulated degree distribution of the social networks (normalized number of social contacts) at the end of the simulation (all agents). Lower-right panel: average segregation for minority agents (ratio of social contacts within the minority to total social contacts). Model calibration based on the median empirical ranges (see text) except for the maximum social density. D

7 Robustness and extensions

In this section, I analyze the robustness of the results to alternative parameter values (Section 7.1), and discuss two model extensions: to families (Section 7.2) and to polarized majorities (Section 7.3). In all cases, the analysis of the model delivers similar results. Further robustness exercises can be found in the on-line Appendix.

7.1 Robustness

The baseline calibration features an arbitrary normalization of the maximum possible social density D to 5% (each agent engages at most in \(0.05 \cdot N\) social matchings per period). I simulated the model with two alternative maximum density values, respectively 10% and 2.5%, and the simulation results turned out to be very similar to the benchmark (see Tables 2 and 5).

As a second robustness test, I considered a model extension to encompass memory and discounting of past issue-specific disutilities. In particular, I defined the total accumulated disutility \(H^{m}_{it}\), for each agent i and issue m, as a weighted average of the actual disutility \(h^{m}_{it}\) and of the discounted past disutilities \(H^{m}_{it-1}\):

$$\begin{aligned} H^{m}_{it}=\phi _{i} (1-\mu ) H^{m}_{i t-1} + (1-\phi _{i}) h^{m}_{it} \end{aligned}$$
(10)

where \(\mu \) is the discount rate of past disutility, and where \(\phi _{i}\) and \(1-\phi _{i}\) are the idiosyncratic weights of, respectively, past and current disutilities. Then, I assumed that changes of opinions occur when the total disutility \(H^{m}_{it}\) is above the threshold \(\hat{H}^{m}_{it}\). Within this new specification, the baseline model is a particular case with \(\mu =0\) (full memory). With discounting (shorter memories, \(\mu >0\)), given the thresholds \(\hat{H}^{m}_{i}\), there are less frequent changes of opinions, resulting in a slower convergence. The model performance at matching the empirical evidence, however, remains unchanged. Tables 2 and 5 show some examples for \(\mu =0.5\) and \(\mu =0.9\) and weights to past disutilities \(\phi _{i}\) randomly drawnFootnote 21 from a uniform distribution between 0 and 1.

As a third test, I considered three alternative values of the thresholds number of social contacts below which the agents are defined as socially marginalized, respectively 10% of N, 1% of N, and below 1.5 standard deviationsFootnote 22 from the mean of the country-level distribution of social contacts. The model performance is in line with the benchmark (see Tables 2 and 5), and the comparative statics unchanged. Note that assuming discounting of the individual disutilities which are relevant for flagging individuals, much as in the previous robustness test for the issue-specific disutilities, would result in higher thresholds to exclude individuals, without much changes in the equilibrium result.

Flags, in the model, are irreversible, which means that flagged agents cannot be re-instated in the social network in case the agent changes opinions as part of the assimilation process. To test for robustness, I considered an alternative model with reversible flags, either at a fixed exogenous rate, or with a per-period probability equal to the proportion of opinions changed in the period as follows: \(\frac{1}{M} \, \sum _{m=1}^{M} \, \mathbbm {1}_{\{q^{m}_{i t} \ne q^{m}_{i t-1} \} }\). In both cases, the results turned out to be in the line with the benchmark, even in case of a relatively high exogenous reversal probability of 0.5. The only difference, in this alternative model, is that the composition of the social networks tends to be more stable over time.

One potential drawback of the analysis is that the model is calibrated at an aggregate level, ignoring potential differences across territories. I tried calibrating the model to a cross-section of NUTS 2 regions within the same countries considered in the analysis. The problem with this alternative, which is the main reason why I did not choose it as benchmark, is that there are not enough immigrants respondent to the ESS in each region to meaningfully calibrate the model and to reliably compute the empirical assimilation measures, whose cross-sectional variability might simply reflect differences in individual characteristics. Moreover, the classification into regions also changed in the period that I analyze, making also spatial comparisons complicated.

A potential problem with the computation of the empirical \(\sigma \)-assimilation measure is that, if the number of questions answered by each pair \(\bar{M}_{ij}\) is too volatile, the resulting assimilation measures are not comparable. Focusing on a common subset of questions, answered by all agents, is not a feasible solution because it will severely limit the scope of the analysis, that hinges on the complexity of the definition of culture. For the empirical application on ESS data, this is not an issue, since the number of missing answers is rather small. To ensure comparability even in case of high volatility of the number of answered questions, I introduced a small correction, basically assuming a disagreement rate, over the non-answered questions, equal to its country-level average. In greater detail, the average corrected measure of \(\sigma \)-assimilation for the minority agent i in country k and time t, is computed as follows:

$$\begin{aligned} \hat{S}^{k}_{i t}= \frac{1}{\vert MJ \vert } \, \sum _{j \in MJ} \, \frac{1}{M} \, \Bigg [ \sum _{m=1}^{\bar{M}_{ij}} \mathbbm {1}_{ [ a^{m,k}_{it} \ne a^{m,k}_{jt} ]} \, + (M-\bar{M}_{ij}) \, \Pi ^{k} \Bigg ] \end{aligned}$$
(11)

where \(\Pi ^{k}\) is the average distance over all minorty-majority pairs in country k:

$$\begin{aligned} \Pi ^{k}=\frac{\sum _{i \in MN} \, \sum _{j \in MJ} \, \Big [ \sum _{m=1}^{\bar{M}_{ij}} \mathbbm {1}_{ [ a^{m,k}_{it} \ne a^{m,k}_{jt} ]} \Big ] }{\sum _{i \in MN} \, \sum _{j \in MJ} \, \bar{M}_{ij}} \end{aligned}$$
(12)

This correction is equivalent to weighting the individual hamming distances between the vectors of opinions of agents i and j proportionately to the number of answered questions. The correlation between \(\hat{S}^{k}_{i t}\) and \(\bar{S}^{k}_{i t}\) in the cross-section of individual data turned out around 0.99, as a consequence of the relatively small number of missing answers. The conclusion is that the discrepancies in the number of answered questions is not a concern.

7.2 Extension I: families

The baseline model with fully random matchings within the social network ignores the role of the relationships within the family. The motivation is that my model focuses on immigrants and, in particular, on their relationship with the majority in the host country, while family relationships are more important, perhaps crucial, drivers of cultural assimilation for second-generation immigrants (Bisin and Verdier 2000, 2001, among others). To test the model robustness, I extended it to account for family relationships, in reduced form, assuming that a fixed fraction of the social matchings takes place with the same subset of individuals drawn from the own social group, and that those individuals cannot be excluded from the social network regardless of the accumulated disutility.

More formally, for each agent i from the minority (majority), \(\xi _{i}\) percent of the per-period social matchings entails a subset \(\Xi _{i}\) of \(\xi _{i} \gamma N\) minority (majority) agents,Footnote 23 with \(f_{it}^{j}=1 \,\forall j \in \Xi _{i}\) at each point in time. I randomly draw the \(\xi _{i}\) from a uniform distribution between 0 and 1, where 0 is intended to model agents without any family relationship and 1 agents whose social contacts are only within the family. Importantly, I assumed that the disutility assigned to disagreement among people within the family is equal to the disutility assigned to disagreement with people outside the family, thereby ignoring the potentially higher weight given to family interactions, especially by younger individuals.Footnote 24

There is less overall cultural assimilation in this alternative model: the median cross-country \(\sigma \)-assimilation is 0.466 (std 0.075) as compared to a benchmark of 0.449 (std 0.071), while the median cross-country \(\Delta \)-assimilation is \(-\)8.51 (std 5.51), versus \(-\)12.79 (std 9.24) of the baseline model without families. Putting it differently, cultural traits tend to be, on average, more persistent. The model performance at matching the stylized facts of \(\sigma \)-assimilation is however intact (see Table 2), and the model can still match the cross-country variability of \(\Delta \)-assimilation (see Table 5). Family relationships, together with a lower level of \(\sigma \)-assimilation, also determine a smaller variability of \(\sigma \)-assimilation, mostly because there are fewer well-assimilated minority agents. As a result, the model performance at matching the within-country variability of \(\sigma \)-assimilation improves. As for the comparative statics, they are in line with the benchmark.

The conclusion from this exercise is that, if family relationship are important, the benchmark calibrated model will slightly underestimates the persistence of culture and it will overestimate the variability of assimilation.

7.3 Extension II: polarized majorities

The baseline model features one majority group defined by an average agreement rate \(\beta \). In this ection, I extend the model to a setting with a polarized majority composed by two groups. In particular, I assume that the average agreement rate among a fraction \(\eta \) of the majority is equal to \(\beta _{\eta }\), while it is \(\beta _{1-\eta }\) among the remaining fraction \(1-\eta \). To preserve the structure of the calibration, I set \(\eta \beta _{\eta }+(1-\eta ) \beta _{1-\eta }=\beta \), so that the country-level average agreement rate of the majority is the same as the benchmark. To have culturally diverse groups in the majority, I set \(\beta _{1-\eta }=2 \beta _{\eta }\). To keep the interpretation of the \(1-\lambda \) share of the population as the majority, I need to have both \((1-\lambda )\eta > \lambda \) and \((1-\lambda )(1-\eta ) > \lambda \), which means that the model can be solved only for minority shares \(\lambda \) smaller than 1/3. I consider a baseline value of \(\eta =0.5\), corresponding to majority groups of equal size.

In this alternative model, the average agreement rate among the minority tends to be, over time, in between the average opinions of the two majority groups, a result which is akin to the convergence to a “Neutral” culture in fragmented societies highlighted by Lazear (1999). (see the on-line Appendix for an illustration). The model performance (see Tables 2 and 5) and the comparative statics are close to the benchmark. The results obtained with \(\eta =0.25\) and \(\eta =0.75\) are, in all respect, similar. In conclusion, the results from the analysis are robust to the possibility of polarization in the majority.

8 Discussion and conclusion

Summarizing the results from the analysis, the model implies: (1) Less assimilation in case of pluralistic (culturally heterogeneous) majorities. (2) Less assimilation for minorities coming from culturally distant countries. (3) More assimilation in ase of higher social density. (4) No effect of the minority share on cultural assimilation. (5) More average segregation for culturally distant and big minorities, for high social density and for pluralistic majorities. (6) Smaller social networks in pluralistic and socially denser societies.

The model has several implications when it is interpreted in terms of immigration (i.e., with a minority made up by immigrants). First, it helps understanding why immigration is more accepted in some areas or historical period rather than others. For instance, assuming that social media increase social density, in the sense of increasing the frequency of social interactions and exchanges of ideas, than the model predicts less assimilation and, therefore, less acceptance of immigration when the use of socials is widespread. This can explain the strong anti-immigration responses in Europe following the refugees crisis (2014–2017). Second, the model can be used for policy predictions. For instance, it implies that restrictive immigration policies that reduce the immigration quotas will not foster cultural assimilation. Similarly, all policies whose goal is to increase the fertility rate among the natives, whenever population growth is higher among the immigrants, will not influence assimilation.

The question, however, is if assimilation is a legitimate policy objective to pursue and, as already stressed from the introduction, the model is ill equipped to perform this analysis. The main reason is the absence of one of the positive effects of cultural diversity, highlighted, among others, by Ashraf and Galor (2013), namely its possibility to foster creativity and technological change. Moreover, assimilation to a culture that does not promote, say, creativity, and patience, will not be beneficial for long-run growth even if, per se, assimilation will determine lower transaction costs between individuals.

The methodology that I propose to assess cultural assimilation can also be used to quantify assimilation of behavior, for instance with respect to financial decisions or fertility choices, using appropriate surveys to calibrate the model and to compute the empirical assimilation measures. It can be also used to study assimilation with respect to specific subgroups of the population defined by socio-demographics characteristics, as well as for second-generation immigrants and for other, self-assessed, minorities, The problem is that such an analysis requires larger surveys than the ESS, with a sufficiently big number of minority agents to meaningfully compute the empirical measures, together with abundant outside information to properly discipline the model. In this respect, the aggregate, country-level analysis of assimilation that I proposed is limited in scope: assimilation at the country level might hinder the presence of non-assimilated cultural enclaves among specific groups or areas.

The analysis can be also extended to residential segregation, either as an alternative to social segregation or as an additional individual choice. In the former case, minority agents would react to the accumulated disutility either changing opinions or changing neighborhood, choosing the one where the probability to match with similar individuals is higher. In the latter case, the agents would change neighborhood in case of high disutility regardless of the small social networks. The problem with both extension is that the calibration requires survey data at a very fine geographical level in order to capture, say, the nuances of neighborhood choice within a city. The clear direction for future research is to attempt a more disaggregated analysis, perhaps developing an ad-hoc survey.

The epidemiological model can also be used to study the emergence of fads or, in general, the diffusion of ideas among specific groups. For instance, it can rationalize the selective prevalence of a xenophobic attitude conditional on the use of social media, on the population density and on the identity of the immigrants in the city or neighborhood. Alternatively, it can be used to explain technology adoption. These are all potential avenues for future research.

One drawback of the analysis is that I do not propose an empirical validation of the model results with respect to social networks. The main reason is that such an analysis requires comparable cross-country data on social network formation and evolution over time, ideally in a context of increasing immigration, which I do not have. A further direction for future research is assessing the model performance at explaining the observed social networks.