1 Introduction

Significant geographic disparities in adult mortality by place of residence are ubiquitous and well entrenched in high-income populations (Boyle, 2004; Wilmoth et al., 2011). These disparities may reflect the impact of a combination of factors characteristic of places of residence, including income (Chetty et al., 2016), exposures to hazards (air and water pollution (Dwyer-Lindgren et al., 2016; Murray et al., 2006; Rogerson & Han, 2002), public infrastructure (Ezzati et al., 2008), health and medical care service (Finkelstein et al., 2021), income inequality (Wilkinson & Pickett, 2008), and socioeconomic mobility (Venkataramani et al., 2020). In most populations studied so far, regional disparities are a persistent feature as is the ranking of mortality levels that places exhibit. Data from the USA, for example, indicate that the magnitude of disparities across states is as large as 8 years of life expectancy at birth (Ezzati et al., 2008; Murray et al., 2006; Wilmoth et al., 2011). Much larger gaps are found at lower levels of aggregation (counties) as they span values between 0 and 20 years (Kindig & Cheng, 2013; Kulkarni et al., 2011).

In Spain, the country we study in this paper, as well as in a handful of other high-income countries studied in previous research (Boyle, 2004; Wilmoth et al., 2011), regional mortality disparities are less sharp than in the USA but still quite large. As of 2003, the gap in life expectancy at birth across Spain’s provinces was of the order 5.1 years, whereas in Autonomous Communities (the largest aggregate units) the gap was about 2.6 years (INE, 2022).Footnote 1 As in the USA, these disparities are attributed to the influence of contextual characteristics of places of residence. Indeed, with a few exceptions (Fletcher et al., 2023; Xu et al., 2020), the determinants of disparities identified by standard research, are always contextual or aggregated individual attributes, such as income and education, of places of residence. The impact of composition of the population by migrants’ origin and their health characteristics are given short shrift. This practice is generalized since most deaths and population data only identify individuals’ place of residence at the time the information is collected. As a consequence, the only accounting possible is one in which observed disparities are imputed to differences in contextual (or aggregated individual) traits of places of residence. This is a risky default explanation as some disparities may be the outcome of regional heterogeneity in the composition of individuals by life history exposures, all of which are concealed from view in this type of data.

To overcome some of these shortcomings, we use a unique data set from Spain which avails us with information on mortality by both place of birth and place of residence. As we argue below, and as shown by recent research, complementing information on mortality by place of residence with information on mortality by place of birth, can significantly enrich the accounting of regional mortality disparities (Fletcher et al., 2023; Xu et al., 2020).

The paper has two goals. First, we estimate life table functions by place of birth and place of residence and illustrate the extent to which these differ from each other. The second goal is to address the problem posed by health selection and, simultaneously, verify hypotheses from the Developmental Origins of Adult Disease (DOHaD) paradigm.Footnote 2 We propose a model that decomposes life expectancy into factors associated with place of birth, residence, and selection effects. We also propose a simple expression to estimate the magnitude of ‘heritability’ of place of origin conditions, e.g., the degree to which migrants’ mortality in places of residence resembles mortality experienced in places of birth. This heritability measure is an indicator of the extent to which exposures during early life in places of birth partially manifest, as the DOHaD paradigm would predict, in older adult ages mortality in places of destination.

2 Background

Under conditions of very low or no residential mobility, some of the many candidate contextual traits in places of residence may be sufficient to account for the whole of mortality disparities across places. But doing so in populations with significant residential mobility could be misleading.Footnote 3 Thus, for example, in the USA about 30% of 2010 adults resided in a state they were not born in. Spain is also a country with important flows of internal migrants as 28% of the population aged between 50 and 82 in 2003 reside in Autonomous Communities (largest aggregation units) they were not born in. Similar conditions characterize Canada, Germany and France (Wilmoth et al., 2011). In all these cases, observed mortality disparities may include a non-negligible component associated with conditions experienced by migrants early in life that might be reflected in individuals’ place of birth.

2.1 New Accounting of Regional Mortality Disparities

Ignoring the foregoing shortcomings of standard mortality statistics can be consequential for two reasons. First, in most cases, the population distribution by place of residence is shaped by past migration flows. Second, there is abundant empirical research demonstrating that individuals’ adult health and mortality risks are influenced by their life course exposures and, in particular, their experiences during early childhood (Ben-Shlomo et al., 2014; Gluckman, 2006; Gluckman & Hanson, 2004; Haas, 2008; Hayward & Gorman, 2004; Kuh & Shlomo, 2004). An improved analytic strategy is to complement information on mortality by place of residence with information on mortality by place of birth. The latter is an indicator, albeit imperfect, of experiences during an important stage of individuals’ life history, namely, soon after birth and during early childhood.

2.2 Healthy Migrants, Salmon Bias and the Enduring Legacy of Early Exposures

Two factors account for the interplay between residential mobility and observed regional mortality. The first is reverse causality, e.g., residential mobility is partially dependent on health status. Its relevance is a function of how health selection shapes the composition of immigrants in regions of destination. This is known as “healthy migrant effect”, one whereby migrant populations tend to have, at least soon after migration, more favorable health and mortality profiles than large swaths of the population at destination (Abraído-Lanza et al., 1999; Palloni & Arias, 2004; Palloni & Morenoff, 2001; Turra & Elo, 2008). One explanation for this pattern is that, except for some refugee populations, immigrants are a draw from subgroups at origin with better health status at the time of migration. Conversely, the so-called salmon bias (Abraído-Lanza et al., 1999; Dunlavy et al., 2022; Palloni & Arias, 2004; Palloni & Morenoff, 2001; Turra & Elo, 2008) refers to a phenomenon whereby immigrants return to the place of origin as a consequence of ill-health. The existence of a healthy migrant effect will produce two results: It will improve the average (and perhaps increase the variance) of health status and mortality risks at destination and, almost surely, will diminish those at origin. Different places harbor migrant populations from different origins, of different size, and are, therefore, characterized by variable health selection. We should then expect that at least a fraction of observed regional disparities by place of residence, e.g., those embedded in traditional measures of mortality, could be rooted in the directionality and size of migration flows. On the other hand, the impact of the “salmon effect” induces reverse health selection and should also have an impact on observed regional disparities. Its magnitude will depend on the prevalence of the selection mechanism and the places most compromised by it.

The second factor associated with residential mobility that could account for regional mortality disparities is the strength of the association between early conditions, adult health and mortality, and migration flows (Galobardes et al., 2004, 2006; Hamad et al., 2016; Hayward & Gorman, 2004; Palloni et al., 2009). This association could either reinforce or dilute observed mortality disparities by place of residence. Most migrants are accompanied by kin and carry with them material possessions. They also carry personal profiles, including their genetic make-up, habits, preferences, and learned behaviors (Deryugina & Molitor, 2020). Importantly, they are carriers of latent risks associated with exposures during early life whose effects are largely manifested post-migration, after attaining some critical adult ages.

In the last 20 years or so, a growing body of research under the so-called Developmental Origins of Health and Disease (DOHaD), has produced empirical evidence supporting a conjecture that was originally posited by Barker (1990, 1998) and later reformulated and extended by Gluckman, Hanson and colleagues (Bateson & Gluckman, 2012; Bateson et al., 2014; Gluckman, 2006; Gluckman & Hanson, 2004; Langley-Evans, 2004). According to it, individuals may be exposed in utero or during infancy and early childhood, to adverse conditions, including nutritional constraints, stressful contexts, or toxic environments. Despite these adverse conditions, they may survive to attain adult ages as a result of plastic adaptations of the embryonic/fetal growth and development plans. However, survival to adulthood may come at a steep price, in the form of inheritance of enhanced risks of chronic illnesses, including obesity, metabolic syndrome, Type 2 Diabetes (T2D), hypertension, coronary artery disease (CAD), stroke, and cognitive decline. These are precisely the conditions that account for the bulk of mortality in modern human populations and, therefore, should also explain the bulk of regional mortality differentials. The onset of these chronic conditions usually occurs after attaining critical ages and we will refer to their impacts as “adult delayed effects”. If places’ mortality levels and prevalence of adverse conditions are correlated and, in addition, the DOHaD’ conjecture is correct, individuals born in places with higher mortality will also experience comparatively longer bouts and higher levels of adverse early conditions that manifest as adult delayed effects.Footnote 4

The conditions that account for a place’s mortality levels are rarely changed over short intervals of time and, instead, are experienced by multiple birth cohorts. Thus, in most cases, the levels of adult mortality in a place at time t reflect conditions that influenced mortality levels some time before, t − k, when adults born in the place were children. Since adult and child mortality are correlated, adult mortality disparities by place of residence at time t will also reflect child mortality disparities at time t − k. Because children born in places with higher mortality are also more likely to experience adverse early childhood conditions, it follows that adult migrants born in places with high mortality at time t − k are more likely to experience adult delayed effects in places of destination than migrants to the same place but born in places with lower mortality. Thus, if delayed effects are at all important, places that receive migrants who are disproportionately drawn from higher mortality places, will experience higher adult mortality, irrespective of conditions at destination. By the same token, migrants to a destination who are born in high mortality places should have higher adult mortality than migrants to the same destination but born in low mortality places.

The linkage between early exposures and adult health conjectured by DOHaD leads to the expectation that adult delayed effects will not be immediately offset by the healthy migrant effect experienced at destination. Delayed effects will only worsen adult migrants’ health and be reflected in a place of destination’s mortality level some time after migrants attain critical ages, e.g., past 40 or 50. Thus, if delayed effects exist, the age and origin composition of migrants arriving to a place could in part account for its adult mortality level, offsetting the healthy migrant effect. By the same token, when the salmon bias is significant, adult migrants who return to their place of origin may do so as a reaction to the manifestation of adult delayed effects. Consequently, the magnitude of the attenuation of the healthy migrant effect by delayed effects will be inversely related to the magnitude of the return migration flow induced by the salmon bias.

In sum, health selection, early experiences, and contemporaneous factors in places of residence are entangled and their influences hopelessly confounded by standard measures of mortality disparities by place of residence. These actually reflect the operation of multiple determinants, including those associated with a place’s health conditions, the composition of immigrants by age and place of birth (healthy migrant and DOHaD type of effects), the composition of the place’s outmigrants by age and health status (healthy migrant and DOHaD effects), and the composition by age and place of destination of returning migrants (salmon bias and DOHaD effects).

3 Methods

3.1 Mortality Data

The mortality microdata used in this study were retrieved from vital statistics records maintained by the Spanish National Institute of Statistics (INE) and cover the period between January 1, 2003 and December 31, 2019. All-cause mortalityFootnote 5 daily counts were aggregated by sex, single year age groups, and geographic location of birth and residence. We focus on the adult population aged 50 and over in 2003 and exclude the foreign born or those with missing information on place of birth. To preserve confidentiality, individuals whose place of residence was a municipality with less than 10,000 inhabitants are assigned the province of residence. Therefore our analyses can only be conducted at the provincial (or at a higher aggregation) level. In this paper we consider provincial information aggregated into 17 Autonomous Communities (AC) (regional NUTS-2 divisions).Footnote 6 To facilitate presentation of results, some of these communities were merged together based on geographic proximity and demographic similarities. In all, we consider 11 regions. Figure S1 identifies the geographic location of the 11 AC and Table S1 displays their full names, acronyms, and the relation between the original 17 NUTS-2 divisions and the final 11 regions we consider in the paper. In addition, Table S1 also displays life expectancy at age 50 for males and females and estimated GDP and unemployment rate as of 2003, the initial year of the mortality data we use in the paper. The poorest AC’s, and also the ones exhibiting some of the lowest life expectancy, are Andalusia (AN) located in Southern Spain, Extremadura, Castilla-La Mancha and Murcia (ECMM), located in Spain’s central region, Asturias and Cantabria (ASC) in the North, and Canary Island, the most isolated Spanish region. In contrast, Madrid (MD) and Spain’s northernmost AC’s (PVNR) near the Spain-France border, are the richest and experience the highest life expectancy. Castilla and Leon (CL) and Galicia (GA) are outliers in the sense that, despite being among the poorest AC’s, they experience some of the lowest mortality levels.

3.2 Population Data

Population data are publicly available as a series of microdata files for the period 2003–2020. They were retrieved from the so-called Continuous Register Statistics (Padrón Continuo). Individuals’ registration in these registers is compulsory by law for all Spanish residents and coverage is virtually complete. As all administrative registers, there might be lags in the registration of residential mobility. In addition, in the case of Spain, it also under reports movements of international migrants within the Spanish territory but this will not affect the analysis we carry out in this paper. Since many of these registration lags involve residential movements within the same municipality, especially in large urban areas, they cannot affect completeness of population counts nor do they influence our estimates.Footnote 7

3.3 Building Life Tables from Administrative Data

The main difficulty we face is that individual population records are not linked to individual death records. As a consequence, we cannot follow individuals residential histories and construct cohort life tables by place of birth, place of residence, and duration of residence, as one would with longitudinal data. However, as we show below, it is possible to construct pooled, pseudo-cohort life tables, by place of birth and place of residence.

The Lexis diagram in Fig. 1 is a representation of the data we employ. The youngest cohort corresponds to those aged 50 on January 1st, 2003 and the oldest to those aged 82 in the same year. Each of these cohorts, as well as those between them, spans a total of 17 years and contributes to population exposure and mortality counts for age groups contained between the age interval (x, x + 1) at the outset (2003) to (x + 16, x + 17) at the end of the period under observation (2019).Footnote 8 Importantly, the Lexis diagram shows that most cohorts contribute to mortality experiences to the same age group as do other cohorts but during different time periods (see below). Our aim is to construct a pooled life table for each place of birth and residence combinations that synthesizes the mortality experience during the entire period under observation. We will thus ignore period (or cohort) effects and focus only the pooled estimates.Footnote 9

Fig. 1
figure 1

Lexis diagram showing the cohort built for this study

Consider a region of birth B and a region of residence R (B and R could be the same). For every place of birth \(B\), and place of residence \(R\), we build life tables starting at age 50. As described in Supplementary Materials, we adjust the number of deaths (and thus the corresponding conditional probabilities of dying) to avoid potential biases due to migration inflows and outflows during year t in a given place. For every year, age and sex, we compute the probability of dying \({q}_{R,B}^{\mathrm{^{\prime}}}\left(x,s\right)\) with the corrected number of deaths:

$$\begin{array}{c}{q}_{R,B}^{\mathrm{^{\prime}}}\left(x,s,t\right)=\frac{{{D}{\prime}}_{R,B}\left(x,s,t\right)}{{P{\prime}}_{R,B}\left(x,s,t\right)}\end{array}$$
(1)

For most cells in the 11 × 11 matrix of places of birth and residence and for a given one-year age interval, there is more than one \({q}_{R,B}^{\mathrm{^{\prime}}}\left(x,s,t\right)\) because different birth cohorts contribute to them at different times. To combine these into a pooled estimate of the corresponding conditional probability, weigh each \({q}_{R,B}^{\mathrm{^{\prime}}}\left(x,s,t\right)\)’s by the fractional contribution to the total population exposed associated with each of the cohorts that contributes to the conditional probabilities, \({P}_{R,B}\left(x,s,t\right)/\sum_{t}{P}_{R,B}\left(x,s,t\right)\) thus yielding a pooled estimate:

$$\begin{array}{c}{q}_{R,B}\left(x,s\right)=\frac{\sum_{t}{q{\prime}}_{R,B}\left(x,s,t\right)*{P}_{R,B}\left(x,s,t\right)}{\sum_{t}{P}_{R,B}\left(x,s,t\right)}\end{array}$$
(2)

Out of a total 11,858 \(R, B, x, s\) combinations (49 ages, 2 sexes, and 121 place of birth and residence combinations), there are 56 (0.5 percent) cases for which \({P}_{R,B}\left(x,s,t\right)=0\), mostly at the last age of the life table. In those cases, we fit a Gompertz curve and obtain an estimate of \({q}_{R,B}\left(x,s\right)\) at the missing ages. From those death probabilities from (2), we can obtain the corresponding mortality rates:

$$\begin{array}{c}{m}_{R,B}\left(x,s\right)=\frac{2\cdot {q}_{R,B}\left(x,s\right)}{2-{q}_{R,B}\left(x,s\right)}\end{array}$$
(3)

Thus, we will have a total of 242 (pooled) life tables for two sexes and 121 places of birth and residence combinations or, equivalently, 242 observed values of life expectancies at age 50, e50, the main dependent variable in our analysis. In addition, and for graphical purposes, we computed standardized mortality rates by place of birth and residence: for those, we employed the mortality rates from (3). They were thus computed as:

$$\begin{array}{*{20}c} {{\text{SMR}}_{{R,B}} = \sum\limits_{{\forall x}} {m_{{R,B}} } \left( {x,M} \right) \times \pi \left( {x,M} \right) + \sum\limits_{{\forall x}} {m_{{R,B}} } \left( {x,F} \right) \times \pi \left( {x,F} \right)} \\ \end{array}$$
(4)

where \({m}_{R,B}\left(x,M\right)\) and \({m}_{R,B}\left(x,F\right)\) are the male and female mortality rates for those residing in R who were born in B (from Eq. (3)) and \(\pi (x,s)\) is the proportion of individuals of age x and sex s in a standard population.Footnote 10

3.4 Model for Life Expectancy at Age 50

We classify individuals according to their region of birth, i, and residence, j. When i = j we will refer to them as “stayers” and “leavers” otherwise. We propose a simple model for the observed life expectancy at age 50 in Spain as a function of combinations of places of birth and residence:

$$\begin{array}{c}{e}_{50}={\alpha }_{0}+\sum_{i=1}^{r}\left({p}_{ii}\times {\beta }_{si}+\left({p}_{bi}-{p}_{ii}\right)\times {\beta }_{li}\right)+\sum_{j=1}^{r}{p}_{rj}\times {\gamma }_{j}+\varepsilon \end{array}$$
(5)

where \({\alpha }_{0}\) is a constant, \({p}_{ij}\) is the probability that an individual is born in i and resides in j, \({p}_{bi}\) is the probability of being born in i, \({p}_{rj}\) is the probability of residing in j; \({\beta }_{si}\) is a coefficient that reflects the effects of place of birth on mortality of stayers in i, \({\beta }_{li}\) is a coefficient for the effects of place of birth on mortality of leavers from i, \({\gamma }_{j}\) is the effect on mortality place of residence j irrespective of place of birth and, finally, \(\varepsilon\) is an idiosyncratic error. The term \(\left({p}_{bi}-{p}_{ii}\right)\) equals the proportion of individuals who are born in i and reside elsewhere. Under some conditions spelled below, the difference \({({\beta }_{li}-\beta }_{si})\), is a measure, albeit imperfect, of health selection of migrants from i. A graphic rendition of expression (5) is in Fig. 2.Footnote 11

Fig. 2
figure 2

Diagram of the decomposition of life expectancy. The life expectancy \({e}_{i}\) (\({e}_{50(i)}\)) of those born in i is represented by the gray circle. The black circle (\({\alpha }_{0}+{\beta }_{s}\)) shows the baseline life expectancy plus the effect of place of birth for stayers. Movers or leavers experience a selection equal to (\({\beta }_{l}-{\beta }_{s}\)). The dotted triangles and squares are the life expectancy of stayers and movers, respectively, with no environmental effects. Environmental effects (\({\gamma }_{j}\)) are shown for each region of residence j. The weighted mean of all the life expectancies \({e}_{ij}\) on the right is equivalent to \({e}_{i}\), represented by the gray circle. (Color figure online)

Proper interpretation of parameters in model (3) requires some assumptions. First, because \({\beta }_{li}\) and \({\beta }_{si}\) depend only on place of birth, i, and not on place of residence, j, their difference can be interpreted as a measure of health selection of leavers from i (relative to stayers), averaged out across places of destination. Second, note that the estimate of the parameter reflecting effects of a place of residence, \({\gamma }_{j}\), does not depend on i and it is assumed to be the same for all migrants arriving in j, irrespective of their place of birth. Underlying expression (5), there is a simple model for the life expectancy of individuals who were born in place i and migrated to place j, namely,

$$\begin{array}{c}{e}_{50(i,j)}={\alpha }_{0}+{\beta }_{si}\times {S}_{i}+{\beta }_{li}\times {L}_{i}+{\gamma }_{j}\times {R}_{j}\end{array}$$
(6)

where \({e}_{50(i,j)}\) is the life expectancy at age 50 of those born in i and reside in j, \({S}_{i}\) is a binary 0/1 dummy variable set to 1 for stayers born in i, \({L}_{i}\) is a dummy variable set to 1 for leavers born in i; \({R}_{j}\) is a dummy variable equal to 1 when the place of residence is j. The coefficients \({\alpha }_{0}\), \({\beta }_{si}\), \({\beta }_{li}\) and \({\gamma }_{j}\) have the same interpretation as in (3). We seek to identify 3r + 1 parameters, namely, a constant, \({\alpha }_{0}\), parameters \({\beta }_{si}\) and \({\beta }_{li}\) for r regions of birth and parameter \(\gamma\) for r regions of residence. We have a total of \(r\times r\) equations for values of life expectancy of all possible pairs of indexes for places of birth and residence, i and j. The parameters of (6) were estimated using a simple linear model.Footnote 12

3.5 Heritability of Life Expectancy

The parameter estimates from model 6, can be used to compute a quantity of great interest, namely, the “heritability” of mortality conditions associated with places of birth. Our previous discussion suggests that migrants carry with them risks associated with conditions experienced during early childhood in the place of birth that can manifest as adult delayed effects, irrespective of region of destination. We propose an estimator of heritability that includes components of total variance of life expectancy across regions associated with place of residence and place of birth. The first component is a function of place of birth and we refer to it as the “inherited” component, I:

$$\begin{array}{c}I=\sum_{i=1}^{r}\left({p}_{ii}\times {\beta }_{si}+\left({p}_{bi}-{p}_{ii}\right)\times {\beta }_{li}\right)\end{array}$$
(7)

In this expression, the contribution of stayers and leavers is different as the latter is modified by health selection (if any). The second component is a function of place of residence and we refer to it as the “environment” component, E:

$$\begin{array}{c}E=\sum_{i=1}^{r}{p}_{rj}\times {\gamma }_{j}\end{array}$$
(8)

Following the terminology in population genetics, we will define heritability of health conditions, \({h}^{2}\), as the fraction of the total variance in life expectancy at age 50 (the “phenotype”) explained by place of birth (“additive allelic variance”) and, correspondingly, the effect of environment is the fraction of the total variance in life expectancy at age 50 explained by place of residence (“environmental effects”).

$$\begin{array}{c}{h}^{2}=\frac{\mathrm{Var}\left(I\right)}{\mathrm{Var}\left(E\right)+\mathrm{Var}\left(I\right)}\end{array}$$
(9)

with \({h}^{2}\in [\mathrm{0,1}]\).

Using Eqs. (7), (8) and (9), we can express \({h}^{2}\) as

$$\begin{array}{c}{h}^{2}=\frac{{\sum }_{i\in \left[1,r\right]}\left[{p}_{ii}\times \left(1-{p}_{ii}\right)\times {\beta }_{si}^{2}+{\overline{p}}_{i}\times \left(1-{\overline{p}}_{i}\right)\times {\beta }_{li}^{2}\right]}{{\sum }_{i\in \left[1,r\right]}\left[{p}_{ii}\times \left(1-{p}_{ii}\right)\times {\beta }_{si}^{2}+{\overline{p}}_{i}\times \left(1-{\overline{p}}_{i}\right)\times {\beta }_{li}^{2}+{p}_{ri}\times \left(1-{p}_{ri}\right)\times {\gamma }_{i}^{2}\right]}\end{array}$$
(10)

where

$$\begin{array}{c}{\overline{p}}_{i}={p}_{bi}-{p}_{ii}\end{array}$$
(11)

Higher values of \({h}^{2}\) correspond to higher influence of region of birth conditions (i.e., the effects of early conditions that manifest as adult delayed conditions are strongly inherited). Conversely, smaller values of \({h}^{2}\) will obtain when mortality heterogeneity is mostly associated with characteristics of places where individuals reside at the time of death.Footnote 13

4 Results

Table 1 displays basic counts of population and deaths by region of birth. The cohorts include 12.375 million individuals, 4.758 million of which (38.4 %) died within the period of observation.

Table 1 Descriptive statistics of the cohort

Tables 2 and 3 display matrices of life expectancy at age 50 by region of birth and residence. Consider first life expectancy by place of birth. Among males, it ranges between 27.8 (born in Catalonia and living in Extremadura-Castilla-La Mancha-Murcia) and 32.7 years (born in Castile-and-Leon and living in Madrid). Among females, the range is between 33.8 (born in Valencia-Balearic Islands and living in Andalusia) and 37.6 years (born in Castile-and-Leon and living in Madrid). The highest life expectancy by region of birth is for those born in Castile-and-Leon (31.5 years for males and 37.0 for females) whereas the lowest are among those born in Andalusia for males (29.3 years) and Canary Islands for females (34.6 years). Regions that follow closely the advantageous performance of Castile-and-Leon are Aragon, Galicia or Madrid. Instead, populations born in Canary Islands and Valencia-Balearic Islands are close to the region with the poorest performance.

Table 2 Life expectancy of males at age 50
Table 3 Life expectancy of females at age 50

Turning now to life expectancy by place of residence, we observe that the highest life expectancy is among those residing in Madrid (31.3 years for males and 36.7 for females), whereas those living in Andalusia have the lowest (29.1 years for males and 34.3 for females). Note that life expectancy among those living in Basque Country-Navarre-La Rioja or Castile-and-Leon is high, regardless of their place of birth, whereas for those living in Valencia-Balearic Islands are almost always among the lowest, regardless of place of birth.

The range of variation of estimates of life expectancy by place of birth and by place of residence are similar to each other. In addition, life expectancy by place of birth and by place of residence are correlated (\({R}^{2}\,\)=0.67, Spearman = 0.75 for men and \({R}^{2}\,\)=0.86, Spearman = 0.90 for women). The association is tight because a significant fraction of the population resides in the place of birth (about 72% of Spain’s population). However, and specially for males, the mapping of one on the other is far from perfect.

A final observation is that, as expected by the healthy migrant conjecture, life expectancy of stayers is lower than that of leavers for all regions of birth, except Madrid and Catalonia (and, among females only, in Basque Country-Navarre-La Rioja).

Figure 3Footnote 14 shows the influence of migrants on the observed mortality rate in places of destination or residence. Madrid stands out as the place where immigrants born in all other regions have lower mortality rates than individuals born there. This is a case in which migrants to Madrid increase the region’s life expectancy. Though to a different degree, the same applies to regions Asturias-Cantabria, Andalusia, Galicia, Canary Islands, Valencia-Balearic Islands and Basque Country-Navarre-La Rioja. Conversely, immigrants to Castile-and-Leon and Aragon experience higher mortality rates than those born in these regions and contribute to lower their life expectancy. The same occurs in Catalonia and Extremadura-Castilla-La Mancha-Murcia, though, in these cases, the phenomenon is mostly due to the high fraction of individuals born in Andalusia among migrants. Note that, with the exception of Madrid and Catalonia, mortality is higher among stayers in a place than among leavers out of the place (Fig. 4). Andalusia is an extreme case, where stayers have the highest mortality rate among all regions of residence.

Fig. 3
figure 3

Composition of annual SMR (‰) in three regions of residence by origins. Blue circles are annual SMR of residents in Catalonia (left), Basque Country-Navarre-La Rioja (center) and Madrid (right). Red circles are SMRs of those born in a region and residing in Catalonia, Basque Country-Navarre-La Rioja or Madrid, respectively. The fraction of residents in a region that were born in another is displayed when it exceeds 5%. (Color figure online)

Fig. 4
figure 4

Composition of annual SMRs (‰) in three regions of birth by destinations. Red circles are SMRs of natives of Castile-and-Leon (left), Catalonia (center) and Andalusia (right). Blue circles SMRs of those residing in a certain region and born in Castile-and-Leon, Catalonia or Andalusia, respectively. The fraction of residents in a region that were born in another is displayed when it exceeds 5%. (Color figure online)

The lowest variance of mortality rates by place of birth corresponds to residents in Castile-and-Leon, Galicia and Madrid, which stand out among the regions with lower mortality by residence. In general, places where residents have lower mortality levels also have lower variance of mortality rates across all origins of residents (\({R}^{2}\,\) = 0.18, Spearman = 0.41). This suggest the operation of a few mechanisms: either there is a strong convergence of mortality of migrants, e.g., a powerful influence of environments at destination, or tight similarity of mortality levels among migrants due to place of birth influences or, finally, similar magnitude and nature of health selection.

Finally, the lowest variability of mortality across places of residence is found among those born in Castile-and-Leon or Madrid, whereas the highest is among those born in Canary Islands, Catalonia or Valencia-Balearic Islands. Overall, mortality rates by place of birth are positively correlated with the variance of mortality rates across places of residence (\({R}^{2}\,\) = 0.33, Spearman = 0.74). This could be due to the fact that those born in places with lower mortality tend to be less affected by levels of mortality in the place of destination.

4.1 Estimates of Parameters for Life Expectancy

The linear model in Eq. 6 fits the data well with \({R}^{2}\,\)= 0.72 for males and \({R}^{2}\,\) = 0.77 for females. The baseline life expectancy at age 50 (constant plus \({\beta }_{si}\)) corresponds to the contribution of place of birth to stayers (columns 1 for females and 4 for males). It hovers around a mean of \({\alpha }_{0}\,\) =3 0.0 years for males and \({\alpha }_{0}\,\)=35.7 years for females, a six year difference (Table 4).The highest contribution by place of birth is for individuals born in Aragon, Extremadura-Castilla-La Mancha-Murcia and Castile-and-Leon among males, and Castile-and-Leon and Galicia among females. Differences between regions are about 2.1 years for men (Aragon and Madrid) and 2.6 years for women (Castile-and-Leon and Canary Islands).

Table 4 Decomposition of life expectancy

Columns 2 for females and 5 for males display the contribution of leavers’ selection effects (\({\beta }_{li}\hspace{0.17em}-\hspace{0.17em}{\beta }_{si}\)). These values are mostly positive and the largest selection effect, about 1.4 years for females and 1.6 for males, are experienced by those who leave Canary Islands. Conversely, outmigrants from some regions, mainly Andalusia and Catalonia, appear to be negatively selected compared to stayers. Overall, the size of these selection effects is larger among males than among female, as are their variances (0.54 for men and 0.21 for females).

Finally, estimates of place of residence’s effects \({\gamma }_{j}\), varies substantially across regions of residence. The largest values are for Madrid, where the residential advantage is equivalent to 1.4 years for males and 0.8 years for females. In contrast, some regions have a negative impact as in Andalusia where there are reductions of 1.0 years 1.2 years for males and females, respectively.

4.2 Income, Migration and Life Expectancy

The association between GDP per capita of regions and estimates of residence effects is tight (\({R}^{2}\) = 0.5), confirming that environments of a place matters. This suggests that those residing in Madrid, Castile-and-Leon, Basque Country-Navarre-La Rioja or Catalonia have higher life expectancy, partly because they have higher incomes or enjoy health benefits of improved infrastructure. Canary Islands stands out as an exception, as it is a below- average-income region with high life expectancy among those who migrate to it. Thus, immigrants to Canary Islands have higher life expectancy (over 30.6 years for men and over 36.4 years for women) than do Canary Islands’ stayers (29.3 years for men and 34.6 for women). This might be a result of the fact that migrants to Canary Islands are a highly selected population (relative to stayers in Canary Islands), much less affected by adverse conditions than Canary Islands’ stayers. It is likely that a high fraction of migrants to Canary Islands are skilled workers and professionals who take jobs that natives cannot. Also, it is known that Canary Islands is a favorite destination place for mainland’s high income groups who settle in Canary Islands right before and after retirement. In both cases, the health status of migrants to Canary Islands should be superior to those who are born in the region.

4.3 Heritability of Life Expectancy

The heritability coefficient of life expectancy in Spain is \({h}^{2}\,\) = 0.42 for males and \({h}^{2}\,\)= 0.43 for females. This means that an important fraction of the variability of life expectancy at age 50 is explained by conditions that individuals take with them when they migrate to another region. We argue that these conditions reflect experiences in places of birth that have delayed adult effects. This estimate of heritability is a lower bound, as it does not reflect the impact of salmon bias, that is, the effects of place of birth on individuals who returned to them because of poor health possibly induced by adult delayed effects.

5 Discussion

A rigorous accounting of factors that might explain regional mortality disparities requires, at the very least, to distinguish between effects associated with place of birth and place of residence. Not only does this information help us discern more precisely the impacts of healthy migrant and salmon bias effects, but it also offers a viable strategy to assess the magnitude of adult delayed effects. Admittedly, considering only place of birth and residence in 2003 is not an optimal strategy to assess health selection or test DOHaD hypotheses. However, except in populations with only weak regional mobility, mortality estimates by place of birth and residence reveal much more than standard measures of mortality.

Our results lead to a number of inferences. First, as showed in the case of the USA (Fletcher et al., 2023; Xu et al., 2020), separating the contribution of places of birth and places of residence, identifies important heterogeneity in mortality disparities. The differentials in life expectancy at age 50 by region of residence (2.4 years for women, 2.2 years for men) are of the same order of magnitude as those observed by region of birth (2.4 years for women, 2.2 years for men). The size of these differences is not trivial. In fact, in Spain at least, they are comparable to mortality differentials by education: the difference between those with primary education and those with college education or more are of the order of 1.5 years for females and 2.5 for males (Requena, 2017).

Second, we find some support for healthy migrant effects. With the exception of those who move to places with the highest levels of mortality (Andalusia, Canary Islands, Asturias-Cantabria), leavers from all regions tend to experience lower mortality than counterparts who stay in their places of birth. There are two extreme illustration: (i) Madrid stands out as the destination for leavers with the highest life expectancy. Indeed, out-migrants to Madrid originating in 8 out of 10 regions experience highest life expectancy as Madrid residents. This could be a result of better conditions in Madrid, the outcome of particularly strong positive selection or both; (ii) conversely, out-migrants to Andalusia originating in 8 out of 10 regions experience the lowest life expectancy among all possible destinations. This could be due to strong negative health selection, poor environmental conditions in Andalusia or both. One could argue that the net differences between life expectancy of leavers and stayers reflect both the impact of places of destinations’ environment and the effects of selection (negative or positive). However, the fact that, with the exception of Andalusia, Canary Islands and Asturias-Cantabria, the bulk of differences always favors leavers over stayers, suggest that selection might be playing the dominant role.

Third, we find strong evidence suggesting the importance of place of birth as an explanatory factor of regional mortality disparities. First, out-migrants from places with low stayers’ mortality experience the lowest variance in mortality levels and, in addition, these are very similar to mortality levels in their places of birth, irrespective of destination. This strongly suggest that out-migrants inherit conditions associated within their place of birth or, equivalently, are less influenced by a place of destination’s environment. This finding is confirmed by the high value of Spain’s coefficient of heritability which indicates that between 42 and 43% of the total variance of life expectancy at age 50 is explained by place of birth.

Fourth, migrants can shape observed mortality in places of destination. Thus, observed regional mortality levels in Spain are associated with the composition of migrants by places of origin. For instance, immigrants to Catalonia increase its mortality level. This is largely explained by the fact that a high proportion of these migrants (47%, or 24% of all Catalonia residents) originate in Andalusia, the place with the least favorable mortality regime. If, instead, migrants originated in Castile-and-Leon, the annual standardized mortality rate would drop from 21.5 to 20.5 ‰. Migrants increase mortality levels in Castile-and-Leon and Aragon as well, but reduce it in all other places.

Finally, decomposition of life expectancy enables us to partially disentangle effects of health selection from residence effects. The value of the estimates is positive, as expected when there are healthy migrant effects. Health selection is highest among those born in Canary Islands, a geographically isolated region where successful outmigration could be more tightly constrained by health conditions, regardless of destination. Lastly, we find that, as expected, the strength of effects of place of residence is correlated with economic conditions of regions.

There are a number of shortcomings that limit the reach of our inferences. First and foremost, we ignore the role of leavers’ timing of migration and duration of residence in places of birth and destination. Thus, we cannot distinguish those who migrated right after birth from those who migrated just before retirement nor can we take into account repeated migration events. By the same token, because we are not working with migration histories, we cannot separate stayers who never moved from stayers who experience one or multiple migration events. Ultimately, place of birth is a highly imperfect indicator of strategic lifetime exposures, albeit one that is better than none.

Second, our estimates of healthy migrant effects are coarse because they are computed assuming that place of destination is unrelated to levels of health selection among immigrants to it. It is possible that health selection is more elevated among migrants to the most attractive regions, those where pressure on new arrivals exerted by labor markets competition are greater. Similar limitations apply to inferences about salmon bias.

Finally, two important issues we intend to pursue in future research. First, we can decompose differences in life expectancy by cause of death and age groups. Doing so offers a different source of empirical evidence to test the DOHaD conjecture. Second, rather than using Autonomous Communities, we could define lower level of aggregation and use a well-defined classification scheme to group provinces. Although we may lose some power in doing so, we will be able to perform analyses with higher granularity, including an assessment of variability of the heritability coefficient by place of origin.

6 Conclusions

Following other research on the subject (Fletcher et al., 2023; Xu et al., 2020), the main goal of this paper is to argue that the study of regional mortality is enriched if one uses statistics computed by place of birth and place of residence. We showed that doing so leads to somewhat different assessments about the nature of regional disparities. In particular, disparities by place of birth are not the same as those by place of residence. This should matter considerably in populations that experience significant residential mobility. With help from some simplifying assumptions, we illustrate that these type of data leads to important insights about the nature of health selection of migrants to a place. Simultaneously, it is possible to compute estimates of effects of a place of residence that are more precise, e.g., less contaminated by the influence of migrants’ mortality on place of residence’s mortality. We propose an index of heritability of health-related conditions prevailing in places of birth among those who leave. The index reflects the importance of early exposures and their manifestation as delayed effects. This index could be an important source of new empirical evidence to test hypotheses abut adult delayed effects, one that is not available in standard studies of regional mortality disparities.

Finally, our findings for Spain are consistent with and comparable to those for the USA. This suggests the intriguing possibility that perhaps they reflect phenomena that are not unique or pathological but quite general in modern populations. If future research does indeed reveal that the regularities uncovered in the USA and Spain apply more generally, the findings will be an unexpected source of empirical evidence for DOHaD’s conjectures.