To analyse spatial patterns in premature mortality and the extent to which they are explained by spatial patterns in socioeconomic factors, we employ a spatial pattern recognition analysis. In a first step of this analysis, we estimate the extent to which systematic spatial patterns are detectable in premature mortality. Specifically, we regress premature mortality on five simple spatial measures: latitude, that is, the degree to which a district/city is located further north in Germany;Footnote 4a dummy for East Germany; the clustering of premature mortality among geographically contiguous districts/cities; the clustering among geographically proximate districts/cities (defined as the inverse of Euclidean distance among districts); and urbanity, defined as population density of a district/city. Regressing premature mortality on these five spatial measures provides us with an estimate of the strength of the spatial patterns.
In the second step, we regress premature mortality on its socioeconomic determinants. This second step feeds into the final and most important step in which we use the predictions of the regression model from the second step to compute the residuals, which represent variation in premature mortality not explained by its socioeconomic determinants. We regress these residuals on the same five spatial measures. By comparing the coefficients from the first step of the spatial pattern recognition, which represent the strength of spatial patterns in premature mortality, with the coefficients from the third step of the spatial pattern recognition, which represent the remaining strength of spatial patterns in residual premature mortality that is unaccounted for by socioeconomic factors, we obtain an estimate of how much the spatial structure in premature mortality declines by accounting for spatial structure in its socioeconomic determinants.
As mentioned above, we define premature mortality as dying before the age of 75 and we use two measures, namely the premature mortality rate defined as the proportion of the population at the district/city level dying before the age of 75 and the total sum of potential years of life lost relative to the age of 75 for those dying prematurely. The bivariate correlation between the two measures is 0.9. For the first measure, we compute a standardised propensity of premature death in local authorities based on life tables we computed for 402 German districts and cities for the year 2014.Footnote 5 These life tables allow us to compare the survival rate of an artificial cohort of 100,000 individuals in each district/city based on observed, that is, actual age-dependent probabilities of dying. Consider Germany as a whole: Starting with 100,000 individuals, 99,666 newborns reach their first birthday, 99,571 survive to experience their 10th birthday, 99,414 their 20th birthday, 96,925 reach their 50th birthday, and 88,247 individuals reach 65. At this point, mortality rates start to increase considerably. Only 74,392 reach the age of 75 and 42,404 individuals experience their 85th birthday. At this age, the information in the life tables ends. The premature mortality rate has a range from 18,423 to 34,307 with a mean of 25,604 and standard deviation of 2881.
The years of potential life lost measure is calculated as the sum of the number of standardised deaths at each cohort times the years that individuals died prematurely. For example in a given district/city the years of potential life lost would be the number of standardised deaths that occurred before the age of one multiplied by the 74 years that these children did not live (die prematurely) plus the standardised number of deaths between the ages of 1 and 5 multiplied by the 72 years they lost on average, plus the deaths between the ages of 5 and 10 multiplied by 67 years, and so on until the cohort that died between 70 and 75, which is multiplied by 2. This measure of premature mortality ranges from 199,814 to 502,682 with a mean of 321,634 and a standard deviation of 49,003.
We use five categories of socioeconomic variables: income and poverty; education; sectoral composition of the economy; socioeconomic status; and the depth and structure of employment.Footnote 6 Specifically, we include mean household income and median workforce income at the district/city level as well as the average pension benefit of pensioners as proxies for disposable income. Per capita expenditures on social welfare and unemployment benefits as well as the total unemployment rate and the unemployment rate of women function as proxies for poverty. We further include information on the highest level of educational qualification of the population and the workforce,Footnote 7 on the share of employment by economic sector and on the socioeconomic status composition of the workforce. Finally, we include the share of men and share of women in employment, the share of foreigners among the workforce as well as the ratio of part-time employment as measures of the depth and structure of employment. Since we use categories for educational achievement, categories of the sectoral composition of the economy and categories of socioeconomic status rather than continuous measures of these socioeconomic factors (e.g. years of schooling), we explicitly do not assume that the effect of, say, education on premature mortality is linear in the number of years of schooling, which would be highly implausible. Allowing for further non-linear effects by including second degree polynomial terms of our explanatory variables results in only a small increase in goodness-of-fit with the data and leaves our substantive findings unchanged (results not reported here).
Given that the socioeconomic factors included in our estimation model are not mutually independent of each other and some might represent the causal mechanism by which others exert their effect, we do not evaluate the point estimates of individual variables or their statistical significance. All we are interested in here is the combined explanatory power that socioeconomic factors jointly exert on premature mortality.