1 Introduction

Standard economic theory as well as empirical observations suggest that education enhances productivity and thus the earnings individuals can expect. This, however, depends at the same time on the state of health the individuals of working age have achieved. Poor health depresses productivity and earnings, good health improves both.

This matters on a societal as on an individual level. Poor health affects the financial resources available to the household and it reduces the contribution the population of working age can make in terms of gross value added to the economy at large.

The aggregate impact of poor health, or for that matter of good health, has been analysed in the context of comparative studies (Weil 2007) and national accounting analyses (Atkinson 2005). The individual impact has been looked at in empirical studies on the relationship between the incidence of a morbidity (or an accident) and the resulting change in labour market behaviour: participation, wages, earnings (Garcia-Gomez et al. 2013; Halla and Zweimüller 2013).

Such studies use an unforeseen event of medical relevance as the “cause” for explaining observable changes in individual labour market results (compared to a control group). Once you move from the incidence of a morbidity to its prevalence as the unit of observation, a “causality” issue arises: An unfavourable socio-economic position could be the “cause” of an accumulating individual burden of morbidity; the so called “gradient” of the prevalence of morbidities (Cutler et al. 2011).

Our paper addresses the gradient issue in two ways: First, we control for the level of education of the individual; education as the major determinant of socio-economic status in adult life. Since we restrict our analysis to age groups for whom the level of education is already fixed, it is (as a proxy of socio-economic status) exogenous to our analysis. Second, we have run our equations additionally in a way that would reflect “reversed causation”.

This is documented in the appendix. The “reversed causation” specification is rather poor and has little explanatory power in a descriptive sense.

There are several ways to conceptualize and measure the “state of health” of an individual. The household production perspective looks at “health” as a stock, in which individuals invest and which depreciates as a function of age (Bolin 2011, originating from Grossman 1972).

Studies using survey data base their concepts of health on the self-assessment of the person asked; further validation of this assessment can be gained from survey items on health related behaviour. (Jäckie and Himmler 2010; Ben Halima and Rococo 2014)

Our analysis takes a different approach. It looks at the frequency and intensity of healthcare interventions by general practitioners, specialists, hospitals; it makes use of leaves of absence and prescriptions. This is done for a population of permanent residents of Lower Austria. Each individual is assigned an index value for the interaction with the health care system. This index for the interaction with the health care system is used to augment the standard wage/earnings function. The advantages and limitations of this index will be discussed in Sect. 3 below.

The paper proceeds from a description of the data, to the construction of the Health Care Interaction index, to the estimation methods chosen, and to a discussion of the empirical results (Table 1).

Table 1 Baseline characteristics of the study population

2 Data and population

The original source of the data are the Austrian health insurance agencies which reimburse providers for health services rendered and drugs prescribed. Health insurance is mandatory and financed via contributions based on payrolls and earnings. Access to public health care involves at most only minimal fees.

The health insurance agencies centrally collect medical as well as employment records. These can be matched on the individual level. Access to such matched micro data is restricted by stringent privacy laws. Permission is only granted to research teams that pass tests on their capacity to ensure data privacy and security. Members of the DEXHELPP network (co-financed by the Austrian Research Promotion Agency) have gained access and spent years on transforming social security records into data suitable for analysis.

A project specific module has been generated from the general research database. It covers all individuals registered with the health insurance agency for Lower Austria in 2016; they have taken continuously residence in Lower Austria from 2006 to 2016; they had been born between 1951 and 1981. There are health care data available for the period 2008–2011; employment data cover the whole period from 2006 to 2016. The population is restricted to individuals earning not more than 100,000-euro and not less than 5000-euro annually between 2006 and 2016. This reduces the total population from 237,119 to 185,761 individuals.

The health care data contain information on visits to general practitioners and specialists; this includes drugs if such have been prescribed; admissions to hospitals, the duration of stay and the underlying diagnoses, leaves of absence.

The employment data inform on the start and termination of employment relationships and the corresponding earnings of the individuals (up to a threshold). Spells of unemployment are recorded as well.

The levels of education are compulsory education only (C), apprenticeship/trade school level (A), university entrance level and beyond (U).

Whereas many variables derived from this data set are more or less self-explaining (gender, age, level of education attainment, days of employment, earnings, etc.), does the index of the interaction with the health care system warrant some discussion.

3 Intensity of interaction with the health care system and the burden of morbidity

The state of health and its implications are a matter of perspective. The individuals affected take a different view than the doctors they visit, or the employers at which they work. All of them interact within an institutional framework that shapes their options for response.

Employers perceive the state of health of an employee through events that signal the level of productivity (and its variation over time) on the jobs considered suitable for the worker (already on the wage roll or to be hired). These signals might point to a below than average job specific output. Each of the signals will be a reason for employers not to hire the person; or to offer a wage rate below the standard average; or in case of an employee not to promote the person. Each of these effects have been quantified in the literature (Markussen 2012; Pelkowski and Berger 2004; Chadi and Goerke 2018) .

With regard to the spells of leave of absence as a signal, several papers have offered evidence for institutional settings similar to those of the Austrian data we use (Markussen 2012; Andersen 2010). The paper by (Ahammer 2018) even uses data of the same sources as we do albeit for Upper Austria rather than for Lower Austria.

These papers interpret the leave of absence (as signal to employers) as the result of an interaction between the individual and the general practitioner who prescribes the number of days of the leave of absence. When this interaction yields a longer duration of sick leave, the patients face adverse employment outcomes.

This leads to the doctors’ views. They have their own metric regarding the severity of health intervention based on the type of their diagnosis. Health conditions of a severe kind warrant admission to a hospital, or long term recurrent therapies and prescriptions (in case of chronic diseases). Moreover, severe health conditions often lead to comorbidities which raise the level of medical intervention in quantitative terms. How these interactions evolve depends partly on the doctors and partly on the patients.

Patients derive their view on the state of their health from both, the signals they perceive themselves (physical limitations, loss of mental well being, pain) and from the assessments given to them by doctors. Epidemiological evidence suggests that at any given type of diagnosis, individuals differ in their propensity to initiate an interaction or to continue an interaction with the health care system.

These considerations point towards potential biases in taking the intensity of interaction with the health care system at face value when deriving an index of the burden of morbidity. The strength of this approach, however, is that it has interfaces with the views taken by each of the three actors (workers, doctors, employers).

The interaction with the health care system and the burden of morbidity are measured by the number of events (“interactions”) occurring during the period 2008 to 2011. Five separate indexes derived from different events of health care utilization are defined. Health care utilization is represented by hospitalization (index \(I_1\)), consultation with general practitioners (index \(I_2\)) as well as specialists (index \(I_3\)), filling prescriptions (index \(I_4\)) and taking sick leaves (index \(I_5\)).

The kth event of type j of person i is denoted \(q_{i,j,k}\). In case of consultation of general practitioners as well as specialists and filling prescriptions \(q_{i,j,k}\) is equal to one. In case of hospital stays and leaves of absence, the event is associated with a certain duration, which serves as a weight. A hospital stay lasting four days corresponds to four single-day hospital stays. That is, a four-day hospital stay is counted like four events. To calculate a score index, a simple procedure is followed: Events in which a person is involvedduring the years 2008 and 2011 are summed up, divided by the number of days of four years and multiplied by 1000 to standardize

$$\begin{aligned} I_{i,j}=\frac{1000}{1440}\sum _{k=1}^{n_{i,j}}q_{i,j,k} \end{aligned}$$
(1)

where \(n_{i,j}\) is the number of events of type j occurring to person i. The resulting Health Care Interaction Index (HCI-Index) is limited between 0 and 1000. Hospitalization, consultation of general practitioners, consultation of specialists and taking sick leaves are naturally restricted, such that the number of events of one type cannot exceed the number of days in four years. The maximum number of filled prescriptions is restricted to the same number. Finally, an average index for each person representing the five subindexes is calculated

$$\begin{aligned} \bar{I}_i=\frac{1}{5}\sum _{j=1}^5I_{i,j} \end{aligned}$$
(2)

The distribution of the HCI-values exhibits a familiar pattern: it is higher for women than for men; it increases with age; it decreases with the level of education. The figures match the expectations about the “social gradient” in the prevalence of morbidity and health service utilization (Burkert et al. 2012; Cutler et al. 2011) (Tables 2, 3 and 4).

Table 2 Total population: distribution of HCI by 10 year age groups and education in 2016
Table 3 Females: distribution of HCI by 10 year age groups and education in 2016
Table 4 Males: distribution of HCI by 10 year age groups and education in 2016

The value of the HCI-Index can be used to partition the population by the first, second and third (rounded) quartile of the HCI-Index; i. e. by the HCI-values of 5, 10, 20. The corresponding populations are denoted by A (up to 4.9 HCI-points), B (between 5 and 9.9 HCI-points), C (between 10 and 19.9 HCI-points), D (starting from 20 HCI-points) (Table 5).

In population A, it is the “specialist” sub-index which contributes most to the average HCI-Index value, followed by the sub-indices for “general practitioners” (GP), for the “prescribed days of sickness leave”, and for “filled prescriptions”. In population D the following ranking of sub-indices emerges: “prescribed days of sickness leave” lead by far, followed by “filled prescriptions”, “general practitioners”, and “specialists”.

Table 5 Mean value of sub-indices for partitions of HCI

The “duration of stays at hospital” lags far behind in each of the populations; its relative increase from A to D is, however, the strongest among all sub-indices.

4 Augmented earnings function

The standard human capital approach makes earnings dependent on the level of education attained (\(Ed_i\)) and the lengths of work experience approximated by age (\(A_i\)). When gender specific preferences for work life balances are taken into account (or discriminating behaviour by employers) then sex (\(S_i\)) is added to the number of explanatory variables.

$$\begin{aligned} \log (E_i)=\beta _0+\beta _1S_i+\beta _2A_i+\beta _3A_i^2+\beta _4Ed_{i,A}+\beta _5Ed_{i,U}+u_i \end{aligned}$$
(3)

\(E_i\) denotes the earnings of individual i, \(S_i\) is equal to one for men and zero for women, \(A_i\) denotes age minus 35 years,Footnote 1 and \(Ed_{i,A}\) is equal to one for education on the apprenticeship/trade school level, and \(Ed_{i,U}\) for the university entrance level (and further education).

The basic idea of the paper is to enhance Eq. 3 with the burden of morbidity described by the HCI-Index as an explanatory variable.

$$\begin{aligned} \log (E_i)=\beta _0+\beta _1S_i+\beta _2A_i+\beta _3A_i^2+\beta _4Ed_{i,A}+\beta _5Ed_{i,U}+\gamma I_i+u_i \end{aligned}$$
(4)

\(I_i\) denotes the Health Care Interaction Index (HCI-Index). It might be useful to comment on several features of Eq. 4. The variables S, A and Ed are fixed for the population which starts with 25 years of age in 2006 and 35 years of age in 2016 respectively.

Annual \(E_{i}\) is determined by wages and number of days employed during a year. These are days of an ongoing employment relationship between the individual and the employer. Austrian labour law protects employees against the termination of the employment contract in case of a morbidity and a more or less prolonged leave of absence. At some stage, the employer gets compensated by social health agencies.

Thus there is no direct relationship between the days of employment and the days of sickness leave (which enter as a variable in determining the HCI-Index). The HCI-variable might, however, be correlated with respect to age (as a proxy for work experience and seniority) if there are work related health hazards which cumulatively lead to morbidity.

Using log earnings is common in estimating returns to education. As the distribution of earnings is typically skewed to the right, taking logs may help to ensure or at least to approximate the normality assumption made in linear regression analysis. Then, given an increase in the HCI-Index by one, \(\gamma \times 100\) smaller than zero represents the percentage decrease in earnings. Using earnings in euro, a negative \(\gamma\) corresponds to the loss in euro due to a poorer level of health. As the earnings distribution looks quite symmetric between the first and the third quartile, equations based on earnings in euro are estimated as well.

5 Results

Tables 6, 7, 8 and 9 exhibit the results of estimating the standard and the augmented earnings function. Both are estimated in log earnings (standard approach) and in (untransformed) units of euro. The untransformed version allows to present the results in a format that lends itself more easily to comparisons with earnings figures.

The signs and the sizes of the parameters of the earnings functions are as expected: There is a sizeable earnings gap for women, which might be confounded by the approximation of work experience by age. Education beyond the compulsory level increases earnings substantially, in particular if it goes beyond the apprenticeship/trade school level. Work experience (as captured by age and age squared) adds to earnings up to the age of about 50 years, when the effect levels out.

The parameters stay more or less the same, when the health care interaction variable (HCI) is added. The sign and the size of the parameter are such that once the burden of morbidity (i.e. intensity of interaction with the health care system) reaches a certain level a markedly depressing effect on earnings sets in.

The equations are estimated for earnings in 2012 (which is at the end of the period for which HCI is calculated) and for earnings in 2016. The size of the impact is for both years hardly different, as a comparison of Tables 6, 7, 8 and 9, respectively, show. This suggests that the medium term effects of the burden of morbidity are not smaller than the short term effects.

Table 10 exhibits the distribution of the burden of morbidity by levels of the HCI-Index. About 47.0% of population have a HCI-value less than 10; and 53.0% of more than 10. The loss in annual earnings at the HCI = 10 level amounted to 827-euro in 2012. About 24.7% of the population is burdened with HCI-value larger than 20 depressing annual earnings even further (by at least 1655-euro) (Table 11).

Table 6 Earnings functions 2012 (earnings in euro)
Table 7 Earnings functions 2012, log(earnings)
Table 8 Earnings functions 2016 (earnings in euro)
Table 9 Earnings functions 2016, log(earnings)
Table 10 Estimated loss of earnings in euro per year in 2012
Table 11 Estimated loss of earnings in euro per year in 2016

6 Discussion

The results draw attention to the fact that a large share of the population of working age is affected by morbidities which warrant interaction with the health care system but are not so severe that they depress earnings; either in the short or the medium term. It seems that health conditions have to reach a certain level that employers interpret them as a signal of persistent lower-than-average job specific productivity.

This is in particular true for the days of leave of absence. Employers tolerate a certain number of annual days of leave of absence before they take notice and start interpreting them as a lower-than-average productivity signal.

Our results suggest that this “threshold” is on average at a HCI score of 1.5 among employers in the Austrian context. The median number of days of leave of absence is 16.3 days per year at that stage.

Further research might want to establish whether the parameter of the functional relationship between the burden of morbidity and earnings are robust with respect to the methodology applied. A plausible next step would be to look at the effects resulting from making use of a control group approach (rather than adding a variable to a standard earnings equation). Proceeding along such lines would yield the option to look more closely into the heterogeneity of effects among sub-populations.

Moreover, it might be of interest to go beyond earnings when analysing the labour market outcome of the prevalence of a morbidity; in particular to decompose “earnings” into hours worked and hourly wages. A special topic is of course the risk of dropping out of active labour market participation at all.

7 Conclusions

Health matters in working life. The nexus shows up in comparing annual earnings of employees burdened by different degrees of morbidity. This is demonstrated in this paper by using a standard wage/earnings equation enhanced by a variable representing the degree of morbidity. This variable is derived from the frequency and type of interaction of individuals with the health care system. It is measured by an index of health care interaction (HCI-Index).

The use of the HCI-Index demonstrates how the impact of health conditions sets in slowly. It is negligible for close to half (47%) of the population of employees (though they are involved in interactions with the health care system). It is relevant in quantitative terms for about 28% of the population and is sizeable for close to a quarter (25%) of the population.

The level of education attained plays a dual role in our analysis. It is directly linked to earnings within the wage equation framework. It is indirectly related to earnings through the HCI-Index as a stratification of the population by levels of education shows. Employees with lower levels of education have significantly higher scores on the HCI-Index.

This might be related to the occupational hazards of the jobs open to workers with compulsory education only. It might be as well that education is a resource in itself when individuals have to cope with contingencies of a health condition. These topics merit further results.

Even though the results are rather robust, further refinements in constructing the HCI-Index and in analysing the impact (in particular with respect to a control group approach) are warranted.