1 Introduction

Rational expectations models (e.g., rational addiction models) are grounded on the assumption that individuals accurately form their survival expectations (Yaari, 1965). Consistently, some research has documented that subjective survival expectations are on average consistent with life table probabilities (Hurd & McGarry, 1995; Hurd & McGarry, 2002; Palloni & Novak, 2016). However, more recent studies show that aggregate life table realizations do not account for individual-specific heterogeneity (Gan et al., 2004).Footnote 1 The availability of end-of-life data makes it possible to precisely compare individual level objective and subjective survival expectation evidence.Footnote 2 After examining subjective longevity expectations over an individual’s life cycle, some studies find evidence of an underestimation (overestimation) of subjective survival at younger (older) age (Elder, 2007; Hurd et al., 2009), suggesting a life course explanation for a bias in survival expectations formation. However, do such biases in expectation formation have an effect on behaviour? Does the specific information domain matter in the formation of biased expectations? Does the domain specific bias reveal a consistent effect across different behaviours?

This paper examines the formation of individual-level biased expectations across two specific domains, namely one’s own longevity and the (predicted) weather. More specifically, we measure biased survival expectations (BSE) by studying individual self-reports in both main and in end-of-life exit interviews in Europe, and we then compare their actual survival realizations (and predicted survival for those alive) with their expectations. Similarly, we estimate individual-level ‘biased meteorological expectations’ (BME), that is, how individuals’ weather predictions compare to weather realisations.

Next, upon documenting evidence of biased expectations, we then examine the impact of such biased expectations on health and financial behaviours. More specifically, we test whether private information drives such effects by exploiting the individual level variation in an individual’s family longevity history (parental age at death) to identify the causal effect of biased expectations on several health (e.g., preventive actions) and financial behaviours (e.g., saving for retirement). The underlying intuition is that, the overestimation of one’s subjective survival can be explained by an individual’s private information about their objective survival probability. This information can arise either from knowledge of either technological or medical reasons influencing a persons survival, including individual-specific genetic information (Martin et al, 2007).Footnote 3 We study different mechanisms including whether the effect of individual biased expectations varies depending on the extent of individual control over different life domains (e.g., higher control over ones health than over the weather).

We add to the literature in several ways. First, we take advantage of unique data from the end-of-life questionnaire of the Survey of Health Ageing and Retirement in Europe (SHARE) and its retrospective wave (SHARELIFE), which allows us to precisely estimate individual level survival expectations and compare them to the respondent’s actual observed survival.Footnote 4 Second, we study BSE using longitudinal data from several European countries, which exhibit a large cross-cultural variation in behavioural reference points. These heterogeneity can influence the formation of expectations compared to similar studies using United States data. Third, unlike previous studies, we are able to identify the bias is expectation formation in two domains, including one's own survival (or health), and the expected weather (or methodological expectations). Finally, we extend the previous literatureFootnote 5 by providing causal estimates of the effect of biased expectations on specific financial and health behaviours.Footnote 6 Hence, we contribute to the still growing literature on the formation of biased expectations and specifically, on the role of private information (Kim et al., 2017), as a driver of biased expectations.

The next section describes in more detail how this paper relates to the existing literature. Section three describes the data, and the empirical strategy is reported in section four. We report the main results in section five. Section six reports a battery of robustness check, and section seven documents evidence of other potential pathways. A final section concludes.

2 Related literature

Biased Survival Expectations (BSE). Subjective survival expectations are central to long-term decision-making regarding saving and consumption (Levhari & Mirman, 1977; Hamermesh, 1985; Hurd, 1989), retirement and employment (Hurd et al., 2004; Cocco & Gomes, 2011), as well as risky health behaviours (Khwaja et al., 2007). However, to date, most of the evidence of bias in expectation formation mainly comes from comparing cross-sectional expectation data and survival life tables. According to Post and Hanewald (2013), the dispersion of subjective survival expectations is more similar to the dispersion of survival life tables. However, life table models are affected by selection, individual heterogeneity, potential cohort effects and individual non-response of those with a higher mortality risk (Bissonnette et al., 2017). In contrast, evidence like the one presented in this study is retrived from exit interviews and is less prone to selection biases. That is, it can precisely identify survival at the individual level.

Cognitive Biases and Optimism. According to the literature, it is unclear whether BSE are the result of individual specific cognitive biases (e.g., optimism bias), or other alternative explanations such as individual differences in access to private information regarding their survival. All of the latter explain why people differ in their ability to judge their own mortality risk (Lichtenstein et al., 1978). Younger respondents tend, on average, to underestimate their objective survival chances, and the opposite is true for older people (Groneck et al., 2017). Similarity, gender plays a role, though according to Steffen (2009) both men and women underestimate their survival, and Teppa and Lafourcade (2013) estimate that women exhibit a systematic lower subjective survival probability relative to actuarial survival probabilities. Viscusi and Hakes (2003) elicit individual subjective survival probabilities and document that they capture a generalised assessment of an individual's health status. Other research has found that age, gender and socio-economic status all have a systematic influence on individual subjective survival expectations (Hamermesh, 1985; Hurd & McGarry, 1995; Khwaja et al., 2007). Consistently with the influence of cognitive biases, Mirowsky (1999) argues that surviving at older ages may give rise to to a feeling of optimism. We will return to this point later in Sect. 6.

Finally, in estimating survival probabilities, there are other important reporting biases to account for such as the rounding of the data and, the presence of focal points (e.g., 0.5) as discussed in Bruine de Bruin et al. (2002). A nontrivial issue is how to estimate survival expectations by age from expectation questions eliciting the probability of survival of an additional target age, which is dealt with in the literature by examining a subjective hazard function under some distributional assumptions (see Bissonnette et al., 2017).

Behaviour and BSE. Although some studies document the presence of discrepancies between perceived and realized survival, there is limited, or no consensus on the effect of BSE on actual behaviour. Groneck et al., (2017) find no evidence that biased survival expectations affect financial behaviours. In contrast, other studies identify an effect of survival expectations on retirement (O’Donnell et al., 2008; van der Klaauw & Wolpin, 2008), demand for annuities (Schulze & Post, 2010; Teppa & Lafourcade, 2013), portfolio allocation (Kézdi & Willis, 2011), education (Arcidiacono et al., 2012), migration (McKenzie et al., 2013), savings (Bloom et al., 2006) and smoking behaviour (Balia, 2011). However, the majority of such studies do not take into account the endogeneity of survival expectation formation. This paper exploits accounts for such endogeneity by drawing on an instrumental variable strategy as described below.

3 The data

We use data from SHARE (the Survey of Health, Ageing and Retirement in Europe)Footnote 7, the European equivalent of the Health and Retirement Survey in the United States. Our variables of interest can be found in the “Main Questionnaire” of waves 1 (2004–05), 2 (2006–07), 3 (2009), 4 (2011), 5 (2011) and 6 (2015) and the “End of life” module for waves 2 (2006–07), 3 (2009), 4 (2011–13), 5 (2013) and 6 (2015). We also link the data with the retrospective SHARELIFE records, especially in our robustness checks. Our final samples are obtained by following several steps (see Table B1 from Appendix B). Using the initial samples for waves 1 to 6 (N = 260,244), we have initially selected those countries which are available for all waves (i.e., Austria, Belgium Denmark, France, Germany, Italy, Spain, Sweden, and Switzerland; N = 160,388). Second, we have then selected observations with non-missing values to calibrate the sampling weightsFootnote 8 (N = 156,320). Third, we have merged each consecutive waves of SHARE. Whilst for survivors two consecutive “Main Questionnaires” of SHARE are used, for the deceased we have merged the “Main Questionnaire” of one wave with the “End-of-life Questionnaire”Footnote 9 of the next wave (N = 118,025). We identify 29,376 non-follow-up respondents, that is, respondents who have only participated in one wave. Next, we compare the characteristics of the non-follow-up respondents with those of the survivors and deceased subsamples. Finally, we distinguish those respondents who have answered the expectations module from those who have not.Footnote 10 Retention ratesFootnote 11 increase remarkably over time in all countries resulting in very high overall panel stability after several waves (Table B2 from Appendix B).

Subjective survival. Subjective survival. The "Main Questionnaire" includes an "Expectations" module that begins with a warm-up question that asks, "What do you think the chances are that it will be sunny tomorrow?" This warm up question should help respondents feel at ease with the numerical scale used in the whole set of questions in this module. To ease the question understanding, these questions are accompanied by cards, with a numerical probability sequence ranging from 0, meaning absolutely no chance, to a value of 100, meaning absolute certainty. Furthermore, respondents are asked to state their subjective survival probability (SSP) on a scale from 0 to 100 using the following question: “What are the chances that you will live to be age [T] or more?”, being T a target age that takes the values {75, 80, 85, 90,…,120} depending on the age of the respondent. Respondents aged below 65 at the time of the interview are presented a target of 75, while those aged between 65 and 69 have a target of 80, and so on (T = 85 for 70–74 years, T = 90 for 75–79 years, T = 95 for 80–84 years, T = 100 for 85–89 years, T = 105 for 95–100, T = 110 for 100–104 years and T = 120 for 105 years and older).

Biased Survival Expectations (BSE). Figure B1 shows the evolution of SSP across the six waves. We find no significant differences and a modest reduction in survival probabilities as people age. Given the significant reduction in sample size, we compare the density function of the survival subjective probability of follow-up respondents and non-follow up respondents. Similarly, to test the validity of SHARE information, Schulz and Doblhammer (2011) compared age-specific death rates for all countries available in wave 1 with age-specific death rates from the Human Mortality Database. They found that empirical death rates mostly ranged within the confidence interval up to age 65, but among individuals older than 65, they were predominantly below the lower bound of the confidence interval.Footnote 12 Hence, older respondents institutionalized shortly before death are coded as attrition cases, rather than deceased.

Fig. 1
figure 1

Trends and gaps of subjective survival and objective survival probability by age of the respondent. Note: These figures display objective survival expectations (OSP) and subjective survival expectations (SSP) for the subsample of survivors and deceased. The distance between OSP and SSP is what we have defined as “biased survival expectations· in Eq. (1)Subjective survival probabilities (SSP) corresponds to the answers to the question of SHARE questionnaire “What are the chances that you will live to be age [T] or more?”. The target T takes the values {75, 80, 85, 90,…,120} depending on the age of the respondent. Respondents aged under 65 at the time of the interview are presented a target of 75, while those aged between 65 and 69 have a target of 80, and so on (T = 85 for 70-74 years, T = 90 for 75-79 years, T = 95 for 80-84 years, T = 100 for 85-89 years, T = 105 for 95-100, T = 110 for 100-104 years and T = 120 for 105 years and older)

Kernel density functions are displayed in Figure B2 (Appendix B) and results of the test of equality of distributions are reported in Table B3. When examining all waves pooled or paired, we identify comparable density functions, which leads us to reject the null hypothesis of equality of distributions.Footnote 13 Table B4 displays the mean survival subjective probabilities by country and target (T) using the final sample (N = 69,900). We identify large cross-country heterogeneity in survival expectations across European countries. We corroborate such evidence by performing F tests comparing the highest and lowest means across countries. More specifically, Danish respondents reveal the largest survival expectations for targets 75, 80 and 85 (when compared to other average country respondents), whereas Italians have the highest survival expectations for the oldest cohorts.Footnote 14 In contrast, while Belgians exhibit the lowest expectations for the youngest cohort, Germans reveal the lowest expectations for targets 80, 85, and 90, and Swedish seniors are the most pessimistic of all the countries studied.Footnote 15

4 Empirical strategy

4.1 Main specification

Using the Weibull specification (commented in Sect. 4.1; see Appendix C for the explanation of the duration model), we first estimate the predicted survival probabilities for each individual, which we denote as “objective survival probabilities” (\({OSP}_{ict}\)). Accordingly, we define our biased survival expectations indicator as the difference between subjective and objective survival expectations as follows:

$${{BSE}_{ict}=SSP}_{ict}-{OSP}_{ict}$$
(1)

Next, we measure the effect of biased survival expectations (\({BSE}_{ict}\)) for an individual i living in country c at time (wave) t; on several behaviours’ (\({D}_{ict}\)) in health and financial domains (including health behaviors, savings, financial investments, and risk taking; see Table A3), as follows:

$${D}_{ict}={\alpha }_{0}+{\alpha }_{1}{BSE}_{ict}+{\alpha }_{2}{X}_{ict}+{T}_{t}+{C}_{c}+{\epsilon }_{ict}$$
(2)

where \({D}_{itc}\) refers to the behaviour indicator under consideration for an individual i residing in country c at time (wave) t; \({X}_{itc}\) refers to a vector of explanatory variables,Footnote 16\({T}_{t}\) and \({C}_{c}\) denote wave and country fixed effect respectively, and \({\epsilon }_{i}\) refers to the error term. We include three groups of explanatory variables: (i) socio-demographic characteristics, (ii) economic controls and (iii) health-related controls. Additionally, we include a definition of each is control variable in Table A2.

4.2 Endogeneity of biased survival expectations on behaviour

Given the potential endogeneity of biased survivaled expectations in explaining health and financial behaviours, we follow an instrumental variable approach (to correct mainly for reverse causality and omitted variable bias). In the first stage, we estimate the following equation

$${BSE}_{ict}={\beta }_{0}+{\beta }_{1}{MAD}_{ict}+{\beta }_{2}{{FAD}_{ict}+{\beta }_{3}{ML}_{ict}+{\beta }_{4}{FL}_{ict}+{\beta }_{5}X}_{ict}+{T}_{t}+{C}_{c}+{\varepsilon }_{ict}$$
(3)

where \({MAD}_{itc}\) and \({FAD}_{itc}\) depict mother’s and father’s age at death, respectively, \({ML}_{ict}\) and \({FL}_{ict}\) are binary variables that take the value 1 if the mother (father) is alive at the time of the survey binary, and 0 otherwise. A valid instrument should be uncorrelated with the error term of (2) but should correlate with \({BSE}_{ict}\). Hurd and McGarry (2002) and Dormont et al. (2014) have shown that death of a parent is associated with a reduction in subjective survival expectations. Additionally, genetic factors or parental ill health, can be regarded as a form of private information that is available to individuals in the formation of expectations of their own and their descendants’ future health status. Therefore, the validity of our instruments relies on the fact that parents' living status affects children’s behaviours only through their effect on biased survival expectations. More specifically, we have selected four instruments: father’s age at death, mother’s age at death, whether the father and/or the mother are alive. Here, we exploit the link between parents and childrens' objective longevity.Footnote 17

In measuring the effect of parent’s age at death we have distinguished between individuals whose parents have already passed away at the time of the survey, from those who are still alive. In the former case, we have used the reported age of death. In the latter, we draw on the predicted age of parental death using multiple imputation (Rubin, 1987). A detailed explanation of the imputation process is reported on Appendix B. The density function and descriptive statistics of deceased parents (reported age of decease in SHARE) and living parents (imputed age of decease) are shown on Figure B3 and Table B7Footnote 18. Sections 6.3 and 6.4 reports robustness checks testing for the validity of our instruments. Finally, the predicted values of \({BSE}_{ict}\)(\({\widehat{BSE}}_{ict}\)) are used instead of the original values of \({BSE}_{ict}\) in Eq. (1).

$${IND}_{ict}={\gamma }_{0}+{\gamma }_{1}{\widehat{BSE}}_{ict}+{\gamma }_{2}{X}_{ict}+{T}_{t}+{C}_{c}+{\omega }_{ict}$$
(4)

For each indicator (\({IND}_{ict}\)) we have estimated four different models, gradually adding new explanatory variables (\({X}_{ict}\)). That is, different specifications vary depending on the inclusion of socio-demographic controlsFootnote 19, country fixed effects and wave fixed effects. Next, we consider a series of health controls, including the number of days spent in hospital in the previous year (due to surgery or medical tests), the number of days stayed at hospital during last year (mental health), the number of days at other institutions during last year and the number of visits to a general practitioner during the previous yearFootnote 20. For each indicator and each model specification we provide three test statistics for the IV estimation (see Tables B13 to B16)Footnote 21.

4.3 Biased meteorological expectations

To identify whether individual biased expectations vary by domain, we have constructed an indicator that compares the subjective probability (prediction) that the following day after the interview is going to be a sunny day with the actual ('objective') meteorological realisation. Meteorological data is retrived from the Gaisma websiteFootnote 22 which provides sunrise and sunset information as well as dusk times for thousands of locations all over the world. For each month and region, we have computed average daylight minutes (\({DMD}_{m,r}\)) as the difference between sunrise and sunset times (see Table B9). The World Meteorological OrganizationFootnote 23 provides the mean number of sunshine hours per month, year, and region. First, we have defined the number of sunshine minutes per month \(m\) of year \(y\) in region \(r\) (\({SMM}_{m,y,r}\)), where region r is the equivalent of NUTS-2 of SHARE. Second, we have computed the average sunshine minutes per day in month \(m\) of year \(y\) in region \(r\) (\({SMD}_{m,y,r}\)) as the ratio between sunshine minutes per month and days per month for each year (to consider leap years) (see Table B10).

$${SMD}_{m,y,r}=\frac{{SMM}_{m,y,r}}{{Days\ per\ month}_{y}}$$
(5)

The meteorological probability of a sunny day in month \(m\) of year \(y\) in region \(r\) (\({MetSD}_{m,y,r}\)) is defined as the ratio between sunlight minutes per day (\({SMD}_{m,y,r}\)) and daylight minutes per day (\({DMD}_{m,r}\)) (see Table B11):

$${MetSD}_{m,y,r}=\frac{{SMD}_{m,y,r}}{{DMD}_{m,r}}*100$$
(6)

\({MetSD}_{m,y,r}\) is regressed over country and regional fixed effects from which we obtain the predicted probability of a sunny day (\({\widehat{MetSD}}_{m,y,r}\)).

The expectations’ module of SHARE include the following question: “what do you think the chances are that it will be sunny tomorrow? For example, '90' would mean a 90 per cent chance of sunny weather. You can say any number from 0 to 100”. We use the answer to this question to define reported probability of a sunny day by an individual \(i\) interviewed in month \(m\) of year \(y\) in region \(r\) (\({Rep}_{i,m,y,r}\))Footnote 24.

“Meteorological expectations” of individual \(i\) interviewed in month \(m\) of year \(y\) in region \(r\)\({BME}_{i,m,y,r}\) is the difference between reported probability of a sunny day and meteorological predicted probability of a sunny day:

$${BME}_{i,m,y,r}={RepSD}_{i,m,y,r}-{\widehat{MetSD}}_{m,y,r}$$
(7)

Table B12 compares both indicators of biased expectations. Our estimates suggest that about 56% of individuals exhibit both positively biased survival and meteorological expectations (e.g., BSE > 0 and BME > 0) and about 8% exhibit negatively biased expectations for both domains. The share of respondents exhibiting positively biased expectations on both domains is 9.2 pp is higher among men though biased expectations differ by age. Similarly, we estimate that biased expectations are 3.5 pp higher for survivors, compared to deceased individualsFootnote 25. Finally, we estimate OLS regressions for the same set of indicators used in Eq. (4), but we now use the expectations based on the meteorological domain (\({BME}_{i,m,y,r})\):

$${IND}_{ict}={\delta }_{0}+{\delta }_{1}{ME}_{i,m,y,r}+{\delta }_{2}{X}_{ict}+{T}_{t}+{C}_{c}+{\varsigma }_{ict}$$
(8)

where \({X}_{ict}\) is the same set of control variables used in (4) and we proceed in a similar fashion, adding them gradually; \({T}_{t}\) and \({C}_{c}\) are fixed and country fixed effects and \({\varsigma }_{ict}\) is an error term.

5 Results

5.1 Biased subjective survival

We begin by estimating BSE among survivors and the individual characteristics predicted to influence BSE. Table B8 reports estimates of a discrete-time hazard model, and identifies a long list of unhealthy lifestyles that correlate with objective survival across waves. Estimates indicate that the hazard rate (1.429) rises over time but such decline tails off. At each survival time point examined, men tend to exhibit a higher hazard ratio (+ 88.4%) than women. Compared to individuals who have never smoked, we find that the hazard rate is 52.4% higher among current smokersFootnote 26 and 36.6% higher for past smokers. Consistently, we estimate a higher hazard rate among respondents that fail to perform any moderate or vigorous physical activity, and similar estimates are retrieved for those who drink every day or almost every day (23.4%). Being diagnosed with cancerFootnote 27 and Alzheimer’s increases mortality risk by 156.6% and 58.5%, respectively, and we document that similarly, those who have suffered a stroke exhibit a 54.3% higher hazard ratio. Individuals feeling depressed exhibit an 18.1% higher hazard ratio which confirms previous evidence regarding the relationship between depression and mortality (Mykletun et al., 2009). Finally, we document a negative association between the survival hazard rate and educational attainmentFootnote 28.

Figure 1 displays the average estimates of both subjective survival (SSP) and objective survival probabilities (OSP) across different individual age groups. Consistently with the hypothesis of biased survival expectations, we find evidence of biased expectations, both among survivors and the deceased. However, such bias widens over time among the deceased whilst it remains relatively stable among survivors.

Table 1 examines age differences in BSE between deceased and survivors. For each age cohort, the column SSP indicates the average subjective survival probability of reaching target T (between 0 and 100) and the column OSP displays the average objective survival probability (between 0 and 100) estimated from a duration model with the Weibull specificationFootnote 29. For instance, we find that BSE is not significantly different across the age cohorts 65–69 and 70–74. The last two columns in Table 1 display the percentage of individuals revealing both a positive and negative bias. More specifically, we define an individual as exhibiting positive BSE if SSP is higher than the predicted OSP, whereas we classify the reverse as negatively biased. The percentage of individuals with biased survival expectations among survivors is stable, and it only peaks at age 95. In contrast, among the deceased, biased survival expectations peak at age 80. One explanation for the overestimation of survival expectations among the older cohort of individuals is the detrimental effect of ageing on cognitive abilitiesFootnote 30 (Elder 2007). Other explanations include the effect of an age increasing optimism, alongside the presence of other cognitive biases (e,g., overconfidence bias).

Table 1 Comparison of subjective survival probability and objective survival probability by age at baseline

Next, when we estimate BSE by gender (in Table C2 in the appendix). Estimates suggest that consistently with prior research, men are more likely to exhibit biased survival expectations than women for both deceased and survivorsFootnote 31, though the bias is higher in the deceased subsample. That is, although women tend to reveal more negatively biased expectations, deceased women exhibit more positively biased expectations than surviving womenFootnote 32. Similarly, when we examine differences across populations and countries (Table C3)Footnote 33, we find that two thirds of the population exhibit positively biased expectations and one third negatively biased expectationsFootnote 34.

5.2 Explaining biased survival expectations

Following the proposed instrumental variable approach, Table D1 displays the covariates predicting BSE which make up the first stage instrumental variable regressionFootnote 35. Consistently, the four instruments considered are statistically significant and exhibit the expected signs. The effect sizes suggest that the effect of age at death and living status is higher for fathers than for mothers. On average, each year of maternal (paternal) age at death raises BSE by 0.0012 points (0.0017 points). Individuals with a living mother (or father) exhibit higher BSE and more specifically 0.0114 points (0.0248 points) higher BSE. Men are more likely to exhibit biased expectations than women (12.42 points), and each year of life increases BSE by 7.62 pointsFootnote 36. Consistently, those who are coded as having stayed in hospital are more likely to show negatively biased expectations. In contrast, stays at other institutions due to convalescence or rehabilitation are associated with only a slight increase in negatively biased expectations.

Time invariant fixed effects reduce BSE by 30.2 points between waves 2 and 4, which corresponds to the period 2007–2011. Furthermore, estimates suggest that the influence of education is U-shaped, revealing that individuals with no completed education, those with primary education and those with post-secondary and tertiary education exhibit more positively biased expectations than those with lower secondary education. Consistently with previous research (Hurd & McGarry, 2002), we find that income plays a larger influence than wealth, and that high-income people exhibit higher BSE. Each 1,000 PPP units of adjusted income (wealth) increase BSE by 66.4 pp (1.2 pp).

Figure B4 displays the predicted estimates of biased subjective longevity by respondent’s age, gender and parents’ living status. When both parents are alive, BSE increases with respondent’s age (e.g., BSE for men (women) aged 70–74 years old is eight (seven) times higher than BSE for men (women) aged 50–54 years old). In contrast, when both parents are deceased, we find a considerably smaller effect among individuals over 75. Comparing BSE by respondent’s gender, we find that biased survival is higher for men regardless of parents’ living status except for the first age cohort (50–54 years). When both parents are deceased, BSE for men gives rise to an inverted U-shape peaking at 75–79 years, whereas BSE for women remains relatively stable for the three last age cohorts.

5.3 Effects of biased survival expectations on health behaviours

Next, we turn to examine the effect of BSE on health behaviours. Table 2 reports the estimated effect of BSE on a number of health behaviours using instrumental variables. Table B13 in the appendix reports the effect of a one standard deviation increase in BSE and BME. IV estimates suggest that possitively biased survival expectations reduce the probability of unhealthy lifestyles such as smoking, alcohol intake and sedentary lifestyles. This evidence is consistent with the hypothesis that individuals who expect to live longer adjust their behaviours accordingly. Estimates in Table 2 suggest that one standard deviation increase in BSE reduces (increases) the likelihood of smoking (never having smoked) by 13 pp (24.9 pp), which is an average decrease (increase) of 48% (51%)Footnote 37. Similarly, a one standard deviation increase in BSE reduces (increases) the probability of daily alcohol consumption (not having drunk alcohol during the last 3 months or less than once per month) in 19.9 pp (28.8 pp)Footnote 38. These results are consistent with previous evidence that finds that binge and heavy binge drinkers overestimate both their driving ability and their tolerance to alcohol (Sloan et al., 2013). We estimate that a one standard deviation increase in BSE, reduces sedentary lifestyles, raising the probability of moderate or vigorous physical activity in 24.5 pp.

Table 2 Effect of one standard deviation increase (or decrease) of biased survival expectations (BSE) and biased meteorological expectations (BME) over health related behaviors

We find that both positive and negative biased subjective expectations (BSE) exert comparable (and opposite) effects on smoking behaviour. However, we identify heterogeneous effects on other behaviours. In contrast, when we examine the effects of meteorological expectations (ME), we observe a similar sign and consistent effect, but its magnitude, except for “sedentary lifestyles”, is almost negligible.

5.4 Effects on anthropometric health outcomes

Next, we turn to examining anthropometric and health outcomes (Table 3)Footnote 39. Table 3 suggest that an increase in BSE increases the likelihood of a normal weight and decreases the likelihood of obesity. A one standard deviation increases in BSE increases (decreases) the probability of being normal weight (obese) by 17.9 pp (27.8 pp), which implies an increase (decrease) by 47.5% (46.7%) with respect to the average probability of being normal weight (obese). These results confirm previous hypothesis that individuals with BSE generally exhibit better physical health.

5.5 Effects on cognitive abilities

Next, we turn to cognitive abilities. Individuals with positively biased survival expectations can recall more words. One standard deviation increase in an individual’s BSE increases the number of words recalled. Furthermore, we find that BSE increases an individual's maximum grip strength test scores. A one standard deviation increase in an individual’s BSE leads to an increase in the maximum grip strength test by 1.47 kg, or an average increase in 4.3% with respect to the average score. Consistently, negatively BSE deliver a consistent effect with an opposite sign. However, when we look at effect sizes, we find that positively biased expectations (BSE > 0) produce a larger effect than negatively biased expectations (BSE < 0). In contrast, the effect of BME, albeit consistent, is more modest.

Table 3 Effect of one standard deviation increase (or decrease) of biased survival expectations (BSE) and biased meteorological expectations (BME) over health and anthropometric outcomes

5.6 Effects on financial behavioursFootnote 40

We then estimate the effect of biased survival expectations (BSE) on a number of financial behaviours, and more specifically on the probability of owning financial assets, including cash and increased household wealth. We estimate that one standard deviation change in BSE increases the probability of ownership of bonds by 2 pp, mutual funds by 3.8 pp and individual retirement accounts by 5 ppFootnote 41. Consistently, it decreases the probability of having a bank account by 17.4 pp, investing in stocks by 2.3 pp, having a mortgage by 5.4 pp and having other debts by 5.3 ppFootnote 42. Negatively biased survival expectations (BSE < 0) explain some financial investments in an inverse mannerFootnote 43. These results point to an intense deaccumulation of assets, and paradoxically, a simultaneous higher concern for the well-being of the family (purchase of life insurance).

Compared to health behaviours, biased meteorological expectations exert a more modest and divergent effect on a few financial behaviours. If heterogeneity in BME is mostly due to private information, one might expect the absolute difference to correlate with behaviors or lifestyle. For instance, people who work outside the household, or follow the news may get more accurate weather estimates than the authors’ benchmark. As expected, we find that negative and positive ME seem to exhibit opposite effects in a number of behaviours. Indeed, we find that a one standard deviation change in positively (negatively) meteorological biased expectations (BME) increases (decrease) investments in bonds (life insurance) by 13.5% (22.5%) with respect to its average probabilities. It also increases the probability of having a mortgage by 36.92%. In comparison, the effect of one standard deviation increases in negatively biased meteorological expectations (BME < 0) is more modest (10.5% and 14.5%, respectively) (see Table 4).

Table 4 Effect of one standard deviation increase (or decrease) of biased survival expectations (BSE) and biased meteorological expectations (BME) over some financial behaviours

5.7 Effects on financial risk-taking attitudesFootnote 44

Finally, Table 5 reports the effect of BSE on above average financial risk taking attitudes. We document that a standard deviation increase in negatively biased survival (BSE < 0) increases the likelihood of prudent financial behaviours. Finally, on average we find that the effect of positively biased survival expectations (BSE > 0) is twice that of negatively biased expectations (BSE < 0). Furthemore, we document that the effects for BME are in line with those of BSE but again, they appear to be significantly smaller.

Table 5 Effect of one standard deviation increase (or decrease) of biased survival expectations (BSE) and biased meteorological expectations (BME) over the extent of financial risk that the respondent is willing to take when making investments

6 Robustness checks

6.1 Focal responses

One potential explanation of our results is the presence of focal responses. Focal responses refer to expectations clustered around certain ‘focal points’ of the distribution (Hurd & McGarry, 2002). From the histograms depictions (Figure D1), it becomes clear that this is indeed the case in our study. For example, 17.2% of the deceased reported a null probability, 20.8% of the survivors and 18.9% of the deceased sample reported a survival probability of 50%, and 17% of the survivors reported a probability of 100%. Some authors suggest that these responses may are suggestive of the individual struggle with answering probabilistic questions (Bruine de Bruine et al., 2002), while others emphasize their valuable information (Manski & Molinari, 2010). Bruine de Bruin et al. (2002) coined the term “epistemic uncertainty” to describe the behaviour of respondents who reported a 50 per cent probability, and argue that in this case, the response would be treated as a "don't know".

Other research claims that the likelihood of providing focal points is positively associated with lower cognitive performance. Perry (2005) found that individuals reporting focal responses of zero and one were on average less educated, held fewer assets and had lower income than the rest of the sampleFootnote 45. Such evidence is validated in the estimates in Table D3 reporting cognitive and economic related correlates of focal and non-focal respondentsFootnote 46.

As a robustness check, we have re-estimated the model excluding focal responses. Table B1 displays the characteristics of the sample for survivors and deceased after excluding focal responses (0, 50 and 100)Footnote 47. The estimated coefficients and hazard ratios for the preferred Weibull specification are reported in Table B8 (3rd column). Our estimates suggest that the magnitude and significance is very similar to the original sample. Figure D2 compares the density function for the “biased survival” indicator including and excluding focal points. Again, differences between both density functions are almost negligible, which suggests that the inclusion of focal responses are consistent with our previous estimates (as has been also observed by Kleinjans & van Soest, 2014).

6.2 Respondents’ understanding of probabilities

A prerequisite for the consistent interpretation of subjective survival expectations is that respondents can answer probabilistic questions (Manski, 2004). Hence, as an additional robustness check, we have excluded those individuals who provided an erroneous response to the following probabilistic question: “If the chance of getting a disease is 10 per cent, how many people out of 1000 (one thousand) would be expected to get the disease?Footnote 48. Again, compared to the baseline coefficients and hazard ratios, estimates resulting from the re-estimated model are of similar magnitude. Figure D3 shows the kernel density function of the “biased survival” indicator for both the total sample and the reduced sample after applying the exclusion criteria. Importantly, the shape of both density functions does not suggest evidence of bias.

6.3 Use of (private) medical information from SHARELIFE

Another concern  refers to the imputed parental age at death when parents are alive at the time of the survey, given that parental age at death is not time varying. Hence, we use retrospective data from parental vital status available in SHARE Wave 3, known as SHARELIFEFootnote 49. We can test for the effects of private information with regards to one’s own health consistently with previous studies (Viscusi & Hakes, 2003). Accordingly, we have re-estimated our models using two additional instruments: blood pressure being checked during the previous year, and having his/her blood testedFootnote 50, as both questions proxy some form of private health information. Consistently, we find that the magnitude and significance of the estimated coefficients compares to that of the entire sampleFootnote 51.

Table D1 (4th column) reports the results of the IV estimates for BSE, and we find that the values of the corresponding instruments compare to those using the entire sample. Having blood pressure checked in the last year reduces BSE in 11.6 and having blood tested in the last year reduces BSE by 8.1 pp. However, Table D4 reveals that even after such adjustment, the effect of the BSE confirms our previous conclusions.

Figure D4 compares the observed BSE for the total sample, with the predicted BSE for the subsample of respondents who also completed SHARELIFE using all instruments (parents' age of decease, blood pressure and blood check) and the predicted BSE using only two instruments (blood pressure and blood check; 5th column of Table D1). Although the differences are not large, we find that when all the instruments are used, the predicted BSE is closer to the observed BSE.

6.4 Instrument validity

In interpreting the effect of BSE as the difference between objective and subjective survival expectations, one potential explanation is that private information (unknown to the econometrician) influences expectations above and beyond other more general biases such as optimism. To examine these concerns more closely, Figure D5 compares the observed BSE, the predicted BSE (according to estimations of Table D1) and the predicted BSE taking the average for each survey respondent. Consistently with the hypothesis that BSE reflect partially an effect of unknown private information, we observe that, for both survivors and deceased samples, the predicted BSE is smaller but still significant and exhibit a large effects size, in line with previous estimates.

To check the reliability of our instruments, we re-estimate the first-stage regression using lagged instruments. Although the sample size is considerably reduced, Table D5 shows that the instrumental variable estimatesFootnote 52 are relevant, significant, and robust. Similarly, to enhance the causal claims of our estimates, we have then examined the two bound weak instrument method proposed by Conley et al. (2012)Footnote 53. Figure E1 displays the estimates using “father’s age of death” as the instrument (similar results have been obtained for the other instruments, available upon request). On the left side we show the graphs for the Local-to-Zero approach. The solid line depicts the 2SLS father’s age of death effect estimate for the respective indicator (for simplicity, we only show the results for one indicator of each group). The two dashed lines depict upper and lower limits of the respective test scores. Overall, the results confirm that even with substantial deviation from the exclusion restriction, the instrument still has a considerable effect over the outcome variableFootnote 54.

Finally, Figure E1 also displays the 95% confidence bounds of the instrumental variable’s coefficient using father’s age of death. Taking as reference the zero-line (red-line), once the confidence bounds include the zero-value, the 2SLS are no longer significant at the 95% level of significance. In contrast, if the upper limit crosses the zero-line at a high value of \(\updelta\), then the 2SLS estimates are robust to possible violations of the exclusion restriction assumption. Similar figures have been obtained for the instrument “mother’s age of death” (figures available upon request).

6.5 Inclusion of parental characteristics in the discrete hazard model

As a final check, we have re-estimated the discrete hazard model including as explanatory variables the age at death of the parents and the fact of having a living parent. Table C1 shows that, once again, the Weibull model is the preferred model. Using this specification, Table E2 displays the estimated coefficients and hazard ratios for the initial sample, excluding focal responses, those who answered probabilistic questions erroneously and the subsample that answered SHARELIFE. Although the variable measuring the presence of a living mother and father shows now non-significant effects, age at death reveals the expected negative effect on the probability of death.

7 Mechanisms

The Nature of Private information. To examine whether BSE convey the effect of respondent's private information, or measurement error, we estimate the effect of lagged BSE alongside the death of a brother, sister, or child as other form of private information in addition to one’s parents. The results (Table E1 in the appendix) show that the effect of lagged BSE is significant at 1% and such significance is preserved when the explanatory variables are progressively included, and when the death of the brother, sister or child is added. However, the death of a sibling is not significant at 5% level, nor is the death of a child. Hence, we conclude that the nature of private information matters. More specifically, only parental longevity plays a significant role in expectation formation.

Finally, we examine the effect of two additional pathways, one is the effect of BSE on social contacts, as well as the individual sense of control of one’s life, so called locus of control.

Social contacts. Individuals with BSE who as a result engage in more social contacts are less likely to exhibit sedentary lifestyles. We estimate Eqs. (4) and (8) using as the dependent variable a binary indicator that takes the value 1 if the respondent has participated in the last month in voluntary or charity work, attended a sports or social club, taken part in political community activities, as well as event organised by religious organization, attended educational or training courses, and 0 otherwise (see Table A3 for definitions). Table B17 reports the estimated coefficients and Table B18 reports the effect of a one standard increase of BSE and ME. We find that one standard deviation change in BSE increases the probability of social contacts by 6.08 pp (or 38.7% with respect to the mean probability). Similarly, we find that one standard deviation change in BME increase social contacts by 5.8 pp (37% with respect to the mean probability).

Locus of control. Another potential driver of our results can be explained from changes in an individual’s sense of control of their life as they age, which we refer to as locus of control (see Rotter 1954). We estimate Eqs. (4) and (8) using as dependent variables four binary indicators that take the value 1 if respondent answers that they feel that things are out of control: often, sometimes, rarely, and never. Table B17 reports the estimated coefficients and Table B18 reports the effect of a one standard increase of BSE and BME.

Consistently, we find that BSE is associated with a reduction in the probability of losing control. A Positive BSE increases the probability of people feeling “things are out of control: often” and increases the probability of feeling “things are out of control. In contrast, negative BSE gives rise to a large increase in the probability of often feeling overwhelmed, whilst the effects of BME is not as robust as those of BSE. These results confirm that BSE may give rise to active coping strategies and the feeling of control of their life (Lopes & Cunha, 2008).

8 Conclusion

We study the formation of biased expectations of both one's own survival and the weather, and we examine the extent to which they affect health and financial behaviours. That is, drawing from individuals level data we have estimated the (positively and negatively) bias in survival expectations (BSE), computed as the difference between actual and subjective survival using individual level longitudinal evidence of individuals in a list of European countries. Similarly, we estimate a measure of biased meteorological expectations (BME) comparing the difference between individual level weather predictions and the actual weather realisations. Next, we draw on an instrumental variable analysis that takes advantage of rich data on parental survival to estimate the effect of BSE on health and financial behaviour, which we compare to the effect of BME.

We document evidence of a bias between subjective and objective survival expectations across domains (survival and the weather), and we estimate that such domain specific bias heterogeneously influences health and financial behaviours. We further find a symmetric effect of positively and negatively biased survival expectations on behavioursFootnote 55. BSE predicts some health and financial behaviours such as smoking, sedentary lifestyles, purchasing life insurance and saving for old age. However, BME does not influence health behaviours in the same magnitude. One standard deviation change in BSE is associated with an increase in the probability of healthier behaviors (such as never having smoked by 51%, not having used alchol in the last 3 months by 94.8%). Similarly, one standard deviation change in BSE increases the probability of taking above average financial risks, or the probability of expecting to earn above average returns (83%), investing to a larger extent in fixed income securities, such as corporate bonds, mutual funds (mostly in bonds), and individual retirement accounts.

In contrast, BME increases the probability of owning bonds (13.49%), holding mortgage (36.9%), but decreases the probability of purchasing life insurance (-22.5%). These results are consistent with evidence that biased expectations are domain specific. A clear policy implication emerges if more positive biased expectations can be learned. The latter might result from access to private information, as well as the influence of some cognitive biases such as optimism though our estimates cannot fully distinguish between the twoFootnote 56.

Our estimates are consistent with Wang and Sloan (2018) who draw attention to the negative effects arising from the underestimation of the risks associated with non-adherence to a prescribed treatment. In their analysis of patients diagnosed with diabetes, they note that many patients were not conscientious in their adherence to medical recommendations. Similarly, Picone et al. (2004) document that women with biased survival expectations are more likely to perform breast self-exams and to request Pap smears and mammograms consistently with our estimates suggesting positive effects of BSE in increasing the probability of a  "healthy" lifestyle, (i.e., smoking, drinking, physical exercise). Hence, we conclude that BSE vary across domains and exert economically relevant effects on health and financial behaviours.