Introduction

The Health and Consumer Protection Directorate-General [12] have argued in their guidelines for “Quality Assurance in Breast Cancer Screening and Diagnosis” that almost 10 in every 100 individuals (9 %Footnote 1) from 1985 to 2000 can be saved thanks to screening policies particularly targeting specific sites: breast, cervical and colorectal cancers.Footnote 2 However, we do not fully understand how tumor detection through screening and treatment impacts patients’ daily lives. A review of the literature points out that authors mainly focus on coping strategies of patients who have (recently) been diagnosed with cancer. They do not, or only to a limited extent, consider direct measures of subjective psychosocial well-being (see Sect. “Background”). Moreover, previous studies dealing with coping strategies or the financial implications of tumor detection have various methodological shortcomings [30]. These methodological shortcomings include (Cook and Campbell [11]): sample selection issues; limited sample size; descriptive research or estimating correlations; omitted variables bias; and external validity. Research into tumor detection and treatment and its effect on individual outcomes often relies on administrative or survey data from hospitals or clinics. It also relies on matching estimation techniques that can identify an appropriate counterfactual outcome for every individual case based on a set of observable characteristics.

This paper contributes to the previous literature in at least three ways. First, we use a national representative panel with information both after and before tumor detection. In particular, studies using the follow-up of various health and labor-related outcomes before tumor detection are, to our best knowledge, rare in the economics literature, as one cannot anticipate tumor detection. Only two papers, Moran et al. [21] and Heinesen and Kolodziejczyk [16], use an alternative control group made up of currently healthy individuals who developed cancer 5 years later. But this paper benefits from data from a national representative panel. It consists of individuals who were selected from the population register by Statistics Netherlands, and combines rich health survey data with administrative data on work and income on a year-to-year basis [28]. We argue for using panel data estimation techniques instead of matching or correlational analysis, and will use both techniques in this paper in order to support our arguments. Panel data techniques allow us to construct a ‘control group’ from the data by using individuals in the year before diagnosis who have by construction no prior knowledge of their disease status in the upcoming year. Second, we include a varied set of measures of subjective well-being available in the panel, namely: self-reported health, self-assessed happiness, anxiety, depression, and hindrance at work. With respect to financial well-being, we look at the ‘traditional’ variables like labor force participation and (log of) wages. And third, the data allow us to distinguish similar outcomes by gender. Previous literature indicated that, owing to the small sample sizes, splitting up the results by gender was difficult (e.g. [21]).

Background

Subjective psychosocial well-being

We searched for articles meeting a set of inclusion criteria, explained below, that do not necessarily cover the whole range of the literature on the relationship between cancer and psychosocial well-being, as this was considered beyond the scope of this article. It was of primary importance that we position our problem statement of the effects of tumor detection on subjective psychosocial well-being in the literature, as well as to search for articles using a systematic approach. As such, we started our literature review with the following keywords in PubMed and Google Scholar: “anxiety”; “cancer”; “diagnosis”; “happiness”; “hospitalization”; “life satisfaction”; “psychosocial”; “well-being”; “quality of life”; and “tumor” or “tumour”. We also searched for articles in, for instance, Science Direct. However, most of these articles deal with the effects of medical or treatment implications of being diagnosed and treated for cancer instead of measures of subjective well-being. Therefore, we decided to start mainly from the PubMed and Google Scholar search results. The second inclusion criterion for the literature search is publication between the years 2000–2014. Third, we only retained articles that had been published in peer-reviewed journals, and that were written in the English or Dutch language. Furthermore, the articles should especially deal with cancer or tumor detection, and its impact on subjective psychosocial well-being, preferably in the short-run. Doing so, we retained only seven articles that met all four inclusion criteria (Table 1). Of these 7 articles, 6 dealt with breast cancer (BrC).

Table 1 Tumor detection and treatment and its relationship with psychosocial well-being

Most studies focus on “coping strategies”, for example how cancer patients cope with their illness in daily life. A survey study at one hospital in Thailand presents several coping strategies [31]. Among these strategies, seeking social support had been indicated the most. Sammarco [27] discusses the positive correlation between (perceived) social support and quality of life of younger breast cancer (BrC) patients in the US. Other evidence from the US (Chicago) from patient surveys indicates that social support is highest close to tumor diagnosis, but then it significantly decreases over time [3]. Kotkamp-Mothes et al. [20] add that, in some cases, patients refuse social support, particularly the elderly, if it is an impediment to their autonomy.

There is one recent qualitative study from Raque-Bogdan [22] that deals with the coping of young BrC survivors. She indicates that this group significantly differs from the elderly, as they are in need of reintegration into the labor force after treatment. From the interviews, the author synthesized that these young women increasingly seek purpose in life, sometimes in their private life, but most often in their work. However, Raque-Bogdan [22] also indicates that this is not always possible due to difficulties with changes in work and financial insecurity. Another study [2] confirms this purpose-seeking behavior as a coping strategy, and explores the role of religion. The authors point to the meaningfulness of religion in reducing the risk of depression. Reddick et al. [23] conducted a clinical trial in the US, also to explore the role of several coping strategies in depression as well as anxiety and fatigue. They also argue that ‘better’ coping strategies go hand-in-hand with increased psychosocial well-being.

Based on Cook and Campbell [11], we indicate that the evidence and the methodological quality of the traced research studies in Table 1 are rather poor: the sample size is low, and most studies violate external validity by reporting on data from one hospital (e.g. [31]) or two hospitals (e.g. [3]). Making inferences about the internal validity is difficult, as the surveys used in the studies are not available. Furthermore, there is only one randomized, controlled, clinical trial [23].

Work and income

Starting from the literature review of Steiner et al. [30], we undertook a different review strategy than described in subsection “Subjective psychosocial well-being”. In fact, we only added four articles that were more recently published in peer-reviewed journals to the review of Steiner and co-authors. These articles have been collected following the same approach and inclusion criteria as previously discussed.

Before turning to the results summarized in Table 2 we first describe an earlier US study of Bradley et al. [6]. This study focuses on breast cancer as a single disease as well as its impact on employment decisions and earnings up to 3 years after the initial diagnosis. The authors find that women who survive up to 3 years after initial diagnosis, are less likely to be in the labor force (about 50 % compared to over 60 %). However, those who are still in the labor force work more and earn higher wages than their peers without breast cancer. They discuss that the way health insurance coverage is provided in the US largely drives their results. Moran et al. [21] point to the relatively old ages (mean age of 56) of individuals in the sample used in Bradley et al. [6]. They add to the US literature by focusing on the effects of surviving cancer on the labor market outcomes of individuals aged 28–54. Their case group is compared with a control group constructed from the Panel Study of Income Dynamics. Moran et al. [21] indicate an overall reduction in the employment rate and hours of work of comparable magnitude to those estimates found for older workers in Short et al. [29]. This overall reduction in the employment rate of cancer survivors may be owing to early retirement (e.g. [9]). Moran et al. [21] additionally find that, in the long-run, individuals who relapse have worse labor market outcomes. Workers with relapses may benefit more from employment support services and workplace accommodation, as indicated by Short et al. [29].

Table 2 Tumor detection and treatment and its relationship with work and income

In contrast to the literature review of subsection “Subjective psychosocial well-being”, Table 2 presents studies with better methodological quality, especially when the authors combine hospital data with administrative and/or survey data (e.g. [29]). However, several issues remain unresolved. First, respondents self-select themselves into the data, as they are only included in case of illness and treatment at the hospital (e.g. [24]). As a result of this data collection method, there is no information available about past events without the respondents having prior knowledge about their health status. Information obtained at the time of the first medical consultation may be severely biased by having this knowledge. A second issue arising in most case–control studies is the absence of a good control group. Often, researchers rely on matching techniques that can identify an appropriate counterfactual outcome for every individual case based on a set of observable characteristics (e.g. [9, 21]). The set of control variables often only captures information on limited background characteristics (e.g. Heinesen and Kolodziejczyk [16]). But even with good control variables, matching analysis does not appropriately deal with the influence of genetic susceptibility, particularly in a study on why individuals develop cancer. As such, finding a good control group is almost an impossible mission (see also [30]).

Empirical strategy

The best control group one can think of is the case individual his- or herself, but in a “healthy” state. In fact, tumor detection can be considered a sudden shock, which seriously can affect the individual’s well-being in the year of detection, especially compared with the year before [6, 7]. Two important assumptions underlie this empirical strategy. First, observed and unobserved background characteristics may not determine the level and change of well-being (i.e. endogeneity). We deal with this econometric challenge by estimating an individual and time fixed effects model. This estimation model controls for information intrinsic to the individual and time constants. And second, individuals may not anticipate tumor detection in the year before the actual diagnosis is made. Bradley et al. [6, 7] already argued that tumor detection can be treated as a random event. We check for this assumption in Sect. “Robustness of the results”.

Modeling well-being

We model the well-being function \(W_{\text{it}}^{*}\) as (Groot and van den Brink [14]):

$$W_{\text{it}}^{*} \sim \left( {{\text{HS}}_{\text{it}} ;\;Y_{{i\left( {t - 1} \right)}} ;\;X_{\text{jit}} } \right),$$
(1)

where W it denotes individual i’s well-being at time t; HS it the health status; Y i(t−1) income in the year before diagnosis; and X jit a vector of \(j \in \{ 1,2, \ldots ,J\}\) background characteristics at time t.

In the case of tumor detection, HSit depends on:

$${\text{HS}}_{\text{it}} \sim \left( {D_{\text{it}} ;N_{\text{it}} ;H_{\text{it}} } \right),$$
(2)

where D it denotes detection with 1 ‘tumor detected’, and 0 otherwise; \(N_{\text{it}} \in \{ 0,\;1\}\) the nature of the tumor with 1 ‘benign’, and 0 ‘malignant’; and H it information on hospitalization.

As in Groot and Maassen van den Brink [14], it is assumed that well-being is a linear function of lagged income (Y it−1), health status (D it; N it; H it), and other background characteristics (X jit). We then may estimate:

$$W_{it}^{*} = \gamma_{0} + \gamma_{1} \left( {D\left( 1 \right)_{it}^{*} \times H\left( 0 \right)_{\text{it}} } \right) + \gamma_{2} \left( {D\left( 0 \right)_{\text{it}} \times H\left( 1 \right)_{\text{it}} } \right) + \gamma_{3} \left( {D\left( 1 \right)_{\text{it}} \times H\left( 1 \right)_{\text{it}} } \right) + \gamma_{4} N_{\text{it}} + \gamma_{5} \;{\text{Age}}_{\text{it}} + \gamma^{6} Y_{{{\text{it}} - 1}} + \mathop \sum \nolimits \delta_{j} X_{\text{jit}} + \varepsilon_{\text{it}} ,$$
(3)

by using individual fixed effects models.

Depending on the nature and stage of the tumor, we expect large heterogeneity of the treatment effect. For instance, tumors that are considered benign do not always have to be removed, and tumors that are diagnosed as malignant do not always imply severe medical therapy. Therefore, three interaction terms are included in Eq. (3): (1) tumor detection without hospitalization \(D\left( 1 \right) \times H(0)\); (2) only hospitalization \(D\left( 0 \right) \times H\left( 1 \right)\); and (3) tumor detection including hospitalization \(D\left( 1 \right) \times H\left( 1 \right)\). Hence, the reference category is \(D\left( 0 \right) \times H\left( 0 \right)\), i.e. no tumor detection and no hospitalization.Footnote 3 Age is also included in Eq. (3), so as to control for the negative relationship between age(ing) and well-being. Note that, by estimating Eq. (3), time invariant X jit is dropped because of obvious reasons of multicollinearity.

Groot and Maassen van den Brink [13] argue that measures of self-reported life satisfaction (e.g. happiness) are affected by preference drift. Or else, in case individuals experience a life-changing event, such as tumor detection, they likely mirror their personal/income situation with those of others in the same situation. We account for preference drift in two ways. First, our results with respect to psychosocial well-being are controlled for income in the year before diagnosis Y it−1 , in particular, we cluster individuals by using dummies of the lagged values of the net personal income categories. Second, both models on well-being, work and income include the variable ‘nature of the tumor’.

Modeling earnings

We model the earnings function \(Y_{\rm {it}}^{*}\) as [6]:

$$Y_{\text{it}}^{*} \sim \left( {{\text{HS}}_{\text{it}} ;S_{\text{it}} ;{\text{HI}}_{\text{it}} ;X_{\text{jit}} } \right),$$

where S it denotes the duration of surviving a tumor diagnosis to date; HIit information on health insurance; and X jit individual, family, neighborhood characteristics and life-style factors. As we focus on the short-run, S it is equal to 1 year for all individuals.

To estimate the earnings function, we replace \(W_{\text{it}}^{*}\) by \(Y_{\text{it}}^{*}\), and include age as a continuous variable. Bradley et al. [6] suggest additionally controlling for an age dummy, instead of its squared term, so as to account for the age at which individuals retire. We include an age dummy indicating 65+, as this was the retirement age in the Netherlands before 2013.

Contrary to Bradley et al. [6], we do not include information on health insurance into the convenient regression. Inhabitants of the Netherlands can change their coverage only once a year on January 1. Consequently, HIit should drop out of estimation owing to multicollinearity. If not, this would directly falsify our assumption of no anticipation (Sect. “Robustness of the results”).

Data

We use data of the LISS (Longitudinal Internet Studies for the Social Sciences) panel administered by CentERdata (Tilburg University, the Netherlands). The LISS panel is a representative sample of Dutch individuals who participate in monthly Internet surveys. The panel is based on a true probability sample of households drawn from the population register. Households that could not otherwise participate are provided with a computer and internet connection. A longitudinal survey is fielded in the panel every year, covering a large variety of domains including work, education, income, housing, time use, political views, values and personality. More information about the LISS panel can be found at: http://www.lissdata.nl [28].

The LISS data on health include 7000–9000 households each year. There have been six waves in total over the period 2007–2012. In the first wave of the year 2007, 8478 households (100 %) were asked to fill in the questionnaires on health and well-being. CentER data and Statistics Netherlands select households to participate in the study. As such, households cannot self-select themselves into the survey. The response rate was 78.9 % (6625 households) in 2007. Overall, there was a high response rate in 2008 (72.0 % of 8280 households), in 2009 (66.7 % of 9170 households), in 2010 (77.6 % of 7364 households), in 2011 (77.6 % of 6533 households), and in 2012 (85.4 % of 6769 households). As such, the data include 44,741 observations. After elementary data cleaning of the variables further used and discussed in this paper, we still kept 33,870 observations in the sample. Most of these observations are the head of the household (53.9 %), wedded partner (29.1 %), or unwedded partner (5.5 %). There is some information available about a child living at home (10.6 %), or any other kind of family member or housemate (about 1.0 %).

Each year in the month of November, respondents were asked to answer the following question: “Has a physician told you this last year that you suffer from: (1) cancer or malignant tumor, including leukemia or lymphoma, but excluding less serious forms of skin cancer; and/or (2) skin tumor, polyps, angioma”. Those respondents, who answered “yes”, constitute the case group, a total sample of (N = 1102) observations. Table 3 presents the total number of individuals from which we know the disease status in year t and in year t − 1. Whereas the mean observed time for those respondents with a tumor diagnosis is 2.7 years (SD 1.5, minimum 1 and maximum 6), we only lose respondents who are only observed once in the data, in total 252 observations. The final sample of case–control individuals then consists of (N = 850) observations, of which exactly 425 are unique individuals before and after tumor detection (see Table 3).Footnote 4 Tables 10 and 11 in the Appendix summarize the rich data we have on background characteristics.

Table 3 Control group—follow up

In our data, there are about 85 individuals each year who are diagnosed with a tumor, with 54 % having a benign tumor (and therefore 45 % having a malignant tumor). This corresponds to an average cancer incidence rate of about 677 per 100,000 individuals each year 2008–2012. For reasons of comparability, the official Dutch statistics of the cancer registry (cijfersoverkanker.nl) are presented in Table 4. The cancer registry indicates an average cancer incidence rate of 584 per 100,000 individuals each year over the same period. These numbers are highly comparable with those of LISS. The Kolmogorov–Smirnov equality-of-distributions test looks at the distribution of cancer incidence observed in the LISS sample and compares it with the distribution of cancer incidence from the official Dutch statistics. The results from this test confirm the comparability of both samples (D = 0.6000; P value = 0.329).

Table 4 Comparability of Statistics Netherlands and Cancer Registry and the LISS panel data

Table 5 summarizes the outcome variables including comparability statistics. The descriptive statistics of the control variables including the individual background characteristics of the respondents and several determinants of life-style are available in the “Appendix”.

Table 5 Outcomes, treatment variables and comparability statistics (full sample, N = 850)

This paper uses direct measures of subjective psychosocial well-being, such as: self-reported health, happiness, anxiety, and hindrance at work. The choice for this set of variables in order to measure subjective well-being is based on a long-standing tradition of (health) economists (for an elaborated discussion on this, see Kahnenman and Krueger [17]). Briefly, it is argued that direct measures of subjective well-being, such as self-reported happiness, are particularly useful in the measurement of social welfare. For example, Gruber and Mullainathan [15] use self-reported happiness as an outcome measure in their study on the effects of cigarette taxes on subjective psychosocial well-being. Kling et al. [19] evaluated the effects of a program that offered housing vouchers to high-poverty families living in the US on self-reported physical and mental health. These health states were measured by survey questions directly asking about feelings of being calm and peaceful, anxiety, and depression. Kahnenman and Krueger ([17], p. 4) further discuss that individuals’ perceptions of their experiences are most accurately measured when reported close to the actual experience. As the survey asks the respondent exactly the same question in two subsequent years in the month of November, we argue that our measures of well-being will reflect the way people feel about experiences that are still fresh in their minds (see also [6]). Further, note that we prefer direct measures of subjective well-being above composite measures such as quality of life, or life satisfaction, for two reasons. First, measures like quality of life are often composed from many different underlying questions aiming at grasping the full concept. These questions often differ from survey to survey, and, therefore, the measure is subject to problems with internal and often also external validity and reliability. Second, quality of life and life satisfaction can be approximated by direct measures (e.g. like self-reported happiness or hindrance at work) that do not suffer, or only in a limited way, from these issues (Cook and Campbell [11]). Even though reliability of direct measures can be subject to context, mood, and duration neglect, “the idiosyncratic effects of recent, irrelevant events are likely to average out in representative population samples (Kahnenman and Krueger [17], p. 7)”.

First, we discuss the variable self-reported health that has been derived from the question: “How would you describe your health, generally speaking?” Respondents could give their answer on a five-point Likert scale ranging from poor (answer = 1) to excellent (answer = 5). We observe a significant overall decline in self-reported health status before and after diagnosis.

Next, the respondents were asked about their feelings and emotions over the past month by answering the question: “The following questions are about how you felt over the past month. For every question, please choose the answer that best describes how you felt during this past month. This past month…”. This question was asked for: happiness “I felt very happy“, and anxiety “I felt very anxious”. Respondents could answer on a five-point Likert scale ranging from never (answer = 1) to continuously (answer = 6). From Table 5, we observe an overall decline in happiness and an overall increase in anxiety in the year of tumor detection.

Third, we also consider a question related to work: “To what extent did your physical health or emotional problems hinder your work over the past month, for instance in your job, the housekeeping, or in school?”. Respondents could answer on a five-point Likert scale ranging from not at all (answer = 1) to very much (answer = 5). Individuals increasingly find that their physical health or emotional problems hinder work, the housekeeping or school, illustrated with a change from 0 to 1 in disease status.

Fourth, we provide information on personal net monthly income (in logarithm) and household net monthly income (in logarithm) for the full sample. We do not observe any significant difference before and after tumor detection.

The variables with respect to nature of the tumor ‘benign’, and hospitalization ‘hospital’, are also summarized in Table 5. First, out of all of the tumors diagnosed, roughly half are benign. Of course, in the control group no (0.00 %) tumors are diagnosed. Second, respondents were asked the following question: “Did you spend any time in hospital or a clinic over the past 12 months?” In total 12.0 % (N = 51) of the respondents said “yes” if (D = 0), and 43.1 % (N = 183) if (D = 1). The respondents were also asked how many days they had to stay in the hospital throughout the year. The control group reports less than 1 day, while the case group reports 2.5 days. In addition to this information, respondents could indicate whether or not they were hospitalized for an operation over the past year. In the control group, only 6.6 % said “yes”, while it was 33.4 % in the case group.

In order to check for overlap between the variables selected for the analyses, we have estimated a Pearson correlation matrix (available from the authors upon request). As expected, we observe a significant correlation between the variables. Highest correlation is found between the variables measuring self-reported health and hindrance at work and unable to work and hindrance at work. However, all of the correlations range from low to medium so that we conclude that each of the chosen variables still has a lot of variance unexplained, and, as such, add to the analyses.

Main results

Table 6 summarizes the results of four models by gender with respect to: self-reported health (model 1); happiness (model 2); anxiety (model 3); and the extent to which the individual’s health problems hinder work, housekeeping or study (model 4). All of the outcome variables have been standardized, so that the results can be expressed as effect sizes (ES). The individual and time fixed models have been estimated using the ordinary least squares estimator. Robust standard errors are reported between brackets.

Table 6 Impact of tumor detection and treatment on subjective psychosocial well-being (estimation output of the fixed effects model)

First, consider the interaction effect between D(0) × H(1). Note that D(0) × H(0) is the reference category. Across the models, we do not find any significant effects of cancer treatment on self-reported health, happiness, and hindrance at work. The results, illustrated in model 3, indicate that, compared to the control group, hospitalization significantly increases anxiety among men with (ES = +0.8478).

Next, consider the interaction effect between D(1) × H(0). Tumor detection significantly decreases self-reported health among women (ES = −0.7862) and among men (ES = −0.5650). Consequently, tumor diagnosis has a moderate to large impact on self-reported health status. Note that, not controlling for the nature of the tumor, ‘benignʼ, we find a small positive effect size of +0.2826 in the model on happiness. As such, women who have been diagnosed with a benign tumor are considerably happier in the year of diagnosis compared to the control group (for a discussion, see also [1].

Third, consider the interaction effect between tumor detection and hospitalization [D(1) × H(1)]. Overall, for men, we observe a moderate to large impact of tumor detection and treatment on health (ES = −0.6727), happiness (ES = −0.4524), anxiety (ES = +0.7967), and hindrance at work (ES = +0.6266). For women, we only observe a significant effect on health status (ES = −0.7442) and hindrance at work (ES = 1.0251).

We conclude that, particularly among men, tumor detection and treatment have a strong and significant negative impact on emotions and feelings of happiness and anxiety. However, compared with the control group, the impact of tumor detection on hindrance at work is higher for women than for men.

Table 7 summarizes the results for work and income by gender. Four models have been estimated, namely: at least 1 day per month unable to go to work, perform housekeeping or study (1 = at least 1 day) (model 1); labor status (has no paid labor = 1) (model 2); log of net personal income (model 3); and log of net monthly household income (model 4).

Table 7 Impact of tumor detection and treatment on work and income (estimation output of the fixed effects model)

Again, we first consider the interaction effect D(0) × H(1), and observe that, for women, hospitalization does not significantly impact labor force participation or income. However, it does impact the ability to work for men (−17.16 % points).

Next, the effect of tumor detection without hospitalization is considered [D(1) × H(0)]. Compared with the control group, the results indicate that women are less able to work (−24.28 % points), or are increasingly likely to drop out of paid labor (+12.12 % points). The models do not report significant estimates for men.

For women, the same picture is sketched when hospitalization follows tumor detection [D(1) × H(1)], namely: women are less able to work (−41.90 % points) and are increasingly likely to drop out of the labor force (+10.61 % points). The results also indicate a significant negative impact on menʼs ability to work (−15.69 % points).

To conclude, overall, no significant impact of tumor detection and treatment [i.e. D(1) × H(1)] is found on personal or household income. Only for women without a partner, we estimated a significant decrease in household income by about −17.33 %.

Robustness of the results

Can matching analysis work?

The case individuals used for matching analysis are the same individuals as those of Table 3. Each year between the years 2007–2012, we have a large set of control individuals. Case individuals are then matched to “healthy” individuals based on a rich set of background characteristics X jit (Tables 10 and 11 in Appendix) and by year. We deal with propensity score matching by using the convenient nearest neighbor matching (NNM) estimator without replacement and with random sorting of the data. The probit model is used for obtaining the propensity scores. After applying the propensity score matching, the ordinary least squares estimator is again used on the matched sample for model estimation. The standard errors are clustered at the individual level. The results of the NNM estimation models are summarized in Tables 8 and 9.

Table 8 Impact of tumor detection and treatment on subjective psychosocial well-being (estimation output of the nearest neighbor matching model)
Table 9 Impact of tumor detection and treatment on work and income (estimation output of the nearest neighbor matching model)

In general, the estimates of the matching models with respect to psychosocial well-being are larger in magnitude in Table 8 than in Table 6. It seems that matching analyses overestimate the ‘trueʼ effects of tumor diagnosis. The overall picture of the matching models with respect to work and income is somewhat blurred. The estimates of Table 9, compared to those of Table 7, are underestimated for women, but overestimated for men. These findings could merely reflect the lack of comparability between the control group and the treatment group. Therefore, we check the assumption of common support (see Fig. 1). The assumption implies strong overlap in the covariate distribution of the case and control group, so that for each case individual a comparable control individual can be allocated [25, 26]. Having a rich set of individual and life-style determinants (Tables 10 and 11 in Appendix), Fig. 1 plots the overlap in covariate distributions by probability of treatment assignment. We observe that these covariate distributions do not overlap properly owing to the small number of control individuals available for high propensity scores. Consequently, case individuals rely on a small set of control individuals at the middle and upper tail of the propensity score values. In line with the work of Black and Smith [5], we argue that the support condition holds only weakly. A potential solution offered by Black and Smith ([5], p. 110 and p. 118) is then to only estimate the impacts of tumor detection for the “thick support” region in order to reduce selection (into the treatment) bias. In this region we observe a substantial number of observations in both the case and control group. Using the specification of Black and Smith [5], in our case, the thick support region is defined as those propensity score values below \(\hat{P}(X) < 0.20\). Indeed, the majority of case individuals with relatively high propensity scores are then dropped from the analysis. Other matching estimators that weigh the control individuals in the matching process (e.g. the Epanechnikov kernel-matching estimator) do not significantly alter the estimates nor do they offer a solution to fit common support. We argue in line with Black and Smith [5] that it is not an option to allocate higher weights to control individuals who are less similar to case individuals (i.e. above \(\hat{P}\left( X \right) > 0.20\)) in order to construct appropriate counterfactual outcomes. These results, including balancing tests, are available from the authors upon request. We conclude that our findings do not support matching analysis.

Fig. 1
figure 1

Check for the assumption of the common support by using matching analyses

Table 10 Information on individual characteristics (full sample, N = 850)
Table 11 Information on life-style (full sample, N = 850)

Can anticipation bias the results?

It often takes several years before cancer is (or can be) detected. As such, individuals can already anticipate their diagnosis in the years before detection by, for example, changing their health insurance coverage or adapting their life-style. It is expected that the ‘trueʼ impact of tumor diagnosis is then underestimated. We test for anticipation by looking at short-run changes in health insurance coverage and life-style.

Health insurance coverage

In the Netherlands, health insurance is provided by privately owned health insurance companies, and basic coverage is compulsory for every individual. Besides this compulsory coverage, individuals may choose supplementary insurances. These insurances cover services that are deemed not essential medical care such as physiotherapy, dental care for adults or transport to the hospital. Individuals can only change their level of coverage once a year on January 1. Thus, it is impossible to change the premium rate or add supplementary insurances during the year. As such, in case of tumor detection and treatment in year t, individuals depend on the coverage they attained per January 1 of year t. If anticipation is likely, individuals should change their health insurance in year t compared to year t − 1. However, we do not find any differences between year t and year t − 1 with respect to changes in health insurance coverage (mean difference −0.0084; T value = −0.23); or changes from having no supplementary deductible to having one on top of the compulsory deductible (mean difference −0.0401; T value = −1.16).

Life-style

Life-style changes can also indicate anticipation (in year t − 1) or adaptation to the new situation (in year t). Using the rich set of life-style variables (information available to the authors upon request), we can easily check for significant changes in life-style between the control group and the treatment group. Again, we compute the mean difference between the case–control individuals. Most surprisingly, we do not find any short-run impact of tumor detection on changes in life-style. This includes no changes in: smoking (mean difference 0.0188; T value = 0.72); BMI (mean difference 0.1679; T value = 0.51); eating vegetables (mean difference −0.0001; T value = −0.01); eating fruit (mean difference −0.0154; T value = 1.48); eating whole-wheat (mean difference −0.0083; T value = −1.00); eating fish (mean difference 0.0044; T value = 0.21); or eating meat (mean difference 0.0058; T value = 0.56).

Can other life changing events bias the results?

It is possible that other life changing events capture the effects estimated in the interaction effects. However, this is highly unlikely. First, these other events should cause a serious shock, and for more than one individual at the same time (i.e. it should drive the estimated average). This would indicate that tumor detection and treatment are not driving the sign and significance of the estimates, whereas other life-changing events are highly correlated with tumor diagnosis. To the best of our knowledge, we cannot recall such an event from the literature. And second, these other events cannot be a trend that covariates, for example, with age, as trends are captured in time fixed effects modeling, and age is controlled for.

Can measurement error explain the results?

The literature indicates several possible types of measurement error. First, questionnaires are liable to recall bias, defined as the inability for respondents to correctly recall a past event at the time of the questionnaire. Bradley et al. [6] argue that (malignant) tumor diagnosis is a serious event that individuals do not forget. Second, questionnaires are also subject to issues with respect to misreporting. For example, an individual may tend to overstate or understate his/her ‘trueʼ feelings and emotions (see also [4, 8]. We deal with these issues of misreporting in the panel data model by controlling for reporting behavior intrinsic to the individual. To conclude, bias owing to misreporting issues on income is not an issue in LISS, as the panel is constructed by using population registries.

Conclusion

The identification of causal effects in cancer research is often hampered by data constraints. This paper explored the beneficial features of combining national representative health surveys with administrative data on income and work. Owing to the data, we could treat tumor detection as a random event, a discontinuity, in order to compare a case–control group before and after tumor detection and treatment by using fixed effects models.

The results indicate that the negative effects of tumor detection and treatment on self-assessed health, happiness, and anxiety are highest for men. We also observe significant effects on self-reported health for women, but not on happiness or anxiety. Preference drift [13], can partially explain these results, as women who have been diagnosed with a benign tumor are happier in the year of tumor detection than the year before. It may also be that women, irrespective of their income position, adapt their feelings better to the new situation than men (i.e. women have better coping strategies), or that their emotions are driven by the same underlying mechanism of tumor formation (e.g. hormones). Previous literature on coping strategies explicitly accounting for gender differences is scarce. However, for women with a history of breast cancer, other studies found that the elderly like to hold on to their autonomy [20], that young women show purpose-seeking behavior, especially in their work [22], and that religion plays an important role in coping [2]. The latter theory on tumor formation is a medical question that we did not explore in this paper. Both theories are subject to further research.

Tumor detection and treatment affects work. The largest effects are found when hospitalization follows tumor diagnosis. In contrast to men, and in line with Bradley et al. [6], women drop out of the labor force more frequently in response to tumor detection. However, this does not affect their personal or household income. This short-run financial security is most likely driven by the Dutch compulsory health insurance coverage. Nonetheless, in the long-run employment decisions can also be affected by the way chronic health conditions are covered. Individuals may receive a 10 % premium discount in collective health insurance arrangements. Changing employers or dropping out of the labor force can lower or increase this premium rate. Carlsen et al. [9] already indicated an increased risk of quitting the labor force and taking early retirement pension or social security benefits up to 8 years after the initial diagnosis is made. Finally, Moran et al. [21] put our positive short-run outcomes in perspective by arguing that the long-run outcomes are worse for men who relapse. This is an important topic for further research.

Chronic diseases such as cancer may not only threaten survivors’ well-being and labor market participation; it may also thoroughly challenge the public health care system as well as individuals’ out-of-pocket expenses. Yabroff et al. [32, 33] estimated the costs associated with cancer care in the US. The authors indicate highest total care costs of about €3.18 billion in the first year of treatment for elderly patients (aged 65 and older) diagnosed with colorectal cancer, while the average 5-year net costs ranged from $20,000 for breast cancer care to more than $40,000 for brain (nervous system) cancer care. Like most studies on the costs of cancer care in the US, the authors did not capture the costs due to non-labor market participation owing to physical and/or psychosocial dysfunction for older and for younger cancer patients still in the labor force. They also did not include those cancer patients who are not eligible for health insurance, but, instead, they only focused on beneficiaries of the Medicare program. From our results we argue that health insurance coverage drives individuals’ financial well-being and these public costs and out-of-pocket expenses should not be neglected in further research.

To conclude, we find evidence against applying matching estimation techniques, but in favor of panel data models. First, matching analysis violates the assumption on common support, while tumor detection can be treated as a random event [6]. Second, comparing the aforementioned panel data estimates with those of the matching analyses, we find that matching overall overestimates the ‘true’ effects of tumor detection and treatment on subjective well-being. The effects of matching models are generally larger in magnitude. With respect to the estimates of the models on financial well-being, the picture is somewhat blurred. Compared with the panel data results, the estimates of the matching models are underestimated for women, but overestimated for men.

Notwithstanding the huge advantages of using panel data, there are also limitations to be considered. First, in our data, we cannot distinguish between recurrent and first ever cancer cases. We do not see this as a cause for concern for the relatively younger cases in our sample, as the official Dutch cancer registry indicates very low (negligible) incidence rates among this age group. Therefore, cancer incidence among young individuals is likely to be first ever cases. For the older age group, we acknowledge potential heterogeneous effects of recurrence of the tumor compared with first ever cases, in particular on psychosocial well-being. However, we also argue that the set of individuals with recurrent cases in our sample is low, which is supported by the high comparability between LISS panel and the official Dutch statistics on cancer incidence. Anyway, work and income should not be affected by this among the older age group, as also indicated by the insignificant results.

Second, we have no information on the staging or type of the cancer when it gets detected. Further policy debate could explore how nationally representative longitudinal questionnaires on well-being, work, and income can be enriched with health-related hospital data (i.e. the creation of a dataset which is “the best of both worlds”).