Introduction

The field of positive psychology studies factors that contribute to a happy human life and explain why and when humans flourish (Kahneman et al., 1999). This research is highly relevant as happiness, which is sometimes also referred to as well-being or life satisfaction, is one of the most relevant goals that humans aspire to. However, the opposite of this positive position, unhappiness, deserves attention as well. While being unhappy cannot be avoided, and all human beings experience negative emotional states from time to time, this can become a serious problem when these episodes are long, resulting in persistent unhappiness (Sapolsky, 2007). Associated factors, such as mental health problems, states of depression, or even suicidal thoughts, are a large burden on the individual but also a challenge for society as a whole (Pincus & Pettit, 2001). To function well, a modern society requires engaged and active citizens that contribute their work and energy to various parts and aspects of society, for example, in the workforce; through relationships with relatives, friends, and other people; while participating in the democratic process; or by being interested in overcoming problems and challenges, such as economic downturns or climate change. Unhappy people, and especially clinically depressed ones, can lose their interest in these aspects and life altogether, which can influence a society negatively if too many individuals are affected by such mental states, not even counting costs for medical treatment (Thomas & Morris, 2003). Obtaining a better understanding of which factors are related to unhappiness and that predict or even cause it can be considered a significant goal in happiness studies, for this is the first step in combatting and reducing individual states of unhappiness.

The present study adds to this research field and takes a longitudinal perspective to understand which influences are associated with being unhappy more often or going through persistent unhappiness. While many studies only have information about a single point in time, longitudinal studies survey respondents regularly to paint a more complete picture of what their lives look like. This approach is highly relevant as it is able to investigate processes instead of states, easing the understanding of what is happening, sometimes even allowing for the deduction of causal inferences. Making robust and validated statements about causality in the social sciences is, however, a major challenge, and overstatements happen regularly when caution would be the wiser choice instead (Hernán, 2018). Given the scope and data available, the present study attempts to answer a more modest research question: What are relevant factors and variables that are correlated with cumulative unhappiness? In this context, cumulative refers to being unhappy for longer periods of time or experiencing states of unhappiness more often, even if not continuously. By drawing information from a large German longitudinal dataset, including a rich set of explanatory variables, relevant conclusions can be drawn for the adult population.

State of research

In general, unhappiness has attracted much less research attention than its counterpart (Haybron, 2008). However, it must be made clear from the beginning that research findings on happiness cannot be easily generalized and applied to unhappiness by assuming that the opposite of the conclusion holds true. To give an intuitive justification for this, nonlinear effects are highly relevant, meaning that saturation or ceiling effects can occur, and most trends in the social sciences are not linear by nature. A classic example is income: Giving a fixed amount of money to a poor person will increase happiness much more than giving the same amount to a rich person (Vendrik & Woltjer, 2007). Understanding unhappiness is clearly not the same as understanding happiness and, therefore, deserves a separate focus of attention. From a policy perspective, this can be understood through the following question: What must be done to increase the happiness of a certain group of individuals with average happiness (average in comparison to the entire country) to an above-average level? However, if the question is what must be done to increase the happiness of a group of individuals with below-average happiness to an average level, the answers (and policies) might differ vastly.

Numerous studies have been conducted to investigate correlates, predictors, or causes of unhappiness. The themes that are apparently most relevant are age (Bittmann, 2021; Blanchflower, 2020; Blanchflower & Oswald, 2009), income (Becchetti & Santoro, 2007; Saunders, 1996), income inequality (Becchetti et al., 2014, 2022; Oshio & Urakawa, 2014; Shields & Wailoo, 2002), being unemployed (Clark, 2006; Clark & Oswald, 1994; Gerlach & Stephan, 1996; Longhi et al., 2018; Winkelmann & Winkelmann, 1998), social status (Bhuiyan, 2018; Chen et al., 2021; Haller & Hadler, 2006), degree of urbanization (Okulicz-Kozaryn, 2022; Okulicz-Kozaryn & Mazelis, 2018), mental (Layard et al., 2013) and physical health (Graham et al., 2011), being overweight (Erickson et al., 2000; Sato, 2021), family and social relations (Haller & Hadler, 2006; Webb et al., 2017), marital status (Coombs, 1991), and personality traits (Costa & McCrae, 1980; Hayes & Joseph, 2003; Spangler & Palrecha, 2004).

However, studies that systematically compare various factors and attempt to rank them by their relative importance are much rarer. Often, only regression coefficients are reported with their standard errors, which are not adequate for estimating their relative influence. This is the case as often unstandardized regression coefficients are reported, and measurement and scales of variables can differ greatly. Another obstacle is that these studies often include multiple countries and attempt to test the influence of macro variables that only vary between countries (but not within a country), such as GDP or social inequality. To cite a few examples (where some sort of happiness is the dependent variable), one study using 17 European countries (N > 30,000) reported that the main model explained 22.5% of the total variance (adjusted R2), and the most influential explanatory variable was the satisfaction with the household income, according to the reported t-statistic (Doherty & Kelly, 2010). Another study with 34 countries (N > 54,000) computed an R2 value of 26.5%, and according to their standardized regression coefficients, the financial situation and the subjective health were the two single most relevant predictors (Haller & Hadler, 2006). A third study using a huge international dataset from 1975 to 1992 (N > 270,000) was able to explain only about 8% of happiness (Tella et al., 2003). Interestingly, despite using many macro indicators and some individual ones, in this study health was not regarded at all. It is striking that despite the differing samples and also the varying number of explanatory variables, the conclusions were rather similar, just as the explained variances. However, while the total number of cases was high, these analyses looked at cross-sectional data and focused more on country differences than on individual differences, which warrants criticism. Another study with a rather special population (elite athletes) concluded that psychological factors such as self-esteem or stress have the most power to explain various forms of happiness (Denny & Steiner, 2009).

In summary, while some forms of happiness have been of great interest in the past, there are some research gaps. First, regarding the outcome, only happiness has been considered and not the opposite, meaning unhappiness. This might seem like a trivial difference, yet I argue that looking specifically at unhappiness is also relevant because it is not certain that opposite conclusions of previous results always hold true, due to nonlinear and ceiling (or bottom) effects. Second, some studies have focused on a few key aspects yet did not attempt to compare many influences. If they did, they often did not rank these influences properly according to modern statistical approaches and instead relied on rather crude comparisons of regression coefficients or t-statistics. Third, they usually did not take the longitudinal perspective into account, which is a major shortcoming. While being unhappy at some points in time is normal, prolonged or recurring episodes are much more problematic and deserve more research attention. By putting cumulative unhappiness into focus and utilizing appropriate statistical approaches, the following empirical analyses go beyond what has been investigated so far.

Empirical analyses

Data and sample

The following analyses are based on the adult cohort of the National Educational Panel Study (NEPS), known as starting cohort 6 (SC6).Footnote 1 The NEPS is one of the most ambitious research projects in Germany for understanding in greater detail the role of education for the entire course of life (NEPS Network, 2022). NEPS SC6 was originally established with a pilot study in 2007/2008 and had its first regular survey wave in 2009/10. Until 2020/21, a total of 12 regular survey waves were conducted because respondents were interviewed annually. The NEPS provides a rich source of relevant information since all major areas of life are included in the survey program, such as (further) education, sociodemographic background, psychological measurements, satisfaction, friends, and even cognitive performance tests. Due to the long-running nature of the panel without recent refreshment samples, the SC6 population has aged continuously and is no longer perfectly representative of the overall German population.

For the following empirical analyses, the sample was slightly restricted to ensure a high internal validity. First, only NEPS surveys 4 to 12 were included (conducted from 2011/12 to 2019/20). The earlier waves were excluded because in wave 4 a major refreshment sample was drawn, meaning that for a large share of the resulting sample, no prior information on many key variables was available (as no retrospective information was gathered on variables like life satisfaction). Also, NEPS wave 13 was excluded, as this survey was conducted after the onset of the COVID-19 pandemic, which must be seen as having been highly impactful for life satisfaction in general. To avoid disturbances due to these very special circumstances, this wave was not included. The second restriction is that only individuals that participated in at least six out of the nine conducted waves were retained in the sample. This restriction was imposed in order to avoid the imputation of information for participants with only little data available, which could create even more bias or variance. Other missing information (item-nonresponse) was imputed, which is explained in more detail below. The final sample size after enforcing the restrictions was 8,646.

Strategy of analysis

As described, the NEPS provides panel data for a rich set of variables over a long period of time, which is highly beneficial for the intended analyses. Yet it must be made transparent that even this high-quality survey comes with limitations. While making statements about causality can be considered as the holy grail of empirical social research, I believe that it is not feasible to make these statements with the data available and the posed research question. An example can help to explain these limitations: If the research question is intended to estimate the causal impact of marriage on life satisfaction, this can probably be achieved quite well with the NEPS data as the “treatment” (marriage) can be considered a single event in time. After considering anticipation effects, some analyses, such as fixed-effect regression models, are probably able to approximate causality quite well. However, this requires that enough time is observed before and after the event and that no unobserved confounders influence the process. As the presented research question outlines, the goal of the current study is to investigate the influence of many variables and test them against each other. Some are not “events” but long-lasting states, where it is rather unclear when they have even formed. Personality traits, as measured using the Big Five inventory, are a good example. However, not only psychological but also sociological measurements are affected, such as social networks or social status, which develop over long periods of time. Also, participants were rather old at the time of their first survey (older than 25 years in 2011/12), meaning that a long and relevant time of their life has not been observed in detail. A second major problem is that variables were measured at quite different time points in the NEPS study, which means that using them all together in the same analysis and differentiating causal effects is highly problematic. Therefore, all of the following analyses should be viewed more as correlations or predictors and less as causal influences. The findings can still be highly relevant, especially for guiding further studies that focus on the causal impact of a few key variables and processes.

Next, the concrete analytical strategy is outlined. As it is the goal of the study to measure how cumulative unhappiness can be explained, the panel nature of the study is of the greatest relevance. It would be ideal to know precisely how long an individual is in a state of unhappiness, measured in days or weeks. However, this is highly difficult to measure in most quantitative surveys, and only one measurement of overall life satisfaction is available per individual and year in the SC6 data. The method that was chosen for the present study was to count the number of survey waves in which an individual reported below-average life satisfaction, which is considered a state of unhappiness. To do so, the average satisfaction (arithmetic mean) of the sample was computed for each survey wave, along with the standard deviation. Any individual that reported a life satisfaction that was at least one standard deviation below the mean value of the survey wave was counted as being in a state of unhappiness at that point in time. Afterwards, these values were summed for each participant, which provided the total number of waves of reporting below-average life satisfaction. The outcome variable is hence a count variable with values between 0 and 9. To compute how this dependent variable is related to all independent variables, a Poisson regression model was selected (For testing statistical robustness, analyses were repeated using a negative binomial regression model.). For each independent variable, two results were computed: first, the bivariate result, meaning that no other variables were included in the model to report the “raw” effect. Afterwards, one large multivariate model was estimated, meaning that the effect of the independent variables should be seen as “under control” of all other variables in the model. Doing this makes it possible to account for the interrelation of explanatory variables.

Finally, a dominance analysis was conducted. This well-established approach is useful for estimating how much additional variance a variable (or set of variables) explains in the dependent variable (Azen & Budescu, 2003; Budescu, 1993). The naive approach of conducting a series of nested regression models and adding explanatory variables in a stepwise fashion is not appropriate as soon as explanatory variables are correlated, which is usually the case. By conducting a large number of regression models where all potential combinations of explanatory variables are tested exhaustively, the additional explanatory power can be quantified. While this approach is computationally intensive, it is well suited to estimating the overall impact a variable has on the outcome.

All analyses were computed in Stata 16.1; the dominance analyses were computed using the package domin (Luchman, 2015). To account for item-nonresponse, data were imputed using multiple imputation by chained equations (MICE). Information on the amount of imputed data is provided in Table 1 (see chapter 4). A total of 30 imputed datasets were generated and their quality assessed by established standards (e.g., no generation of impossible values, statistical convergence). For most variables, predictive mean matching was selected as an imputation algorithm. Average marginal effects for the regression models were computed using the package mimrgns (Klein, 2014).

Table 1 Descriptive statistics

Operationalization

As the number of explanatory variables is high, they have been grouped together thematically for a more convenient overview. Note that many variables that are continuous by nature were recoded into categories to account for potential nonlinear effects. Another peculiar aspect is the handling of variables that are potentially time varying. The final analysis is cross-sectional by nature so that all of these variables have to be reduced to one single state. Depending on the scaling of the variable and how often it was measured, distinct solutions have been chosen. If the variable is nominal, such as region of residence, the mode over all survey waves has been selected. Others, which can fluctuate more often, such as income, were accounted for by taking the median over all waves to give the robust average. If some variables were measured only twice with larger gaps, the earlier response was counted to reduce the potential influence of reverse causality.

Based on previous research findings, key areas of interest were selected to explain and predict cumulative unhappiness (the independent variables). The first of these groups are sociodemographic-background variables, such as gender, age, or place of residence, and social origin, such as income, social status, or education level. As the state of research outlined above has shown, these factors are well-established correlates of unhappiness. While low incomes restrict what a person can buy and access in a society, low levels of social status are also associated with low happiness (Anderson et al., 2012). Education can provide the means to deal with unhappiness and find solutions to overcome problems and obstacles. The third group is unemployment, which has been identified as one of the most relevant factors for unhappiness, as not being able to provide for oneself and one’s own family is problematic; furthermore, unemployment is associated with a loss of purpose in society. The fourth group is social capital, which refers to the extent to which social networks are available to deal mentally and materially with problems and states of unhappiness. The final groups are personality traits, which are mostly stable and can be seen as the mental dispositions of a person, and health measures, which were one of the strongest predictors in the previously cited references on unhappiness.

Cumulative unhappiness

In each survey wave, respondents were presented with the following: “I would like to begin by asking you a few questions about how satisfied you are with various aspects of your life. Please answer using a scale of 0 to 10. ‘0’ means that you are totally and utterly dissatisfied; ‘10’ means that you are entirely satisfied. […] How satisfied are you currently with your life in general?” For the analysis of the given research question and overall life satisfaction, the NEPS recommends using this single variable and not generating a compound score, even if sub-dimensions of happiness were also measured. This scale has 11 distinct levels in total. As was already described above, the average happiness and standard deviation was computed for each survey wave. If a respondent reported a response that was at least one standard deviation below the wave-mean, she was counted as being unhappy in that wave. The total number of times a respondent was recorded at below-average happiness was used to measure his or her cumulative unhappiness. It should also be highlighted that cumulative unhappiness, as used in this study, is not the same as average unhappiness, which could be operationalized as averaging the values over all waves. By looking at each wave separately and comparing each value to the mean-value, more information can be extracted out of the data.

Sociodemographic background variables

The gender of the respondents was measured as binary, either male or female. The age was computed in 2015 (wave 8) and grouped as follows: 29 to 39 years, 40 to 50 years, 51 to 61 years, 62 to 71 years. This also shows that all respondents were between 29 and 71 years old at that point in time. If the person was born abroad, or at least one of their parents was born abroad, this was counted as having a migration background, otherwise not. If the person lived predominantly in East Germany, this was counted as living in the East, otherwise not.

Social origin

The precise measurement of social origin is of the greatest interest as this construct can involve aspects such as education, the financial situation of the household, or the social status in society. First, the highest educational qualification was measured with five levels on the CASMIN scale: no degree or lower degree (Hauptschulabschluss) / intermediate degree (Realschulabschluss) / higher education eligibility (Abitur) / university of applied science degree (UAS, Fachhochschulabschluss) / any university degree. Second, the total post-tax monthly household income was taken and the median formed over all available survey waves. Then, the equivalent household income was computed by accounting for the number of other adults and children living in the same household. The categories are €0 to €1,000 / €1,001 to €1,500 / €1,501 to €2,000 / €2,001 to €3,000 / more than €3,001. Third, the social status was measured using the International Socio-Economic Index of Occupational Status (ISEI), which is based on the occupation held. Since occupations can change over time, the NEPS provides retrospective information for all jobs. I considered the time between 2008 and 2018 and generated the average ISEI for all occupations held in this time span, weighted by the duration of a specific occupation. If no occupation or ISEI was available for this time (which can be possible for partners who stay at home), the ISEI of the working partner was used, if available. If no information could be found, the data was imputed. The ISEI, which can go from 16 to 90, is also categorized as follows: 16 to 25 / 25 to 45 / 45 to 65 / 65 to 90.Footnote 2 Fourth and finally, the highest ISEI of the father was also included, which was measured with the same categories. When taken together, these variables give a good impression of various sub-dimensions of social origin.

Unemployment

The cumulative time unemployed was measured between 2008 and 2018. To do so, all reported episodes of unemployment within this time span were added together. If the unemployment episode started and ended within the same month, 15 days were assumed since no daily information was available. The resulting variable was grouped into the following categories, measured in months: 0 / 1 to 12 months / 12 to 24 months / 24 to 48 months / more than 48 months.

Social capital

The social capital of an individual was measured with multiple variables. First, the marital status was taken from wave 4 of the NEPS (2011/12): being single / being married or in a civil union / being divorced / being widowed. Clearly, this measurement is not precise as marital status can change over time, yet it is not feasible to account for all variations in a cross-sectional analysis. An early point in the survey (wave 4) was chosen to reduce the problem of reverse causality. Second, the total number of individuals in the household was measured as the maximum number of individuals ever living in the household over the time surveyed. The categories are 1 / 2 / 3 to 4 / 5 or more. Third, the number of children was measured as the maximum number of children below 18 ever living in the household with the following categories: 0 / 1 to 2 / 3 or more. Fourth, the social capital was measured with the “position generator” (Schulz et al., 2017). This instrument measures whether the respondent knows at least one person with a specific occupation, for example, a lawyer, a teacher, a translator, and so on. The more individuals with specific occupations the respondent knows, the higher his or her social capital. As each occupation can be associated with an ISEI value, the average total social capital can be measured that way. This variable is mean-centered and standardized (z-standardization). Higher values indicate an overall higher social capital.

Personality measurements

The established Big Five inventory was surveyed in wave 5 of the NEPS (2012/13) and contains the dimensions extraversion, agreeableness, conscientiousness, neuroticism, and openness. The constructs were measured using the German 10-item short version (Rammstedt & John, 2007) in wave 5 of the survey.

Health measurements

First, self-rated health was measured in each wave of the NEPS. To take the average health, the median over all available survey waves was formed. The wording is as follows: “I now have a brief question about your health. How would you generally describe your state of health?” The responses were measured with five levels from 1 (very poor) to 5 (very good). This item is useful in order to have information on subjectively perceived health. Second, for a more objective measurement, physical and mental health were measured using the 12-item Short Form Survey, which uses 12 items to record overall physical and mental health by asking questions such as, “During the past week, have you had any of the following problems with your work or other regular daily activities as a result of your physical health?” (Andersen et al., 2007). These items were surveyed in wave 4. Third, personal self-worth was measured following the conceptualization of Rosenberg in wave 6 (2013/14) (von Collani & Herzberg, 2003). It ranges from 14 to 50, where higher values stand for higher self-worth. Fifth, the body mass index (BMI) was measured by taking the median weight and height over all available survey waves. The result was classified according to the WHO standard from 2008 as follows: Up to 25 (Normal weight or underweightFootnote 3) / 25 to 30 (Overweight) / 30 to 35 (Obesity I) / more than 35 (Obesity II/III). Sixth and finally, how often the respondent consumed alcohol was measured (“How often do you consume alcoholic drinks, e.g., a glass of wine, beer, a mixed drink, spirits or liqueur?”); this was measured in wave 4 (or wave 7, if no information was available from wave 4) with the following scale: never / once per month / 2–4 × per month / 2–3 × per week / more than 4 × per week.

Results

Descriptive findings

The overall average satisfaction was highly stable and always between 7.5 and 7.7 (with standard deviations between 1.4 and 1.6) for all survey waves. This means that aggregated satisfaction is rather constant over time, which of course does not mean that no individual satisfaction is constant as well. The total number of survey waves with below-average satisfaction is visualized in Fig. 1.

Fig. 1
figure 1

Total number of waves with below-average happiness per respondent. Source: NEPS SC6, imputed data (M = 30). Higher numbers indicate a larger cumulative unhappiness

As it turns out, the majority (about 53%) of all respondents never reported any survey with below-average values, yet a large minority of about 47% did. Slightly more than 11% of all respondents reported below-average happiness for at least five survey waves, which is more than half of the entire survey time of up to nine waves. Descriptive results for all variables are reported in Table 1.

Regression analysis

In this section, the results of the Poisson regression models are reported. First, bivariate associations are reported, followed by the multivariate results (see Table 2). For a more convenient interpretation, average marginal effects (AMEs) are reported. As an example, consideration can be given to the effect of gender (Since reverse causality can be excluded for this variable, referring to an effect is probably adequate). The AME for women is 0.231 and statistically highly significant in the bivariate model. This means that women report, on average, 0.23 more waves with below-average happiness than men and are, apparently, unhappy more often since the value is larger than zero. However, this association between gender and unhappiness vanishes as soon as other variables are considered in the same model, as the computed AME changes to -0.048 in the multivariate model. This result is no longer statistically significant, and even the sign changes. This means that as soon as all other variables are considered, gender is no longer associated with unhappiness, and this association from the bivariate model is, therefore, spurious. While the bivariate results can be relevant for prediction or for samples where little information is known or only few explanatory variables are available, the multivariate results are better able to understand what is actually relevant. Some variables are associated with a higher number of reported surveys with below-average happiness, such as being older or drinking more alcohol. For others, this association is negative, such as for having a higher education, earning more money, or reporting better health. These people reported being unhappy less often, on average.

Table 2 Average marginal effects for bi- and multivariate associations with cumulative unhappiness

Dominance analysis

While a regression analysis is helpful to understand how variables are related to the outcome (positively or negatively) and if these associations are statistically significant, they are hardly able to determine the relative importance for prediction and explanation. Even if some variables are statistically highly significant, this does not automatically mean that much of the overall variance in the outcome can be explained. This is further complicated due to the fact that explanatory variables are often measured on vastly different scales and are usually correlated with each other. For example, in the results presented above, both the household income and the number of adults in the household have statistically significant relationships with the outcome, yet it is not feasible to state which of the two has more influence based on the AMEs alone. Answering such a question is, however, highly relevant in order to understand which factors are most strongly associated with unhappiness, which can be useful to guide further research or even design interventions. To solve this problem, dominance analysis can be used as it quantifies the additional explanatory power of a variable or a set of variables under control of all other variables in the model. Since the number of independent variables is rather large, it is not feasible to compute this additional part for each one, since the number of required regression models grows exponentially and is, hence, computationally not viable. To overcome this obstacle, explanatory variables are included as thematically grouped sets. Results are presented in Table 2. As the regressions are Poisson models, pseudo-R2 values are reported.

All explanatory variables together explain almost 26% of the overall variance of the dependent variable. Given that human life is highly complex and contingent, this can be regarded as a rather satisfactory result, which is also close to what previous studies have been reporting. When this total variance is decomposed, it becomes clear that health factors are, by far, the most relevant influence with a share of almost 15%. In second place are social status factors, which account for about 4%. Social capital factors come in third with about 3%, and personality traits are fourth, with about 2%. What is obvious from this analysis is that health factors are, with a large margin, the most relevant factors correlated with cumulative unhappiness. Since the results outline that health factors are, by far, the most relevant correlate of unhappiness, these variables have been decomposed in an additional analysis to see how their share (14.81%) is divided. It turns out that self-rated health has the biggest impact (4.4%), followed by the mental health score (3.3%) and self-worth (2.6%). The influence of the other factors is below 1% each. It’s important to note that this extra computation is also under control of all other variables as included before, yet the total share of all health variables does not add up to 14.81 as the two analyses are not directly comparable. However, it is still useful to gauge approximately which health variable is the most relevant.

Discussion

As the results outline, there are some factors that are rather strongly related to cumulative unhappiness. Of greatest relevance is the concept of statistical control that has been applied in the multivariate analyses, as this makes it possible to come to the “core” of the associations and avoid incorrect conclusions due to spurious correlations. By comparing bi- and multivariate coefficients, it is obvious that some factors that initially show significant correlations completely lose them as soon as other variables are also included. This is crucial for approximating causal influences. While this has not been done in the present analyses, it can be helpful for future studies that focus on distinct aspects or key variables. When discussing correlates, especially health factors must be highlighted, which explain the most variation in the outcome. This is in line with previous research findings that come to the same conclusion. However, for the first time, various influences have been ranked systematically. The findings underline that health is, with a wide margin, the single most important influence, as the health variables taken together explain more than three times the variance that all influences of social status, including the financial situation of the household, do. While these results are to be seen as more correlational than causal, the current analyses do not allow for seeing them as a causal factor. The question remains whether this is a causal explanation or whether reverse causality might be present, meaning that individuals with lower happiness become unhealthier. In any case, as the results underline, the nexus of unhappiness and health deserves further attention to disentangle potential causal mechanisms. This is a potential next step for further research projects. By having correlates of unhappiness ranked by their relative importance, the present study helps researchers to pick some areas that deserve special attention in order to explain the genesis of cumulative unhappiness.

To conclude the discussion, the limitations of the analyses must be made transparent. First, only a single country, Germany, is available, which restricts the external validity of the findings. While I believe that the conclusions likely also hold true for comparable (western and industrialized) countries, they might be different for other cultures with a different standard of living. This restriction also means that no macro indicators could be examined. It should also be made transparent that due to the nature of the used panel data, the data available are not perfectly representative of the overall German (adult) population due to panel effects, such as selective dropout over time. Second, as only about one fourth of the total variance of cumulative unhappiness can be explained, the majority of the influences remain underexposed. However, as previous studies also reported similar values, this means that either happiness is a rather volatile or difficult-to-explain construct or that main explanatory factors were not yet identified. Third, many constructs used in the study are subject to measurement error. Clearly, measuring “happiness” or “unhappiness” is a difficult task, and experts might not agree on which items or scales are ideal. The measurement in the NEPS uses a rather widespread approach also used in other large surveys, such as the World Values Survey or the European Social Survey. Especially when subjective variables are measured, measurement error is likely always present and might add some bias to all results, which is, of course, to some extent, unavoidable. Items and scales have been presented and references given for any reader that is interested in reviewing the validity and reliability of the results.

Conclusion

Identifying which factors are correlated with unhappiness is a first step in the process of understanding causal influences. However, instead of a current state of unhappiness, longer lasting episodes or reporting being unhappy more often over time were studied to gain robustness and look for processes that are temporally stable. By using an established German panel study, a large number of potential influences were identified and ranked by their relative importance. It is clear that health is, by far, the single most relevant predictor of cumulative unhappiness, followed by social status and social capital. Overall, the findings are in line with previous findings, yet they present more qualitative data to estimate the strength of associations in more detail. Based on these findings, further research projects might want to establish causal mechanisms and relationships with unhappiness.