1 Introduction and literature

Many individuals distinguish between money obtained from different sources, violating the assumption of fungibility of money.Footnote 1 This has important implications for various domains, for example, for the design and welfare effects of public policies. The observation that unearned income is treated differently than earned income is referred to as the house money effect. The common rationale is that windfall money evokes perceived property rights less strongly than earned money (see also Hoffman and Spitzer, 1985). As a result, in dictator games, subjects show less generosity when allocating earned income, both in the laboratory (Cherry, 2001; Cherry et al., 2002; Cherry & Shogren, 2008; Reinstein & Riener, 2012; Oxoby & Spraggon, 2008) and in the field (Carlsson et al., 2013).Footnote 2 Furthermore, Cárdenas et al. (2014) show that windfall money is spent more riskily. Houser and Xiao (2015) identify a house money effect in trust games. Transfers by investors and trustees are lower if they have to decide over earned money.

By contrast, in public goods games, corresponding evidence remains mixed, rather indicating no effect (Spraggon & Oxoby, 2009). Most studies find that contributions in public goods games are independent of endowment origin (Clark, 2002; Cherry et al., 2005; Antinyan et al., 2015; Bailey et al., 2022). Others indicate that contributions indeed depend on expended effort. Keeping subjects uninformed about the heterogeneity regarding the sources of endowment, Muehlbacher and Kirchler (2009) find that individuals who have to exert more effort contribute less. Conversely, Harrison (2007) re-analyses the data of Clark (2002) and reaches a different conclusion. He finds that individuals have a higher propensity to free-ride when playing with windfall money. On the intensive margin, there is no house money effect.

However, to date, the discussion has ignored a major factor that may be crucial for explaining the house money effect. The complexity of economic decisions requires analytical reasoning. Since humans vary in their cognitive abilities (Frederick, 2005), these reasoning skills are fundamental determinants of heterogeneous responses to economic problems. Dohmen et al. (2010) show that cognitive ability correlates with risk and time preferences. Furthermore, subjects appear to be less self-serving in dictator games when being under cognitive load (Schulz et al., 2014). By contrast, Chen et al. (2013) conclude that cognitive ability is positively correlated with generosity in dictator games. Recent research also suggests an effect of cognitive processes on co-operation. Interestingly, there exist both studies that find a positive (Clark, 1998; Lohse, 2016) and a negative link (Kanazawa & Fontaine, 2013; Nielsen et al., 2014) between cognitive skills and co-operation.

Regarding the house money effect, cognitive abilities appear to be equally relevant. Many researchers suggest that people apply strategies like choice bracketing and mental accounting to simplify economic decisions (Thaler, 1999; Read et al., 1999). Read et al. (1999, p. 187) argue that cognitive limitations are a key determinant for individuals to bracket narrowly, thereby facilitating decision-making. Empirically, cognitive ability decreases the use of heuristics (Frederick, 2005). Correspondingly, when spending income, individuals with low cognitive abilities are likely to be less capable to abstract from the source of income and to use the source for mental accounting. By contrast, subjects with higher cognitive capacity can be expected to be less reliant on applying heuristic simplification methods. Instead, they are more likely to keep track of the entire available budget when spending income. Thus, if the differentiation of income sources is the result of individuals having to simplify decision-making, this suggests that a subject’s cognitive capacity may also be associated with the extent to which she exhibits the house money effect. An experiment by Abeler and Marklein (2017) supports this prediction. They find that individuals with lower cognitive skills have a higher propensity to violate the assumption of the fungibility of money. Similarly, Antonides et al. (2011) find that higher education is negatively associated with mental budgeting.

In this paper, I examine whether individuals differ in the degree to which they exhibit the house money effect in a public goods game. I build on two documented facts: (1) unearned income generally appears to be donated or shared more easily (e.g., Cherry, 2001), and (2) violations of fungibility negatively correlate with cognitive skills (e.g., Abeler and Marklein, 2017).

On the basis of these observations, I hypothesise that in a public goods game, contributions by individuals with low cognitive skills should be smaller the higher the share of earned income. By contrast, I expect the origin of income to have less or no effect on contributions by individuals with high cognitive skills.

The results demonstrate that subjects’ contributions depend on an interplay of cognitive abilities and endowment origin. While a house money effect exists for subjects with lower cognitive ability, there is no such effect for those with high cognitive ability. The former contribute more when income was allocated to them and less when income was obtained by effort. Contrarily, the latter contribute the same amount independent of income type. This finding may enable policy-makers to encourage subsidised households to act more co-operatively and spend their money more efficiently (e.g., purchase goods with positive household externalities rather than purely private goods).

I proceed as follows. In Sect. 2, I describe the experimental design. Section 3 presents results as well as further robustness checks. I conclude and discuss potential implications of the results in Sect. 4.

2 Experimental design

The experimentFootnote 3 consisted of four parts: (1) a real-effort task, (2) a three-person one-shot linear public goods game, (3) the cognitive reflection test (CRT) developed by Frederick (2005), and (4) a questionnaire on demographic information. It was conducted in four sessions with first-year business administration students using paper and pencil at the Technical University of Munich, Germany.Footnote 4 After undergraduate Economics tutorials, students were invited to remain seated to participate in the experiment where they could win a monetary reward. Subsequently, participating students were distanced from each other in a large classroom to prevent contact during the experiment.

Although there is evidence that business and economics students are different from the rest of the population (Meier & Frey, 2004; Kirchgässner, 2005; McCannon & Peterson, 2015; Bauman & Rose, 2011), the sample is well suited for the analysis. First, respective studies find level differences in social preferences between students of different subjects. Since the main focus of this study is to detect an interaction effect, this would only affect the analysis if the overall level of contributions was too low or too high to find an interaction. Second, students within one discipline are less heterogeneous. Hence, an interaction effect in this sample can be considered a lower bound of the interaction. Cognitive ability can be assumed to vary substantially more in the entire population. As a consequence, the house money effect is also more prevalent and the interaction between cognitive ability and the house money effect (i.e., the interaction effect) more strongly pronounced when considering more heterogeneous populations.

2.1 Real-effort task

At the beginning of the experiment, the participants were randomly divided into three treatment groups.Footnote 5 Each group had to colour in a different number of circles out of a total of 150.Footnote 6 This task was used to simulate a cognitively non-demanding real-effort task. Hereby, depending on the treatment, a fraction of the 150 circles had already been filled in on behalf of the participants, sparing them a part of the effort. More precisely, the subjects had to colour in either 15 circles with 135 circles already being filled in (Low Effort), 75 circles with the other half being filled in (Medium EffortFootnote 7), or 135 circles with only 15 circles being filled in (High Effort). Hence, subjects in the Low Effort condition, for instance, only had to provide the effort of filling in 10% of the total 150 circles to earn the endowment. All participants were informed about these three different treatments.

In total, 161 students participated in the experiment. Seven participants did not finish the task and were excluded from the experiment. This resulted in 65, 38, and 51 subjects in the Low, Medium, and High Effort treatment, respectively.Footnote 8 For having coloured in all circles appropriately, all subjects received the same endowment of 100 tokens (10 tokens = 0.60 Euro). As the participants had to colour in different numbers of circles, this induced different proportions of earned and allocated income (i.e., either 10, 50, or 90% of the total income was earned by effort).

2.2 Public goods game

Following the task, individuals could decide on which proportion of their endowment to invest in a three-person one-shot linear public good with a marginal per capita return of 0.5. Therefore, the pay-off function \(\pi _i\) of player i with \(i\in \{1,2,3\}\) who contributes \(\theta _i\in \{0;100\}\) is given by

$$\begin{aligned} \pi _i&= (100-\theta _i)+0.5\cdot \sum \limits _{j=1}^3 \theta _j \end{aligned}$$
(1)

with \(\theta _j\in \{0;100\}\) being the contribution of player \(j\in \{1,2,3\}\). Importantly, subjects did not know the required effort levels of the other two players in their group. However, they knew that all combinations of the three treatments were possible.Footnote 9

2.3 Cognitive reflection test

Subsequently, the students had to perform the CRT to elicit cognitive ability. It contains three computationally easy questions that all have an intuitive, yet incorrect answer, and one correct answer that requires deliberation.Footnote 10 Despite the test’s brevity, it significantly correlates with results from more sophisticated tests such as the Wonderlic Personnel Test or the Wechsler Matrix Test (Frederick, 2005; Toplak et al., 2011, 2014). It is also popular and frequently used in economics experiments, (e.g., Haita-Falah, 2017), including public goods experiments (Nielsen et al., 2014; Lohse, 2016). The test is particularly suitable in a setting with two different types of endowment. It aims at separating types that exercise deliberation before answering the questions from those who follow their first intuition or cannot solve the problem. Participants are making use of an endowment from two different sources, so it might be reasonable at first glance to use these sources differently as well. However, further cognitive reflection should make individuals realise that the two income types are perfect substitutes. For that reason, in the analyses, I do not further distinguish between the type of incorrect answers.

It is feasible that both the real-effort task and the deliberation when choosing the contribution to the public good impact the performance in the CRT. Johnson et al. (2014) show that CRT scores decrease with cognitive load. However, a real-effort task was chosen that is cognitively non-demanding. Moreover, participants did not have to memorise anything after making the decision in the public goods game, the standard procedure to induce cognitive load. Finally, the research question dictates the real-effort task to precede the decision in the public goods game, and the CRT, since it can be considered a real-effort task, to follow it.

Finally, participants had to complete a questionnaire on demographic information.

2.4 Payment

After all experimental sessions had been completed, a total of 18 participantsFootnote 11 were randomly chosen using a random number generator and assigned into groups of three to receive the resulting pay-off. Nevertheless, the participants remained completely anonymous. The composition of groups was only announced using participation numbers. Also, final payments were subsequently done in private while ensuring that subjects could not see which participants were drawn or formed a group. Both was communicated at the beginning of each session. I also informed the participants prior to the experiment that the chances of being drawn were at least ten percent. Following Dohmen et al. (2010, p.1245) this ensures incentive compatibility although ultimately not everyone is being paid. On average, the selected students earned 7.68 Euro.

3 Results

On average, subjects contribute around 51% of their initial endowment. In the CRT, 40% answered all three questions correctly, followed by 32% with two, 16% with one, and 12% with zero correct answers.Footnote 12

3.1 Contributions by endowment origin

In line with the existing literature, contributions in the public goods game do not differ significantly across treatments, i.e., effort levels. Of their equally high endowment, subjects contribute 55% in the Low and 48% in the High Effort treatment (\(p=.210\), independent t testFootnote 13; \(p=.230\), Mann–Whitney U test, \(n=116\); Fig. 1).

Fig. 1
figure 1

Mean contributions by treatment group (sample size in parentheses)

3.2 Contributions by cognitive ability

For the analysis of contributions by cognitive ability, I divide the sample into individuals with high (40% of the sample) and low cognitive skills (60%). High CRT (Low CRT) individuals answered all three (two or less) questions correctly. Across treatments, contributions of individuals with a high CRT score are not significantly different to those of their low CRT counterparts, both contributing around 51% (\(p=.885\), t test with unequal variances; \(p =.826\), Mann–Whitney U test, \(n=154\); Fig. 2). This suggests that subjects with low and high cognitive ability do not differ in how much they contribute to a public good when the dominant strategy is to give nothing.

Fig. 2
figure 2

Mean contributions by CRT scores (sample size in parentheses)

3.3 Contributions by endowment origin and cognitive ability

A breakdown of contributions by treatment and CRT results confirms the hypothesis derived in Sect. 1. As depicted in Fig. 3, I find that subjects’ contributions depend on the interaction of their cognitive abilities and their endowment source. On the one hand, individuals with a low CRT score contribute 62.9% in the Low Effort treatment and 43.9% in the High Effort treatment. An independent t test shows that this difference is significant (\(p=.008\), t test; \(p=.016\), Mann–Whitney U test, \(n=63\)) and demonstrates that a low CRT score is associated with behaviour exhibiting the house money effect. On the other hand, contributions by individuals with a high CRT score do not differ significantly by endowment source, 48% in the Low versus 55% in the High Effort treatment (\(p=.496\), t test; \(p=.578\), Mann–Whitney U test, \(n=53\)). This suggests that subjects with higher cognitive ability are less likely to exhibit the house money effect.

The endowment source determines whether subjects with a low CRT score appear to be more or less co-operative than subjects with a high CRT score. In the Low Effort treatment, contributions by subjects with low CRT scores are 14.8 percentage points higher than contributions by subjects with high scores, 62.9 versus 48.1% (\(p=.044\), t test; \(p=.053\), Mann–Whitney U test, \(n=65\)). Thus, subjects with low CRT scores behave relatively more co-operatively when they had to exert less effort to obtain their endowment. Contrarily, in the High Effort treatment, their contributions are 10.8 percentage points lower than those of subjects with high scores (43.9 versus 54.7%). However, this difference is not statistically significant, possibly due to a too small sample size (\(p=.250\), t test; \(p=.295\), Mann–Whitney U test, \(n=51\)).Footnote 14 Hence, while being more co-operative than subjects with high CRT scores in the Low Effort treatment, subjects with lower CRT scores behave similarly to the high CRT scorers and possibly less co-operative when using earned income. An independent t test shows that this interaction of treatment and CRT scores is statistically significant (\(p=.029\)).

Fig. 3
figure 3

Mean contributions by CRT score and treatment group. Error bars indicate 95% confidence intervals (sample size in parentheses)

3.4 Multivariate analysis

The interaction effect is also robust to controlling for gender, age, and session effects. Similar to Harrison (2007), I resort to the Hurdle model (2) that accommodates the extensive and intensive margin of decisions in public goods games. As depicted by Fig. 4, a large fraction of subjects (11.0%) decides to contribute zero to the public good.

Fig. 4
figure 4

Histogram of contributions

The Hurdle model therefore has two components. First, it estimates whether a subject contributes anything at all (3). Second, it fits a linear outcome model (4). It can be characterised by the following set of relations (see Botelho et al., 2009, for a discussion of the Hurdle model)

$$\begin{aligned} y_{i}&= s_{i} h_{i}^{*}, \end{aligned}$$
(2)
$$\begin{aligned} \text { with }s_{i}&= \left\{ \begin{array}{l} 1 \quad \text {if }\textbf{x}_i\gamma + \epsilon _i > 0\\ 0 \quad \text {otherwise,} \end{array} \right. \end{aligned}$$
(3)
$$\begin{aligned} \text {and }h_i^*&= \textbf{x}_i \beta + \nu _i, \end{aligned}$$
(4)

where \(y_i\) is the estimated contribution. \(s_i\) is the selection variable which is 1 if a subject is estimated to contribute a positive amount and 0 otherwise. \(h_i^*\), the latent variable, is a subject’s expected contribution, conditional on the contribution being positive. \(\textbf{x}_i\) is the vector of explanatory variables. I include the same variables in both the selection and the outcome model. Thus, \(\gamma\) and \(\beta\) are the corresponding vectors of coefficients, and \(\epsilon _i\) and \(\nu _i\) are error terms.

Table 1 reports the estimation results using the dummy for CRT score. While none of the control variables has a significant effect on the contribution to the public good, the required effort to earn the original endowment significantly reduces contributions by subjects with low cognitive skills by \(-21.5\), given they decide to contribute. Due to the interaction effect, \(\text {High Effort} \times \text {High CRT}\) of 30.6, this effect vanishes for subjects with high CRT scores. Surprisingly, none of the variables has a significant effect on the extensive margin. I am grouping the intensive and extensive margin in Appendix B, obtaining a qualitatively identical result.

Table 1 Maximum-likelihood estimates of the Hurdle model of contributions using the binary treatment variable (High Effort) and the binary CRT variable High CRT

Computing the predicted values based on the coefficients of both stages of the Hurdle model confirms the interaction effect and the differences within treatments observed in the non-parametric analysis (see Table 2). Individuals with low CRT scores contribute 22.1 percentage points less in the High than in the Low Effort treatment (combined marginal effect of the variable High Effort for Low CRT subjects, \(p =.006\)). For subjects with high CRT scores, the difference is positive (they contribute more), but not significant (7.9, combined marginal effect of the variable High Effort for High CRT subjects, \(p =.443\)). The difference between individuals with low and high CRT scores is 16.8 (combined marginal effect of the variable High CRT, \(p =.023\)) in the Low Effort treatment and \(-\)13.1 (combined marginal effect of the variable High CRT, \(p =.144\)) in the High Effort treatment.

Table 2 Contributions as estimated by the Hurdle model

To conclude, subjects with low CRT scores exhibit the house money effect, giving more in the Low than in the High Effort condition. By contrast, contributions by subjects with high CRT scores are statistically indistinguishable. Moreover, subjects with low CRT scores give more than their high CRT counterparts in the Low Effort treatment, but less in the High Effort treatment. As these differences partly cancel out, it seems that cognitive ability does not affect co-operative behaviour when pooling treatments.Footnote 15

3.5 Discussion

The results raise the question whether studies that find no significant effect of effort in public goods games but do not account for cognitive ability should at least find suggestive evidence for a house money effect.

In fact, Antinyan et al. (2015) obtain a negative but insignificant effect of effort on contributions in their No Punishment treatment as well as in their Punishment treatment without controlling for an interaction of time and effort. Further, effort reduces the proportion of endowment contributed to the public good in Cherry et al. (2005). Finally, Clark (2002) finds suggestive but insignificant evidence for a house money effect in the initial round of a repeated public goods game. Similarly, with an experimental design that addresses issues with the design of Clark (2002), Bailey et al. (2022) find no significant house money effect, but slightly higher contributions by subjects who received the endowment on the day of the experiment (their house money treatment) versus 3 weeks prior to the experiment (which subjects mostly viewed to be their own money by the time the public goods game was played).

Hence, the interaction between cognitive ability and the house money effect in my experiment, combined with previous studies showing similar but insignificant effects in the same direction, suggests that their insignificance results may be the consequence of homogenous subject pools (university students in all mentioned studies). A house money effect might be more prevalent in a cross-section of society. Also, my experiment was not designed to investigate the persistence of a house money effect. However, the revealed interplay between cognitive skills and endowment origin asks for further experiments investigating the house money effect over time across distinct cognitive types.

Despite the evidence of a house money effect among subjects with lower cognitive skills and the broad existence of a house money effect in dictator games (Cherry, 2001; Cherry et al., 2002; Cherry & Shogren, 2008; Reinstein & Riener, 2012; Oxoby & Spraggon, 2008; Carlsson et al., 2013), it remains unclear whether the origin of the endowment indeed affects social preferences. Alternatively, unearned income or income that required little effort might just be more loosely handled without any pro-social motive. Identifying the exact psychological mechanism should be the purpose of future research.

4 Conclusion

Experimental studies have frequently used public goods games to shed light on the dynamics behind co-operative behaviour (see Zelmer, 2003, for an overview on public goods games). This is the first study to show that contributions to a public good depend on an interplay of income origin and cognitive skills. This result has implications for the interpretation of past and future experiments in which subjects are either given or earn their endowments. For instance, studies that find a relationship between factors such as cognitive ability and generosity or willingness to co-operateFootnote 16 may only hold true for the specific experimental design used, for example, if income is given by the experimenter, but the relationship may vanish or even be reversed if the design is changed, for example, if income has to be earned. Therefore, the generalisability of these findings should be considered with caution, and the specific experimental conditions under which the relationship was observed should be taken into account. In the experiment, subjects with a high CRT score behave more co-operatively in the High Effort treatment but less so in the Low Effort treatment. Thus, the external validity of respective results might particularly be questioned, considering populations that are more cognitively heterogeneous than students. In economics, the participant pool often consists solely of university students who are likely to be rather homogenous (Frederick, 2005) and to have above average cognitive abilities. Nevertheless, I am able to identify an interaction effect even within this group.

Furthermore, contributions in a public goods game can inter alia be regarded as subjects’ preferences for redistribution. Thus, the results are relevant for studying redistributive taxation and tax compliance (e.g., Bühren and Kundt, 2014). My results provide evidence that people with low cognitive skills may exhibit different spending patterns for earned income than for unearned income, which could have implications for the design of public policies that aim to support low-income individuals or reduce inequality. Assuming some degree of correlation between cognitive skills and income, the fact that non-earned income is spent more co-operatively can be considered as an argument in favour of existing and additional income subsidies, allowing these families to exploit positive externalities within families.

This study also shows that people with low cognitive skills are less willing to give away or invest earned income, suggesting that they may perceive it as more valuable for their own consumption. This could have implications for compensation practices and the allocation of performance-based and flat pay. Employers may need to consider how they structure pay and benefits packages to ensure that employees with different cognitive skills feel valued and motivated.

Finally, team production can also be compared to a public goods game (Alchian & Demsetz, 1972). Contributions by one team member benefit all other team members. For this reason, team production is equally exposed to a free-rider problem. Hence, a variation in how agents receive the resources they can use as production inputs (e.g., information) can alter team member’s propensity to co-operate.