To disentangle the magnitude effect into two channels, we perform parametric estimations both at the aggregate level and at the individual level. We then test if the preference parameters change with the size of the total budget.
Aggregate-level estimation
Estimation strategy
In our main specification, we assume a time separable utility function with power atemporal utility functions as in Eq. (1). We set \(\omega\) (background consumption) equal to the average response to the question about one’s typical daily expenditure, €7.89, as Andreoni and Sprenger (2012) did in two of their specifications.Footnote 15
Given the intertemporal utility function, solving the optimization problem yields the tangency condition
$$\frac{{z_{t} + \omega }}{{z_{t + \tau } + \omega }} = \left\{ {\begin{array}{*{20}c} {\left( {\beta \delta^{\tau } R} \right)^{{\frac{1}{\alpha - 1}}} ,} & {{\text{if }}t = 0} \\ {\left( {\delta^{\tau } R} \right)^{{\frac{1}{\alpha - 1}}} ,} & {{\text{if }}t > 0} \\ \end{array} } \right..$$
Taking logs gives a linear equation
$$\ln \left( {\frac{{z_{t} + \omega }}{{z_{t + \tau } + \omega }}} \right) = \left( {\frac{\ln \beta }{{\alpha - 1}}} \right) \cdot 1_{t = 0} + \left( {\frac{{\ln \delta^{\tau } }}{\alpha - 1}} \right) + \left( {\frac{1}{\alpha - 1}} \right) \cdot \ln R$$
where \(1_{t = 0}\) is the indicator for the Present Group.
The parameters to be estimated are the present bias parameter, \(\beta\), the discount factor, \(\delta\), and the power curvature parameter, \(\alpha\). The present bias parameter is identified by the differences in allocation between the Present Group and the Delayed Group. If there is a present bias, subjects in the Present Group will allocate more tokens to the sooner date than those in the Delayed Group. The discount factor is identified by one’s average choice across different experimental interest rates. A more patient subject will allocate more tokens to the later date in all decisions. The curvature parameter is identified by the dispersion of one’s choices across interest rates. Those who consider rewards highly substitutable over time are likely to make corner choices in all decisions, while those with lower elasticity of intertemporal substitution will make choices closer to equal splits.
Following the practice in previous studies (Andreoni & Sprenger, 2012, and Augenblick et al., 2015), we assume a normally distributed error term additive to the log-consumption ratio and consider censoring, to yield the two-limit Tobit model:
$$\begin{aligned} l_{i,j,k}^{*} & \equiv \ln \left( {\frac{{z_{t;i,j,k}^{*} + \omega }}{{z_{t + \tau ;i,j,k}^{*} + \omega }}} \right) \\ & = \left( {\frac{\ln \beta }{{\alpha - 1}}} \right) \cdot 1_{t = 0;i} + \left( {\frac{{\ln \delta^{\tau } }}{\alpha - 1}} \right) + \left( {\frac{1}{\alpha - 1}} \right)\ln R_{j} + \varepsilon_{i,j,k} , \varepsilon_{i,j,k} \;\sim \;N\left( {0,\sigma_{k} } \right) \\ \end{aligned}$$
$$l_{i,j,k} = \left\{ {\begin{array}{*{20}l} {\ln \frac{\omega }{{m_{k} + \omega }},} \hfill & {{\text{if}}\quad l_{i,j,k}^{*} \le \ln \frac{\omega }{{m_{k} + \omega }}} \hfill \\ {l_{i,j,k}^{*} ,} \hfill & {{\text{if}}\quad \ln \frac{\omega }{{m_{k} + \omega }} < l_{i,j,k}^{*} < \ln \frac{{\frac{{m_{k} }}{{R_{j} }} + \omega }}{\omega }} \hfill \\ {\ln \frac{{\frac{{m_{k} }}{{R_{j} }} + \omega }}{\omega },} \hfill & {{\text{if}}\quad l_{i,j,k}^{*} \ge \ln \frac{{\frac{{m_{k} }}{{R_{j} }} + \omega }}{\omega }} \hfill \\ \end{array} } \right.$$
where \(i = 1, \ldots ,203\) denotes Subject \(i\), \(j = 1, \ldots ,7\) denotes Interest rate \(j\), and k=1,...,5 denotes Total budget k. The error term is allowed to vary across total budgets since giving a larger number of tokens might induce a larger noise, which might be a competing explanation of a larger sensitivity to the interest rate.
The Tobit model can predict corner choices with a natural interpretation. When the background consumption \(\omega\) is positive, the marginal utility at a zero reward is finite. If the implied interest rate is much higher than the discount rate, the model predicts a latent choice with a negative budget on the sooner date. The individual would be willing to give up part of her background wealth on the sooner date to earn a larger amount on the later date. But in the experiment, she is not allowed to do that. This is naturally captured by the Tobit model by censoring at the later corner. The opposite case occurs when the implied interest rate is much lower than the discount rate. Then the individual would be willing to borrow money from the experimenter if she could, but her choice is censored at the sooner corner.Footnote 16
The model is estimated by the quasi-maximum-likelihood method: when performing the estimation, the error term, \(\varepsilon\), is assumed to be i.i.d., while in computing the standard errors, the error term is assumed to be independent across subjects, but might be correlated within-subject. Estimates of the parameters can be recovered and standard errors can be inferred by the delta method.
Since we are interested in the magnitude effect, we also perform the estimation with interaction terms of the parameters and the budget dummies. Thus, tests can be performed on the differences between the parameters for different total budgets.
To see why the measure of the intertemporal substitutability is robust against misspecification, notice that \(\alpha\) is identified from the sensitivity of the log-ratio of the consumption (i.e., \(\ln \left( {\frac{{z_{t;i,j,k}^{*} + \omega }}{{z_{t + \tau ;i,j,k}^{*} + \omega }}} \right)\)) to the logarithm of the gross interest rate (i.e., \(\ln R_{j}\)). This is approximately the sensitivity of the percentage change in the ratio of consumption to the percentage change in the relative price, which is exactly the definition of the elasticity of intertemporal substitution. Therefore, even if the utility function is misspecified, \(\alpha\) is still a measure of intertemporal substitutability.
In Online Appendix D, we assume a more flexible specification, in which the utility function has one additional free parameter. There the background consumption, \(\omega\), is also a parameter to be estimated. In this way, we address the concern that the average self-reported background consumption may not match the true background consumption integrated with the experimental rewards in decision making, or the elasticity of intertemporal substitution of the utility function may not be constant (i.e., the power utility function with a fixed consumption shift is misspecified). The results are basically the same.
Results
Table 4 reports the magnitude-invariant estimates and the magnitude-specific estimates of the parameters, respectively. A salient feature is that none of the estimates of \(\beta\) is significantly different from 1, implying no evidence of present bias, which is consistent with our finding in the model-free analysis. The annual discount rate for all the budgets together is 52.7%, which is in the range found by previous studies. The curvature parameters are always significantly smaller than 1, implying that the subjects on average consider the monetary rewards received on different dates imperfectly substitutable, which is also consistent with other studies (e.g. Andreoni & Sprenger, 2012; Andreoni et al., 2015; Augenblick et al., 2015; Cheung, 2020).
Table 4 Discounting and curvature parameter estimates in the aggregate-level estimation Most importantly, both the discount factor and the curvature parameter are increasing in the stake. To judge if these magnitude effects are significant: Table 5 presents Wald tests over the differences of parameters between total budgets. We find significant magnitude effects both on the discount factor, \(\delta\), and on the exponent parameter, \(\alpha\), which is a positive transformation of the elasticity of intertemporal substitution. The discount factor is increasing in the total budget, meaning that the decision weights on later rewards shift upward when the total budget increases. The elasticity of intertemporal substitution is increasing in the total budget, meaning that the rewards on the two dates are more substitutable to the subjects when a larger total budget is provided. This results in choices closer to the two corners (to which corner depends on whether \(\delta R > 1\)). Thereby, we verify Hypothesis 3 and Hypothesis 4.
Table 5 Estimates of parameter differences between total budgets in the aggregate-level estimation To get an idea about the size of the magnitude effects, we compare the discount rates and the predicted monetary discount rates, respectively, between stakes. The continuous annual discount rate at stakes of €20, €40, €60, €80, and €160 are 0.696, 0.519, 0.384, 0.370, and 0.237, respectively. Thus, the discount rate is 1.3 to 1.6 times larger when the stake is halved. This is larger than the effects found by Andersen et al. (2013). Their discount rate at a stake of 1,500 Danish kroner is only 1.0 to 1.1 times larger than that at a stake of 3,000 Danish kroner. As we mentioned earlier, the difference is consistent with the pattern that the magnitude effect becomes smaller at higher stakes. To incorporate the magnitude effect on the utility curvature, we predict the monetary discount rates at the lowest and the highest stakes. Take the monetary discount rate at the stake of €20 as an example, it is the continuous annual discount rate implied by an indifference relation between a later reward of €20 and an equally good sooner reward, assuming linear utility. We find the monetary discount rate at €20 is 1.058 and that at €160 is 0.257. The former is 4.1 times larger than the latter. Therefore, the two channels of the magnitude effect between the stakes of €20 and €160 are both large. Halevy (2015), for instance, finds that the monetary discount rates measured with a budget of $10 are 1.4 to 1.9 times larger than those with a budget of $100.
To illustrate the relative importance of the two channels of the magnitude effect, we use the estimates above to predict choices in the 35 questions for both the Present Group and the Delayed Group. Table 6 presents the marginal effects of allowing one parameter to vary with the total budget: in each row, we allow only one parameter, either \(\delta\) or \(\alpha\), to vary with the total budget of the decisions (as indicated by the column title), but fix the other two parameters at the value estimated from the budget of €20. Each number in a cell is the total change (in units of \(\frac{{N}_{k}}{100}\), the percentage of the total budget) in the seven decisions with the corresponding total budget. The results show that the marginal effect of allowing \(\alpha\) to vary with the total budget is at least as large as the marginal effect of allowing \(\delta\) to vary. This suggests that the magnitude effect on the elasticity of intertemporal substitution is at least as important as the magnitude effect on the discount rate.
Table 6 Marginal effects of allowing a parameter to vary with total budgets in the aggregate-level estimation Individual-level estimation
The aggregate-level estimation provides evidence of positive magnitude effects on the discount factor and intertemporal substitutability. One may wonder whether these results are purely a compositional effect: a bias resulting from forcing all subjects to have the same preferences and the same distribution of noise. To deal with this concern, we also perform individual-level estimation and tests.
Estimation and testing procedure
We keep all the assumptions that underlie Eq. (1) except for \(\beta\) since it is not identified in individual-level estimations. We estimate the discount factor (\(\delta\)) and the intertemporal substitutability (\(\alpha\)) for each combination of subject and stake, and then test if the two parameters are increasing in the magnitude within-subject.
One important difference from the aggregate-level estimation is that under-identification occurs when a subject made no or only one interior choice at one stake. There are 627 out of 1015 (62%) subject-stake combinations for which this is the case. We thereby adopt a conservative way to test the magnitude effect. First, we obtain point estimates of \(\delta\) and \(\alpha\) if possible by estimating the Tobit model specified in Sect. 5.1. Whenever there is an under-identification problem, we remove the error term from the model and then infer the intervals of \(\delta\) and \(\alpha\) that can generate the observations. Second, we perform a one-sided sign test on the two parameters, respectively, with the null hypothesis that they do not change with the magnitude. The sign test is flexible in that it does not impose assumptions on the distributions of idiosyncratic shocks to the parameters. For a comparison between a point estimate and an interval estimate, we recognize a difference only if the point is not in the interior of the interval. For a comparison between two interval estimates, we recognize a difference if the two intervals do not overlap.Footnote 17
We also categorize the subjects based on how their preference parameters change with the stake. For each parameter, a subject presents values for five stakes, allowing for 10 pairwise comparisons across stakes. We use the following two criteria for our categorization: first, we define subjects who present no magnitude effect as those who have “unchanged” results in all the 10 comparisons across stakes, subjects who present positive magnitude effects as those who have no “decrease” and at least one “increase”, and subjects who present negative magnitude effects as those who have no “increase” and at least one “decrease”, and call the rest as subjects who present a mixed magnitude effect. Arguably this criterion is too stringent in that it does not allow for any error. Our second criterion allows for error: we define subjects who present positive magnitude effects as those who have more “increases” than “decreases” and subjects who present negative magnitude effects as those who have more “decreases” than “increases”.
Results
Table 7 shows the results of the tests at the individual level. We reject the null hypotheses of no magnitude effect on the two parameters, in favor of positive magnitude effects. This shows that the two channels of the magnitude effect on intertemporal choices are robust against individual heterogeneity.
Table 7 Sign tests on preference parameters between total budgets Table 8 shows the results of subject categorization. Under both criteria, subjects who present positive magnitude effects are much more frequent than those who present negative magnitude effects. This is consistent with our main finding that positive magnitude effects exist.Footnote 18
Table 8 Categorization of subjects