1 Introduction

Decision-making processes frequently involve updating information. A decision maker (henceforth DM) may change her intertemporal preferences over time as she updates uncertain information in the process, which may lead to apparently time-inconsistent behaviors. For example, abundant anecdotal evidence shows that people tend to reverse their original retirement plans when approaching the planned retirement age. They update uncertain survival information over time. Those in good health who expect to live longer are apt to continue working to accumulate more retirement pension in the future, while those in poor health are apt to stop working to receive less but earlier retirement pension.

Time-inconsistent behavior is also termed dynamic preference reversal,Footnote 1 which is well documented in academic research: the DM prefers a larger delayed reward to a smaller immediate reward but reverses her preferred choice in the future as a future time draws nearer.Footnote 2 Under the assumption of time invariance, dynamic preference reversal is equivalent to static preference reversal (Frederick et al. 2002): the phenomenon whereby a DM prefers a smaller immediate reward over a larger delayed one but reverses her preferences when these two alternatives are equally shifted to the future.Footnote 3 In short, preference reversals resulting from a change in decision timeFootnote 4 are termed “dynamic”, and a DM exhibiting dynamic preference reversal has time-inconsistent preferences. In contrast, reversals from an equal time shift from two future alternatives at the same decision time are termed “static”, and a DM exhibiting statistic preference reversal has non-stationary preferences. Numerous studies implicitly make the assumption of time invariance without distinguishing static from dynamic preference reversal, and they simply refer to these terms together as preference reversals or time-inconsistent choices. Furthermore, with the assumption of time invariance, non-stationary (stationary) preferences are equivalent to time-inconsistent (time-consistent) preferences.Footnote 5 However, in general, non-stationarity (stationarity) is not always equivalent to time inconsistency (time consistency). For example, Read (2004) demonstrates that a DM shows dynamic preference reversals but no static preference reversals, which means that the DM has stationary but time-inconsistent preferences.

In fact, the stationarity, time consistency and time invariance of preferences are similar yet distinctively different concepts. Stationarity means that at any fixed decision time, a preference over two future rewards depends only on the time distance and reward difference between these two alternatives. For example, Keren and Roelofsma (1995) show that when making decisions at the current time, 82% of their subjects prefer to receive $110 after 30 weeks rather than $100 after 26 weeks, and 37% of their subjects prefer to receive $110 after 4 weeks rather than $100 immediately. Those subjects are said to have stationary preferences because their preferences at a fixed decision time over two pairs of future alternatives with the same distance and reward difference do not change. However, 45% of their subjects prefer $100 immediately rather than $110 after 4 weeks. These subjects are said to have non-stationary preferences because their preferences at a fixed decision time over two pairs of future alternatives are affected by the difference between the reward times and the decision times. Time consistency means that a preference between two fixed future rewards remains the same at any decision time as long as the time and reward difference between these two alternatives remains the same. Take as an example the subjects who currently prefer to receive $110 after 30 weeks rather than $100 after 26 weeks. As time goes by, say, in 26 weeks, if they still prefer $110 in 4 weeks to $100 immediately, they are said to have time-consistent preferences. In everyday life, however, we see many examples of time inconsistency: people often deviate from their original savings plans for retirement by consuming more than initially planned. Time invariance means that a preference for any future two rewards with the same time distance and reward difference does not change as long as the time distances between the decision time and rewards remain the same. For example, for an experiment conducted on Jan 1, 2018, a subject prefers to receive $110 after 30 weeks rather than $100 after 26 weeks. When this experiment is repeated on Jan 1, 2019, the subject’s preference is considered time invariant if her decision does not change.

Using the axiomatic method, Halevy (2015) is the first to show that the presence of any two of the three properties (time invariance, time consistency and stationarity) implies the presence of the third. This theoretical result means that under the assumption of time invariance, the DM’s intertemporal preferences are time consistent if and only if these preferences are stationary. The concept of stationarity is premised on a fixed decision time; i.e., the time that the DM has to decide which alternative is more preferable does not change. In contrast, by its very definition, time consistency involves a shift in decision time. In other words, time consistency characterizes whether the DM’s preferences over future alternatives remain unchanged (time consistent) or not (time inconsistent) as the decision time goes by while the future delivery time of the two rewards is fixed. Naturally, time invariance provides a crucial link that connects these two distinct notions. The functional form of a time-invariant preference is not dependent on calendar time. In other words, time-invariant preferences over two future rewards depend only on the time distance and reward difference between two future alternatives and the time distance between the decision time and the reward time, regardless of whether the decision time changes. Consequently, stationarity and time consistency are equivalent. Likewise, non-stationarity is equivalent to time inconsistency when a preference is time invariant. Therefore, under the assumption of time invariance, dynamic preference reversal is equivalent to static preference reversal.

Economists generally believe that the present bias, which characterizes the DM’s non-stationary preferences, accounts for static preference reversal. Recent economic literature focuses on time-invariant hyperbolic discounting as a form of the present bias to explain both static and dynamic preference reversals.Footnote 6 Since hyperbolic discounting explains preference reversals and fits experimental data well while exponential discounting does not, preference reversals, especially the existence of dynamic ones, are regarded as evidence of hyperbolic discounting (Frederick et al. 2002).Footnote 7

Halevy’s (2015) experiments show that at least half of subjects are time consistent. However, only three-quarters of them exhibit stationary choice behavior, while approximately half of subjects with time-inconsistent choices have stationary preferences. These results imply that both time-varying and non-stationary behavior contribute to time-inconsistent behaviors. In fact, a person may be time inconsistent because of non-stationary preferences, time-varying preferences or both. This notion indicates that there may exist other explanations for time inconsistency besides the present bias, which describes non-stationarity. Halevy (2015) does not investigate the underlying causes of time-varying and non-stationary preferences. Motivated by Halevy (2015), we delve into a theoretical analysis of potential underlying causes of dynamic preference reversal.Footnote 8 We show that the process of updating information, which renders the DM’s preferences time varying and non-stationary, ultimately leads to dynamic preference reversal.Footnote 9 Consequently, the process of updating information may be another cause of dynamic preference reversal. To some extent, it is reasonable to assume time invariance for inter-temporal preferences, particularly in problems with a short time horizon. With the assumption of time invariance, non-stationarity is equivalent to time inconsistency. Therefore, the present bias explains both dynamic and static preference reversals. However, in general, the assumption of time invariance is untenable, as shown in this study.

In our analysis, we first characterize the three inter-related notions of time invariance, stationarity and time consistency by applying the concept of the marginal rate of substitution, which can be defined at any decision time occurring at or before the delivery of future rewards. We use the marginal rate of substitution to identify discounting behavior. The flexibility in choosing the decision time enables us to analyze how the process of updating information affects the DM’s preferences in our framework. We establish a sufficient condition for dynamic preference reversal characterized by the marginal rate of substitution. Next, following the frameworks developed by Phelps and Pollak (1968), Halevy (2005) and Ekeland and Pirvu (2008), we incorporate mortality risk and an altruistic factor into discount modeling.Footnote 10 We show that the resulting discount function is time varying and non-stationary and is similar to, yet distinct from, a hyperbolic discount function. The DM characterized by this discount function engages in dynamic preference reversal. More specifically, our results show that the process of updating information may be one of the underlying causes of dynamic preference reversal.

Our study makes several contributions to the literature. We follow the frameworks developed by Sozou (1998), Azfar (1999) and Halevy (2005) in modeling discounting, but we apply a different approach to explain dynamic preference reversal. We consider mortality risks and utility from bequests in our discounting model. Hence, we obtain results that are markedly different from those of Sozou (1998) and Halevy (2005). Those studies assume no utility derived from bequests and consequently predict no dynamic preference reversal behavior.Footnote 11 In our model, the discount value increases as the DM updates her survival information over time, which has a stronger effect on an earlier reward than on a later reward. As a result, the marginal rate of substitution decreases with the passage of decision time, leading to dynamic preference reversal behavior.Footnote 12 Second, we present a non-hyperbolic and time-varying discount function that explains dynamic preference reversal. Hence, it is premature to interpret the existence of dynamic preference reversal behavior as definitive evidence of hyperbolic discounting.Footnote 13 Finally, our analysis, in addition to that of Dasgupta and Maskin (2005), provides a theoretical underpinning for the possibility that the process of updating information may account for time-varying preferences, which lead to dynamic preference reversal behavior. Halevy (2015) does not investigate the underlying causes of time-varying and non-stationary preferences, and our results offer a theoretical explanation for his experimental results.

2 Preference reversals and discounting

It was well established by Halevy (2015) that the presence of any two of the three properties of time invariance, time consistency and stationarity implies the existence of the third. This result means that under the assumption of time invariance, the DM’s intertemporal preferences are time inconsistent if and only if the preferences are non-stationary.

Preference reversals have been widely studied in economics research. Under the assumption of time invariance, the DM exhibits static preference reversal if and only if she shows dynamic preference reversal (Frederick et al. 2002). Therefore, present-biased preferences that explain static preference reversal can also explain dynamic preference reversal under the assumption of time invariance.Footnote 14 Figure 1 illustrates the relationship among present-biased preferences, dynamic preference reversal, static preference reversal and time invariance.

Fig. 1
figure 1

The relationship between static and dynamic preference reversal

Present-biased preferences are a model of hyperbolic discounting, which is a natural way to represent how the DM responds to time delays (O’Donoghue and Rabin 2015).Footnote 15 The most popular models are the quasi-hyperbolic discounting (Phelps and Pollak 1968) and hyperbolic discounting functions proposed by Harvey (1986), Mazur (1987) and Loewenstein and Prelec (1992). All of these implicitly assume time invariance and explain both static and dynamic preference reversals from the perspective of discounting. The existence of dynamic preference reversal is regarded as evidence of hyperbolic discounting.Footnote 16 Figure 2 illustrates the relationship among hyperbolic discounting, present-biased preferences, time invariance, dynamic preference reversal and static preference reversal.

Fig. 2
figure 2

The relationship between static, dynamic preference reversal and hyperbolic discounting

The experimental results presented in Halevy (2015) show that the DM has time-varying preferences and that time-inconsistent behavior can occur as a result of non-stationary preferences, time-varying preferences or both. Therefore, in addition to the present bias, there may exist other causes that account for dynamic preference reversals. We will show that the process of information updating is an underlying cause of dynamic preference reversal behavior.

The decision time plays a very important role in revealing the processes of information updating as an underlying cause of dynamic preference reversal behavior. We next define preferences and incorporate the decision time into preferences to show that the DM’s preferences may change as the decision time changes.

Let [0, X] denote the set of outcomes, and let [0, T] denote the set of time at which an outcome occurs, where X > 0 and T > 0. A typical element of [0, T] × [0, X] is (tʹ + Δ, x), which represents the receipt of outcome x at time tʹ + Δ. Let \( { \succsim }_{t} \) represent the DM’s preferences over temporal payments at time t. We also use \( \{ { \succsim }_{t} \}_{t = 0}^{\infty } \) to denote the dynamic preference structure. The DM’s preferences \( { \succsim }_{t} \) over timed outcomes (t′ + Δ, x) can be represented by discounted utility DUt:

$$ {\text{DU}}_{t} \left( {t^{\prime } + \Delta ,x} \right) = D\left( {t,t^{\prime } + \Delta } \right)u\left( x \right). $$
(1)

That is, \( \left( t^{\prime } + \Delta_{1} ,x\right){ \succsim }_{t} \left( {t^{\prime } + \Delta_{2} ,y} \right)\) if and only if DUt(t + Δ1x) ≥ DUt(t + Δ2y). DUt is the discounted utility at decision time t, and u(x) is the utility of payment x. D(tt′ + Δ) is the discount function that denotes the value of one unit of utility at time t to be received at time t′ + Δ.Footnote 17

The discounted utility at t over the timed outcome (t′, x) depends on the discount function D(t, ) and on the utility of payment u(x). In the case of dynamic preference reversals, which result from a shift in the decision time, i.e., \( (t^{\prime } ,x) \prec_{t} (t^{\prime } + \tau , y) \) and \( (t^{\prime } ,x) \succ_{s} (t^{\prime } + \tau , y) \), where \( t < s < t^{\prime } , \tau > 0 \) and x < y; timed outcomes x and y are fixed at fixed times tʹ and \( t^{\prime } + \tau \); and the ratio between the discounted utility at t over timed outcomes (tʹ, x) and \( (t^{\prime } + \tau , y) \), \( \frac{{D(t,t^{\prime } + \tau )}}{{D(t,t^{\prime } )}}\frac{u(y)}{u(x)} \), depends only on the discount factors D(tt′) and D(tt′ + τ) as the decision time t changes. The discount factors D(tt′) and D(tt′ + τ) for the two timed rewards x and y increase as the decision time t changes. However, the discount factor for the earlier reward is not proportional to the later reward. Therefore, the marginal rate of substitution, \( \frac{{D(t,t^{\prime } + \tau )}}{{D(t,t^{\prime } )}} \), changes with the decision time, which may lead to dynamic preference reversal. Following Halevy (2005) and Frederick et al. (2002), we refer to this marginal rate of substitution as time discounting or as the value at tʹ of receiving a unit utility at t′ + τ from the decision time of t. This can be used to determine whether the corresponding discounting behavior is hyperbolic.

We next use the marginal rate of substitution to show that the process of updating information may be one underlying cause of dynamic preference reversal and to determine whether the corresponding discounting behavior is hyperbolic. We define the marginal rate of substitution at tʹ + Δ1 for utility at tʹ + Δ2 from the decision time t as follows:

$$ {\text{MRS}}_{t} \left( {t^{\prime } + \Delta_{1} ,t^{\prime } + \Delta_{2} } \right) = \frac{{D(t,t^{\prime } + \Delta_{2} )}}{{D(t,t^{\prime } + \Delta_{1} ))}},\quad \Delta_{2} > \Delta_{1} . $$
(2)

Following the definitions of time invariance, time consistency and stationarity provided in Halevy (2015) and the preferences represented in Eq. (1), we characterize time invariance, time consistency and stationarity as the marginal rate of substitution as follows:

Proposition 1

For every \( \Delta_{1} , \Delta_{2} > 0 \) and \( t, t^{\prime } > 0, \) preference \( \{ { \succsim }_{t} \}_{t = 0}^{\infty } \)

  1. (1)

    is time invariant if and only if the marginal rate of substitution of any future two rewards with the same time distance and reward difference does not change as long as the time distance between the decision time and rewards remains the same, i.e,

    $$ {\text{MRS}}_{t} \left( {t + \Delta_{1} ,t + \Delta_{2} } \right) = {\text{MRS}}_{{t^{\prime } }} \left( {t^{\prime } + \Delta_{1} ,t^{\prime } + \Delta_{2} } \right); $$
    (3)
  2. (2)

    is stationary if and only if at any fixed decision time, the marginal rate of substitution over two future rewards depends only on the time distance and reward difference between these two alternatives, i.e,

    $$ {\text{MRS}}_{t} \left( {t + \Delta_{1} ,t + \Delta_{2} } \right) = {\text{MRS}}_{t} \left( {t^{\prime } + \Delta_{1} ,t^{\prime } + \Delta_{2} } \right); $$
    (4)
  3. (3)

    is time consistent if and only if the marginal rate of substitution over two fixed future rewards remains the same at any decision time as long as the time distance and reward difference between these two alternatives remain the same, i.e,

    $$ {\text{MRS}}_{t} \left( {t + \Delta_{1} ,t + \Delta_{2} } \right) = {\text{MRS}}_{{t^{\prime } }} \left( {t + \Delta_{1} ,t + \Delta_{2} } \right). $$
    (5)

Proof

See Appendix 1.□

From Proposition 1, stationarity precludes static preference reversal, and time consistency precludes dynamic preference reversal. Furthermore, it is evident that under the assumption of time invariance, the DM exhibits static preference reversal if and only if she engages in dynamic preference reversal.Footnote 18 Although a variety of hypotheses have been proposed for the origins of these behaviors,Footnote 19 most economists agree that static preference reversal is typically attributed to present bias.Footnote 20 Hyperbolic discounting is usually defined as (for a fixed decision time) an increase in the marginal rate of substitution with two alternatives shifting equally into the future (see Read 2001; Read and Roelofsma 2003; Scholten and Read 2006; Noor (2009); Kinari et al. 2009; Halevy 2005), i.e., \(\frac{{d{\text{MRS}}_{t} \left( {s + \Delta_{1} ,s + \Delta_{2} } \right)}}{ds} > 0 \).Footnote 21 Time-invariant hyperbolic discount functions can explain both static and dynamic preference reversals.

The assumption of time invariance is untenable in real life because of the presence of a variety of risks.Footnote 22 Such risks may be due to an uncertain lifetime that may prevent the DM from realizing contracted utility. When an uncertain lifetime is considered in the decision-making process, the DM’s discounting behavior is dependent on the calendar time because of the DM’s non-constant hazard rate of death. Hence, static preference reversal may not be equivalent to dynamic preference reversal, and there may exist other causes to account for dynamic preference reversal. We establish a sufficient condition for dynamic preference reversal as follows:

Proposition 2

The DM exhibits dynamic preference reversal if the marginal rate of substitution MRSt(tʹ + Δ1tʹ + Δ2) decreases as the decision time t increases.

Proof

See Appendix 2.□

Proposition 2 shows that if the marginal rate of substitution decreases with the decision time, the DM will exhibit dynamic preference reversal. In such cases, the DM’s preferences may be time varying or non-stationary or both.Footnote 23 Notably, a decreasing marginal rate of substitution with decision time is only a sufficient condition for dynamic preference reversal. Other conditions may also explain dynamic preference reversal or other time-inconsistent choices.Footnote 24

In the next section, we investigate possible underlying causes of dynamic preference reversal by constructing a discounting model incorporating mortality risk, bequest motivations and survival information updating. We show that the discount function derived from this realistic yet mathematically tractable model is time varying, non-stationary and non-hyperbolic and accounts for dynamic preference reversal well.

3 Non-hyperbolic discounting and dynamic preference reversal

In this section, we construct a non-hyperbolic discount function that is time varying with the process of updating information but that explains dynamic preference reversal behavior well. We derive this new discount function from an analysis of a fundamental problem of lifetime consumption with mortality risk. As argued in Yaari (1965) and recently in Sozou (1998), Dasgupta and Maskin (2005) and Halevy (2005), discounting originates from the possible inability of the DM to enjoy future payoffs. We follow Yaari’s (1965) approach by considering mortality risk in our framework. The discount function thus derived encompasses the key elements of pure time preferences of exponential form (eρt), an “altruistic factor” (β), and survival probability (the Weilbull distribution, \( {\text{e}}^{{ - \lambda t^{\gamma } }} \)). Our approach is analogous to that of Halevy (2005) in that the primary source of discounting is the mortality risk of the DM, which makes it possible that the DM will not enjoy the fruits of future income streams. Through the process, the DM constantly updates her survival probability, causing her preferences to change over time and resulting in her engagement in dynamic preference reversal behavior.Footnote 25 Our conclusion, together with those of Dasgupta and Maskin (2005), provides a theoretical explanation for how the process of updating uncertain information may be another source of dynamic preference reversal, consistent with the experimental results of Halevy (2005).Footnote 26

Consider an individual who faces mortality risks when making intertemporal decisions on consumption and savings. The probability that she will be alive at time t is given by the survival probability function:

$$ \varOmega \left( t \right) = {e}^{{ - \lambda t^{\gamma } }} , $$
(6)

where λ > 0 and γ > 0. Life expectancy is the reciprocal of λ. Parameter γ is the monotonicity of the hazard rate of death. Cases 0 < γ < 1, γ = 1 and γ > 1 correspond to the burn-in, adult and wear-out periods, respectively.

The probability that the DM will be alive at time s conditional on being alive at time t is:

$$ \varOmega_{t} \left( s \right) = \frac{{\varOmega (s)}}{\varOmega (t)} = {\text{e}}^{{ - \lambda \left( {s^{\gamma } - t^{\gamma } } \right)}} . $$
(7)

The hazard rate of death is:

$$ \pi \left( s \right) = \frac{{ - \varOmega^{\prime } (s)}}{\varOmega (s)} = \lambda \gamma s^{\gamma - 1} . $$
(8)

The uncertain lifetime prevents the DM from realizing the entire stream of future contracted utility u(c(s)), which may be inherited by her heirs after her death. Unlike in previous studies (Yaari 1965; Sozou 1998; Halevy 2005) that assume no utility from bequests, we maintain that both the DM’s own consumption in her lifetime and her bequests to heirs have utility for her.Footnote 27 Moreover, utility derived from one’s own consumption and bequests differs due to, for example, inheritance tax or self-interest priorities. Following Phelps and Pollak (1968), we use the “altruistic” factor β, where 0 < β < 1,Footnote 28 to differentiate the utility derived from the DM’s own consumption from that derived from bequests to her heirs. The higher β is, the more the DM values the expected utility enjoyed by her heirs, thus facilitating a more “altruistic” attitude. We focus on the case of an altruistic DM with a non-constant hazard rate of death, i.e., γ > 1, which leads the DM to have time-varying and non-stationary preferences.Footnote 29

We analyze the present value (at time t) of the expected utility u(c(s)) of future consumption at s. At time s, if she is alive, the expected present value of future utility u(c(s)) at t is:

$$ \varOmega_{t} \left( s \right){\text{e}}^{{ - \rho \left( {s - t} \right)}} u\left( {c\left( s \right)} \right). $$
(9)

If she dies at time s1, s1 ∊ (ts), the utility of her future income stream after her death will be inherited by her heirs. Assuming that the pure time preferences of her heirs remain the same, the value of utility at s1 is \( {\text{e}}^{{ - \rho \left( {s - s_{1} } \right)}} u\left( {c\left( s \right)} \right) \), and the present value at t of future utility u(c(s)) is \( \beta {\text{e}}^{{ - \rho \left( {s_{1} - t} \right)}} {\text{e}}^{{ - \rho \left( {s - s_{1} } \right)}} = \beta {\text{e}}^{{ - \rho \left( {s - t} \right)}} \). Therefore, the present value at t of expected future utility u(c(s)) is:

$$ \left( {1 - \varOmega_{t} \left( s \right)} \right)\beta {\text{e}}^{{ - \rho \left( {s - t} \right)}} u\left( {c\left( s \right)} \right), $$
(10)

where β is the “altruistic” factor.

Given these two possibilities, the present value (at time t) of expected future contracted utility u(c(s)) at t is:

$$ \left[ {\varOmega_{t} \left( s \right){\text{e}}^{{ - \rho \left( {s - t} \right)}} + (1 - \varOmega_{t} \left( s \right))\beta {\text{e}}^{{ - \rho \left( {s - t} \right)}} } \right]u\left( {c\left( s \right)} \right). $$
(11)

Hence, the discount function for s to t is:

$$ D\left( {t,s} \right) = \left( {1 - \beta } \right)\overbrace {{{\text{e}}^{{ - \rho \left( {s - t} \right)}} {\text{e}}^{{ - \lambda \left( {s^{\gamma } - t^{\gamma } } \right)}} }}^{{{\text{short}} - {\text{run }}\,{\text{factor}}}} + \beta \overbrace {{{\text{e}}^{{ - \rho \left( {s - t} \right)}} }}^{{{\text{long}} - {\text{run }}\,{\text{factor}}}}. $$
(12)

D(ts) can be regarded as a convex combination of a short-run exponential discount factor \( {\text{e}}^{{ - \rho \left( {s - t} \right)}} {\text{e}}^{{ - \lambda \left( {s^{\gamma } - t^{\gamma } } \right)}} \) with weight (1 − β) and a long-run exponential discount factor eρ(st) with weight β.Footnote 30

The discount function [Eq. (12)] satisfies the sufficient condition of dynamic preference reversal established in Proposition 2 (see the proof shown in Appendix 3). In other words, the discounting behavior characterized by Eq. (12) exhibits dynamic preference reversal over time. A close inspection of Eq. (12) shows the underlying cause of this phenomenon. The DM updates her survival probabilities over time. This updating process has two marginal effects on the expected future utility Et[u(c(s))] that will emerge at time s: (a) it increases the probability of future utility Et[u(c(s))] consumed by oneself over time because as t approaches s, the conditional survival probability Ωt(s) increases; (b) it reduces the probability of the future utility consumed by one’s heirs. Effect (a) is proportional for Et[u(c(s1))] and Et[u(c(s2))], s2 > s1.Footnote 31 However, effect (b), which is negative, is larger for Et[u(c(s2))] than for Et[u(c(s1))]. These combined effects cause the discount value to increase with the passage of decision time and are larger for rewards received earlier than for rewards received later. Hence, the marginal rate of substitution at s1 for utility at s2 decreases with the passage of decision time. This consideration leads the DM to choose the smaller, earlier reward in the near future when she is allowed to re-optimize. Our analysis is analogous to that of Dasgupta and Maskin (2005). In their model, the DM updates information received earlier with the passage of time, and the corresponding updating information process results in dynamic preference reversal. In our model, the DM updates her survival information based on the mortality risk, which reflects a real-life situation and has a concrete economic interpretation.

To use a concrete example, consider a person at the age of 70 who is comparing two monetary rewards to be received when she turns 81 and 82 years. Suppose that the later reward is large enough for her to prefer it over the earlier reward at the age of 70. After ten years, she advances to the age of 80 and reassesses her chances of being alive at 81 and 82. She realizes that her chances of enjoying these rewards by herself are more likely now than when she was 70. Furthermore, the increase in the conditional survival probability of reaching 81 is greater than that of reaching 82.Footnote 32 On the other hand, the likelihood that she will bestow the rewards on her heirs is simultaneously reduced. As the utility of bestowing is only a fraction (β) of that of her own consumption, the increase in utility from her own consumption has a dominant effect. Consequently, at the age of 80, the utility of her enjoying the earlier reward may well increase sufficiently due to the increase in the updated conditional survival probability of reaching the age of 81 relative to that of reaching the age of 82, so she exhibits preference reversal and prefers the earlier reward to the later one.

We illustrate the MRS’s decreasing behavior by plotting the marginal rate of substitution MRSt(ss + 1). Without a loss of generality, we assume parameters β = 0.5, ρ = 0, λ = 0.04, γ = 1.2 and s = 4.Footnote 33 Figure 3 shows that MRSt(s, s + 1) decreases as the decision time t approaches the earlier alternative. From Propositions 1 and 2, the DM has a time-inconsistent preference and will show dynamic preference reversal.

Fig. 3
figure 3

The MRS of time-inconsistent behavior

Researchers use hyperbolic discounting—a functional form of discounting that accommodates present bias to characterize non-stationary preferences—to explain preference reversals. We next determine whether the discount function of Eq. (12) is hyperbolic. From the definition of hyperbolic discounting used in Read (2001), Read and Roelofsma (2003), Scholten and Read (2006), Noor (2009), Kinari et al. (2009) and Halevy (2005), a discount function is hyperbolic if its marginal rate of substitution increases as the two alternatives shift equally into the future when the decision time is fixed.Footnote 34 In our model, with a fixed decision time, the marginal rate of substitution first decreases and then increases, implying that the discount function of Eq. (12) is non-hyperbolic (see Appendix 4).

For a fixed decision time, the marginal rate of substitution first decreases and then increases in our discounting model as the two alternatives shift into the future. As in our previous analysis, this result is determined by the two marginal effects on the expected future utility. As the two alternatives shift equally into the future, the first effect decreases the probability of future utility E[u(c(s))] being consumed by oneself because of reduced remaining lifetime as one ages; the second effect increases the probability of future utility being consumed by one’s heirs.Footnote 35 The first effect, which negatively affects the expected future utility, is greater for the later alternative than for the earlier one due to the increasing hazard rate of death that causes the marginal rate of substitution to decrease; by contrast, the second effect is greater for the earlier alternative than for the later one, causing the marginal rate of substitution to increase. When the two timed alternatives will emerge in the near future, the probability that the utility is inherited and consumed by heirs is very small. Consequently, the first effect dominates the second, which leads the marginal rate of substitution to decline when shifting the two alternatives into the future. By contrast, when the two timed alternatives are set in the far distant future, the probability of utility being consumed by the DM is very low because of an individual’s limited lifetime. As a result, the second effect dominates the first, causing the marginal rate of substitution to increase. Our analysis shows that the DM’s non-constant hazard rate of death contributes to the formation of non-stationary preferences and that the DM’s non-stationary preferences are characterized by non-hyperbolic discounting behavior.

We illustrate the non-hyperbolic discounting behavior of Eq. (12) by plotting the marginal rate of substitution MRSt(s, s + 1). As in Fig. 3, we adopt the

parameters β = 0.5, ρ = 0, λ = 0.04, and γ = 1.2 and assume that the decision time is present, i.e., t = 0. Figure 4 shows that MRS0(ss + 1) first decreases and then increases as the two timed alternatives at s and s + 1 shift equally into the future. This pattern of the marginal rate of substitution means that the corresponding discounting behavior is non-hyperbolic.

Fig. 4
figure 4

The MRS of non-stationary behavior

From the above analysis, we make the following proposition:

Proposition 3

If the DM cares about her bequest and distinguishes between it and her own consumption, then discounting behavior is non-hyperbolic, and the process of updating survival information accounts for dynamic preference reversal behavior.

As shown above, the process of updating survival information accounts for dynamic preference reversal behavior. We next show that the marginal rate of substitution for our discount function [Eq. (12)] decreases with calendar time t (see the proof shown in Appendix 5), which implies that the DM’s preferences are time varying. As the DM approaches the end of her life with a non-constant hazard rate of death, she updates her survival information and recognizes changing survival probabilities. As the utility derived from a bequest is a fraction β of that derived from her own consumption, the discount function is time varying as she ages.

We illustrate time-varying behavior by plotting a marginal rate of substitution MRSt(t + Δ1t + Δ2). In the graph, we use the parameters β = 0.5, ρ = 0, λ = 0.04, γ = 1.2, Δ1 = 1 and Δ2 = 2. Figure 5 shows that MRSt(t + Δ1t + Δ2) decreases as calendar time t increases.

Fig. 5
figure 5

The MRS of time-varying behavior

Our analysis demonstrates that the DM’s preferences derived from our discounting model are non-stationary, time varying and time inconsistent. The DM’s non-constant hazard rate of death leads to the non-stationarity of her preferences. We also show that the preferences from our model are non-hyperbolic. The process of updating survival information is the underlying driver that renders them time varying and time inconsistent. Therefore, survival information updating may be another underlying driver of dynamic preference reversal behavior. Our results and those of Dasgupta and Maskin (2005) serve as a theoretical foundation that explains the experimental results of Halevy (2005).Footnote 36

4 Conclusion

In this study, we analyze the underlying causes of dynamic preference reversal. We first characterize the three inter-related notions of time invariance, stationarity and time consistency from the marginal rate of substitution and establish sufficient conditions that lead to dynamic preference reversal. We consider mortality risk and an altruistic factor in our discounting model and present a discount function that is non-stationary, time varying and non-hyperbolic. The DM’s preference is non-stationary because of the non-constant hazard rate of death. Moreover, the process of updating survival information renders discounting behavior time varying. The DM becomes more inclined to receive the smaller but earlier reward with the passage of the decision time. Our analysis shows that the process of updating uncertain information may account for the occurrence of dynamic preference reversal.