1 Introduction

1.1 The Ramsey conjecture and endogenous time preference

The standard consumption-saving model in which households maximize their discounted lifetime utilities, when put under the natural assumption that they differ in their time preferences, has an uneasy long-run implication—given a time-constant interest rate, households are generically divided into two “classes”, one such that their consumptions and wealths converge to zero or subsistence level, the other such that their consumptions and wealths diverge to infinity; moreover, in the full dynamic general equilibrium environment in which the interest rate lowers over time according to diminishing returns to capital, all but the most patient household economically “perish”, and only the most patient one can “survive”, actually regardless of earnings and initial distribution of capitals. Such “division of society into two classes” has been conjectured by Ramsey in his classic work Ramsey (1928), and later confirmed by Becker (1980) and Bewley (1982) in the discrete-time setting and Mitra and Sorger (2013) in the continuous-time setting.

Look at the discounted utility form, written in the continuous-time setting which the current paper adopts:

$$\begin{aligned} \int _{0}^{\infty } v(c(t)) \mathrm{e}^{-\beta t}\mathrm{d}t, \end{aligned}$$

where \(\beta\) denotes the discount rate. When the long-run pure interest rate r is given as fixed, we have to have either:

$$\begin{aligned} \beta >r \end{aligned}$$

or

$$\begin{aligned} \beta <r, \end{aligned}$$

generically. In the first case, the household saves less, or even borrows, and as this accumulates exponentially over time its consumption/wealth go to zero. In the second case, the household saves more, and as this accumulates exponentially over time its consumption/wealth go to infinity.

When the long-run interest rate is flexibly adjusted to diminishing return to capital over time, it is expected to satisfy the condition:

$$\begin{aligned} \beta = r, \end{aligned}$$

but this can never be met when households have different \(\beta\)s. Thus, it has been shown that all but one with the smallest \(\beta\) are pushed away toward zero consumption/wealth or subsistence level in the long run.Footnote 1

There are two kinds of uneasiness here. One is that such long-run state does not look normatively right. There is nothing wrong in terms of the classical concept of welfare applied to “long-lived” households’ preferences over their consumption paths, since such equilibrium path is Pareto-efficient and even “fair” in the sense of envy-freeness if they have equal earnings and initial capital holding. According to this, it is simply that impatient households consume more in earlier periods. It is still problematic in two senses, though. First, since allocations take place over time, welfare criteria applied just to ex-ante evaluation of planned life-courses may be normatively insufficient, and it is ethically a different question whether we should accept resulting ex-post inequalities. Second, when the “long-lived” households are interpreted as families, such that nobody is responsible for in which family and in which generation and with what nature of time preference he or she is born, this extreme long-run inequality is definitely problematic.

Second is that such extreme long-run inequality looks simply unrealistic. Although the precise answer depends on how long the “long-run” is, our crude sense tells us that plural and different households are there, while there exist unignorable and perhaps severe inequalities among them due to other reasons. This motivates various positive models to describe and explain that plural and different households can economically survive.

Because the extreme long-run inequality result depends critically on the assumption of discounted utility preference, which is additively separable and stationary and seen as a knife-edge case even within the class of “rational (in the sense of dynamic consistency)” preferences, many preference models have been proposed to allow for the roles of non-separability and/or non-stationarity, and it has been investigated whether the extreme long-run inequality result is mildered or still persists.

To illustrate, consider that the discount rate at each moment takes the form \(\beta (x)\), where x can be either a decision variable which can be chosen at each moment, such as consumption, or a state variable such as physical capital, or accumulated habit. The idea is that in the long run, the steady-state value of x for each household, denoted \(x^{*}\), can be flexibly adjusted, so that the interior steady-state condition:

$$\begin{aligned} \beta (x^{*})=r \end{aligned}$$

is met, and this can hold for different \(\beta (\cdot )\) as the steady-state value is flexibly adjusted. This allows us to shift our question from existence of interior steady state, which generically fails, to its stability.

There are various existing models of such endogenous time preference. Uzawa (1968) proposes a preference model in which \(\beta\) depends on the current consumption, and it is axiomatized by Epstein (1983) in the discrete-time setting. Its further generalization is provided by Epstein (1987b), which allows non-linear way of discounting continuation lifetime utility, and is seen as a continuous-time counterpart of the classic discrete-time model of recursive utility by Koopmans (1960). Based on those models of recursive utility, a number of studies provide characterizations of the stability condition for economies with heterogenous households, such as Epstein and Hynes (1983), Lucas and Stokey (1984), Epstein (1987a, b), Benhabib et al. (1988).Footnote 2

These stability results assume that discount rate at each moment is increasing in present consumption. On the other hand, Das (2003, 2007), Hirose and Ikeda (2008) consider that discount rate is decreasing in present consumption. This is consistent with the empirically natural observation that the rich (poor) are more (im)patient, which was originally suggested by Fisher (1930). They show that steady state may be unstable, or that there are multiple steady states.

Note that the above-noted class is stationary, while weakening additive separability, and there preference over-consumption streams being held at each time is independent of histories of past consumptions, meaning that whatever the history is the consumer’s discounting between the present period and the next period depends only on the present consumption. This is somewhat odd as an explanation of the “Fisherian” story, because it says that regardless of wealth accumulation, a sudden change in the present consumption immediately changes the degree of patience, and also that such effect lasts only for one period. We view that patience should be modeled rather as certain kind of capital.

Shi and Epstein (1993) incorporate habit formation into time-preference determination, in which \(\beta\) depends on the level of consumption habit and is increasing in it, meaning that higher habit leads to more impatience. There, the habit rises as the household consumes more than the current habit level. This works as a stabilizer, as an over-consumption leads to higher habit, leading to less patience, leading to more current consumption but less in the future, leading to lower habit and more patience, and so on.

In the context of growth problem, Schumacher (2009) and Strulik (2012) consider that discount rate at each moment depends on physical capital amount, which goes along the line that rich are more patient, and in general work in the direction to destabilize. They show that there are multiple steady states, so that the lower one is interpreted as a poverty trap.

1.2 Investment in time preference

This paper studies a model of endogenous time-preference formation, following Becker and Mulligan (1997), in which households can pay for costly investment to establish and maintain patience, where the degree of patience is now understood to be a capital. Following Doepke and Zilibotti (2005, 2008), we call such capital patience capital. A prominent example of the form of investment in patience capital is education, within or outside family.Footnote 3

We view that investment in patience capital is a natural channel of endogeneity in time-preference determination, and it tells causal effect more directly than the above-noted elements. It captures better the reverse causality that the rich (poor) are more (im)patient, because they are rich (poor), not just the single direction that the rich (poor) are more (im)patient, because they are (im)patient.

Effect of education on time-preference determination is documented by experimental studies such as Alan and Ertac (2018) and Perez-Arce (2017). Also, numbers of studies document that time preference is related to socioeconomic status which naturally require inputting wealth for acquisition and maintenance, or to wealth itself (see, for example, Lawrance 1991; Barsky 1997; Tanaka et al. 2010; Dohmen 2015). Although the precise causal effects are yet unclear, it will be fair to say that acquiring and maintaining patience are at least costly.

Patience capital may be accumulated over time, while it can depreciate, as well. In the intergenerational interpretation, this means that patience capital is inherited to descendants. Although acquired characteristics are not inherited biologically, in the literature of sociology, numbers of studies have documented that “cultural capital” is reproduced across generations within a family or a class, through cultural inheritance (see, for example, Bourdieu 1973; DiMaggio 1982).

After the works by Doepke and Zilibotti (2005, 2008), there is a growing attention to investment in patience capital in the context of endogenous formation of “spirit of capitalism” and its effect on economic development (see, for example, Kawagishi 2014; Haruyama and Park 2017). To our knowledge, though, the implication of investment in patience capital to long-run distribution of consumption and wealth in relation to the rather classic problem as explained above is not known. Does investment in time preference lead to survival and coexistence of heterogenous types, or opposite?

Denote the amount of patience capital by \(\pi\), then the endogenous discount rate is given by \(\beta (\pi )\), which is decreasing in \(\pi\), meaning that higher patience leads to less discounting of the future. Unlike physical capital, such patience capital is not transferrable across households after acquired. Evolution of patience capital \(\pi\) at each moment is determined by the amount of investment-like consumption. Precisely, it is given by:

$$\begin{aligned} {\dot{\pi }}=-\delta \pi + g(z), \end{aligned}$$

where z denotes the amount of investment-like capital, g is the production function, and \(\delta\) denotes the depreciation rate. Investing z requires material cost, and hence, it is in a direct material trade-off with pure consumption. The preference model is seen as a multi-period extension of the model by Becker and Mulligan (1997), where they assume determination of time preference occurs just once in the initial period. It parallels with the discrete-time dynastic model by Doepke and Zilibotti (2005, 2008), while there is a conceptual difference (or trade-off, perhaps) that they consider that each generation can invest on the next generation’s patience level for the latter’s lifetime, but the degree of altruism toward succeeding generation remains constant across all generations, and we consider that both evolve endogenously, but they are described altogether by one variable \(\pi\).

First, we will consider linear technology, where interest rate r is time-constant, to isolate the problem of whether investment in time preference has a stabilizing effect for each household or not. In contrary to the previous literature on endogenous time preference, we show that the interior steady-state condition:

$$\begin{aligned} \beta (\pi ^{*})=r \end{aligned}$$

is unstable in the following sense: per one household, there is a one-dimensional curve lying in the space of the two-dimensional space of state variables \((\pi ,k)\), where k denotes the amount of physical capital, which pass through the interior steady state \((\pi ^{*},k^{*})\) as the only possible stable path leading to it, such that convergence to the steady state occurs only when the vector of initial values \((\pi (0),k(0))\) falls exactly on the curve.

Thus, we again see a division of society into two “classes”, where the above-described curve plays the role of border. Households with initial vectors falling in the upper side of the border are to be in the “upper class”, invest more in patience capital, which leads themselves to save more, and hence, the consumption level grows in the long-run. Households with initial vectors falling in the lower side of the border are to be in the “lower class”, opt out from investing in patience capital, leading to a decay of patience level, which leads themselves to save less, and hence, they perish in the long-run. In this sense, the extreme long-run inequality as obtained in the basic discounted utility model is shown to be rather robust and self-confirming. We also show a possibility that there is an expanding swing between the two classes.

We then show that this result extends to economies with technologies with diminishing returns to capital, as far as the diminishment is sufficiently small. This shows that the instability result is not a mere knife-edge case.

2 The household model

Consider an infinite horizon continuous-time setting. There are two consumption goods: one is for pure consumption and the other is for “investment-consumption.” Denote the amount of pure consumption at time t by c(t), and that of investment-consumption by z(t). Denote a path of such pair by \((c,z)=(c(t),z(t))_{t\ge 0}\), which is the object of preference.

We consider that a household has preference represented in the form:

$$\begin{aligned} U(c,z)= & {} \int _{0}^{\infty } v(c(t)) \mathrm{e}^{-\int _{0}^{t}\beta (\pi (\tau )) \mathrm{d}\tau } \mathrm{d}t\\ {\dot{\pi }} (t)= & {} -\delta \pi (t) +g(z(t)) \\ \pi (0)= & {} \text {given}, \end{aligned}$$

where \(\beta\) is decreasing in \(\pi\). The preference is indirectly represented through variable \(\pi\), which is interpreted as “patience capital”. Moment by moment, it is accumulated through having the investment-consumption z(t), while it depreciates with rate \(\delta \ge 0\).

In a market, a household has to pay for purchasing the investment-consumption good. Thus, there is a trade-off between enjoying pure consumption and establishing patience through having the investment-consumption good. Also there is a trade-off between saving and establishing patience, in addition to the standard trade-off between pure consumption and saving.

We make the following regularity assumptions on v, \(\beta\), and g.

Assumption 1

\(v: {\mathbb {R}}_{+}\rightarrow [0,{\overline{v}})\) is twice-continuously differentiable on \({\mathbb {R}}_{++}\) and satisfies \(v^{\prime }>0\) and \(v^{\prime \prime }<0\). Also, it satisfies \(\lim _{c \rightarrow 0}v^{\prime }(c) =\infty\) and \(\lim _{c \rightarrow \infty }v^{\prime }(c) =0\).

In contrast to the case of standard discounted utility, the assumption that the period utility function is positive-valued has a behavioral content, since if it is negative, the household may prefer to become more impatient. We assume v is bounded from above to guarantee that the lifetime utility function is well defined.

Assumption 2

\(\beta : {\mathbb {R}}_{++}\rightarrow ({\underline{\beta }},\infty )\) is twice-continuously differentiable on \({\mathbb {R}}_{++}\), and satisfies \(\beta ^{\prime }<0\) and \(\beta ^{\prime \prime }>0\). Also, it holds \(\lim _{\pi \rightarrow 0}\beta (\pi ) =\infty\) and \(\lim _{\pi \rightarrow \infty }\beta (\pi ) ={\underline{\beta }}\).

We assume that \(\beta\) is bounded away from below by \({\underline{\beta }}>0\). There are two reasons. One is that it is natural, in the sense that even after indefinite accumulation of patience capital, there is a fundamental minimal degree of discounting. Second, the minimal degree of discounting guarantees that the lifetime utility is well defined, because, otherwise, the household’s lifetime utility may blow up by means of making oneself indefinitely more patient.

Finally, we impose the following assumption on production of patience capital, which is quite natural.

Assumption 3

\(g: {\mathbb {R}}_{+} \rightarrow {\mathbb {R}}_{+}\) is twice-continuously differentiable on \({\mathbb {R}}_{++}\), and satisfies \(g^{\prime }>0\), \(g^{\prime \prime }<0\), and \(g(0) =0\). Also, it satisfies \(\lim _{z \rightarrow 0}g^{\prime }(z) =\infty\) and \(\lim _{z \rightarrow \infty }g^{\prime }(z) =0\).

3 Dynamic competitive equilibrium and its welfare property

At each time, an output good is produced from an input good and labor. Let F(KL) denote the aggregate production function, where K denotes the amount of capital input and L denotes the labor input, and it is assumed to be three-times differentiable over \({\mathbb {R}}^{2}_{++}\), weakly concave, and exhibits constant returns to scale. To suppress notation, we assume that depreciation of physical capital is already taken into account in the specification of F.

Each household has 1 unit of labor time which can be used as production input, and for simplicity, we assume that the households have no preference for leisure.

The produced output can be allocated either as the pure consumption good or the investment-consumption good or capital holding, as we implicitly assume that there is a linear technology which converts between them, and without loss of generality, we assume that the rate of conversion is 1.

Let \(I=\{1,\ldots ,n\}\) be the set of households. Let \(c_{i}(t)\) denote the amount of pure consumption, \(z_{i}(t)\) denote the amount of investment-consumption, and \(k_{i}(t)\) denote the amount of capital being held for household \(i\in I\) and time t, while \(k_{i}(0)\) is given. Let r(t) denote the return rate of capital and w(t) denote wage at time t. Here, an household allocation is denoted by \((c_{i},z_{i},k_{i})=(c_{i}(t),z_{i}(t),k_{i}(t))_{t\ge 0}\) and a social one is denote by \((c,z,k)=(c_{i},z_{i},k_{i})_{i\in I}\).

Definition 1

An allocation (czk) is a dynamic competitive equilibrium if there exists a path \((r,w)=(r(t),w(t))_{t\ge 0}\), such that \((c_{i},z_{i},k_{i})\) maximizes:

$$\begin{aligned} U_{i}(c_{i},z_{i})= & {} \int _{0}^{\infty } v_{i}(c_{i}(t)) \mathrm{e}^{-\int _{0}^{t}\beta _{i}(\pi _{i}(\tau )) \mathrm{d}\tau } \mathrm{d}t \end{aligned}$$

under the constraints:

$$\begin{aligned} {\dot{\pi }}_{i}(t)= & {} -\delta _{i} \pi _{i}(t) +g_{i}(z_{i}(t)) \\ {\dot{k}}_{i}(t)= & {} r(t)k_{i}(t) +w(t)-c_{i}(t)-z_{i}(t) \\ c_{i}(t), z_{i}(t)\ge & {} 0 \\ \pi _{i}(0), k_{i}(0)= & {} \text {given} \end{aligned}$$

for every \(i\in I\), so that the No-Ponzi condition:

$$\begin{aligned} \lim _{t\rightarrow \infty }\frac{k_{i}(t)}{e^{\int _{0}^{t} r(\tau ) d\tau }} \ge 0 \end{aligned}$$

is met, and \(\left( \sum _{i\in I}k_{i}(t),n\right)\) solves the profit-maximization problem:

$$\begin{aligned} \max _{K,L}F(K,L)-r(t)K-w(t)L \end{aligned}$$

and the market-clearing condition:

$$\begin{aligned} \sum _{i\in I} c_{i}(t)+ \sum _{i\in I} z_{i}(t)+ \sum _{i\in I} {\dot{k}}_{i}(t)= & {} F\left( \sum _{i\in I}k_{i}(t),n\right) \end{aligned}$$

is met for all t.

We assume that there is no borrowing constraint or that it is not binding in finite time range, as we are focusing on the question whether the economy can converge to an interior long-run steady state.

Under the No-Ponzi condition, we can consolidate the series of sequential budget constraints into one, as:

$$\begin{aligned} \int _{0}^{\infty } \frac{c_{i}(t)+z_{i}(t)}{\mathrm{e}^{\int _{0}^{t}r(\tau ) \mathrm{d}\tau }} \mathrm{d}t\le & {} \int _{0}^{\infty } \frac{w(t)}{\mathrm{e}^{\int _{0}^{t}r(\tau ) \mathrm{d}\tau }} \mathrm{d}t +r(0)k_{i}(0). \end{aligned}$$

Hence, the current definition of equilibrium falls in the Arrow–Debreu–McKenzie framework.

Proposition 1

If \((c_{i},z_{i},k_{i})_{i\in I}\) is a dynamic competitive equilibrium allocation, then \((c_{i},z_{i})_{i\in I}\) is an Arrow–Debreu–McKenzie equilibrium allocation.

As far as the investment-like consumption is understood as still falling in the category of “consumption”, there is nothing wrong with the classic concept of welfare, since any equilibrium consumption path is Pareto-efficient. Moreover, when each household is responsible for the initial level of patience capital and the way of reproducing it is “a matter of taste”, the equilibrium path is even “fair” in the sense of absence of envy among households when they are endowed with equal initial amounts of physical capital.

Say that a feasible allocation \((c,z)=(c_{i},z_{i})_{i\in I}\) is ex-ante Pareto-efficient if there is no feasible allocation \(({\widetilde{c}},{\widetilde{z}})=({\widetilde{c}}_{i},{\widetilde{z}}_{i})_{i\in I}\), such that \(U_{i}({\widetilde{c}}_{i},{\widetilde{z}}_{i}) \ge U_{i}(c_{i},z_{i})\) for all \(i\in I\) and \(U_{i}({\widetilde{c}}_{i},{\widetilde{z}}_{i}) > U_{i}(c_{i},z_{i})\) for at least one \(i\in I\).

Say that an allocation \((c,z)=(c_{i},z_{i})_{i\in I}\)ex-ante envy-free if \(U_{i}(c_{i},z_{i}) \ge U_{i}(c_{j},z_{j})\) for all \(i,j \in I\). We put the word “ex-ante” to emphasize that it is about ex-ante evaluation of planned life-courses. Note that here, every i is taken to be responsible for its preference, including the initial amount of patience capital \(\pi _{i}(0)\).

Proposition 2

Arrow–Debreu–McKenzie equilibrium allocation (cz) is Pareto-efficient. It is also ex-ante envy-free if \(k_{i}(0)=k_{j}(0)\) for all \(i,j\in I\).

Despite of Pareto efficiency and fairness from the ex-ante viewpoint, we show in the next section that the long-run ex-post consequence can be an extreme inequality.

4 Analysis

4.1 Equilibrium dynamics and steady state

Let

$$\begin{aligned} \alpha _{i}(t) = e^{-\int _{0}^{t}\beta _{i}(\pi _{i}(\tau )) \mathrm{d}\tau } \end{aligned}$$

which follows the differential equation:

$$\begin{aligned} {\dot{\alpha }}_{i}(t)=-\beta _{i}(\pi _{i}(t)) \alpha _{i}(t). \end{aligned}$$

To simply notation, hereafter, we omit t as far as no confusion arises. Then, the maximization problem for a generic price-taking household is formulated as:

$$\begin{aligned} \max \int _{0}^{\infty } \alpha _{i}v_{i}(c_{i}) \mathrm{d}t \end{aligned}$$

subject to

$$\begin{aligned} {\dot{k}}_{i}= & {} rk_{i}+w-c_{i}-z_{i} \\ {\dot{\pi }}_{i}= & {} -\delta _{i} \pi _{i} + g_{i}(z_{i})\\ {\dot{\alpha }}_{i}= & {} -\beta _{i}(\pi _{i}) \alpha _{i}\\ c_{i}, z_{i}\ge & {} 0 \\ k_{i}(0)= & {} \text {given} \\ \pi _{i}(0)= & {} \text {given} \\ \alpha _{i}(0)= & {} 1. \end{aligned}$$

Set up the Hamiltonian, assuming an interior path, as:

$$\begin{aligned} H_{i}=\alpha _{i} v_{i}(c_{i}) + \lambda _{i} \left[ rk_{i}+w-c_{i}-z_{i} \right] + \mu _{i} [-\delta _{i} \pi _{i} + g_{i}(z_{i})]+\nu _{i}[-\beta _{i}(\pi _{i}) \alpha _{i}]. \end{aligned}$$

Then, the individually optimal path is characterized by:

$$\begin{aligned} \alpha _{i} v^{\prime }_{i}(c_{i}) -\lambda _{i}= & {} 0 \\ -\lambda _{i}+\mu _{i} g^{\prime }_{i}(z_{i})= & {} 0 \\ {\dot{\lambda }}_{i}= & {} - \lambda _{i} r \\ {\dot{\mu }}_{i}= & {} \delta _{i} \mu _{i}+\beta ^{\prime }_{i}(\pi _{i}) \nu _{i} \alpha _{i} \\ {\dot{\nu }}_{i}= & {} -v_{i}(c_{i}) +\beta _{i}(\pi _{i}) \nu _{i} \\ {\dot{k}}_{i}= & {} rk_{i}+w-c_{i}-z_{i} \\ {\dot{\pi }}_{i}= & {} -\delta _{i} \pi _{i} + g_{i}(z_{i})\\ {\dot{\alpha }}_{i}= & {} -\beta _{i}(\pi _{i}) \alpha _{i} \end{aligned}$$

plus three transversality conditions:

$$\begin{aligned} \lim _{t\rightarrow \infty } \lambda _{i}(t)k_{i}(t)\le & {} 0 \\ \lim _{t\rightarrow \infty } \mu _{i}(t)\pi _{i}(t)\le & {} 0 \\ \lim _{t\rightarrow \infty } \nu _{i}(t)\alpha _{i}(t)\le & {} 0. \end{aligned}$$

Note that together with the No-Ponzi condition. the first transversality condition reduces to:

$$\begin{aligned} \lim _{t\rightarrow \infty }\frac{k_{i}(t)}{\mathrm{e}^{\int _{0}^{t} r(\tau ) \mathrm{d}\tau }} =0. \end{aligned}$$

Also, because of positivity of \(\pi _{i}\), together with the first-order condition, the second transversality condition reduces to:

$$\begin{aligned} \lim _{t\rightarrow \infty }\frac{\pi _{i}(t)}{g_{i}^{\prime }(z_{i}(t)) \mathrm{e}^{\int _{0}^{t} r(\tau ) \mathrm{d}\tau }} =0, \end{aligned}$$

and it is relevant only when \(\pi _{i}\) diverges to positive infinity. Note also that the third transversality condition is met by any path, under our assumption that utility is bounded and the discount rate is bounded away from zero.

We eliminate \(\lambda _{i}\), \(\mu _{i}\), and \(\alpha _{i}\), while \(\nu _{i}\) cannot be, and obtain the dynamics:

$$\begin{aligned} {\dot{c}}_{i}= & {} \frac{(\beta _{i}(\pi _{i})-r)v^{\prime }_{i}(c_{i})}{v^{\prime \prime }_{i}(c_{i})}\\ {\dot{z}}_{i}= & {} -\frac{g^{\prime }_{i}(z_{i})}{g^{\prime \prime }_{i}(z_{i})} \left[ r+\delta _{i}+\frac{g^{\prime }_{i}(z_{i})\beta ^{\prime }_{i}(\pi _{i})\nu _{i}}{v^{\prime }_{i}(c_{i})}\right] \\ {\dot{\nu }}_{i}= & {} -v_{i}(c_{i}) +\beta _{i}(\pi _{i}) \nu _{i} \\ {\dot{\pi }}_{i}= & {} -\delta _{i} \pi _{i} + g_{i}(z_{i})\\ {\dot{k}}_{i}= & {} rk_{i}+w-c_{i}-z_{i}. \end{aligned}$$

On the other hand, from the profit-maximization condition in equilibrium, it holds:

$$\begin{aligned} r= & {} F_{1}\left( \sum _{i}k_{i},n\right) \\ w= & {} F_{2}\left( \sum _{i}k_{i},n\right) . \end{aligned}$$

By taking the time derivative of the above, we obtain the dynamics of equilibrium interest rate and wage:

$$\begin{aligned} {\dot{r}}= & {} F_{11}\left( \sum _{i}k_{i},n\right) \left[ r \sum _{i}k_{i}+nw-\sum _{i}c_{i}-\sum _{i}z_{i} \right] \\ {\dot{w}}= & {} F_{21}\left( \sum _{i}k_{i},n\right) \left[ r \sum _{i}k_{i}+nw-\sum _{i}c_{i}-\sum _{i}z_{i} \right] . \end{aligned}$$

Interior steady state \(((c^{*}_{i},z^{*}_{i}, \nu ^{*}_{i}, \pi ^{*}_{i},k^{*}_{i})_{i\in I},r^{*},w^{*})\), if exists, is determined by:

$$\begin{aligned} \beta _{i}(\pi ^{*}_{i})= & {} r^{*}\\ g_{i}(z^{*}_{i})= & {} \delta _{i} \pi ^{*}_{i}\\ \frac{v^{\prime }_{i}(c^{*}_{i})}{v_{i}(c^{*}_{i})}= & {} -\frac{g^{\prime }_{i}(z^{*}_{i}) \beta ^{\prime }_{i}(\pi ^{*}_{i}) }{r^{*}(r^{*}+\delta _{i})} \\ \nu ^{*}_{i}= & {} \frac{v_{i}(c^{*}_{i})}{r^{*}} \\ k^{*}_{i}= & {} \frac{c^{*}_{i}+z^{*}_{i}-w^{*}}{r^{*}} \\ r^{*}= & {} F_{1}\left( \sum _{i}k^{*}_{i},n \right) \\ w^{*}= & {} F_{2}\left( \sum _{i}k^{*}_{i},n \right) . \end{aligned}$$

As far as they all survive, the households end up with the same value of discount rate in steady state, which is equal to the given interest rate. Note that general existence of interior steady state is not obvious, though, because \(\beta _{i}\) has the natural lower bound \({\underline{\beta }}_{i}\) and it might be possible that the \(r^{*}\) can be above \(\max _{i}{\underline{\beta }}_{i}\) only by violating the other steady-state conditions. In the next subsections, we present a class of technologies, such that a unique steady state exists.

Then, the Jacobian matrix evaluated at the interior steady state takes the form of \((5n+2)\times (5n+2)\) matrix:

$$\begin{aligned} \left( \begin{array}{ccccc} A_{1} &{} O &{} \cdots &{} O &{} P_{1} \\ O &{} A_{2} &{} \cdots &{} O &{} P_{2} \\ \vdots &{} \vdots &{} \ddots &{} \vdots &{} \vdots \\ O &{} O &{} \cdots &{} A_{n} &{} P_{n} \\ E &{} E &{} \cdots &{} E &{} Q \end{array} \right) , \end{aligned}$$

where

$$\begin{aligned} A_{i} = \left( \begin{array}{ccccc} 0 &{} 0 &{} 0 &{} \frac{\beta ^{\prime }_{i}v^{\prime }_{i}}{v^{\prime \prime }_{i}} &{}0\\ -(r+\delta _{i}) \frac{g^{\prime }_{i}v^{\prime \prime }_{i}}{g^{\prime \prime }_{i}v^{\prime }_{i}} &{} r+\delta _{i} &{} (r+\delta _{i}) r \frac{g^{\prime }_{i}}{g^{\prime \prime }_{i}v_{i}} &{} (r+\delta _{i}) \frac{g^{\prime }_{i}\beta ^{\prime \prime }_{i}}{g^{\prime \prime }_{i}\beta ^{\prime }_{i}} &{} 0\\ -v^{\prime }_{i} &{} 0 &{} r &{} \frac{\beta ^{\prime }_{i} v_{i}}{r} &{} 0 \\ 0 &{} g^{\prime }_{i} &{} 0 &{} -\delta _{i} &{} 0\\ -1 &{} -1 &{} 0 &{} 0 &{} r \end{array} \right) , \ \ \ P_{i} = \left( \begin{array}{cc} -\frac{v^{\prime }_{i}}{v^{\prime \prime }_{i}} &{} 0\\ -\frac{g^{\prime }_{i}}{g^{\prime \prime }_{i}} &{} 0\\ 0 &{} 0 \\ 0 &{} 0 \\ k_{i} &{} 1 \end{array} \right) \end{aligned}$$

for each i and

$$\begin{aligned} E= & {} \left( \begin{array}{ccccc} -F_{11} &{} -F_{11} &{} 0 &{} 0 &{} F_{11}r \\ -F_{21} &{} -F_{21} &{} 0 &{} 0 &{} F_{21}r \end{array}\right) \\ Q= & {} \left( \begin{array}{cc} F_{11}{\sum }_{i}k_{i} &{} F_{11}n \\ F_{21}{\sum }_{i}k_{i} &{} F_{21}n \end{array}\right) . \end{aligned}$$

4.2 The case of linear technology

Here, we restrict attention to consumption and investment under linear technology, \(F(K,L)=rK+wL\), to isolate the problem of whether investment in patience capital has a stabilizing or rather destabilizing effect at the pure household level. Assume that \(r> \max _{i}{\underline{\beta }}_{i}\), and then, a unique interior steady state exists. Otherwise, there is no steady state with all households surviving, and some households automatically perish in the long run.

When technology is linear, there is no interaction between households, the aggregate outcome is simply the sum of individual ones, and it suffices to look at each household problem separately. Since \(F_{11}=F_{21}=0\) here and E and Q are zero matrices, we only need to look at the stability property of each diagonal block \(A_{i}\) separately.

To see if the system described by \(A_{i}\) is stable, note that since \(\pi _{i}\) and \(k_{i}\) are the state variables, the number of stable roots required for stability with a unique optimal path is exactly two.

Proposition 3

The number of stable roots for each \(A_{i}\) is one. Let \(\theta _{i1}\) denote the only stable root for \(A_{i}\), and \(\theta _{i2},\theta _{i3},\theta _{i4},\theta _{i5}\) denote the unstable roots, ordered in the ascending manner according to their real parts, and then, at least \(\theta _{i1},\theta _{i4},\theta _{i5}\) are real and it holds:

$$\begin{aligned} \theta _{i1}<-\delta _{i}<0< \mathrm {Re}\theta _{i2}\le \mathrm {Re}\theta _{i3}<\theta _{i4}=r<r+\delta _{i} <\theta _{i5}. \end{aligned}$$

Proof

Pick any i. Since it is clear that one eigenvalue of \(A_{i}\) is r, we can restrict attention to its submatrix:

$$\begin{aligned} \left( \begin{array}{cccc} 0 &{} 0 &{} 0 &{} \frac{\beta ^{\prime } v_{i}^{\prime }}{v_{i}^{\prime \prime }} \\ -(r+\delta _{i}) \frac{g_{i}^{\prime }v_{i}^{\prime \prime }}{g_{i}^{\prime \prime }v_{i}^{\prime }} &{} r+\delta _{i} &{} (r+\delta _{i}) r \frac{g_{i}^{\prime }}{g_{i}^{\prime \prime }v_{i}} &{} (r+\delta _{i}) \frac{g_{i}^{\prime }\beta _{i}^{\prime \prime }}{g_{i}^{\prime \prime }\beta _{i}^{\prime }} \\ -v_{i}^{\prime } &{} 0 &{} r &{} \frac{\beta _{i}^{\prime } v_{i}}{r} \\ 0 &{} g_{i}^{\prime } &{} 0 &{} -\delta _{i} \\ \end{array} \right) . \end{aligned}$$

Then, its characteristic polynomial is:

$$\begin{aligned} p_{i}(\theta _{i})= & {} (\theta _{i}+\delta _{i})\theta _{i} (\theta _{i}-r)(\theta _{i}-(r+\delta _{i})) \\&-(r+\delta _{i}) \frac{(g_{i}^{\prime })^{2}}{g_{i}^{\prime \prime }} \frac{\beta _{i}^{\prime \prime }}{\beta _{i}^{\prime }} \theta _{i} (\theta _{i}-r) \\&-(r+\delta _{i}) r \frac{(g_{i}^{\prime })^{2}}{g_{i}^{\prime \prime }} \beta _{i}^{\prime } \left( 1-\frac{(v_{i}^{\prime })^{2}}{v_{i}^{\prime \prime }v_{i}} \right) , \end{aligned}$$

which is an even function around r / 2. Note that one can verify: \(-(r+\delta _{i}) r \frac{(g_{i}^{\prime })^{2}}{g_{i}^{\prime \prime }} \beta _{i}^{\prime } \left( 1-\frac{(v_{i}^{\prime })^{2}}{v_{i}^{\prime \prime }v_{i}} \right) <0\).

Since the first line above, a fourth-order function being even around r / 2 equals to zero at \(\theta _{i}=-\delta _{i}, 0, r, r+\delta _{i}\), and the second line above, a second-order function being even around r / 2, equals to zero at \(\theta _{i}=0,r\), their sum equals to zero at \(-\delta _{i}-\omega _{i}\), 0, r, \(r+\delta _{i} +\omega _{i}\) with some \(\omega _{i}>0\).

Since the third line above is negative, \(p_{i}(\theta _{i})=0\) has at least two real roots \(-\delta _{i}-\rho _{i}\) and \(r+\delta _{i}+\rho _{i}\) with \(\rho _{i}>0\). When \(p_{i}(r/2)\ge 0\), the other two are real and they are \(r/2-\sigma _{i}\) and \(r/2+\sigma _{i}\) with \(0\le \sigma _{i}<r/2\). The case of \(p_{i}(r/2)< 0\) cannot be ruled out, because we can always make \(v^{\prime \prime }<0\) arbitrarily close to zero without changing \(v_{i}\), \(v^{\prime }_{i}\) and the other values and the steady state, so that \(-(r+\delta _{i}) r \frac{(g_{i}^{\prime })^{2}}{g_{i}^{\prime \prime }} \beta _{i}^{\prime } \left( 1-\frac{(v_{i}^{\prime })^{2}}{v_{i}^{\prime \prime }v_{i}} \right)\) is a large negative number. Then, the other two roots are complex and they are \(r/2-\sigma _{i} \sqrt{-1}\) and \(r/2+\sigma _{i} \sqrt{-1}\) with \(\sigma _{i}\) being some real number. \(\square\)

Since the number of stable roots for each household’s linearized system is one, the interior steady state is unstable, while there is a stable manifold with dimension one, a curve.

To get an idea about the dynamics, we can verify that the eigenvector for \(\theta _{i4}=r\) is \((0,0,0,0,1)^\mathrm{T}\), and the eigenvector for each \(\theta _{ik}\), \(k=1,2,3,5\), is:

$$\begin{aligned} \left( \frac{1}{\theta _{ik}} \frac{\beta _{i}^{\prime }v_{i}^{\prime }}{v_{i}^{\prime \prime }} , \frac{\theta _{ik}+\delta _{i}}{g_{i}^{\prime }}, \frac{1}{r-\theta _{ik}} \left[ \frac{1}{\theta _{ik}} \frac{\beta _{i}^{\prime }(v_{i}^{\prime })^{2}}{v_{i}^{\prime \prime }} -\frac{\beta _{i}^{\prime }v_{i}}{r}\right] , 1, \frac{1}{r-\theta _{ik}}\left[ \frac{1}{\theta _{ik}} \frac{\beta _{i}^{\prime }v_{i}^{\prime }}{v_{i}^{\prime \prime }} +\frac{\theta _{ik}+\delta _{i}}{g_{i}^{\prime }}\right] \right) ^\mathrm{T}, \end{aligned}$$

which implies that the local dynamics is given by:

$$\begin{aligned} \left( \begin{array}{c} c_{i}-c^{*}_{i}\\ z_{i}-z^{*}_{i}\\ \nu _{i}-\nu ^{*}_{i}\\ \pi _{i}-\pi ^{*}_{i}\\ k_{i}-k^{*}_{i} \end{array} \right) = s_{i4} e^{rt} \left( \begin{array}{c} 0\\ 0\\ 0\\ 0\\ 1 \end{array} \right) + \sum _{k=1,2,3,5} s_{ik} e^{\theta _{ik}t} \left( \begin{array}{c} \frac{1}{\theta _{k}} \frac{\beta _{i}^{\prime }v_{i}^{\prime }}{v_{i}^{\prime \prime }} \\ \frac{\theta _{k}+\delta }{g_{i}^{\prime }}\\ \frac{1}{r-\theta _{k}} \left[ \frac{1}{\theta _{k}} \frac{\beta _{i}^{\prime }(v_{i}^{\prime })^{2}}{v_{i}^{\prime \prime }} -\frac{\beta _{i}^{\prime }v_{i}}{r}\right] \\ 1 \\ \frac{1}{r-\theta _{k}}\left[ \frac{1}{\theta _{k}} \frac{\beta _{i}^{\prime }v_{i}^{\prime }}{v_{i}^{\prime \prime }} +\frac{\theta _{k}+\delta _{i}}{g_{i}^{\prime }}\right] \end{array} \right) , \end{aligned}$$

where the constants \(\{s_{ik}\}_{k=1,2,3,4,5}\) are determined according to the initial values. Note that the eigenvectors projected on the space of state variables \((\pi _{i},k_{i})\) are downward for \(\theta _{i1}\) and \(\theta _{i5}\), upward for \(\theta _{i2}\) and \(\theta _{i3}\) when they are real, and vertical for \(\theta _{i4}=r\). Hence, the curve of stable manifold projected on the space of state variables is downward-sloping.

Since \(\theta _{i4}=r\) and \(\theta _{i5}>r+\delta _{i}\) are inconsistent with the conjunction of No-Ponzi condition and transversality condition, however, the associated upper left/lower right directions will not play a role in the global dynamics.

Thus, when \(\theta _{i2}\) and \(\theta _{i3}\) are real, the flows projected on the space of state variables look like Fig. 1. Here, the space is partitioned by the projected curve, and if the initial \((\pi _{i}(0),k_{i}(0))\) falls in one side, the household will invest more on patience capital, which leads to higher saving in the future and more accumulation of capital, resulting in more consumption in further future, and so on, and if it falls in the other side, the household will opt out from investing on patience capital, which leads to lower saving in the future and eating up the capital, resulting in less consumption in further future, and so on, while the speed of divergence to infinity/convergence to zero must be sufficiently modest to meet the transversality conditions. Thus, we see a division of society into two “classes”, even among ex-ante identical households, depending on their initial values of physical capital and patience capital.

Fig. 1
figure 1

Flows projected on the space of state variables: the case of \(\theta _{i2},\theta _{i3}\) being real

Fig. 2
figure 2

Flows projected on the space of state variables: the case of \(\theta _{i2},\theta _{i3}\) being complex

In the literature of (analytical) Marxian economics, they make a seemingly ad hoc assumption that capitalists are interested only in accumulation and workers are totally myopic, make no saving, and always live at the subsistence level (see, for example, Okisio 1963; Morishima 1977; Roemer 1988). The instability result may serve as an explanation that such extreme division can be rather self-confirming.

Note that this saddle-point instability is about steady state projected onto the two-dimensional space of state variables. The familiar saddle-point nature which we observe for example in the standard optimal growth model is about steady state in the space of one state variable (such as physical capital) and one jump variable (such as consumption). There, the household or planner can choose the initial value of the jump variable, and the fact that a steady state being a saddle point means that it is stable with regard to a unique optimal path. Here, neither a household or a planner can choose the initial pair of physical capital and patience capital, and therefore, a steady state being a saddle-point means that it is unstable in the above-stated sense.

Still we cannot rule out the case that \(\theta _{i2}\) and \(\theta _{i3}\) are complex. Then, the unstable flows form expanding cycles spinning around the curve of stable manifold, which look like Fig. 2 when projected on the space of state variables. This is a puzzling case, since when such an unstably cyclical orbit passes above and below the curve of stable manifold (i.e., when their projections cross on the space of state variables), the household can switch between the stable path and the unstable orbit, by restarting the life with the current state variable \((\pi _{i},k_{i})\). This can be viewed as a kind of indeterminacy, as the initial value \((\pi _{i}(0),k_{i}(0))\) may not determine an unique optimal path even for a single household.

4.3 Diminishing returns to capital and capital/labor complementarity

Just to explain that the instability result as above is not a mere knife-edge case due to linear technology, we show that the above argument extends to economies with diminishing returns to capital and capital/labor complementarity, while the degrees of diminishment and complementarity are sufficiently small.

We can take for example the class of CES production functions with the form \(F(K,L)=A\left[ aK^{\rho } +(1-a) L^{\rho }\right] ^{\frac{1}{\rho }}-\eta K\), where \(\eta\) denotes the rate of depreciation, such that the corresponding production set coincides with that of the linear production function \(rK+wL\) along the ray connecting the origin and \(\left( \sum _{i}k^{*}_{i},n\right)\). By varying \(A,a,\rho ,\eta\), we can make \(F_{11}\) and \(F_{21}\) arbitrarily close to zero at \(\left( \sum _{i}k^{*}_{i},n\right)\).

Then, the characteristic polynomial for the Jacobian matrix evaluated at the steady state is given by:

$$\begin{aligned} \left| \begin{array}{ccccc} \theta I-A_{1} &{} O &{} \cdots &{} O &{} -P_{1} \\ O &{} \theta I-A_{2} &{} \cdots &{} O &{} -P_{2} \\ \vdots &{} \vdots &{} \ddots &{} \vdots &{} \vdots \\ O &{} O &{} \cdots &{} \theta I-A_{n} &{} -P_{n} \\ -E &{} -E &{} \cdots &{} -E &{} \begin{array}{cc} \theta -F_{11}{\sum }_{i}k_{i}^{*} &{} -F_{11}n \\ -F_{21}{\sum }_{i}k_{i}^{*} &{} \theta -F_{21}n \end{array} \end{array} \right| , \end{aligned}$$

where I denotes the \(5 \times 5\) identity matrix.

Consider doing cofactor expansion of the above determinant formula first along the \(5n+2\)th row and then along the \(5n+1\)th row. Because all the entries of the second row of E are constant multiples of \(F_{21}\) and all the entries of the first row of E are constant multiples of \(F_{11}\), all the terms in the expanded determinant formula other than the one obtained by going along the diagonal are constant multiples of either of \(F_{11}\), \(F_{21}\), or \(F_{11}F_{21}\). Thus, the characteristic equation takes the form:

$$\begin{aligned} \left( \theta -F_{11}\sum _{i}k_{i}^{*}\right) \left( \theta - F_{21}n\right) \prod _{i}\left| \theta I- A_{i}\right| + F_{11}G_{1}(\theta )+ F_{21}G_{2}(\theta )+F_{11}F_{21}G_{12}(\theta )=0, \end{aligned}$$

where \(G_{1}(\theta ), G_{2}(\theta ),\) and \(G_{12}(\theta )\) are 5nth order polynomials.

Now, consider that \(F_{11}\) and \(F_{21}\) are arbitrarily close to zero. Then, the collection of roots in the characteristic equation is arbitrarily close to the one for:

$$\begin{aligned} \theta ^{2}\prod _{i}\left| \theta I- A_{i}\right| =0. \end{aligned}$$

Because the term \(\prod _{i}\left| \theta I- A_{i}\right|\) is independent of \(F_{11}\) and \(F_{21}\), the signs of 5n roots being close to the 5n ones for \(\prod _{i}\left| \theta I- A_{i}\right| =0\), which are all non-zero, do not change when \(F_{11}\) and \(F_{21}\) are sufficiently small. Out of those 5n roots, from the previous result we see that the number of stable roots is n. The signs of the remaining two are ambiguous as they may switch across zero under small perturbation.

Summing up, we obtain the following claim.

Proposition 4

Pick any \(r> \max _{i}{\underline{\beta }}_{i}\) and w for which an interior steady state \((c^{*}_{i},z^{*}_{i}, \nu ^{*}_{i}, \pi ^{*}_{i},k^{*}_{i})_{i\in I}\) exists in the corresponding linear technology economy. Then, there is a range of technology F with constant returns to scale and \(F_{11} \left( \sum _{i}k^{*}_{i},n\right) <0\) and \(F_{21} \left( \sum _{i}k^{*}_{i},n\right) >0\), which results in the same interior steady state with \(F_{1}\left( \sum _{i}k^{*}_{i},n\right) =r\) and \(F_{2}\left( \sum _{i}k^{*}_{i},n\right) =w\), such that the number of stable roots in the linearized system is either n or \(n+1\) or \(n+2\), while stability requires; it is 2n.

Thus, while there is a possibility of stability and even indeterminacy when \(n=1\) or \(n=2\), the similar kind of instability shows up again when \(n \ge 3\), and its nature gets closer to the one under linear technology as n tends to be large.

5 Conclusion

We conclude by listing the remaining problems and suggesting future directions for the research.

A complete and global characterization of equilibrium path is obviously desired, especially for the case of stronger diminishing returns to capital and capital/labor complementarity, while we believe that we have spelled out the critical nature of it. This will require a more involved technical treatment of the system of differential equations obtained here.

There will be two effects of lowering interest rate over time adjusting to diminishing returns to capital, while the total effect is ambiguous. One is that as r tends to go below \(\beta\), the bar for remaining in the “upper class” tends to be higher. This will discourage households from investing in patience capital and they will opt out. The other is that lowering interest rate makes patience capital relatively cheaper compared to physical capital, and this encourages investment in the former.

A numerical approach may help to get a picture. For example, in the case or time-constant interest rate, we can obtain the Hamilton–Jacobi–Bellman equation:

$$\begin{aligned} \beta (\pi )V(\pi ,k)=\max _{c,z} \left\{ v(c)+V_{1}(\pi ,k) \left[ -\delta \pi +g(z) \right] +V_{2}(\pi ,k) \left[ rk+w-c-z\right] \right\} , \end{aligned}$$

and there are substantial numbers of studies which allow us to solve such class of functional equation.

Also, its discrete-time counterpart is solvable by the recursive method (this confirms that the household model is dynamically consistent). The Bellman equation

$$\begin{aligned} V(\pi ,k)=\max \left\{ v(c)+\frac{1}{1+\beta (\pi )}V(\pi ^{\prime },k^{\prime }) \right\} , \end{aligned}$$

where

$$\begin{aligned} \pi ^{\prime }= & {} (1-\delta ) \pi + g(z) \\ k^{\prime }= & {} (1+r)k-c-z \end{aligned}$$

is solvable by the simple contraction-mapping method.

Also, one can think of a more general form of production of patience capital, like

$$\begin{aligned} {\dot{\pi }}=h(z,\pi ), \end{aligned}$$

while it will require handling complementarity between patience capital and investment.

From the standpoint of classic concept of welfare, there is nothing wrong with having severe ex-post inequalities in the long run, as time preference and the way of reproducing it are understood as “a matter of taste”. This is even “fair” from the ex-ante viewpoint. Yet, we will need to ask whether we should accept resulting ex-post inequalities and how we should reconcile between ex-ante equity and ex-post equity.

There is a difficulty, however, that even when we accept ex-post redistribution of physical capital, the same kind of inequality may emerge again, because it is impossible to redistribute patience capital. Then, the question becomes whether we should continuously redistribute physical capital to compensate for the infeasibility of redistribution of patience capital (or cultural capital/human capital in general), but this is even more ethically demanding.

One may think of a governmental intervention to promote investment in patience capital, but this leads to inefficiency according to the classic concept of welfare, since there is no externality between households here.

Finally, note that we considered an entirely frictionless economy with perfect foresights. This leaves a question whether any market friction or bounded foresight/concistency strengthens instability or rather stabilizes long-run distribution.Footnote 4