1 Introduction

The literature concerning various aspects of altruism in economic models is pretty large. For a comprehensive survey with historical notes, the reader is referred to [9, 21, 33, 35] and their references. This paper is devoted to study some mathematical issues related to existence of equilibria in a large class of economic growth models with altruism between generations. Intergenerational altruism has usually been modeled in two ways. In the paternalistic model, the utility of the current generation depends on its own consumption and the consumptions of other generations. In other words, the generation cares about what all or some successors will consume, but it does not take into account the utilities the successors derive from the act of consumption. In the non-paternalistic model, each generation derives utility from its own consumption and the utilities of future generations.

First strong results on the existence of Markov perfect equilibria in paternalistic economic growth models with deterministic transitions were established by Bernheim and Ray [11] and Leininger [28]. They assume that each generation cares only about consumption of its immediate successor. From the mathematical point of view, their proofs are rather complicated. A simpler and more direct method was used by Balbus et al. [3]. Certain extensions of the works of [11, 28] to models with specific stochastic production functions are surveyed in [8, 9, 25]. However, more general results the reader may find in [4, 5], where the transition probability function obeys a natural weak continuity condition. Such transition probabilities are extensively used in economics, since economic dynamics are typically described by some difference equations with additive or multiplicative shocks. The most general paternalistic model was considered by Balbus et al. [5]. However, their results concern non-atomic transition probabilities. The model studied in [4] allows to cover both deterministic production functions and the stochastic transitions that satisfy a stochastic dominance assumption. The drawback of this approach lies in the compactness of the state space and in the separability of the utility function which takes into account only an immediate descendant for each generation.

In contrast to the paternalistic case, there have been very few rigorous studies of models assuming the non-paternalistic altruism. As suggested by Ray [35], “this framework appears to be of somewhat greater interest in the context of applications.” Examples include the works of [10, 29]. For instance, Ray [35] described a general model with deterministic transitions involving non-paternalistic altruism and formulated an equilibrium concept, but its existence remains an open problem. Balbus et al. [6] showed that an equilibrium in a stochastic version of Ray’s framework exists provided that the transition probabilities are non-atomic. The other group of models were considered by Barro [10] or Loury [29], who dealt with only one descendant for each generation. However, the existence of an equilibrium consisting of an indirect utility and an optimal consumption (or saving) strategy in the aforementioned models can be studied by dynamic programming methods using contraction mapping theorems. It is worth mentioning that these methods were also applied to the wide class of various decision processes with recursive utilities, see for instance [17, 18].

In this paper, we study a version of so-called mixed models with both paternalistic and non-paternalistic components. A need for studying mixed models is expressed on page 113 in [35]. An approach to mixed models (two-sided altruism) is given by Hori [23]. He considers a rather specific model with pretty strong assumptions on the utility and deterministic transition functions. An equilibrium is shown to exist in this model by the Schauder fixed-point theorem. Although the approach of [23] concerns some specific model, it is inspiring for us.

In our model, we assume that every generation considers only its immediate successor. The equilibrium problem studied in this paper is a double fixed-point problem. One fixed point is obtained for an indirect utility function via a contraction mapping with a nonlinear discount function. This is a sort of recursive utility extensively discussed in economics, see Becker and Boyd [15]. The second one (in an appropriate strategy space) corresponds to Nash equilibrium in an intergenerational game. Our basic tool is the Schauder–Tychonoff fixed-point theorem (see e.g., Dugundji and Granas [16]). We would like to emphasize that the indirect utility that we consider depends on both consumption and endowment of the generation. This approach is generalized in the model of [35]. A similar mixed model is studied in the recent paper of [2]. However, he considers risk-sensitive generations (uses different utility functions), makes much stronger assumptions on the stochastic transition function and proves existence of equilibria in the class of Borel measurable randomized strategies. Therefore, his approach is essentially different from the one applied in this paper. Our proofs are based on techniques used in [3, 4] with some necessary modifications. We prove a general existence theorem for the mixed model with unbounded utility functions by applying the weighted norm approach, which was well developed in dynamic programming (see Wessels [40]; Jaśkiewicz et al. [27]). However, instead of standard exponential discounting, we use a nonlinear discount function. As a by-product, we considerably extend our earlier result for paternalistic models given in [4]. The most important point is that no separability condition on the utility function is imposed.

The rest of the paper is organized as follows. Section 2 contains preliminaries. Section 3 derives the utility function which incorporates the paternalistic and non-paternalistic altruism and formulates an equilibrium. Section 4 presents basic assumptions and existence theorems. Examples satisfying our conditions are given in Sect. 5. Finally, the proofs are collected in Sects. 6 and 7.

2 Preliminaries

In this section, we introduce some notation and state a few auxiliary results. As usual, \(\mathbb {R}\) stands for the set of all real numbers and \(\mathbb {N}\) is the set of all positive integers. Let \(S= \mathbb {R}_+ = [0,\infty )\), \(S_+= \mathbb {R}_+{\setminus }\{0\} \) and \(A(s):=[0,s],\)\(s\in S.\)

Let X be the vector space of all continuous from the left functions \(\phi :S\rightarrow \mathbb {R}\) such that \(\phi (0)=0\) and that the restriction of \(\phi \) to any bounded interval [0, m] (\(m\in \mathbb {N}\)) has a bounded variation. We assume that X is endowed with the topology of weak convergence. Recall that a sequence \((\phi _n)\) converges weakly to \(\phi \in X\) if and only if \(\phi _n(s)\rightarrow \phi (s)\) as \(n\rightarrow \infty \) at any continuity point \(s\in S\) of \(\phi .\) Here, we point out that \(s=0\) is considered as a continuity point of \(\phi \in X\) if \(\lim _{s\rightarrow 0^+}\phi (s)=\phi (0).\) The weak convergence of \((\phi _n)\) to \(\phi \) is denoted by \(\phi _n{\mathop {\rightarrow }\limits ^{*}} \phi \).

Let F be the set of all continuous function from the left mappings \(c:S\rightarrow S\) such that the function \( y(s):=s-c(s)\) is non-decreasing and \( c(s)\in A(s)\) for all \(s\in S.\) Note that \(s\rightarrow y(s)\) is lower semicontinuous. Thus, \(c\in F \) is upper semicontinuous. Define

$$\begin{aligned} I:=\{ y\in X: y(s)=s-c(s)\ \text{ where }\ s\in S,\ c\in F \}. \end{aligned}$$

Observe that \(I\subset X.\) Moreover, \(s=0\) is the continuity point of every function in F or I.

Lemma 1

X is a locally convex topological vector space. The sets I and F are convex and sequentially compact in X.

For a more detailed discussion, consult Lemma 1 and Appendix in [7].

In the sequel, we shall use a generalized version of the contraction mapping principle due to [31], see also Theorem 5.2 in [16].

Proposition 1

Let \((Y,\rho )\) be a complete metric space and \(\delta :S\rightarrow S\) be a continuous function such that \(\delta (0)=0\) and \(\delta (r)<r\) for all \(r\in S_+.\) Assume that \(T:Y\rightarrow Y\) is a mapping such that

$$\begin{aligned} \rho (Tx,Ty)\le \delta (\rho (x,y))\ \ \text{ for } \text{ all }\ \ x, y \in Y, \end{aligned}$$

and define \(T^1x:=Tx,\)\(T^{m+1}x:= TT^{m}x,\)\(x\in Y,\)\(m\in \mathbb {N}.\) Then, T has a unique fixed point \(x^*\in Y\) and \(\lim _{n\rightarrow \infty } \rho (T^nx_0,x^*)=0\) for any \(x_0\in Y.\)

We shall also assume that \(z \rightarrow \delta (z)/z\) is non-increasing on \(S_+.\) This assumption implies that \(\delta \) is subadditive and hence

$$\begin{aligned} |\delta (z_1)-\delta (z_2)|\le \delta (|z_1-z_2|)\ \ \text{ for } \text{ all }\ \ z_1, z_2 \in S. \end{aligned}$$

Moreover, the fact that \(z\rightarrow \delta (z)/z\) is non-increasing implies that \(\delta (\kappa z)/\kappa z \le \delta (z)/z\) for any \(z\in S_+\) and \(\kappa \ge 1.\) Hence, it holds

$$\begin{aligned} \delta (\kappa z)\le \kappa \delta (z)\ \ \text{ for } \text{ all }\ \ \kappa \ge 1,\ z\in S. \end{aligned}$$
(1)

As in the dynamic programming literature (see Jaśkiewicz et al. [27]), we call \(\delta \) a discount function.

Let \(d>0\) and \(\eta :[d,\infty ) \rightarrow \mathbb {R}\) be a fixed function. Following Milgrom and Shannon [32], we say that \(\eta \) has the strict single crossing property on \([d,\infty ),\) when the following holds: if there exists some \(x\ge d\) such that \(\eta (x) \ge 0\), then for each \(x'>x\), we have \(\eta (x')> 0.\) It is worth to note that \(\eta \) need not be increasing, see Example 3 in [3].

Let \(u:S\times S \rightarrow \mathbb {R}_+\) be a function of the form \(u(a,w) =g(u_o(a,w)).\) We make the following assumptions.

  1. (U1)

    g is continuous, increasing, and \(g(0)=0.\)

  2. (U2)

    \(u_o(0,0)=0\) and \(u_o\) is continuous on \(S\times S\) and increasing in each variable.

  3. (U3)

    For any \(w_2>w_1\) in S\(l>0\) and for each \(d>0\), the function \(D_lu_o(x):= u_o(x,w_2)-u_o(x+l,w_1)\) has the strict single crossing property on \([d,+\infty ).\)

Remark 1

The form of \(u(a,w)=g(u_o(a,w))\) is very convenient for verification of our assumptions. From them, it follows that u is nonnegative. Using this representation, one can immediately see that (U1)–(U3) hold for u(aw) such as \(\ln (1+\sqrt{a} +w),\)\(\sqrt{1+\sqrt{a}+w}-1,\)\(\sqrt{\sqrt{a}+w}\) or \(\ln (1+w\sqrt{a} ). \) The function u plays the role of “aggregator” in our definitions of the utility functions in altruistic growth models. Note that the aggregator of [20] of the form \(u(a,w)= ((1-\beta )a^\sigma +\beta w^\sigma )^{\frac{1}{\sigma }}\) with \(\beta ,\)\( \sigma \in (0,1)\) also satisfies (U1)-(U3).

The study of unbounded from below utilities u requires new methods and seem to be difficult to handle in this setup.

Remark 2

Bernheim and Ray [11] in their study of an altruistic growth model did not assume that u is the composition of \(u_o\) and g. They directly imposed conditions on u: u is strictly concave in its first argument and u satisfies the increasing difference property (ID). (ID) says that for each \(w_1 >w_2\) in S, the function \(x\rightarrow u(x,w_1)-u(x,w_2)\) is non-decreasing. The function u that satisfies (ID) is called supermodular. Supermodular functions turn out to be useful in operations research and game theory, see Topkis [38, 39]. Balbus et al. [3], on the other hand, assumed that u meets the strict single crossing property. On page 517 in [3], it is also shown that if u satisfies (ID) and is strictly concave in the first argument, then u possesses the strict single crossing property. Observe that the functions \(\ln (1+\sqrt{a} +w),\)\(\sqrt{1+\sqrt{a}+w}-1,\)\(\sqrt{\sqrt{a}+w}\) do not satisfy (ID). However, they are compositions of an increasing function g with \(u_o\) satisfying (ID). Below, we give an example of \(u=g\circ u_o\) where g is increasing and \(u_o\) satisfies condition (U3) and does not have the (ID) property. This example also illustrates how (U3) can be checked directly using the definition of \(u_o.\)

Example 1

Let \(u(a,w):= (w +\ln (a+w))^\sigma \) where \(\sigma \in (0,1).\) Clearly, \(g(z)=z^\sigma \) is increasing and continuous and \(u_o(a,w)=w +\ln (a+w).\) The function \(u_o\) does not satisfy (ID), since \(\frac{\partial ^2 u_o}{\partial a \partial w} <0.\) For any \(w_2>w_1\) in S\(l>0\) we have

$$\begin{aligned} D_lu_o(x):= u_o(x,w_2)-u_o(x+l,w_1)= w_2-w_1 +\ln \left( \frac{1+w_2 + x}{1+w_1+l+x}\right) . \end{aligned}$$

If \(w_2\ge w_1+l,\) then \(D_lu_o(x)>0\) for all \(x\ge d,\) where d is an arbitrary number larger than 0. If \(w_2<w_1+l,\) then \(D_lu_o(x)\) is increasing and there is a unique point \(x_o\ge d\) such that \(D_lu_o(x_o)=0.\) Thus, \(D_lu_o\) has the strict single crossing property on \([d,+\infty ).\)

3 Markov Perfect Equilibria in Altruistic Growth Economies

Consider an infinite sequence of generations labeled by \(t\in T=\mathbb {N}.\) There is one commodity, which may be consumed or invested. Every generation lives one period and in the paternalistic case derives utility from its own consumption and consumption of its immediate descendant. In the non-paternalistic case generation, \(t\in T\) takes into account a utility for consumption of generation \(t+1.\) In this paper, we are interested in mixed model where both paternalistic and non-paternalistic components are present. Generation \(t\in T\) receives the endowment \(s_t\in S\) and chooses consumption level \(a_t\in A(s_t)=[0,s_t].\) The investment \(i_t:=s_t-a_t\) determines the endowment of its successor according to some transition probability q from S to S,  which depends on \(i_t\in A(s_t).\) Let \(\varPhi \) be the set of Borel measurable functions \(\phi : S\rightarrow S\) such that \(\phi (s)\in A(s)\) for each \(s\in S.\) A strategy or policy for generation \(t\in T\) is a Borel measurable function \(c_t:S\rightarrow S\) such that \(c_t(s_t)\in A(s_t)\) for all \(s_t\in S.\) The set of all strategies for each generation is denoted by \(\varPi .\)

Let \(v:S \rightarrow \mathbb {R}_+\) be a continuous increasing function such that \(v(0)=0.\) Assume that generation \(t\in T\) consumes \(a \in A(s_t)\) in state \(s_t=s\) and the following generation is going to use a strategy \(c_{t+1}=c'\in \varPi .\) Then, the term

$$\begin{aligned} \mathcal{E}_iv(c'):=\int _Sv(c'(s'))q(ds'|i) \end{aligned}$$

is a generation t’s evaluation of consumption policy \(c'\) of generation \(t+1\) under investment \(i=s-a.\) Let \(U(c')(s')\) denote the (Borel measurable in \(s'\)) utility for generation \(t+1\) resulting from its consumption policy \(c'\) in state \(s'\in S.\) This utility can also be evaluated by generation t under investment \(i=s-a\) by computing the expected value with respect to the probability measure \(q(\cdot |i).\) More formally, generation t can consider

$$\begin{aligned} \mathcal{E}_iU(c'):=\int _SU(c')(s')q(ds'|i). \end{aligned}$$

Assume that \(\mathcal{E}_iv(c')\) and \(\mathcal{E}_iU(c')\) are aggregated with the aid of the function \(W:\mathbb {R}_+\times \mathbb {R}_+\rightarrow \mathbb {R}_+,\) i.e., \(w= W(\mathcal{E}_iv(c'),\mathcal{E}_iU(c'))\) is calculated. Then, the aggregated utility for generation t is obtained by aggregating \(a\in A(s)\) and w by the function u discussed in Preliminaries. More precisely, the utility of generation t under investment \(i=s-a\) is defined as

$$\begin{aligned} P (a,c',U(c'))(s): = u(a,W(\mathcal{E}_{s-a}v(c'),\mathcal{E}_{s-a}U(c'))). \end{aligned}$$
(2)

Similarly as in [35] or [29], we can call U an indirect utility for generation \(t\in T.\) However, one should note that indirect utilities in their approaches are functions depending on endowments only.

If \(s=s_t,\)\(a=c_t(s_t),\)\(c'=c_{t+1}\), \(i_t=s_t-c_t(s_t),\) this utility equals

$$\begin{aligned} P (c_t(s_t),c_{t+1},U(c_{t+1}))(s_t) = u(c_t(s_t),W(\mathcal{E}_{i_t}v(c_{t+1}),\mathcal{E}_{i_t}U(c_{t+1}))). \end{aligned}$$
(3)

This clearly shows that the utility of generation t depends on its own consumption in state \(s_t\), the expectation of its own evaluation v of consumption of generation \(t+1\) (paternalistic altruism component) and the utility U of consumption of generation \(t+1\) (non-paternalistic altruism component). In the sequel, we impose additional assumptions on functions v and W and the transition probability q to cover an unbounded case.

Definition 1

A pair \((c^*,U^*)\) with \(c^*\in \varPi \) is a stationary Markov perfect equilibrium (SMPE) with the utility defined in (2), if

$$\begin{aligned} U^*(c^*)(s)= P (c^*(s),c^*,U^*(c^*))(s)= \sup _{a\in A(s)}P (a,c^*,U^*(c^*))(s)\quad \text{ for } \text{ every }\quad s\in S. \end{aligned}$$
(4)

Note that in (4), we deal with a double fixed-point problem. The strategy \(c^*\) is the best response for every generation t, if its immediate successor is going to use \(c^*\), and each generation evaluates its consumption strategy \(c^*\) using the same function \(U^*.\) Following Ray [35], one can say that it is assumed in Definition 1 that “there exist an indirect utility function and a consumption strategy (policy), both depending on current endowment, such that each generation finds it optimal to adopt that consumption strategy, provided its immediate descendant uses the same policy and exhibit the given indirect utility. Moreover, the indirect utility function generated by the generations maximization problem is also the same as that announced by its descendant.”

In the pure paternalistic case, the utility for generation t is of simpler form

$$\begin{aligned} \tilde{P} (c_t(s_t),c_{t+1} )(s_t) = u(c_t(s_t), \mathcal{E}_{i_t}v(c_{t+1})). \end{aligned}$$

The analogous form to (2) is

$$\begin{aligned} \tilde{P} (a,c' )(s): = u(a, \mathcal{E}_{s-a}v(c')), \end{aligned}$$
(5)

where \(a\in A(s)\) is a consumption of generation t, \(i=s-a\) is its investment in state \(s\in S\) and \(c'\) is a consumption strategy of generation \(t+1.\)

The definition of equilibrium is similar to that given in [11, 28, 34] or [3, 4].

Definition 2

A strategy \(c^*\in \varPi \) is a stationary Markov perfect equilibrium (SMPE) with the utility defined in (5), if

$$\begin{aligned} \tilde{P }(c^*(s),c^*)(s)=\sup _{a\in A(s)}\tilde{P} (a,c^*)(s) \quad \text{ for } \text{ every }\quad s\in S. \end{aligned}$$
(6)

Remark 3

By putting \(i=s-a\) in (2) and (5), we can rewrite Eqs. (4) and (6) in a more convenient forms for our proofs. Namely, observe that

$$\begin{aligned} P(c^*(s),c^*,U^*(c^*))(s) = \sup _{i\in A(s)} u(s-i,W(\mathcal{E}_{i}v(c^*),\mathcal{E}_{i}U^*(c'))). \end{aligned}$$

and

$$\begin{aligned} \tilde{P}(c^*(s),c^*)(s) = \sup _{i\in A(s)} u(s-i,\mathcal{E}_{i}v(c^*)). \end{aligned}$$

Remark 4

If \(W(b,r)=r\), then the utility in (3) reduces to a pure non-paternalistic case, which is of the form

$$\begin{aligned} R (c_t(s_t),U(c_{t+1}))(s_t) := u(c_t(s_t),\mathcal{E}_{i_t}U(c_{t+1})). \end{aligned}$$
(7)

This model was studied by Loury [29] with a deterministic transition function and by Ray [35], who considered countably many descendants for each generation. An equilibrium existence in the model of [35] is still an open problem. In the stochastic case with non-atomic transition, the existence of such equilibrium was established by Balbus et al. [6].

Moreover, we would like to point out that in this pure non-paternalistic case with utility (7), the fixed point \(U^*\) is independent of \(c\in F.\) This follows from the fact that the fixed point belongs to the subspace of functions considered here. The consumption strategy \(c^*\in F\) in Definition 1, on the other hand, can be found by dynamic programming technique, see Loury [29], Durán [17, 18]. Our results within this special framework can be considered as an extension of the results of [18] to models with nonlinear discount function.

In the general case, \(c^*\) from Definitions 1 and 2 is a solution to a non-cooperative game problem. This is a symmetric Nash equilibrium in a game played by generations. Further comments on this issue, the reader may find in [25].

4 Basic Assumptions and Main Results

Let \(\Pr (S)\) be the set of all probability measures on the state space S. We recall that a sequence \((\mu _n)\) of probability measures on Sconverges weakly to some \(\mu _0\in \Pr (S)\) (\(\mu _n \Rightarrow \mu _0\) in short) if, for any bounded continuous function \(h:S\rightarrow \mathbb {R},\) it holds that

$$\begin{aligned} \lim _{n\rightarrow \infty }\int _Sh(s)\mu _n(ds)= \int _Sh(s)\mu _0(ds). \end{aligned}$$

We already made three assumptions (U1)-(U3) on the aggregator u. Below, we provide additional conditions on the primitive data that will be imposed in our two main results. To include unbounded from above utilities, we shall apply a weighted norm approach inspired by the papers in dynamic programming (see Wessels [40]; Hernández-Lerma and Lasserre [22]; Jaśkiewicz and Nowak [24]) or recursive utility theory (see Boyd [14]; Durán [17, 18]).

Let \(\omega :S\rightarrow [1,\infty )\) be a continuous non-decreasing function. Further, \(\omega \) will be called a weight function. We now make some basic assumptions on the transition probability.

  1. (Q1)

    Assume that \(\lambda _j:S\rightarrow [0,1],\)\(j\in J:=\{1,\ldots ,N\},\) are continuous functions such that \(\sum _{j=1}^N \lambda _j(i)=1\) for all \(i\in S.\) In addition, suppose that there exist transition probabilities \(q_j\) from S to S\(j\in J\), such that for each \(i\in S\), we have

    $$\begin{aligned} q(\cdot |i)=\sum _{j=1}^N\lambda _j(i)q_j(\cdot |i). \end{aligned}$$
    (8)

    Moreover, for every \(j\in J,\)\(q_j(\{0\}|0)=1\), and the transition probability \(q_j(\cdot |i)\) has the Feller property, i.e., if \(i_n\rightarrow i_0\) in S as \(n\rightarrow \infty ,\) then \(q(\cdot |i_n) \Rightarrow q(\cdot |i_0).\)

  2. (Q2)

    Every transition probability \(q_j(\cdot |y)\) in (8) satisfies the stochastic dominance condition, i.e., if \(z\rightarrow Q_j(z|i):=q_j([0,z]|i)\) is the cumulative distribution function for \(q_j(\cdot |i),\) then for any \(i_1<i_2\) and \(z\in S\), we have that \(Q_j(z|i_1)\ge Q_j(z|i_2).\)

  3. (Q3)

    For every \(z\in S,\) the set \(S^z :=\{i\in S:\; q(\{z\}|i)>0\}\) is countable.

  4. (Q4)

    The function \(i\rightarrow \int _S \omega (s')q(ds'|i)\) is continuous on S.

  5. (Q5)

    We have

    $$\begin{aligned}\kappa _0 :=\sup _{s\in S}\sup _{a\in A(s)} \frac{\int _S\omega (s')q(ds'|s-a)}{\omega (s)} <\infty .\end{aligned}$$

As in preliminaries, we make the following assumption on the discount function \(\delta .\)

  1. (D)

    \(\delta :S\rightarrow S\) is continuous and non-decreasing, and \(\delta (z)<z\) for all \(z\in S_+.\) (Hence, it follows that \(\delta (0)=0.\)) Moreover, the function \(z\rightarrow \delta (z)/z\) defined on \(S_+\) is non-increasing.

We can now continue our assumptions on the utility function.

  1. (U4)

    The function \(v:S\rightarrow \mathbb {R}_+\) is increasing and continuous, and \(v(0)=0.\)

  2. (U5)

    If v is unbounded, then the function \(i\rightarrow \int _Sv(s')q(ds'|i)\) is continuous on S.

  3. (U6)

    The function \(W:\mathbb {R}_+\times \mathbb {R}_+\rightarrow \mathbb {R}_+\) is continuous and increasing in each variable, and \(W(0,0)=0.\)

  4. (U7)

    There exists a constant \(\kappa _1>0\) such that for each \(s\in S,\) we have

    $$\begin{aligned} u(s,W(\Vert v\Vert _\omega \kappa _0 \omega (s),0)\le \kappa _1\omega (s). \end{aligned}$$

    Here, \( \Vert v\Vert _\omega \) is defined as \(\sup _{x\in S} v(x)/\omega (x)\) and is assumed to be finite.

  5. (U8)

    For any \(r_1,r_2\in \mathbb {R}_+\) and every \(s\in S,\)\(a\in A(s),\)\(b\ge 0,\), we have

    $$\begin{aligned} |u(a,W(b,r_1))-u(a,W(b,r_2))|\le \delta (|r_1-r_2|). \end{aligned}$$

For any \(f:F\times S\rightarrow \mathbb {R}\) we define

$$\begin{aligned} \Vert f\Vert _\omega :=\sup _{s\in S}\sup _{c\in F}\frac{|f(c,s)|}{\omega (s)}. \end{aligned}$$

Let \(C(F\times S)\) be the Banach space of all continuous functions \(f:F\times S\rightarrow \mathbb {R}\) such that \(\Vert f\Vert _\omega <\infty .\) Further, in some cases we shall write f(c)(s) instead of f(cs) for any \(f\in C(F\times S)\).

We can now state our main results.

Theorem 1

Let assumptions (Q1)–(Q5), (U1)–(U4), (U6)–(U8) and (D) be satisfied. Assume also that if \(\kappa _0>1,\) then \(\delta (\kappa _0r) <r\) for all \(r>0.\) Then, there exists an SMPE\((c^*,U^*)\) in the sense of Definition 1 and \(c^*\in F,\)\(U^*\in C(F\times S).\)

Remark 5

It should be noted that \(U^*\) in Theorem 1 is unique. However, in a general case the uniqueness of \(c^*\) is not guaranteed. This issue for similar models is also considered in [8, 30].

In the paternalistic case, we can drop the assumptions involving the discount function \(\delta \) and weight function \(\omega .\)

Theorem 2

Under assumptions (Q1)–(Q3) and (U1)–(U5), there exists an SMPE\(c^*\) in the sense of Definition 2 and \(c^*\in F.\)

Remark 6

(a) Assumptions (Q1)–(Q5) were used to study stochastic games of resource extraction by Jaśkiewicz and Nowak [26]. (Q3) was first used in [4], and it holds for the purely deterministic transitions and as well their convex combinations. Moreover, it may occur that all sets \(S^z\) in (Q3) are empty. This means that q is non-atomic. Other possible transitions are presented in Sect. 5. Here, we only wish to point out that (Q3) is not satisfied, for instance, for q of the form

$$\begin{aligned} q(\cdot |i)=(1-i)\tilde{\delta }_{\{0\}}(\cdot )+iq_u(\cdot |i), \end{aligned}$$

where \(\tilde{\delta }_{\{0\}}(\cdot )\) is the Dirac measure at point 0 and \(q_u(\cdot |i)\) is the uniform distribution on \(S:=[0,1].\) Then, \(q(\{0\}|i)=1-i\) and \(q(\{0\}|i)>0\) for all \(i\in [0,1).\) Hence, the cardinality of \(S^0\) is continuum.

(b) Although the transition probability q has the Feller property, it need not satisfy the stochastic dominance condition, even if the expected new state \(\int _S s'q(ds'|i)\) is increasing with respect to the investment i (consult with Examples 4–6 in [26]).

(c) Assumption (D) was used in dynamic programming models studied in [27].

Remark 7

The classes of strategies F and I were used to study bequest equilibria by Bernheim and Ray [11] and Leininger [28]. Their proofs are rather complicated. A simpler method was proposed in [3, 4]. Moreover, this class of semicontinuous strategies is also useful in dynamic game models of resource extraction, see Sundaram [37], Dutta and Sundaram [19], Jaśkiewicz and Nowak [26].

Remark 8

Theorem 1 is new, even in the pure non-paternalistic case (see Remark 4). Theorem 2, on the other hand, considerably extends the work of [4] (in the risk-neutral case), where the state space S is a compact interval and thus, the utility function u is bounded. More importantly, Balbus et al. [4] assume that \(\tilde{P}(a,c)(s)= \hat{u}(a) +\int _Sv(c(s'))q(ds'|s-a),\) i.e., the utility is separable. Hence, these results generalize ones of [4] in the two aforementioned directions. We wish to emphasize that certain techniques and ideas used in [3, 4] are useful in our setup, but after a suitable adaptation.

5 Examples

In this section, we give four examples. The first example is inspired by a two-sided altruism model of [23]. The remaining three examples provide some specific functions in the altruistic growth economies for which the imposed conditions are satisfied. Examples 3 and 4 refer to the assumptions used in Theorem 1, whereas Example 5 illustrates conditions used in Theorem 2.

Example 2

Assume that each generation \(t\in T\) consists of two populations: old and young members. The young population in generation t becomes old in generation \(t+1.\) The utility functions for consumptions are: \(u_1\) for the young population and \(u_2\) for the old population. Supposing that the members of generation t cooperate, we can maximize \(u_1(a_t^{1}) + u_2(a_t^{2})\) subject to \(a_t^{1}+ a_t^{2}=a_t,\) where \(a_t\in A(s_t)\) is the total consumption of generation t in state \(s_t\in S.\) Assuming that the functions \(u_1\) and \(u_2\) are increasing and strictly concave, one can show that there exist continuous increasing functions \(\psi _1\) and \(\psi _2\) on S such that

$$\begin{aligned} \hat{u}(a_t)= \max _{a_t^{1}+a_t^{2}=a_t}(u_1(a_t^{1})+u_2(a_t^{t})) = u_1(\psi _1(a_t))+u_2(\psi _2(a_t)). \end{aligned}$$

By Lemma 1 in [23], \(\hat{u}\) is strictly concave and increasing.

Let \(c_{t+1}\) be a consumption strategy of generation \(t+1\) and \(U(c_{t+1})(s_{t+1})\) be a chosen (announced) utility function by generation \(t+1\) depending on both \(c_{t+1}\) and \(s_{t+1}.\) Then, the aggregated utility for generation t is

$$\begin{aligned}&\hat{u}(a_t) + (1-\beta )\int _S u_2(\psi _2(c_{t+1}(s_{t+1}))) q(ds_{t+1}|s_t -a_t)\\&\quad +\,\, \beta \int _SU(c_{t+1})(s_{t+1}) q(ds_{t+1}|s_t -a_t), \end{aligned}$$

where \(\beta \in (0,1).\) Thus, the utility of generation t is the sum of the utility resulting from cooperation of populations in period t and the weighted sum of the utility from consumption of old in period \(t+1\) (who were young in period t) and the expected utility U announced by generation \(t+1\) calculated for the strategy \(c_{t+1}\) and endowment \(s_{t+1}.\)

Example 3

Let the transition probability be of the following form

$$\begin{aligned} q(\cdot |i)=\lambda (i)\tilde{\delta }_{\{i\}}(\cdot )+(1-\lambda (i))q_u(\cdot |i), \end{aligned}$$

where \(\lambda :S\rightarrow (0,1)\) is a continuous function such that \(\lambda (i)\rightarrow 0\) as \(i\rightarrow 0,\) and \(\tilde{\delta }_{\{i\}}(\cdot )\) is the Dirac measure concentrated at point i and \(q_u(\cdot |i)\) is the uniform distribution on [0, 2i] with \(i\in S.\) Obviously, \(q_u(\{0\}|0)=1.\) Note that (Q1)–(Q3) immediately hold. Moreover, the set \(S^z\) in (Q3) has at most 1 element. Now we prove (Q4) and (Q5) for \(\omega (s)=1+s.\) Observe that the function

$$\begin{aligned} i\rightarrow \int _S \omega (s')q(ds'|i)=1+\lambda (i)i+(1-\lambda (i))i =1+i \end{aligned}$$

is continuous on S and additionally,

$$\begin{aligned} \sup _{s\in S}\sup _{i\in A(s)}\frac{\int _S \omega (s')q(ds'|i)}{\omega (s)}= \sup _{s\in S} \sup _{i\in A(s)}\frac{1+i}{s+1}= 1. \end{aligned}$$

Hence, we have \(\kappa _0=1.\) Furthermore, we define the function \(\delta \) satisfying (D) as

$$\begin{aligned} \delta (z)=\alpha z +(1-\alpha )\ln (1+z) \quad \text{ with } \alpha \in (0,1),\ z\in S, \end{aligned}$$

and the function in (U4) as \(v(s)=2\sqrt{s},\)\(s\in S.\) Clearly, \(\Vert v\Vert _\omega =1.\) Finally, we put \(W(b,r)=\delta (b+r),\)\((b,r)\in S\times S\) and \( u(a,w)=\ln (1+\sqrt{a}+w),\)\( (a,w)\in S\times S\) (see Remark 1). Then, we have

$$\begin{aligned} u(a,W(b,r))=\ln \left( 1+\sqrt{a}+\alpha (b+r)+(1-\alpha )\ln (1+b+r)\right) . \end{aligned}$$

Condition (U8) follows from the mean value theorem, because

$$\begin{aligned} |u(a,W(b,r_1))-u(a,W(b,r_2))|= & {} |\ln (1+\sqrt{a}+\delta (b+r_1))-\ln (1+\sqrt{a}+\delta (b+r_2))|\\ {}\le & {} |\delta (b+r_1)-\delta (b+r_2)|\le \delta (|r_1-r_2|). \end{aligned}$$

The last inequality is due to this fact

$$\begin{aligned} |\delta (b+r_1)-\delta (b+r_2)|\le & {} \alpha |r_1-r_2|+(1-\alpha )\left| \ln \left( \frac{1+b+r_1+r_2-r_2}{1+b+r_2}\right) \right| \\\le & {} \nonumber \alpha |r_1-r_2|+(1-\alpha )\ln \left( 1+\frac{|r_1-r_2|}{1+b+r_2}\right) \\ {}\le & {} \alpha |r_1-r_2|+(1-\alpha )\ln \left( 1+|r_1-r_2|\right) = \delta (|r_1-r_2|). \nonumber \end{aligned}$$
(9)

It remains to check condition (U7) and find the constant \(\kappa _1.\) For every \(s\in S\) we obtain

$$\begin{aligned} u(s,W(\Vert v\Vert _\omega \kappa _0\omega (s),0)= & {} \ln (1+\sqrt{s}+\delta (\omega (s)))\\= & {} \ln \left( 1+\sqrt{s}+\alpha \omega (s)+ (1-\alpha )\ln (1+\omega (s))\right) \\\le & {} \ln \left( 1+\sqrt{s}+ \omega (s)\right) \le \sqrt{s}+1+s\le \frac{3}{2}(s+1). \end{aligned}$$

Thus, it suffices to take \(\kappa _1=3/2.\)

Example 4

Assume that the transition function is deterministic, i.e., \(q(\cdot |i)=\tilde{\delta }_{\{i+\root 4 \of {3i}\}}(\cdot ).\) Let the function W be as in Example 3, \(\omega (s)=s+l\) for \(s\in S\) and some \(l\ge 1\) (which will be chosen below). Furthermore, suppose that

$$\begin{aligned} \delta (z)=(1-2\alpha ) z +\alpha \ln (1+z) \quad \text{ with } \alpha \in (0,1/2),\ z\in S. \end{aligned}$$

Note that \(\delta \) obeys all requirements in (D). Conditions (Q1)–(Q4) hold and (Q5) is satisfied with

$$\begin{aligned} \kappa _0=\sup _{s\in S}\frac{l+s+\root 4 \of {3s}}{l+s} = 1+\frac{3}{4\root 4 \of {l^3}}. \end{aligned}$$

This last equality follows from the fact that the function \(s \rightarrow 1+ \frac{\root 4 \of {3s}}{l+s}\) attains its maximum at \(s=l/3.\) We now observe that

$$\begin{aligned} \delta (\kappa _0z)=(1-2\alpha )\kappa _0 z +\alpha \ln (1+\kappa _0z)\le (1-\alpha )\kappa _0 z<z \end{aligned}$$

holds if \((1-\alpha )\kappa _0 <1.\) Hence, for every \(\alpha \in (0,1/2)\), we may choose \(l\ge 1\) such that \((1-\alpha )\kappa _0<1\) and consequently \(\delta (\kappa _0z)< z\) for all \(z>0.\) Let \(u(a,w)=2\sqrt{1+\sqrt{a}+w}-2, \)\((a,w)\in S\times S\) (consult with Remark 1). We have

$$\begin{aligned} u(a,W(b,r))=2\sqrt{1+\sqrt{a}+(1-2\alpha )(b+r) +\alpha \ln (1+b+r)}-2. \end{aligned}$$

Assumption (U8) holds, since

$$\begin{aligned} |u(a,W(b,r_1))-u(a,W(b,r_2))|= & {} 2\left| \sqrt{1+\sqrt{a}+\delta (b+r_1)}-\sqrt{1+\sqrt{a}+\delta (b+r_2)}\right| \\\le & {} \delta (|r_1-r_2|), \end{aligned}$$

where the last inequality can be shown in the same manner as in (9). Let us put \(v(s)=\ln (1+s),\)\(s\in S,\) and observe that

$$\begin{aligned} \Vert v\Vert _\omega =\sup _{s\in S} \frac{\ln (1+s)}{s+l}\le \sup _{s\in S}\frac{\ln (1+s)}{1+s}= \frac{ 1}{e}. \end{aligned}$$

Finally, we have to prove (U7). Choosing \(l>1\) such that \((1-\alpha )\kappa _0<1\), we can assure that \(\kappa _0<e/2.\) Then,

$$\begin{aligned} u(s,W(\Vert v\Vert _\omega \kappa _0\omega (s),0)= & {} \le u(s,W(0.5\, \omega (s),0) \\= & {} 2\left( \sqrt{1+\sqrt{s}+ (1-2\alpha ) 0.5\,\omega (s)+\alpha \ln (1+0.5\,\omega (s))} -1\right) \\\le & {} 2\left( \sqrt{1+\sqrt{s}+ (1-2\alpha )0.5\, \omega (s)+\alpha 0.5\, \omega (s)} -1\right) \\\le & {} 2\left( \root 4 \of {s}+\sqrt{0.5\,(1-\alpha ) \omega (s)} \right) \le 2\left( \root 4 \of {s}+\sqrt{0.5\, \omega (s)}\right) <3\omega (s). \end{aligned}$$

Thus, it suffices to take \(\kappa _1=3.\)

Example 5

In this example, we assume that the next state evolves according to the following recursive equation \(s_{t+1}=i_t(1+\epsilon )\zeta _t,\) where \(\epsilon >0\) is a fixed interest rate and \((\zeta _t)\) is a sequence of i.i.d. random shocks with values in \([0,\infty )\) and with a distribution \(\pi \) such that

$$\begin{aligned} \int _S\sqrt{\zeta }\,\pi (d\zeta )<\infty . \end{aligned}$$

Thus,

$$\begin{aligned} q(B|i)=\int _S 1_B(i(1+\epsilon )\zeta )\pi (d\zeta )\quad \text{ for } \text{ every } \text{ Borel } \text{ set } B\subset S,\ i\in S. \end{aligned}$$

Note that (Q1)–(Q3) are satisfied, if 0 is not an atom of \(\pi \). In the paternalistic model, we may define \(u(a,w)= (1/4 \sqrt{a} +3/4 \sqrt{w})^2\) for \((a,w)\in S\times S\) and \(v(s)=\sqrt{s}\) for \(s\in S.\) Note that the function

$$\begin{aligned}i\rightarrow \int _S v(s')q(ds'|i)=\int _S v(i(1+\epsilon ) \zeta )\pi (d\zeta )= \sqrt{i(1+\epsilon )}\int _S\sqrt{\zeta }\,\pi (d\zeta )\end{aligned}$$

is continuous. Hence, all assumptions in Theorem 2 are satisfied.

6 Basic Monotonicity Result

In this section, we provide a useful result that may have applications to various models in optimization. Let \(\xi :S\rightarrow S\) be an upper semicontinuous function. For any \(s\in S,\) define

$$\begin{aligned} A_o^\xi (s) := \mathrm{arg} \max _{i\in A(s)}u(s-i,\xi (i))\ \ \text{ and }\ \ a^\xi _o(s):= \min A_o^\xi (s). \end{aligned}$$
(10)

Under our assumptions (U1)–(U2), the function \(i\rightarrow u(s-i,\xi (i))\) is upper semicontinuous on A(s). Therefore, the set \(A_o^\xi (s)\) is non-empty and compact. Thus, \(a_o^\xi (s)\) is well defined. The following result is related to Proposition 1 in [3].

Proposition 2

If (U1)–(U3) are satisfied and \(\xi \) is an upper semicontinuous function, then

  1. (a)

    any selection of the set-valued mapping \(s\rightarrow A_o^\xi (s),\) i.e., a function \(\varphi :S\rightarrow S\) such that \(\varphi (s)\in A_o^\xi (s)\) for all \(s\in S\) is non-decreasing.

  2. (b)

    If, in addition, \(\xi \) is continuous from the left, then the function \(\phi = a_o^\xi \) is continuous from the left at any point \(s\in S.\)

Proof

Observe that since g is continuous and increasing, we conclude from (10) that

$$\begin{aligned} A_o^\xi (s) = \hat{A}_o^\xi (s):= \mathrm{arg} \max _{i\in A(s)}u_o(s-i,\xi (i)). \end{aligned}$$

Therefore, \(a^\xi _o(s):= \min A_o^\xi (s) = \min \hat{A}_o^\xi (s)\) and in the remaining part of the proof, we can work with \(u_o.\)

(a) First note that \(\varphi (0)=0 \le \varphi (s)\) for each \(s\in S.\) Assume that there exist \(s_1, s_2 \in S\) such that \(0<s_1<s_2\) and \(\varphi (s_1)>\varphi (s_2).\) Let \(S_1=[s_1,\infty )\) and \(A_1=[0,\varphi (s_1)].\) Note that \(\varphi (s_1)>0.\) Choose any \(i_1, i_2 \in A_1\) such that \(i_1>i_2\) and define

$$\begin{aligned} \eta (s) = u_o(s-i_1,\xi (i_1))- u_o(s-i_2,\xi (i_2)),\ s\in S_1. \end{aligned}$$

If \(\xi (i_1)\le \xi (i_2),\) then under our monotonicity assumptions on \(u_o\), we have \(\eta (s) <0\) for all \(s\in S_1\), and \(\eta \) has the strict single crossing property. If \(\xi (i_1)>\xi (i_2),\) then we write

$$\begin{aligned} \eta (s) = u_o(s-i_1,\xi (i_1))- u_o(s-i_1+(i_1-i_2),\xi (i_2)),\ s\in S_1. \end{aligned}$$

By (U3), this function has a strict single crossing property. Let \(\varphi '\) be any function defined on \(S_1\) such that \(\varphi '(s)\in \mathrm{arg} \max _{i\in A_1} u_o(s-i,\xi (i))\) for all \(s\in S_1\) and \(\varphi '(s_1)=\varphi (s_1)\) and \(\varphi '(s_2)=\varphi (s_2).\) By Theorem 4’ in [32], \(\varphi '\) is non-decreasing. Hence, it follows that \(\varphi (s_1) \le \varphi (s_2).\) We have come to a contradiction, which finishes the proof of part (a).

(b) Let \(s_0 >0.\) Assume that \(s_n\uparrow s_0\) as \(n\rightarrow \infty .\) Choose any \(i\in [0,s_0).\) Then, we get

$$\begin{aligned}u_o(s_n-\phi (s_n),\xi (\phi (s_n)))= \max _{i'\in A(s_n)}u_o(s_n-i',\xi (i'))\ge u_o(s_n-i,\xi (i))\end{aligned}$$

for all but finitely many \(n\in \mathbb {N}.\) Since \(\phi \) is non-decreasing, the limit \(i_0:=\lim _{n\rightarrow \infty }\phi (s_n)\) exists and \(i_0\in A(s_0).\) By assumption, the function \(\xi \) is continuous from the left. Thus, we have

$$\begin{aligned} u_o(s_0-i_0,\xi (i_0))\ge u_o(s_0-i,\xi (i)), \ \ i\in [0,s_0). \end{aligned}$$
(11)

Since \(\xi \) is continuous from the left, (11) also holds for \(i=s_0\). Hence, \(i_0\in A_o^\xi (s_0).\) Clearly, \(i_0 \ge a_o^\xi (s_0).\) If \(i_0 > a_o^\xi (s_0)=\phi (s_0),\) then there exists some \(n\in \mathbb {N}\) such that \(s_n <s_0\) and \(\phi (s_n) > \phi (s_0).\) This inequality contradicts the fact that \(\phi \) is non-decreasing. \(\square \)

The following auxiliary result is a simple modification of Lemma 3 in [4].

Lemma 2

Let \(\varphi \in I\) be any function such that \(\varphi (s)\in A_o^\xi (s)\) for all \(s\in \mathcal{D}\) and \(\mathcal D\) be a dense subset of S. If \(s \in \mathcal{D}\) is a continuity point of \(\varphi ,\) then the set \(A_o^\xi (s)\) is a singleton.

7 Proofs of Theorems 1 and 2

In this section, we assume that assumptions used in Theorems 1 and 2 are satisfied, although they are not explicitly recalled.

Lemma 3

Let \(f\in C(F\times S).\) Then, the function \((c,i)\rightarrow \int _Sf(c,s')q(ds'|i)\) is continuous on \(F\times S\).

Proof

First note that both functions f and \(f^\omega \) defined by \(f^\omega (c,s): = \omega (s)\Vert f\Vert _\omega - f(c,s)\) are continuous and nonnegative. A simple adaptation of the proof of Proposition 7.31 in [12] yields that the functions \((c,i)\rightarrow \int _S f(c,s')q(ds'|i)\) and \((c,i)\rightarrow \int _Sf^\omega (c,s')q(ds'|i)\) are lower semicontinuous. Since \(i\rightarrow \int _S\omega (s')q(ds'|i)\) is continuous, it follows that \((c,i)\rightarrow \int _Sf(c,s')q(ds'|i)\) is simultaneously lower and upper semicontinuous. \(\square \)

Lemma 4

For any \(c\in F\), the function \(i\rightarrow \int _Sv(c(s'))q(ds'|i)\) is upper semicontinuous.

Proof

Since v is continuous and increasing and \(c\in F,\) the function \(v(c(\cdot ))\) is upper semicontinuous. Hence, \(\omega (s)\Vert v\Vert _\omega - v(c(s))\) is nonnegative and lower semicontinuous. By Proposition 7.31 in [12], the function \(i\rightarrow \int _S(\omega (s')\Vert v\Vert _\omega - v(c(s')))q(ds'|i)\) is lower semicontinuous. Since \(\rightarrow \int _S\omega (s')q(ds'|i)\) is continuous, the assertion follows. \(\square \)

Lemma 5

For any \(c\in F\), the function \(i\rightarrow \int _Sv(c(s'))q(ds'|i)\) is continuous from the left.

Proof

Consider any \(q_j\) from the representation (Q1) of q. Then, recall that \(q_j\) has the Feller property and the stochastic dominance property. Using the Skorohod representation theorem for weak convergence of probability measures (see Billingsley [13]), continuity from the left of \(v(c(\cdot ))\) and the dominated convergence theorem as in the proof of Lemma 6 in [4], one can conclude that the function \(i\rightarrow \int _Sv(c(s'))q_j(ds'|i)\) is continuous from the left. Since all functions \(\lambda _j\) in the representation of q in (Q1) are continuous, it follows the assertion of our lemma. \(\square \)

Let \((h_n)\) be a sequence of Borel measurable real-valued functions on S. For each \(s\in S,\) define

$$\begin{aligned} \liminf _{n\rightarrow \infty , \ s'\rightarrow s}h_n(s') := \inf \{\liminf _{n\rightarrow \infty } h_n(s_n):\;s_n\rightarrow s\} \end{aligned}$$

and

$$\begin{aligned} \limsup _{n\rightarrow \infty , \ s'\rightarrow s}h_n(s') := \sup \{\limsup _{n\rightarrow \infty } h_n(s_n):\;s_n\rightarrow s\}. \end{aligned}$$

Lemma 6

Assume that \(\mu _m\Rightarrow \mu _0\) and \(\int _S\omega (s)\mu _n(ds) \rightarrow \int _S\omega (s)\mu _0(ds)\) as \(n\rightarrow \infty .\) Suppose that there exists \(k>0\) such that \(0\le g_n(s)\le k\omega (s) \) for all \(s\in S\) and \(n\in \mathbb {N}.\) Then, we have

$$\begin{aligned} \int _S \limsup _{n\rightarrow \infty , \ s'\rightarrow s}g_n(s')\mu _0(ds) \ge \limsup _{n\rightarrow \infty } \int _S g_n(s) \mu _n(ds). \end{aligned}$$
(12)

Proof

Note that \(h_n(s'):= k\omega (s')-g_n(s')\ge 0\) for all \(s'\in S\) and \(n\in \mathbb {N}.\) By Lemma 3.2 in [36], we have

$$\begin{aligned} \int _S \liminf _{n\rightarrow \infty , \ s'\rightarrow s}h_n(s')\mu _0(ds) \le \liminf _{n\rightarrow \infty } \int _Sh_n(s) \mu _n(ds). \end{aligned}$$
(13)

It is clear that

$$\begin{aligned} \liminf _{n\rightarrow \infty , \ s'\rightarrow s}(k\omega (s') -g_n(s'))=k\omega (s')- \limsup _{n\rightarrow \infty , \ s'\rightarrow s}g_n(s'). \end{aligned}$$

Moreover, we have

$$\begin{aligned} \liminf _{n\rightarrow \infty } \int _S h_n(s) \mu _n(ds)= \int _Sk\omega (s')\mu _0(ds')- \limsup _{n\rightarrow \infty }\int _Sg_n(s) \mu _n(ds). \end{aligned}$$

Now observe that (12) follows easily from (13). \(\square \)

Lemma 7

Assume that \(c_n{\mathop {\rightarrow }\limits ^{*}} c_0\) in F. Then, for every \(s'\in S,\) we have

$$\begin{aligned} \limsup _{n\rightarrow \infty ,\ z\rightarrow s'}v(c_n(z))\le v(c(s')). \end{aligned}$$
(14)

Proof

Let \((z_n)\) be arbitrary sequence in S converging to \(s'.\) By Lemma 4 in [4], we obtain

$$\begin{aligned} \limsup _{n\rightarrow \infty }v(c_n(z_n)) \le v(c(s')). \end{aligned}$$
(15)

Thus, (14) follows by taking supremum over all sequences \((z_n)\) converging to \(s'\) on the left-hand side of (15). \(\square \)

Let V be the closed subset of all nonnegative functions f in the Banach space \(C(F\times S) \) such that \(f(c,0)=0.\) Let \(f\in V\) be fixed. For any \(c\in F,\)\(s\in S,\) define

$$\begin{aligned} A_o(c)(s):= \mathrm{arg} \max _{i\in A(s)} u(s-i,W(\mathcal{E}_iv(c),\mathcal{E}_if(c)))\quad \mathrm{and}\quad a_o(c)(s)= \min A_o(c)(s).\nonumber \\ \end{aligned}$$
(16)

By Lemmas 3 and 4, the set \(A_o(c)(s)\) is non-empty and compact and therefore, \(a_o(c)(s)\) is well defined. Put for any \(c\in F\)

$$\begin{aligned} (Tf)(c,s):= \max _{i\in A(s)} u(s-i,W(\mathcal{E}_{i}v(c),\mathcal{E}_{i}f(c))), \ \ s\in S. \end{aligned}$$
(17)

Lemma 8

T maps V into itself.

Proof

Step 1. First note that from \(q(\{0\}|0)=1\), \(u(0,0)=0\) and \(W(0,0)=0,\) it follows that \((Tf)(c,0)=0.\) Assume that \(c_n{\mathop {\rightarrow }\limits ^{*}} c_0\) in F and \(s_n \rightarrow s_0\) in S as \(n\rightarrow \infty .\) For each \(n\in \mathbb {N}\), choose any \(i_n\in A_o(c_n)(s_n).\) Without loss of generality, it can be assumed that \(i_n\rightarrow i_0\in A(s_0)\) as \(n\rightarrow \infty .\) By Lemma 3, \(\mathcal{E}_{i_n}f(c_n) \rightarrow \mathcal{E}_{i_0}f(c_0)\) as \(n\rightarrow \infty .\) From Lemmas 6 and 7, it follows that

$$\begin{aligned} \limsup _{n\rightarrow \infty }\mathcal{E}_{i_n}v(c_n)\le \int _S \limsup _{n\rightarrow \infty ,\ s'\rightarrow s}v(c_n(s'))q(ds|i_0) \le \int _S v(c_0(s))q(ds|i_0)= \mathcal{E}_{i_0}v(c_0). \end{aligned}$$

Therefore, by our monotonicity and continuity assumptions on u and W, we infer that

$$\begin{aligned} \limsup _{n\rightarrow \infty }(Tf)(c_n,s_n) \le u(s_0-i_0,W(\mathcal{E}_{i_0}v(c_0),\mathcal{E}_{i_0}f(c_0))) \le (Tf)(c_0,s_0). \end{aligned}$$
(18)

Observe now that if \(s_0=0,\) then

$$\begin{aligned} \liminf _{n\rightarrow \infty }(Tf)(c_n,s_n) \ge (Tf)(c_0,s_0)=(Tf)(c_0,0)=0. \end{aligned}$$

Let \(\mathcal{C}(c_0)\) be the set of all continuity points of the function \(c_0.\) Clearly, \(0\in \mathcal{C}(c_0)\) and \(S{\setminus } \mathcal{C}(c_0)\) is countable. Assume that \(s_0>0\) and define \(D:=\{i\in [0,s_0): q(\mathcal{C}(c_0)|i)=1\}.\) By (Q3), D is a dense subset of \([0,s_0).\) Choose any \(i\in D.\) Then, \(i\le s_n\) for all but finitely many \(n\in \mathbb {N}.\) Under our conditions on v and \(\omega \), the sequence \((v(c_n(\cdot ))\) satisfies the assumptions of the dominated convergence theorem and it converges to \(v(c_0(\cdot ))\)\(q(\cdot |i)-a.e.\) Therefore, we have that \(\lim _{n\rightarrow \infty } \mathcal{E}_iv(c_n) = \mathcal{E}_iv(c_0).\) By Lemma 3, we also have \(\lim _{n\rightarrow \infty } \mathcal{E}_if(c_n) = \mathcal{E}_if(c_0).\) Therefore,

$$\begin{aligned} \liminf _{n\rightarrow \infty } (Tf)(c_n,s_n)\ge & {} \lim _{n\rightarrow \infty }u(s_n-i,W( \mathcal{E}_iv(c_n),\mathcal{E}_if(c_n))) \nonumber \\= & {} u(s_0-i,W(\mathcal{E}_iv(c_0),\mathcal{E}_if(c_0))). \end{aligned}$$
(19)

Assume that \(\tilde{i}\in [0,s_0){\setminus } D\) or \(\tilde{i}=s_0.\) Then, \(\tilde{i}>0\) and we can take \(i\in D\) such that \(i<\tilde{i}\) and \(i\uparrow \tilde{i}.\) Since \(c_0\) is continuous from the left, by Lemma 5, the function \(i\rightarrow \mathcal{E}_iv(c_0)\) is also continuous from the left. Thus, we infer that

$$\begin{aligned} \lim _{i\uparrow \tilde{i}}u(s_0-i,W(\mathcal{E}_iv(c_0),\mathcal{E}_if(c_0)))=u(s_0-\tilde{i},W(\mathcal{E}_{\tilde{i}}v(c_0),\mathcal{E}_{\tilde{i}}f(c_0))). \end{aligned}$$

This equality and (19) imply that

$$\begin{aligned} \liminf _{n\rightarrow \infty }(Tf)(c_n,s_n)\ge u(s_0-i,W(\mathcal{E}_iv(c_0),\mathcal{E}_if(c_0))) \end{aligned}$$

for all \(i\in A(s_0).\) Hence,

$$\begin{aligned} \liminf _{n\rightarrow \infty }(Tf)(c_n,s_n)\ge (Tf)(c_0,s_0). \end{aligned}$$
(20)

From (18) and (20), it follows that Tf is continuous at any point \((c_0,s_0)\in F\times S.\)

Step 2. By (Q5), we have \(\int _S\omega (s')q(ds'|i)\le \kappa _0\omega (s)\) for all \(s\in S,\)\(i=s-a\in A(s).\) Note that \(i\le s,\)\(v(c(s'))\le v(s')\) for all \(s'\in S\) and

$$\begin{aligned} \mathcal{E}_{i}v(c) =\int _S v(c(s'))q(ds'|i)\le \int _S\omega (s') \Vert v\Vert _\omega q(ds'|i)\le \Vert v\Vert _\omega \kappa _0 \omega (s). \end{aligned}$$
(21)

We also have

$$\begin{aligned} \mathcal{E}_{i}f(c)=\int _S f(c,s')q(ds'|i) \le \int _S \omega (s')\Vert f\Vert _\omega q(ds'|i) \le \Vert f\Vert _\omega \kappa _0\omega (s). \end{aligned}$$
(22)

From (U8) and (D), it follows that

$$\begin{aligned} u(s-i,W(\mathcal{E}_{i}v(c),\mathcal{E}_{i}f(c)))\le & {} u(s-i,W(\mathcal{E}_{i}v(c),0)) +\delta (\mathcal{E}_{i}f(c)) \nonumber \\\le & {} u(s-i,W(\mathcal{E}_{i}v(c),0)) +\mathcal{E}_{i}f(c). \end{aligned}$$
(23)

From (21)–(23), we conclude that

$$\begin{aligned} u(s-i,W(\mathcal{E}_{i}v(c),\mathcal{E}_{i}f(c)))\le u(s,W(\Vert v\Vert _\omega \kappa _0\omega (s) ,0)) + \Vert f\Vert _\omega \kappa _0\omega (s). \end{aligned}$$
(24)

By (24) and (U7), we obtain that

$$\begin{aligned} u(s-i,W(\mathcal{E}_{i}v(c),\mathcal{E}_{i}f(c))\le K\omega (s) \end{aligned}$$

for all \(s\in S,\) and for \(K=\kappa _1 + \Vert f\Vert _\omega \kappa _0.\) Thus, we have \(0\le (Tf)(c,s) \le K\omega (s)\) for all \(s\in S\) and consequently \(\Vert Tf\Vert _\omega \le K<\infty .\)\(\square \)

Lemma 9

For every \(f_1,\, f_2 \in V,\) we have

$$\begin{aligned} \Vert Tf_1-Tf_2\Vert _\omega \le \delta (\kappa _0\Vert f_1-f_2\Vert _\omega ). \end{aligned}$$
(25)

Proof

By (U8), for any \(f_1,\)\(f_2 \in V,\) we have

$$\begin{aligned} |(Tf_1)(c,s)-(Tf_2)(c,s)|\le & {} \delta \left( \sup _{i\in A(s)}\int _S|f_1(c,s')-f_2(c,s')|q(ds'|i)|\right) \\ \nonumber\le & {} \delta \left( \sup _{i\in A(s)}\int _S \omega (s')\Vert f_1-f_2\Vert _\omega q(ds'|i)\right) . \end{aligned}$$

This fact, (1) and (Q5) imply that

$$\begin{aligned} |(Tf_1)(c,s)-(Tf_2)(c,s)| \le \delta (\kappa _0\Vert f_1-f_2\Vert _\omega ) \omega (s),\ \ s\in S,\ c\in F. \end{aligned}$$

Hence, (25) follows. \(\square \)

Proof of Theorem 1

If \(\kappa _0 \le 1,\) then, by Lemma 9, T is a \(\delta \)-contraction mapping from V into itself. If \(\kappa _0>1,\) then we define \(\delta _0(r)=\delta (\kappa _0r).\) By (D) and (25), T is a \(\delta _0\)-contraction mapping. By Proposition 1, there exists a unique \(U^*\in V\) such that \(TU^*=U^*.\)

Let

$$\begin{aligned} A^*_o(c)(s):= \mathrm{arg} \max _{i\in A(s)} u(s-i, W(\mathcal{E}_iv(c),\mathcal{E}_iU^*(c))), \ \ c\in F, \ \ s\in S. \end{aligned}$$

The set \(A^*_o(c)(s)\) is non-empty and compact. Define the mapping \(a^*_o(c)(s):= \min A^*_o(c)(s),\)\(s\in S.\) By Proposition 2, \(a^*_o: F\rightarrow I.\) Hence, \(a^*_o(c)\) is continuous from the left. We show that \(a^*_o\) is continuous when both F and I are given the topology of weak convergence. Assume that \(c_n{\mathop {\rightarrow }\limits ^{*}} c_0\) in F as \(n\rightarrow \infty .\) Put \(y^*_n := a^*_o(c_n),\) Let \(y^*_o\) be any accumulation point of the sequence \((y^*_n)\) in the compact space I. We need to show that \(y^*_o=a^*_o(c_0).\) Clearly, \(y^*_o(0)=a^*_o(c_0)(0)=0.\) Let \((n')\) be a subsequence of positive integers such that \(y^*_{n'}{\mathop {\rightarrow }\limits ^{*}} y^*_0\) as \(n'\rightarrow \infty .\) Let \(\mathcal{C}(y^*_0)\) be the set of all continuity points of the function \(y^*_0.\) Since \(y^*_0\in I,\)\(S{\setminus } \mathcal{C}(y^*_0)\) is countable, the set \(\mathcal{C}(y^*_0)\) is dense in S. Choose any \(s\in \mathcal{C}(y^*_0).\) From Step 1 of the proof of Lemma 8 (with \(i_{n}=y^*_{n'}(s),\)\(i_0= y^*_0(s)\) and \(s_n=s\)), we can conclude that \(y^*_0(s)\in A^*_o(c_0)(s),\) for any \(s\in \mathcal{C}(y^*_0).\) Since \(y^*_0\) is continuous at s,  by Lemma 2, \(A^*_o(c_0)(s)\) is a singleton. Therefore, \(y^*(s)=a^*_o(c_0)(s)\) for all \(s\in \mathcal{C}(y^*_0).\) As both functions \(y^*_0\) and \(c_0\) are left continuous on S and \(\mathcal{C}(y^*_0)\) is dense in S,  we conclude that \(y^*_0(s)= a^*_o(c_0)(s)\) for all \(s\in S.\) Thus, we have shown that the mapping \(a^*_o:F\rightarrow I\) is continuous. Let \(\varPsi (c)(s):= s-a^*_o(c)(s),\)\(s\in S,\)\(c\in F.\) Obviously, \(\varPsi :F\rightarrow F\) is also a continuous mapping. Moreover, we know from Lemma 1 that F is a compact convex subset of a locally convex topological vector space. By the Schauder–Tychonoff fixed-point theorem (see Aliprantis and Border [1] or Dugundji and Granas [16]), there exists some \(c^*\in F\) such that \(\varPsi (c^*)=c^*.\) Clearly, \((c^*,U^*)\) is an SMPE. \(\square \)

Remark 9

We have shown that \(U^*\) is a fixed point of the mapping T defined in (17). By Proposition 1, \(U^*\) is unique. Moreover, if \(f_0\equiv 0\) and \(f_0\in V,\) then \(\lim _{n\rightarrow \infty }\Vert T^nf_0 - U^*\Vert _\omega =0.\)

In the remaining part of this section, we consider only assumptions of Theorem 2. By replacing the function \(\omega \) with v in Lemmas 4 and 5 and using condition (U5), one can prove that the mapping

$$\begin{aligned}i\rightarrow \int _Sv(c(s'))q(ds'|i)\end{aligned}$$

is upper semicontinuous continuous from the left. Let

$$\begin{aligned} \tilde{A}^*_o(c)(s):= \mathrm{arg} \max _{i\in A(s)}u(s-i,\mathcal{E}_iv(c))\quad \mathrm{and}\quad \tilde{a}^*_o(c)(s):= \min \tilde{A}^*_o(c)(s), \ \ s\in S,\ c\in F. \end{aligned}$$

Proof of Theorem 2

A simple adaptation of the arguments used in the proof of Theorem 1 yields that the mapping \(\tilde{a}^*_o:F\rightarrow I\) is continuous in the weak topology on F and I. Therefore, the mapping \(\tilde{\varPsi }(c)(s):= s- \tilde{a}^*_o(c)(s)\) from F into F is continuous as well. By the Schauder–Tychonoff fixed-point theorem, there exists some \(c^*\in F\) such that \(\tilde{\varPsi }(c^*)=c^*.\) Clearly, \(c^*\) is an SMPE. \(\square \)