1 Introduction

Consistency of dynamic valuations, or risk measures, addresses the fundamental question how risk-adjusted valuation depends, or should depend, on degrees of information. We refer to [1] for a survey of this topic. The two main application areas are in regulation, where values correspond to capital requirements, and nonlinear pricing.

The standard notion of strong time consistency, also called dynamic consistency, or simply time consistency, postulates that two positions with identical conditional values in every state at some future date must have the same value today. This guarantees that values can be determined backward recursively. The notions of weak time consistency that we consider allow to generalize this standard recursion in a single value per state to a (finite- or infinite-dimensional) vector recursion.

The restrictiveness of standard recursion is best visible in a regulatory context. It requires that capital requirements over several periods can be determined backward recursively. Concretely, if one agrees to use, for example, Tail-Value-at-Risk at 99.5 % per year, an example of a concave valuation, this would result in an overly conservative “\(\operatorname{TVaR}\) of \(\operatorname{TVaR}\)” outcome over two years. This indicates that a conditional requirement in a future state, no matter how well chosen, does not provide sufficient information about the conditional position in that state if it comes to determining a reasonable capital requirement today. Under weak time consistency, the accumulation of conservatism can be avoided, as shown in [12].

A common approach in nonlinear pricing is to interpret the outcomes of a concave valuation as bid prices, while ask prices derive from applying the valuation to positions with a minus sign. Often also a linear pricing operator is considered that generates intrinsic values in between bid and ask prices; see e.g. [5, 7] and Sect. 5. The point of consideration is whether a conditional position in a future state should be deemed equivalent to its conditional bid price in the sense that these two can be interchanged in a position without effect on its current bid price, as required by strong time consistency. Notice that this equivalence lets the conditional bid price also play the role of an ask price, even when they differ according to the very same conditional valuation. In order to reflect the presence of more than one type of price more fundamentally, it is a natural idea to allow a joint recursion in several prices and hence adopt weaker forms of time consistency. We refer to [13, Examples 3.8 and 3.9] for illustrations of our arguments in the context of regulation and nonlinear pricing.

We analyze two forms of weak time consistency that are still strong enough to ensure uniqueness of updates, that is, to allow at most one conditional valuation that satisfies the imposed consistency condition with respect to a given initial valuation. This means that these notions do not induce a different update than strong time consistency, but extend the set of valuations that have one.

The central notion, sequential consistency, simply requires that transitions from acceptable to unacceptable, or vice versa, should not be predictable. This is precisely the combination of the well-known concepts of (weak) acceptance and rejection consistency, (4.1a), (4.1b), which separately do not induce uniqueness of updates. Conditional consistency serves as an auxiliary, even weaker notion of time consistency. It prescribes by definition a unique update that is obtained by checking the acceptability of a position restricted to all possible events at a future date. The notions of sequential and conditional consistency have been introduced in [12] in a simple setting for coherent risk measures on a finite outcome space. We refer to [13] for further motivation of these concepts in a more general setting that includes nonconvex risk measures.

In this paper, we translate the main characterizations in [13] to concrete conditions for dual representations. We first show how dual representations of the unique conditionally consistent updates naturally arise as densities of measures defined in terms of initial measures. An extension of the construction addresses the case in which consistent updates fail to exist. We then give a characterization of sequential consistency, partly based on the well-known supermartingale condition for acceptance consistency [8, Proposition 4.10].

In the second part of the paper, starting with Sect. 5, we work under the assumption that a certain property holds, which was called the “supermartingale property” by Detlefsen and Scandolo [6] and which we refer to as consistent risk aversion. This assumption is quite intuitive both in a regulatory setting and in a pricing framework, and it greatly simplifies the analysis. In particular, the notions of conditional consistency and sequential consistency coincide for consistently risk averse dynamic valuations, and these properties can be characterized by three simple rules in terms of dual representations. Finally, we translate these rules to a description of the set of valuations with prescribed properties per time step and relate the extra flexibility compared to standard recursion to a joint recursion over a range of risk aversion levels.

In this paper, we consider families of valuations that are indexed concordantly with a given filtration. Time consistency will usually be discussed with respect to two given instants of time. In Sect. 6.1, we describe a compatibility property that makes it possible to apply the main results to time-indexed families of valuations without any difficulties.

2 Setup and notation

The setting we work in is the same as, for instance, in [8] and [9, Chap. 11], extended to incorporate continuous time. A filtered probability space \((\varOmega,\mathcal{F},(\mathcal {F}_{t})_{t\in\mathcal{T}},P)\) is assumed to be given, with \(\mathcal{T}\subset[0,T]\) a discrete or continuous time axis, \(0 \in\mathcal{T}\), \(T \in\mathcal{T}\) the finite or infinite horizon date, \(\mathcal{F}_{0} = \{\emptyset,\varOmega\}\), and \(\mathcal{F}_{T}=\mathcal{F}\). The set \(L^{\infty}:= L^{\infty}(\varOmega,\mathcal{F},P)\) is taken as the universe of all financial positions under consideration. The positions that are determinate at time \(t\) are given by \(L^{\infty}_{t} := L^{\infty}(\varOmega,\mathcal{F}_{t},P)\), and \(L^{0}_{t}(\overline{\mathbb{R}}_{+})\) denotes the set of all \(\mathcal{F}_{t}\)-measurable random variables with values in \(\mathbb{R}_{+} \cup\{\infty\}\).

All inequalities, equalities and limits applied to random variables are understood in the \(P\)-almost sure sense. The complement of an event \(F \in\mathcal{F}\) is indicated by \(F^{c}\).

We define \(\mathcal{Q}\) as the set of all probability measures on \((\varOmega, \mathcal{F} )\) that are absolutely continuous with respect to the reference measure \(P\). Following [9], the symbol \(\mathcal{Q}_{t}\) is used to denote the set of probability measures that are equivalent to \(P\) on \(\mathcal{F}_{t}\), whereas the collection of probability measures that coincide with \(P\) on \(\mathcal{F}_{t}\) is indicated by \(\mathcal{P}_{t}\). To denote the subset of \(\mathcal{Q}\) consisting of measures that are equivalent to \(P\), we use the conventional symbol \(\mathcal{M} ^{e}(P)\) instead of \(\mathcal{Q}_{T}\).

We use the following notation related to pasting probability measures into each other, similar to the usage, for example, in [6, Definition 9]. We write \(E^{Q}_{t}\) for \(E^{Q}[\cdot\,|\,\mathcal{F}_{t}]\). For a given pair \(Q'\in\mathcal{Q}\), \(Q \in\mathcal{Q}_{t}\), the probability measure \(Q'Q_{t} \in\mathcal{Q}\) is defined by the property

$$E^{Q'Q_{t}} X = E^{Q'} E^{Q}_{t} X \quad(X \in L^{\infty}). $$

Note that \(Q \in\mathcal{P}_{t}\) if and only if \(Q = PQ_{t}\). We also make use of conditional pasting with respect to \(F \in\mathcal{F}_{t}\): \(Q'Q^{F}_{t}\) is the probability measure in \(\mathcal{Q}\) defined by

$$ E^{Q'Q^{F}_{t}} X = E^{Q'} (1_{F} E^{Q}_{t} X + 1_{F^{c}} E^{Q'}_{t} X) \quad(X \in L^{\infty}). $$
(2.1)

We consider conditional valuations \(\phi_{t}: L^{\infty}\rightarrow L^{\infty}_{t}\) of the form

$$ \phi_{t}(\cdot) = \mathop{\mathrm{ess}\;\mathrm{inf}}_{Q \in\mathcal{Q}_{t}} \big(E^{Q}_{t} (\cdot) + \theta _{t}(Q)\big), $$
(2.2)

where the threshold function \(\theta_{t} : \mathcal{Q}_{t} \rightarrow L^{0}_{t}(\overline{\mathbb{R}}_{+})\) satisfies

$$ \mathop{\mathrm{ess}\;\mathrm{inf}}_{Q \in\mathcal{Q}_{t}} \theta_{t}(Q)=0. $$
(2.3)

As discussed in [9, Chap. 4], these are all mappings \(\phi _{t}:L^{\infty}\rightarrow L^{\infty}_{t}\) with the following five properties (with \(X,Y,X_{n} \in L^{\infty}\), \(C,\varLambda\in L^{\infty}_{t}\), \(0 \leq\varLambda\leq1\)): (i) normalization: \(\phi_{t}(0) = 0\), (ii) monotonicity: \(X \leq Y \Rightarrow\phi _{t}(X) \leq\phi_{t}(Y)\), (iii) \(\mathcal{F}_{t}\) -translation invariance: \(\phi_{t}(X+C) = \phi_{t}(X) + C\), (iv) \(\mathcal{F}_{t}\) -concavity:

$$\phi_{t}\big(\varLambda X+ (1-\varLambda)Y\big) \geq\varLambda\phi_{t} (X)+(1-\varLambda)\phi_{t}(Y), $$

and (v) continuity from above: \(X_{n} \searrow X \Rightarrow\phi _{t}(X_{n}) \searrow\phi_{t}(X)\).

The class of mappings from \(L^{\infty}\) to \(L^{\infty}_{t}\) that satisfy these properties will be denoted by \(\mathcal{C}_{t}\). We refer to its elements as concave valuations. A mapping \(\phi_{t} \in\mathcal{C}_{t}\) is called coherent if it also satisfies \(\mathcal{F}_{t}\)-positive homogeneity, that is, \(\phi _{t}(\varLambda X) = \varLambda\phi_{t}(X)\) for \(X \in L^{\infty}\), \(\varLambda\in L^{\infty}_{t}\), \(\varLambda\geq0\). These are precisely the elements of \(\mathcal{C}_{t}\) that can be represented by a threshold function \(\theta_{t}\) that only takes the values 0 and \(\infty\).

Mappings with properties (i)–(iii) are called (conditional) monetary valuations or, usually with opposite sign convention, monetary risk measures. Monetary valuations possess the elementary property of

$$ \textstyle\begin{array}{ll} \mathcal{F}_{t}\mbox{-}\textit{regularity}{:} & \phi _{t}(1_{F} X) = 1_{F} \phi_{t}(X) \quad(F \in\mathcal{F}_{t}), \end{array} $$
(2.4)

which is a minimal requirement for a meaningful interpretation of the mapping \(\phi_{t}\) as a normalized valuation at time \(t\).

The threshold function associated to a given conditional valuation is not determined uniquely. However, to a given \(\phi_{t} \in\mathcal{C}_{t}\), there is a unique minimal threshold function, which is given by

$$ \theta^{\min}_{t}(Q) = -\mathop{\mathrm{ess}\;\mathrm{inf}}_{X \in\mathcal{A}_{t}} E^{Q}_{t} X, $$
(2.5)

where \(\mathcal{A}_{t}\) denotes the acceptance set that is defined by

$$ \mathcal{A}_{t} = \{ X \in L^{\infty}\,|\,\phi _{t}(X) \geq0 \}. $$
(2.6)

We call a threshold function \(\theta_{t}\) regular if it satisfies

$$ 1_{F} E^{Q}_{t} = 1_{F} E^{R}_{t} \quad\Rightarrow\quad 1_{F} \theta_{t}(Q)=1_{F} \theta_{t}(R) \quad(Q,R \in\mathcal{Q}_{t}, F \in\mathcal{F}_{t}). $$
(2.7)

This property is similar to the regularity property (2.4) for risk measures. It has been called the “finite pasting property” in [8, after Lemma 3.3] and the “local property” in [11, Lemma 3.12]. Minimal threshold functions always have this property, and we sometimes impose (2.7) as a regularity condition when nonminimal thresholds are considered.

A valuation \(\phi_{t} \in\mathcal{C}_{t}\) is called sensitive if

$$X \lneqq0 \quad\Rightarrow\quad\phi_{t}(X) \lneqq0 \quad(X \in L^{\infty}) $$

and strongly sensitive, or also strictly monotone, if

$$ X \lneqq Y\quad\Rightarrow\quad\phi_{t}(X) \lneqq\phi_{t}(Y) \quad (X,Y \in L^{\infty}). $$
(2.8)

3 Conditional consistency

Conditional consistency for a pair of valuations \(\phi_{s}\), \(\phi_{t}\) can be expressed compactly as the requirement that (cf. [13])

$$ \mathcal{A}_{t} = \mathcal{A}_{s}^{t}, $$
(3.1)

where \(\mathcal{A}_{t}\) is the acceptance set of \(\phi_{t}\), see (2.6) and \(\mathcal{A}_{s}^{t}\) is the \(\mathcal{F}_{t}\) -restriction of \(\mathcal{A}_{s}\),

$$ \mathcal{A}_{s}^{t} = \{X \in L^{\infty}\,|\,\phi_{s}(1_{F} X) \geq0 \text{ for all } F \in \mathcal{F}_{t}\}. $$
(3.2)

A valuation \(\phi_{t}\) that satisfies (3.1) for a given \(\phi_{s}\) is called the conditionally consistent update of \(\phi_{s}\). This update is unique because conditional monetary valuations are completely determined by their acceptance set. For coherent risk measures \(\phi_{s}\), this update is simply obtained by conditioning probability measures; cf. [12, Theorem 7.1]. In other words, conditional consistency generalizes the notion of Bayesian updating. More precisely, assuming sensitivity of \(\phi_{s}\) in order to avoid technicalities, the conditionally consistent update in \(\mathcal{C}_{t}\) of \(\phi_{s}\) is given by (cf. [13])

$$\phi_{t}(X) = \mathop{\mathrm{ess}\;\mathrm{inf}}\{ E^{Q}_{t} X \,|\,Q \in\mathcal{M}^{e}(P), \theta _{s}(Q)=0\}. $$

As observed in [14, Remark 3.1.10], outside the coherent class, it is not guaranteed that \(\mathcal{A}_{s}^{t}\) has the property

$$ 1_{F} X, 1_{F^{c}} X \in\mathcal{A}_{s}^{t} \quad \Rightarrow\quad X \in\mathcal{A}_{s}^{t} \quad(X \in L^{\infty}). $$
(3.3)

Since this is a necessary condition for \(\mathcal{F}_{t}\)-regularity (2.4) of \(\phi_{t}\) satisfying (3.1), a conditionally consistent update is not possible in \(\mathcal{C}_{t}\) (and not even in the monetary class) when that property is not satisfied.

We characterize the existence of a conditionally consistent update of a given \(\phi_{s}\) in terms of the operator \(\eta: \mathcal{Q}_{s} \times\mathcal{F}_{t} \rightarrow L^{0}_{s}(\overline{\mathbb{R}}_{+})\) defined by

$$ \eta(Q,A) = -\mathop{\mathrm{ess}\;\mathrm{inf}}\{E^{Q}_{s} (1_{A} X) \,|\,1_{A} X \in\mathcal{A}_{s}^{t}\}. $$
(3.4)

Nonnegativity of this function follows from the fact that \(0 \in \mathcal{A} _{s}^{t}\). It is also clear that \(\eta\) can only take infinite values where \(\theta_{s}(Q)\) is infinite because

$$ \eta(Q,A) \leq-\mathop{\mathrm{ess}\;\mathrm{inf}}\{E^{Q}_{s} X \,|\,X \in\mathcal{A}_{s}\} = \theta ^{\min}_{s}(Q) \leq\theta_{s}(Q). $$
(3.5)

The function \(\eta\) can be viewed as a dual representation of \(\mathcal{A} _{s}^{t}\) in the sense that

$$X \in\mathcal{A}_{s}^{t}\quad\Longleftrightarrow\quad E^{Q}_{s}(1_{A} X) + \eta(Q,A) \geq0 \quad\text{for all } A \in\mathcal{F}_{t}, Q \in\mathcal{Q}_{s}. $$

The implication from left to right is obvious from the definition of \(\mathcal{A}_{s}^{t}\), whereas the reverse implication follows from the fact that \(X \notin\mathcal{A}_{s}^{t}\) implies that \(E^{Q}_{s}(1_{A} X) + \theta_{s}(Q) \ngeq 0\) for some \(Q \in\mathcal{Q}_{s}\) and \(A \in\mathcal{F}_{t}\), together with inequality (3.5).

Proposition 3.1

The valuation \(\phi_{s} \in\mathcal{C}_{s}\) admits a conditionally consistent update \(\phi_{t}\) if and only if for all \(Q \in\mathcal{Q}_{s}\), the mapping \(\eta(Q,\cdot)\) is additive; in other words,

$$ \eta(Q,A\cup B) = \eta(Q,A) + \eta(Q,B) \quad\textit{for}\ \textit{all}\ A, B \in \mathcal{F}_{t}, A\cap B = \emptyset. $$
(3.6)

If this holds, then \(E^{Q}\eta(Q,\cdot)\) is a measure on \(\mathcal {F}_{t}\) that is absolutely continuous with respect to \(Q\), and its Radon–Nikodým derivative \(\mu_{t}(Q)\) for given \(Q \in \mathcal{Q}_{t}\) equals the minimal threshold \(\theta^{\min}_{t}(Q)\) for the update \(\phi_{t}\).

The proof is in the Appendix. We conclude this section by addressing the question how to define an update in case (3.6) does not hold and return to the main line in Sect. 4.

3.1 The refinement update

The \(\mathcal{F}_{t}\) -refinement update \(\phi_{s}^{t}\) of a given sensitive \(\mathcal{F}_{s}\)-conditional monetary valuation \(\phi_{s}\) has been introduced in [13] as the smallest \(\mathcal{F}_{t}\)-conditional monetary valuation whose acceptance set contains the set \(\mathcal{A}_{s}^{t}\) defined in (3.2). In other words, \(\phi_{s}^{t}\) is the conditional capital requirement for \(\mathcal{A}_{s}^{t}\) [6, 11]. It is defined as [13, Definition 4.3]

$$ \phi^{t}_{s}(X) = \operatorname{ess\;sup}\big\{ Y \in L_{t}^{\infty}\,\big|\, \phi_{s}\big(1_{F}(X-Y)\big) \geq0 \text{ for all } F \in\mathcal{F}_{t} \big\} , $$
(3.7)

and its acceptance set is the closure of \(\mathcal{A}_{s}^{t}\) under (3.3),

$$\begin{aligned} \mathcal{B}^{t}_{s} = \bigg\{ \sum_{i \in\mathbb{N}} 1_{A_{i}} X_{i} \bigg|& X_{i} \in\mathcal{A}_{s}^{t}, A_{i} \in\mathcal{F}_{t}, \bigcup_{i \in \mathbb{N}} A_{i} = \varOmega, \\ & A_{i} \cap A_{j} = \emptyset \text{ for } i \neq j\bigg\} \supseteq \mathcal{A}_{s}^{t}. \end{aligned}$$
(3.8)

Sensitivity of \(\phi_{s}\) ensures that the mapping \(\phi^{t}_{s}\) is indeed an \(\mathcal{F}_{t}\)-conditional monetary valuation, that is, it satisfies the first three properties of risk measures in \(\mathcal{C}_{t}\) listed in Sect. 2; cf. [13, Corollary 4.3]. Concavity is preserved as well, that is, the \(\mathcal{F}_{t}\)-refinement update of \(\phi_{s} \in\mathcal{C}_{s}\) is \(\mathcal{F}_{t}\)-concave; cf. [3] and [13, Proposition 2.2]. It follows from (3.1) that \(\phi_{s}^{t}\) must coincide with the conditionally consistent update if that update exists. Otherwise, the inclusion in (3.8) is strict (see [13, Example 4.5] for a simple example), and we know from the previous section that this occurs when the mapping \(\eta\) given by (3.4) is not additive in its second argument.

We restrict attention to \(s=0\) in this section and consider a valuation \(\phi_{0} \in\mathcal{C}_{0}\). Perhaps not surprisingly, the refinement update is closely related to the smallest measure \(\bar{\eta}\) dominating \(\eta\), which is given by

$$ \bar{\eta}(Q,A) := \sup\bigg\{ \sum_{i \in\mathbb{N}} \; \eta (Q,A_{i}) \, \bigg| \, \bigcup_{i \in\mathbb{N}} A_{i} = A,\; A_{i} \cap A_{j} = \emptyset\text{ for } i \neq j\bigg\} . $$
(3.9)

Notice that \(\bar{\eta}(Q)\) need not be finite even for measures \(Q\) for which \(\theta_{0}(Q,\cdot)\) is finite, as is illustrated by the following example.

Example 3.2

Take \(\phi_{0} = \min\{E^{P}, E^{Q}(\cdot)+\theta\}\) with \(\theta>0\). Assume that the space \((\varOmega,P,\mathcal{F}_{t})\) is atomless and that \(Q\) is equivalent to \(P\). Also assume that there exists \(X \in L^{\infty}\) such that \(E^{P}_{t} X = 0\) and \(E^{Q}_{t} X=-\theta\). Then not only \(X \in\mathcal{A}_{0}^{t}\), but also \(1_{A} X / Q[A] \in\mathcal{A}_{0}^{t}\) for all nontrivial \(A \in\mathcal{F}_{t}\). Hence, \(\eta(Q,A)=\theta\) for all such \(A\), and consequently \(\bar{\eta}(Q,F) = \infty\) for all nontrivial \(F \in\mathcal{F}_{t}\). It follows that the refinement update is given by \(E^{P}_{t}\).

We prove a representation theorem under the assumption that the update is continuous from above. The proof is in the Appendix.

Proposition 3.3

Let a sensitive concave valuation \(\phi_{0} \in\mathcal{C}_{0}\) be given, and assume that the \(\mathcal{F}_{t}\)-refinement update \(\phi_{0}^{t}\) is continuous from above. Then we have that

$$\phi_{0}^{t}(X) = \mathop{\mathrm{ess}\;\mathrm{inf}}_{Q \in\mathcal{Q}_{t}} \big(E^{Q}_{t} X + \mu _{t}(Q) \big), $$

where \(\mu_{t}(Q)\) is the Radon–Nikodým derivative with respect to \(Q\) of \(\bar{\eta}(Q,\cdot)\) as defined in (3.9).

4 Characterization of sequential consistency

We say that the conditional monetary valuations \(\phi_{s}\) and \(\phi_{t}\) are sequentially consistent, or that \(\phi_{t}\) is a sequentially consistent update of \(\phi_{s}\), if the following conditions hold (cf. [13]):

$$\begin{aligned} & \phi_{t}(X) \geq0 \quad\Rightarrow\quad\phi_{s}(X) \geq0 \quad(X \in L^{\infty}), \end{aligned}$$
(4.1a)
$$\begin{aligned} & \phi_{t}(X) \leq0 \quad\Rightarrow\quad\phi_{s}(X) \leq0 \quad(X \in L^{\infty}). \end{aligned}$$
(4.1b)

The term “sequential” is chosen to express that the values of a given position at a sequence of time instants should not change sign predictably. These two requirements have been called weak acceptance consistency and weak rejection consistency, respectively; we use the terms “acceptance consistency” and “rejection consistency” for brevity. They can be combined into one implication that characterizes sequential consistency (cf. [13, Lemma 3.2]), namely

$$ \phi_{t}(X)=0 \quad\Rightarrow\quad\phi_{s}(X)=0. $$
(4.2)

This can be viewed as an extension of the normalization condition \(\phi_{0}(0)=0\) that requires a zero outcome not only for the zero position, but also for every \(X \in L^{\infty}\) such that \(\phi_{t}(X)=0\). We refer to [13] for a further discussion of this concept. In particular, it is shown in this reference that sequential consistency implies conditional consistency under the assumption of strong sensitivity (2.8), so that uniqueness of updates is guaranteed. On the other hand, sequential consistency is much weaker than the standard notion of strong time consistency, which requires that

$$ \phi_{s}(X) = \phi_{s}\big(\phi_{t}(X)\big). $$
(4.3)

We further discuss this backward recursion in Sect. 7.

Acceptance consistency has been characterized by a supermartingale condition on threshold functions; see [8, Proposition 4.10]. Adapted to our setting, and with a slight generalization for risk measures that are not sensitive and thresholds that are not minimal, the result reads as follows. All proofs in this section are in the Appendix.

Lemma 4.1

Acceptance consistency (4.1a) holds for a pair \((\phi_{s},\phi_{t})\) with \(\phi_{s} \in\mathcal{C}_{s}\) and \(\phi_{t} \in\mathcal{C}_{t}\) if

$$ \theta_{s}(Q'Q_{t}) \geq E^{Q'}_{s} \theta_{t}(Q) \quad(Q'\in\mathcal {Q}_{s}, Q \in \mathcal{Q}_{t}) $$
(4.4)

and only if their minimal threshold functions satisfy this property.

We identify \(Q \in\mathcal{Q}\) with its Radon–Nikodým derivative \(z^{Q}:=\frac {dQ}{dP}\) and equip the set \(\mathcal{Q}\) with the corresponding \(L^{1}\)-topology. The corresponding \(\varepsilon\)-neighborhood of \(Q\) is denoted by

$$ B^{\varepsilon}(Q):=\{Q'\in\mathcal{Q}\,|\,\| z^{Q'}-z^{Q} \|_{1} < \varepsilon\}. $$
(4.5)

We also use the union of these sets over probability measures of the form \(RQ_{t}\) for given \(Q \in\mathcal{Q}_{t}\),

$$ B^{\varepsilon}_{t}(Q):= \bigcup_{R \in\mathcal{Q}} B^{\varepsilon}(RQ_{t}). $$
(4.6)

Theorem 4.2

Let a pair of valuations \(\phi_{s} \in\mathcal{C}_{s}\), \(\phi_{t} \in \mathcal{C}_{t}\) be given, and suppose that these valuations are represented by regular threshold functions \(\theta_{s}\) and \(\theta_{t}\), respectively. The pair \((\phi_{s},\phi_{t})\) is sequentially consistent if we have both

$$\begin{aligned} \mathop{\mathrm{ess}\;\mathrm{inf}}\{ \theta_{s} (QQ^{*}_{t}) - E^{Q}_{s} \theta_{t}(Q^{*})\,|\, Q \in \mathcal{Q}_{s}\} &\geq0 \quad(Q^{*} \in\mathcal{Q}_{t}), \end{aligned}$$
(4.7a)
$$\begin{aligned} \inf\big\{ E^{P} \big(\theta_{s}(Q) - E^{Q}_{s} \theta_{t}(Q^{*})\big)\,\big|\, Q \in B^{\varepsilon}_{t}(Q^{*}) \cap\mathcal{P}_{s} \big\} &\leq0 \quad (\varepsilon>0, Q^{*} \in\mathcal{Q}_{t} \\ &\phantom{\leq0 \quad(}\ \textit{with}\ \theta_{t}(Q^{*}) \ \textit{bounded}), \end{aligned}$$
(4.7b)

and only if these conditions hold for their minimal threshold functions.

From Theorem 4.2 we immediately obtain sufficiency of the following, simpler criterion.

Corollary 4.3

A pair of valuations \(\phi_{s} \in\mathcal{C}_{s}\), \(\phi_{t} \in \mathcal{C}_{t}\) represented by regular \(\theta_{s}\) and \(\theta_{t}\), respectively, is sequentially consistent if for all \(Q^{*} \in\mathcal{Q}_{t}\),

$$\begin{aligned} & \mathop{\mathrm{ess}\;\mathrm{inf}}\{ \theta_{s} (QQ^{*}_{t}) - E^{Q}_{s} \theta_{t}(Q^{*})\,|\,Q \in \mathcal{Q} _{s}\} = 0. \end{aligned}$$
(4.8)

Before we discuss the interpretation, let us first briefly compare the criteria in the corollary and the preceding theorem. Criterion (4.8) is obtained by extending the requirement in (4.7b) in two respects: to \(Q^{*}\) with unbounded \(\theta_{t}(Q^{*})\), and not only for \(\varepsilon>0\), but also for \(\varepsilon=0\). Both extensions are not without loss of generality, as shown by two counterexamples in Sect. A.6 of the Appendix.

For monetary valuations \(\phi\) that are coherent, so that the minimum threshold only takes the values 0 and \(\infty\), condition (4.8) amounts to the requirement that measures applied at \(t\) (i.e., \(Q^{*} \in\mathcal{Q}_{t}\) such that \(\theta _{t}(Q^{*})=0\)) can be combined into one measure of the form \(QQ^{*}_{t} \in\mathcal{Q}\) with zero threshold. This property has been called junctedness in [12]. In the noncoherent case, we can again interpret (4.8) as a junctedness condition, considering \(\theta_{t}(Q^{*})\) as an affine term added to the conditional expectation of positions. The criterion requires that for all conditional affine functionals of the form \(E^{Q^{*}}_{t}(\cdot)+\theta_{t}(Q^{*})\), there exists an initial “junct” \(Q \in\mathcal{Q}\) that approximately amounts to taking a weighted average of the outcome of this functional.

5 Risk aversion in concave valuations

In this section, we summarize some well-known properties of valuations related to risk aversion. This prepares for our definition of consistent risk aversion in the next section, which plays an important role in simplifying the characterizations obtained so far. We refer to [9, Chap. 2] for an extensive introduction to risk aversion, emphasizing its role in axiomatic frameworks for risk measures.

A valuation \(\phi_{t} \in\mathcal{C}_{t}\) is said to exhibit risk aversion at level \(t\) with respect to a measure \(Q \in\mathcal{Q}\) if \(\phi_{t}(X) \leq E^{Q}_{t} X\) for all \(X \in L^{\infty}\). In terms of the minimal representation \(\theta^{\min }_{t}\) of \(\phi_{t}\), the criterion is simply

$$ \theta^{\min}_{t}(Q)=0. $$
(5.1)

The terminology is taken from the literature on premium principles, although the direction of the inequality is reversed here due to our different sign convention. An alternative term that is sometimes used is that there is nonnegative risk loading. In the applications to premium setting, the measure \(P\) is the “physical” (real-world) measure, and the difference \(E^{P}_{t}(X)-\phi_{t}(X)\) is viewed as a risk margin. A similar interpretation may be given in a regulatory context, where \(\phi_{0}(X)\) serves to determine the amount of required capital associated to a risky position \(X\). Alternatively, one may think of \(P\) as a pricing measure and interpret \(\phi_{t}(X)\) as a bid price for the payoff \(X\); in other words, \(\phi_{t}(X)\) is the price that a trader at time \(t\) is able to get in the market for a contract that obliges the seller to deliver a contingent payoff \(X\). The corresponding ask price is \(-\phi_{t}(-X)\) [10, 5, 7]; this is the price that a trader needs to pay to obtain the contingent payoff \(X\). The inequality \(\phi_{t}(X) \leq E^{P}_{t} X\) implies \(E^{P}_{t} X \leq-\phi _{t}(-X)\), so that the “intrinsic value” \(E^{P}_{t} X\) lies between the bid price \(\phi_{t}(X)\) and the ask price \(-\phi_{t}(-X)\). The presence of a martingale measure inducing expected values that are bracketed by the bid and ask prices associated to a nonlinear valuation \(\phi_{t}\) is a well-known condition for absence of arbitrage [10, Theorem 3.2]; cf. also the discussion in [13].

It turns out to be convenient to study the risk aversion property in combination with acceptance consistency (4.1a), which guarantees that if the most strongly aggregated valuation exhibits risk aversion, then so do its updates.

Lemma 5.1

If a concave valuation \(\phi_{t} \in\mathcal{C}_{t}\) is an acceptance consistent update of a conditional valuation \(\phi_{s} \in\mathcal{C}_{s}\) that exhibits risk aversion with respect to a measure \(Q \in\mathcal {Q}_{t}\) at level \(s\), then \(\phi_{t}\) exhibits risk aversion with respect to \(Q\) at level \(t\).

Proof

From (5.1) it follows that \(\theta^{\min}_{s}(Q)=0\). The supermartingale condition (4.4) implies that the relation \(\theta^{\min}_{t}(Q)=0\) holds as well, and hence \(\phi_{t} \leq E^{Q}_{t}\). □

In particular, if we assume that there is a measure \(P'\) equivalent to the reference measure \(P\) with \(\theta^{\min}_{0}(P')=0\), like in [15], for example, it follows that not only \(\phi_{0}\), but also all its acceptance consistent updates exhibit risk aversion with respect to \(P'\). This assumption is hardly restrictive for sensitive concave valuations, as illustrated by the following lemma.

Lemma 5.2

Any coherent sensitive valuation is risk averse with respect to some \(P' \in\mathcal{M}^{e}(P)\). For a sensitive concave valuation \(\phi_{0} \in\mathcal{C}_{0}\), there exists for all \(\varepsilon>0\) a sensitive concave valuation \(\phi^{\prime}_{0} \in \mathcal{C}_{0}\) that is risk averse with respect to some \(P' \in\mathcal{M}^{e}(P)\) and satisfies \(0 \leq\phi_{0}(X)-\phi^{\prime}_{0}(X)<\varepsilon\) for all \(X \in L^{\infty}\).

Proof

Let \(\theta^{\min}_{0}\) denote the minimal representation of \(\phi_{0} \in \mathcal{C}_{0}\). From (2.3) and (A.9) it follows that there exists \(P'\in\mathcal{M}^{e}(P)\) with \(\theta^{\min }_{0}(P')<\varepsilon\). In the coherent case, then \(\theta^{\min}_{0}(P')=0\). For the general case, take as \(\phi^{\prime}_{0}\) the valuation obtained by redefining \(\theta ^{\min}_{0}(P'):=0\). Obviously, then \(0 \leq\phi_{0}(X)-\phi^{\prime}_{0}(X) < \varepsilon\) for all \(X \in L^{\infty}\). Sensitivity of \(\phi^{\prime}_{0}\) follows from \(\phi^{\prime}_{0} \leq\phi_{0}\), and by (5.1) the claims follow. □

In the sequel, we mainly concentrate on risk aversion with respect to \(P\). This is without further loss of generality because formally the only role of the reference measure is to specify nullsets, and hence \(P\) can be replaced by any \(P'\in\mathcal{M}^{e}(P)\).

6 Consistent risk aversion

The notion of risk aversion can be incorporated in time consistency in a straightforward way by imposing upper limits on \(\phi_{s}(X)\) in terms of conditional expected values not only of \(X\), but also of \(\phi_{t}(X)\). A dynamic valuation is a family \((\phi_{t})_{t\in\mathcal{T}}\) of conditional valuations \(\phi_{t}: L^{\infty}\rightarrow L^{\infty}_{t}\).

Definition 6.1

A dynamic valuation \((\phi_{t})_{t\in \mathcal{T}}\) is said to exhibit consistent risk aversion (CRA) with respect to \(P' \in\mathcal {M}^{e}(P)\) if \(\phi_{s} \leq E^{P'}_{s} \phi_{t}\) for all \(0 \leq s \leq t \leq T\).

For \(P'=P\), we may omit the phrase “with respect to \(P'\)”. The property of conditional risk aversion has been introduced in [6] under the name supermartingale property, and it is motivated there by the argument that the average of risk premiums at a given level of information should not exceed the risk premium that is required when less information is available. It should be noted that the sign convention in the cited paper is different from the one we use here.

Under the CRA condition, various notions of weak time consistency coincide. This is stated in the following proposition.

Proposition 6.2

Let a dynamic valuation \(\phi= (\phi_{t})_{t\in\mathcal{T}}\) be given with \(\phi_{t} \in\mathcal{C}_{t}\) for all \(t \in\mathcal{T}\). Under the condition that \(\phi\) satisfies the CRA property with respect to some \(P' \in\mathcal {M}^{e}(P)\), the following statements are equivalent:

  1. (i)

    \(\phi\) is acceptance consistent;

  2. (ii)

    \(\phi\) is conditionally consistent;

  3. (iii)

    \(\phi\) is sequentially consistent.

Proof

The implications from (ii) to (i) and from (iii) to (i) hold by definition, even without the CRA assumption. The CRA property directly implies rejection consistency (4.1b), so that acceptance consistency is equivalent to sequential consistency under CRA. Finally, to prove that acceptance consistency implies conditional consistency (3.1), first note that the inclusion \(\mathcal{A}_{t} \subset\mathcal{A}_{s}^{t}\) already holds without the CRA assumption since acceptance consistency means that \(\mathcal{A}_{t} \subset\mathcal{A}_{s}\), and by the regularity (2.4) of \(\phi_{t}\), this implies that \(\mathcal{A}_{t} \subset\mathcal{A}^{t}_{s}\). To derive the reverse inclusion, take \(X \in\mathcal{A}_{s}^{t}\). Then \(\phi _{s}(1_{F} X) \geq0\) for all \(F \in\mathcal{F}_{t}\), so that by the CRA property, \(E^{P'}_{s} 1_{F} \phi_{t}(X) = E^{P'}_{s} \phi_{t}(1_{F} X) \geq0\) for all \(F \in\mathcal{F}_{t}\). Taking \(F = \{ \phi_{t}(X)<0 \}\), we find that \(P'[\phi_{t}(X)<0]=0\); in other words, \(\phi_{t}(X) \geq0\), so that \(X \in\mathcal{A}_{t}\). □

Recall that conditionally consistent updates are unique by definition (see (3.1)); so the combination of acceptance consistency and CRA is sufficiently strong to rule out ambiguity of updating. Hence, we can also apply the notion of CRA to an initial valuation itself as follows.

Definition 6.3

The initial valuation \(\phi_{0} \in\mathcal{C}_{0}\) is said to exhibit CRA (with respect to a given filtration) if its conditionally consistent updates exist at all \(t \in\mathcal{T}\) and form a CRA dynamic valuation.

We state a number of conditions that may be imposed on dynamic valuations in terms of the associated threshold functions. In Theorem 6.4, these are shown to be equivalent to consistent risk aversion of the initial valuation (with respect to \(P\)). The conditions below are stated for a given dynamic risk measure \((\phi _{t})_{t\in\mathcal{T}}\) and for all \(0 \leq s \leq t \leq T\). The proof of the theorem is in the Appendix.

  • Rule 1: \(\theta_{t}(P)=0\).

  • Rule 2: \(\theta_{s}(Q'Q_{t}) \geq E^{Q'}_{s} \theta_{t}(Q) \ (Q'\in\mathcal{Q}_{s}, Q \in\mathcal{Q}_{t})\).

  • Rule 3: \(\theta_{s}(PQ_{t}) = E^{P}_{s} \theta_{t}(Q) \ (Q \in \mathcal{Q}_{t})\).

Theorem 6.4

Let a dynamic valuation \(\phi=(\phi_{t})_{t\in\mathcal{T}}\) be given with \(\phi_{t} \in\mathcal{C}_{t}\) for all \(t \in\mathcal{T}\). The following four conditions are equivalent:

  1. (1)

    \(\phi_{0}\) exhibits consistent risk aversion, and \(\phi_{t}\) is its conditionally consistent update at \(t\) for all \(t \in\mathcal{T}\).

  2. (2)

    \(\phi\) is acceptance consistent and exhibits consistent risk aversion.

  3. (3)

    \(\phi\) is representable by regular threshold functions \((\theta_{t})_{t\in\mathcal{T}}\) that satisfy Rules 13.

  4. (4)

    The minimal threshold functions of \(\phi\) satisfy Rules 13.

6.1 Additional results

Under Rule 3, the reference measure \(P\) serves as a universal junct (cf. the discussion at the end of Sect. 4), guaranteeing sequential consistency in a straightforward way. Rule 3 also shows that consistent updating induces a strong link between, on the one hand, \(\theta_{s}|_{\mathcal{P}_{t}}\), that is, the threshold functions \(\theta_{s}\) restricted to \(\mathcal{P}_{t}\), and, on the other hand, \(\theta_{t}\), whose restriction to \(\mathcal{P}_{t}\) still fully describes \(\phi_{t}\); cf. (A.3). It is clear that a given threshold function \(\theta_{t}\) at time \(t\) completely determines \(\theta_{s}(Q)\) for \(Q \in\mathcal{P}_{t}\). The converse is also true if we impose regularity of \(\theta_{t}\).

Proposition 6.5

Let \(\phi_{s}\) and \(\phi_{t}\) belong to a dynamic risk measure that satisfies the first condition of Theorem 6.4. Then \(\phi_{t}\) is a function of \(\theta^{\min}_{s}|_{\mathcal{P}_{t}}\) and vice versa. Moreover, for any representation \(\theta_{s}\) of \(\phi_{s}\), the restriction \(\theta_{s}|_{\mathcal{P}_{t}}\) determines a unique regular threshold function \(\theta_{t}\) for \(\phi_{t}\) that satisfies Rule 3.

Proof

We show that under Rule 3, there exists at most one regular threshold function \(\theta_{t}\). The second claim then follows directly, and the first claim follows from the fact that Rule 3 holds for minimal (hence regular) threshold functions.

Given a regular threshold function \(\theta_{t}\), we have \(\theta _{t}(PQ_{t}^{A}) = 1_{A} \theta_{t}(Q)\). (Recall that the notation \(PQ_{t}^{A}\) is used for conditional pasting of the measures \(P\) and \(Q\); see (2.1).) Therefore, Rule 3 is equivalent to

  • Rule 3′: \(\theta_{s}(PQ_{t}^{A}) = E^{P}_{s} 1_{A} \theta_{t}(Q) \ (Q \in\mathcal{Q}_{t}, A \in\mathcal{F}_{t})\).

Now consider two regular threshold functions \(\theta_{t}\) and \(\theta^{\prime}_{t}\) that both satisfy this rule. For given \(Q \in\mathcal{Q}_{t}\), apply Rule 3′ to \(A=\{\theta_{t}(Q) < \theta ^{\prime}_{t}(Q)\}\). It can be verified directly that this event has zero probability, and by an obvious symmetry argument, it follows that \(\theta_{t}(Q)\) and \(\theta^{\prime}_{t}(Q)\) are equal. □

The connection provided by the proposition not only reflects the uniqueness of consistent updates, which was also proved in [13] for not necessarily concave risk measures under appropriate sensitivity assumptions, but it also indicates which feature of the aggregated valuation determines the update, or why it may fail to exist. For instance, one of the consequences of Rule 3′ is that for disjoint \(A,B \in\mathcal{F}_{t}\),

$$ \theta^{\min}_{s}(PQ_{t}^{A \cup B}) = \theta^{\min}_{s}(PQ_{t}^{A})+ \theta^{\min}_{s} (PQ_{t}^{B}) $$
(6.1)

because otherwise there exists no regular time-\(t\) threshold function \(\theta_{t}\) that satisfies Rule 3′.

Remark 6.6

As shown in Proposition 6.2, the difference between conditional and sequential consistency disappears under the CRA property. In Proposition 3.1, we have characterized conditional consistency in terms of the operator \(\eta\) defined in (3.4). Under the conditions of Theorem 6.4, the operator \(\eta\) satisfies

$$ \eta(PQ_{t},A) = E^{P}_{s} 1_{A} \theta^{\min}_{t}(Q) = \theta^{\min}_{s}(PQ_{t}^{A}) \quad(Q \in \mathcal{Q}_{t}, \; A \in\mathcal{F}_{t}). $$
(6.2)

This condition determines \(\eta\) completely because \(\eta(Q'Q_{t},A) = E^{Q'}_{s} 1_{A} \theta^{\min}_{t}(Q)\) by Proposition 3.1. From this formula, the additivity of \(\eta\), which characterizes conditional consistency, is obvious. A closer inspection reveals that if we restrict Rule 3′ to events \(A\) that are in a sense “small,” the rule still guarantees conditional consistency. To be precise, let \(\mathcal {F}^{\mathrm {a}}_{t} \subset\mathcal{F}_{t}\) denote the collection of atoms of \((\varOmega,\mathcal{F}_{t},P)\), and for any given \(\delta> 0\), define \(\mathcal{F}^{\delta}_{t} = \{F \in\mathcal{F}_{t} \,|\, P[F]<\delta\}\). Choose \(\delta>0\) and consider the following relaxation of Rule 3′:

  • Rule 3″: \(\theta_{s}(PQ_{t}^{S}) = E^{P}_{s} 1_{S} \theta_{t}(Q) \ (Q \in\mathcal{Q}_{t}, S \in\mathcal{F}^{\mathrm{a}}_{t} \cup\mathcal {F}^{\delta}_{t})\).

It reflects a weaker, “local” form of consistent risk aversion. Notice that under Rules 1, 2, and 3″, relation (6.2) still holds and that conditional consistency is preserved. It also follows that conditionally consistent updates are in fact already completely determined by the restriction of \(\theta_{s}\) to \(\{PQ_{t}^{A} \,|\,A \in\mathcal {F}^{\mathrm {a}}_{t} \cup\mathcal{F}^{\delta}_{t}\}\). Rule 3″ is too weak, however, to guarantee that such updates are sequentially consistent since condition (6.1) may be violated for sets \(A\) and \(B\) such that \(A \cup B\) is not in the collection \(\mathcal{F}^{\mathrm{a}}_{t} \cup\mathcal{F}^{\delta}_{t}\).

We conclude this section with a corollary on the characterization of CRA for initial valuations; see Definition 6.3. We make use of a compatibility result, which shows that it is sufficient to verify the conditions of Theorem 6.4 for a limited set of pairs of time instants \(s,\,t\). Similar results in [13] make use of sensitivity conditions; under the CRA assumption, these conditions are not needed.

Proposition 6.7

For a dynamic valuation \((\phi_{t})_{t \in\mathcal{T}}\) with \(\phi_{t} \in\mathcal{C}_{t}\), the following statements are equivalent:

  1. (a)

    The conditions of Theorem 6.4 hold for \(s=0\) and for all \(t \in\mathcal{T}\) with \(t>0\).

  2. (b)

    The conditions of Theorem 6.4 hold for all \(s,t \in\mathcal{T}\) with \(s < t\).

In case \(\mathcal{T}=\{0,1,\dots,T\}\) with \(T\) finite, these statements are also equivalent to

  1. (c)

    The conditions of Theorem 6.4 hold for all \(s,t \in\mathcal{T}\) with \(t=s+1\).

From Theorem 6.4 and the proof of Proposition 6.5 the following result now follows straightforwardly.

Corollary 6.8

Let an initial concave valuation \(\phi_{0} \in\mathcal{C}_{0}\) be given, represented by the threshold function \(\theta_{0}\). The valuation \(\phi _{0}\) exhibits CRA if for all \(t \in\mathcal{T}\), the (unique) regular threshold function \(\theta_{t}\) that satisfies Rule 3′ with \(s=0\) exists and satisfies Rules 1 and 2 for \(s=0\). This condition is also necessary if \(\theta_{0}\) is the minimal representation of \(\phi_{0}\).

7 CRA valuations with prescribed stepwise properties

We assume a discrete, finite time axis \(\mathcal{T}= \{0,1,\dots,T\} \). Suppose that for every \(t \in \mathcal{T}' := \{0,1,\dots,T-1\}\), a single-period concave valuation \({\bar{\psi}}_{t}:L_{t+1}^{\infty}\rightarrow L_{t}^{\infty}\) is given. These single-period valuations can be composed to form a dynamic valuation \(\psi= (\psi _{t})_{t\in\mathcal{T}}\), which may be defined recursively by

$$ \psi_{T}(X) = X, \qquad\psi_{t}(X) = {\bar{\psi}}_{t}\big(\psi _{t+1}(X)\big) \quad(t = T-1,\dots,0). $$
(7.1)

The dynamic valuation \(\psi\) obtained in this way is strongly time consistent, that is, it satisfies (4.3). The construction as described is in fact a standard method of obtaining multiperiod strongly time consistent valuations. The given valuations \({\bar{\psi}}_{t}\) may correspond to one of the well-known types of static risk measures. Standard examples in the coherent class are Tail-Value-at-Risk (\(\operatorname{TVaR}\)) and its generalization to spectral risk measures, and \(\operatorname{MINVAR}\) and other variants of distortion measures, introduced in [4] in the context of bid–ask price modeling; see also Example 7.3. The prime example in the concave class is that of entropic risk measures, related to exponential utility.

If in general we write \(\phi_{s,t}\) for the restriction \(\phi _{s}|_{L_{t}^{\infty}}\) of a concave valuation \(\phi_{s}\) to \(L_{t}^{\infty}\), with \(t>s\), then the valuation \(\psi\) defined by (7.1) satisfies \(\psi_{t,t+1} = {\bar{\psi}}_{t}\) for all \(t \in\mathcal{T}'\), and it is in fact the only strongly time consistent dynamic valuation that has this property. However, there are in general many weakly time consistent valuations \(\phi\) that satisfy the same property, that is,

$$ \phi_{t,t+1} = {\bar{\psi}}_{t} \quad(t \in\mathcal{T}'). $$
(7.2)

Across a single time period, these valuations express the same level of conservatism as the given single-period valuations \({\bar{\psi }}_{t}\), but across multiple periods, they can avoid the piling up of conservatism that is inherent in the strongly consistent valuation \(\psi\). In this section, we discuss the construction of CRA valuations that match a given set of single-period valuations. In view of Definition 6.3, the matching condition (7.2) can also be interpreted as a prescription of the stepwise properties of initial valuations \(\phi_{0}\).

To simplify the analysis, we restrict attention to valuations with the additional property that for pairs \(s,t \in\mathcal{T}\) with \(s< t\), we have

$$ \phi_{s}(X) \leq\phi_{s}(E^{P}_{t} X). $$
(7.3)

This has a natural interpretation in both a regulatory and a pricing context with \(P\) respectively the real-world and a pricing measure. As shown in the next lemma, the corresponding extra rule for representations, in addition to the three for CRA, is

  • Rule 4: \(\theta_{s}(Q) \geq\theta_{s}(Q P_{t}) \ (Q \in \mathcal{Q}_{s})\).

Lemma 7.1

A valuation \(\phi_{s} \in\mathcal{C}_{s}\) satisfies (7.3) for given \(t>s\) if it has a representation \(\theta_{s}\) satisfying Rule 4 and only if its minimal representation satisfies that rule. Furthermore, \(\theta_{s,t}(Q):=\theta_{s}(QP_{t})\) defines a representation of \(\phi_{s,t}\) if \(\theta_{s}\) satisfies Rule 4.

Proof

If Rule 4 holds, then

$$ \phi_{s}(E^{P}_{t} X) = \mathop{\mathrm{ess}\;\mathrm{inf}}_{Q \in\mathcal{Q}_{s}} \big(E^{QP_{t}} X + \theta_{s}(Q)\big) \geq \mathop{\mathrm{ess}\;\mathrm{inf}}_{Q \in\mathcal{Q}_{s}} \big(E^{QP_{t}} X + \theta _{s}(QP_{t})\big) \geq\phi_{s}(X). $$

For the only if part, by (2.5) the minimal representation of \(\phi_{s}\) satisfies

$$\begin{aligned} \theta^{\min}_{s}(QP_{t}) & = -\mathop{\mathrm{ess}\;\mathrm{inf}}_{X \in\mathcal{A}_{s}} E^{Q}_{s} (E^{P}_{t} X) \\ & = -\mathop{\mathrm{ess}\;\mathrm{inf}}\{E^{Q}_{s} Z \,|\,Z = E^{P}_{t} X, X \in\mathcal{A}_{s}\} \\ & \leq -\mathop{\mathrm{ess}\;\mathrm{inf}}_{Z \in\mathcal{A}_{s}} E^{Q}_{s} (Z) = \theta^{\min}_{s}(Q). \end{aligned}$$

The last inequality is based on (7.3), implying that \(E^{P}_{t} X \in\mathcal{A}_{s}\) if \(X \in\mathcal{A}_{s}\). For the last claim, notice that because \(X \in L^{\infty}_{t}\), we can always write

$$\phi_{s,t}(X) = \mathop{\mathrm{ess}\;\mathrm{inf}}_{Q \in\mathcal{Q}_{s}, R \in\mathcal{Q}_{t}} \big(E^{Q}_{s} X + \theta _{s}(QR_{t})\big). $$

By Rule 4 we can restrict the domain to \(R=P\), and the result follows. □

Combining Rule 4 with the matching condition (7.2) yields a fifth rule, namely

  • Rule 5: \(\theta_{t}(QP_{t+1}) = \xi_{t} (Q) \ (t \in \mathcal{T} ', Q \in\mathcal{Q}_{t})\)

for some regular representation \(\xi_{t}\) of the to-be-matched \({\bar {\psi}}_{t}\) with \(\xi_{t}(P)=0\). Notice that in the notation \(\xi_{t}(Q)\), we identify \(Q\) with its restriction to \(\mathcal{F}_{t+1}\). The condition \(\xi_{t}(P)=0\), which can be imposed in view of (5.1), ensures that Rule 5 for \(Q=P\) is not in conflict with Rule 1.

Corollary 7.2

A valuation \(\phi_{0} \in\mathcal{C}_{0}\) exhibits CRA, satisfies the extra requirement (7.3) and has stepwise properties prescribed by (7.2) if and only if there exist regular threshold functions \((\theta_{t})_{t \in\mathcal{T}}\), representing \(\phi_{0}\) and its conditionally consistent updates, that satisfy Rules 15.

We sketch the effect of these rules backward recursively. For the last step, Rule 5 leaves no freedom, implying \(\theta_{T-1}:= \xi_{T-1}\). Then, at \(T-2\), the following properties of \(\theta_{T-2}\) are prescribed, by resp. Rule 1, 3, and 5, for \(Q \in\mathcal{Q}_{T-2}\):

$$\begin{aligned} \theta_{T-2}(P) & =0; \\ \theta_{T-2}(PQ_{T-1} )& = E^{P}_{T-2} \theta_{T-1}(Q) = E^{P}_{T-2} \xi _{T-1}(Q); \\ \theta_{T-2}(QP_{T-1} )& = \xi_{T-2}(Q). \end{aligned}$$

Rules 2 and 4 both put lower bounds on \(\theta_{T-2}\), which can be combined into

$$ \theta_{T-2}(Q'Q_{T-1}) \geq\xi_{T-2}(Q') \vee E^{Q'}_{T-2} \xi_{T-1}(Q). $$

If we impose that \(\phi\) must also satisfy superrecursiveness,

$$ \phi_{s}(X) \geq\phi_{s}\big(\phi _{t}(X)\big), $$
(7.4)

then this would lead to another rule, stronger than Rules 2 and 4, namely

  • Rule 6: \(\theta_{s}(Q'Q_{t+1}) \geq\theta_{s}(Q' P_{t}) + E^{Q'} \theta_{t}(Q)\),

so that the lower bound derived before would increase to

$$ \theta_{T-2}(Q'Q_{T-1} )\geq\xi_{T-2}(Q') + E^{Q'}_{T-2} \xi_{T-1}(Q). $$

The pattern for the remaining steps is the same, and it follows that we can take, once without and once with Rule 6,

$$\begin{aligned} \theta_{t} (Q'Q_{t+1} ) & = \xi_{t}(Q') \vee E^{Q'}_{t} \theta_{t+1}(Q) + \hat {\theta}_{t} (Q'Q_{t+1} ), \end{aligned}$$
(7.5)
$$\begin{aligned} \theta_{t} (Q'Q_{t+1} ) & = \xi_{t}(Q') + E^{Q'}_{t} \theta_{t+1}(Q) + \hat {\theta}_{t} (Q'Q_{t+1} ), \end{aligned}$$
(7.6)

with an incremental threshold function \(\hat{\theta}_{t}\) satisfying, besides regularity,

$$ \hat{\theta}_{t} (Q'Q_{t+1}) \geq0, \quad\text{with equality holding if } Q'P_{t+1}=P \text{ or } PQ_{t+1}=P. $$

For \(\hat{\theta}_{t}\) chosen zero in (7.6), for all \(t \in \mathcal{T}'\), the outcome is \(\psi\). Choosing \(\hat{\theta}_{t}\) maximal in (7.5) or (7.6), that is, infinity where zero is not prescribed, yields the maximum CRA valuation compatible with (7.2) and (7.3). This corresponds to applying risk aversion only in one period, that is, to

$$ {\hat{\phi}}=({\hat{\phi}}_{t})_{t \in \mathcal{T}} \quad \text{with}\quad{\hat{\phi}}_{t} (X) = \mathop{\mathrm{ess}\;\mathrm{inf}}_{u \in\{t,\dots,T-1\}} E^{P}_{t} {\bar{\psi}}_{u} (E^{P}_{u+1} X). $$
(7.7)

Example 7.3

Let \({\bar{\psi}}^{\alpha}_{t}\) denote the one-step conditional valuations corresponding to \(\operatorname{MINVAR}(\alpha+1)\) with parameter \(\alpha\in \mathbb{R}_{+}\). For \(\alpha\in\mathbb{N}\), this amounts to taking the conditional expected value of the minimum of \(\alpha+1\) trials under the reference measure \(P\) (cf. [4]); in particular, \(\bar{\psi}^{0}_{t}(X) = E^{P}_{t}(X)\) for \(X \in L^{\infty}_{t+1}\). Assume that \(\alpha=n \in\mathbb{N}\) corresponds to a reasonable level of risk aversion over one period, and let \(\psi^{n}\) be the recursive valuation (7.1) with stepwise valuation \(\psi_{t,t+1} = {\bar{\psi}}^{n}_{t}\). The maximum CRA valuation satisfying (7.2) is given by (7.7), which amounts to applying \(\operatorname{MINVAR}(n+1)\) in at most one period. The minimum valuation with the same stepwise properties as \(\psi^{n}\) is, of course, \(\psi^{n}\) itself. An example in between these extremes is obtained by setting a limit on the total number of trials until the horizon date \(T\),Footnote 1

$$ \phi^{n}_{t}(X) = \mathop{\mathrm{ess}\;\mathrm{inf}}\Big\{ {\bar {\psi}} ^{n_{t}}_{t}\Big( \cdots\big({\bar{\psi}}^{n_{T-1}}_{T-1}(X)\big)\cdots\Big) \,\Big|\, n_{t} + {\cdots} + n_{T-1} \leq n\Big\} . $$
(7.8)

Other examples are obtained by replacing the upper bound \(n\) by \(n(T-t)^{\gamma}\), with \(\gamma\in[0,1]\) controlling the level of risk aversion over multiple time steps.

It may be noted that for coherent valuations, as the examples just given, superrecursiveness (7.4) is equivalent to acceptance consistency (4.1a) and hence is always satisfied under CRA. Indeed, acceptance consistency is directly implied by (7.4), and conversely, (4.1a) implies that \(\phi_{s}(X-\phi_{t}(X)) \geq0\) because \(\phi_{t}(X-\phi_{t}(X))=0\), and by coherence then \(\phi_{s}(X) \geq \phi_{s}(\phi_{t}(X)) + \phi_{s}(X-\phi_{t}(X)) \geq\phi_{s}(\phi_{t}(X))\).

7.1 Set-recursive valuation

It may be illuminating to compare the recursive features of \({\hat {\phi}}\), defined by (7.7) with the standard recursive property of \(\psi\) in (7.1). A backward recursive evaluation of \({\hat{\phi}}(X)\) for given \(X \in L^{\infty}\) is quite possible if we keep track of the outcomes of a “double” value function at each time \(t \in\mathcal{T}\), consisting of not only \({\hat{\phi}} _{t}\), but also \(E^{P}_{t}\); in other words, we look at

$$ \big({\hat{\phi}}_{t}(X), E^{P}_{t} X\big) = \big({\bar{\psi}}_{t}(E^{P}_{t+1} X) \wedge E^{P}_{t} {\hat{\phi}}_{t+1}(X), E^{P}_{t} E^{P}_{t+1} X\big). $$
(7.9)

This is an example of what we call set-recursion, a generalization of the standard “singleton” recursion (4.3) in just one \(\mathcal{F}_{t}\)-measurable variable at \(t\), as obeyed by \(\psi\) in (7.1).

More generally speaking, we may consider valuations \(\phi= (\phi _{t})_{t \in\mathcal{T}}\) that are constructed by means of a recursion of the form

$$\begin{aligned} \varPhi_{t}(X) & = {\bar{\varPsi}}_{t} \big(\varPhi_{t+1}(X)\big), \end{aligned}$$
(7.10a)
$$\begin{aligned} \phi_{t}(X) & = \bar{\phi}_{t} \big(\varPhi_{t}(X)\big), \end{aligned}$$
(7.10b)

where the auxiliary quantities \(\varPhi_{t}(X)\) take values in the sets \(L_{t}^{\infty}(\varOmega,\mathcal{F};Z)\) of essentially bounded \(\mathcal{F}_{t}\)-measurable functions with values in a suitable normed vector space \(Z\). These auxiliary quantities are defined recursively by means of the mappings \({\bar{\varPsi}}_{t}: L_{t+1}^{\infty}(\varOmega,\mathcal{F};Z) \rightarrow L_{t}^{\infty}(\varOmega,\mathcal{F};Z)\), and the actual valuations at the time instants \(t\) are produced from \(\varPhi_{t}(X)\) by applying a mapping \(\bar{\phi}_{t}: L_{t}^{\infty}(\varOmega,\mathcal{F};Z) \rightarrow L_{t}^{\infty}\). The idea is that the vector space \(Z\) allows storage of multiple attributes that play a role in valuation. We now discuss this idea more concretely in terms of a parameterized family of valuations.

Definition 7.4

A parameterized family of dynamic valuations \((\phi ^{\alpha})_{\alpha\in A}\) for some index set \(A\) is called set-recursive or, more specifically, \(A\)-recursive, if the following implication holds for all \(X,Y \in L^{\infty}\):

$$\phi^{\alpha}_{t+1}(X) = \phi^{\alpha}_{t+1}(Y) \quad(\alpha\in A) \quad \Rightarrow\quad\phi^{\alpha}_{t}(X) = \phi^{\alpha}_{t}(Y) \quad(\alpha \in A). $$

In other words, \(A\)-recursiveness means that each \(\phi^{\alpha}\) can be recursively specified by (7.10a), (7.10b) with \(\varPhi_{t}(X) = (\phi ^{\alpha}_{t}(X))_{\alpha\in A}\). We take \(A \subset\mathbb{R}\), interpreted as a range of risk aversion levels. For example, if we set \(\phi^{a}:={\hat{\phi}}\) with \(a>0\) interpreted as the overall risk aversion level of \({\hat{\phi}}\) defined in (7.7), and \(\phi^{0}=(E^{P}_{t})_{t \in\mathcal{T}}\), then (7.9) shows that the pair is \(A\)-recursive for \(A=\{0,a\}\) (and \(\phi^{0}\) itself for \(A = \{ 0\}\)).

Within the context of concave valuations, it is an obvious idea to specify \(A\)-recursion in terms of concave single-step valuations and to consider, for instance,

$$ \phi^{\alpha}_{t} = \mathop{\mathrm{ess}\;\mathrm{inf}}_{\alpha' \in A} {\bar{\psi}}^{\alpha ,\alpha'}_{t} \phi ^{\alpha'}_{t+1}. $$
(7.11)

Here \({\bar{\psi}}^{\alpha,\alpha'}_{t}\) is a single-period valuation that specifies how conservative one can be over \([t,t+1]\), under overall risk aversion level \(\alpha\), in combination with applying risk aversion level \(\alpha'\) over the remaining period. We therefore impose that \({\bar{\psi}}^{\alpha,\alpha'}_{t}\) is nonincreasing in \(\alpha \) and nondecreasing in \(\alpha'\). We call \({\bar{\varPsi}}^{\alpha}_{t}:= ({\bar {\psi }}^{\alpha ,\alpha'}_{t})_{\alpha' \in A}\) the generator of \(\phi^{\alpha}\) at \(t\), in analogy to the standard recursive case, in which this operator is independent of \(\alpha'\) and coincides with \(\phi^{\alpha}_{t,t+1}\).Footnote 2

When we take \({\bar{\psi}}^{\alpha,\alpha}_{t} \leq E^{P}_{t}\) (on \(L^{\infty}_{t+1}\)) for all \(t \in\mathcal{T}'\), the dynamic valuation \(\phi^{\alpha}\) satisfies the CRA criterion of Definition 6.1. To obtain the equivalent conditions in Proposition 6.2, so that \(\phi ^{\alpha}_{0}\) is CRA (Definition 6.3), we also assume that \({\bar {\psi}} ^{\alpha ,\alpha'}=\infty\) for \(\alpha' > \alpha\); criterion (4.1a) for acceptance consistency then follows.

This translates to dual representations as follows. Let \(\theta ^{\alpha ,\alpha'}_{t}\) denote a regular representation of \({\bar{\psi }}^{\alpha,\alpha'}_{t}\). In order to satisfy the conditions of Theorem 6.4, we set \(\theta^{\alpha,\alpha'}_{t}(P)=0\) for all \(\alpha' \leq\alpha\). The matching condition (7.2) takes the form \(\theta^{\alpha ,0}_{t} = \xi_{t}^{\alpha}\) (cf. Rule 6) with \(\xi^{\alpha}_{t}\) a regular representation of the single-step valuation that has to be matched by \(\phi^{\alpha}\). The corresponding representations \(\theta^{\alpha}_{t}\) of \(\phi^{\alpha}_{t}\) defined by (7.11) are then given by

$$\begin{aligned} \theta^{\alpha}_{T-1} & = \xi_{T-1}^{\alpha}, \\ \theta^{\alpha}_{t}(Q'Q_{t+1}) & = \mathop{\mathrm{ess}\;\mathrm{inf}}_{\alpha' \in A}\big( E^{Q'}_{t} \theta_{t+1}^{\alpha'}(Q) + \theta^{\alpha,\alpha'}_{t}(Q')\big). \end{aligned}$$

We conclude by pointing out the fact that this setting gives rise to a revision of the very definition of positions. We took as a starting point the specification of a position \(X\) at some future moment \(T\), and correspondingly, we can “artificially” set \(\phi^{\alpha}_{T}(X)=X\) for all risk-aversion levels we consider. However, in many applications, \(T\) is a somewhat arbitrarily chosen horizon date of modeling, and there is no reason to treat \(T\) in a different manner than earlier time instants. So we should allow then for dependency on \(\alpha\) of \(\phi ^{\alpha}_{T}\), and hence of a position \(X\) itself, to reflect the sensitivity of \(X(\omega)\) for risk aversion after \(T\) for each \(\omega\in\varOmega\). In other words, rather than formalizing a position as \(X: \varOmega\rightarrow\mathbb{R}\), we could take \(X^{A}: \varOmega \times A \to \mathbb{R}\) as the fundamental object of valuation with \(A\) a suitable range of risk aversion levels in which \(X^{A}\) is monotone. It is clear that \(A\)-recursive valuations then become recursive in the ordinary sense and can be locally specified in terms of the newly introduced generators.

Example 7.5

The example \(\phi^{n}_{0}\) in (7.8) is an \(A\)-recursive CRA valuation in \(\mathcal{C}_{0}\) for \(A=\{0,\dots,n\}\), with generator \({\bar{\varPsi}}^{n}_{t}= ({\bar{\psi}}^{n-k}_{t})_{k \in A}\) at \(t\), since we can write

$$ \phi^{n}_{t}(X) = \mathop{\mathrm{ess}\;\mathrm{inf}}\big\{ {\bar {\psi}} ^{n-k}_{t}\big(\phi^{k}_{t+1}(X)\big) \,\big|\, k=0,\dots,n\big\} \leq E^{P}_{t} \phi^{n}_{t+1}(X). $$
(7.12)

To suppress the role of the horizon date \(T\), we can use exponential weights \(\beta^{k}\) for the parameter \(n_{t+k}\) in (7.8) for some \(\beta\in[0,1]\). For \(T\) large and with parameters extended to \(\mathbb{R}_{+}\), (7.12) then transforms into

$$ \phi^{\alpha;\beta}_{t}(X) = \mathop{\mathrm{ess}\;\mathrm{inf}}\big\{ {\bar{\psi}}^{\alpha-\beta\alpha'}_{t}\big(\phi^{\alpha '}_{t+1}(X)\big) \, \big|\, \alpha' \in A \big\} , $$
(7.13)

with \(A = [0,\alpha]\). This constitutes a recursion in “extended” conditional positions \(X^{A}_{t}:=(\phi^{\alpha';\beta}_{t}(X))_{\alpha' \in A}\), specified by the generator \({\bar{\varPsi}}^{\alpha;\beta}_{t}=({\bar{\psi}}^{\alpha-\beta \alpha'}_{t})_{\alpha' \in A}\) at \(t\). Notice that the extra parameter \(\beta\) in (7.13) does not affect stepwise properties and hence can be calibrated to market prices after \(\alpha\) has been tuned to the market at a local time scale.

8 Conclusions

We have given dual characterizations of conditional and sequential consistency of concave valuations. Under the assumption of consistent risk aversion, we have characterized sequential consistency by three straightforward rules for threshold functions, and we have described the freedom still left by these rules when valuation per time step is fully prescribed. The description of set-recursive valuations in terms of generators provides a recursive structure for tuning levels of risk aversion over long and short time periods. We look upon this topic, which can be treated only to a very limited extent under strong time consistency, as an important research theme in the field of dynamic risk measures.

In particular, our analysis eventually led to a refined definition of positions, specifying their conditional value for an entire range of risk aversion levels in each state rather than for just one. For this refined specification of positions, set-recursive valuations regain the strong intuition and computational advantages of backward recursive valuation, which may facilitate the incorporation of this extra dimension in existing frameworks for nonlinear pricing and risk measurement.