1 Introduction

In many circumstances, a decision maker (DM) may evaluate an uncertain prospect not only in absolute terms but also in relative relation to some reference point. Kahneman and Tversky (1979) first introduced the notion of reference dependence, in the seminal Prospect theory, to explain experimental violations of expected utility. Within Prospect theory, deviations from the reference point are weighted by a gain–loss value function, which has the feature, referred to as loss aversion, that losses have more negative value than equal sized gains have positive value.

A different resolution for empirical deviations from expected utility proposes models of multiple priors, in particular MaxMin Expected Utility (MMEU). MMEU, axiomatized by Gilboa and Schmeidler (1989) as an explanation of the Ellsberg Paradox,Footnote 1 considers a DM who holds a family of beliefs regarding the likelihood of events. She evaluates uncertain prospects by the minimum expected utility consistent with any of her beliefs. As such, a MMEU DM displays uncertainty aversion (or, ambiguity aversion), the feature that she prefers to minimize her exposure to uncertainty.

At a purely intuitive level, there seems to be a connection between loss aversion and uncertainty aversion; both behaviors characterize some form of pessimism in comparison with a subjective expected utility (SEU) maximizer. A loss averse DM places more weight on the utility of “bad” events but leaves the probabilities undistorted, whereas an uncertainty averse DM places more weight on the probability of “bad” events but leaves the utilities undistorted. We show in this paper that this connection is more than superficial; there exists a formal connection between reference dependence and attitude toward uncertainty. In particular, we axiomatize a simple class of reference-dependent preferences, called asymmetric gain–loss (AGL) preferences, which can be equivalently represented by a MMEU functional. Within our framework, loss aversion and uncertainty aversion produce identical choice data.

1.1 AGL preferences

In addition to formalizing the connection between reference dependence and ambiguity aversion, AGL preferences provide a simple model of endogenous reference dependence. Our object of choice is a state-contingent contract, or act, which is an assignment of consumption (in utility terms) to each state of the world. AGL preferences evaluate an act according to two components: consumption utility and gain–loss utility. The expected consumption utility of an act is as in the standard SEU model, where the decision maker holds a subjective belief, \(\mu \), over the state space. Her expected consumption utility of an act \(f: S \rightarrow {{\mathbb {R}}}\) is

$$\begin{aligned} {{\mathbb {E}}}_{\mu }[f] = \sum _{s \in S}\mu (s) f(s), \end{aligned}$$
(1.1)

where S is the state space and \(s\in S\) is a generic state. The AGL DM takes this assessment of acts both as the reference point by which gains and losses are measured, and as the baseline level of utility on which gains and losses act as distortions. The main result of this paper is the behavioral characterization of asymmetric gain–loss preferences, which are preferences that can be represented by the functional

$$\begin{aligned} V(f) = \underbrace{{{\mathbb {E}}}_{\mu }[f] }_{\text {Expected Consumption Utility}} + \underbrace{\lambda \sum _{s:f(s)<{{\mathbb {E}}}_\mu [f] }\mu (s)[{{\mathbb {E}}}_\mu [f]-f(s)]}_{\text {Expected Gain/Loss Utility}}. \end{aligned}$$
(1.2)

The first term is the DM’s subjective expected utility without any reference considerations, and the second is the gain–loss utility. The gain–loss term captures the reference effects. When \(\lambda < 0\), the DM is loss averse and receives a utility penalty when the realized utility falls short of her expectation. This utility penalty is linearly scaled by \(\lambda \). In our representation results, all the elements are identified from choice behavior: \(\mu \) and \(\lambda \) are identified uniquely.Footnote 2

1.2 Reference point formation

There are two alternative views on the formation of endogenous reference points. In the first, the DM forms a reference point before making a choice, based on the set of options that she faces.Footnote 3 As such, her beliefs about her own actions will affect the reference point, leading these papers to generally require an equilibrium condition to account for the mutual relationship between reference points and choices.

In the second, the DM’s chosen action completely determines the reference point. Thus, each element of a choice set is associated with its own reference point.Footnote 4 AGL preferences fall into this second category. In particular, the AGL representation is closely related to the notion of choice acclimating personal equilibrium defined by Kőszegi and Rabin (2007), excepting that there the domain of uncertainty is objective risk. A main contribution of our paper is that, by considering the case where probabilities are subjective, we show how it is possible to simultaneously identify both the reference attitude and the beliefs of the DM.

It is worth noting that when the reference point is defined using equilibrium conditions, as in Kőzsegi and Rabin (2006), the joint identification of beliefs and reference effects is generally not possible. The feedback loop between choices and the reference point can lead to intransitivity of the revealed preference, as shown in Gul and Pesendorfer (2006).

Because of this identification problem, and the intrinsic complexity surrounding equilibrium conditions, the latter notion or reference point determination has proved more suitable for applications. Indeed, many applications use AGL preferences: Lange and Ratan (2010) explore how reference dependence can increase the optimal bid in sealed bid auctions (to be more in line with empirical evidence); Herweg et al. (2010) show that loss aversion can explain prevalence of binary incentive schemes (i.e., bonuses) in moral hazard environments; Abeler et al. (2011) show that, in an effort provision experiment, expectations-based reference dependence best explains their data; Karle and Peitz (2014) consider the competition of differentiated firms when buyers exhibit loss aversion. Each of the above-mentioned papers assumed a kinked, piecewise linear gain/loss function and assumed the reference point was the expected consumption utility—exactly the characterization given here. Our work provides the foundational restrictions for such consumer behavior, and shows that beliefs regarding uncertainty and reference effects can be jointly identified.

1.3 A simple example of AGL preferences

We employ the following numerical example to explain the intuition behind the representation, and show how asymmetric gain–loss preferences can explain different types of behavior regarding uncertainty.

Consider the environment of a seller selling a single good to a buyer who makes a take-it-or-leave-it offer. The value to the seller is \(\hat{v}\) which the buyer believes takes the values \(\{5,3,2\}\), with probability .2, .3, and .5, respectively. The buyer has an independent private value for the object given by \(v = 10\). The buyer will submit her offer, b, and the seller will accept or reject the offer. The seller will accept any offer which (weakly) exceeds her value. The buyer’s utility associated with the bid b is given by

$$\begin{aligned} U(b) = \left\{ \begin{array}{l@{\quad }l} v - b &{} \text { if } b \ge \hat{v},\\ 0 &{} \text { otherwise}. \end{array}\right. \end{aligned}$$

It is obvious that the optimal bid will always be in \(\{5,3,2\}\). If the buyer is a risk neutral expected utility maximizer, her optimal bid solves

$$\begin{aligned} \underset{b \in \{5,3,2\}}{\max } \quad (v - b)\text {Prob}(b \ge \hat{v}). \end{aligned}$$

The optimal bid is \(b^{RN} = 3\), which has an expected utility of 5.6 before the bid is placed.

Now suppose that the buyer has gain–loss preferences: in addition to the expected value she wants to avoid losses, so she subtracts any expected losses from the expected consumption utility to determine the valuation (this corresponds to the parametrization \(\lambda = -1\)). A loss for her is any outcome where her ex-post utility is worse than the expected value; therefore, outcomes in which she does not obtain the item are considered losses.

Her expected AGL utility, taking into account her gain–loss preferences, of making the bid \(b = 3\) is given by:

$$\begin{aligned} \underbrace{5.6}_{\text {Expected Cons. Utility}} - \underbrace{\left( .2\right) \left( 5.6 - 0\right) }_{\text {Expected Loss Penalty}}, \end{aligned}$$

which is equal to 4.48. The optimal AGL bid, however, is \(b^{AGL} = 5\), which provides a constant utility, and hence an AGL utility of 5.

When the buyer in this simple take-it-or-leave-it example takes into consideration expected gains and losses in addition to the standard expected utility, she is better off increasing her bid. Intuitively, she sacrifices her payoff in good outcomes (where she obtains the item) in order to decrease the chance of bad outcomes (not obtaining the item). While her payoff is smaller contingent on obtaining the good, the outcome is favorable more often and she increases her ex-ante utility.

A similar behavior could be captured by a buyer who was averse to ambiguity. To see this, assume that the buyer, instead of having AGL preferences, has MMEU preferences. Further, the buyer believes that the distribution of the seller’s valuation is contained in the following set of distributions over \(\{5,3,2\}\):

$$\begin{aligned} C = \left. {\left\{ \begin{array}{ll} {[}.75, .15, .10], [.35, .51, .14], [.40, .24, .36],\\ {[}.60, .36, .04],[.65, .09, .26], [.25, .45, .30] \end{array}\right. } \right\} . \end{aligned}$$

As such, the buyer chooses a bid according to:

$$\begin{aligned} \underset{b}{\max } \left( \underset{\nu \in C}{\min } \ (v - b)\text {Prob}_{\nu }(b \ge \hat{v}) \right) . \end{aligned}$$

It is straightforward to check that the optimal bid is \(b^{MMEU} = 5\). Therefore, a AGL bidder with \(\mu = [.2, .3,.5]\) and \(\lambda =-1\) and a MMEU bidder with multiple priors given by C will make the same optimal bid. While at first glance this connection may seem contrived, in fact, the AGL and MMEU DMs choose identically not only in this bidding game but in all decision problems—they have identical preferences.Footnote 5 We show in Sect. 3 that AGL behavior can always be equivalently described by a MMEU decision maker.

1.4 Structure of the paper

The rest of the paper is structured as follows. Section 2 provides an axiomatic characterization of the preferences, discusses the concept of the alignment of acts, which is instrumental for the endogenous determination of a reference point, and formally defines the utility representation. Section 3 explores the link between gain–loss and ambiguity attitudes. Section 4 puts forth comparative statics results. Section 5 contains a literature review. All proofs are contained in the “Appendices.”

2 Axiomatization

In this section, we formally present the choice environment and a set of axioms which prove to be necessary and sufficient for the representation presented later in this section.

Let \(S = \{s_1,s_2,\ldots ,s_n\}\) be a finite set of states of the world that represent all possible payoff-relevant contingencies for the DM; any \(E\subseteq S\) is called an event. Define \(\mathcal {E} = \mathcal {P}(S){\setminus }\{\emptyset , S\}\) as the set of all non-trivial events. Denote by \(\mathcal {F} = {{\mathbb {R}}}_{+}^S\) the set of all acts, that is, functions \(f:S\rightarrow {{\mathbb {R}}}_{+}\) (endowed with the standard Euclidean topology). We interpret the act f as providing the payoff f(s) in state \(s \in S\) and assume it is the utility received by the DM when f is chosen and s is realized.Footnote 6

Take the mixture operation on \(\mathcal {F}\) as the standard pointwise mixture, where for any \({\alpha }\in [0,1]\), \({\alpha }f+(1-{\alpha })g\in \mathcal {F}\) gives \({\alpha }f(s)+(1-{\alpha })g(s)\in {{\mathbb {R}}}_{+}\) for any \(s\in S\). Abusing notation, any \(c \in {{\mathbb {R}}}_{+}\) can be identified with the constant act \(c(s) = c\) for all \(s\in S\). Let \(\mathcal {F}_c \cong {{\mathbb {R}}}_{+}\) be the set of constant acts. Preferences on \(\mathcal {F}\) are denoted by the binary relation \(\succsim \); \(\succ \) and \(\sim \) represent, respectively, the asymmetric and symmetric components of \(\succsim \). For each \(f\in \mathcal {F}\), if there is some \(c_f \in \mathcal {F}_c\), such that \(f\sim c_f\), then call \(c_f\) the certainty equivalent of f. Before we can specify the behavioral restrictions on preference that correspond to the AGL utility representation, we need to consider some particular structures in the choice domain.

2.1 Balanced pairs of acts

A particularly important type of act to study AGL preference is given by those that provide perfect hedges against uncertainty. Hedging gets rid of uncertainty, and therefore, it also removes all possible gain–loss considerations from the act. Call a pair of acts \((f,\bar{f})\)balanced if they provide a perfect hedge and are indifferent to each other.Footnote 7 The importance of balanced acts is that eliminating subjective gain–loss considerations allows an analyst to identify beliefs from preferences.

Definition 1

Two acts f and \(\bar{f}\) are balanced if \(f\sim \bar{f}\), and for any states \(s, s'\in S\)

$$\begin{aligned} \frac{1}{2} f(s) + \frac{1}{2}\bar{f}(s) = \frac{1}{2} f(s')+\frac{1}{2}\bar{f}(s'). \end{aligned}$$

If there exists \(e_f\in \mathcal {F}_c\) such that \(e_f = \frac{1}{2}f(s)+\frac{1}{2}\bar{f}(s)\) for all \(s\in S\), we call \(e_f\) the hedge of f. \((f,\bar{f})\) is referred to as a balanced pair, and \(\bar{f}\) is a balancing act of f (and vice-versa).

When the notation \(\bar{f}\) is used, it is always in reference to the balancing act of \(f\in \mathcal {F}\). The conditions imposed on preferences below guarantee that \(c_f\) and \(e_f\) are unique and well defined for each f.

2.2 Act alignment: separating positive and negative states

Balanced acts will provide a behavioral way of separating gains and losses. We require that when the outcome in state s is considered a gain for f, the outcome on state s is considered a loss for \(\bar{f}\). This is a natural requirement given that f and \(\bar{f}\) provide a perfect hedge to the DM. Hence, \(\bar{f}\) has the exact opposite gain–loss composition of f. For an act, define positive states as those states that deliver gains, and negative states as those states that deliver losses.

Definition 2

Let \((f,\bar{f})\) be a balanced pair. Say \(s\in S\) is a positive state for f if \(f(s) \ge \bar{f}(s)\), and a negative state for f if \(\bar{f}(s) \ge f(s)\). If a state is both positive and negative (i.e., \(f(s) = \bar{f}(s)\)) say s is a neutral state for f.

Any balanced pair of acts induces a set of partitions, each of which splits the state space into two events: one event that contains only positive states for f (\(\{s\in S|f(s) \ge \bar{f}(s)\}\)) and one event that contains only negative states for f (\(\{s\in S|\bar{f}(s)\ge f(s)\}\)). We use the convention that neutral states can be labeled as either positive or negative (but not both).Footnote 8 When there are no neutral states, each act has a unique way of partitioning the states into positive and negative. These partitions associated with each act are called the alignment of the act. We use the convention that the alignment of the act is represented by the event that includes the positive states E (the complement is the negative states) rather than saying that the alignment is represented by the partition \(\{E,E^c\}\).

Definition 3

Let \((f,\bar{f})\) be a balanced pair. For any \(f\in \mathcal {F}\), say f is aligned with the event \(E\in \mathcal {E}\) if for all \(s\in E\), \(f(s)\ge \bar{f}(s)\) and for all \(s\in E^c\), \(\bar{f}(s) \ge f(s)\).

For every \(E\in \mathcal {E}\), there is a set of acts that is aligned with E.

Definition 4

Given any event \(E\subset S\), define \(\mathcal {F}^E\) be the set of acts where the positive states are contained in E, i.e.,

$$\begin{aligned} {\mathcal {F}}^E= \{f\in \mathcal {F}: \ \forall \ s\in E, f(s) \ge \bar{f}(s) \ \text {and} \ \forall s\not \in E, \ \bar{f}(s) \ge f(s) \}, \end{aligned}$$

where \(\bar{f}\) is the balancing act to f.

Note that any constant act is its own balancing act, and therefore, constant acts are aligned with all partitions of the state space. It is useful to consider acts that have only one alignment, which are called single-alignment acts. These acts are important because they are acts where small perturbations on outcomes do not change the alignment.

Definition 5

Let \((f,\bar{f})\) be a balanced pair. Then, f is single-alignment act if for no \(s\in S\), \(f(s) = \bar{f}(s)\).

If the event E represents an alignment of f, every subset of E or \(E^c\) is called a non-overlapping event. These are the events where all the states are either all positive or all negative for f, so there is no overlap between positive and negative states for f. Non-overlapping events provide a way of specifying situations where there is no tradeoff between positive and negative states, only across one type of state.

Definition 6

Given \(f\in {\mathcal {F}}\), event \(F\subset S\) is a non-overlapping event for f if every state in F is aligned in the same way.

If follows that F is non-overlapping for \(f \in {\mathcal {F}}^E\) if

$$\begin{aligned} F \subseteq E \text { or } F \subseteq E^c. \end{aligned}$$

Abusing terminology, we say that F is non-overlapping for E whenever F is non-overlapping for all \(f\in {\mathcal {F}}^E\).

With these definitions in mind, we can now specify the behavioral restrictions that are necessary and sufficient to be represented by the AGL functional, as given by Eq. (1.2).

2.3 Standard axioms

The first 3 conditions, A1–A3, are standard axioms in the literature of choice under uncertainty.Footnote 9

  • A1. (Weak Order). \(\succsim \) is complete and transitive.

  • A2. (Continuity). For all \(f\in \mathcal {F}\), the sets \(\{g\in \mathcal {F}: g\succsim f\}\) and \(\{g\in \mathcal {F}: f\succsim g\}\) are closed.

  • A3. (Strict Monotonicity). If for all s, \(f(s) \ge g(s)\) then \(f \succsim g\). If \(f(s) > g(s)\) for some s and \(f(s) \ge g(s)\) for all s, then \(f\succ g\).

2.4 New axioms: mixture conditions

The standard subjective expected utility model from Anscombe and Aumann (1963) is characterized by some version of A1–A3, plus the independence axiom. Independence requires that \(f\succsim g\) if and only if \({\alpha }f + (1-{\alpha }) h \succsim {\alpha }g + (1-{\alpha }) h\) for any \(h\in \mathcal {F}\), and any \({\alpha }\in (0,1)\).

The independence axiom does not hold for AGL preferences because it does not allow for gains and losses to be evaluated differently; convex combinations of acts can change the gain–loss composition of acts, therefore changing the assessments as well. AGL preferences relax independence, but impose three consistency requirements for mixtures of acts.

The first new axiom states that as long as the alignment of acts remains the same when mixing, then independence is preserved. If two acts have the same alignment, taking any mixture of them does not change the composition of gains and losses, so the tradeoff between gains and losses should not change.

  • A4. (Alignment Independence). If \(f,h\in {\mathcal {F}}^E\) and \(g,h \in {\mathcal {F}}^F\) for some \(E,F \in \mathcal {E}\), then \(f\succsim g\) if and only if \({\alpha }f+ (1-{\alpha })h \succsim {\alpha }g + (1-{\alpha })h\) for all \({\alpha }\in [0,1]\).

When considering the families of acts that have the same alignment, Axiom 4 implies the regular independence axiom holds, imposing an expected utility representation over such acts. Further, because constant acts are mutually aligned with all other acts, this provides a connection between the different expected utility representations.

Under the full independence axiom, the preference between an \(\alpha \) mixture of f and h or f and \(h'\) would depend only on the preference between h and \(h'\). Here, on the other hand, mixing acts may change valuations in nonlinear ways: This could happen if either (i) the mixture changes the alignment of states, or (ii) the mixture provides an opportunity to hedge by improving loss states and worsening gain states (or the opposite, if the DM is gain seeking).

The next axiom, Local Mixture Consistency, states that these are the only two reasons for the nonlinearity; Axiom A5 states that the effect of adding a small amount of noise, entirely contained within the positive or negative alignment of an act, depends only on the expected consumption utility of the noise. Because of continuity, a sufficiently small amount of noise will not change the alignment of a single-alignment act. Moreover, if the noise is added only to states that are positive or states that are negative, the mixture does not allow for the possibility of hedging (since any decreasing in loss utility in one negatively aligned state must be added to another negatively aligned state).

  • A5. (Local Mixture Consistency). For any single-alignment acts \(f \in {\mathcal {F}}^E\) and \(g\in {\mathcal {F}}^{E'}\), any event F, which is non-overlapping for bothE and \(E'\), and any \(h,h'\in \mathcal {F}\) such that \(h(s) = \bar{h}(s) = h'(s) = \bar{h}'(s)\) for all \(s\not \in F\), there exists \(\alpha ^* < 1\) such that

    $$\begin{aligned} {\alpha }f + (1-{\alpha })h\succsim {\alpha }f + (1-{\alpha })h' \ \Longleftrightarrow {\alpha }g + (1-{\alpha }) h \succsim {\alpha }g + (1-{\alpha }) h', \end{aligned}$$

    for all \({\alpha }\in ({\alpha }^*,1)\).

To see how the mechanics capture the intuition above: because h and \(h'\) are equal to their hedge—except on a subset of \(E \cap E'\)—they do not contribute any gain/loss considerations except on F. Since all of the variation of h and \(h'\) take place within F, \({\alpha }f + (1-{\alpha })h\) and \({\alpha }f + (1-{\alpha })h'\) will all be aligned the same way for any state \(s \notin F\). Of course, we might still worry that mixing f with h or \(h'\) will alter the alignment differently within F. However, because \(F \subset E\) and f was a single-alignment act, for \(s \in F\), f(s) is strictly better than the expected consumption utility of f. Hence, mixing with very little weight on h or \(h'\) will still not distort the alignment. All of the same considerations hold for g, indicating that the final preference over mixtures depends only on h and \(h'\).

The last axiom imposes a consistency condition on mixtures of acts when reversing the role of gains and losses. Intuitively, the condition requires that the effect of mixing h and f is the opposite to the effect of mixing h and \(\bar{f}\). This condition is called Antisymmetry.

  • A6. (Antisymmetry). For any acts f and g such that \(f\sim g\), for every \(h\in \mathcal {F}\), and for any \(\alpha \in (0,1)\), \( \alpha h + (1-\alpha ) f \succsim \alpha h + (1-\alpha )g\) implies

    1. (i)

      \( \alpha h + (1-\alpha ) \bar{f} \precsim \alpha h + (1-{\alpha }) \bar{g}\), and,

    2. (ii)

      \({\alpha }\bar{h} + (1-{\alpha }) f \precsim {\alpha }\bar{h} + (1-{\alpha }) g\),

    where \(\bar{f}\), \(\bar{g}\), and \(\bar{h}\) are balancing acts of f, g, and h, respectively.

Since f is indifferent to g, the DM has a strict preference between the mixtures only if h is changing the gain–loss component of utility. Consider the case where the DM is loss-biased. Then, \( \alpha h + (1-\alpha ) f \succsim \alpha h + (1-\alpha )g\) whenever f “smooths” out consumption of h more than g does—f provides a better hedge against the loss states of h. Of course, when f and g are replaced with \(\bar{f}\) and \(\bar{g}\), the losses and gains are reversed, and so, \(\bar{f}\) now exaggerates the loss states of h, breaking the indifference between f and g in the opposite direction. The same intuition applies when h is replaced with \(\bar{h}\). Notice, when h is constant, there is no room for hedging, and so, the mixtures with f and g will be indifferent (as dictated by Alignment Independence).

2.5 Representation results

This section provides the main representation results of the paper. Theorem 2.1 introduces the AGL representation as characterized by the above axioms. This section also outlines important preliminary results that highlight the role of particular axioms and elucidate the relation between the AGL representation and other decision theoretic models.

Theorem 2.1

The preference \(\succsim \) satisfies A1–A6 if and only if there exists a probability distribution \(\mu \in \Delta (S)\) such that \(\mu (s) > 0\) for all \(s \in S\), and a real number \(\lambda \in {{\mathbb {R}}}\) such that \(f \succsim g \iff V(f) \ge V(g)\) where \(V: {\mathcal {F}}\rightarrow {{\mathbb {R}}}\) is defined by

$$\begin{aligned} V(f)= {{\mathbb {E}}}_{\mu }[f] + \lambda \sum _{s:f(s)<{{\mathbb {E}}}_\mu [f] }\mu (s)[{{\mathbb {E}}}_\mu [f]-f(s)]. \end{aligned}$$
(AGL)

Moreover, (i) \(\mu \) is a unique probability distribution, (ii) \(\lambda \in {{\mathbb {R}}}\) is unique, and \(|\lambda | < \min _{s \in S} \frac{1}{1-\mu (s)}\).

The bound on \(\lambda \) is a consequence of strict monotonicity: Increasing the payoff in any state must increase the valuation of the act. The bound on \(\lambda \) ensure that the marginal increase in consumption utility outweighs any negative marginal decrease in gain/loss utility. The parameter \(\lambda \) captures the difference between the weight placed on gains and the weight placed on losses, which for the representation is unique. An important application of the representation result from Theorem 2.1 is that it provides an index for reference dependence (\(\lambda \)) that is decoupled from risk attitudes, and can be easily estimated. The fact that \(\mu \) lies in the interior \(\Delta (S)\) is also a consequence of strict monotonicity.

2.5.1 Sketch of the proof and preliminary results

The result is proven in two steps. First, Lemma 2.2 provides a SEU representation on \(\mathcal {F}^E\) established by Axioms A1–A4 (that is, excluding Local Mixture Consistency and Antisymmetry), which can be extended to aggregate preferences across families of mutually aligned acts. Then, we utilize the properties of Axioms A5 and A6 to generate the final result.

Lemma 2.2

The preference \(\succsim \) satisfies A1–A4 if and only if there exists a set of probability distributions over S, \(\{\mu _E\}_{E\in \mathcal {E}}\), such that for \(f\in {\mathcal {F}}^F\) and \(g\in {\mathcal {F}}^{F'}\),

$$\begin{aligned} f\succsim g \ \Longleftrightarrow {{\mathbb {E}}}_{\mu _{F}}[f] \ge {{\mathbb {E}}}_{\mu _{F'}}[g]. \end{aligned}$$

Moreover the set \(\{\mu _E\}_{E\in \mathcal {E}}\) is unique.

Every prior in \(\{\mu _E\}_{E\in \mathcal {E}}\) is different, and Alignment Independence does not imply any structure on the priors. To derive the main result from the representation of Lemma 2.2, Local Mixture Consistency and Antisymmetry are used to guarantee that every prior in the set \(\{\mu _E\}_{E\in \mathcal {E}}\) can be written as functions of one unique prior \(\mu \).

If we mix f with a small amount of noise (either the act h or \(h'\)), where the noisy acts exhibit variation only on F non-overlapping with E, then Local Mixture Consistency guarantees that the preference over the two mixtures depends only on the expected consumption utility of the noise. Hence, for all \(E\in \mathcal {E}\), the conditional distributions—conditioned on F non-overlapping with E—are the same for all \(\mu _E\). That is, for E and \(E'\in \mathcal {E}\), \(\mu _E(\cdot |F) = \mu _{E'}(\cdot |F)\) whenever F is non-overlapping for E and \(E'\). In fact, there is a single \(\mu \) such that \(\mu (s | F) = \mu _{E}(s | F)\), keeping the condition of F.

Since E is non-overlapping with itself, this last point implies that for any \(s,s'\in E\), \(\frac{\mu (s)}{\mu (s')} = \frac{\mu _E(s)}{\mu _E(s')}\), which holds only if for all \(s\in E\), \(\mu (s)= \gamma \mu _E(s)\), \(\gamma \in {{\mathbb {R}}}_{++}\). So on E, \(\mu _E\) is just a constant perturbation of the original prior \(\mu \). Since \(E^{c}\) is non-overlapping with E the same holds for \(E^{c}\).

Then, the distribution \(\mu _E\) can be written in the following way

$$\begin{aligned} \mu _E(s) = \left\{ \begin{array}{l@{\quad }l} \gamma _E^+ \mu (s) &{} \text {if } s\in E, \\ \gamma _E^- \mu (s) &{} \text {if } s\in E^c, \end{array}\right. \end{aligned}$$

where \(\gamma _E^+\) represents how the original prior is perturbed on the positive states (i.e., E), and \(\gamma _E^-\) represents how the original prior is modified on the negative states. Both \(\gamma _E^+\) and \(\gamma _E^-\) are positive numbers from monotonicity of \(\succsim \), where \(\gamma _E^+ \ge 1 \Leftrightarrow \gamma _E^- \le 1\) for every E because \(\mu _E\) is a probability distribution.

Antisymmetry implies a particular relationship between distributions indexed by complementary alignments, which is that for any \(E,F \in \mathcal {E}\),

$$\begin{aligned} \mu _E + \mu _{E^c} = \mu _F + \mu _{F^c}. \end{aligned}$$

Using this observation, we show that for \(s\in E\) the distributions on \(\mu _E\) and \(\mu _{E{\setminus } s}\) depend only on s. In particular, whenever \(s\in E\cap F\),

$$\begin{aligned} \mu _{E} - \mu _{E{\setminus } s} = \mu _F - \mu _{F{\setminus } s}. \end{aligned}$$

Then, using these conditions about the family \(\{\mu _E\}_{E\in \mathcal {E}}\) we show that the difference between the distortion on negative states and positive states is always constant, thus \(\gamma _E^+ - \gamma _E^- = \lambda \) for all \(E\in \mathcal {E}\). Therefore, it is possible to characterize any \(\mu _E\), as functions of \(\mu \), and this constant \(\lambda \) that captures the difference between the negative and positive distortions. That is,

$$\begin{aligned} \mu _E(s)&= \mu (s) \left( 1 + \lambda \mu (E^{c})\right)&\text { if }&s\in E,&\text { and, } \nonumber \\ \mu _E(s)&= \mu (s) \left( 1 - \lambda \mu (E)\right)&\text { if }&s\in E^c. \end{aligned}$$
(2.1)

Finally, this representation of \(\mu _E\) is used to rewrite the representation from Lemma 2.2 in terms of \(\mu \), which yields the desired result.

3 Relation to ambiguity attitude

3.1 Maxmin expected utility

According to Theorem 2.1, the DM who abides by the AGL axioms is probabilistically sophisticated but displays some reference effect. That is, she holds some unique belief, \(\mu \), regarding the state space, and evaluates each act according to this belief and her preferences for outcomes. Nonetheless, Lemma 2.2 states that the same preferences can be represented by a family of distributions, \(\{\mu _E\}_{E\in \mathcal {E}}\), each of which is a distortion of the original belief, \(\mu \). This alludes to a possible relationship between reference effects and attitudes toward uncertainty, which has classically been modeled by a DM who considers a (non-singleton) set of priors.

Definition 7

\(\succsim \) has an MMEU representation if there exists a convex set of priors \(C \subseteq \Delta (S)\) such that

$$\begin{aligned} V^{MM}(f) = \min _{\nu \in C}{{\mathbb {E}}}_{\nu }[f] \end{aligned}$$
(3.1)

represents \(\succsim \).

MMEU, axiomatized by Gilboa and Schmeidler (1989), is characterized by two key conditions: certainty independence and uncertainty aversion. Certainty independence requires that \(f\succsim g\) if and only if for all \({\alpha }\in [0,1]\), \({\alpha }f + (1-{\alpha }) c \succsim {\alpha }g + (1-{\alpha }) c\) where \(c\in \mathcal {F}_c\). Mixing two acts with a common constant act does not reverse the preference between them. Since constant acts are aligned with every \(E\in \mathcal {E}\), Alignment Independence implies certainty independence.

Uncertainty aversion requires that for all fg such that \(f\sim g\) for any \({\alpha }\in (0,1)\), \({\alpha }f + (1-{\alpha }) g \succsim f\). If uncertainty aversion is changed for uncertainty seeking preferences,Footnote 10 then the representation is a Maxmax representation, where the DM evaluates an act according to the prior that maximizes her expectations. It is clear from the representation that if \(\lambda \le 0\) then the DM is uncertainty averse, and if \(\lambda \ge 0\) then she is uncertainty seeking.

Uncertainty aversion can be characterized as a preference for hedging, as hedging reduces the exposure to uncertainty. Moreover, hedging reduces the exposure to negative states. Pushing the utility value in each state closer to the average has more effect on the negative states (because of the loss bias) and hence weakly improves the act.

The formal connection is captured by the following result, which states that asymmetric gain–loss preferences always admit a Maxmin or Maxmax representation and that the set of priors C has a specific structure that is related to the distortion of the (unique) beliefs of the DM.

Theorem 3.1

Suppose \(\succsim \) admits an AGL representation \((\mu , \lambda )\), with \(\lambda < 0\), then \(\succsim \) admits a MMEU representation. Moreover, \(C ={\text {conv}}(\{\mu _E\}_{E\in \mathcal {E}})\), as defined in Lemma 2.2.

Theorem 3.1 has several implications. First, it shows that this form of reference dependence is always tied to a particular attitude toward uncertainty.Footnote 11 So, preferences studied in this paper will always be either uncertainty averse or uncertainty seeking. Second, it gives a precise form to the belief distortion that takes place when gain–loss consideration affect a probabilistically sophisticated DM.

While every AGL representation can be faithfully captured within the MMEU framework, the converse is not true. In the AGL framework, the distorted beliefs keep the relative likelihood of states among gains and among losses unchanged, but, depending on the sign of \(\lambda \), increase or decrease the total weight given to gains (and losses) proportional to the baseline belief. This distortion is a function only of the degree of reference dependence, \(\lambda \), and the baseline prior \(\mu \). In addition, Antisymmetry implies the set of priors is symmetric with respect to all hyperplanes (in the \(|S| - 1\) dimensional simplex) which divide the state space into positive and negative states and which pass through the baseline prior. See Fig. 1; the dashed lines show such symmetries.

Intuitively, this additional symmetric structure imposed on MMEU stems from the fact the reference effects distort utility relative to a reference point. Hence, when translating the utility distortions in the AGL model into the equivalent probabilistic distortions, the symmetries around some reference point is preserved. An arbitrary convex set of priors would not necessarily admit such a baseline prior, and so, could not be translated into a model of reference effects.

Fig. 1
figure 1

The set of priors for \(S = \{s_{1},s_{2},s_{3}\}\) and \(\mu = (.5,.3,.2)\), and \(\lambda = 1\)

3.2 Concave expected utility

Alignment Independence imposes more structure than certainty independence, and therefore, AGL also shares a connection to a class of ambiguity models outside of MMEU. In particular, any loss adverse AGL preference is also a concave expected utility (cavEU) preference. CavEU is a capacity-based model, which considers all possible decompositions of an act into bets over events (where a bet of magnitude \(a_{E} \in {{\mathbb {R}}}_{++}\) on E, is an act that is constant on E and 0 off E, i.e., \(a_{E}\mathbb {1}_{E}\) where \(\mathbb {1}_{E}\) is the characteristic function on E). The preference \(\succsim \) is cavEU if it can be represented by a concave integral introduced by Lehrer (2009).

Definition 8

\(\succsim \) has a cavEU representation if there exists a capacity \(v: 2^{S} \rightarrow [0, 1]\) such that

$$\begin{aligned} V^{cav}(f) = \max \left\{ \sum _{E \in \mathcal {E}} a_{E}v(E) \big | \sum a_{E}\mathbb {1}_{E} = f, a_{E} \in {{\mathbb {R}}}_{++} \right\} \end{aligned}$$
(3.2)

represents \(\succsim \).

The concave integral returns the maximum value of all possible decompositions, when aggregated according to the capacity v. Lehrer and Teper (2015) show that \(\succsim \) is cavEU if and only if it satisfies A1–A3 plus uncertainty aversion, independence with respect to the constant act 0, and co-decomposable independence. This last requirement states, for every non-bet act f, there exist a bet \(a_{E}\) and an act \(f'\) such that (i) \(f = \alpha a_{E} + (1-\alpha )f'\) for \(\alpha \in (0,1)\), and (ii) \(\succsim \) satisfies independence over \(\{\alpha a_{E} + \beta f' | \alpha ,\beta \in {{\mathbb {R}}}_{+}\}\).

Theorem 3.2

Suppose \(\succsim \) admits an AGL representation \((\mu , \lambda )\), with \(\lambda < 0\), then \(\succsim \) admits a cavEU representation with \(v: \mathcal {P}(S) \rightarrow [0,1]\) defined by \(v: E \mapsto \min _{F \in \mathcal {E}}\mu _F(E)\).

The property that AGL preference admits cavEU representations stems from the fact that each act, \(f \in \mathcal {F}^{E}\) can always be decomposed into a bet on E and another act in \(\mathcal {F}^{E}\). Since all these acts share the same alignment, independence holds within the convex-cone generated thereby. As with the set of priors in the MMEU representation, the capacity v is characterized by the lower envelope of the distorted beliefs arising themselves from Lemma 2.2. Of course, this must be, since these functionals represent the same preferences! Note, cavEU and MMEU are not nested models; AGL preferences reside in the non-trivial intersection.Footnote 12

4 Comparative gain/loss attitudes

This section advances comparative statics results relating behavior to elements of the AGL representation. For an act f, recall the hedge, \(e_{f}\), is the constant act which provides the expected consumption utility in every state; the constant equivalent, \(c_{f}\), is the constant act which provides the expected total utility in every state, in other words taking into account gain/loss considerations.

A natural measure for the degree and direction of reference effects is the gap between \(e_f\) and \(c_f\), the hedge and the certainty equivalent. For a loss averse DM, the difference between the hedge and the constant equivalent is how much, in utility terms, she is will sacrifice to avoid having to feel a loss. In the standard SEU model, \(e_f = c_f\), so the SEU model is the baseline case for reference effects.

Definition 9

Let \(\succsim \) be a preference over \(\mathcal {F}\). Say \(\succsim \) is gain-biased if for all \(f\in \mathcal {F}\), \(c_f \succsim e_f\). Say \(\succsim \) is loss-biased if for all \(f\in \mathcal {F}\), \(e_f \succsim c_f\).

Remark 1

An AGL decision maker is gain-biased (respectively, loss-biased) if and only if she is uncertainty seeking (resp., uncertainty averse) if and only if \(\lambda \ge 0\). (resp., \(\lambda \le 0\)).

Remark 1 follows immediately from the observation that \({{\mathbb {E}}}_{\mu }[f] = e_{f}\) and examination of the representing functionals.

Problematically, however, the hedge and constant equivalent of an act depend on the DM’s beliefs, so if we want to be able to compare two DM’s degree of reference dependence we want to disentangle reference dependence from beliefs. To do this, we define \(f\vee \bar{f}\), the join of a balanced pair \((f,\bar{f})\), as the act that gives the DM the best outcome between f and \(\bar{f}\) for each \(s\in S\).

Definition 10

Given any balanced pair \((f,\bar{f})\), define the act \(f\vee \bar{f}\), the join of\((f,\bar{f})\) as

$$\begin{aligned} (f\vee \bar{f})(s) = \left\{ \begin{array}{l@{\quad }l} f(s) &{} \text {if } f(s) \ge \bar{f}(s)\\ \bar{f}(s) &{} \text {if } \bar{f}(s) > f(s) \end{array}\right. . \end{aligned}$$

From the AGL representation, gain–loss utility depends on how much the act deviates state by state from \(e_f\). \(f\vee \bar{f}\) provides the absolute value of the state by state deviations of f from \(e_f\). Thus, the hedge of the join, \(e_{f\vee \bar{f}}\), is the average deviation of f from \(e_{f}\). Then, to capture reference dependence behaviorally across DMs, we focus on acts that have the same hedge: If acts have different hedges, the reference effects can be confounded by the beliefs.

The intuition behind our comparative notion of “more loss biased” is that, holding the hedge constant, the DM prefers an act f with smaller expected losses. Conversely, a DM with gain–bias prefers acts with larger gains. Since we want to consider acts that have the same hedge, the comparative notions of “more gain-biased” and “more loss-biased” depend on a possibly different act for each DM: f for DM 1 and g for DM 2. We use the notation where \(e_f^i\) denotes the hedge of f for DM i. If, in addition, \(e^{1}_{f\vee \bar{f}} = e^{2}_{g\vee \bar{g}}\), then g (evaluated according to \(\mu _{1}\)) has the same variance as g (according to \(\mu _{2}\)). So we say DM 1 is more loss-biased than DM 2, if the exposure to the same variance, keeping the expected consumption utility the same, produces a harsher utility penalty.

Definition 11

Given two preference orders \(\succsim _1\) and \(\succsim _2\), say that \(\succsim _1\) is more loss-biased than \(\succsim _2\) (and \(\succsim _2\) is more gain-biased than \(\succsim _1\)) if for any fg with \(e_f^1 = e_g^2\) and \(e^1_{f\vee \bar{f}} = e^2_{g\vee \bar{g}}\), then for any \(c\in \mathcal {F}_c\), \(f \succsim _1 c\) implies \(g \succsim _2 c\) and \(f \succ _1 c\) implies \(g \succ _2 c\).

Theorem 4.1

Let \(\succsim _i\) admits an AGL representation given by \((\mu _i,\lambda _i)\) for \(i=1,2\). Then, \(\succsim _1\) is more loss-biased than \(\succsim _2\) if and only if \(\lambda _1 \le \lambda _2\).

When \(\mu _1 = \mu _2\), we can say more than Theorem 4.1.

Remark 2

Let \(\succsim _i\) admits an AGL representation given by \((\mu ,\lambda _i)\) for \(i=1,2\). Further assume \(\lambda _{1} \le \lambda _{2} \le 0\). Then, (i) \(f \succsim _1 c\) implies \(f\succsim _2 c\), for all acts f and constant acts c, and (ii) \(C_2 \subseteq C_1\) for the equivalent Maxmin/Maxmax representation from Theorem 3.1.

These addition equivalences stem from the fact that when DMs have the same belief, then for any \(f\in \mathcal {F}\), \(e_f^1 = e_f^2\). In such circumstances, the degree of loss bias is equivalent to the comparative notion of ambiguity aversion from Ghirardato and Marinacci (2002). Remark 2 furthers this link: Whenever \(\succsim _i\) is gain-biased or loss-biased for both DMs, the notion of loss bias is consistent with the representation of comparative ambiguity aversion derived from Gilboa and Schmeidler (1989) (that the more ambiguity averse DM should have a larger set of priors). This observation establishes a clear connection between the idea of “loss aversion” that has been prevalent since Prospect theory, and uncertainty aversion.

These comparative statics results establish an unexplored link between the absolute and comparative notions of gain or loss bias, and existing notions of uncertainty aversion which is worth further exploring. The initial motivation for studying uncertainty was due to the Ellsberg (1961) idea that DMs are not able to formulate unique probabilities over uncertain events. Many models with multiple priors have been developed to capture what is considered “Ellsbergian behavior.” Nonetheless, even if the DM is able to form a unique prior, having gain–loss considerations can appear to contaminate her prior in a way that gives rise to behavior embodied by some multiple priors model. Hence, for AGL preference a probabilistically sophisticated DM can appear to have multiple priors due to gain–loss asymmetry.

5 Related literature

This paper links reference dependence and attitudes toward ambiguity. We show the notion of choice acclimating personal equilibrium (CPE) from Kőszegi and Rabin (2007)—where the reference point is the expectation of consumption utility—provides a clean way to link these two concepts in the domain of choice under uncertainty. Bell (1985), Loomes and Sugden (1986) and Kőzsegi and Rabin (2006) also provide various models where the DM is loss averse with respect to a reference point given by her expected consumption utility of an uncertain prospect.

In many decision theory models, the status quo has been interpreted as a reference point. Giraud (2004a), Masatlioglu and Ok (2005), Sugden (2003), Sagi (2006), Rubinstein and Salant (2007), Apesteguia and Ballester (2009), Ortoleva (2010), Riella and Teper (2014) and Masatlioglu and Ok (2013) provide models of reference dependence, where the reference point is exogenously given. Along with Kőzsegi and Rabin (2006) and Kőszegi and Rabin (2007), other papers that tackle the problem of endogenous reference point determination are Giraud (2004b), Sarver (2011), Ok et al. (2014), and Werner and Zank (2017). The approach in Ok et al. (2014) investigates reference point determination problem under a very general framework, where they do not need an equilibrium condition to characterize reference dependence. Nonetheless in their framework it is impossible to identify reference points and reference effects uniquely.

In Gul (1991), outcomes of a (n objective) lottery are considered either a disappointment or an elation depending on whether they are less than or more than the certainty equivalent. The DM suffers a utility penalty when an outcome is considered disappointing. In contrast, we assume an outcome is disappointing if it is dispreferred to the hedge, rather than the certainty equivalent. Blavatskyy (2010) extends this to a domain where certainty equivalents need not exist.

Dillenberger (2010) shows that Gul’s disappointment averse preferences satisfy negative certainty independence and so admit a cautious expected utility representation a la Cerreia-Vioglio et al. (2015). The later paper also shows that cautious expected utility is, in the objective risk domain, the analogy to MMEU in the subjective uncertainty domain. The connection between AGL and MMEU is therefore the subjective counterpart to the connection between disappointment aversion and cautious EU. AGL preferences (with \(\lambda < 0\)) satisfy negative certainty independence.Footnote 13

In a similar spirit to our paper, Masatlioglu and Raymond (2016) provide an complete characterization of CPE within the domain of objective risk. They show that CPE is exactly the intersection of quadratic preferences and rank-dependent expected utility preferences.

For AGL preferences, the evaluation of acts depends on the state by state variation of the act. Although some papers have studied attitudes toward variation in the context of risk and uncertainty, none relates such attitudes to reference dependence. In the risk domain, Quiggin and Chambers (1998, 2004) measure attitudes toward risk, which depend on the expectation of the lottery and a risk index of the lottery that depends on the variation of the distribution.

From the vantage of attitudes toward ambiguity, AGL preferences are a clear special case of mean-dispersion preferences: Grant and Polak (2013) axiomatize a very general model of mean-dispersion preferences, where an act is evaluated by the representation \(V(f) = \mu -\rho (d)\), where \(\mu \) is the expected consumption utility of f with respect to a given probability, d is the vector of state-by-state utility deviations from the mean, and \(\rho (\cdot )\) is a measure of (aversion to) dispersion.

Many well-known families of preferences such as Choquet EU (Schmeidler 1989), Maxmin EU (Gilboa and Schmeidler 1989), invariant biseparable preferences (Ghirardato et al. 2004), variational preferences (Maccheroni et al. 2006), and Vector EU (Siniscalchi 2009) belong to this family of preferences. Our paper (under loss aversion) corresponds to the specification where \(\rho = \lambda {{\mathbb {E}}}(\min \{d(s),0\})\). The interest in studying this special case is twofold. First, mean-dispersion preferences are so general that it is predominantly not possible to identify the DM’s baseline prior. (Although some authors do provide various additional restrictions that facilitate identification.) The additional structure imposed in this paper precipitates not only the identification of beliefs, but also the comparative statics results presented in Sect. 4. By taking a stand on way dispersion affects utility (i.e., via linear loss aversion), we can more thoroughly relate the parameters of the representation to behavioral patterns. The second motivation is the ubiquity of AGL (or very similar) preferences in applications. As outlined in Sect. 1.2, linear loss aversion with respect to expected consumption utility has proven to be a popular way of representing reference dependence in applied work. This paper, therefore, precisely outlines the tacit assumptions made in such applications.