Price discrimination with loss averse consumers

Hahn, Jong-Hee; Kim, Jinwoo; Kim, Sang-Hyun; Lee, Jihong

doi:10.1007/s00199-017-1035-2

Price discrimination with loss averse consumers

Research Article
Open access
Published: 03 February 2017

Volume 65, pages 681–728, (2018)
Cite this article

Download PDF

You have full access to this open access article

Economic Theory Aims and scope Submit manuscript

Price discrimination with loss averse consumers

Download PDF

Jong-Hee Hahn¹,
Jinwoo Kim²,
Sang-Hyun Kim¹ &
…
Jihong Lee²

4436 Accesses
19 Citations
Explore all metrics

Abstract

This paper proposes a theory of price discrimination based on consumer loss aversion. A seller offers a menu of bundles before a consumer learns his willingness to pay, and the consumer experiences gain–loss utility with reference to his prior (rational) expectations about contingent consumption. With binary consumer types, the seller finds it optimal to abandon screening under an intermediate range of loss aversion if the low willingness-to-pay consumer is sufficiently likely. We also identify sufficient conditions under which partial or full pooling dominates screening with a continuum of types. Our predictions are consistent with several observed practices of price discrimination.

Accurately measuring willingness to pay for consumer goods: a meta-analysis of the hypothetical bias

Article Open access 07 June 2019

The effects of scarcity on consumer decision journeys

Article 20 September 2018

The pricing strategies of online grocery retailers

Article Open access 28 November 2023

1 Introduction

When facing heterogeneous buyers, price discrimination allows a seller to capture a larger portion of the total market surplus than offering a single product quality. Price discrimination is prevalent, but sellers often employ just a small number of product types, despite our casual and statistical observations that suggest significant heterogeneity among buyers’ willingness to pay. The lack of sufficient product variety has been commonly attributed to the existence of some fixed costs of launching products of different qualities (e.g., Dixit and Stiglitz 1977; Spence 1980). In many instances, however, these costs tend to be small or immaterial, thereby making it difficult to justify the observed patterns of firm strategy by resorting to such costs alone.

Motivated by these observations, this paper proposes a theory of price discrimination that incorporates a now well-established bias from rational decision making, namely consumer loss aversion (Kahneman and Tversky 1979). Specifically, we introduce Kőszegi and Rabin (2006) expectation model of reference-dependent preferences into a standard screening model á la Mussa and Rosen (1978).^{Footnote 1} In our setup, a monopolist seller offers a menu of bundles before a buyer privately observes his willingness to pay and decides whether to make a purchase. As in Kőszegi and Rabin (2006), henceforth referred to as KR, the buyer anticipates his future consumption choice for each possible contingency and experiences “gain–loss utility” with reference to his own past expectation of contingent consumption, in addition to standard “consumption/intrinsic utility.” Furthermore, the expectation must be correct; that is, it must be consistent with the buyer’s optimal consumption choice in each realization of uncertainty. This requirement of rational expectation, or personal equilibrium (PE), implies that the menu must satisfy incentive compatibility and (ex post) participation constraints that account for reference-dependent preferences and loss aversion.^{Footnote 2}

In addition to the large existing literature documenting empirical support for loss aversion in a variety of economic situations, a slew of recent studies point to the specific role played by expectations in the formation of reference points (e.g., Mas 2006; Abeler et al. 2011; Card and Dahl 2011; Crawford and Meng 2011; Ericson and Fuster 2011; Gill and Prowse 2012; Sprenger 2015). The price discrimination setting offers a natural ground to explore the expectation-based approach to reference point formation, because its essential ingredient is the uncertainty of consumer demand. We usually know products that are available before discovering the specific conditions that determine our preferences. Consider, for example, a sports fan whose willingness to pay for the sports TV package is influenced by the performance of his favorite team during pre-season. This consumer may form an expectation that he would purchase the premium package if and only if the team ends up having a promising pre-season. But, once the regular season starts, he compares the expected purchase to what he could have consumed.

We show that loss aversion indeed serves to limit the benefits of price discrimination and can even result in the optimality of a full pooling menu in a situation where buyers with standard preferences would be separated via a menu with strictly increasing quality-price schedule. Moreover, the expectation-based approach brings into play an additional determinant of optimal contractual form: It depends on an interplay between the extent of consumer loss aversion and the shape of the distribution of consumer’s willingness to pay. In particular, our results suggest that given a (sufficient) level of loss aversion, the firm is more likely to shy away from screening in markets with large population of consumers with low willingness to pay.

Our main message is most clearly conveyed in the case of binary consumer types, where the effect of loss aversion manifests itself in two ways. First, when each consumer compares the alternative of non-participation to the bundle of his choice ex post, he experiences a loss on quality and a gain in money. Thus, as the consumer becomes more loss averse, he becomes willing to pay more for a given quality, which implies that the seller can profitably increase the quality for the type whose participation constraint is binding (i.e., the low willingness-to-pay type).

Second, for the consumer who acquires an information rent (i.e., the high willingness-to-pay type), deviation to lower quality-price bundle leads to another channel of gain–loss comparisons across the two utility dimensions. In this case, however, the comparison is weighted by the ex ante likelihood of the alternative event. Given loss aversion, the deviation incentive would be greater when the low willingness-to-pay type, and thus a lower price, was anticipated with a larger probability.

The combination of the above two effects generates the following: When the likelihood of low willingness-to-pay consumer is sufficiently large and the degree of loss aversion lies in an intermediate range, the seller’s optimal strategy is to offer the same bundle to both types.^{Footnote 3} In the case of a continuum of consumer types, focusing on the case in which full separation is optimal under standard preferences, we establish conditions under which partial or even full pooling is optimal among menus with monotone quality and price.

In our model, multiple personal equilibria may arise from a menu. Our treatment above follows the standard mechanism design approach by assuming that the firm can select the PE and hence focusing on truthful self-selection. An alternative approach suggested by KR is to assume that it is the consumer who is capable of choosing his favorite PE, or the preferred personal equilibrium (PPE). We derive the optimal menu under the concept of PPE and binary consumer types, and show that a pooling menu continues to be optimal under a wide range of parameter values. To our knowledge, this is the first non-trivial analysis of PPE in a model of adverse selection to date.

Our paper contributes to the growing literature on firm behavior under boundedly rational agents (see the surveys of Ellison 2006; Spiegler 2011; Kőszegi 2014). Among this literature, monopolist’s screening problems with loss averse consumers have recently been studied independently by Herweg and Mierendorff (2013), Orhun (2009), and Carbajal and Ely (2016).

Herweg and Mierendorff (2013) consider a seller who chooses from two-part tariffs for a loss averse consumer with uncertain demand and demonstrate the optimality of flat tariff. They model the consumer also in the frame of KR, but with gain–loss arising only from the money dimension, and characterize optimal contract when the consumer can commit to ex ante participation. Our analysis differs in several aspects. First, our setup allows for both ex post and ex ante participation. Second, we consider a general class of menus under gain–loss utilities that arise from both money and quality dimensions and derive the precise channel via which consumer loss aversion generates bunching over quality as well as price. Moreover, our treatment of gain–loss utility gives rise to non-trivial PPE analysis.

In Orhun (2009) and Carbajal and Ely (2016), the seller offers a menu to a consumer who already knows his type and admits an exogenously given reference point that is type-dependent. These authors also demonstrate the possibility of optimal pooling. However, their utility models do not involve gain–loss comparisons across multiple types; moreover, the issue of optimal menus that are PE (PPE) is not explored. The main concern of Carbajal and Ely (2016) is to explain how the shape of optimal contract depends on the reference point.

Loss aversion has been fruitfully incorporated in other contexts of firm behavior. Heidhues and Kőszegi (2014), Spiegler (2012), and Rosato (2013) consider monopoly pricing with complete information. In these models, the monopolist can optimally commit to a random pricing strategy. In contrast, we explore the role of loss aversion in a model with demand uncertainty and menu contracts. The consumer’s expectations concern his future demand, not the price realization. Courty and Nasiry (2015) derive the uniformity of optimal price irrespective of product quality in a monopoly model with consumer loss aversion and random utility shocks. They do not however address the issue of screening as we do here.

Also using the KR model, Heidhues and Kőszegi (2008) explain why firms with differentiated products and heterogeneous costs may end up charging a uniform price. The competition model of differentiated products is also explored by Karle and Peitz (2013) and Zhou (2011). De Meza and Webb (2007) and Herweg et al. (2010) study the role of loss aversion in agency problems. In auctions, loss averse bidders are introduced by Lange and Ratan (2010) and Eisenhuth (2010), and Grillo (2013) considers a cheap talk game in which the receiver is loss averse.

Finally, our paper complements other approaches aimed at understanding the implications of biased consumers for monopolist behavior. Time-inconsistent preferences or self-control problems have been explored in the context of contract design by DellaVigna and Malmendier (2004), Eliaz and Spiegler (2006), Esteban et al. (2007) and Heidhues and Kőszegi (2010); Eliaz and Spiegler (2008) and Grubb (2009) investigate the role of overconfident consumers. Jeleva and Villeneuve (2004) show that pooling menu could be optimal in an insurance model with adverse selection if the consumer has imprecise belief about the underlying risk. Here, optimal pooling arises if the likelihood of low risk consumer is sufficiently large; however, this parameter also affects the corresponding insurance coverage, while in our optimal pooling menu the product quality depends on the degree of loss aversion and not on the distribution of willingness to pay.

This paper is organized as follows. Section 2 lays out a price discrimination setup with KR’s reference-dependent preferences for the binary-type case. In Sect. 3, we characterize the optimal menu in our model by adopting truthful personal equilibrium as the solution concept. The optimal menu under preferred personal equilibrium is characterized in Sect. 4. We discuss some alternative models of reference points, and their consequences, in Sect. 5. Section 6 concludes. All proofs are relegated to the “Appendix” unless mentioned otherwise. We also present the details of some omitted analyses in a Supplementary Material.

2 The setup

2.1 Price discrimination with loss averse consumers

Consider a market that consists of a monopolistic seller of some product and its buyer. Let $b=(q,t) $ denote a “bundle” in which the product of quality q is sold for the payment of t. A “menu” of bundles is referred to as $M \subseteq \mathbb {R}_+^2$. We refer to $\emptyset =(0,0)$ as the null bundle, or outside option. The seller’s profit from a bundle $b=(q,t)$ is $t-cq$, where $c>0$ is the constant marginal cost of production. There is no cost of offering a bundle.

The buyer’s willingness to pay for the product, or “type,” $\theta \in \varTheta $ is unknown at the time of menu offer from the seller but later learned privately at the time of actual consumption. Let F denote the commonly known cumulative distribution function on $\varTheta $.

Upon observing menu M, but before learning his type, the buyer forms a “reference point,” $R: \varTheta \rightarrow M \cup \{\emptyset \}$, which specifies a (deterministic) contingent plan of purchase at each possible type realization (including the possibility of opting out). Let $R(\theta ^{\prime }) = (q^r(\theta ^{\prime }), t^r(\theta ^{\prime }))$ for each $\theta ^{\prime } \in \varTheta $. Given reference point R, type-$\theta $ buyer’s ex post utility from consuming bundle $b = (q,t)$ is given by the sum of two components, “consumption/intrinsic” utility and “gain–loss” utility,” as follows^{Footnote 4}:

$$\begin{aligned} u(b \mid \theta , R) := m(b;\theta ) + \int _{\theta ^{\prime } \in \varTheta } n(b; \theta , \theta ^{\prime }, R(\theta ^{\prime })) {\text {d}}F(\theta ^{\prime }), \end{aligned}$$

(1)

where

the consumption/intrinsic utility is measured by
$$\begin{aligned} m(b;\theta ) := \theta v(q) - t \end{aligned}$$
such that $v(\cdot )$ is a (differentiable) function satisfying $v(0)=0$, $v^{\prime }(\cdot )>0$,$v^{\prime \prime }(\cdot )<0$, ${\lim }_{q\rightarrow 0}v^{\prime }(q)=\infty $ and ${\lim }_{q\rightarrow \infty }v^{\prime }(q)=0$; and
the gain–loss utility is given by
$$\begin{aligned} n(b; \theta , \theta ^{\prime }, R(\theta ^{\prime })) := \mu \left( \theta v(q) - \theta ^{\prime } v(q^r(\theta ^{\prime })) \right) + \mu \left( t^r(\theta ^{\prime })-t \right) , \end{aligned}$$
(2)
where $\mu $ is an indicator function such that, for any $k_{1},k_{2}\in \mathbb {R}_{+}$,
$$\begin{aligned} \mu (k_{1}-k_{2}):=\left\{ \begin{array}{ll} k_{1}-k_{2} &{}\quad \text {if }k_{1}\ge k_{2} \\ \lambda (k_{1}-k_{2}),\lambda >1 &{}\quad \text {if }k_{1}<k_{2}.\end{array} \right. \end{aligned}$$

The utility function in (1) adapts the model of KR to our price discrimination setting. Note that the overall gain–loss utility here is measured in expectation over the uncertainty surrounding the payoff type of the decision maker rather than the randomness of outcomes per se (for each type, the reference bundle is deterministic). Each type-$\theta $ buyer compares himself with another hypothetical type $\theta ^{\prime }$; as such, type-$\theta $ buyer experiences gain–loss from the difference between his bundle and that of each hypothetical type $\theta ^{\prime }$ in terms of final utilities. Following Tversky and Kahneman (1991) and Kőszegi and Rabin (2006), we assume that the gain–loss utility is additively separable across the two consumption dimensions, quality and monetary transfer. In Sect. 5, we formally discuss how our utility formulation differs from some alternative formulations of reference point in the price discrimination setup.

The following time line will be useful to illustrate the model and compare it with the standard screening model.^{Footnote 5}

2.2 Personal equilibrium

We now introduce the notion of personal equilibrium proposed by KR, which incorporates the idea that the reference point formed by an economic agent should be in accordance with his actual choices.

Definition 1

Given any menu M, $R: \varTheta \rightarrow M \cup \{\emptyset \}$ is a personal equilibrium (PE) if, for all $\theta \in \varTheta $,

$$\begin{aligned} u(R(\theta )|\theta ,R)\ge u(b|\theta ,R),\quad \forall b \in M \cup \{\emptyset \}. \end{aligned}$$

(3)

We say that R is a truthful personal equilibrium (TPE) if it is a PE given $M=R$.

Condition (3) requires that each bundle $R(\theta )$ in the PE be optimal for type $\theta $ with R as the reference point so that $R(\theta )$ is the bundle the buyer actually chooses if his type turns out to be $\theta $. Note that the equilibrium utility of each type must be no lower than its utility from choosing the null option since the buyer can always opt out after the realization of his type.

In the case of a TPE, the reference point itself is offered as a menu and therefore each type only needs to prefer his choice of bundle over the other type’s bundle or the null bundle. That is, R is a TPE if and only if the incentive compatibility and individual rationality requirements hold as follows: For each $\theta ,\theta ^{\prime } \in \varTheta $,

$$\begin{aligned} u(R(\theta ) |\theta ,R) \ge u(R(\theta ^{\prime }) |\theta ,R)\qquad \qquad \qquad ({\text {IC}}_{\theta }) \end{aligned}$$

$$\begin{aligned} u(R(\theta )|\theta ,R)&\ge u(\emptyset |\theta ,R).\qquad \qquad \qquad \qquad ({\text {IR}}_{\theta }) \end{aligned}$$

Since these inequalities, henceforth referred to as the (IC) and (IR) constraints, are implied by (3), the following result is immediate.

Proposition 1

Suppose R is a personal equilibrium (PE) of some menu M. Then, R is a truthful personal equilibrium (TPE).

This result is a version of revelation principle since it implies that it is without loss to focus on direct menus, i.e., menus in which every bundle is purchased in equilibrium.

The concept of personal equilibrium is not robust to the problem of multiple equilibria, however. When the seller offers a TPE menu R, the buyer might form an alternative reference point $R^{\prime }\ne R$ and play it as a PE so that the seller fails to achieve the desired outcome. Moreover, the alternative PE could give the buyer a higher ex ante expected utility than the TPE. It is possible that the TPE generates a negative ex ante utility with there being another PE in which the buyer does not buy at all.

One approach to resolve the issue of multiple PEs proposed by KR is to assume that the consumer always chooses the PE that maximizes his ex ante expected utility, or the preferred personal equilibrium (PPE). Let $\mathcal {P} (M)$ denote the set of all PEs that can arise when the seller offers a menu M; that is, R belongs to $\mathcal {P} (M)$ if $R\subseteq M \cup \{\emptyset \}$ and R satisfies condition (3). Also, given a menu R, let U(R) denote the buyer’s corresponding ex ante expected utility:

$$\begin{aligned} U (R) := \int _{\theta \in \varTheta } u(R(\theta ) \mid \theta , R) {\text {d}}F(\theta ). \end{aligned}$$

Definition 2

Given any menu M, $R: \varTheta \rightarrow M \cup \{\emptyset \}$ is a preferred personal equilibrium (PPE) if $R \in \mathcal {P} (M)$ and $U(R)\ge U(R^{\prime }) \text{ for } \text{ all } R^{\prime }\in \mathcal {P} (M)$. We say that R is a truthful preferred personal equilibrium (TPPE) if it is a PPE given $M=R$.

We characterize the seller’s profit-maximizing menu of bundles under both notions PE and PPE. In Sect. 3, the seller is assumed to be able to select his favorite PE from the menu that he offers; in Sect. 4, the buyer selects the PPE. While it may be unrealistic to assume that the seller can always manipulate the consumer’s beliefs, it also seems plausible that some consumers would respond naively to the menu on the table when he forms beliefs about his future contingent actions.

In both treatments, we restrict attention to the set of direct menus by focusing on TPE and TPPE. This is without loss for the analysis of PE menu due to Proposition 1, but a similar revelation principle for PPE menus may not hold. To see this, suppose that R is a PPE given some menu $M \ne R$. It is possible that R is not a PPE given itself—that is, R is not a TPPE—because we cannot a priori rule out the existence of some $R^{\prime } \in \mathcal {P} (R)$ that does not belong to $\mathcal {P} (M)$ and generates a higher ex ante expected utility. This failure of revelation principle poses a great challenge for complete analysis of optimal menu design since such analyses rely critically on the revelation principle, as well known from the mechanism design literature. We address this issue in more detail in Sect. 4.3.

3 Optimal TPE menu

3.1 Binary consumer types

We begin by characterizing the seller’s optimal PE menu for the case of binary consumer types. Let $\varTheta = \{\theta _L, \theta _H\}$ such that $0<\theta _{L}<\theta _H$. The probability measure on $\theta _L$ is denoted by $p\in (0,1)$. For ease of exposition, we refer to a reference point in this case simply as $R:=\{r_{L},r_{H}\}$ where $r_{i}=(q_{i}^{r},t_{i}^{r})$ for $i=H,L$.^{Footnote 6}

3.1.1 The seller’s problem

Proposition 1 implies that the set of PE menus is equivalent to the set of TPE menus and hence there is no loss in restricting ourselves to menus that are themselves TPEs. We sometimes refer to such a menu simply as a TPE menu and let $\mathcal {M}$ denote the set of all TPE menus. The seller’s problem, denoted as [P], is given by

$$\begin{aligned} \max _{\{(q_{L},t_{L}),(q_{H},t_{H})\} \in \mathcal {M}} p(t_{L}-cq_{L})+(1-p)(t_{H}-cq_{H}). \qquad \qquad \qquad [P] \end{aligned}$$

Under the reference-dependent preference framework, a broader class of menus can be supported as TPEs, compared to the standard screening model. In particular, it is possible to have the low-type buyer purchasing the higher quality-price bundle and vice versa. Given such a menu, the high type suffers a loss from deviating to mimic the low type and paying more than anticipated, and this no longer supports the usual incentive compatibility argument for the necessity of quality monotonicity of a feasible menu.

One class of menus that can be easily ruled out is one where one type of buyer receives a lower quality but pays more than the other type (including the case of a higher payment for the same quality or the same payment for a lower quality). The reason is simple: If the former type deviates to the latter’s bundle, then he will enjoy a higher gain–loss utility as well as a higher intrinsic utility.

We are therefore left with the following three classes of menus to consider.

1.
Pooling menu $q_{H}=q_{L}$ and $t_{H}=t_{L}$
2.
Screening menu $q_{H}>q_{L}$ and $t_{H}>t_{L}$
3.
Reverse-screening menu $q_{H} < q_{L}$ and $t_{H} < t_{L}$

We let $\mathcal {M}^{P}$, $\mathcal {M}^{S}$, and $\mathcal {M}^{R}$ denote the set of pooling, screening, and reverse-screening menus, respectively, that satisfy the (IC) and (IR) constraints. For the full expressions of these constraints, see Section S.1 of the Supplementary Material.

3.1.2 Symmetric information benchmark

Before the main analysis, we examine the optimal menu when the seller and buyer are symmetrically informed. This will give us an insight into how the informational asymmetry interacts with loss aversion to generate the optimality of pooling. Consider a profit-maximizing seller who is symmetrically informed of $\theta $ and thus can commit to a menu ex ante such that she imposes $(q_{i},t_{i})$ upon observing each type $\theta _{i}$ being realized. Specifically, we modify the seller’s problem [P] by dropping the (IC) constraints. Let us denote by $[P^s]$ the seller’s profit maximization problem among contracts that satisfy the (IR) constraints only.

The following result gives a necessary condition for the optimal menu with symmetric information.

Lemma 1

The solution to $[P^s]$ must be such that $\theta _{H} v(q_{H}) \ge \theta _{L} v(q_{L})$ and $t_{H} \ge t_{L}$.

Using the above Lemma and the fact that both (IR) constraints are binding, we obtain

$$\begin{aligned} t_{L} = \frac{(\lambda +1)}{2} \theta _{L} v(q_{L}) \; \text{ and } \; t_{H} =t_{L} + \frac{\theta _{H} v (q_H) -\theta _{L} v(q_{L})}{B(p,\lambda )}, \end{aligned}$$

(4)

where

$$\begin{aligned} B(p,\lambda ) :=\frac{1+(1-p) +p\lambda }{1+p +(1-p)\lambda }. \end{aligned}$$

(5)

Here, $B(p,\lambda )$ measures the relative impact of loss aversion on deviation incentives in our model, where gain–loss utilities arise stochastically in both quality and monetary dimensions. Deviating from purchasing the reference bundle to the null bundle induces a loss in quality but a gain in money. Notice that $B(p,1)=1$.

Assuming $\theta _{H} v (q_H) > \theta _{L} v(q_{L})$ at the optimum,^{Footnote 7} we can plug (4) into the objective function and take the first-order conditions to obtain

$$\begin{aligned} \frac{c}{v^{\prime }(q_{L})}&= \frac{ \left[ (\lambda +1)B(p,\lambda )-2(1-p)\right] \theta _{L}}{2p B(p,\lambda )} \end{aligned}$$

(6)

$$\begin{aligned} \frac{c}{v^{\prime }(q_{H})}&= \frac{\theta _{H}}{B(p,\lambda )}. \end{aligned}$$

(7)

Note from (6) and (7) that $q_{L} \ge q_{H}$ if and only if

$$\begin{aligned} \frac{(\lambda +1)B(p,\lambda ) -2(1-p)}{2p} \ge \frac{\theta _{H}}{\theta _{L}}, \end{aligned}$$

(8)

which holds for $\lambda $ exceeding some threshold since $(\lambda +1)B(p,\lambda )$ strictly increases in $\lambda $ without bound. Thus, with $\lambda $ high enough to satisfy (8), the symmetrically informed seller can maximize profit by endowing the low type with a higher quality but charging the high type with a larger transfer (see (4)). Note that the optimal qualities are the same across the two types only when (8) holds as equality, which is a knife-edge phenomenon. Furthermore, the same quality does not necessarily mean the same transfer.

This implies that pooling menu, which is the main focus of our analysis, does not arise when the buyers are loss averse but do not hold private information. Neither does it emerge as a consequence of asymmetric information alone, as in Mussa and Rosen (1978). The optimality of pooling is indeed a consequence of the interplay between loss aversion and asymmetric information, as we demonstrate in later sections. Intuitively, pooling will emerge as the optimal menu when the quality reversal is desirable due to loss aversion but is not feasible in the presence of asymmetric information.

3.1.3 Results

We now turn to the analysis of [P], i.e., finding an optimal menu when the seller and buyer are asymmetrically informed. A unified analysis of all possible menus is not available since different classes of menus entail different forms of gain–loss utility. Our analysis below considers each class separately to identify an optimal menu within that class, which will then lead us to characterize the overall optimal menu. Note that any pooling menu lies on the boundary of the set of feasible screening menus ($\mathcal {M}^S$) or reverse-screening menus ($\mathcal {M}^R$). The optimality of pooling will thus arise if two inequality constraints, $q_H \ge q_L$ and $q_H \le q_L$, which we impose to find an optimal menu within $\mathcal {M}^S$ and $\mathcal {M}^R$, turn out to be binding. In what follows, whenever we mention an “optimal screening (pooling or reverse-screening) menu,” it will mean optimality within the set of screening (pooling or reverse-screening) menus.

Pooling menu

We begin by characterizing the seller’s profit-maximizing choice within the class of the pooling menu. Consider a pooling menu $R =\{r= (q,t)\} \in \mathcal {M}^{P}$. Clearly, the $({\text {IR}}_{H})$ constraint is implied by the $({\text {IR}}_{L})$ constraint since, if both types choose the same bundle, type $\theta _{H}$ is better off in terms of both intrinsic and gain–loss utilities while the outside payoff is type-independent. Now, $({\text {IR}}_{L})$ can be written as

$$\begin{aligned} u(r|\theta _{L}, R )&= \theta _{L}v(q)-t-(1-p)\lambda (\theta _{H}-\theta _{L})v(q) \\&\ge u(\emptyset |\theta _{L}, R) =p[t-\lambda \theta _{L}v(q)]+(1-p)[t-\lambda \theta _{H}v(q)], \end{aligned}$$

or after rearrangement,

$$\begin{aligned} t \le \frac{(\lambda +1)}{2}\theta _{L} v(q). \end{aligned}$$

(9)

Clearly, (9) must be binding at the optimum. The following result is then immediate from the first-order condition of the seller’s profit maximization.

Proposition 2

The optimal pooling menu, $\{(q^{p}, t^{p})\}$, is such that $\theta _{L}v^{\prime }(q^{p})=\frac{2c}{\lambda +1}$.

Thus, the seller finds it optimal to sell a higher quality to a consumer with higher $\lambda $. This is because the buyer wants to avoid the loss from non-participation and, therefore, is willing to pay more for a given amount of consumption if he is more loss averse, as can be seen in (9) above.

Screening menu

Consider a screening menu $R = \{r_{L} =(q_{L},t_{L}), r_{H}=(q_{H},t_{H})\}\in \mathcal {M}^{S} $ where $q_{L}<q_{H}$ and $t_{L}<t_{H}$. As in the standard screening model, we can show that the $({\text {IC}}_{H})$ and $({\text {IR}}_{L})$ constraints are binding at the optimum while the other constraints are not. Using a similar derivation to (9), the $({\text {IR}}_{L})$ constraint can be written as

$$\begin{aligned} t_{L} \le \frac{\lambda +1}{2}\theta _{L}v(q_{L}), \end{aligned}$$

(10)

which must be binding at the optimum. Thus, for the same reason as in the optimal pooling menu above, the optimal quality for the low type increases with loss aversion. We refer to this as the participation effect of loss aversion, meaning that a greater aversion to the loss resulting from comparison with non-participation enables the seller to charge more and thus increase the quality for the low-type consumer.

Next, write the $({\text {IC}}_{H})$ constraint as

$$\begin{aligned} u(r_{H}|\theta _{H},R)&= \theta _{H}v(q_{H})-t_{H}+p[\theta _{H}v(q_{H})-\theta _{L}v(q_{L})-\lambda (t_{H}-t_{L})] \\&\ge u(r_{L}|\theta _{H},R) = \theta _{H}v(q_{L})-t_{L}+p(\theta _{H}-\theta _{L})v(q_{L}) \\&\quad +(1-p)[(t_{H}-t_{L})-\lambda \theta _{H}(v(q_{H})-v(q_{L}))], \end{aligned}$$

which can then be rewritten as

$$\begin{aligned} {[}1+(1-p)+p\lambda ] (t_{H}-t_{L})\le [1+p+(1-p)\lambda ]\theta _{H}[v(q_{H})-v(q_{L})]. \end{aligned}$$

(11)

The benefit of type $\theta _{H}$ deviating to $r_{L}$, captured by the LHS of (11), consists of reduced payment, $t_{H}-t_{L}$, and its positive impact on the gain–loss utility, $(1-p + p\lambda )(t_{H}-t_{L})$. To understand the latter, note first that the gain from paying $t_{L}$ instead of $t_{H}$ is weighted by the probability $1-p$ with which the buyer expected the payment to be $t_{H}$. At the same time, by the deviation, the high type avoids the loss equal to $\lambda (t_{H}-t_{L})$ that he would have incurred from sticking with his equilibrium choice, which is weighted by the probability p with which $\theta _{L}$ would have occurred.

The cost of deviation, captured by the RHS of (11), results from a reduced quality from $q_{H}$ to $q_{L}$ and can be explained similarly. One can then see that $B(p,\lambda ) =\frac{1+(1-p) +p\lambda }{1+p +(1-p)\lambda }$, defined previously in (5), reflects the relative (benefit–cost) impact factor of deviating to a lower quality, lower price bundle, which would result in a monetary gain but a quality loss.

When binding, (11) can be written as

$$\begin{aligned} t_{H} = t_{L} + \frac{\theta _{H} [v (q_{H}) -v(q_{L})]}{B(p,\lambda )}. \end{aligned}$$

(12)

Notice from (11) that higher $\lambda $ amplifies both the benefit and cost of deviation. If a higher $\lambda $ makes $B(p,\lambda )$ larger (smaller), then the loss aversion makes screening less (more) effective in enabling the extraction of more payment from the high type. We will refer to this as the screening effect of loss aversion, which could be favorable or adverse to the seller depending on the value of p. Also, (12) implies that, for fixed $\lambda $, the effectiveness of screening is decreasing in the likelihood of low type, i.e., $B(p,\lambda )$ is increasing in p.

Now, we describe the optimal screening menu and compare it with the optimal pooling menu.

Proposition 3

(a)
The optimal screening menu, $\{(q^s_L,t^s_L),(q^s_L,t^s_H)\}$, is such that
$$\begin{aligned} \frac{c}{v^{\prime }(q^s_{L})}&= \max \left\{ \frac{ (\lambda +1)B(p,\lambda )\theta _{L}-2(1-p)\theta _{H}}{2p B(p,\lambda )},0 \right\} \end{aligned}$$
(13)

$$\begin{aligned} \frac{c}{v^{\prime }(q^s_{H})}&= \frac{\theta _H}{B(p,\lambda )}, \end{aligned}$$
(14)
where $q^s_L$, if not equal to 0, increases in $\lambda $ and $q^s_H$ decreases (increases) in $\lambda $ if $p > \frac{1}{2}$ $(p<\frac{1}{2})$.
(b)
Any screening menu is dominated by the optimal pooling menu if and only if
$$\begin{aligned} \frac{\theta _H}{\theta _{L}} \le \left( \frac{\lambda +1}{2} \right) B(p,\lambda ), \end{aligned}$$
(15)
which in turn holds if and only if $\lambda \ge \lambda _S $ for some threshold $\lambda _S >1$ that decreases in p and increases in $\frac{\theta _H}{\theta _L}$.

In part (a) of Proposition 3, the optimal quality $q_{L}$ increasing with $\lambda $ should be expected from the participation effect. The behavior of $q_{H}$ is related to the fact that $B(p,\lambda )$ increases with $\lambda $ if and only if $p > \frac{1}{2}$: That is, a higher $\lambda $ means the adverse (favorable) screening effect if $p > \frac{1}{2}\left( p< \frac{1}{2}\right) $.

Part (b) states the condition under which pooling dominates screening. The inequality (15) holds when the participation effect, measured by $\frac{\lambda +1}{2}$ [see (10) above], is large and/or when the screening effect works against the profitability of screening as $B(p,\lambda )$ gets large. There are a couple of noteworthy observations here. First, with sufficiently large $\lambda $, the dominance of pooling over screening remains even when $p<\frac{1}{2}$ such that the screening effect works favorably for the screening seller. This is because the participation effect dominates the screening effect, namely $\frac{\lambda +1}{2}$ increases with $\lambda $ faster than $B(p,\lambda )$ decreases. Second, the threshold, $\lambda _S\left( p,\frac{\theta _{H}}{\theta _{L}}\right) $, is decreasing in p, and this implies that screening is less attractive relative to pooling when the low-type consumers are more abundant. This follows from the fact that a higher (ex ante) likelihood of $\theta _{L}$ generates a greater deviation incentive for the high type via the gain–loss utility ($\partial B(p,\lambda )/ \partial p > 0$).

Reverse-screening menu

Let us consider next a reverse-screening menu $R = \{r_{L} =(q_{L},t_{L}), r_{H}=(q_{H},t_{H})\} \in \mathcal {M}^{R} $ such that $q_{L} > q_{H}$ and $t_{L} > t_{H}$, satisfying the (IC) and (IR) constraints. The reverse-screening menu is a useful device to exploit the aforementioned participation effect by giving a higher quality to the low type. Giving a higher quality to the low type, however, may create a deviation incentive for the high type. This incentive can be curbed should the high type suffer a sufficient loss from a higher deviation price. How this loss is affected by the parameters in our model will determine when the reverse-screening menu is optimal.

We first provide a couple of necessary conditions for reverse-screening menu to be feasible or optimal.

Lemma 2

(a)
A reverse-screening menu can be a TPE only if
$$\begin{aligned} \frac{\theta _H}{\theta _L} \le \frac{\lambda +1}{2}. \end{aligned}$$
(16)
(b)
Any optimal reverse-screening menu must satisfy $\theta _H v(q_H) \ge \theta _L v(q_L)$.

Part (a) states that loss aversion must be high enough to sustain a reverse-screening menu as a TPE. According to part (b), the seller does not want to reverse the qualities to the extent that the utility from quality consumption is reversed.

We now compare reverse-screening and pooling menus.

Proposition 4

Any reverse-screening menu is dominated by the optimal pooling menu if and only if

$$\begin{aligned} \frac{\theta _{H}}{\theta _{L}} \ge \frac{1+p+(1-p)\lambda }{2}, \end{aligned}$$

(17)

which in turn holds if and only if $\lambda \le \lambda _{R} $ for some threshold $\lambda _{R}$ that increases in p and $\frac{\theta _{H}}{\theta _{L}}$.

Thus, if $\lambda $ is large enough to violate (17), reverse-screening in fact dominates pooling. This arises due to the participation effect that makes the increase in $q_L$, rather than $q_H$, more effective in extracting surplus. Since the high-type consumer derives a higher level of utility from any given contract and therefore cares less about an improvement in quality than the low-type consumer, the attractiveness of exploiting the high type’s higher marginal intrinsic utility can be outweighed by the participation effect when the consumer is significantly loss averse.

Condition (17) shows that pooling tends to dominate reverse-screening as p gets larger. The logic is similar to that behind part (b) of Proposition 3: A higher p makes it more tempting for the high type to deviate. When the realization of the low type has been anticipated to be more likely, under screening, the high type experiences a greater loss from sticking to $r_H$ that involves a higher payment while, under reverse-screening, the same consumer finds it less costly to deviate to $r_L$.

Optimal menu

We are now ready to characterize the menu that maximizes the expected profit among all TPE menus.

Theorem 1

There exists some $\hat{p} \in (0,1)$ such that $\lambda _S \le \lambda _R $ if and only if $p \ge \hat{p}$. Then, the optimal menu that solves [P] is

(a)
a pooling menu if $p \ge \hat{p}$ and $\lambda \in [\lambda _S, \lambda _R]; $
(b)
a screening menu if $\lambda < \min \{ \lambda _R,\lambda _S \}; $
(c)
a reverse-screening menu if $\lambda > \max \{ \lambda _R,\lambda _S \}; $
(d)
either screening or reverse-screening menu (but not both) if $p < \hat{p}$ and $\lambda \in [\lambda _R, \lambda _S].$

Proof

First, it is straightforward to see that

$$\begin{aligned} \lim _{p \rightarrow 0} \lambda _S&=\infty >\frac{2\theta _H}{\theta _L} -1 = \lim _{p \rightarrow 0} \lambda _R\\ \lim _{p \rightarrow 1} \lambda _S&=2\sqrt{\frac{\theta _H}{\theta _L}} -1 <\infty = \lim _{p \rightarrow 1} \lambda _R. \end{aligned}$$

Thus, by the mean value theorem and the monotonicity of $\lambda _S$ and $\lambda _R$, we can find $\hat{p} \in (0,1)$ such that $\lambda _S\ge \lambda _R$ if and only if $p \ge \hat{p} $. Then, parts (a) to (d) of the claim immediately follow from combining part (b) of Propositions 3 and 4. $\square $

Pooling is optimal if there is enough mass of low types and the consumer is sufficiently, but not too, loss averse. Otherwise, a screening or reverse-screening menu is optimal. In the latter case, there is a region of parameters, as shown in part (d), in which we have not been able to fully sort between screening and reverse-screening menus, but in most cases we expect the screening (reverse-screening) menu to be optimal if $\lambda $ is low (high).

The central message of Theorem 1 is the optimality of pooling. Another noteworthy theoretical prediction of our model is the possibility of optimal reverse-screening under sufficiently large $\lambda $. We nonetheless show below that this latter result does not hold in a model with a continuum of buyer types [Theorem 2, part (c)] or with an alternative gain–loss utility specification (Proposition 7).

The following example illustrates how the optimal menu varies with the parameter values. Here, pooling is optimal for a wide range of parameter values, while reverse-screening requires $\lambda $ to be larger than 2.^{Footnote 8}

Example 1

Suppose that $\frac{\theta _{H}}{\theta _{L}}=1.5$. Figure 1 divides the space of $(\lambda ,p)$ into four regions according to Theorem 1 and illustrates the type of optimal menu in each region.

It can be shown, though only numerically, that in the region (d), there is a threshold value of $\lambda $ for each p below (above) which the screening (reverse-screening) menu is optimal. Below dotted line, the optimal screening menu entails exclusion of the buyer with low willingness to pay [see (13) above].^{Footnote 9}

Notice in the above example that, at low values of p, loss aversion actually generates a benefit from serving also the low-type buyer who would otherwise be excluded by the profit-maximizing seller. This is due to the participation effect that enables the firm to sell a higher quality-price bundle to the low type than in the model without loss aversion.^{Footnote 10}

3.2 A continuum of consumer types

In this section, we explore the scope of our findings beyond binary consumer types by considering a continuum-type case. Section S.2 of the Supplementary Material offers a detailed analysis, including formal proofs and numerical examples of the main results.

Suppose that $\theta \in [\underline{\theta },\overline{\theta }]$ with a cdf F, which has a strictly positive and continuously differentiable pdf f. Define the “virtual value” function as

$$\begin{aligned} J(\theta ) :=\theta -\frac{1-F(\theta )}{f(\theta )}, \end{aligned}$$

and assume that it is strictly increasing. Without loss aversion, this assumption leads to full separation of types.

Let $(q,t): [\underline{\theta },\overline{\theta }] \rightarrow \mathbb {R}_{+}\times \mathbb {R}$ denote a menu offered by the seller. For simplicity, we assume that $q(\cdot )$ and $t(\cdot )$ are continuous.^{Footnote 11} We restrict attention to two classes of monotone menus: (i) both $q(\cdot )$ and $t(\cdot )$ are non-decreasing; and (ii) both $q(\theta )$ and $t(\theta )$ are non-increasing while $\theta v(\theta )$ is non-decreasing. With some abuse of terminology, we refer to the former class of menus as screening menus and the latter as reverse-screening menus.

Given a feasible TPE menu, with some abuse of notation, let $U(\theta ^{\prime };\theta )$ denote the payoff of type $\theta $ reporting $\theta ^{\prime }$ and let $U(\theta ) := U(\theta ;\theta )$. Then, the (IC) constraint can be written as

$$\begin{aligned} U(\theta )= \max _{\theta ^{\prime }\in [\underline{\theta },\overline{\theta }]}U(\theta ^{\prime };\theta ),\quad \forall \theta , \end{aligned}$$

(18)

while the (IR) constraint as

$$\begin{aligned} U (\theta ) \ge \int _{\underline{\theta }}^{\overline{\theta }} (t(s) -\lambda s v(q(s))) {\text {d}} F(s), \quad \forall \theta . \end{aligned}$$

(19)

In both screening and reverse-screening menus we consider, $\theta v(q(\theta ))$ is non-decreasing and, hence, we can define

$$\begin{aligned} \hat{\theta }(\theta ,\theta ^{\prime }) :=\sup \{ r \in [\underline{\theta },\overline{\theta }]\, |\, s v (q(s)) \le \theta v(q(\theta ^{\prime })),\,\, \forall s \le r\}. \end{aligned}$$

Note that if type $\theta $ (mis)reports to be type $\theta ^{\prime }$ and receives $q (\theta ^{\prime })$, he experiences a utility gain (loss) in quality dimension, compared to the types below (above) $\hat{\theta } (\theta ;\theta ^{\prime })$.

We can then write

$$\begin{aligned} U(\theta ^{\prime };\theta )&= \theta v(q(\theta ^{\prime })) -t(\theta ^{\prime }) + \left[ \int _{\underline{\theta }}^{\hat{\theta }(\theta ,\theta ^{ \prime })}(\theta v(q(\theta ^{\prime }))-s v(q(s))){\text {d}}F(s)\right. \nonumber \\&\quad \left. +\int _{\theta ^{\prime }}^{ \overline{\theta }}(t(s)-t(\theta ^{\prime })){\text {d}}F(s)\right] \nonumber \\&\quad -\lambda \left[ \int _{\hat{\theta }(\theta ,\theta ^{\prime })}^{\overline{\theta } }(sv(q(s))-\theta v(q(\theta ^{\prime }))){\text {d}}F(s)+\int _{\underline{\theta } }^{\theta ^{\prime }}(t(\theta ^{\prime })-t(s)){\text {d}}F(s)\right] . \end{aligned}$$

The first-order condition for incentive compatibility amounts to the following^{Footnote 12}:

$$\begin{aligned} \left. \frac{\partial }{\partial \theta ^{\prime }} U(\theta ^{\prime };\theta ) \right| _{\theta ^{\prime } =\theta }= & {} \theta \left( v (q(\theta ))\right) ^{\prime } \left[ 1+F(\theta )\right. \nonumber \\&\left. +\lambda (1-F(\theta ))\right] -t^{\prime }(\theta )\left[ 1 + (1-F(\theta ))+\lambda F(\theta )\right] = 0. \end{aligned}$$

(20)

To see the intuition behind this expression, consider the cost and benefit of type $\theta $ from slightly overstate his type. On the one hand, the intrinsic utility from quality consumption marginally increases by $\theta (v(q(\theta )))^{\prime }$. From this, the gain that type $\theta $ enjoys relative to the types below increases by $\theta (v(q(\theta )))^{\prime } F(\theta )$ while the loss, which type $\theta $ suffers relative to the types above, decreases by $\lambda \theta (v(q(\theta )))^{\prime }(1-F(\theta ))$. Thus, the overall marginal benefit in the quality dimension is proportional to $1+F(\theta )+\lambda (1-F(\theta ))$. On the other hand, due to a higher payment after the deviation, the intrinsic utility decreases by $t^{\prime }(\theta )$. From this, the gain that type $\theta $ enjoys relative to the types above decreases by $t^{\prime } (\theta )(1-F(\theta ))$ while the loss increases by $\lambda t^{\prime } (\theta )F(\theta )$. Thus, the overall marginal benefit in the money dimension is proportional to $1+ (1- F(\theta )) + \lambda F(\theta )$.

We can rewrite (20) as

$$\begin{aligned} t^{\prime }(\theta )=(v(q(\theta )))^{\prime }\frac{\theta (1+F(\theta )+ \lambda (1-F(\theta )))}{1+(1-F(\theta ))+\lambda F(\theta )}= (v(q(\theta )))^{\prime }G(\theta ,\lambda ), \end{aligned}$$

(21)

where

$$\begin{aligned} G(\theta ,\lambda ):=\frac{\theta }{H (\theta ,\lambda )} \; \text{ and } \; H (\theta ,\lambda ) := \frac{ 1+(1-F(\theta ))+\lambda F(\theta )}{1+F(\theta )+\lambda (1-F(\theta ))}. \end{aligned}$$

Note that $H(\theta ,\lambda )$ is the continuum-type counterpart of $B(p,\lambda )$ in (12). It affects the rate at which the payment increases as the consumer’s type, and thus its corresponding quality marginally increases. Without reference-dependent utility, the rate of increase is proportional to $G(\theta ,1)=\theta $; this should be adjusted using $H(\theta ,\lambda )$ in the presence of reference-dependent utility. We refer to $G (\theta ,\lambda )$ as the “gain–loss-adjusted type,” whose behavior is crucial for determining the optimal quality schedule. Note that $G(\theta ,\lambda ) > \theta $ if $\theta < F^{-1}\left( \frac{1}{2}\right) $ (and $G(\theta ,\lambda ) < \theta $ if $\theta > F^{-1}\left( \frac{1}{2}\right) $), so the gain–loss-adjusted type is leveled out. Moreover, $H(\theta ,\lambda )$ increases in $\theta $ and does so faster with higher $\lambda $, which may cause $G(\theta ,\lambda )=\frac{\theta }{H(\theta ,\lambda )} $ to decrease.

We next present our results of this section.

Theorem 2

Consider the case of a continuum of consumer types, and restrict attention to monotone menus. The optimal TPE menu has the following properties:

(a)
Suppose that (i) $\theta (1+F(\theta )+\lambda (1-F(\theta ))) $ is non-decreasing in $\theta $ and (ii) $\frac{\lambda ^2 +2\lambda -3}{2(\lambda +1)} > \frac{1}{\overline{\theta }f(\overline{\theta })}$. Then, pooling occurs around the highest type $\overline{\theta }$.
(b)
Suppose that $\underline{\theta } >0$, $\theta f(\theta ) > F(\theta ) \ \forall \theta $, and $f^{\prime }(\theta )\le 0 \ \forall \theta $. Then, there exists some $\overline{\lambda }>1$ such that, for any $\lambda > \overline{\lambda }$, pooling occurs over the entire interval $[\underline{\theta },\overline{\theta }]$.
(c)
Any reverse-screening menu is dominated by a pooling menu.

In part (a), condition (i) guarantees that a quality-transfer schedule that deters deviation to a marginal type does so to all other types and hence global incentive compatibility is implied by local consideration.^{Footnote 13} Condition (ii) is equivalent to requiring that $G_{\theta } (\overline{\theta },\lambda )<0$, i.e., the gain–loss-adjusted type decreases with the original type around the top. Without having to concern with information rent at the top, this means that the gain–loss-adjusted virtual value also decreases, leading to pooling at the top. Note that the inequality never holds if $\lambda =1$.

Part (b) gives a set of conditions sufficient for full pooling to be optimal. The first condition, $\underline{\theta } >0$, prevents the optimal menu from excluding the bottom type, as required by a full pooling menu. To understand the second condition, let us first note

$$\begin{aligned} \lim _{\lambda \rightarrow \infty } G(\theta ,\lambda ) = \theta \frac{1-F(\theta )}{ F(\theta )}. \end{aligned}$$

Thus, for sufficiently high $\lambda $, the gain–loss-adjusted type decreases going from $\underline{\theta }$ to $\overline{\theta }$ while it may not be in between. Then, the condition that $\theta f(\theta ) > F(\theta ) \ \forall \theta $ ensures that this expression monotonically decreases over the entire interval so that $G_{\theta }(\theta ,\lambda )$ is always negative for sufficiently high $\lambda $. The last condition, $f^{\prime }(\theta ) \le 0$, ensures (along with the second condition) that $G_{\theta \theta } (\theta )\le 0$ for sufficiently high $\lambda $, which means worsening of the information rent problem due to loss aversion. Note that this condition is consistent with the observation in the previous binary-type analysis that the screening effect adversely affects the profitability of a screening menu when the low type is abundant.

Part (c) shows that, in contrast to the binary-type case, the reverse-screening menu can no longer be optimal with continuously many types. Recall that we consider reverse-screening menus whose quality/transfer schedule is non-increasing. Thus, the class of menus that are dominated by pooling menu here includes any menu in which the quality/transfer schedule is strictly decreasing over some local interval of types while being constant elsewhere. To understand this result, recall that a key feature of optimal reverse-screening with binary types was the participation effect: For the low willingness-to-pay consumer, the participation constraint must be binding at the optimum and therefore the additional loss arising from non-participation allows the firm to extract a greater payment from this type by offering a higher quality product [see (10)]. With a continuum of types, this effect no longer applies. The participation constraint similarly binds for the lowest type, but the corresponding revenue impact is only marginal. On the other hand, just as in the binary case, the incentive compatibility requirement works against the profitability of reverse-screening menus.

Remark 1

Our derivation of optimal menu is based on the restriction to monotone menus. Therefore, Theorem 2 implies the following: When the conditions stipulated in part (a) or (b) are met, the optimal TPE menu involves either pooling, or else, strict violation of monotonicity (“local reverse-screening”). Neither contractual form is predicted by the standard model with increasing virtual value.

4 Optimal TPPE menu

4.1 The seller’s problem

Let us next consider a consumer who is capable of choosing the best PE from a given menu of bundles. We restrict attention to the binary consumer type case and TPPE menus, i.e., TPE menus that generate the highest ex ante utility to the consumer among all corresponding PEs.

Given any TPE menu $R = \{b_L, b_H\} \in \mathcal {M}$, let

$$\begin{aligned} C(R) := \left\{ R^{\prime } = \{ b_L^{\prime }, b_H^{\prime } \} \ne R\, | \, b_i^{\prime } = \emptyset , b_L, \text{ or } b_H \text{ for } \text{ each } i = L,H \right\} , \end{aligned}$$

that is, the set of all menus other than R that can arise from each of the two types choosing a bundle contained in R. In order for a TPE menu $R = \{b_L, b_H\}$ to be a TPPE, it must be that for every alternative consumption plan $R^{\prime }\in C(R)$, either $R^{\prime }$ fails to be a PE or the buyer’s ex ante payoff from $R^{\prime }$ does not exceed that from R. This requirement will be met if and only if R and $R^{\prime }$ satisfy at least one of the five inequalities below:

$$\begin{aligned} u(b_L^{\prime }|\theta _L, R^{\prime }) < u( \tilde{b} |\theta _L, R^{\prime })\quad \text{ for } \tilde{b} \in R {\setminus } \{b_L^{\prime }\} \qquad \qquad \qquad ({\text {FIC}}_L) \end{aligned}$$

$$\begin{aligned} u (b_L^{\prime }| \theta _L, R^{\prime }) < u (\emptyset | \theta _L,R^{\prime }) \qquad \qquad \qquad \quad \qquad \qquad \qquad \qquad ({\text {FIR}}_L) \end{aligned}$$

$$\begin{aligned} u (b^{\prime }_H | \theta _H, R^{\prime }) < u (\tilde{b} | \theta _H, R^{\prime })\quad \text{ for } \tilde{b} \in R {\setminus } \{b_H^{\prime }\} \qquad \qquad \qquad ({\text {FIC}}_H) \end{aligned}$$

$$\begin{aligned} u (b_H^{\prime }| \theta _H, R^{\prime }) < u (\emptyset | \theta _H, R^{\prime }) \qquad \qquad \qquad \quad \qquad \qquad \qquad \qquad ({\text {FIR}}_H) \end{aligned}$$

$$\begin{aligned} U(R^{\prime }) \le U(R). \qquad \qquad \qquad \quad \qquad \qquad \qquad \qquad \qquad \qquad (U) \end{aligned}$$

Fixing a consumption plan R, the first four inequalities above represent violations of the four (IC) and (IR) conditions, respectively, for an alternative plan $R^{\prime }$ to constitute itself a PE. These inequalities will be referred to as the (FIC) and (FIR) conditions. The last inequality means that the buyer’s ex ante payoff from $R^{\prime }$ does not exceed that from R. We say that $R\in \mathcal {M}$ satisfies the PPE requirement with respect to $R^{\prime }$ if at least one of the above five inequalities is satisfied.

A TPE menu $R \in \mathcal {M}$ is a TPPE if and only if it satisfies the PPE requirement with respect to $R^{\prime }$ for every $R^{\prime }= C(R)$. Let $\mathcal {M}^e$ denote the set of all such menus. Then, the seller’s corresponding optimization program is given as follows^{Footnote 14}:

$$\begin{aligned} \max _{\{(q_{L},t_{L}),(q_{H},t_{H})\} \in \mathcal {M}^e} p(t_{L}-cq_{L})+(1-p)(t_{H}-cq_{H}). \qquad \qquad \qquad [P^e] \end{aligned}$$

4.2 Results

We begin our analysis of optimal TPPE menu by exploring a necessary condition for a screening menu to be a TPPE. Suppose that the firm offers $R =\{b_L, b_H\}$ such that $b_L \ne b_H$ intended to screen the high-type consumer. The problem is that the consumer may instead form, or deviate to, an alternative consumption plan from the offered bundles. In particular, choosing a constant bundle poses a potential benefit in terms of gain–loss utilities. Our first result provides the conditions for a screening menu to satisfy the PPE requirement with respect to the pooling menus. To state the result, define

$$\begin{aligned} \alpha (p,\lambda ) := {\left\{ \begin{array}{ll} \frac{\lambda +1}{2} &{} \text{ if } p < \frac{\lambda +2}{\lambda +3}\\ \frac{1+(1-p)(\lambda -1)}{1-(1-p)(\lambda -1)} &{} \text{ if } p \ge \frac{\lambda +2}{\lambda +3} \end{array}\right. } \quad \text{ and } \quad \beta (p,\lambda ) := {\left\{ \begin{array}{ll} \frac{1-p(\lambda -1) }{1+p(\lambda -1)} &{} \text{ if } p \le \frac{1}{\lambda +3} \\ \frac{2}{\lambda +1} &{} \text{ if } p > \frac{1}{\lambda +3}. \end{array}\right. } \end{aligned}$$

(22)

Lemma 3

Fix any screening menu $R = \{b_L, b_H\}$. Then, we obtain the following:

(a)
R satisfies the PPE requirement with respect to $R^H := \{b^H\}$ if and only if
$$\begin{aligned} \frac{t_H - t_L}{v_H -v_L} \ge \theta _L \alpha (p,\lambda ) (\text{ With } \text{ the } \text{ inequality } \text{ being } \text{ strict } \text{ if } p < \frac{\lambda +2}{\lambda +3}); \end{aligned}$$
(23)
(b)
R satisfies the PPE requirement with respect to $R^L := \{b^L\} $ if and only if
$$\begin{aligned} \frac{t_H - t_L}{v_H -v_L} \le \theta _H \beta (p,\lambda ) (\text{ With } \text{ the } \text{ inequality } \text{ being } \text{ strict } \text{ if } p > \frac{1}{\lambda +3}). \end{aligned}$$
(24)

Furthermore, conditions (23) and (24) imply that R is a TPE.

Part (a) is derived from the following considerations. If $b_H$ was so expensive relative to $b_L$ as to satisfy (23), the consumer would not deviate to $R^H$ (under which he would always consume $b^H$) for one of two reasons: Either the low type prefers $b_L$ to $b_H$ so that $R^H$ cannot be a PE, or the expected transfer from $R^H$ is sufficiently higher than that from R such that $R^H$ overall yields a lower ex ante payoff than R. Part (b) and condition (24) are derived similarly by considering $R^L$. These two conditions also turn out to ensure that the screening menu R is itself a TPE, greatly facilitating our characterization below.

It follows from (23) and (24) that a screening TPPE menu exists only if the RHS of (24) is smaller than the RHS of (23), which delivers the necessary condition for the existence of a screening TPPE menu. We next show that this condition is also sufficient and holds if $\lambda $ is not too large. A reverse-screening TPPE menu can exist only if $\lambda $ is sufficiently large. In contrast, one can always find a pooling TPPE menu that yields a positive profit.

Proposition 5

Define $\overline{\lambda }_S \in (1, \infty )$ such that $\theta _L \alpha \left( p,\overline{\lambda }_S\right) =\theta _H \beta \left( p,\overline{\lambda }_S\right) $. Also, define

$$\begin{aligned} \overline{\lambda }_R := \max \left\{ \frac{2\theta _H-(1+p)\theta _L}{(1-p)\theta _L}, 1+ \frac{1}{p}\right\} > \overline{\lambda }_S. \end{aligned}$$

We obtain the following:

(a)
There exists a screening TPPE menu if and only if $\lambda < \overline{\lambda }_S$. Also, there exist $\underline{p}$ and $\overline{p}$ with $0<\underline{p}<\overline{p}<1$ such that, as p increases, $\overline{\lambda }_S$ is (continuously) decreasing for $p < \underline{p} $, constant for $p \in [\underline{p},\overline{p}]$, and increasing for $p > \overline{p}$.
(b)
There exists a reverse-screening TPPE menu only if $\lambda \ge \overline{\lambda }_R$.
(c)
There always exists a pooling TPPE menu that yields a positive profit.

An immediate implication from Proposition 5 is that only pooling menus can be sustained as TPPE if the loss aversion parameter is in the range $[\overline{\lambda }_S,\overline{\lambda }_R)$. Furthermore, part (a) shows that screening is feasible under a smallest range on $\lambda $ when p takes an intermediate value: $\overline{\lambda }_S$ is minimized when $p \in [\underline{p}, \overline{p}]$.

To gain some intuition, note first that the gain–loss utilities are generated by the difference between the actual realized type and the expectation. Therefore, they occur more often when the type distribution has a greater variance, which, in the case of binary types, is true when p is closer to a half. In contrast to the PE analysis earlier, we are now concerned with the consumer’s ex ante payoff comparisons across multiple PEs: a greater variance in the type distribution makes the contingent consumption plan less attractive ex ante.

It remains to show the shape of profit-maximizing TPPE menu. It turns out that a screening menu is optimal whenever it can be supported as a TPPE. We state our next theorem.

Theorem 3

The optimal menu that solves $[P^e]$ is

(a)
a pooling menu if $\lambda \in [\overline{\lambda }_S,\overline{\lambda }_R)$;
(b)
a screening menu if $\lambda < \overline{\lambda }_S$, and the optimal $q_L$ and $q_H$ solve
$$\begin{aligned} \frac{c}{v^{\prime } (q_L)}&= \frac{\theta _L \alpha (p,\lambda )}{p} - \frac{\theta _H (1-p) \beta (p,\lambda )}{p} \end{aligned}$$
(25)

$$\begin{aligned} \frac{c}{v^{\prime } (q_H)}&= \theta _H \beta (p,\lambda ). \end{aligned}$$
(26)

Our proof of part (b) consists of two steps. First, we take a screening menu and solve a relaxed problem by imposing the PPE requirement only for a subset of deviations, $R^L$, $R^H$, and $R^{\emptyset H} =\{\emptyset ,H\}$. As shown in Lemma 3, the deviations to $R^L$ and $R^H$ can be deterred by invoking (23) and (24). In order to deter the deviation to $R^{\emptyset H}$, the transfer for the low type, $t_L$, should not be too large since otherwise the buyer would find it better off ex ante to choose $R^{\emptyset H}$, i.e., (U) is violated. This imposes another upper bound on $t_L$ in addition to the bound imposed by $({\text {IR}}_L)$ as part of the TPE conditions. These two bounds can be written together as $t_L \le \theta _L \alpha (p,\lambda )$ (where $\alpha (p,\lambda )$ is as defined in (22)). This constraint and (24) must be binding at the optimum of the relaxed problem, which leads to the first-order conditions given in (25) and (26). The second step of the proof then shows that the optimal menu for the relaxed problem satisfies all other PPE requirements.

We have not derived a boundary beyond which reverse-screening begins to dominate pooling, which can still be optimal when $\lambda \ge \overline{\lambda }_R$.^{Footnote 15} Nonetheless, Theorem 3 demonstrates that the additional insurance motive captured by the PPE requirement favors pooling for a wide range of parameter values. We offer a numerical illustration in Fig. 2. To highlight the contrast with the TPE results earlier, we set $\frac{\theta _H}{\theta _L} = 1.5$ as in Example 1 and plot $\overline{\lambda }_S$ and $\overline{\lambda }_R$ together with $\lambda _S$ and $\lambda _R$ appearing in Fig. 1.

Remark 2

Notice the shaded region at the top left of Fig. 2 where optimal TPE menu is pooling but screening is the optimal strategy under TPPE. The introduction of PPE requirements reduces profitability of both types of menu. For instance, optimal pooling TPE menu may entail an alternative PE in which the buyer never makes a purchase.^{Footnote 16} When the likelihood of low type is large, the PPE requirements make a greater impact on pooling than on screening.

4.3 A role for redundant bundle

The analysis of PPE menus above followed the spirit of revelation principle, focusing on the direct revelation menus. The restriction to direct menus is without loss if the seller is allowed to select the truthful equilibrium, or TPE in out setup. However, the notion of PPE also seeks optimality from the agent’s perspective and hence renders the revelation principle inapplicable. In this section, we present a new possibility that an indirect menu can improve the seller’s profit upon the optimal pooling TPPE menu previously characterized. The alternative menu that we propose features two bundles, but both consumer types pool on a single bundle, with the other remaining redundant.

Suppose that the optimal TPPE menu is a pooling menu $M = \{b^*=(q^*, t^*)\}$ for which the PPE requirement against (the deviation to) the null menu $R= \{ \emptyset ,\emptyset \}$ boils down to condition (U). This condition then imposes an upper bound on the transfer as follows:

$$\begin{aligned} t^*\le \big [ p\theta _L + (1-p)\theta _H -p (1-p)(\lambda -1) (\theta _H -\theta _L) \big ]v(q^*) = \varPhi v (q^*), \end{aligned}$$

(27)

where

$$\begin{aligned} \varPhi := p \theta _L + (1-p)\theta _H - p(1-p)(\lambda -1) (\theta _H -\theta _L). \end{aligned}$$

Let us now modify M and design a new menu $M^{\prime } = \{ b=(q,t), b^{\prime }=(q^{\prime },t^{\prime })\}$, where

$q=q^*$ and $t=\varPhi v (q^*) +\epsilon $ for $\epsilon >0$;
$q^{\prime } =\delta $ and $t^{\prime } = \theta _H \frac{2}{\lambda +1} v (q^{\prime }) -\delta ^{\prime }$ for $\delta , \delta ^{\prime } >0$.

Since $ q = q^*$ and $t > t^*$, the seller’s profit is higher under $M^{\prime }$ than under M, provided that both types pooling on b constitutes a PPE.

This latter observation is indeed true under the following parametric restrictions.

Assumption 1

(i)
$ \overline{\lambda }_S< \lambda < 1+ \frac{1}{p}$;
(ii)
$\theta _H \frac{2}{\lambda +1}< \varPhi < \theta _L \frac{\lambda +1}{2}$;
(iii)
$ \frac{1}{\varPhi } > \max \Big \{ \left( \frac{p\lambda +1}{2} \right) \frac{1}{\theta _H}, \left( \frac{1-(1-p)(\lambda -1)}{1+(1-p)(\lambda -1)}\right) \frac{1}{\theta _L} \Big \}$;
(iv)
$\theta _H \frac{2}{\lambda +1} < \theta _L \frac{1+p +(1-p)\lambda }{1+(1-p) + p \lambda }$.

Proposition 6

Suppose that Assumption 1 holds. Then, there are sufficiently small values of $\epsilon , \delta ,$ and $\delta ^{\prime }$ such that in the PPE of menu $M^{\prime } = \{ b, b^{\prime }\}$, both types choose b and the corresponding expected profit exceeds that from the optimal pooling TPPE menu $M = \{ b^* \}$.

A formal proof is presented in Section S.4 of the Supplementary Material. To understand this result, note first that, by Theorem 3 (since $1+ \frac{1}{p}\le \overline{\lambda }_R$), part (i) of Assumption 1 implies that the optimal TPPE menu is a pooling menu; also, by part (ii), (U) is implied by $({\text {FIC}}_H)$ and $({\text {FIC}}_L)$ and hence (U) captures the PPE requirement against $R = \{\emptyset ,\emptyset \}$.

Next, consider our menu $M^{\prime }$. Here, pooling on b violates condition (U) given in (27) but still satisfies the PPE requirement against $ \{\emptyset , \emptyset \}$. This is because the redundant bundle $b^{\prime }$ is constructed such that the high type would deviate from the null bundle to choose $b^{\prime }$. For providing such incentives to break pooling on $\emptyset $ as a PE, we need to ensure that the screening effect of loss aversion works in favor of separation. An important content of Assumption 1 therefore requires p to be sufficiently low (recall from Sect. 3.1.3 that, for fixed $\lambda $, the effectiveness of screening is decreasing in p).

Introducing a redundant bundle however generates new constraints: First, the consumer must be incentivized not to choose $b^{\prime }$ over b, and second, the PPE requirements must be satisfied against new potential PEs involving $b^{\prime }$. In terms of the latter, since three bundles (including the null bundle) are available, we need to check for 8 possible deviations from the desired pooling PE, while each deviation must be consistent with $({\text {FIC}}_L)$, $({\text {FIC}}_H)$ or (U). Assumption 1-(iii) and -(iv) are invoked to handle these requirements.

Assumption 1 is satisfied by a non-trivial set of parameter values. For instance, in Fig. 3, we set $\theta _H/\theta _L =1.5$ and depict those parameter values in the shaded region.

We do not know the full extent of optimal contracting under general indirect menus. In the case of pooling menu, it is relatively easy to break undesired PEs by introducing a redundant bundle since pooling menus admit a relatively small number of potential deviations compared to screening or reverse-screening menus. To derive the profit-maximizing menu among all indirect menus with an arbitrary number of redundant bundles, the optimization problem involves an intractable number of constraints. From the perspective of mechanism design theory, the analysis of general optimal menu amounts to searching for the second-best mechanism over the entire mechanism space, direct or indirect, without help of the revelation principle. To our knowledge, this question is yet to be tackled by the literature.

5 Alternative reference points

In this section, we discuss some alternative approaches, and their consequences, of modeling gain–loss utilities in the price discrimination setup.

5.1 Bundles as stochastic reference point

Our approach to modeling a stochastic reference point is that each type-$\theta $ consumer compares the utility from his consumption, i.e., $\theta v(q)$, with the utility that each hypothetical type $\theta ^{\prime }$ would have derived from consuming her reference bundle, i.e., $\theta ^{\prime } v(q^r(\theta ^{\prime }))$. Thus, the gain–loss term on the intrinsic utility component for each type $\theta $ amounts to

$$\begin{aligned} \int _{\theta ^{\prime } \in \varTheta } \mu \left( \theta v(q) - \theta ^{\prime } v(q^r(\theta ^{\prime }))\right) {\text {d}}F(\theta ^{\prime }), \end{aligned}$$

(28)

where $\mu $ is the loss aversion indicator function as defined in (2).

An alternative approach is to consider comparison of just the physical outcomes. This would mean rewriting of (28) into

$$\begin{aligned} \theta \int _{\theta ^{\prime } \in \varTheta } \mu \left( v(q) - v(q^r(\theta ^{\prime }))\right) {\text {d}}F(\theta ^{\prime }), \end{aligned}$$

(29)

and (2) into

$$\begin{aligned} n(b; \theta , \theta ^{\prime }, R(\theta ^{\prime })):= & {} n(b; \theta , R(\theta ^{\prime }))\\= & {} \theta \mu \left( v(q) - v(q^r(\theta ^{\prime })) \right) + \mu \left( t^r(\theta ^{\prime })-t \right) . \end{aligned}$$

According to (29), each type-$\theta $ consumer evaluates his consumption bundle against reference bundles with his own willingness to pay, ignoring a potential comparison against other possible selves that he could have been.^{Footnote 17} To further clarify the difference from (28), suppose that the reference bundle is identical for two distinct types, i.e., $R(\theta ^{\prime }) = R(\theta ^{\prime \prime })$. In the alternative approach, the gain–loss utility is also treated identically; in contrast, we consider the case in which the gain–loss utilities would differ across the two distinct types. Our approach recognizes the fact that the same bundle could generate different consequences for different types.

Beyond the conceptual difference discussed above, the two approaches also generate different results. In particular, with (29), reverse-screening can never be incentive feasible. The properties of screening and pooling menus remain identical nonetheless. Next result characterizes the optimal TPE menu with the alternative utility model in the binary-type case. A corresponding analysis for the continuum-type case is presented in Section S.5 of the Supplementary Material.

Proposition 7

Suppose that the buyer’s gain–loss utility is as given by (29). Also, suppose that $\varTheta = \{\theta _L, \theta _H\}$. Then, the optimal menu that solves [P] is a pooling menu if and only if $\lambda \ge \lambda _S$, where $\lambda _S$ is as defined in Proposition 3.

Proof

Note first that the alternative gain–loss specification does not affect the (IR) constraints and hence the optimal pooling menu. Also, the (IC$_H$) constraint for screening is given by

$$\begin{aligned} u(r_{H}|\theta _{H},R)&= \theta _{H}v(q_{H})-t_{H}+p[\theta _{H}v(q_{H})-\theta _{H}v(q_{L})-\lambda (t_{H}-t_{L})] \\&\ge u(r_{L}|\theta _{H},R) = \theta _{H}v(q_{L})-t_{L}+p(\theta _{H}-\theta _{H})v(q_{L}) \\&\quad +(1-p)[(t_{H}-t_{L})-\lambda \theta _{H}(v(q_{H})-v(q_{L}))], \end{aligned}$$

which clearly leads to the same expression as (11). Therefore, Proposition 3 remains true.

Next, we show that reverse-screening cannot be a PE. Consider a reverse-screening menu with $t_{L}>t_{H}$ and $q_{L}>q_{H}.$ Then, $({\text {IC}}_{H})$ is written as

$$\begin{aligned}&\theta _{H}v(q_{H})-t_{H}+p\left[ -\lambda \theta _{H}(v(q_{L})-v(q_{H}))+(t_{L}-t_{H})\right] \\&\quad \ge \theta _{H}v(q_{L})-t_{L}+(1-p)\left[ \theta _{H}(v(q_{L})-v(q_{H}))-\lambda (t_{L}-t_{H})\right] , \end{aligned}$$

which simplifies to

$$\begin{aligned} (t_{L}-t_{H})\left[ 1+p+(1-p)\lambda \right] \ge \theta _{H}(v_{L}-v_{H})\left[ 1+(1-p)+p\lambda \right] . \end{aligned}$$

(30)

Analogously, $({\text {IC}}_{L})$ is written as

$$\begin{aligned}&\theta _{L}v(q_{L})-t_{L}+(1-p)\left[ \theta _{L}(v(q_{L})-v(q_{H}))-\lambda (t_{L}-t_{H})\right] \\&\quad \ge \theta _{L}v(q_{H})-t_{L}+p\left[ -\lambda \theta _{L}(v(q_{L})-v(q_{H}))+(t_{L}-t_{H})\right] , \end{aligned}$$

which simplifies to

$$\begin{aligned} (t_{L}-t_{H})\left[ 1+p+(1-p)\lambda \right] \le \theta _{L}(v_{L}-v_{H})\left[ 1+(1-p)+p\lambda \right] . \end{aligned}$$

(31)

Combining (30) and (31) yields

$$\begin{aligned} B(p,\lambda )\theta _{H}\le \frac{t_{L}-t_{H}}{v(q_{L})-v(q_{H})}\le B(p,\lambda )\theta _{L}, \end{aligned}$$

(32)

where $B(p,\lambda )=\frac{1+(1-p)+p\lambda }{1+p+(1-p)\lambda }.$ It is clear that the two inequalities in (32) cannot hold simultaneously. This completes the proof. $\square $

A key modeling choice that facilitates the KR approach in our setup is that the buyer and seller have symmetric information when the seller designs/offers menu, but the buyer later learns some additional payoff-relevant private information. As observed in Sect. 3.1.2, this incomplete information is critical to our results. After receiving new information, the buyer evaluates his consumption by not only its intrinsic utility but also by comparing it with the utility or outcome previously anticipated for every other possible contingency. In particular, the buyer’s ex post preference is affected by the average gain–loss utility with respect to the (commonly known) prior distribution.

Using the prior to evaluate gain–loss comparisons offers a convenient way of modeling expectation-based reference-dependent utility. An interesting direction of future research would however be to consider alternative approaches to incorporating gain–loss comparisons across multiple types.^{Footnote 18} Such a model would still be consistent with KR’s rational expectations framework that attempts to endogenize reference point: The buyer would form contingent consumption plan before learning his private information, and this plan would have to be optimal for each realized type under the alternative utility model.

5.2 Average bundle

An important motivation for adopting the KR model of reference-dependent preferences arose from recognizing the role of expectations. While in the KR model the reference point is stochastic and equals the distribution of expected outcomes, the models of disappointment aversion (Bell 1985; Loomes and Sugden 1986) formulate the reference point as fixed, and in particular, as the expected utility certainty equivalent of a gamble. A similar approach in our price discrimination setup would be to take the expected utility of the contingent bundles as reference point.^{Footnote 19}

Formally, with binary types and menu $\left\{ b_{L},b_{H}\right\} $, consider type-$\theta $ buyer’s gain–loss utility from bundle $b=(q,t)$ to be

$$\begin{aligned} \mu \left[ \theta v(q)-\left( p\theta _{L}v(q_{L})+(1-p)\theta _{H}v(q_{H})\right) \right] +\mu \left[ \left( pt_{L}+(1-p)t_{H}\right) -t \right] . \end{aligned}$$

(33)

In Section S.5 of the Supplementary Material, we solve for the optimal menu under this alternative specification of reference-dependent preferences. It turns out that this analysis is very close to that of optimal TPE menus in Sect. 3. Whenever a pooling menu maximizes the firm’s profit under TPE, it does so here as well.

5.3 Additive separability

Our formulation of gain–loss utilities treats quality and money dimensions in an additive separable form. This is consistent with the endowment effect observed in many empirical studies. An alternative formulation would be to apply the gain–loss utility to the total utility, $\theta v(q)-t$. It turns out that the predictions of our model under such a gain–loss specification are no different from the model with standard preferences. See Section S.5 of the Supplementary Material.

6 Conclusion

We often find sellers offering menus with just a small number of bundles. This paper demonstrates that such observations are consistent with profit-maximizing firms that face loss averse consumers. We show that, in the binary-type case, a pooling menu is the seller’s optimal menu under a range of loss aversion parameter if the low willingness-to-pay consumers are sufficiently abundant. This result arises as a consequence of the interplay between loss aversion and asymmetric information. The benefits from screening with multiple bundles become even more restricted when the consumer is capable of choosing the personal equilibrium that generates the highest ex ante payoff. We also identify conditions under which partial or even full pooling dominates screening for the seller facing a continuum of consumer types.

The optimal menus described in our analysis above have the feature that the buyer’s ex ante expected utility (including anticipated gain–loss) often falls below zero. This can be problematic for the seller if the consumer can calculate the ex ante loss and find some commitment device to stay away from the menu altogether. In our previous working paper Hahn et al. (2012), we showed that introducing an additional ex ante participation constraint to the analysis (requiring the buyer’s ex ante expected utility to be nonnegative) does not alter our central message. In fact, the loss averse consumer’s ex ante insurance motives can induce the profit-maximizing firm to offer pooling menus under a wider range of parameters.

The same conclusion also holds in an alternative model of ex ante contracting where the buyer’s participation decision is made before his type is realized. That is, the buyer, when deciding whether to accept the menu offered by the seller, is uncertain about his willingness to pay. Analyzing the optimal PPE menu in this alternative model reveals that the pooling menu is optimal for a larger set of parameters under the ex ante participation constraint than under the ex post one. Again, the buyer’s insurance motives reinforce the benefits of pooling.^{Footnote 20} These additional results, together with those reported in Sect. 5 for additively separable gain–loss utilities, demonstrate that the optimality of pooling is a general phenomenon with loss averse consumers, valid under different decision-making scenarios and time lines.

Our theory offers potential explanations for why some sellers fail to fully materialize the benefits from further price discrimination in industries that seem to have low fixed costs of adding another product variant. For example, seats in existing entertainment venues provide different views and the cost of offering multiple seating categories is essentially zero. But, the practice of price discrimination in this industry, sometimes known as “scaling the house,” displays wide variations both within and across markets as well as across time (see the survey of Courty 2000). In particular, many ticket sellers indeed choose to offer uniform pricing or very few seating categories.^{Footnote 21} In a study of another industry with potentially low fixed product costs, Crawford and Shum (2007) report that 70% of over 1000 US cable TV providers in their sample year of 1995 offered a single package of channels only and estimate substantial unrealized returns from price discrimination.^{Footnote 22}

Consistent with our prediction that price discrimination would be more likely under certain market conditions, in contrast to pop concerts, high-brow entertainment events, such as classical concerts, usually offer many seating categories (e.g., Huntington 1993); in their cross-sectional study of cable TV providers, Crawford and Shum (2007) report evidence that markets offering more cable packages tend to be “populated by households with greater tastes for cable service quality (Crawford and Shum 2007, p. 201).” Our results can also shed light on observed pricing practices in other industries. For example, buses and motels usually offer a single type of seats and rooms, and this contrasts with the standard features of trains and hotels that frequently serve upscale travelers.

While we take the uncertainty to affect willingness to pay directly, variations in willingness to pay may arise from other sources, for example, income shocks. In such a case, however, the buyer should also realize gain–loss utility in that uncertain monetary dimension. Also, our model suggests that, contrary to common observations, reverse-screening can be optimal if the consumer is significantly loss averse (at least with only few consumer types). Interestingly, Ayres (1995) and Ayres and Siegelman (1995) found a case of car dealers who offered substantially lower prices to white consumers than to nonwhite consumers. Given the high willingness to pay estimated for white buyers, these authors suggested racial bias behind the observed practice. In a recent paper, however, Bang et al. (2014) provide a rational justification of such “reverse price discrimination.” Although these accounts are concerned with third-degree price discrimination, they suggest that reverse-screening may not be a mere theoretical possibility.

Notes

Kőszegi and Rabin (2007, 2009) extend their previous model to incorporate risky and intertemporal decisions. Other models of expectation-based reference-dependent preferences are analyzed by Bell (1985), Loomes and Sugden (1986), Gul (1991), and Shalev (2000).
Our main results also hold under alternative time lines with ex ante participation.
When the buyer is very loss averse, a reverse-screening menu, where the low consumer type purchases a higher quality-price bundle than the high consumer type, can be made incentive feasible and even optimal. However, this result does not hold when there is a continuum of types. See Sect. 3.2.
To adjust the magnitude of gain–loss utility relative to consumption utility, we could introduce a parameter, say $\beta $, and multiply it to the gain–loss utility term. Here, we set $\beta $ equal to 1 for simplicity; the qualitative features of our results remain the same for any $\beta $ provided it is not too small.
Although we consider a model with ex post participation, our central message holds also under other time lines with ex ante participation. See Sect. 6 for a further discussion.
When we write a menu as an ordered pair of two bundles, the first (second) element refers to the bundle consumed by the low (high) type. When the two elements are the same and equal to r, with slight abuse of notation, we sometimes write the corresponding menu simply as $\{r\}$.
It is possible to have $\theta _{H} v (q_H) = \theta _{L} v(q_{L})$ at the optimum. We ignore this case to ease the exposition.
Estimates of loss aversion have been obtained in a variety of contexts, ranging from 1.3 to 2.7; see (Camerer 2006). However, these estimates do not translate directly to values of $\lambda $ in our setup since they are measured only in terms of money. A high level of $\lambda $ may also be unrealistic on theoretical grounds. For example, lottery decisions of an individual modeled along Kőszegi and Rabin (2007) violate first-order stochastic dominance for high $\lambda $ (e.g., Masatlioglu and Raymond 2016).
In the optimal reverse-screening menu, however, neither type is excluded. To see this, note that by definition of reverse-screening, $q_L > q_H$, and also that by Lemma 2(b), $q_L > 0$ implies $q_H > 0$.
This observation demonstrates an important distinction between our theory and an alternative explanation of coarse price discrimination based on fixed menu costs. In contrast to our predictions, when p is low, menu cost would have no bite since the seller would serve only the high-type customers even without it.
If the optimal schedule involves some jump(s), then it will manifest itself as a boundary solution of the optimization program since any such schedule can be approximated by a sequence of continuous schedules.
Note that this condition only considers local incentive compatibility. With standard preferences, global incentive compatibility is usually guaranteed by the nonnegative cross derivative of $U(\theta ^{\prime };\theta )$, but this latter property may not hold under reference dependence. Here, one can verify global incentive compatibility directly from the solution menu satisfying (20), or impose certain parametric assumptions. Also, global incentive compatibility trivially holds if the optimal schedule is constant.
This condition is easily satisfied if $\lambda $ is not too large (if $\lambda =1$, for instance, it holds irrespective of F). For given $\lambda $, the requirement is met if $\theta \ge \frac{F(\theta )}{f(\theta )}$ for all $\theta $, which, for example, holds for convex F.
The fact that the (FIC) and (FIR) conditions are strict inequalities implies that the set of TPPE menus is not closed and hence not compact. This may cause nonexistence of the optimal menu. To avoid this problem, we allow the (FIC) and (FIR) conditions to be satisfied as equality, in which case the optimum can only be attained approximately.
See Section S.3 of the Supplementary Material for a numerical example of optimal reverse-screening under TPPE.
It is straightforward to show that the optimal pooling or reverse-screening menu characterized in Theorem 1 always involves ex ante loss.
Orhun (2009) and Carbajal and Ely (2016) also assume that each type $\theta $ compares his consumption bundle with (exogenously given) reference bundle in terms of his own $\theta $. However, unlike (29), their gain–loss formulation does not involve comparisons against other possible types. Similarly to us, gain–loss formulation that compares utilities across types is adopted by Heidhues and Kőszegi (2008) and Herweg and Mierendorff (2013), among others.
For instance, one could conceive of a decision maker who considers only the maximum gain and loss (instead of the average).
De Meza and Webb (2007) apply the disappointment aversion model to an incentive provision setup.
This ex ante participation model is fully analyzed in a separate work, which can be provided upon request. Herweg and Mierendorff (2013) study an alternative ex ante participation model of price discrimination.
Leslie (2004) investigates the revenue impact of price discrimination at a single Broadway play and observes that almost 75% of the performances offered just two seating categories with the remainder offering three. In a large panel dataset on US pop music concerts analyzed by Courty and Pagliero (2012), Courty and Pagliero (2012), two categories were used by more than half of the sample and another quarter came with single price ticketing. Uniformly priced seats are usually allocated on the first-come-first-served basis, and hence, the customers can be thought of as facing a single, random seat quality.
The analysis of Crawford and Shum (2007) was based on data from 1995. While the US cable TV industry continues to be local monopolies to this date, the overall market landscape has changed substantially. First, cable TV providers now face significant competition from digital satellite providers. Second, the products offered by cabel TV providers have widened horizontally in the advent of new technologies such as Internet, recording and on-demand services. Nonetheless, the average number of purely vertically differentiated cable TV packages currently on offer are still very few.
The full expression of these constraints is given in Section S.1 of the Supplementary Material.
For the full expressions, see Section S.1 of the Supplementary Material.
A formal proof is a straightforward translation of graphical illustration and is thus omitted.
This can be seen from observing that in the case $\theta _H v_H \ge \theta _L v_L$, we have
$$\begin{aligned} \frac{\partial U(R^{\prime \prime })}{\partial \lambda } = -\frac{\theta _L v_H}{2}- p\left( 1-p\right) \left( \theta _H v_H-\theta _L v_L\right) -\frac{2 p(1-p) \theta _H \left( v_L -v_H\right) }{(1+p+\lambda -p \lambda )^2} <0. \end{aligned}$$
We note that $U(R^{\prime \prime })$ is also decreasing in the case $\theta _H v_H < \theta _L v_L$, since, in that case,
$$\begin{aligned} \frac{\partial U(R^{\prime \prime })}{\partial \lambda } = -\frac{\theta _L v_H}{2}- p\left( 1-p\right) \left( \theta _L v_L-\theta _H v_H\right) -\frac{2(1-p) p \theta _H \left( v_H-v_L\right) }{(1+p+\lambda -p \lambda )^2} <0. \end{aligned}$$
More precisely, it is true that $R^{\prime } =\{b,\emptyset \}$ must satisfy at least one of $({\text {FIC}}_H)$ and $({\text {FIC}}_L)$ conditions, that is, $u (\emptyset | \theta _H, R^{\prime } ) < u (b | \theta _H, R^{\prime }) $ and $u (b | \theta _L, R^{\prime }) < u (\emptyset | \theta _H, R^{\prime }).$
Note that the second inequality of (66) is strict for $p > \frac{\lambda +2}{\lambda +3}$. Thus, the optimal menu fails to exist for the reason mentioned in footnote 14. We avoid the nonexistence problem by allowing for the strict inequality to hold as equality.
To be precise, the transfers $t_L$ and $t_H$ in (94) and (95) result from binding the second inequality of (66) to avoid the nonexistence problem (as mentioned in Remark 14). In fact, for $p > \frac{1}{\lambda +3}$, that inequality is strict and corresponds to (62), which is the same as (100).

References

Abeler, J., Falk, A., Götte, L., Huffman, D.: Reference Points and Effort Provision. Am. Econ. Rev. 101, 470–492 (2011)
Article Google Scholar
Ayres, I.: Further Evidence of Discrimination in New Car Negotiations and Estimates of its Causes? Mich. Law Rev. 94, 109–147 (1995)
Article Google Scholar
Ayres, I., Siegelman, P.: Race and Gender Discrimination in Bargaining for a New Car? Am. Econ. Rev. 85, 304–321 (1995)
Google Scholar
Bang, S.H., Kim, J., Yoon, Y.R.: Reverse Price Discrimination with Bayesian Buyers. J. Ind. Econ. 62, 286–308 (2014)
Article Google Scholar
Bell, D.E.: Disappointment in Decision Making under Uncertainty. Oper. Res. 33, 1–27 (1985)
Article Google Scholar
Camerer, C.F.: Behavioral Economics. In: Blundell, R., Newey, W.K., Persson, T. (eds.) Advances in Economics and Econometrics: Theory and Applications, Ninth World Congress, vol. II. Cambridge University Press, New York (2006)
Google Scholar
Carbajal, J.C., Ely, J.: A Model of Price Discrimination under Loss Aversion and State-Contingent Reference Points. Theor. Econ. 11, 455–485 (2016)
Article Google Scholar
Card, D., Dahl, G.B.: Family Violence and Football: The Effect of Unexpected Emotional Cues on Violent Behavior. Q. J. Econ. 126, 103–143 (2011)
Article Google Scholar
Courty, P.: An Economic Guide to Ticket Pricing in the Entertainment Industry. Recherches Économiques de Louvain 66, 167–192 (2000)
Google Scholar
Courty, P., Nasiry, J.: Loss Aversion and the Uniform Pricing Puzzle for Vertically Differentiated Products. mimeo (2015)
Courty, P., Pagliero, M.: The Impact of Price Discrimination on Revenue: Evidence from the Concert Industry. Rev. Econ. Stat. 94, 359–369 (2012a)
Article Google Scholar
Courty, P., Pagliero, M.: The Pricing of Art and the Art of Pricing. In: Ginsburgh, V.A., Throsby, D. (eds.) Handbook of the Economics of Art and Culture, vol. II. North-Holland, Amsterdam (2012b)
Google Scholar
Crawford, G.S., Shum, M.: Monopoly Quality Choice in Cable Television. J. Law Econ. 50, 181–209 (2007)
Article Google Scholar
Crawford, V.P., Meng, J.: New York City Cab Drivers’ Labor Supply Revisited: Reference-Dependent Preferences with Rational-Expectations Targets for Hours and Income. Am. Econ. Rev. 101, 1912–1932 (2011)
Article Google Scholar
DellaVigna, S., Malmendier, U.: Contract Design and Self-Control: Theory and Evidence. Q. J. Econ. 119, 353–402 (2004)
Article Google Scholar
De Meza, D., Webb, D.C.: Incentive Design under Loss Aversion. J. Eur. Econ. Assoc. 5, 66–92 (2007)
Article Google Scholar
Dixit, A., Stiglitz, J.: Monopolistic Competition and Optimum Product Diversity. Am. Econ. Rev. 67, 297–308 (1977)
Google Scholar
Eisenhuth, R.: Auction Design with Loss Averse Bidders: The Optimality of All Pay Mechanisms. mimeo (2010)
Eliaz, K., Spiegler, R.: Contracting with Diversely Naive Agents. Rev. Econ. Stud. 73, 689–714 (2006)
Article Google Scholar
Eliaz, K., Spiegler, R.: Consumer Optimism and Price Discrimination. Theor. Econ. 3, 459–497 (2008)
Google Scholar
Ellison, G.: Bounded Rationality in Industrial Organization. In: Blundell, R., Newey, W.K., Persson, T. (eds.) Advances in Economics and Econometrics: Theory and Applications, Nineth World Congress, vol. II. Cambridge University Press, New York (2006)
Google Scholar
Ericson, K.M.M., Fuster, A.: Expectations as Endowments: Evidence on Reference-Dependent Preferences from Exchange and Valuation Experiments. Q. J. Econ. 126, 1879–1907 (2011)
Article Google Scholar
Esteban, S., Miyagawa, E., Shum, M.: Nonlinear Pricing with Self-Control Preferences. J. Econ. Theory 135, 306–338 (2007)
Article Google Scholar
Gill, D., Prowse, V.: A Structural Analysis of Disappointment Aversion in a Real Effort Competition. Am. Econ. Rev. 102, 469–503 (2012)
Article Google Scholar
Grillo, E.: Reference Dependence, Risky Projects and Credible Information Transmission. mimeo (2013)
Grubb, M.D.: Selling to Overconfident Consumers. Am. Econ. Rev. 99, 1770–1807 (2009)
Article Google Scholar
Gul, F.: A Theory of Disappointment Aversion. Econometrica 59, 667–686 (1991)
Article Google Scholar
Hahn, J.-H., Kim, J., Kim, S.-H., Lee, J.: Screening Loss Averse Consumers. mimeo (2012)
Heidhues, P., Kőszegi, B.: Competition and Price Variation when Consumers are Loss Averse. Am. Econ. Rev. 98, 1245–1268 (2008)
Article Google Scholar
Heidhues, P., Kőszegi, B.: Exploiting Naivete about Self-Control in the Credit Market. Am. Econ. Rev. 100, 2279–2303 (2010)
Article Google Scholar
Heidhues, P., Kőszegi, B.: Regular Prices and Sales. Theor. Econ. 9, 217–251 (2014)
Article Google Scholar
Herweg, F., Mierendorff, K.: Uncertain Demand, Consumer Loss Aversion, and Flat-Rate Tariffs. J. Eur. Econ. Assoc. 11, 399–432 (2013)
Article Google Scholar
Herweg, F., Múller, D., Weinschenk, P.: Binary Payment Schemes: Moral Hazard and Loss Aversion. Am. Econ. Rev. 100, 2451–2477 (2010)
Article Google Scholar
Huntington, P.: Ticket Pricing Policy and Box Office Revenue. J. Cult. Econ. 17, 71–87 (1993)
Article Google Scholar
Jeleva, M., Villeneuve, B.: Insurance Contracts with Imprecise Probabilities and Adverse Selection. Econ. Theory 23, 777–794 (2004)
Article Google Scholar
Kahneman, D.J., Tversky, A.: Prospect Theory: An Analysis of Decision under Risk. Econometrica 47, 263–291 (1979)
Article Google Scholar
Karle, H., Peitz, M.: Competition under Consumer Loss Aversion. RAND J. Econ. 45, 1–31 (2013)
Article Google Scholar
Kőszegi, B.: Behavioral Contract Theory. J. Econ. Lit. 52, 1075–1118 (2014)
Article Google Scholar
Kőszegi, B., Rabin, M.: A Model of Reference-Dependent Preferences. Q. J. Econ. 121, 1133–1165 (2006)
Google Scholar
Kőszegi, B., Rabin, M.: Reference-Dependent Risk Attitudes. Am. Econ. Rev. 97, 1047–1073 (2007)
Article Google Scholar
Kőszegi, B., Rabin, M.: Reference-Dependent Consumption Plans. Am. Econ. Rev. 99, 909–936 (2009)
Article Google Scholar
Lange, A., Ratan, A.: Multi-dimensional Reference-Dependent Preferences in Sealed-Bid Auctions: How (Most) Laboratory Experiments Differ from the Field. Games Econ. Behav. 68, 634–645 (2010)
Article Google Scholar
Leslie, P.: Price Discrimination in Broadway Theatre. RAND J. Econ. 35, 520–541 (2004)
Article Google Scholar
Loomes, G., Sugden, R.: Disappointment and Dynamic Consistency in Choice under Uncertainty. Rev. Econ. Stud. 53, 271–282 (1986)
Article Google Scholar
Mas, A.: Pay, Reference Points, and Police Performance. Q. J. Econ. 121, 783–821 (2006)
Article Google Scholar
Masatlioglu, Y., Raymond, C.: A Behavioral Analysis of Stochastic Reference Dependence. Am. Econ. Rev. 106(9):2760–2782 (2016)
Mussa, M., Rosen, S.: Monopoly and Product Quality. J. Econ. Theory 18, 301–317 (1978)
Article Google Scholar
Orhun, A.Y.: Optimal Product Line Design When Consumers Exhibit Choice Set-Dependent Preferences. Mark. Sci. 28, 868–886 (2009)
Article Google Scholar
Rosato, A.: Selling Substitute Goods to Loss-Averse Consumers: Limited Availability, Bargains and Rip-Offs. RAND. J. Econ. 47(3):709–733 (2016)
Shalev, J.: Loss Aversion Equilibrium. Int. J. Game Theory 29, 269–287 (2000)
Article Google Scholar
Spiegler, R.: Bounded Rationality and Industrial Organization. Oxford University Press, New York (2011)
Book Google Scholar
Spiegler, R.: Monopoly Pricing when Consumers are Antagonized by Unexpected Price Increases: A Cover Version of the Heidhues- Kőszegi-Rabin Model. Econ. Theory 51, 695–711 (2012)
Article Google Scholar
Spence, A.M.: Multi-Product Quantity-Dependent Prices and Profitability Constraints. Rev. Econ. Stud. 47, 821–841 (1980)
Article Google Scholar
Sprenger, C.: An Endowment Effect for Risk: Experimental Tests of Stochastic Reference Points. J. Pol. Econ. 123, 1456–1499 (2015)
Article Google Scholar
Tversky, A., Kahneman, D.: Loss Aversion in Riskless Choice: A Reference-Dependent Model. Q. J. Econ. 106, 1039–1061 (1991)
Article Google Scholar
Zhou, J.: Reference Dependence and Market Competition. J. Econ. Manag. Strat. 20, 1073–1097 (2011)
Article Google Scholar

Download references

Acknowledgements

The authors have received helpful comments from Juan Carlos Carbajal, Yeon-Koo Che, Faruk Gul, Paul Heidhues, Navin Kartik, Shaowei Ke, Fuhito Kojima, Tracy Lewis, Stephen Morris, Matthew Rabin, Tim Van Zandt as well as seminar participants at AMES 2013, Beijing GSM, CUHK, Gerzensee 2011, KEA/KAEA 2011, New South Wales, Sogang and Yonsei.

Author information

Authors and Affiliations

School of Economics, Yonsei University, Seoul, 03722, Korea
Jong-Hee Hahn & Sang-Hyun Kim
School of Economics, Seoul National University, Seoul, 08826, Korea
Jinwoo Kim & Jihong Lee

Authors

Jong-Hee Hahn
View author publications
You can also search for this author in PubMed Google Scholar
Jinwoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sang-Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jihong Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jihong Lee.

Additional information

This paper was previously circulated under the title “Screening Loss Averse Consumers.” This research was supported by the National Research Foundation of Korea Grant funded by the Korean Government (NRF-2014S1A5A2A03065638) and by the Institute of Economic Research of Seoul National University.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 417 KB)

Appendices

Appendix 1: Omitted proofs from Sect. 3

In the proofs throughout the Appendix, we simplify notation by letting $v_{L}:=v(q_{L})$ and $v_H:=v (q_H)$, and refer to their derivatives as $v_L^{\prime }$ and $v_H^{\prime }$, respectively.

Proof of Lemma 1

The proof consists of two claims.

Claim

If the optimal menu satisfies $\theta _{H} v_H \ge \theta _{L} v_L$, then it must be that $t_{H} \ge t_{L}$.

Proof

Suppose to the contrary that $t_L > t_H$. Clearly, we must have both (IR) constraints binding or

$$\begin{aligned}&u (\emptyset | \theta _H,R) = u (r_L | \theta _L,R) = \theta _L v_L -t_L -(1-p)\lambda [\theta _H v_H -\theta _L v_L +t_L -t_H] \end{aligned}$$

(34)

$$\begin{aligned}&u (\emptyset | \theta _L,R) = u (r_H|\theta _H,R) = \theta _H v_H -t_H + p [\theta _H v_H -\theta _L v_L + t_L -t_H]. \end{aligned}$$

(35)

Since $u(\emptyset | \theta _H,R) = u(\emptyset | \theta _L,R) $, equating (34) and (35) yields

$$\begin{aligned} {[}1+p + (1-p)\lambda ] (t_L -t_H) = [1+p +(1-p)\lambda ] (\theta _L v_L -\theta _H v_H), \end{aligned}$$

which is a contradiction since $t_L -t_H > 0$ but $\theta _L v_L - \theta _H v_H \le 0$.

Claim

It is never optimal to offer a menu with $\theta _{L} v(q_{L}) > \theta _{H} v(q_{H})$.

Proof

Suppose that $\theta _L v(q_L) > \theta _H v(q_H)$. A similar argument to that in the proof of Claim 1 can be used to show $t_L \ge t_H$. Then, rewrite (IR) constraints as

$$\begin{aligned} t_H \le \theta _H\frac{\lambda +1}{2} v_H \ \text { and } \ t_L \le t_H + \frac{\theta _L v_L -\theta _H v_H}{B(p,\lambda )}. \end{aligned}$$

Since both constraints must clearly be binding, we can substitute these into the objective function and take the first-order conditions (FOCs) as follows:

$$\begin{aligned} \frac{c}{v_L^{\prime }} = \frac{\theta _L}{B(p,\lambda )} \ \text { and } \frac{c}{v_H^{\prime }} = \frac{ \left[ (\lambda +1)B(p,\lambda )-2p\right] \theta _{H}}{2(1-p) B(p,\lambda )}. \end{aligned}$$

This, however, yields a contradiction since

$$\begin{aligned} \frac{c}{v_H^{\prime }} - \frac{c}{v_L^{\prime }}= & {} \frac{ (\lambda +1)B(p,\lambda )\theta _H - 2 \left[ p \theta _H+(1-p) \theta _L \right] }{2(1-p) B(p,\lambda )} \\\ge & {} \frac{ \left[ (\lambda +1)B(p,\lambda ) - 2 \right] \theta _H}{2(1-p) B(p,\lambda )} \ge 0, \end{aligned}$$

where the last inequality holds since $(\lambda +1)B(p,\lambda ) \ge (\lambda +1)B(0,\lambda ) = 2, \forall \lambda ,p$.

Proof of Proposition 3

Consider maximizing profit under the (IC) and (IR) constraints and under the quality constraint, $q_{H} -q_{L} \ge 0$. We show the following: When the quality constraint is not binding, the optimal qualities must be given by (13) and (14), and the quality constraint is binding if (15) holds.

First, one can easily check that $({\text {IR}}_{H})$ is implied by $({\text {IR}}_{L})$ and $({\text {IC}}_{H})$ since

$$\begin{aligned} u(r_{H}|\theta _{H},R)\ge u (r_{L} | \theta _{H}, R) \ge u(r_{L}|\theta _{L},R)\ge u(\emptyset |\theta _{L},R)=u(\emptyset |\theta _{H},R), \end{aligned}$$

(36)

where the second inequality holds since if two types choose the same bundle, $r_{L}$, then $\theta _{H}$ is better off in terms of both intrinsic and gain–loss utilities. Next, after rearrangement, we can write $({\text {IR}}_L)$ and (IC) constraints, respectively, as

$$\begin{aligned}&t_L \le \theta _L \frac{\lambda +1}{2} v_L \end{aligned}$$

(37)

$$\begin{aligned}&\frac{\theta _{L}}{B (p,\lambda )} \underbrace{\le }_{({\text {IC}}_L)} \frac{t_{H}-t_{L}}{v_{H}-v_{L}} \underbrace{\le }_{({\text {IC}}_H)} \frac{\theta _{H}}{B (p,\lambda )}, \end{aligned}$$

(38)

where $B(p,\lambda )=\frac{1+(1-p) +p\lambda }{1+p +(1-p)\lambda }$ as defined in (5).^{Footnote 23} By the usual argument, $({\text {IR}}_{L})$ and $({\text {IC}}_{H})$ must be binding.

Using the two binding constraints, we obtain (10) and (12) for $t_{L}$ and $t_{H}$, respectively. Substituting these into the objective function, the seller’s problem becomes

$$\begin{aligned} \max _{ \{ q_L, q_H \}} \;&p (t_L - cq_L ) +(1-p)(t_H -cq_H) \\&=\frac{\lambda +1}{2} \theta _L v_L +(1-p)\frac{\theta _H (v_H -v_L) }{B(p,\lambda )} - p c q_L - (1-p) c q_H, \end{aligned}$$

subject to $q_H \ge q_L$. Ignoring the quality constraint for the moment, the FOCs with respect to $q_{L}$ and $q_{H}$ yield (13) and (14). One can then check that the RHS of (14) is no larger than that of (13) if and only if the inequality (15) holds, which means that the quality constraint is binding in such a case.

To obtain the comparative statics for $q_{L}$, $q_{H}$, and $\lambda _S$, let us first observe the following facts: (i) $B(p,\lambda )$ increases with $\lambda >1$ if and only if $p > \frac{1}{2}$; (ii) $B(p,\lambda )$ increases with p if $\lambda >1$; and (iii) $(\lambda +1)B(p,\lambda )$ increases from 1 to infinity as $\lambda $ increases starting from $\lambda =1$. The comparative statics for $q_{H}$ directly follows from (i) and the fact that $\frac{c}{v^{\prime }(\cdot )}$ is increasing. As for the comparative statics regarding $q_{L}$, rewrite the maximand in (13) as

$$\begin{aligned} \frac{ \theta _{L}- \left[ 2(1-p)\theta _{H} \right] /\left[ (\lambda +1)B(p,\lambda )\right] }{2p/(\lambda +1)}, \end{aligned}$$

whose numerator increases in $\lambda $ by (iii), while its denominator decreases. So the optimal $q_{L}$, if not 0, must increase. The existence and properties of $\lambda _S$ follow from (ii) and (iii). $\square $

Before proving Lemma 2 and Proposition 4, we write here the (IC) and (IR) constraints for the reverse-screening menu, whose forms differ depending on whether $\theta _{L}v_L \ge \theta _{H} v_H$ or $\theta _{L} v_{L} \le \theta _{H}v_H$.^{Footnote 24}

In case $\theta _{L} v_{L} \ge \theta _{H} v_H$, the constraints $({\text {IC}}_H)$, $({\text {IC}}_L)$, $({\text {IR}}_H)$, and $({\text {IR}}_L)$ are, respectively, given as

$$\begin{aligned}&\displaystyle t_{L}-t_{H} \ge \frac{[1+(1-p)+p\lambda ]\theta _{H}(v_L-v_H)- p(\lambda -1) (\theta _{H}-\theta _{L}) v_L}{1+p+(1-p)\lambda } \end{aligned}$$

(39)

$$\begin{aligned}&\displaystyle \frac{[1+(1-p)+p\lambda ]\theta _{L}(v_L-v_H)+ (1-p)(\lambda -1)(\theta _{H}-\theta _{L})v_H}{1+p+(1-p)\lambda } \ge t_{L}-t_{H} \end{aligned}$$

(40)

$$\begin{aligned}&\displaystyle \theta _{H}(\lambda +1)v_H\ge 2t_{H} \end{aligned}$$

(41)

$$\begin{aligned}&\displaystyle {[}1+(1-p)+p\lambda ]\theta _{L}v_L+(1-p)(\lambda -1)\theta _{H}v_H\nonumber \\&\displaystyle \ge [1+p+(1-p)\lambda ]t_{L}-(1-p)(\lambda -1)t_{H} \end{aligned}$$

(42)

while in case $\theta _{L} v_L \le \theta _{H} v_H$, they are given as

$$\begin{aligned}&{[}1+p+(1-p)\lambda ](t_{L}-t_{H})\ge 2\theta _{H}(v_L-v_H) \end{aligned}$$

(43)

$$\begin{aligned}&(\lambda +1)\theta _{L}(v_L-v_H)\ge [1+p+(1-p)\lambda ](t_{L}-t_{H}) \end{aligned}$$

(44)

$$\begin{aligned}&{[}1+p+(1-p)\lambda ]\theta _{H}v_H+p(\lambda -1)\theta _{L}v_L\ge 2t_{H} \end{aligned}$$

(45)

$$\begin{aligned}&(\lambda +1)\theta _{L}v_L\ge [1+p+(1-p)\lambda ]t_{L}-(1-p)(\lambda -1)t_{H}. \end{aligned}$$

(46)

Proof of Lemma 2

To prove (a), let us consider both cases of reverse-screening menu. In case $\theta _{L} v_L \le \theta _{H} v_H$, the LHS of (44) being greater than the RHS of (43) yields (16) after rearrangement. In case $\theta _{L}v_L \ge \theta _{H} v_H$, combining (40) and (39) yields

$$\begin{aligned} (\lambda +1) v_{H} - 2 v_{L} \ge 0, \end{aligned}$$

(47)

which implies $\frac{\lambda +1}{2} \ge \frac{v_L}{v_H} \ge \frac{\theta _{H}}{\theta _{L}}.$

To prove (b), consider maximizing profit under the constraints (39) to (42) and the constraint that $\theta _{L} v_L - \theta _{H} v_H \ge 0.$ It suffices to show that the last constraint must be binding. First, the same inequalities as in (36) can be used to show that (41) is implied by (39) and (42). Next, to identify the binding constraints, we depict as a shaded area in Fig. 4 below the set of $(t_L,t_H)$’s satisfying (39), (40), and (42) for any given $q_L$ and $q_H$. Clearly, $({\text {IC}}_H)$ and $({\text {IR}}_L)$, i.e., (39) and (42), must be binding.^{Footnote 25}

Combining the two binding constraints, we obtain

$$\begin{aligned} t_{H} (q_{L},q_{H})&=\frac{(\lambda +1)\theta _{H}v_H}{2}-2(\theta _{H}-\theta _{L})v_L \end{aligned}$$

(48)

$$\begin{aligned} t_{L} (q_{L},q_{H})&=\frac{[1+(1-p)+p\lambda ] \theta _{H}(v_L-v_H)-p(\lambda -1)(\theta _{H}-\theta _{L})v_L}{ 1+p+(1-p)\lambda }\nonumber \\&\quad +t_{H} (q_{L},q_{H}). \end{aligned}$$

(49)

Note that given (39) is binding, (40) is satisfied if and only if (47) is satisfied so we can replace (40) by (47). Now, using (47) and $\theta _{L} v_L -\theta _{H} v_H \ge 0$ as constraints, the Lagrangian for the maximization problem can be written as

$$\begin{aligned} \mathcal {L} (q_{L},q_{H},\mu ,\eta )&= p (t_{L} (q_{L}, q_{H}) -cq_{L}) + (1-p) (t_{H} (q_{L},q_{H}) -cq_{H}) \nonumber \\&\quad + \mu \left[ (\lambda +1)v_H -2v_L\right] + \eta ( \theta _{L} v_L -\theta _{H} v_H ). \end{aligned}$$

(50)

where $\mu $ and $\eta $ are nonnegative multipliers. Suppose that $\theta _{L} v_{L} -\theta _{H} v_H \ge 0$ is not binding so $\eta =0$. Substituting (48) and (49) into (50), the FOCs are given by

$$\begin{aligned}&\frac{c}{v^{\prime }_{L}} +\frac{2\mu }{p} =\frac{ [1+p-p^{2}+(1-p+p^{2})\lambda ]\theta _{L}-(1-p)(\lambda +1)\theta _{H}}{ p[1+p+(1-p)\lambda ]} =: \varPsi _L \end{aligned}$$

(51)

$$\begin{aligned}&\frac{c}{v^{\prime }_{H}} - \frac{\mu (\lambda +1)}{1-p} =\frac{ [1-2p+2(1+p)\lambda +\lambda ^{2}]\theta _{H}}{2[1+p+(1-p)\lambda ]} =: \varPsi _H. \end{aligned}$$

(52)

One can verify that $\varPsi _H > \varPsi _L$. Given this, and $\mu \ge 0$, (51) and (52) require $q_H >q_L$, which is a contradiction. $\square $

Proof of Proposition 4

By Lemma 2, we focus on the case $\theta _H v_H - \theta _L v_L \ge 0$. Consider maximizing profit under the constraints (43) to (46), and $\theta _H v_H - \theta _L v_L \ge 0$, also subject to $q_L \ge q_H$. We show that $q_L = q_H$ if (17) holds.

Fist, we can ignore (45) since it is implied by (43) and (46) for the same reason as in (36). To identify the binding constraints, with $q_L$ and $q_H$ fixed, we depict the set of $(t_L,t_H)$ satisfying the constraints, (43), (44), and (46), to obtain the same graph as Fig. 4. From this, it is immediate that (46) and (43) are binding, which gives us

$$\begin{aligned} t_{H} (q_L,q_H)&=\frac{(\lambda +1)\theta _{L}v_L}{2}-\theta _{H}(v_L-v_H) \\ t_{L} (q_L,q_H)&=\frac{(\lambda +1)\theta _{L}v_L}{2} - \theta _H \frac{ (1-p)(\lambda -1)}{1+p +(1-p)\lambda } (v_L - v_H). \end{aligned}$$

Ignoring the constraint $q_L \ge q_H$ for the moment, we set up the Lagrangian

$$\begin{aligned} \mathcal {L} (q_{L},q_{H},\mu ,\eta )&= p (t_{L} (q_{L}, q_{H}) -cq_{L}) + (1-p) (t_{H} (q_{L},q_{H}) -cq_{H}) \\&\quad + \mu ((\lambda +1)v_H -2v_L) + \eta (\theta _{H} v_H- \theta _{L} v_L), \end{aligned}$$

whose FOCs are

$$\begin{aligned} \frac{c}{v^{\prime }(q_{L})} +\frac{ 2 \mu +\eta \theta _L}{p}&=\frac{ [1+p+(1-p)\lambda ](\lambda +1 )\theta _{L}- 2 (1-p)(\lambda +1) \theta _{H}}{ 2p[1+p+(1-p)\lambda ]} \nonumber \\&=: \widetilde{\varPsi }_L \end{aligned}$$

(53)

$$\begin{aligned} \frac{c}{v^{\prime }(q_{H})} -\frac{\mu (\lambda +1) + \eta \theta _H}{1-p}&=\frac{ (\lambda +1)\theta _H}{1+p+(1-p)\lambda } =: \widetilde{\varPsi }_H. \end{aligned}$$

(54)

One can check that $\widetilde{\varPsi }_H \ge \widetilde{\varPsi }_L$ if (17) holds, which implies that given $\mu ,\eta \ge 0$, (53) and (54) can only be satisfied when $q_H \ge q_L$. Thus, $q_L = q_H$ if (17) holds.

Now rewrite (17) as

$$\begin{aligned} \lambda \le \frac{1}{1-p}\left( \frac{2\theta _{H}}{\theta _{L}} - (1+p)\right) =:\lambda _R. \end{aligned}$$

It is straightforward to check that $\lambda _R$ increases with both p and $\frac{\theta _H}{\theta _L}$. This completes the proof of Proposition 4. $\square $

Appendix 2: Omitted proofs from Sect. 4

Proof

(Proof of Lemma 3) Part (a) Consider any screening menu R and a deviation $R^{\prime }=R^{H}$, for which $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, $({\text {FIR}}_H)$, and (U) can be written as

$$\begin{aligned}&\displaystyle \frac{t_{H}-t_{L}}{v_{H}-v_{L}} > \theta _{L}\frac{\lambda +1}{2} \end{aligned}$$

(55)

$$\begin{aligned}&\displaystyle t_{H} > \theta _{L}\frac{\lambda +1}{2} v_H \end{aligned}$$

(56)

$$\begin{aligned}&\displaystyle \frac{t_{H}-t_{L}}{v_H -v_L} > \theta _{H} \frac{ 1+p+(1-p)\lambda }{2} + \frac{p(\lambda -1)}{2}\left( \frac{\theta _{L}v_{H} -\theta _{H} v_{L}}{v_H -v_L}\right) \mathbbm {1}_{\{\theta _{L}v_{H} -\theta _{H} v_{L} \ge 0\}} \end{aligned}$$

(57)

$$\begin{aligned}&\displaystyle t_{H} > \frac{1}{2}\big ( \theta _{H}\left( 1+p+(1-p)\lambda \right) +p(\lambda -1)\theta _{L}\big ) v_H \end{aligned}$$

(58)

$$\begin{aligned}&\displaystyle \frac{t_{H}-t_{L}}{v_{H}-v_{L}} \ge \theta _{L}\frac{1+(1-p)(\lambda -1)}{1-(1-p)(\lambda -1)} > 0. \end{aligned}$$

(59)

Note first that (57) contradicts $({\text {IC}}_H)$ in (38) since the RHS of (57) is greater than $ \frac{\theta _H}{B(p,\lambda )}$. It is straightforward to check that the RHS of (58) is greater than the RHS of (56), so (58) implies (56). Also, subtracting the $({\text {IR}}_L)$ constraint, $t_L \le \theta _L \frac{\lambda +1}{2} v_L$, from (56) and rearranging yields (55). Thus, the PPE requirement with respect to $R^H$ boils down to satisfying either (55) or (59), which gives us the condition in (23).

Part (b) Let us next consider a deviation $R^{\prime }= R^L$, for which $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, $({\text {FIR}}_H)$, and (U) can be written as

$$\begin{aligned}&\displaystyle \frac{t_{H}{-}t_{L}}{v_{H}{-}v_{L}} < \theta _{L}\frac{1{+}p{+} (1-p)\lambda }{\lambda +1}{-}\frac{(\lambda -1)(1-p)}{(\lambda +1)} \left( \frac{\theta _{L}v_{H}{-}\theta _{H}v_{L}}{ v_H - v_L}\right) \mathbbm {1}_{\{\theta _{L}v_{H}-\theta _{H}v_{L}\ge 0\}} \end{aligned}$$

(60)

$$\begin{aligned}&\displaystyle \frac{t_{L}}{v_L} > \theta _{L}\frac{\lambda +1}{2} \end{aligned}$$

(61)

$$\begin{aligned}&\displaystyle \frac{t_{H}-t_{L}}{v_{H}-v_{L}} < \theta _{H}\frac{2}{\lambda +1} \end{aligned}$$

(62)

$$\begin{aligned}&\displaystyle \frac{t_{L}}{v_L} > \frac{1}{2}\big ( \theta _{H}\left( 1+p+(1-p)\lambda \right) +p(\lambda -1)\theta _{L}\big ) \end{aligned}$$

(63)

$$\begin{aligned}&\displaystyle \frac{t_{H}-t_{L}}{v_{H}-v_{L}} \le \theta _{H}\frac{1-p(\lambda -1)}{1+p(\lambda -1)}. \end{aligned}$$

(64)

Note first that (60) contradicts $({\text {IC}}_L)$ in (38) since the RHS of (60) is smaller than $\frac{\theta _L}{B(p,\lambda )}$. It is straightforward to check that (61) and (63) contradict $({\text {IR}}_L)$ in (37), so they can be ignored. Thus, the PPE requirement with respect to $R^L$ boils down to satisfying either (62) or (64), which gives us the condition in (24).

The statement that $({\text {IC}}_L)$ are $({\text {IC}}_H)$ are implied by (23) and (24) can be established if we show that for any $p \in (0,1)$ and $\lambda >1$,

$$\begin{aligned} \alpha (p,\lambda ) \ge \frac{1}{B(p,\lambda )} \ge \beta (p,\lambda ). \end{aligned}$$

(65)

To do so, observe first that $ \frac{\lambda +1}{2}> \frac{1}{ B(p,\lambda )} =\frac{1+p+(1-p)\lambda }{1+(1-p)+p\lambda } > \frac{2}{\lambda +1}$. Given this, (65) will be shown if (i) $\frac{1+(1-p)(\lambda -1)}{1-(1-p)(\lambda -1)} \ge \frac{1+p+(1-p)\lambda }{1+(1-p)+p\lambda }$ when $p \ge \frac{\lambda +2}{\lambda +3} $ and (ii) $\frac{1-p(\lambda -1)}{1+p(\lambda -1)} \le \frac{1+p+(1-p)\lambda }{1+(1-p)+p\lambda }$ when $p \le \frac{1}{\lambda +3} $. To prove this, observe that if $p \ge \frac{\lambda +2}{\lambda +3}$, then

$$\begin{aligned}&\frac{1+(1-p)(\lambda -1)}{1-(1-p)(\lambda -1)} - \frac{1+p+(1-p)\lambda }{1+(1-p)+p\lambda } \\&\quad =\frac{(\lambda -1) (\lambda +2- p (\lambda +1)) }{(1-(1-p)(\lambda -1)) ( 1+(1-p)+p\lambda )}> 0, \end{aligned}$$

and also that

$$\begin{aligned} \frac{1+p+(1-p)\lambda }{1+(1-p)+p\lambda } -\frac{1- p(\lambda -1)}{1+p(\lambda -1)} = \frac{(\lambda -1) (1+p+p \lambda )}{(1+(1-p)+p\lambda ) (1+p(\lambda -1))} >0, \end{aligned}$$

as desired. $\square $

1.1 Proof of Proposition 5

Part (a) We prove the if part in the proof of Theorem 3 by constructing a screening PPE menu that is optimal among all PPE menus under the given condition. Here we only prove the only if part. To do so, note that we can combine (23) and (24) in Lemma 3 to write the following necessary condition for any screening menu R to be a PPE:

$$\begin{aligned} \theta _L \alpha (p,\lambda ) \le \frac{t_H -t_L}{v_H -v_L} \le \theta _H \beta (p,\lambda ) \end{aligned}$$

(66)

with the first inequality being strict if $p < \frac{\lambda +2}{\lambda +3}$ and the second inequality being strict if $p > \frac{1}{\lambda +3}$. This condition can hold only if the RHS of (66) is greater than the LHS, that is, $\frac{\theta _H}{\theta _L} > \frac{\alpha (p,\lambda )}{\beta (p,\lambda )}=: \gamma (p,\lambda )$. The following claim establishes several properties of $\gamma $.

Claim

$\gamma $ is a continuous function that satisfies the following properties:

(i)
$\gamma (p,\cdot )$ is strictly increasing;
(ii)
$\gamma (p,1) =1$ and $\lim _{\lambda \rightarrow \infty } \gamma (p,\lambda ) =\infty $; and
(iii)
$\gamma (\cdot , \lambda )$ is strictly increasing if $p < \frac{1}{\lambda +3}$, constant if $p \in [\frac{1}{\lambda +2},\frac{\lambda +2}{\lambda +3}]$, and strictly decreasing if $p > \frac{\lambda +2}{\lambda +3}$.

Proof

Clearly, $\gamma $ is continuous. It is also clear that $\alpha (p,\cdot )$ is strictly increasing and $\beta (p,\cdot )$ is strictly decreasing; thus, $\gamma (p,\cdot )$ is strictly increasing, proving (i). To prove (ii), note that $\alpha (p,1) = \beta (p,1) = 1$ so $\gamma (p,1) =1$. Also, for sufficiently large $\lambda $, $\alpha (p,\lambda ) = \frac{\lambda +1}{2}$ and $\beta (p, \lambda ) = \frac{2}{\lambda +1}$, so $\lim _{\lambda \rightarrow \infty } \gamma (p,\lambda ) = \lim _{\lambda \rightarrow \infty } \frac{(\lambda +1)^2}{4} = \infty . $ Part (iii) follows from combining that $\gamma (p,\lambda ) =\frac{\alpha (p,\lambda )}{\beta (p,\lambda )}$ and the fact that $\alpha (\cdot ,\lambda )$ is strictly decreasing if $p > \frac{\lambda +2}{\lambda +3}$ and constant otherwise while $\beta (\cdot ,\lambda )$ is strictly decreasing if $p < \frac{1}{\lambda +3}$ and constant otherwise. $\square $

Properties (i) and (ii) along with the continuity of $\gamma $ imply that there exists $\overline{\lambda }_S >1$ such that $\frac{\theta _H}{\theta _L} > \gamma (p,\lambda )$ if and only if $\lambda < \overline{\lambda }_S$.

Next observe that by (iii), we have $ \max _{p\in [0,1]} \gamma (p,\lambda ) = \frac{(\lambda +1)^2}{4} = \gamma (p,\lambda ) $ for any $p \in [\frac{1}{\lambda +3}, \frac{\lambda +2}{\lambda +3}]$. Letting $\underline{\lambda } = 2 \sqrt{\frac{\theta _H}{\theta _L} }-1$, we solve $ \frac{(\lambda +1)^2}{4} =\frac{\theta _H}{\theta _L}$ to obtain $\lambda = \underline{\lambda } $. Thus, letting $\underline{p} = \frac{1}{\underline{\lambda } +3} $ and $ \overline{p} = \frac{\underline{\lambda } +2}{\underline{\lambda } +3}$, we have $\overline{\lambda }_S =\underline{\lambda }$ for $p \in [\underline{p},\overline{p}]$. The fact that $\gamma (\cdot , \underline{\lambda })$ is increasing in p if $p \in (0,\frac{1}{\underline{\lambda }+3})$ and $\gamma $ is increasing in $\lambda $, means that $\overline{\lambda }_S$ is decreasing in p and greater than $\underline{\lambda }$ for $p \in (0,\frac{1}{\underline{\lambda }+3})$. Similarly, $\overline{\lambda }_S$ is decreasing in p and greater than $\underline{\lambda }$ in the range $ (\frac{\underline{\lambda }+2}{\underline{\lambda }+3},1)$. Also, the monotonicity of $\overline{\lambda }_S$ implies that $\overline{\lambda }_S$ is maximized at $p =0$ or 1. Since $\gamma (0,\lambda ) =\gamma (1,\lambda ) = \frac{\lambda +1}{2}$, solving $\frac{\lambda +1}{2}=\frac{\theta _H}{\theta _L}$ yields $ \lambda = \frac{2\theta _H}{\theta _L} -1$. Thus, if we let $\overline{\lambda } = \frac{2\theta _H}{\theta _L} -1$, then $\overline{\lambda }_S < \overline{\lambda }$ for all $p\in (0,1)$.

Part (b) Consider any reverse-screening menu satisfying $\theta _{H}v_{H}\ge \theta _{L}v_{L}$. (We will later discuss reverse-screening menu with $\theta _H v_H < \theta _L v_L$.) To facilitate the reference, we rewrite here $({\text {IC}}_H)$ and $({\text {IC}}_L)$ constraints in (43) and (44) as

$$\begin{aligned} \frac{2\theta _{H}}{1+p+(1-p)\lambda } \underbrace{\le }_{ ({\text {IC}}_H)} \frac{t_{L}-t_{H}}{v_{L}-v_{H}} \underbrace{\le }_{({\text {IC}}_L)} \frac{(\lambda +1)\theta _{L}}{1+p+(1-p)\lambda }. \end{aligned}$$

(67)

Let us first consider a deviation $ R^{\prime } =R^{L}=\{b_{L}\}$. The conditions $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, $({\text {FIR}}_H)$, and (U) are given as

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_{L}-v_{H}}&> \theta _{L}\frac{\lambda +1}{2} \end{aligned}$$

(68)

$$\begin{aligned} \frac{t_{L}}{v_{L}}&> \theta _{L}\frac{\lambda +1}{2} \end{aligned}$$

(69)

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_{L}-v_{H}}&> \theta _{H}\frac{1+p+(1-p)\lambda }{2} \end{aligned}$$

(70)

$$\begin{aligned} \frac{t_{L}}{v_{L}}&> \frac{1}{2}\left[ \theta _{H}\left( 1+p+(1-p)\lambda \right) +p(\lambda -1)\theta _{L}\right] \end{aligned}$$

(71)

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_{L}-v_{H}}&{\left\{ \begin{array}{ll} \ge \theta _H &{} \text{ if } p(\lambda -1)<1 \\ \le \theta _H &{} \text{ if } p(\lambda -1)>1, \end{array}\right. } \end{aligned}$$

(72)

respectively. In addition, (U) always holds if $p(\lambda -1)=1$. Note first that (68) contradicts $({\text {IC}}_L)$ in (67). We can also ignore (71) because it implies (69). However, (69) contradicts $({\text {IR}}_L)$ in (46) since $(1+p+(1-p)\lambda )t_{L}-(1-p)(\lambda -1)t_{H} > 2 t_L$. Thus, the reverse-screening menu R must satisfy either (70) or (72). Since $\frac{1+p +(1-p)\lambda }{2} >1$, (70) implies (72) in case $p(\lambda -1)<1$, which means that (72) must hold if $p(\lambda -1)<1$. If $p(\lambda -1)>1$, then either (70) or (72) must hold.

Consider next a deviation $R^{\prime } =R^{H}=\{b_{H}\}$. Then, the conditions $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, $({\text {FIR}}_H)$, and (U) are given as

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_{L}-v_{H}}&< \theta _{L}\frac{1+p+(1-p)\lambda }{\lambda +1} \end{aligned}$$

(73)

$$\begin{aligned} \frac{t_{H}}{v_{H}}&> \theta _{L}\frac{\lambda +1}{2} \end{aligned}$$

(74)

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_{L}-v_{H}}&< \theta _{H}\frac{2}{\lambda +1} \end{aligned}$$

(75)

$$\begin{aligned} \frac{t_{H}}{v_{H}}&> \frac{1}{2}\left[ \theta _{H}\left( 1+p+(1-p)\lambda \right) +p(\lambda -1)\theta _{L}\right] \end{aligned}$$

(76)

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_{L}-v_{H}}&\le \theta _{L}, \end{aligned}$$

(77)

respectively. First, each of (73) and (75) implies (77) since the right-hand sides of the former inequalities are both smaller than the RHS of the latter. So, (73) and (75) can be ignored. Similarly, (76) implies (74) and can thus be ignored. In sum, either (74) or (77) must hold for the buyer not to deviate to $R^{H}$.

Claim

If either $p (\lambda -1) <1 $ or $p(\lambda -1) \ge 1$ and $p \ge \frac{\theta _L}{2 \theta _H -\theta _L}$, then (74) cannot be satisfied by any reverse-screening PPE menu.

Proof

Suppose that under the assumed conditions, there is a reverse-screening PPE menu that satisfies (74). We first obverse that (43) and (74) together imply

$$\begin{aligned} \frac{t_{L}}{v_{L}}>\frac{2\theta _{H}}{1+p+(1-p)\lambda }. \end{aligned}$$

(78)

To see it, rewrite (43) and (74) as

$$\begin{aligned} t_L -t_H&\ge \left( \frac{2\theta _H}{1+p +(1-p)\lambda }\right) (v_L -v_H)\\ t_H&> \theta _L \left( \frac{\lambda +1}{2}\right) v_H. \end{aligned}$$

Sum up the two inequalities side by side to obtain

$$\begin{aligned} t_L> & {} \frac{2\theta _H}{1+p +(1-p)\lambda } v_L + \left( \theta _L \frac{\lambda +1}{2} -\frac{2\theta _H}{1+p +(1-p)\lambda }- \right) v_H \\> & {} \frac{2\theta _H}{1+p +(1-p)\lambda } v_L, \end{aligned}$$

where the second inequality holds since $\theta _L \frac{\lambda +1}{2} \ge \theta _H > \frac{2\theta _H}{1+p +(1-p)\lambda }$.

Let us consider the PPE requirement with respect to $R^{\emptyset } :=\{\emptyset \}$, which requires to satisfy at least one of the conditions, $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, $({\text {FIR}}_H)$, and (U), written as follows:

$$\begin{aligned} \frac{t_{L}}{v_{L}}&< \theta _{L}\frac{2}{\lambda +1} \end{aligned}$$

(79)

$$\begin{aligned} \frac{t_{H}}{v_{H}}&< \theta _{L}\frac{2}{\lambda +1} \end{aligned}$$

(80)

$$\begin{aligned} \frac{t_{L}}{v_{L}}&< \theta _{H}\frac{2}{\lambda +1} \end{aligned}$$

(81)

$$\begin{aligned} \frac{t_{H}}{v_{H}}&< \theta _{H}\frac{2}{\lambda +1} \end{aligned}$$

(82)

$$\begin{aligned} U(R)&\ge U(R^{\emptyset })=0 \end{aligned}$$

(83)

First, (79) and (80) can be ignored since they are implied by (81) and (82). Given that $\frac{2}{\lambda +1} < \frac{2}{1+p +(1-p)\lambda }$ and $\theta _H \frac{2}{\lambda +1} < \theta _H \le \theta _L \frac{\lambda +1}{2}$, (81) contradicts (78) while (82) contradicts (74). Therefore, (83) must hold. We prove below that (83) cannot hold if either (i) $p(\lambda -1)\le 1$ or (ii) $p(\lambda -1) > 1$ and $p \ge \frac{\theta _L}{2\theta _H -\theta _L}$.

Case (i) In this case, the first inequality of (72) must hold (since it is implied by (70) as mentioned above). Define $R^{\prime } = \{(q_L, t_L^{\prime }), (q_H,t_H^{\prime })\}$ to be a reverse-screening menu with the same quantities as R, where $t_H^{\prime } =\theta _L \frac{\lambda +1}{2} v_H$ and $t_L^{\prime } = t_H^{\prime } + \theta _H (v_L -v_H) = \theta _L \frac{\lambda +1}{2} v_H + \theta _H (v_L -v_H)$. Note that due to (74) and the first inequality of (72), we have $t_H^{\prime } < t_H$, $t_L^{\prime } < t_L$, and $t_L^{\prime } - t_H^{\prime } \le t_L -t_H $. Clearly, this implies that $U (R^{\prime }) > U(R) $. We now obtain

$$\begin{aligned} U (R^{\prime }) = v_H [ \theta _H - \textstyle \theta _L \frac{1+\lambda }{2}] + v_L p (\theta _H -\theta _L)[p (\lambda -1) -\lambda ]. \end{aligned}$$

Since $\theta _H \le \frac{\lambda +1}{2}\theta _L$ and $p (\lambda -1) < 1$, the expressions in both square brackets are negative, so $U(R)< U(R^{\prime }) < 0,$ which contradicts (83).

Case (ii) Define $R^{\prime \prime } = \{(q_L, t_L^{\prime \prime }), (q_H,t_H^{\prime \prime })\}$ to be a reverse-screening menu with the same qualities as R, where $t_H^{\prime \prime } =\theta _L \frac{\lambda +1}{2} v_H$ and $t_L^{\prime \prime } = t_H^{\prime \prime } + \frac{2\theta _H}{1+p+(1-p)\lambda } (v_L -v_H) = \theta _L \frac{\lambda +1}{2} v_H + \frac{2\theta _H}{1+p+(1-p)\lambda } (v_L -v_H)$. It can be verified that due to (43) and (74), we have $t_H^{\prime \prime } < t_H$, $t_L ^{\prime \prime } < t_L$, and $t_L^{\prime \prime } - t_H ^{\prime \prime } \le t_L -t_H $. Clearly, this implies that $U (R^{\prime \prime }) > U(R) $. We now show that $U(R^{\prime \prime }) <0$ so (83) cannot be satisfied. First, it can be shown that $U (R^{\prime \prime })$ is decreasing as $\lambda $ increases.^{Footnote 26} It thus suffices to show that $U (R^{\prime \prime })< 0$ for the lowest $\lambda $, which is equal to $1+\frac{1}{p}$ given the assumption that $p (\lambda -1) \ge 1$. Setting $\lambda =1 +\frac{1}{p}$, we obtain after rearrangement

$$\begin{aligned} U(R^{\prime \prime }) = \frac{-\theta _L v_L + 2 p^2 (2 \theta _H - \theta _L) (v_H - v_L) + p \theta _L (-3 v_H + 2 v_L)}{2 p (1 + p)}. \end{aligned}$$

(84)

Let h(p) denote the numerator of this expression as a function of p. To show that $h(p) < 0$ for any $p \ge \frac{\theta _L}{2\theta _H -\theta _L}$, observe that h is a quadratic function maximized at $p =\frac{\theta _L }{2 \theta _H-\theta _L} \left( \frac{2 v_L -3 v_H}{4v_L - 4v_H}\right) < \frac{\theta _L}{2\theta _H -\theta _L}$, where the inequality holds since $\frac{2 v_L -3 v_H}{4v_L - 4v_H}<1$. Thus, for any $p \ge \frac{\theta _L}{2\theta _H -\theta _L}$, $h(p) \le h (\frac{\theta _L}{2\theta _H -\theta _L}) =\frac{2\theta _L \theta _H v_H}{-2 \theta _H + \theta _L} <0$. $\square $

When $p (\lambda -1) <1$, the above argument and Claim 1 imply that (77) must hold. But (72) and (77) contradict each other, which means there does not exist a reverse-screening PPE menu if $p (\lambda -1) <1$. Thus, the existence of reverse-screening PPE menu requires $p (\lambda -1) \ge 1$ or $\lambda \ge 1 +\frac{1}{p}$. Define:

$$\begin{aligned} \overline{\lambda }_R := \max \left\{ \frac{2\theta _{H} -(1+p)\theta _{L}}{(1-p)\theta _{L}},1+\frac{1}{p}\right\} = {\left\{ \begin{array}{ll} 1 + \frac{1}{p} &{} \text{ if } p < \frac{\theta _L}{2\theta _H -\theta _L} \\ \frac{2\theta _{H} -(1+p)\theta _{L}}{(1-p)\theta _{L}} &{} \text{ if } p \ge \frac{\theta _L}{2\theta _H -\theta _L}. \end{array}\right. } \end{aligned}$$

Given this, if $p < \frac{\theta _L}{2 \theta _H -\theta _L}$, then $\lambda \ge \overline{\lambda }_R$ is necessary for the existence of reverse-screening PPE menu. Consider now the case where $p (\lambda -1) \ge 1$ and $p \ge \frac{\theta _L}{2\theta _H -\theta _L}$. In this case, according to Claim 1, we must satisfy (77), which contradicts with $({\text {IC}}_H)$ in (67) if $\theta _L < \frac{2\theta _H}{1+p + (1-p)\lambda }$. Thus, it is necessary to have $\theta _L \ge \frac{2\theta _H}{1+p + (1-p)\lambda }$ or $\lambda \ge \frac{2\theta _{H} -(1+p)\theta _{L}}{(1-p)\theta _{L}} = \overline{\lambda }_R$ for $p \ge \frac{\theta _L}{2 \theta _H -\theta _L}$. In sum, $\lambda \ge \overline{\lambda }_R $ is necessary for the existence of reverse-screening PPE menu. It is straightforward to verify that $\overline{\lambda }_S< \overline{\lambda } = \frac{2\theta _H}{\theta _L} -1 < \frac{2 \theta _H}{\theta _L} \le \overline{\lambda }_R$.

Recall that we have so far focused on a reverse-screening menu satisfying $\theta _H v_H \ge \theta _L v_L$. The necessity proof is then completed by the result in the following lemma that the necessary condition does not get relaxed by considering a reverse-screening menu with $\theta _H v_H < \theta _L v_L$.

Lemma 4

There exists a reverse-screening PPE menu with $\theta _{H}v_H < \theta _{L} v_L$ only if $\lambda \ge \overline{\lambda }_R$.

Proof

Fix any reverse-screening menu R with $\theta _L v_L > \theta _H v_H$. We prove that (i) the inequalities (39) and (40) corresponding to $({\text {IC}}_H)$ and $({\text {IC}}_L)$ imply their counterparts in (67) for a reverse-screening menu satisfying $\theta _H v_H \ge \theta _L v_L$; (ii) for each menu $R^{\prime } =R^H, R^L,$ or $R^\emptyset $, the inequalities corresponding to (FIC), (FIR), and (U) conditions imply their counterparts for reverse-screening menu with $\theta _H v_H \ge \theta _L v_L$. Given (i) and (ii), the proof of Part (b) in Proposition 5 can be repeated to show that $\lambda \ge \overline{\lambda }_R $ is also necessary for the existence of reverse-screening PPE menu satisfying $\theta _H v_H < \theta _L v_L$.

To show (i), consider any reverse-screening menu satisfying $\theta _L v_L > \theta _H v_H$ and write (39) and (40) as

$$\begin{aligned}&\frac{2\theta _{H}}{1+p+(1-p)\lambda } +\frac{p(\lambda -1) (\theta _L v_L - \theta _H v_H)}{(1+p +(1-p)\lambda )(v_L -v_H)} \\&\qquad \le \frac{t_{L}-t_{H}}{v_{L}-v_{H}} \le \frac{(\lambda +1)\theta _{L}}{1+p+(1-p)\lambda } - \frac{(1-p)(\lambda -1) (\theta _L v_L - \theta _H v_H)}{(1+p +(1-p)\lambda )(v_L -v_H)}, \end{aligned}$$

which clearly implies (67), given that $\theta _L v_L > \theta _H v_H.$

To show (ii), consider first the reference point $R^{L} = \{ b_L\}$. It is straightforward to see that the conditions $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, and $({\text {FIR}}_H)$ are the same ss $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, and $({\text {FIR}}_H)$ in (68), (69), and (71), respectively. The condition $({\text {FIC}}_H)$ is given as

$$\begin{aligned} \frac{ t_{L}-t_{H}}{v_L -v_H} > \frac{\left[ 1+p+\lambda (1-p)\right] \theta _{H}}{2}+ \frac{p(\lambda -1)( \theta _{L}v_{L}-\theta _{H}v_{H})}{2(v_L - v_H)}, \end{aligned}$$

which implies $({\text {FIC}}_H)$ in (70). Condition (U) is given as

$$\begin{aligned} \left[ 1-p(\lambda -1)\right] (t_{L}-t_{H})&\ge \left[ 1-p(\lambda -1)\right] \theta _{H} (v_{L}-v_{H})\nonumber \\&\quad +\, 2p(\lambda -1)(\theta _{L} v_L -\theta _{H}v_{H}), \end{aligned}$$

(85)

while (U) in (72) can be rewritten as

$$\begin{aligned} \left[ 1-p(\lambda -1)\right] (t_{L}-t_{H}) \ge \left[ 1-p(\lambda -1)\right] \theta _{H}(v_{L}-v_{H}). \end{aligned}$$

(86)

It is clear that neither condition holds when $1-p(\lambda -1) <0$ while (86) always holds when $1-p(\lambda -1)=1$. Subtracting the RHS of (86) from the RHS of (85) yields

$$\begin{aligned} 2p(\lambda -1)(\theta _{L}v_{L} -\theta _{H}v_{H}) >0. \end{aligned}$$

Thus, (85) implies (86) when $1- p (\lambda -1) >0$.

Next let us consider $R^{H} =\{b_H\}$. It is straightforward to see that $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, and $({\text {FIR}}_H)$ are the same as $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, and $({\text {FIR}}_H)$ in (74), (75), and (76), respectively. $({\text {FIC}}_L)$ is given as

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_L -v_H} < \theta _{L} \frac{1+p+\lambda (1-p)}{1+\lambda } - \frac{(\lambda -1)(1-p)\left( \theta _{L}v_{L} -\theta _H v_H \right) }{(1+\lambda )(v_L -v_H)}, \end{aligned}$$

which implies $({\text {FIC}}_L)$ in (73), given that $\theta _{L}v_{L} -\theta _H v_H >0$. Condition (U) is given as

$$\begin{aligned} \frac{t_{L}-t_{H}}{v_L -v_H} \le \theta _{L} - \frac{2(1-p)(\lambda -1)( \theta _{L}v_{L} - \theta _{H}v_{H} )}{(1+(1-p)(\lambda -1))(v_L -v_H)}, \end{aligned}$$

which implies (U) in (77).

Consider $R^{\prime } = R^\emptyset $. (FIC) and (FIR) are the same as their counterparts in the case $\theta _H v_H \ge \theta _L v_L$, which are given in (79) through (82). (U) is written as

$$\begin{aligned}&p(\theta _{L}v_{L}-t_{L})+(1-p)(\theta _{H}v_{H}-t_{H})-p(1-p)(\lambda -1)(\theta _{L}v_{L}-\theta _{H}v_{H}\\&\quad + t_{H} -t_L) \ge 0 \end{aligned}$$

while its counterpart in the case $\theta _H v_H \ge \theta _L v_L$ is

$$\begin{aligned}&p(\theta _{L}v_{L}-t_{L})+(1-p)(\theta _{H}v_{H}-t_{H})-p(1-p)(\lambda -1)(\theta _{H}v_{H}-\theta _{L}v_{L}\\&\quad +\,t_{H} -t_L) \ge 0. \end{aligned}$$

The latter inequality is implied by the former, given that $\theta _H v_H - \theta _L v_L <0 $. $\square $

Part (c) Given $R =\{b = (q,t)\}$ with $q,t >0$, the $(IR_L)$ constraint is $t \le \theta _L \frac{\lambda +1}{2} v(q)$. Note that this is the only constraint needed for R to be a PE. Given R, there are three possible deviations: $R^{\prime } = \{\emptyset ,\emptyset \}$, $\{\emptyset ,b \}$, and $\{b,\emptyset \}$. First, it is straightforward to verify that R always satisfies the PPE requirement with respect to $R^{\prime }= \{b, \emptyset \}$.^{Footnote 27} Next, $({\text {FIC}}_L)$ conditions for deviations $R^{\prime } = \{\emptyset ,\emptyset \}$ and $\{\emptyset ,b\}$ are given, respectively, as

$$\begin{aligned} t_L< \theta _L \frac{2}{1+\lambda } v (q) \; \; \text{ and } \; \; t_L < \theta _L \frac{1+p +(1-p)\lambda }{1+(1-p) +p \lambda }v (q). \end{aligned}$$

Thus, for any $q>0$, if we set $t =\theta _L k^{\prime } v(q)$ for some positive constant $k^{\prime } < \min \{\frac{2}{1+\lambda }, \frac{1+p +(1-p)\lambda }{1+(1-p) + p \lambda }\}$, then $R = \{(q,t)\}$ is a pooling PPE menu. The profit from this menu is $ \theta _L k^{\prime } v(q) - cq $, which is positive for a sufficiently small q, due to the assumption that $\lim _{q \rightarrow 0} v^{\prime }(q) =\infty $.

1.2 Proof of Theorem 3

We only need to prove part (b). The proof consists of two lemmas. In the first lemma, we consider a relaxed problem by weakening the PPE requirement as follows:

$$\begin{aligned} \max _{R = \{b_L,b_H\}} \; p (t_L -cq_L) +(1-p) (t_H - cq_H) \qquad \qquad \qquad [P^{\prime }] \end{aligned}$$

subject to the constraints that R is a PE menu and also satisfies the PPE requirement with respect to $R^L = \{b_L\}$, $R^H =\{b_H\}$, and $R^{\emptyset H} :=\{ \emptyset , b_H \}$.

Lemma 5

If $\lambda < \underline{\lambda }_S$, then the optimal menu that solves $[P^{\prime }]$ must be a screening menu.

Proof

Recall from Lemma 3 that the necessary and sufficient condition for any screening menu R to satisfy the PPE requirement with respect to $R^L$ and $R^H$ is given by (66). Then, by Lemma 3, the (IC) constraints in (38) are implied by (66) and can thus be ignored.

Let us now turn to the PPE requirement with respect to $R^{\emptyset H}$, for which the conditions, $({\text {FIC}}_L)$, $({\text {FIR}}_L)$, $({\text {FIC}}_H)$, $({\text {FIR}}_H)$, and (U), are given as

$$\begin{aligned}&\displaystyle \frac{t_{L}}{v_{L}} < \theta _{L}\frac{1+p+(1-p)\lambda }{1+(1-p)+p\lambda } = \frac{\theta _L}{B (p,\lambda )} \end{aligned}$$

(87)

$$\begin{aligned}&\displaystyle \frac{t_{H}}{v_{H}} < \frac{\theta _{L}}{B (p,\lambda )} \end{aligned}$$

(88)

$$\begin{aligned}&\displaystyle \frac{t_{H}-t_{L}}{v_{H}-v_{L}} > \frac{\theta _{H}}{B (p,\lambda )} \end{aligned}$$

(89)

$$\begin{aligned}&\displaystyle \frac{t_{H}}{v_{H}} > \frac{\theta _{H}}{B (p,\lambda )} \end{aligned}$$

(90)

$$\begin{aligned}&\displaystyle \left[ 1-(1-p)(\lambda -1)\right] t_{L} \le \left[ 1+(1-p)(\lambda -1)\right] \theta _{L}v_{L}. \end{aligned}$$

(91)

First, (89) contradicts $({\text {IC}}_H)$ in (38). Second, (87) implies (91) and can thus be ignored. To see this, we can focus on the case $[1-(1-p) (\lambda -1)]> 0$, so (91) can be rewritten as $\frac{t_L}{v_L} \le \theta _L \frac{1+(1-p)(\lambda -1)}{1- (1-p)(\lambda -1)}$, which is then implied by (87) since $ \frac{1+(1-p)(\lambda -1)}{1- (1-p)(\lambda -1)} > \frac{1}{B(p,\lambda )}$ (as shown in the proof of Lemma 3). Next, (90) cannot hold. To see it, sum up (90) and $({\text {IR}}_L)$ constraint, rewritten as $ - t_L \ge - \theta _L \frac{\lambda +1}{2} v_L $, side by side to obtain

$$\begin{aligned} t_H - t_L > \theta _H \frac{1}{B(p,\lambda )} v_H - \theta _L \frac{\lambda +1}{2} v_L. \end{aligned}$$

This inequality contradicts the second inequality of (66) since the RHS of the former is greater than that of the latter:

$$\begin{aligned}&\frac{\theta _H}{B(p,\lambda )} v_H - \theta _L \frac{\lambda +1}{2} v_L - \theta _H \beta (p,\lambda ) (v_H -v_L) \\&\quad = \ \theta _H \left( \frac{1}{B(p,\lambda )}-\beta (p,\lambda ) \right) v_H - \left( \theta _L \frac{\lambda +1}{2} -\theta _H \beta (p,\lambda )\right) v_L \\&\quad \ge \ \theta _H \left( \frac{1}{B(p,\lambda )}-\beta (p,\lambda ) \right) v_L - \left( \theta _L \frac{\lambda +1}{2} -\theta _H \beta (p,\lambda )\right) v_L \\&\quad = \ \left( \frac{\theta _H}{B(p,\lambda )} - \theta _L \frac{\lambda +1}{2}\right) >0, \end{aligned}$$

where the first inequality holds since $\frac{1}{B(p,\lambda )} \ge \beta (p,\lambda ) $ (by Lemma 3) and $v_H > v_L$. To see why the second (strict) inequality holds, consider first the case $p \le \frac{\lambda +2}{\lambda +3} $ so $\alpha (p,\lambda ) = \frac{\lambda +1}{2}$. Then, we have $\frac{\theta _H }{B(p,\lambda )} - \theta _L \frac{\lambda +1}{2} \ge \theta _H \beta (p,\lambda ) - \theta _L \alpha (p,\lambda ) >0$, where the second inequality holds since $\lambda < \overline{\lambda }_S$. Consider next the case $p \ge \frac{\lambda +2}{\lambda +3} > \frac{1}{2}$. Note that $\frac{1}{B(p,\lambda )} > 1$ if $p > \frac{1}{2}$. Thus, $ \frac{\theta _H}{B(p,\lambda )} - \theta _L \frac{\lambda +1}{2}> \theta _H -\theta _L \frac{\lambda +1}{2} > 0$ since $\lambda< \overline{\lambda }_S < \overline{\lambda } = 2 \frac{\theta _H}{\theta _L} -1$ implies $\theta _H >\theta _L \frac{\lambda +1}{2}$.

In sum, either (88) or (91) must be satisfied. We show that any PPE menu satisfying (88) is suboptimal for $[P^{\prime }]$. For any such menu $\tilde{R}$ with transfers $\tilde{t}_L$ and $\tilde{t}_H$, we must have

$$\begin{aligned}&\displaystyle \tilde{t}_H < \frac{ \theta _L}{B(p,\lambda )} v_H \end{aligned}$$

(92)

$$\begin{aligned}&\displaystyle \tilde{t}_L \le \tilde{t}_H - \theta _L \alpha (p,\lambda ) (v_H -v_L) < \frac{\theta _L}{B(p,\lambda )} v_H - \theta _L \alpha (p,\lambda ) (v_H -v_L), \end{aligned}$$

(93)

where the first inequality of (93) is due to (66). But, a higher profit is achieved by another menu R with the same quantities as in $\tilde{R}$ but different transfers:

$$\begin{aligned}&\displaystyle t_L = \theta _L \alpha (p,\lambda ) v_L \end{aligned}$$

(94)

$$\begin{aligned}&\displaystyle t_H = t_L + \theta _H \beta (p,\lambda ) (v_H-v_L) =\theta _L \alpha (p,\lambda ) v_L + \theta _H \beta (p,\lambda ) (v_H-v_L)\qquad \end{aligned}$$

(95)

Note that R satisfies (66). Also, R satisfies $({\text {IR}}_L)$ and (91) since they can be written together as $t_L \le \theta _L \alpha (p,\lambda ) v_L. $ Thus, R is a PE screening menu that satisfies the PPE requirement with respect to $R^L$, $R^H$, and $R^{\emptyset H }$. Now, using (92) through (95), we obtain

$$\begin{aligned}&\displaystyle t_H - \tilde{t}_H> ( \theta _H \beta (p,\lambda ) -\theta _L \alpha (p,\lambda )) v_L + \left( \theta _H \beta (p,\lambda ) - \frac{\theta _L}{B (p,\lambda )} \right) v_H >0 \qquad \quad \end{aligned}$$

(96)

$$\begin{aligned}&\displaystyle t_L -\tilde{t}_L> \theta _L\left( \alpha (p,\lambda ) - \frac{1}{B(p,\lambda )} \right) v_H > 0, \end{aligned}$$

(97)

where the last inequalities in (96) and (97) hold since $\theta _H \beta (p,\lambda ) > \theta _L \alpha (p,\lambda ) \ge \frac{\theta _L}{B (p,\lambda )} $. Thus, we can focus on (91) to find the optimal menu for $[P^{\prime }]$.

Summing up, $[P^{\prime }]$ can be solved with (66), (91), and $({\text {IR}}_L)$. Note that the latter two constraints can be written together as

$$\begin{aligned} t_L \le \theta _L \alpha (p,\lambda ) v_L. \end{aligned}$$

(98)

Then, the standard argument can be used to establish that (98) and the second inequality of (66) must hold as equality at the optimal menu that solves $[P^{\prime }]$,^{Footnote 28} implying that the transfers satisfy (94) and (95). Plugging them into the profit and taking the FOCs with respect to $q_L$ and $q_H$ yield (25) and (26). For a screening menu to be a solution of $[P^{\prime }]$, we need the two qualities that solve (25) and (26) to satisfy $q_L < q_H $. This requires the LHS of (25) to be smaller than that of (26), which yields $\theta _H \beta (p,\lambda ) > \theta _L \alpha (p,\lambda )$ after rearrangement. This condition is equivalent to requiring $\lambda < \overline{\lambda }_S$. $\square $

We next show that the optimal menu in Lemma 5 satisfies all other PPE requirements.

Lemma 6

If $\lambda < \overline{\lambda }_S$, then the optimal menu R that solves $[P^{\prime }]$ satisfies the PPE requirement with respect to all menus in C(R) other than $R^L$, $R^H$, and $R^{\emptyset H}$.

Proof

(i) $R^{\prime } = R^\emptyset = \{\emptyset , \emptyset \}$: We consider two cases, $p \ge \frac{1}{\lambda +3}$ or $p < \frac{1}{\lambda +3}$. In the former case, R and $R^\emptyset $ with transfers as in (94) and (95) satisfy $({\text {FIC}}_H)$, $u (\emptyset | \theta _H, R^\emptyset ) < u (b_L | \theta _H, R^\emptyset )$, written as $t_L < \theta _H \frac{2}{\lambda +1} v_L = \theta _H \beta (p,\lambda ) v_L $ for $p \ge \frac{1}{\lambda +3}$. To see this, given (94), this inequality becomes $\theta _L \alpha (p,\lambda ) v_L < \theta _H \beta (p,\lambda ) v_L,$ which holds since $\lambda < \overline{\lambda }_S$.

When $p < \frac{1}{\lambda +3}$, we show that R and $R^\emptyset $ satisfy (U), $U (R) \ge U(\emptyset )=0$. Write first the ex ante utility from R as

$$\begin{aligned} U(R)=p(\theta _{L}v_{L}-t_{L})+(1-p)(\theta _{H}v_{H}-t_{H})-p(1-p)(\lambda -1)(\theta _{H}v_{H}-\theta _{L}v_{L}+ t_{H} -t_L). \end{aligned}$$

(99)

Note that with $p < \frac{1}{\lambda +3}$, (94) and (95) become

$$\begin{aligned} t_L = \theta _L \frac{\lambda +1}{2} v_L \ \text { and } \ t_H = \theta _L \frac{\lambda +1}{2} v_L + \theta _H \frac{1 - p (\lambda - 1)}{1 + p (\lambda - 1)} (v_H - v_L). \end{aligned}$$

Plug these into (99) to express (U) as

$$\begin{aligned} U (R) = \frac{1}{2} v_L \Big (2 (1-p) \theta _H \big [1-p (\lambda -1)\big ]{-}\theta _L \big [ 1+2 p^2 (-1+\lambda ){+}\lambda -2 p \lambda \big ] \Big ) \ge 0. \end{aligned}$$

With $p< \frac{1}{\lambda +3} < \frac{1}{\lambda -1}$, the expression in the first square bracket is positive and also the fact that $\theta _L \alpha (p,\lambda )= \theta _L \frac{\lambda +1}{2} < \frac{1-p(\lambda -1)}{1+p(\lambda -1)} \theta _H = \theta _H \beta (p,\lambda ) $ means $\theta _H > \frac{\theta _L \lambda (1+\lambda )}{2 (1+p-p \lambda )}$. Thus,

$$\begin{aligned} U(R)&> \frac{1}{2} v_L \Big (2 (1-p) \left( \frac{\theta _L \lambda (1+\lambda )}{2 (1+p-p \lambda )}\right) \big [1-p (\lambda -1)\big ]\\&\quad -\theta _L \big [ 1+2 p^2 (-1+\lambda )+\lambda -2 p \lambda \big ] \Big )\\&= \frac{1}{2} p \theta _L v_L (\lambda -1) (2+\lambda - p (3+\lambda )) >0, \end{aligned}$$

where the last inequality follows from $p< \frac{1}{\lambda +3} < \frac{\lambda +2}{\lambda +3}$.

(ii) $R^{\prime }= R^{\emptyset L} = \{\emptyset , b_L \}$: Consider the condition $({\text {FIC}}_H)$, $u (b_H | \theta _H, R^{\emptyset L}) > u (b_L | \theta _H, R^{\emptyset L })$, which can be written as

$$\begin{aligned} \frac{t_H-t_L}{v_H -v_L} < \theta _H \frac{2}{\lambda +1}. \end{aligned}$$

(100)

If $p > \frac{1}{\lambda +3} $, then this condition is satisfied by $t_L$ and $t_H$ as in (94) and (95) since $\frac{t_H-t_L}{v_H -v_L} = \theta _H \beta (p,\lambda ) = \theta _H \frac{2}{\lambda +1}$ for $p > \frac{1}{\lambda +3}$.^{Footnote 29} Suppose from now that $p \le \frac{1}{\lambda +3}$. We show that (U), that is, $U (R) \ge U (R^{\emptyset L})$, is satisfied. To do so, we obtain after simplification of terms

$$\begin{aligned} U(R) - U (R^{\emptyset L})&= \frac{v_L}{2} \Big (\theta _H \big [1 - 2 p (2 + p (\lambda -1) - \lambda )\big ] (\lambda +1) \\&\qquad \qquad + \theta _L \big [- \lambda -1 + p (2 + p (-1 + \lambda )^2 - (-3 + \lambda ) \lambda )\big ]\Big ). \end{aligned}$$

First, we define the expression in the first square bracket as $h(p) := 1 - 2 p (2 + p (\lambda -1) - \lambda ) $ and show that $h (p) > 0$ for $p \in [0,\frac{1}{\lambda +3}]$. Since h is a concave, quadratic function of p, it suffices to show that both h(0) and $ h (\frac{1}{\lambda +3}) $ are positive: $h (0) = 1 >0$ and $h(\frac{1}{\lambda +3}) = \frac{3\lambda (\lambda +2) -1}{(3+\lambda )^2}>0$. Given that h is positive for $p \in [0,\frac{1}{\lambda +3}]$, we have

$$\begin{aligned} U (R) - U (R^{\emptyset L})&> \frac{v_L}{2} \Big (\theta _L \big [1 - 2 p (2 + p (\lambda -1) - \lambda )\big ] (\lambda +1) \\&\qquad \qquad + \theta _L \big [- \lambda -1 + p (2 + p (-1 + \lambda )^2 - (-3 + \lambda ) \lambda )\big ]\Big ) \\&= p \theta _L (\lambda -1 ) ( \lambda +2 - p(\lambda +3)) >0 \end{aligned}$$

if $ p \le \frac{1}{\lambda +3}$, as desired.

(iii) $R^{\prime }= R^{HL} = \{b_H, b_L\}$: This menu corresponds to a reverse-screening menu where the high (low) type chooses a bundle with low (high) quality. By Lemma 2, this menu can only be a PE if $\theta _H \le \theta _L \frac{\lambda +1}{2}$, which contradicts with the assumption that $\lambda< \overline{\lambda }_S < \overline{\lambda }= 2 \frac{\theta _H}{\theta _L} -1$.

(iv) $R^{\prime } = R^{H\emptyset } =\{b_H, \emptyset \}$: Write $({\text {FIC}}_L)$ and $({\text {FIC}}_H)$—that is, $u(b_{H}|\theta _{L},R^{H\emptyset }) < u(\emptyset |\theta _{L},R^{H\emptyset }) $ and $u(\emptyset |\theta _{H},R^{H\emptyset }) > u(b_{H}|\theta _{H},R^{H\emptyset })$—as

$$\begin{aligned} \frac{t_{H}}{v_{H}} > \theta _{L}\frac{1+(1-p)+p\lambda }{1+p+(1-p)\lambda } \; \text{ and } \;\frac{t_{H}}{v_{H}} < \frac{2\theta _{H}+p(\lambda -1)\theta _{L}}{1+p+(1-p)\lambda }, \end{aligned}$$

respectively. One of these two inequalities must hold since

$$\begin{aligned} \frac{2\theta _{H}+p(\lambda -1)\theta _{L}}{1+p+(1-p)\lambda } - \theta _{L}\frac{1+(1-p)+p\lambda }{1+p+(1-p)\lambda } = \frac{2(\theta _{H}-\theta _L)}{1+p+(1-p)\lambda } >0. \end{aligned}$$

(v) $R^{\prime } =R^{L\emptyset } =\{b_L, \emptyset \}$: The analysis of this case is analogous to the case (iv) above and thus omitted. $\square $

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Hahn, JH., Kim, J., Kim, SH. et al. Price discrimination with loss averse consumers. Econ Theory 65, 681–728 (2018). https://doi.org/10.1007/s00199-017-1035-2

Download citation

Received: 22 February 2016
Accepted: 16 January 2017
Published: 03 February 2017
Issue Date: May 2018
DOI: https://doi.org/10.1007/s00199-017-1035-2

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Price discrimination with loss averse consumers

Abstract

Similar content being viewed by others

Accurately measuring willingness to pay for consumer goods: a meta-analysis of the hypothetical bias

The effects of scarcity on consumer decision journeys

The pricing strategies of online grocery retailers

1 Introduction

2 The setup

2.1 Price discrimination with loss averse consumers

2.2 Personal equilibrium

Definition 1

Proposition 1

Definition 2

3 Optimal TPE menu

3.1 Binary consumer types

3.1.1 The seller’s problem

3.1.2 Symmetric information benchmark

Lemma 1

3.1.3 Results

Proposition 2

Proposition 3

Lemma 2

Proposition 4

Theorem 1

Proof

Example 1

3.2 A continuum of consumer types

Theorem 2

Remark 1

4 Optimal TPPE menu

4.1 The seller’s problem

4.2 Results

Lemma 3

Proposition 5

Theorem 3

Remark 2

4.3 A role for redundant bundle

Assumption 1

Proposition 6

5 Alternative reference points

5.1 Bundles as stochastic reference point

Proposition 7

Proof

5.2 Average bundle

5.3 Additive separability

6 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material 1 (pdf 417 KB)

Appendices

Appendix 1: Omitted proofs from Sect. 3

Proof of Lemma 1

Claim

Proof

Claim

Proof

Proof of Proposition 3

Proof of Lemma 2

Proof of Proposition 4

Appendix 2: Omitted proofs from Sect. 4

Proof

1.1 Proof of Proposition 5

Claim

Proof

Claim

Proof

Lemma 4

Proof

1.2 Proof of Theorem 3

Lemma 5

Proof

Lemma 6

Proof

Rights and permissions