1 Introduction

There are many situations in which decision makers lack relevant information, while those that possess the information have a biased interest in the decision that must be taken. DellaVigna and Gentzkow (2010) provide numerous examples where informed parties, like sellers or lobbyists, try to influence uninformed parties, e.g., consumers or politicians. In such situations, what is the optimal strategy of the informed party, the Sender, to influence the decision-maker, the Receiver? We study how a Sender makes optimal use of two common instruments—cheap talk and money burning—to influence a Receiver.

Money burning is a common euphemism for self-imposed losses in utility, see, e.g., Austen-Smith and Banks (2000). An often cited example of money burning is brand advertising as the content of the advertisement is less important than the fact that money was spent on it.Footnote 1 However, as losses are self-imposed, money burning is also readily accessible in many other social situations, e.g., when a lobbyist goes out of his way to reach a member of parliament or when a political prisoner goes on hunger strike to show his commitment to a cause.Footnote 2

We conduct our analysis in the cheap talk model of Crawford and Sobel (1982), CS from now on, augmented with money burning. The resulting model was first studied by Austen-Smith and Banks (2000), ASB from now on. ASB show that the option to burn money expands the set of equilibria enormously and makes, e.g., full revelation a possible equilibrium outcome. We proceed further and establish which equilibrium is best, in expected terms, from the Sender’s perspective.Footnote 3 In particular, we address the following issues. Under what conditions is it optimal for the Sender to burn money to fully reveal all her private information about the state? Under what conditions is it optimal to fully reveal a particular interval of states? Can it be optimal for the Sender to rely exclusively on cheap talk, and not to use any money burning at all? How does the use of money burning relate to the degree of conflict between the Sender and the Receiver?

Our main results are as follows. Suppose we have an equilibrium in which the Sender separates states in an interval. We derive a condition such that, if satisfied, we can construct another equilibrium in which the Sender pools some of the states in that interval and that gives him a higher expected utility. Two results follow immediately. First, although money burning makes perfect separation feasible, as is shown in ASB and Kartik 2007, we show it is never optimal for the Sender. Because of the misalignment between the Sender and the Receiver, separation requires increasing amounts of money burning. Irrespective of the Sender’s utility function and the state distribution, the Sender benefits from garbling information for at least some states. The second result is that in a large class of models, including models with preferences represented by quadratic loss functions in combination with any density of the states that is log-concave, the Sender does not use money burning to separate an interval of states.Footnote 4 That is, in this class of models, adding money burning as an instrument of communication does not lead the Sender to reveal any state precisely. Communication is garbled for all states. In other words, to the extent that a lobbyist can choose her lobbying strategy, the member of parliament cannot deduce the exact state from his interaction with the lobbyist.

It then logically follows that, within that class, if money burning is used at all in the optimal equilibrium, it is used (i) to re-arrange pooling intervals relative to what they would have been in the absence of money burning, and possibly (ii) to include one or more additional pooling intervals. To better understand the logic of these two uses of money burning, we next focus on a well-known instance of a log-concave density, the uniform density, and assume a quadratic loss function, as much theory and applied work does. Recall that Crawford and Sobel (1982) show that if cheap talk is the only instrument of communication, when the degree of conflict between Sender and Receiver decreases, the quality of communication can increase. In particular, if the degree of conflict drops below specific threshold levels, the maximum number of distinct actions that the Sender can induce the Receiver to take through verbal communication goes up by one.

We show that the Sender only burns money if her degree of conflict with the Receiver is close to one of these threshold values. If it is just larger than such a threshold value, burned money is particularly cost-effective in inducing the Receiver to take an additional action, as a slight reduction in the degree of conflict would already have led to an increase in the number of actions in the absence of money burning. That is, an additional interval is injected into the equilibrium partition. If the degree of conflict is just below a threshold value, it is not optimal to use money burning to increase the number of actions that the Receiver wants to take. Instead, the Sender burns money to induce the Receiver to better align his actions with her own interests. When the degree of conflict decreases, optimal equilibria with and without money burning alternate. We conjecture that this structure also holds for other log-concave densities. We conclude by showing how a discontinuity in the Receiver’s utility function makes full revelation through money burning the Sender’s optimal strategy.

In this paper we focus on Sender-optimal equilibria. We do so for a number of reasons. It is a focal extremal point of the equilibrium set. The other focal extremal point is the Receiver-optimal equilibrium. Clearly, from the Receiver’s perspective, a perfectly separating equilibrium is optimal as he does not incur the costs of money burning. Second, in the game that we study there is a channel for communication directed from the Sender to the Receiver. If the Sender also uses this channel to announce ex ante what signaling strategy it will use, it is natural to focus on the Sender-optimal equilibrium strategy. This requires only a moderate degree of commitment, as the Sender only commits to play an equilibrium strategy, rather than possibly a non-equilibrium strategy. Moreover, as we show towards the end of the paper, the equilibrium signaling strategy that maximizes the sum of expected utilities of Sender and Receiver is qualitatively similar to the one that is Sender-optimal. Finally, by focusing on the signaling equilibrium strategy that is optimal from the informed party’s perspective we can usefully contrast it with the screening equilibrium strategy that is optimal from the uninformed party’s perspective and that has been studied in Krishna and Morgan (2008). We discuss that paper in the next section.

The rest of the paper is organized as follows. Section 2 reviews the related literature, and Sect. 3 sets up the model which is analyzed in Sects. 4 and 5. Section 6 discusses when separation can be Sender-optimal. Section 7 concludes. Appendix A contains all proofs.

2 Related literature

ASB is the seminal paper on signaling with cheap talk and money burning, and we make use of one of its important finding: the set of states that give rise to a given action is convex. The main contribution of ASB is to prove the existence of a special class of equilibria in which money burning takes place. ASB establish that thanks to money burning, one can “squeeze in separating segments at the far end of any ... equilibrium partition” of a game studied in CS in which the Sender can use only cheap talk (ASB, p. 7). It then follows that full revelation is also an equilibrium outcome. Austen-Smith and Banks (2002) study some welfare results for the uniform-quadratic case. They establish that the Sender prefers the most-informative cheap talk equilibrium over a fully separating equilibrium using money burning. We extent their results by showing that, with general densities and preferences, the Sender never prefers using money burning to fully separate, and by showing which equilibrium she prefers in the uniform quadratic case.

For the model used in ASB, Kartik (2007) shows that cheap talk can be influential, i.e., induce at least two different actions, with money burning if and only if it can be influential without money burning. He also proves the conjecture stated in ASB that if the amount of money the Sender can burn shrinks to zero, the set of equilibria with burned money shrinks to the set of pure cheap talk equilibria.

Sender-optimal strategies have also been the focus in other contexts. Recently, Rayo and Segal (2010) and Kamenica and Gentzkow (2011) derived Sender-optimal communication strategies when she can commit to truthfully reveal her private information. This contrasts with our assumption that a Sender cannot commit to a strategy. Like us, Eső and Galambos (2013) study a Sender who has access to cheap talk and a second means of communication. Whereas we study money burning as the second means, they study the possibility making information hard at a cost that is independent of the true state. In the presence of hard information, full revelation can only be an equilibrium if disclosure costs equal zero, whereas full revelation always exists in case of money burning.Footnote 5

The Sender–Receiver problem has also been approached as a screening problem. The focus has been on the uninformed principal offering a compensation contract or a delegation contract.Footnote 6 Ottaviani (2000) and Krishna and Morgan (2008) study compensation contracts, in which the principal compensates the agent for information that she provides. Ottaviani (2000) shows, for the uniform-quadratic case, that the principal can guarantee full revelation, but that he is better off with delegation. Krishna and Morgan (2008) study characteristics of the optimal contract in general settings. Their so-called imperfect commitment case is the relevant case for a comparison with our paper. In that case, the principal cannot commit to a decision as a function of the message he receives, but can commit to a monetary transfer as a function of the message. They show that, in general, the principal is better off by offering a contract that induces the agent to pool for high states than one that leads to full revelation. Although pooling gives rise to a reduction in utility for high states, this is more than compensated by the reduction in transfers that are needed to induce truth telling for lower states. In our set-up, the Sender can perfectly reveal her information about the state through money burning but does not find it in her interest to do so because pooling some states gives rise to a local gain in utility that more than compensates any possible additional money that needs to be burned in all the other states. The balance of local losses and global gains is such that in the uniform-quadratic case, the principal in Krishna and Morgan (2008) never pays for vague information, only for precise information. In the same uniform-quadratic case, the Sender in our model does not burn money to separate an interval of states. Instead, it is only cost-effective to burn money to adjust the intervals of the semi-pooling communication strategy and, possibly, to adjust their number.

3 The model

The model is that of ASB. In stage 1, the state of nature \(t\in [ {0,1} ]\) is privately revealed to the Sender. This state is distributed according to a distribution function F(t) that has a strictly positive and continuously differentiable density function \(f( t )\equiv F'( t )\). In stage 2, the Sender sends the Receiver a signal \(\sigma =( {m,b} )\), which consists of a costless message m from some continuum M, and a costly message \(b\ge 0\) (money burning). The content of the message m is unverifiable, i.e., cheap talk, and b is publicly observed and, hence, verifiable.

Let \(\sigma ( t )=( {m,b} )\) be a pure signaling strategy. Having observed ( mb ) the Receiver forms posterior beliefs about the state, denoted by the CDF \(G( {z\vert \,m,b} )\equiv \Pr [ {t\le z\vert m,b} ]\). Then, in stage 3, the Receiver takes an action a. Let \(\alpha ( {m,b} )\) be his pure strategy. Preferences of the Sender and the Receiver are given by the following utility functions \(U^{\mathrm{S}}\) and \(U^{\mathrm{R}}\), respectively:

$$\begin{aligned} U^{\mathrm{S}}\left( {a,t,m,b} \right) =u\left( {a,t,x} \right) -b,\quad \hbox { and }\quad U^{\mathrm{R}}\left( {a,t,m,b} \right) =u^{\mathrm{R}}\left( {a,t} \right) , \end{aligned}$$

where \(x>0\) is a measure of the extent to which the players’ preferences are misaligned.

Following Austen-Smith and Banks (2000), we assume that u( atx ) and \(u^{\mathrm{R}}( {a,t} )\) are three times continuously differentiable, are strictly concave and not monotone in a, and have strictly positive cross partial derivatives \(u_{at} >0\), \(u_{ax} >0\), \(u_{at}^{\mathrm{R}} >0\). Let \(a^{\mathrm{R}}( t )\) denote the action that is best from the Receiver’s perspective in state t:

$$\begin{aligned} a^{\mathrm{R}}\left( t \right) =\mathop {\hbox {argmax}}\limits _{a} \,u^{\mathrm{R}}\left( {a,t} \right) , \end{aligned}$$

and let \(a^{\mathrm{S}}( {t,x} )\) be the action that is best from the Sender’s perspective in state t:

$$\begin{aligned} a^{\mathrm{S}}\left( {t,x} \right) =\mathop {\hbox {argmax}}\limits _{a} \,u\left( {a,t,x} \right) . \end{aligned}$$

Under our assumptions, both \(a^{\mathrm{R}}( t )\) and \(a^{\mathrm{S}}( {t,x} )\) are finite and strictly increasing, and we assume, in addition, that \(a^{\mathrm{S}}( {t,x} )>a^{\mathrm{R}}( t )\), for all \(x>0\) and \(t\in [ {0,1} ]\). In order to simplify the exposition, we assume w.l.o.g. that the actions are labeled in such a way that \(a^{\mathrm{R}}( t )=t\) (this can always be done due to the monotonicity of \(a^{\mathrm{R}}( t )\)). The ex-ante expected utilities from a strategy profile \(( {\sigma ,\alpha } )\) are denoted as follows:

$$\begin{aligned}&V^{\mathrm{S}}\left( {\sigma ,\alpha } \right) \equiv \text{ E }\left[ {U^{\mathrm{S}}\left( {a,t,m,b} \right) } \right] =\text{ E }\left[ {u\left( {\alpha \left( {\sigma \left( t \right) } \right) ,t,x} \right) } \right] -\text{ E }\left[ {b\left( t \right) } \right] , \end{aligned}$$
(1)
$$\begin{aligned}&V^{\mathrm{R}}\left( {\sigma ,\alpha } \right) \equiv \text{ E }\left[ {U^{R}\left( {a,t,m,b} \right) } \right] =\text{ E }\left[ {u^{\mathrm{R}}\left( {\alpha \left( {\sigma \left( t \right) } \right) ,t} \right) } \right] . \end{aligned}$$
(2)

An equilibrium \({\Omega }=\left\langle \sigma ( t ),\alpha ( {m,b} ),G( {t{\vert }m,b} )\right\rangle \) is (i) a signaling strategy of the Sender \(\sigma ( t )\) that is optimal for every state t given the Receiver’s action strategy \(\alpha ( {m,b} )\), (ii) an action strategy \(\alpha ( {m,b} )\) that is optimal given beliefs \(G( {t{\vert }\,m,b} )\), and (iii) Receiver’s beliefs \(G( {t{\vert }\,m,b} )\) that are consistent with \(\sigma ( t )\) for signals ( mb ) on the equilibrium path. Ex-ante utilities of the Sender and the Receiver in equilibrium \({\Omega }\) are denoted by \(V^{\mathrm{S}}( {{\Omega }} )\) and \(V^{\mathrm{R}}( {{\Omega }} )\) respectively. An equilibrium signaling strategy is denoted by \(\sigma ^{\Omega }\). We are interested in the equilibrium that generates the highest ex-ante expected utility for the Sender.

4 Characteristics of Sender-optimal equilibria

ASB (p. 7) establish that all equilibria have the following structure: there is a partition \(( {B_{0} \equiv 0,A_{1} ,B_{1} ,\ldots ,B_{N} ,A_{N+1} \equiv 1} )\) with \(B_{i-1} \le A_{i} <B_{i} \le A_{i+1} \) for all \(i\in I=\{ {1,\ldots ,N} \}\), such that the Sender pools all states \(t\in ( {A_{i} ,B_{i} } )\) by sending a message \(m( t )=m_{i}^{\mathrm{P}} \) (with the meaning “the state t belongs to interval \(( {A_{i} ,B_{i} } )\)”) and by burning identical amounts of money \(b_{i}^{\mathrm{P}} \). If instead \(t\in ( {B_{i} ,A_{i+1} } )\), she separates by sending distinct messages \(m( t )=m^{\mathrm{S}}( t )\) (with the meaning “the state is t”) and by burning distinct amounts of money \(b( t )=b^{\mathrm{S}}( t )\). As any partition is uniquely defined by its collection of pooling intervals, we will abuse terminology by referring to the collection \(P( {{\Omega }} )=\{ {( {A_{i} ,B_{i} } )\vert i\in I} \}\) as the partition of equilibrium \({\Omega }\). For example, a separating equilibrium corresponds to \(P=\emptyset \), and a complete pooling equilibrium corresponds to \(P=\{ {( {0,1} )} \}\).

ASB establish the existence of a special class of equilibria that can be obtained from the pure cheap talk equilibria studied in CS by squeezing in “separating segments at the far end of any CS partition” (ASB, p. 7).Footnote 7 We are interested in how the Sender can best combine cheap talk and money burning and, therefore, cannot limit ourselves to a special class of equilibria. The next proposition is the first step towards a Sender-optimal equilibrium.

Proposition 1

For any partition \(P_{0}\), there is an equilibrium \({\Omega }_{0} \) such that \(P_{0} =P( {{\Omega }_{0} } )\).

The Sender can arbitrarily choose any number of non-overlapping pooling intervals. Let \(S^{\mathrm{P}}\) denote the union of these intervals and let \(S^{\mathrm{S}}=[ {0,1} ]\backslash S^{\mathrm{P}}\) denote the set of states for which the Sender separates. Proposition 1 means, for example, that full separation is an equilibrium.

According to Proposition 1, we can search for the Sender-optimal equilibrium by optimizing \(V^{\mathrm{S}}( {\sigma ,\alpha } )\) over the set of all possible partitions, subject to incentive compatibility constraints and non-negativity of burned money constraints. Note that Proposition 1 does not imply that the Sender can induce the Receiver to take any set of actions. For example, if the Sender wants to induce the Receiver to take a single action, then she must pool, and thus this action can only equal the Receiver’s optimal action in case of complete pooling.

On the current level of generality, when both players have utility functions u( atx ) and \(u^{\mathrm{R}}( {a,t} )\) that are completely independent from each other, the analysis is very complicated and results are not easy to interpret. We assume in what follows that the Receiver’s utility is a quadratic loss function \(u^{\mathrm{R}}( {a,t} )=-( {t-a} )^{2}\) while keeping the Senders’ utility u( atx ) arbitrary. We define:

$$\begin{aligned} {\varPsi } \left( t \right) \equiv \frac{2u_{tta} \left( {t,t,x} \right) +u_{taa} \left( {t,t,x} \right) }{2u_{ta} \left( {t,t,x} \right) }. \end{aligned}$$

Since function \({\varPsi } ( t )\) vanishes for quadratic utility functions u( atx ), it can be interpreted as a measure of a difference between utility function u( atx ) and its second-order approximation, i.e., as a measure of higher-order terms in the Tailor expansion of u( atx ). Suppose we have an equilibrium in which the Sender separates states in an interval. If a simple condition is satisfied, then we can construct another equilibrium in which the Sender pools some of the states in that interval and that gives her a higher expected utility.

Proposition 2

Consider an equilibrium \({\Omega }\) in which the Sender separates all states in an interval ( BA ). If the following condition holds for some \(t\in ( {B,A} )\):

$$\begin{aligned} - \frac{{f\left( t \right) }}{{F\left( t \right) }} + \frac{{f^{\prime }\left( t \right) }}{{f\left( t \right) }}< {\varPsi } \left( t \right) < \frac{{f\left( t \right) }}{{1 - F\left( t \right) }} + \frac{{f^{\prime }\left( t \right) }}{{f\left( t \right) }}, \end{aligned}$$
(3)

then the Sender benefits from pooling some states in that interval, so that \({\Omega }\) is not optimal.

Fig. 1
figure 1

Illustration for Proposition 2

To gain intuition for the condition in Proposition 2, and to understand how local changes in ( BA ) have global consequences—as suggested by the appearance of F(t) and \(1-F( t )\) in the condition—we consider an equilibrium \({\Omega }\) in which the Sender separates all states in ( BA ). The utility \(v^{\Omega }( t )\) that the Sender gets in ( BA ) is indicated in Fig. 1 by the bold line.

We now try to construct another equilibrium, \(\Omega ^{\varepsilon }\), that is obtained from \(\Omega \) by adding a pooling interval \(( {A-\varepsilon ,A} )\), i.e., \(P( {{\Omega }^{\varepsilon }} )=P( {{\Omega }} )\cup \{ {( {A-\varepsilon ,A} )} \}\), and by imposing that \(v^{\Omega ^{\varepsilon }}( {A-\varepsilon } )=v^{\Omega }( {A-\varepsilon } )\). In other words, we try to leave the Sender’s utility unchanged for \(t\in ( {B,A-\varepsilon } ]\) and, instead of separating, we let the Sender pool all states \(t\in ( {B,A-\varepsilon } ]\) in \({\Omega }^{\varepsilon }\). In \({\Omega }^{\varepsilon }\), the Receiver’s best response action to a message that the state is in \(( {A-\varepsilon ,A} )\) is strictly larger than his best response action to the Senders’ message that \(t=A-\varepsilon \). Because of \(u_{ta} >0\), this means that in a right neighborhood of \(t=A-\varepsilon \), the Sender benefits from pooling, see the thin (solid and dashed) lines representing two possibilities for \(v^{\Omega ^{\varepsilon }}( t )\) in Fig. 1.

Because of the equilibrium conditions, local changes of equilibrium strategies in the interval \(( {A-\varepsilon ,A} )\) have global consequences: pooling these states may make additional money burning necessary for states to the right or to the left of the pooling interval, and Fig. 1 explains why. For \(t=A\), the difference in utility \(\varphi ( A )=v^{\Omega }( A )-v^{\Omega ^{\varepsilon }}( A )\) depends on the Sender’s utility function and the distribution of states. In the proof of Proposition 2, we show that for a pooling interval of marginal length \(\varepsilon \), the change in utility is given by

$$\begin{aligned} \varphi \left( A \right) = \frac{{u_{{ta}} \left( {A,A,x} \right) }}{2}\left( {{\varPsi } \left( A \right) - \frac{{f^{\prime }\left( A \right) }}{{f\left( A \right) }}} \right) . \end{aligned}$$

Since \(\varphi ( A )=0\) for a uniform-quadratic model, the term \( ( {{\varPsi } ( A ) - f^{\prime }( A )/f( A )} )\) can be interpreted as a measure of a difference between the current model and its uniform-quadratic approximation. Suppose that \({\varPsi } ( A ) > f^{\prime }( A )/f( A ) \) so that \(\varphi ( A )>0\). Then, without any changes in the signaling strategy to the right of A, if the state is just below A, the Sender would have liked to slightly exaggerate and report a state just above A. One way to make sure that \({\Omega }^{\varepsilon }\) is an equilibrium is to adjust the Sender’s signaling strategy so that in all states \(t>A\), she burns an additional amount of money that is equal to \(\varphi ( A )\). This restores the continuity of the utility function at \(t=A\), and guarantees that the equilibrium conditions continue to hold for all \(t>A\). Hence, an equilibrium \({\Omega }^{\varepsilon }\) with a pooling segment of marginal length yields more utility than \({\Omega }\) if the additional amount of money burned \(( {1-F( A )} )\varphi ( A )\) is smaller than the gain in utility which turns out to be \(\frac{1}{2}u_{ta} ( {A,A,x} )f( A )\). That is, if the condition \( 0< {\varPsi } - \frac{{f^{\prime }}}{f} < \frac{f}{{1 - F}} \) holds for \(t=A\), the Sender benefits from pooling some states of the separating segment.

Alternatively, if \(\varphi ( A )<0\) because of \( {\varPsi } ( A ) < \frac{{f^{\prime }( A )}}{{f( A )}}\), the Sender’s utility gets increased at \(t=A\) due to pooling the states; Fig. 1 represents this case by the dashed line. Without any further changes to the Sender’s money burning strategy, she would now prefer to underreport the state if she were to observe values of t slightly above \(t=A\). One way to make sure that \({\Omega }^{\varepsilon }\) is an equilibrium is to increase amounts of money burnt in all states \(t<A\) (it may not be possible to burn less money for \(t>A\) because, e.g., in some states above A the Sender does not burn any money at all in \({\Omega }\)). For pooling some of the states to be optimal, the additional cost of money burning \(( {-F( A )\varphi ( A )} )\) should be smaller than the gain \(\frac{1}{2}u_{ta} ( {A,A,x} )f( A )\), i.e., \( - \frac{f}{F}< {\varPsi } - \frac{{f^{\prime }}}{f} < 0\) evaluated for \(t=A\).

Proposition 2 has two important consequences. First, by applying the condition to the whole interval [ 0, 1 ] it follows that full revelation (i.e., revealing all states) is never optimal. The Sender benefits from some vagueness by pooling at least some interval of states. Pooling some states give rise to a local gain in utility that more than compensates any possible additional money that needs to be burned globally (i.e., outside the interval of pooled states). Thus, combining Propositions 1 and 2 we conclude that:

Corollary 1

Even though the perfectly separating equilibrium is always feasible, it is never Sender-optimal.

The second consequence deals with the class of state distributions with log-concave densities and the much-used quadratic loss function. With such preferences, \({\varPsi } ( t )=0\) for all t. Moreover, a log-concave density has an increasing hazard rate and a decreasing reversed hazard rate, see Bagnoli and Bergstrom (2005). This suffices to show that \( - \frac{{f( t )}}{{F( t )}} + \frac{{f^{\prime }( t )}}{{f( t )}} < 0 \) and that \( \frac{{f( t )}}{{1 - F( t )}} + \frac{{f^{\prime }( t )}}{{f( t )}} > 0 \) for all t. Thus,

Corollary 2

Let the utility function u( atx ) of the Sender be a quadratic loss function, i.e., \(u( {a,t,x} )=-( {a-t-x} )^{2}\), and let the states have a log-concave density f(t). Then, in an optimal equilibrium the Sender does not separate any interval of states.

Corollary 2 implies that, in a large class of models (including the uniform-quadratic case), communication is garbled in a Sender-optimal equilibrium, for all states. What role can money burning play in an optimal equilibrium, if it cannot lead to separation? Logically, two possibilities remain. It is either not used at all, or it is used but only to re-arrange the pooling intervals in the equilibrium partition, relative to what they would have been in the absence of money burning. In the latter case, money burning may also give rise to a different number of pooling intervals. To better see whether and how the Sender uses money burning, in the next section we derive the optimal equilibrium for the well-known uniform-quadratic case.

Fig. 2
figure 2

Illustration for Proposition 3

5 Sender-optimal equilibrium in the uniform-quadratic case

We use asterisks to denote variables in the Sender’s optimal equilibrium \({\Omega }^{*}\). The next proposition characterizes the Sender’s maximization problem.

Proposition 3

In the uniform-quadratic case, an optimal equilibrium \({\Omega }^{*}\) always exists. Its equilibrium partition \(P^{*}=P( {{\Omega }^{*}} )\) contains a finite number \(N^{*}\) of pooling intervals \(( {A_{i} ,A_{i+1} } )\) that fully cover the state space [ 0, 1 ]. Number \(N^{*}\) and marginal states \(\{ {A_{i} } \}\), \(i=1,\ldots ,N^{*}\) solve the following maximization problem:

$$\begin{aligned}&\mathop {\max }\limits _{N,\left\{ {A_{i} } \right\} ,c} V^{\mathrm{S}}=-c-x^{2}-x+\frac{1}{6}\mathop {\sum }\limits _{i=1}^{N} \left( {A_{i+1} -A_{i} } \right) ^{3} \end{aligned}$$
(4)
$$\begin{aligned}&\text {subject to}:b_{i}^{\mathrm{P}} =\left( {A_{i+1} +A_{i} } \right) x-\frac{1}{4}\left( {A_{i+1} -A_{i} } \right) ^{2}+c\ge 0 \end{aligned}$$
(5)

In this optimization, the Sender chooses a number of intervals N, positions of \(( {N-1} )\) distinct marginal states, \(\{ {A_{i} } \}\), and a parameter c. Figure 2 shows that there is a trade-off in the above optimization. The bold solid curve represents a utility function \(v^{\Omega _{1} }( t )\) of the Sender in state t, for an equilibrium \({\Omega }_{1} \) with (at least) three pooling intervals in the partition, \(( {t_{i} ,t_{i+1} } )\), \(( {t_{i+1} ,t_{i+2} } )\), and \(( {t_{i+2} ,t_{i+3} } )\). In each pooling interval, the (smallest) vertical distance between the graph of \(v^{\Omega _{1} }( t )\) and the horizontal axis is equal to \(b_{i}^{\mathrm{P}} \). The ex-ante expected utility of the Sender is \(V^{\mathrm{S}}\) the (negative of) the shaded area above \(v^{\Omega _{1} }( t )\).

By merging two intervals \(( {t_{i} ,t_{i+1} } )\) and \(( {t_{i+1} ,t_{i+2} } )\) into one single pooling interval \(( {t_{i} ,t_{i+2} } )\), we get a new equilibrium \({\Omega }_{2}\), with the utility function \(v^{\Omega _{2} }( t )\) (bold dashed curve). As one can see, this has a positive effect on \(V^{\mathrm{S}}\). If however, we merge all three intervals into a single pooling interval \(( {t_{i} ,t_{i+3} } )\) without changing parameter c, the amount of money to be burned in states \(t\in ( {t_{i} ,t_{i+3} } )\) would have become negative (thin dotted curve, which goes above the horizontal axis). Thus, the value of c must be increased so as to just make the graph of \(v^{\Omega _{3} }( t )\) touching the horizontal axis, (thin solid line).

The trade-off the Sender faces can now be seen. On the one hand, she benefits from having fewer but longer pooling intervals in an equilibrium, since this increases the sum in (4). On the other hand, such a strategy would potentially require a larger value of parameter c, i.e., more burned money in all states, due to (5), which lowers her utility \(V^{\mathrm{S}}\) directly.

Proposition 4 below gives the main features of the equilibrium. It shows for which degree of misalignment x the Sender actually burns money in the optimal equilibrium. It also shows how the optimal equilibrium partition \(P( {{\Omega }^{*}} )\) is related to that of CS. Before we state the proposition, a small digression on the Crawford and Sobel (1982) model is needed. Recall that in CS, money burning is absent and communication can only take place through cheap talk. In equilibrium there is an upper bound on the number of distinct messages that the Sender can use. For every integer \(N\ge 2\), let us define \(x_{N} \) as follows:

$$\begin{aligned} x_{N} \equiv \frac{1}{2N\left( {N-1} \right) }. \end{aligned}$$
(6)

Without the option to burn money, if the degree of misalignment satisfies \(x\in [ {x_{N+1} ,x_{N}} )\), then for every value of \(n=1,\ldots ,N\) there exists an equilibrium in which the Sender can induce the Receiver to take exactly n actions. For every x, the equilibrium with the highest number of messages N, is called the most informative equilibrium, denoted by \({\Omega }^{\mathrm{CT}}\). For any given value of x, the Sender prefers the equilibrium \({\Omega }^{\mathrm{CT}}\) with the largest number of messages N. Let \(\sigma ^{\mathrm{CT}}\) denote the most informative cheap talk strategy.

It turns out that the structure of \(\sigma ^{*}\) is closely related to the structure of \(\sigma ^{\mathrm{CT}}\). The intuition is that an additional communication instrument—money burning—can be used to alleviate the bound on the number N of distinct actions that the Sender can induce the Receiver to take. This is particularly cost-effective for those values of x for which a slight reduction in x would already have led to an increase in the number of such actions in the absence of money burning. Alternatively, money burning can be used to change the lengths of the intervals of the pure cheap talk partition \(P^{\mathrm{CT}}\). This is particularly beneficial for values of x for which an additional interval has just emerged in \(P^{\mathrm{CT}}\). This intuition is borne out by the next proposition.

Proposition 4

For every \(N\ge 4\) there exist a \(\underline{y} _{N}\) and \(\bar{y}_{N}\) satisfying \(x_{{N + 1}}< \bar{y}_{{N + 1}}< \underline{y}_{N}< x_{N} < \bar{y}_{N}\), such that in an optimal equilibrium \(\Omega ^{*}\):

  1. (a)

    if \(x \in [ {x_{N} ,\bar{y}_{N} } )\), the Sender burns money. Compared with \({\Omega }^{\mathrm{CT}}\), she induces one additional action, \(N^{*}=N+1\), \(P^{*}\ne P^{\mathrm{CT}}\).

  2. (b)

    if \(x\in ( \underline{y}_{N} ,x_{N} )\), the Sender burns money. Compared with \({\Omega }^{\mathrm{CT}}\), she induces the same number of actions, \(N^{*}=N\) but re-arranges the partition, \(P^{*}\ne P^{CT}\).

Proposition 4 illustrates how the Sender uses money burning to adjust the partition in an optimal equilibrium even if she does not use it to reveal any specific state. This result will qualitatively carry over to situations with other log-concave densities.

In what follows we study in more depth how, for the uniform-quadratic case, money burning shapes the equilibrium partition.

It follows from the proof of Proposition 4 that in the uniform-quadratic case money burning is used in the second interval only. Figure 3 illustrates how, for different values of x, money burning is present or absent in the optimal equilibrium \({\Omega }^{*}\) in an alternating pattern. It shows the amount of money burned \(b_{2}^{\mathrm{P}} \) in the second interval of \({\Omega }^{*}\) (thin line, right scale), and the ex-ante probability that the Sender burns money, \(\Pr [ {b( t )>0} ]\) (thick line, left scale), which is the length of the second interval provided \(b_{2}^{\mathrm{P}} >0\). One can see, e.g., that at \( x = \bar{y}_{4} = \frac{1}{{22}}\), the length of this interval is zero, and increases with further reductions in x. At \(x=\underline{y}_{4} \approx 0.039\), the Sender burns money for nearly 20% of the states. The amount of money burned \(b_{2}^{\mathrm{P}} \) is strictly positive at \(x = \bar{y}_{4}\), and decreases with further reductions in x.

Fig. 3
figure 3

The amount of money burned in the second interval \(b_{2}^{\mathrm{P}} \), and the ex-ante probability that money will be burned \(\Pr [ {b^{*}( t )>0} ]\), as functions of x

If \(x > \bar{y}_{4} = \frac{1}{{22}}\), money burning cannot improve upon the most informative cheap talk equilibrium. That is, for money burning to improve the Sender’s utility, the interests between Sender and Receiver should be sufficiently well aligned such that unaided cheap talk communication allows her to induce at least three actions.

We illustrate the two parts of Proposition 4 with one figure each. Figure 4 is drawn for \(x=0.042\). Since \(x \in ( {x_{4} ,\bar{y}_{4} } )\), the most informative cheap talk equilibrium \({\Omega }^{\mathrm{CT}}\) has three intervals (with up to three digits of accuracy):

$$\begin{aligned} P^{\mathrm{CT}}=\left\{ {\left( {0,0.165} \right) ,\left( {0.165,0.499} \right) ,\left( {0.499,1} \right) } \right\} , \end{aligned}$$

whereas the optimal equilibrium \({\Omega }^{*}\) has four intervals:

$$\begin{aligned} P^{*}=\left\{ {\left( {0,0.066} \right) ,\left( {0.066,0.176} \right) ,\left( {0.176,0.504} \right) ,\left( {0.504,1} \right) } \right\} . \end{aligned}$$

Figure 4 shows the ex-post equilibrium utility functions \(v^{\Omega }( t )\) of the Sender in \({\Omega }^{\mathrm{CT}}\) and \({\Omega }^{*}\), and the money burning function \(b^{*}( t )\). She burns \(b_{2}^{P} \approx 5.5\times 10^{-3}\) in the second interval of \(P^{*}\).

Figure 5 is drawn for \(x=0.04\). Since \(x\in ( \underline{y}_{4} ,x_{4})\), it illustrates a situation in which the Sender does not burn money to induce the Receiver to take a larger number of actions, but to improve the existing actions. Indeed, the most informative cheap talk strategy \(\sigma ^{\mathrm{CT}}\) has four distinct messages,

$$\begin{aligned} P^{\mathrm{CT}}=\left\{ {\left( {0,0.01} \right) ,\left( {0.01,0.18} \right) ,\left( {0.118,0.51} \right) ,\left( {0.51,1} \right) } \right\} , \end{aligned}$$

and so does the optimal equilibrium strategy in the presence of money burning,

$$\begin{aligned} P^{*}=\left\{ {\left( {0,0.035} \right) ,\left( {0.035,0.185} \right) ,\left( {0.185,0.512} \right) ,\left( {0.512,1} \right) } \right\} . \end{aligned}$$

The figure shows ex-post equilibrium utility functions \(v^{\Omega }( t )\) of the Sender in \({\Omega }^{\mathrm{CT}}\) and \({\Omega }^{*}\), and the money burning function \(b^{*}( t )\). She burns \(b_{2}^{P} \approx 2.0\times 10^{-3}\) in the second interval of \(P^{*}\).

Fig. 4
figure 4

Equilibria \({\Omega }^{\mathrm{CT}}\) and \({\Omega }^{*}\) for \(x=0.042\)

Fig. 5
figure 5

Equilibria \({\Omega }^{\mathrm{CT}}\) and \({\Omega }^{*}\) for \(x=0.04\)

Above, we have derived the Sender’s optimal equilibrium. The Receiver’s optimal equilibrium is, clearly, the perfectly separating equilibrium. Indeed, the ex-ante expected utility \(V^{\mathrm{R}}( {{\Omega }} )\) of the Receiver equals:

$$\begin{aligned} V^{\mathrm{R}}\left( {{\Omega }} \right) =\text{ E }\left[ {u\left( {\alpha \left( {\sigma \left( t \right) } \right) ,t,0} \right) } \right] =-\frac{1}{12}\sum \limits _{i} \left( {B_{i} -A_{i}} \right) ^{3}. \end{aligned}$$

The Receiver experiences a utility loss from any pooling intervals and, therefore, obtains his first best level of utility in the perfectly separating equilibrium. A natural question that arises is what equilibrium is best from a social welfare perspective. Define the social welfare function as follows:

$$\begin{aligned} V^{\mathrm{SW}}\left( {{\Omega }} \right) \equiv \left( {1-\alpha } \right) V^{\mathrm{R}}\left( {{\Omega }} \right) +\alpha V^{\mathrm{S}}\left( {{\Omega }} \right) ,\quad \alpha \in \left( {0,1} \right) . \end{aligned}$$

Thus,

$$\begin{aligned} V^{\mathrm{SW}}\left( {{\Omega }} \right) =\alpha \left( {-\left( {c+x^{2}+x} \right) +\frac{3\alpha -1}{2\alpha }\cdot \frac{1}{6}\sum \limits _{i} \left( {B_{i} -A_{i}} \right) ^{3}} \right) , \end{aligned}$$

which only differs from \(V^{\mathrm{S}}( {{\Omega }} )\) by the multiplier \(\frac{3\alpha -1}{2\alpha }\). As a result of this similarity, Proposition 4 continues to hold for the efficient equilibrium \({\Omega }^{\mathrm{SW}}\), i.e., the equilibrium that maximizes the social welfare function, provided \(\alpha >1/3\). Thus, when the Sender’s contribution to the Social Welfare function is at least one third, the number of pooling intervals in \({\Omega }^{\mathrm{SW}}\) is finite, the Sender burns money in at most one interval; if she burns money, she does so in the second interval only; and the values of x for which she does so are close to the values of \(x_{N} \). In short, all results from this Section continue to hold qualitatively for this new objective function, although the exact values of the marginal states in the partition supporting the efficient equilibrium, and the exact amounts of money burnt, will be different from \({\Omega }^{*}\).

6 Sender-optimality and separation

In the previous two sections, we have derived conditions under which the Sender prefers communication that is garbled. In this section, we discuss situations in which she might prefer truthful revelation of her private information.

It logically follows from Corollary 1 that for the Sender to reveal truthfully all her private information in an optimal equilibrium, one needs to step outside the model assumed in Sect. 3. The explanation of condition (1) in Sect. 4 suggests one such step. We have explained there that the Sender benefits locally from garbling information, possibly at the expense of more money burning in other states. By garbling information, she induces the Receiver to take a higher action from which she benefits in the right neighborhood of \(t=\tilde{t}\equiv A-\varepsilon \), see Fig. 1. This benefit is due to the single-crossing property \(u_{ta} >0\). Formally, if we define the difference in the Sender’s utility between these two equilibria as \( \theta ( {\varepsilon ,t} ) \equiv v^{{{\Omega }}} ( t ) - v^{{{{\Omega }}^{\varepsilon } }} ( t )\) and \(a^{\varepsilon }=\text{ E }[t\vert t\in (\tilde{{t}},A)]\) as the Receiver’s best reply to the pooling message, one can obtain:Footnote 8

$$\begin{aligned} \frac{\partial \theta }{\partial t}(\varepsilon ,t)=-\int _{\tilde{{t}}}^{a^{\varepsilon }} {u_{at} (z,\tilde{{t}},x)dz.} \end{aligned}$$

Clearly, \(a^{\varepsilon }>\tilde{{t}}\) and \(u_{at}>0\) together imply \(\frac{\partial \theta }{\partial t}(\varepsilon ,\tilde{{t}})<0\). Hence, \(v^{\Omega }(t)<v^{\Omega ^{\varepsilon }}(t)\) in some right neighborhood of \(\tilde{t}\). Of course, this local benefit alone is not sufficient for \({\Omega }^{\varepsilon }\) to be a superior equilibrium to \({\Omega }\) from the Senders’ prospective. Nevertheless, eliminating this local benefit, as we will see, is a way to construct a model in which truthful revelation of all information is optimal for the Sender.

If one keeps the single-crossing property, local gains disappear if the Receiver does not take a higher action when information is garbled, i.e., when \(a^{\varepsilon }=\tilde{t}\). This implies that he must get a punishment if the state turns out to be lower than the chosen action. Moreover, it must be so large that even a tiny chance of such a punishment makes the Receiver worse off. Thus, marginal punishment must be infinite, and his utility function must be discontinuous.

We consider, therefore, a variation on the standard quadratic loss function. Let the Receiver’s utility \(u^{\mathrm{R}}( {a,t} )\) be given by

$$\begin{aligned} u^{\mathrm{R}}\left( {a,t} \right) =\left\{ {{\begin{array}{ll} -\left( {t-a} \right) ^{2}, &{}\quad if\,\,t\ge a \\ -\left( {t-a} \right) ^{2}-\gamma , &{}\quad if\,\,t<a, \\ \end{array}}} \right. \end{aligned}$$
(7)

with \(\gamma >1\). In other words, relative to the standard quadratic loss function, the Receiver experiences additional disutility \(\gamma \) if the state t turns out to be lower than the action a he takes.

This discontinuity in the utility function of the Receiver at \(a=t\) may arise in policy areas where any deviation from the right level of preparation is costly, but lack of preparation carries a separate, additional cost.Footnote 9 A key example is the reputational damage to the decision maker for being caught ‘unprepared.’ Another example is the loss to society from suffering floods due to dykes of insufficient height: even a small flood causes a discrete drop in welfare. The same discrete drop accompanies terrorism or health risks that have materialized due to insufficient investigative or detection capacity.

The Sender-optimal equilibrium for this uniform-quadratic model with discontinuous loss \(\gamma \) turns out to be the perfectly separating equilibrium, as the following proposition states.

Proposition 5

Let \(F( t )=t\), \(u( {a,t,x} )=-( {t+x-a} )^{2}\), and \(u^{\mathrm{R}}( {a,t} )\) be given by (7). Then, in the optimal equilibrium, the Sender burns money to truthfully reveal all states, \(P( {{\Omega }^{*}} )=\emptyset \).

Although there remains a conflict between the Sender and the Receiver as to the optimal action in a given state, the change in the utility function of the Receiver makes both of them experiencing a large disutility stemming from communicating ‘vague’ information (i.e., a message from pooling interval). With \(\gamma >1\), the Receiver’s best-reply to vague information is the lowest possible action consistent with the message. This aligns their preferences across equilibria, and the perfectly separating equilibrium is the unique Pareto-optimal equilibrium.

Can it be optimal for the Sender to burn money to reveal some of her private information without assuming a discontinuous utility function? This paper does not answer this question, as there is an important asymmetry between showing sub-optimality of pooling and sub-optimality of separation on an interval ( AB ).

There is a unique way to separate states from an interval ( AB ), namely by letting the Sender send different signals in all these states. Thus, in order to show that separation on ( AB ) is sub-optimal, it is sufficient to find one way to pool some of states in ( AB ) that generates a higher utility for the Sender than the unique way of separation. Our condition (3) represents one such a test.

There are, however, infinitely many ways of pooling states from an interval ( AB ) into sub-intervals: any partition of ( AB ) represents one such possibility. Therefore, in order to show that separation on ( AB ) is optimal, one needs to show that separation is better than any such way of pooling those states. This is much more demanding than what is required for deriving condition (3).

7 Conclusion

In this paper, we study how a privately informed Sender can influence a decision-maker by using both cheap talk messages and money burning. We focus on the Sender’s optimal equilibrium. We have shown that the Sender always garbles some information and that for a large class of cases—including the much studied uniform-quadratic case—the Sender garbles all information. Money burning, although it makes truthful revelation feasible, is either not used at all, or it is used to improve the actions that the Receiver takes by improving upon the partition that she would have used if the Sender could only use cheap talk. If money burning is used to improve the actions, it can also be used in the optimal equilibrium to induce an additional action. We have also shown that if garbled information induces the Receiver to act conservatively because of a discontinuity in his utility function, the Sender optimally reveals all her private information.

De Haan et al. (2015) study how subjects in a laboratory experiment use cheap talk and money burning in the uniform-quadratic case for various degrees of preference misalignment. It is unlikely that any lab experiment will be able to mimic the precise alternating pattern of the presence and absence of money burning as a function of the misalignment parameter that we have shown to be the best from the Sender’s perspective. They do find, however, that Senders have a strong preference for cheap talk, and that they do not burn money to support intervals of full separation.