1 Introduction

Economic agents often face strong incentives to engage in immoral/non-conforming/deceptive or fraudulent behaviour in the marketplace. Such decisions to cheat may be made at small-scale bilateral trade relationships but can also be made at the highest corporate level, as the recent Volkswagen emissions scandal has shown. In this case, one of the leading automotive manufacturing companies in the world, the Volkswagen Group, misled customers by manipulating emissions, in particular of nitrogen oxides (NOx), of their cars during testing. Dubbed, inter alia, ‘diesel dupe’ (Hotten 2015), ‘diesel fraud’ (Smith and Parloff 2016), or ‘Dieselgate’ (Gorzelany 2015) by the media, and referred to as ‘diesel issue’ by the company (Volkswagen 2015) this cheating scandal had substantial effects not only on the perpetrator itself, but also on industry standards as well as other competitors in the marketplace (who, after the scandal was made public at the end of September 2015 also became the subject of investigations due to suspected cheating). Figure 1 compares the evolution of cumulative fractional changes in the share prices for the Volkswagen with its main German competitors (BMW AG, Daimler AG) during 2012–2018.

Fig. 1
figure 1

Normalized share prices of the main German automobile manufacturers since 2012

The diesel scandal had a substantial (at least short term) impact on the whole industry. Of course, this is not surprising since the current stock price approximates the present value of future cash flows, which are influenced by information on the company and its assets, including consumers’ goodwill and trust. The combination of negative publicity due to the disclosed scandal and legal consequences, with their inherent uncertain outcomes, affected the stock price substantially. However, despite this major event the industry has rebounded surprisingly strongly, as recent performance indicators show [2017 was Volkswagen’s best year in terms of sales, only to be topped by the numbers of 2018, Volkswagen (2017, 2018)]. Still, since firms knowingly engaged in inappropriate or illegal activities towards their customers by promising non-existent product features, their so far excellent brand’s reputation is being tarred over time.

So why do companies occasionally trade off reputation capital for increased profits? Corporate reputation can be defined as “... a stakeholder’s overall evaluation of a company over time. This evaluation is based on the stakeholder’s direct experiences with the company, any other form of communication and symbolism that provides information about the firm’s actions and/or a comparison with the actions of other leading rivals” (Gotsi and Wilson 2001, p. 29). It is therefore a critical measure of trust for agents as well as a valuable intangible asset for corporations (Coombs 2010, 2015; Haskel and Westlake 2017). Reputation is also rivalry, since it can be represented as a continuum, where each subject can be compared and ultimately ranked to another one. Generally, firms recognize this fact and focus on taking actions that increase their respective image, which is well researched in the literature on advertising for goodwill (Nerlove and Arrow 1962; Fershtman 1984; Cellini and Lambertini 2003). However, just like most other resources, reputation capital can not only be built up, but can also be consumed and restored. Stakeholders have certain expectations with regard to the organization’s actions, which goes hand in hand with their perception of the firm. Clearly, organizational reputation suffers when such expectations are not fulfilled (Coombs 2015). This link is mediated by how much the organization is perceived to be responsible for a scandal. According to attribution theory, intentional corporate misbehavior leading to preventable scandals with substantial damages evoke some of the strongest perceptions of blame and responsibility, which results in a high potential for reputational damage (Coombs and Holladay 1996; Weiner 2013). Furthermore, one company’s crisis may spill over to the entire industry (compare Fig. 1). Such externalities may be positive or negative in nature. It is possible that exposed cheating weakens the cheater’s market position and hence favors his competitors. On the other hand, detected cheating could lead deceived victims to put whole industries under general suspicion, or at least increased scrutiny and, hence, to negative externalities. The case of Volkswagen highlights the Glauconian dilemma in business and here in particular with reference to the issue of Corporate Social Responsibility. Plato reports in The Republic how Glaucon challenges Socrates’ view by the thought experiment in which a ring (Gyges) makes the person invisible [the following is quoted from Haidt (2012)], “... no one, it seems, would be so incorruptible that he would stay on the path of justice or away of other people’s property .... Rather his actions would be in no way different from those of an unjust person and both would follow the same path.” In short, having the choice between doing good even at the risk of being considered a scoundrel, or appearing virtuous but actually being a scoundrel, most economic agents, and firms in particular, would presumably have to opt for the second. The contribution of this paper is to investigate the incentives for firms to cheat and in particular how competition induces or mitigates cheating. Of course, intensity of competition matters, however, the evidence in which direction this effect holds is mixed. Cartwright and Menezes (2014) report that intermediate levels of competition induce the highest level of cheating (misreporting rate in their study), whereas Kopel and Lamantia (2018) roughly report the opposite.Footnote 1 Schwieren and Weichselbaumer (2010) argue that competitive pressures increase nonethical conduct (compare Shleifer 2004). In this paper we emphasize the dynamics of firm cheating, accounting for the trust or mistrust consumers have about a particular firm and its relation to its competitors. The states of trust depend on past actions, deceits or honesty, of the firms. In short, we propose a differential game setup to study and compare outcomes under different market and information structures. A surprising finding of this paper is that dynamic strategic oligopolistic interactions (i.e., a Markov perfect equilibrium) mitigate cheating and in fact lead to a ranking opposite to the one obtained from the corresponding static game. Particularly open loop outcomes lead to excessive cheating, which could be somewhat mitigated by regular (even inconsequential) inspections, such that firms can adjust their policies and use the more honest Markov strategies.

The paper is organized as follows: the next section introduces the model in all its versions for the respective market structures (monopoly, cartel and duopoly). In Sect. 3 we derive the solutions for the static game. Afterwards, Sect. 4 is devoted to the dynamics, considering differing information structures. Section 5 compares the derived results. Finally, we conclude and offer directions for future research.

2 Model

What is crucial if a firm \(i=1,2\) cheats, greenwashes or, in general, exaggerates the quality of its product? Actual quality \((q_{0})\), is normalized to zero, while \(q_{i}>q_{0}=0\) denotes the extent to which firm i exaggerates the quality of its product. This generates a benefit on its own for the firm. However, this immediate benefit requires that cheating is not observable, at least not easily and directly, which holds in particular for experience (Nelson 1974) and credence goods (Darby and Karni 1973; Dullek and Kerschbamer 2006). Although consumers can judge the quality of an experience good once bought (e.g., the food served at an unknown restaurant or a movie in the cinema), this is not necessarily the case with a credence good. Here, consumers may also be unable to identify the properties of the good ex post, i.e., after purchase. Hence, for these types of goods it is often impossible, or at least very costly, to uncover cheating. This applies to many goods like the claims of fluoride in a toothpaste, organic food served in restaurants, to Volkswagen claiming to meet all emission standards (even by saving the costs for the more expensive SCR-catalytic converter, ‘BLUE’, that other companies installed) and to corporate social (ir)responsibility (Langevoort 2002; Lin-Hi and Müller 2013) in general.Footnote 2 Indeed, even claims that are known to be exaggerated matter. For example, it is common knowledge that the officially stated vehicle fuel consumption numbers will be exceeded by far in practice. Nevertheless, the relative position of claims still plays a role if car A is claimed to be more efficient than the competitor B, even though on paper the absolute numbers are implausible. However, cheating is not for free. Stated exaggerated claims can be costly because of the necessity of a camouflage (e.g., via software) or negligible if restricted to purely unverifiable technical claims. Misbehaviour may also cause moral costs for decision makers, including risk of fines and even of imprisonment. For instance, the stakes in the Dieselgate scandal are enormous. On the one hand, additional revenue streams from selling products with exaggerated quality (lemons) have been generated for many years. On the other hand, the costs range from allowing/ordering a software to fool inspectors, fines (recently, one billion dollars which Volkswagen accepted), full compensation for the cars sold in the US, costly upgrades and still pending law suits in Europe, to terms in jail (that Volkswagen’s managers face in the United States).

While cheating enhances (ceteris paribus) current sales and revenues this comes at the expense that a firm’s future reputation is tarred. We let negative reputation, a firm’s ‘badwill’ so to speak \(\left( B_{i}\right) \), evolve dynamically,

$$\begin{aligned} \dot{B}_{i}=q_{i}-\delta B_{i},\;i=1,2, \end{aligned}$$
(1)

i.e., badwill B is an exponentially weighted (by the parameter \(\delta \), \( 1/\delta \) being the time constant) average of past cheating. Higher values of \(B_{i}\) lower firm i’s profit \(\left( \pi ^{i}\right) \) but exaggerated claims \((q_{i})\) increase it. Our setup, in which truth is uncovered only slowly, seems to be particularly applicable to (imperfect) credence goods, so that the firms can hide truth only temporarily but not forever (‘you can fool all of the people some of the time, you can fool some of the people all of the time, but you can’t fool all of the people all of the time’, Abraham Lincoln). Not only the firm’s own actions and standing matter but also those of its competitors. Presumably, the relative position to the competitor j matters in both cases—the immediate gain from cheating as well as the reputational damage. Yet, instead of setting up firm i’s,

$$\begin{aligned} {\text {payoff}} = {\text {benefit (cheating)}} - {\text {loss (due to badwill)}} - {\text {costs (cheating)}}, \end{aligned}$$

in a more or less ad hoc manner, we motivate the firms’ payoffs from an underlying (linear) demand system. As a consequence, firms compete in cheating and production. We consider two market segments of identical size and their inverse demands, or respectively, their marginal willingness to pay, \(p_{i},\;i=1,2\),

$$\begin{aligned} p_{1}=1+aq_{1}-bq_{2}-\alpha B_{1}+\beta B_{2}-x_{1}-x_{2}, \end{aligned}$$
(2)
$$\begin{aligned} p_{2}=1+aq_{2}-bq_{1}-\alpha B_{2}+\beta B_{1}-x_{1}-x_{2}, \end{aligned}$$
(3)

where \(x_i\) denotes the quantity of firm i’s product and \(a,b,\alpha \) and \( \beta \) are positive constants. Thus, willingness to pay for an additional unit decreases with respect to total supply \(\left( x_{1}+x_{2}\right) \). Absent cheating, the maximum willingness to pay is normalized to unity. Technically, the two goods are homogeneous, however, from the consumers’ point of view they differ according to their respective perceived quality. More precisely, the difference between the two segments is that consumers from segment 1 are more susceptible to the claims and the reputation of firm 1 while consumers from segment 2 have the same symmetric feelings towards firm 2. Exaggerated (false) claims of the ‘own’ good are beneficial, yet similar claims of the competitor are harmful. It seems plausible to assume that the own effect dominates,

$$\begin{aligned} a>b. \end{aligned}$$
(4)

A firm’s negative reputation, \(B_{i}>0\), lowers consumers willingness to pay as the firm’s claims are mistrusted, while a negative reputation of the competitor is beneficial; again own effects dominate, \(\alpha >\beta \). Of course, one may allow for \(\beta <0\), as increased badwill of a firm can have a negative spillover on the competing firm since the entire industry faces increasing distrust. In order to render cheating demand enhancing and thus at least potentially profitable, the short run gain must exceed the associated (marginal) long run damage. More precisely, since cheating by one (infinitesimal) unit creates the benefit a but implies the (stationary) damage \(\alpha /\delta \), or respectively after accounting for discounting \( \left( r\right) \), we require

$$\begin{aligned} a>\frac{\alpha }{r+\delta }, \end{aligned}$$
(5)

and similarly for the cross effects. In particular, for a monopoly (or cartel) selling two brands we need,

$$\begin{aligned} b>\frac{\beta }{r+\delta }\wedge a-b>\frac{\alpha -\beta }{r+\delta }>0. \end{aligned}$$
(6)

These inequalities must hold strictly as the costs of cheating, for simplicity quadratic,

$$\begin{aligned} \frac{c}{2}q_{i}^{2},\;c>0, \end{aligned}$$
(7)

must be covered too.

Summarizing, if firms maximize their net present value of profits over an infinite horizon,

$$\begin{aligned} \underset{q_{i},x_{i}}{\max } \int _{0}^{\infty }e^{-rt}\pi ^{i}dt,\;i =1,2, \end{aligned}$$
(8)

with

$$\begin{aligned} \pi ^{i}= & {} p_{i}x_{i}-\frac{c}{2}q_{i}^{2}, \end{aligned}$$
(9)

then they must solve the above optimizations subject to the evolution of the (two) states, \(B_{i}\), \(i=1,2\), accounting for the competitor’s strategies; production costs are normalized to zero.

Remark

In this setup production and cheating are chosen simultaneously by both players and both move simultaneously in each period t. An alternative and also plausible assumption is that production \(\left( x_{i}\right) \) takes place after reputations (the states \( B_{i}\)) and the degrees of cheating (actions, controls, \(q_{i}\) ) are determined. Computing the Nash equilibrium of the intraperiod levels of production for given cheating and badwill would result in,

$$\begin{aligned} x_{i}=\frac{1+a\left( 2q_{i}-q_{j}\right) -b\left( 2q_{j}-q_{i}\right) -\alpha \left( 2B_{i}-B_{j}\right) -\beta \left( B_{i}-2B_{j}\right) }{3}, \end{aligned}$$

which implies the reduced form profit,

$$\begin{aligned} \pi ^{i}=\frac{\left( 1+\left( 2a+b\right) q_{i}-\left( a+2b\right) q_{j}-\left( 2\alpha +\beta \right) B_{i}+\left( \alpha +2\beta \right) B_{j}\right) ^{2}}{9}-\frac{c}{2}q_{i}^{2}, \end{aligned}$$

of firm i, with cheating as then the only control. This objective includes all linear, mixed and quadratic terms between firms’ actions and states. Of course, in this case a monopoly would keep the flexibility over both controls, cheating and production, since a sequential optimization would be suboptimal.

Returning to our simultaneous setup, we consider two different monopolistic solutions: a single-brand monopoly and a monopoly with different brands, or equivalently, a cartel of different firms. Selling different brands allows to benefit from the cross effects, even if symmetric strategies, to which the analysis is limited, are applied. Therefore, a monopoly or cartel can be modelled in two ways, even if assuming a symmetric allocation, i.e.,

$$\begin{aligned} q=q_{1}=q_{2},\;x=x_{1}=x_{2}, \end{aligned}$$

and thus,

$$\begin{aligned} B=B_{1}=B_{2}. \end{aligned}$$
  1. 1.

    A single-brand monopoly cannot benefit/lose from cross effects, thus,

    $$\begin{aligned} b=\beta =0. \end{aligned}$$
  2. 2.

    A multi-brand monopoly or cartel, such that the cross effects matter, i.e., \(b\ne 0\) and \(\beta \ne 0\) and both are presumably positive.

Therefore, a higher willingness to pay results if the monopoly commits to a single brand if the current loss induced on the other brand \(\left( b>0\right) \), is larger than the marginal gain due to the implied stationary badwill \(\left( \beta /\delta \right) \), and again \(\left( \beta /\left( r+\delta \right) \right) \) on a net present value basis; this is analogue to assumption (6). Combining this with the assumption that own effects dominate, inequality (4), introducing different brands will lower the willingness to pay and will lower cheating due to the overall negative cross effects. Therefore, more brands are, if at all, only preferable for the opposite inequalities, and will, surprisingly, result in less cheating.

3 Solution of the static game

Unfortunately, an analytical solution of the dynamic setup is either too cumbersome to draw analytical conclusions (in the case of a monopoly and a cartel) or not existing (at least, we could not obtain a closed-form solution). Therefore, we first consider the outcome using the steady state relationships and then use carefully calibrated relations to draw some inferences how competition, actual or inter-brand, (as in the case of different brands sold by a monopolist) affects the honesty of the firms, transiently and in the long run. Computing the symmetric equilibrium choices of \(x_{i}\) and \(B_{i}\), substituting for \(q_{i}=\delta B_{i}\), i.e., the steady states of (1), yields the following equilibria.

Result 1

For the single-brand monopoly (superscript m),

$$\begin{aligned} B^{m}= & {} \frac{a\delta -\alpha }{4c\delta ^{2}-\left( a\delta -\alpha \right) ^{2}}, \end{aligned}$$
(10)
$$\begin{aligned} x^{m}= & {} \frac{c\delta ^{2}}{4c\delta ^{2}-\left( a\delta -\alpha \right) ^{2}}, \end{aligned}$$
(11)

for the cartel, or equivalently a two-brands monopoly (superscript c),

$$\begin{aligned} B^{c}= & {} \frac{\left( a\delta -\alpha \right) -\left( b\delta -\beta \right) }{4c\delta ^{2}-\left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) ^{2}}, \end{aligned}$$
(12)
$$\begin{aligned} x^{c}= & {} \frac{c\delta ^{2}}{4c\delta ^{2}-\left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) ^{2}}, \end{aligned}$$
(13)

and for a duopoly (superscript d),

$$\begin{aligned} B^{d}= & {} \frac{a\delta -\alpha }{3c\delta ^{2}-\left( a\delta -\alpha \right) \left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) ^{2}}, \end{aligned}$$
(14)
$$\begin{aligned} x^{d}= & {} \frac{c\delta ^{2}}{3c\delta ^{2}-\left( a\delta -\alpha \right) \left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) ^{2}}. \end{aligned}$$
(15)

A single brand monopoly engages in more cheating than a monopoly selling two brands,

$$\begin{aligned} B^{m}>B^{c}, \end{aligned}$$
(16)

and a duopoly cheats even more than a single brand monopoly,

$$\begin{aligned} B^{d}>B^{m}\iff a<\frac{c\delta ^{2}+\alpha \left( b\delta -\beta \right) }{ \delta \left( b\delta -\beta \right) }, \end{aligned}$$
(17)

i.e., except high immediate gains from cheating (large a).

Proof

The first-order-conditions (FOC’s) of the cartel (assuming symmetry) result from equating the marginal costs of cheating to the induced marginal revenue, and production results from equating marginal revenues to the marginal production costs (assumed to be zero),

$$\begin{aligned}&c\delta ^{2}B =\left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) x, \\&1+\left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) B-2x =0. \end{aligned}$$

Solving these two equations for \(\left( B,x\right) \) yields the claim \( \left( B^{c},x^{c}\right) \) in (12) and (13). These conditions simplify for the monopoly, \(b=\beta =0\), to

$$\begin{aligned}&c\delta ^{2}B =\left( a\delta -\alpha \right) x, \\&1+\left( a\delta -\alpha \right) B-2x =0, \end{aligned}$$

and thus imply \(\left( B^{m},x^{m}\right) \) as reported in (10) and (11). The FOC’s of the duopoly (again after imposing symmetry),

$$\begin{aligned}&c\delta ^{2}B = \left( a\delta -\alpha \right) x, \\&1+\left( \left( a\delta -\alpha \right) -\left( b\delta -\beta \right) \right) B-3x =0, \end{aligned}$$

imply the same marginal revenue equals marginal costs condition for cheating as the monopoly. Solving those two equations verifies the claim \( \left( B^{d},x^{d}\right) \) in (14) and (15).

Compared to the cartel, the monopoly equates its marginal costs to higher marginal revenues from cheating (keeping output the same) and, similarly, from production (now keeping cheating constant). Directly, the numerator in the monopoly’s state, (10), is larger than the one of the cartel, (12), yet the denominator of (10) is smaller since a larger number is subtracted from the term \(4c\delta ^{2}\). This is the case under (6), hence, \(B^{m}>B^{c}\), as reported in (16). A comparison of the numerators of (10) and (14) yields

$$\begin{aligned} a\delta -\alpha>\left( a-b\right) \delta -\left( \alpha -\beta \right) \iff b\delta > \beta , \end{aligned}$$

i.e., if the immediate effect is dominating the intertemporal one, which is in accordance to our assumptions in (6). Now, comparing the denominators of (10) and (14) yields,

$$\begin{aligned}&4c\delta ^{2}-\left( \alpha -a\delta \right) ^{2}<4c\delta ^{2}-\left( \alpha -\beta -\left( a-b\right) \delta \right) ^{2}, \\&\iff (\beta -b \delta )(\beta -b \delta +2a\delta -2\alpha )<0, \end{aligned}$$

and the claim is equivalent to that this difference is negative, as \(B^{d}\) must have the smaller denominator. Solving this inequality for a verifies that \(B^d>B^m\) and yields the critical value reported in (17). \(\square \)

Cheating, \(B>0\), results for a duopoly and a single-brand monopoly only if the intuitively derived condition in assumption (5) holds. For a monopoly selling two brands, the analogous assumption (6) is necessary for cheating. The monopoly cheats more than the cartel (or a multi-brand monopoly), because the associated marginal revenue from cheating is diminished for a cartel due the cannibalization by the different brands (the cross effect). The duopoly’s high rate of cheating is a consequence of the larger production, because cheating is determined by a condition identical to the monopoly’s and the marginal revenues are scaled by the output.

4 Intertemporal equilibria

4.1 Monopoly and cartel

A symmetric cooperative equilibrium eliminates the gain from cheating that results from relative positions. This mitigates but does not eliminate the incentive for cheating. In spite of the two cases, we can restrict the analysis to the more general case of the cartel by solving the Hamilton–Jacobi–Bellman (HJB) equation for the value function J,

$$\begin{aligned} rJ\left( B\right) =\underset{q_i,x_i}{\max }\left\{ \sum _{i=1}^{2}\left[ p_ix_i-\frac{c}{2}q_i^2+J_{B_i}(q_i-\delta B_i)\right] \right\} , \end{aligned}$$
(18)

and obtain the monopoly solution by setting \(\beta =b=0\). The first-order-conditions of the maximization on the right side of (18),

$$\begin{aligned} ax_1-bx_2+J_{B_{1}}-cq_1= & {} 0, \\ 1+aq_1-bq_2-2(x_1+x_2)-\alpha B_1+\beta B_2= & {} 0, \end{aligned}$$

imply for the strategies of the monopoly (symmetry allows to confine the analysis to \(q=q_{1}=q_{2}\) and \(x=x_{1}=x_{2}\)),

$$\begin{aligned} q^m= & {} \frac{a\left( \alpha B-1\right) -4J^{\prime }}{a^{2}-4c}, \end{aligned}$$
(19)
$$\begin{aligned} x^m= & {} \frac{c\left( \alpha B-1\right) -aJ^{\prime }}{a^{2}-4c}. \end{aligned}$$
(20)

as well as the cartel,

$$\begin{aligned} q^c= & {} \frac{(a-b)\left[ (\alpha -\beta )B-1\right] -4J^{\prime }}{\left( a-b\right) ^{2}-4c}, \end{aligned}$$
(21)
$$\begin{aligned} x^c= & {} \frac{(\alpha -\beta )cB-c-(a-b)J^{\prime }}{\left( a-b\right) ^{2}-4c}. \end{aligned}$$
(22)

i.e., optimal cheating from a myopic perspective is adjusted for the intertemporal effect \(\left( J^{\prime }\right) \) of how cheating affects the future badwill. Although a closed-form solution exists, after all only a quadratic polynomial must be solved after proper substitutions and rearrangements when comparing coefficients of the Hamilton–Jacobi–Bellman equation for a quadratic guess of J, the result is very cumbersome and thus suppressed. Instead, we will present a numerical comparison in Sect. 5.

4.2 Duopoly

4.2.1 Open loop

The open loop equilibrium results if both firms solve their intertemporal optimization of (8) subject to (1), where the strategy \(\left\{ q_{i}\left( t\right) ,t\in [0,\infty )\right\} \) is a best response to the strategy of the opponent \(\left\{ q_{j}\left( t\right) ,t\in [0,\infty )\right\} \) and vice versa. Moreover, due to symmetry we have \(q_{i}=q_{j}\) in any Nash equilibrium. This setup ignores how the state, in particular of the competitor, evolves and is thus not subgame perfect if the opponent deviates, for whatever reason, from his optimal policy. However, the open loop strategy can make sense in this case, at least temporarily, since the badwill as well as cheating of the competitor may not be observable—or only occasionally, such as at the beginning of the game. Indeed, Volkswagen and presumably others, were able to hide their policy of cheating for quite some time.

Setting up the Hamiltonian of player i,

$$\begin{aligned} H^{i}=\pi ^{i}+\lambda ^{i}\left( q_{i}-\delta B_{i}\right) +\mu ^{i}\left( q_{j}-\delta B_{j}\right) ,\;i\ne i, \end{aligned}$$

the first-order-conditions for interior solutions are,

$$\begin{aligned} ax_{i}-cq_{i}+\lambda ^{i}= & {} 0, \\ 1+aq_{i}-bq_{j}-2x_{i}-x_{j}-\alpha B_{i}+\beta B_{j}= & {} 0, \\ \dot{\lambda }^{i}=\left( r+\delta \right) \lambda ^{i}+\alpha x_{i} ,\; \underset{t\rightarrow \infty }{\lim }e^{-rt}\lambda ^{i}B^{i}= & {} 0, \\ \dot{\mu }^{i}=\left( r+\delta \right) \mu ^{i}-\beta x_{i},\;\underset{ t\rightarrow \infty }{\lim }e^{-rt}\mu ^{i}B^{j}= & {} 0. \end{aligned}$$

The maximization of the Hamiltonian is independent of the competitor’s action and state and moreover, the second costate vanishes, \(\mu ^{i}=0\) for \(i=1,2\) due to the corresponding transversality condition. Assuming a symmetric equilibrium (thus dropping the index), the open loop strategies are given by,

$$\begin{aligned} q^o= & {} \frac{\left( \alpha -\beta \right) aB-a-3\lambda }{a^{2}-ab-3c}, \end{aligned}$$
(23)
$$\begin{aligned} x^o= & {} \frac{\left( \alpha -\beta \right) cB-c-\left( a+b\right) \lambda }{ a^{2}-ab-3c}. \end{aligned}$$
(24)

The symmetric, intertemporal open loop Nash equilibrium is characterized by the following pair of differential equations,

$$\begin{aligned} \dot{B}= & {} \frac{\left( \alpha -\beta \right) aB-a-3\lambda }{a^{2}-ab-3c} -\delta B, \\ \dot{\lambda }= & {} \left( r+\delta \right) \lambda +\frac{\left( \alpha -\beta \right) \alpha cB-\alpha c-\left( a+b\right) \alpha \lambda }{a^{2}-ab-3c}, \end{aligned}$$

which implies the steady state,

$$\begin{aligned} B^o_{\infty }=\frac{\alpha -a(r+\delta )}{A},\;\lambda ^o _{\infty }=\frac{ \alpha c\delta }{A}, \end{aligned}$$
(25)

with

$$\begin{aligned} A=(\alpha -\beta +b \delta )(\alpha -a(r+\delta ))-\alpha a \delta +\delta (r+\delta )(a^2-3c). \end{aligned}$$

4.2.2 Markov perfection

Finally, we consider the possibility of state-contingent strategies as well. In contrast to the time-dependent strategies in the open loop setup, the duopolistic outcome is now characterized by linear Markov strategies. The resulting equilibrium is strongly time-consistent, more precisely, subgame perfect such that the strategies remain optimal even if a player took an erroneous move. The Markov strategies are the natural outcome if applying (continuous time) dynamic programming. Therefore, we set up the Hamilton–Jacobi–Bellman equation (see Dockner et al. 2003) for the value function of player \(i=1\),

$$\begin{aligned} rV^{1}\left( B_{1},B_{2}\right) =\underset{q_{1},x_{1}}{\max }\left\{ \pi ^{i}+V_{B_{1}}^{1}\left( q_{1}-\delta B_{1}\right) +V_{B_{2}}^{1}\left( q_{2}-\delta B_{2}\right) \right\} , \end{aligned}$$
(26)

and analogously for player \(i=2\). The first-order-condition of the maximization of the right hand side for the instrument of cheating is,

$$\begin{aligned} cq_{i}=ax_{i}+V_{B_{i}}^{i},\ i=1,2, \end{aligned}$$
(27)

i.e., the marginal cost of cheating must equal the marginal benefit, which consists of the induced additional revenues \(\left( ax_{i}\right) \) minus the intertemporal loss from a damaged reputation, captured by \( V_{B_{i}}^{i}<0\). The production choice follows from the familiar condition that marginal revenue must equal marginal cost (zero in our case),

$$\begin{aligned} 1+aq_{i}-bq_{j}-\alpha B_{i}+\beta B_{j}-2x_{i}-x_{j}=0. \end{aligned}$$

As usual for LQ dynamic games, we guess a quadratic solution of the value functions,

$$\begin{aligned} V^{1}= & {} v_{0}+v_{1}B_{1}+v_{2}B_{2}+\frac{1}{2}\left( v_{11}B_{1}^{2}+2v_{12}B_{1}B_{2}+v_{22}B_{2}^{2}\right) , \end{aligned}$$
(28)
$$\begin{aligned} V^{2}= & {} w_{0}+w_{1}B_{1}+w_{2}B_{2}+\frac{1}{2}\left( w_{11}B_{1}^{2}+2w_{12}B_{1}B_{2}+w_{22}B_{2}^{2}\right) \end{aligned}$$
(29)

with coefficients \(v_{i},w_{i}\in \mathbb {R}\) and then assume symmetry,

$$\begin{aligned} w_{0}=v_{0},w_{1}=v_{2},w_{2}=v_{1},w_{11}=v_{22},w_{12}=v_{12},w_{22}=v_{11}, \end{aligned}$$

such that ultimately only the coefficients of \(V^{1}\) must be determined. Yet, even exploiting all these conditions does not allow for a meaningful analytical representation of the Markov perfect equilibrium. Therefore, the following numerical examples have to substitute for this shortcoming.

5 Comparison

We base our analysis on the following reference case:

$$\begin{aligned} a=0.2,\ b=0.1,\ \alpha =0.02,\ \beta =0.01,\ c=0.1,\ r=0.05,\ \delta =0.2. \end{aligned}$$
(30)

We let firms discount at \(5\%\) per annum and the consumers’ rate of forgetting/forgiving be given by \(20\%\), which assumes a rather weak memory. Turning to the demand parameters, cheating by one unit increases a consumer’s willingness to pay by \(a=0.2\) (for the preferred good/brand), i.e., by \(20\%\) relative to the otherwise maximum willingness to pay (which has been normalized to unity). This increase in willingness to pay is reduced by \(b=0.1\) if also the competitor exaggerates his claims by one unit, which is half of the own effect and in line with assumption (4). The long run effect due to the firm’s bad reputation amounts to \(\alpha =0.02\), which satisfies assumption (5), and the effect from the competitor’s badwill as half of that, \(\beta =0.01\). More precisely, the immediate effects of cheating are twice the long run effects, own and cross: cheating by one incremental unit triggers the gain \(a=0.2\) (and respectively the loss \(b=0.1\) from the competitor’s cheating) but also the long run reputational loss \(\alpha /\delta =0.1\) and \(\beta /\delta =0.05\) if ignoring discounting and transient behavior. That is, substantial incentives for cheating are assumed. The parameters also satisfy inequality (6). Finally, the cost parameter implies for hypothetical cheating levels of 1 / 2 , 1 and 2, marginal cheating costs of 0.05, 0.1, and 0.2.

Starting with the monopoly and the reference case, Fig. 2a and b compare the monopolistic strategies of a single brand with a multi-brand monopoly. This figure highlights two things: first, how much more cheating is employed by the single-brand monopoly because countereffects are missing. Second, both monopolistic strategies are increasing in their own badwill, i.e., the worse the reputation the more the monopoly tries to compensate by exaggerating claims and surprisingly, declining with respect to the other brand’s state of badwill. This is clearly counterintuitive and we may just quote as an explanation a proverb (which rhymes in German) that one can proceed unabashedly once one’s reputation is ruined. The same signs apply to the production strategy, and we note that both strategies (cheating and production) are fairly insensitive to changes in the state.

Fig. 2
figure 2

Strategies, cheating in a and c and production in b and d, in state space

Figure 2c and d compare the cartel (multi-brand) and the oligopolistic Markov perfect strategies also in state space. In contrast to the monopoly, the oligopolistic strategy has the expected sign: cheating decreases with respect to the own badwill but increases with respect to the opponent’s. Table 1 compares the steady states of the reference case across the different scenarios, including the outcomes of the static game. The insensitivity of all strategies with respect to the actual state of badwill is confirmed by how little production changes due to the dynamic analysis. However, compared to the cartel, the monopoly’s badwill increases and almost doubles, which rather shamefully exploits its monopoly position—not only with respect to output but also with respect to cheating.

Table 1 Steady state comparison of the controls and states including total discounted utility

In contrast, the dynamic analysis of reputational effects in duopolistic competition shows a dramatic reduction—stationary badwill is cut to below one half of the static game, which is the opposite for a single-brand monopoly. In fact, the duopoly ends up in less cheating than the monopoly (at least stationary and weighted over the evolution as the lowest level of badwill results). However, this positive characterization of competition is due to strong intertemporal strategic interactions (i.e., the linear Markov perfect equilibrium), as the open loop equilibrium of the dynamic game ends up at the highest level of badwill. The reason is that playing in open loop strategies ignores the evolution of the opponent’s badwill, which in this case leads to excessive cheating.

Fig. 3
figure 3

ad Sensitivity of stationary values with respect to model parameters around the reference case (vertical line) in (30). Production of the monopoly (constant at 0.24) and cartel (increasing but rather flat, from 0.25 to 0.26) are much lower and excluded in d in order to highlight the difference between the open loop and the Markov perfect equilibrium

Result 2

Accounting for dynamics results in an ambiguous effect on the long run behaviour of the firm, depending on the market and information structure. For the monopoly as well as the cartel (marginally) more cheating results, whereas a more nuanced picture emerges for actual competition. To summarize:

$$\begin{aligned} q_{dyn}^{m}>q_{stat}^{m},\ q_{dyn}^{c}>q_{stat}^{c}\ \ \text {and}\ \ q_{OL}^{d}>q_{stat}^{d}>q_{MP}^{d}. \end{aligned}$$

In particular, Markov strategies reduce cheating substantially compared to the open loop outcome.

That is, a dynamic consideration can turn the static results upside down if the firms play in the usually more aggressive Markov strategies. These findings are robust as the sensitivity analyses in Fig. 3, which compares stationary levels with respect to the model parameters around the reference case, document. Figure 3a displays the evolution of the long run state for all market structures with respect to \(\delta \) and shows that the relation is non-monotonic. However, increasing \(\delta \) (i.e., consumers’ memories become shorter and shorter), the dynamic solutions, including the Markov perfect one, approach the static ones. Therefore, the relation between monopoly and duopoly can be reversed at very high values of \(\delta \).Footnote 3 While long run cheating is declining in the marginal effect of one’s own state in all models (Fig. 3b ), this is not true for the opponent’s state (Fig. 3c). In the Markov perfect equilibrium cheating (as well as the state) is declining in \(\beta \) , which then also results in slightly less production due to a smaller inflated market size (Fig. 3d). These figures corroborate the conclusions from Fig. 2 and the previous results and further emphasize that Markovian strategies mitigate cheating. We note further that the monopoly’s cheating (and also of the cartel over certain domains of low values of \(\delta \) and over significant spillovers \( \beta \)) exceeds that of Markov perfect (but not open loop) competition.Footnote 4 This result—the cheating of German carmakers is due to their implicit cartelization—seems to fit hypotheses expressed in various media, including Domen and Hawranek (2017). As a consequence, further investigations by the EU commission in the direction of suspected long-lasting collusion among German carmakers have also been initiated recently.

Hence, information structure is crucial, i.e., the ability to monitor the state of the opponent (and the fact that this also holds for the opponent) leads to a dramatic reduction in cheating and therefore also to less reputational loss. On the other hand, lacking this knowledge or lacking the ability to condition one’s strategy on the evolution of the states leads to overshooting and therefore to the worst outcome with respect to consumers (i.e., by how much they are cheated, see Table 1). A trivial way of revealing the true underlying state is by inspections. These would make information public and thus would enable firms to credibly adjust their strategies in time. We note that even inconsequential audits, that is, audits devoid of any material consequences like a punishment fine, are sufficient in mitigating misbehaviour of firms. Indeed, this tightening of standards and regulations has followed after Dieselgate, but probably did not go far enough.

6 Final remarks

In this paper we introduced a simple dynamic game theoretic representation of cheating. Both aspects—dynamics as well as strategic interactions—are crucial. Indeed, a major result is that the analysis of the dynamic game turns the counterpart from the static game upside down. More precisely, a Markov perfect oligopoly equilibrium leads to the lowest level of badwill while a duopoly leads to the highest level of cheating (and thus also of badwill) in the static game and in the dynamic game in open loop strategies. Therefore, not only the consideration of dynamics is important but also the degree of strategic interaction. It is the interaction in the Markov perfect equilibrium that lowers cheating, which is in contrast to the resulting outcome if the duopolists compete in open loop strategies. The reason seems to be the countervailing effect of the opponent’s strategy, such that the Markov perfect strategies are fairly flat in the symmetric equilibrium.

Our model is general enough so that it may be applied to many examples. In regards to the game the German carmakers played, one may conjecture that those firms have not acted completely non-cooperatively. Or they colluded tacitly which was enabled by the reliance on common suppliers (e.g., Bosch). We observe that a cartel but even more so, a game played in commitment strategies, increases the incentives to cheat (i.e., the cheating need not be due to cartelization but due to the commitment, e.g., to certain technical solutions). This suggests that establishing competition (as is the case in most markets anyway) but coupled with sufficient public verification of claims should have the highest priority.

These observations are based to some extent on numerical examples and a theoretical and more robust verification of our claims could be part of future research. Of course, this is a tough task given a dynamic game with multiple states and their interaction in the objectives. Also, the analysis of possible multiple nonlinear equilibria is ignored, because in this setting, a partial differential equation implied by the Hamilton–Jacobi–Bellman equation rules out graphical phase plane techniques in order to derive and characterize nonlinear strategies. Therefore, this is also left for future research (and as the one above, hard to solve too). Other possible extensions would be to include spillovers in the dynamics of the competitor’s badwill and to include stochastic processes. Considering the discrete event of ‘Dieselgate’, Poisson jumps could capture the uncovering of cheating and its subsequent effect on the consumers’ perception (see Hirschmann 2014).