1 Introduction

This paper examines the welfare tradeoff between patent and antitrust law. Patent law enables the patent holder to obtain and exploit a monopoly lawfully. Antitrust law regulates the acquisition, maintenance, and, to some degree, the exploitation of monopoly power.

Since patent and antitrust law have superficially contradictory goals, the question that naturally arises is how one should choose between the two in instances where there appears to be a conflict. There are methods of exploiting patent monopolies that have been treated as antitrust violations. For example, the Federal Trade Commission recently sued Qualcomm, a manufacturer of communications technology, on the theory that the firm had abused its patent monopoly by adopting a two-part pricing scheme in the licensing and sale of its smartphone semiconductor chips.

One sensible approach to choosing between two legal standards, or between two proof standards with respect to evidence, is to consider the relative costs of errors. Over-enforcement generates “false positives”: cases where the regulated firm is punished or prohibited from taking a certain action when society should prefer that the action be taken. Under-enforcement generates “false negatives”: cases where society should prefer that the excused firm be punished or prohibited from taking action. The approach in this paper is to consider the ratio of false positives to false negatives in the patent antitrust area. A high error cost would imply that antitrust should be reluctant to restrain patentees. Moreover, proof standards and legal doctrines should be biased toward protecting innovation incentives. We find that the relevant error-cost ratio for patent antitrust is the ratio of the sum of the monopoly profit and the residual consumer surplus to the deadweight loss. This is different from the profit-deadweight loss (reward-social loss) ratio that has been advanced in the literature (Kaplow 1984; Gilbert and Shapiro 1990; Klemperer 1990). The reward-loss ratio is a doubtful measure of the patent-antitrust welfare tradeoff because it does not take into account the residual surplus to consumers, and thereby underweights the social value of innovation.

We find that the error-cost ratio—for a wide variety of deterministic demand functions—ranges from infinity to a low of roughly three. This supports a bias—when designing legal standards and proof standards in patent antitrust law—in favor of the patent holder.

2 Baseline model

We assume that patent protection gives the innovator a property right that sometimes yields a monopoly in a market that exists only because of a prior investment by the innovator. In the first period, the prospective patentee invests in an innovation; and in the second period, the innovator is awarded a patent (with probability one) that guarantees a monopoly in the market that is created by the innovation.Footnote 1 The costs of the innovation are sunk when the second period arrives.

The patentee-monopolist faces a downward-sloping inverse demand curve \( {\text{p}} = {\text{p}}\left( {\text{q}} \right) \) and constant marginal cost of c. The firm’s profit maximization problem is

$$ \mathop {{\text{max:}}}\limits_{q} \quad \uppi \left( {\text{q}} \right) = {\text{p}}\left( {\text{q}} \right){\text{q}} - {\text{cq}} $$

The monopolistic output \( q^{ *} \) satisfies the familiar optimality condition \( p^{\prime}\left( {q^{ *} } \right)q^{ *} + {\text{p}}\left( {q^{ *} } \right) = {\text{c}} \). Denoting demand elasticity at a price p by \( {\epsilon}\left( p \right) = - \frac{{q^{\prime}\left( p \right)}}{q/p} \), the monopoly price \( p^{ *} \) satisfies \( p^{{ \star }} \left[ {1 - \frac{1}{{{\epsilon}\left( {p^{{ \star }} } \right)}}} \right] = {\text{c}}. \) The patentee anticipates all of this when he invests in innovation in the first period.

Under competition, price equals marginal cost: \( p^{c} = {\text{c}} \); and the competitive quantity is \( q^{c} = p^{ - 1} \left( {p^{c} } \right) \). Figure 1 illustrates the standard monopoly outcome: with profit denoted by \( {{\uppi }} \), the residual consumer surplus is denoted by RS and the deadweight loss, from constraining output below the competitive level, is D.

Fig. 1
figure 1

Error-cost ratio

3 Error-Cost Ratio

In this part, we consider the welfare tradeoffs of antitrust enforcement in the intellectual property area. Although we focus on patents, the model applies equally to many other types of intellectual property (copyrights, trade secrets, trademarks) when a monopoly results.

Figure 1 illustrates our basic argument. In the figure, \( p_{BE} \) is the break-even price that is necessary for the innovator to recover the fixed (sunk) costs of innovation. If the price that the firm expects to receive in the second period is less than the break-even price, the firm will not invest in innovation (research and development, R&D).

We incorporate antitrust, in Fig. 1, as a mechanism that operates as a price regulation: \( \hat{p} \). This is different from the more traditional economic model of antitrust that operates as a penalty that is imposed on monopolizing firms.Footnote 2 Here, antitrust operates as an injunction that constrains the firm from choosing its preferred price-output combination along the demand curve.

If the anticipated antitrust-regulated price cap is greater than the break-even price, the firm will innovate and charge up to the price cap. If the anticipated antitrust price cap is less than the break-even price, the firm will not invest in the first period, and no entry will occur.

The most stringent antitrust regime is equivalent to a price cap equal to marginal cost. In this case, antitrust is so effective that the innovator will be forced to charge the competitive price in the second period. Investment will occur in the first period, in this case, only if the break-even price is equal to marginal cost. That will occur only if R&D is essentially costless, which is likely to be rare. An example of an injunctive policy that would implement a price cap equal to marginal cost is a rule that denies enforceability of the patent, which thus opens the market to competition and drives the price down to marginal cost.

The most relaxed antitrust regime would set the effective price cap at the monopoly price \( p^{ *} \). With an antitrust-regulated price cap that is greater than or equal to the monopoly price, the innovator would never be deterred by the threat of antitrust regulation from investing.

Perfect antitrust, in this model, involves setting the antitrust price cap equal to the break-even price. With the antitrust cap set at the break-even price, the innovator will invest, and society will get the benefit from innovation with the smallest possible deadweight loss.Footnote 3

If the antitrust price cap is less than the break-even price, society loses the gain from innovation. The firm will not invest in innovation, and the minimum social loss is the sum of consumer’s surplus and the firm’s profit in the unconstrained regime:

$$ {\text{RS}} + {{\uppi }} = \mathop \smallint \limits_{0}^{{q^{{ \star }} }} \left[ {p\left( q \right) - c} \right]dq. $$

Society does not lose the potential welfare captured by area \( {\text{D}} \) in Fig. 1 because this portion of the potential surplus from innovation would never have been available to society in the unconstrained regime.Footnote 4 The maximum society will lose is the deadweight loss \( {\text{D}} \), given by

$$ {\text{D}} = \mathop \smallint \limits_{{q^{{ \star }} }}^{{q^{c} }} \left[ {p\left( q \right) - c} \right]dq. $$

Errors, in this framework, are deviations from perfect antitrust enforcement. An error in the direction of excessive enforcement, where the antitrust price cap is below the break-even price, results in “false conviction” or “false positive” costs. Errors in the direction of too little enforcement, where the antitrust price cap is greater than the break-even price, result in “false acquittal” or “false negative” costs. For enforcement authorities, the error-cost ratio—the ratio of false positive costs to false negative costs provides guidance as to the optimal direction of any bias that is due to errors in enforcement. For adjudicatory tribunals, the error-cost ratio provides guidance as to the standard of proof that should govern disputes over whether a firm has violated the antitrust laws. More generally, the error-cost ratio also serves as a measure of the welfare tradeoff that is relevant to any conflict between the scope of the patent laws and the scope of the antitrust laws. This measure of the welfare tradeoff differs from the ratio test (reward-to-social-loss ratio) that has been advanced in parts of the patent antitrust literature.

Given the foregoing, the most conservative measure of the error-cost ratio is represented by

$$ {{\uprho }} \doteq \frac{{RS + {{\uppi }}}}{D}. $$

This quotient provides a lower-bound measure: It takes account of the different levels of stringency in antitrust enforcement: the ratio of the cost of excessive enforcement (false convictions) to the cost of inadequate enforcement (false acquittals) in the patent antitrust area.

Under traditional decision theory arguments (Kaplan 1968; Burtis et al. 2018), an error-cost ratio that is equal to one would justify a balanced approach to the standard of proof, such as the preponderance standard. Such a standard would treat prospective errors in the direction of excessive enforcement as equally costly as prospective errors in the direction of lenient enforcement. On the other hand, a ratio of one-third would justify an approach that favors patent challengers: say, by adopting a rebuttable presumption of guilt in patent-antitrust cases. A ratio that is considerably higher than one—such as three—would justify a proof standard that favors the innovator.

These implications of the error-cost ratio should be considered, while also taking into account that it is a minimum estimate (because \( {\text{D}} \) is a maximum estimate). Thus, if the ratio that was presented above is greater than one, given a particular market structure, the actual ratio in applications to specific regulatory interventions within the particular market typically will be even greater. If the ratio is greater than one, then the case for adopting a proof standard that favors the innovator is even stronger than is implied by the numerical value of the ratio.

The ratio formula itself, however, provides no guidance as to what the value of it might be in real-world situations. It seems plausible, initially, that some market structures might generate a high ratio, and others might generate a low ratio. In the abstract, a demand function could take a shape that could generate almost any positive numerical value as a ratio estimate. However, we limit the range of possibilities below by examining plausible and widely used functional forms.

We analyze the error-cost ratio first by a linear demand function. Then we relax the assumption to show that the qualitative results are robust over several commonly used demand assumptions.

3.1 Linear demand

Assume the demand function takes the form \( {\text{p}} = {\text{A}} - {\text{bq}} \), where \( {\text{A}} > 0 \) and \( {\text{b}} > 0 \). The basic results are summarized in Proposition 1. In the next subsection, we show that the qualitative result holds u:nder more robust assumptions on the demand function.

Proposition 1

(Linear demand) If the demand curve is linear and marginal cost is constant, then the error-cost ratio for patent antitrust is

$$ {{\uprho }} \doteq \frac{{RS + {{\uppi }}}}{D} = 3. $$

Under linear demand, the error-cost ratio is a constant value of three. If the error-cost ratio happened to be one, there would be a credible argument for adopting a balanced approach to patent antitrust where the risk of excessive enforcement is equated with the risk of inadequate enforcement. In terms of proof standards for adjudication, this would be equivalent to a preponderance test. However, an error-cost ratio of three is more consistent with a biased approach to the risk of excessive enforcement, where the bias favors leniency toward the regulated party (Burtis et al. 2018). Alternatively, the ratio of three suggests—in the adjudication context—a “clear and convincing” standard of proof. We consider below whether this implication is also valid for other common representations of consumer demand.

3.2 Power-Law Demand

In this part we consider power-law demand functions: such as the isoelastic, algebraic, and exponential forms. The advantage of these forms over the linear is that they better represent demand in settings of wealth inequality or where a relatively small number of consumers bid intensively for the good (e.g., medical care).

Consider the algebraic demand form, \( {\text{p}} = {{\alpha }}q^{{{\upbeta }}} - \), \( {{\upbeta }} \in \left( { - 1,0} \right) \). After that, we will examine the results under isoelastic and exponential functional form. Linear and isoelastic demand functions are special cases of the algebraic form. In particular, if \( {{\alpha }} = - {\text{b}},{{\upbeta }} = 1 \), and \( {{\upsigma }} = - {\text{A}}, \) the algebraic demand function turns out to be a linear demand function. If σ = 0, the algebraic demand function becomes an isoelastic demand function, \( {\text{p}} = {{\alpha }}q^{{{\upbeta }}} \).

Proposition 2

(Algebraic demand) If the demand curve is algebraic\( {\text{p}} = {{\alpha }}q^{{{\upbeta }}} - {{\upsigma }} \), \( {{\upbeta }} \in \left( { - 1,0} \right) \), and marginal cost is a constant\( {\text{c}} \), then the minimum false conviction cost is\( {\text{RS}} + {{\uppi }} = \left( {{{\upsigma }} + c} \right)\left[ {\frac{{ - {{\upbeta }}\left( {{{\upbeta }} + 2} \right)}}{{\left( {{{\upbeta }} + 1} \right)^{2} }}} \right]\left[ {\frac{{\left( {{{\upsigma }} + c} \right)}}{{{{\alpha }}\left( {{{\upbeta }} + 1} \right)}}} \right]^{{1/{{\upbeta }}}} \)and the maximum false acquittal cost is\( {\text{D}} = \left( {{{\upsigma }} + c} \right)\left[ {\frac{{ - {{\upbeta }}}}{{{{\upbeta }} + 1}}} \right]\left[ {\frac{{{{\upsigma }} + c}}{{{\alpha }}}} \right]^{{1/{{\upbeta }}}} - \left( {{{\upsigma }} + c} \right)\left[ {\frac{{ - {{\upbeta }}\left( {{{\upbeta }} + 2} \right)}}{{\left( {{{\upbeta }} + 1} \right)^{2} }}} \right]\left[ {\frac{{\left( {{{\upsigma }} + c} \right)}}{{{{\alpha }}\left( {{{\upbeta }} + 1} \right)}}} \right]^{{1/{{\upbeta }}}} \).Footnote 5Therefore, the error cost ratio is

$$ {{\uprho }} \doteq \frac{{RS + {{\uppi }}}}{D} = \frac{1}{{\frac{{\left( {{{\upbeta }} + 1} \right)^{{1 + 1/{{\upbeta }}}} }}{{{{\upbeta }} + 2}} - 1}}, $$

which ranges, as the elasticity of demand increases, from\( \infty \)to a lower bound of\( \frac{1}{{{\text{e}}/2 - 1}} \).

The algebraic demand form does not generate a constant elasticity. Elasticity is \( {\epsilon}\left( p \right) = - \frac{p}{{{{\upbeta }}\left( {p + {{\upsigma }}} \right)}} \), so introducing the parameter \( {\epsilon} \), where \( {\epsilon} = - \frac{1}{{{\upbeta }}} \)\( {\epsilon} \equiv - \frac{1}{{{\upbeta }}}, \) allows us to examine the behavior of the error cost ratio as demand elasticity goes to infinity.Footnote 6 As demand becomes more inelastic, the error-cost ratio approaches infinity. As demand becomes more elastic, the error-cost ratio falls to its lower bound of \( \frac{1}{{{\text{e}}/2 - 1}} \approx 2.8 \).

In more intuitive terms, Proposition 2 states that for relatively u:ncompetitive markets—where the elasticity of demand is still above but close to one—the error-cost ratio is extremely high. For such markets, the sum of profit and consumer surplus is very large relative to deadweight loss, and society loses much more than one dollar for each dollar of deadweight loss that is avoided through excessive antitrust regulation. The error-cost ratio falls toward its lower bound of roughly 2.8 as the market moves toward perfect competition.

Proposition 3

(Isoelastic demand) If the demand curve is isoelastic\( {\text{p}} = {{\alpha }}q^{{{\upbeta }}} , \)\( {{\upbeta }} \in \left( { - 1,0} \right) \), and marginal cost is a constant\( {\text{c}} \), then the minimum false conviction cost is\( {\text{RS}} + {{\uppi }} = {\text{c}}\left[ {\frac{{ - {{\upbeta }}\left( {{{\upbeta }} + 2} \right)}}{{\left( {{{\upbeta }} + 1} \right)^{2} }}} \right]\left[ {\frac{c}{{{{\alpha }}\left( {{{\upbeta }} + 1} \right)}}} \right]^{{1/{{\upbeta }}}} \), and the maximum false acquittal cost is\( {\text{D}} = {\text{c}}\left[ {\frac{{ - {{\upbeta }}}}{{{{\upbeta }} + 1}}} \right]\left[ {\frac{c}{{{\alpha }}}} \right]^{{1/{{\upbeta }}}} - {\text{c}}\left[ {\frac{{ - {{\upbeta }}\left( {{{\upbeta }} + 2} \right)}}{{\left( {{{\upbeta }} + 1} \right)^{2} }}} \right]\left[ {\frac{c}{{{{\alpha }}\left( {{{\upbeta }} + 1} \right)}}} \right]^{{1/{{\upbeta }}}} \). Therefore, the error cost ratio is

$$ \rho \doteq \frac{RS + \pi }{D} = \frac{1}{{\frac{{\left( {\beta + 1} \right)^{1 + 1/\beta } }}{\beta + 2} - 1}}. $$

This case delivers the same result as the algebraic demand case, and again the error-cost ratio ranges with the elasticity of demand from positive infinity to a limiting lower bound of \( \frac{1}{{{\text{e}}/2 - 1}} \). Figure 2 shows the relationship between the error-cost ratio and the elasticity of demand for the isoelastic case.Footnote 7

Fig. 2
figure 2

Error-cost ratio and demand elasticity

Proposition 4

(Exponential demand) If the demand curve is exponential form\( q= \gamma\,e^{\wedge} \{ { - \beta\,p} \},\,\beta > 0 \)and marginal cost is a constant c, then the minimum false conviction cost is\( RS + \pi = \frac{1}{\beta }\gamma e^{ - \beta c} \left( {1 - \frac{2}{e}} \right), \)the maximum false acquittal cost is and the error-cost ratio is\( D = \frac{1}{\beta }\gamma e^{ - \beta c} \left( {1 - \frac{2}{e}} \right) \), and the error cost ratio is

$$ {{\uprho }} = \frac{1}{{\frac{e}{2} - 1}}. $$

Like the linear case, exponential demand delivers an error-cost ratio that is independent of market structure as measured by the elasticity of demand. Of the demand functions that we consider in this part, the exponential generates the lowest error-cost ratio, which is a constant value of \( \frac{1}{{\frac{e}{2} - 1}} \approx 2.8. \) The linear and the exponential also share the feature that there is a maximum price that consumers are willing to pay, above which demand falls to zero. However, in most markets, there is always some consumer willing to bid up the price of a scarce item to relatively high levels. In particular, in markets that are characterized by wealth inequality among consumers, the wealthiest consumers can bid u:p the price of a scarce and highly desirable good (e.g., housing) to a level that is quite well beyond affordability for the average consumer. Given this, the algebraic form probably best captures the features of real markets.

4 Model Extension

This section presents an extended model that incorporates the probability of violating antitrust law to model explicitly the firm’s stage-1 investment decision and the error-cost ratio. The reason is to separate the effect of market structure on the probability of entry (investment) and on the error- cost ratio. By separating these two effects, we can capture both the static and dynamic effects of changes in market structure on the error-cost ratio.

At stage 1, the firm observes the R&D investment cost \( {\text{K}} \) and decides whether to invest in innovation. The choice variable at stage 1 is denoted as \( {\text{r}} \in 0,1 \), where \( {\text{r}} = 1 \) denotes that the firm invests in R&D and enters the subgame of stage 2, and \( {\text{r}} = 0 \) denotes that the firm does not enter and the game ends. If \( {\text{r}} = 1 \), the firm incurs the fixed cost of R&D and enters into stage 2. At stage 2, the firm is possibly faced with an antitrust challenge. Let \( {\text{v}} \) denote the probability that the firm is held to violate the antitrust law and receives zero economic profit in stage 2.

This framework differs from the previous part by treating antitrust enforcement as “all or nothing”. As a result, the error-cost ratio is the same as before, though this time it represents the consistent value, given a specific demand curve, rather than the minimum that takes account of different levels of enforcement stringency. However, the probability of enforcement affects the incentive to invest and the likelihood that deadweight loss occurs.

4.1 Stage-1 Choice

Assume that the probability of antitrust legal enforcement is fixed at a level v, v ∈ (0, 1). The innovation investment cost \( {\text{K}} \) follows a cumulative density function \( {\text{F}} \), which is differentiable and its derivative \( F^{\prime}\left( K \right) \doteq f\left( K \right) \) is positive everywhere. Assume that the firm is risk neutral

We solve the firm’s entry game by backward induction. From our proceeding result in the previous section, if the firm continues to stage 2 as a monopoly, it charges the monopoly price \( p^{ *} \) and receives the monopoly profit \( {{\uppi }}^{ *} \). Given that v is the probability that the firm is held in violation of the antitrust law, the expected stage-2 utility is

$$ {\text{E}}\left[ {U_{2} \left( {r = 1} \right)} \right] = \left( {1 - v} \right){{\uppi }}^{ *} $$

where \( {\text{r}} = 1 \) means that the firm invests in R&D in stage 1, and \( U_{2} \) denotes the firm’s utility at stage 2. If the firm does not invest in R&D, it earns zero. It optimizes by choosing whether to invest or not in stage 1. Thus the firm’s stage 1 maximal utility is

$$ U_{1} = \hbox{max} \left\{ {E\left[ {U_{2} \left( {r = 1} \right)} \right] - K,0} \right\}, $$

Therefore, the firm invests in R&D to continue to stage 2, if and only if \( {\text{E}}\left[ {U_{2} \left( {r = 1} \right)} \right] - {\text{K}} \ge 0 \), that is,

$$ {\text{r}} = 1,{\text{if}}\;{\text{and}}\;{\text{only}}\;{\text{if}}\;\left( {1 - v} \right){{\uppi }}^{ *} \ge K. $$

Let \( {{\uptheta }} \) denote the probability that the firm invests in R&D, and is called probability of entry in later analyses.Footnote 8 We have the following expression for \( {{\uptheta }} \),

$$ {{\uptheta }} \doteq {\text{Probability}}\left\{ {\left( {1 - v} \right){{\uppi }}^{ *} \ge K} \right\} = {\text{F}}\left[ {\left( {1 - v} \right){{\uppi }}^{ *} } \right] $$

To study the effect of market structure on the firm’s R&D investment decision, we study how market structure affects the monopoly profit \( {{\uppi }}^{ *} \) which then determines the firm’s entry decision.

$$ \begin{array}{*{20}c} {\frac{\partial \theta }{{\partial \pi^{*} }} = f\left( {\left( {1 - v} \right)\pi^{*} } \right) \cdot \left( {1 - v} \right) > 0} \\ \end{array} $$
(1)

The condition in Eq. (1) indicates, in accord with intuition, that the firm’s investment decision depends positively on the monopoly profit that is expected at stage 2.

5 The Probability of Entry and the Adjusted Error Cost Ratio

As was analyzed in the prveious part, if the firm does not invest in R&D, the society loses \( {\text{RS}} + {{\uppi }} \). If the firm invests in R&D and monopolizes in stage 2, the society loses the monopoly deadweight loss \( {\text{D}} \). If we take the firm’s R&D probability into consideration, the adjusted error cost ratio is

$$ {\tilde{{\uprho }}} \doteq \frac{{\left( {1 - {{\uptheta }}} \right)\left( {{\text{RS}} + {{\uppi }}} \right)}}{\uptheta {{\text{D}}}} = \frac{{1 - {{\uptheta }}}}{{{\uptheta }}}. $$

The effect of entry probability on error cost is captured by the partial derivative of \( {\tilde{{\uprho }}} \) with regard to \( {{\uptheta }} \), which is given by

$$ \frac{{\partial \tilde{\rho }}}{\partial \theta } = - \frac{{{\uprho }}}{{{{\uptheta }}^{2} }} < 0, $$
$$ \frac{{\partial^{2} {\tilde{{\uprho }}}}}{{\partial {{\uptheta }}^{2} }} = \frac{{2{{\uprho }}}}{{{{\uptheta }}^{3} }} > 0. $$

Thus, the error cost ratio decreases as the likelihood of innovation investment increases, and this negative relationship grows at an increasing rate. Other things equal, excessive enforcement is less harmful to society where there are strong incentives to invest because of high expected monopoly profits or low R&D costs. In addition, a small increase in the investment probability has a larger downward effect on the error cost ratio in an industry where firms have more and frequent R&D than in an industry with less R&D intensity.

5.1 Effect of Market Structure

The price elasticity of demand is an important feature of market structure, and usually varies across industries (Johnson and Helmberger 1967). A number of industry-specific characteristics contribute to the pattern of demand elasticity, such as the degree of substitutability between goods. If a product does not have substitutes, such as some drugs and treatments for rare diseases, the product’s demand is likely to be relatively inelastic. Path dependence (switching costs) is another factor that gives rise to inelastic demand (Klemperer 1987). Path dependence is a common feature that is observed in the demand for high tech products, such as online platforms and operating systems. Because consumers take time to adapt to a new technology, they tend to continue using related products from the same firm and are willing to pay a price premium for doing so.Footnote 9

5.1.1 Negative Relationship Between Elasticity and Profit

As a general matter, a low demand elasticity reveals the consumer’s greater tolerance for price increases, and the monopolist can exploit this high tolerance to earn more profits. In this sense, an industry with low demand elasticity is favorable for a monopoly firm (Kamien and Schwartz 1970). Of course, demand elasticity varies along the demand curve in most cases. To study the effect of demand elasticity on profit, we will first have to construct a parameter that tracks the elasticity measure at all points along the demand curve. By examining the relationship between such a measure of elasticity and profit, we can draw inferences on the relationship between market structure and profitability.

We start by considering the power demand functions: isoelastic, algebraic, and exponential. For the isoelastic form \( {\text{p}} = {{\alpha }}q^{{{\upbeta }}} , \)\( {{\upbeta }} < 0 \), the price elasticity of demand is simply which tracks elasticity at all points along the demand curve. The monopoly profit under isoelastic demand can be expressed as a function of demand elasticity

$$ {{\uppi }}^{ *} = \left( {\frac{{\epsilon} - 1}{c}} \right)^{{\epsilon} - 1} \left( {\frac{\alpha }{{\epsilon}}} \right)^{{\epsilon}} \;{\text{where}}\;\begin{array}{*{20}c} {\frac{{\partial \pi^{*} }}{\partial {\epsilon}} < 0.} \\ \end{array} $$
(2)

Generally, the elasticity of demand for each of the power demand functions can be expressed as \( {\epsilon}\left( p \right) \doteq {\epsilon}f\left( p \right) \), where \( {\epsilon} \) is a parameter that tracks the elasticity of demand. In the algebraic case, \( {\epsilon}\left( p \right) = - \frac{p}{{{{\upbeta }}\left( {p + {{\upsigma }}} \right)}} \), so that \( {\epsilon} = - \frac{1}{{{\upbeta }}} \) and \( {\text{f}}\left( p \right) = \frac{p}{{\left( {p + {{\upsigma }}} \right)}} \). In the isoelastic case \( {\epsilon} = - \frac{1}{{{\upbeta }}} \) and \( {\text{f}}\left( p \right) = 1 \). In the exponential case, \( {\epsilon}\left( p \right) = - {{\upbeta }}p \), so that \( {\epsilon}\left( p \right) = \), and \( {\text{f}}\left( p \right) = {\text{p}} \). For this class of demand functions, (2) holds.Footnote 10

Although the parameterization is not as straightforward, a similar decomposition can be accomplished with linear demand, where the same negative relationship between profit and the elasticity tracking parameter holds.Footnote 11

5.1.2 Dynamic Effect of Market Structure on the Error-Cost Ratio

Recall that in Sect. 3, we demonstrated that the probability of investment (entry) increases with monopoly profit. Combining this with the negative relationship between profit and demand elasticity, we expect that the less elastic market demand generates higher investment and entry probability.

$$ \begin{array}{*{20}c} {\frac{\partial \theta }{\partial {\epsilon}} = \frac{\partial \theta }{\partial \pi } \cdot \frac{\partial \pi }{\partial {\epsilon}} < 0.} \\ \end{array} $$
(3)

To consider the effect of market structure on the error-cost ratio, we analyze the derivative of the error-cost ratio with respect to elasticity

$$ \begin{array}{*{20}c} {\frac{{\partial \tilde{\rho }}}{\partial {\epsilon}} = \frac{{\partial \frac{{\left( {1 - \theta } \right)}}{\theta }\rho }}{\partial {\epsilon}} = \mathop {\mathop {\frac{1 - \theta }{\theta }\frac{\partial \rho }{\partial {\epsilon}}}\limits_{{}} }\limits_{static\,effect} + \left[ {\mathop {\mathop { - \frac{\rho }{{\theta^{2} }}\frac{\partial \theta }{\partial {\epsilon}}}\limits_{{}} }\limits_{\text{dynamic effect}} } \right],} \\ \end{array} $$
(4)

where the first part in Eq. (4), which is negative, is the static effect of elasticity on the error-cost ratio, as was discussed in Sect. 2. The second term, which is positive, captures the dynamic effect of the demand elasticity on the error-cost ratio. As the post-patent-award market becomes more competitive—because of the greater availability of substitutes—investment and entry are less likely, which increases the error-cost ratio. On the one hand, low elasticity leads to a larger monopoly profit and residual surplus relative to deadweight loss, which leads to a larger error cost as the direct impact, which we call the static effect in this model. On the other hand, the higher monopoly profit that is expected in stage 2 encourages entry, which partially offsets the static effect.

The conflicting static and dynamic effects suggest that the relationship between demand elas- ticity and error cost may not be negative as was suggested in some of our earlier analyses of demand functions. The limiting ratio that is derived for the algebraic and isoelastic cases is larger than when the entry/investment effect is taken into account. Indeed, for the linear and exponential cases, where the static effect is zero, increasing the demand elasticity (making the market more competitive) generates only a dynamic effect, raising the limiting error-cost ratio.

5.1.3 Example: Isoelastic Demand and Uniform Distribution

As an illustration, assume that demand is isoelastic and that the probability distribution that determines is \( {{\uptheta }} \) uniform. The R&D investment cost \( {\text{K}} \) follows a uniform distribution on \( \left( {0,\bar{K}} \right] \), where \( \bar{K} \) denotes the upper bound of investment cost. It follows that if expected profit is greater than the upper bound on investment cost, the firm is sure to enter. If expected profit is below the upper bound, then the entry probability is determined by the cumulative distribution function of \( {\text{K}} \). Formally, we have

$$ {{\uptheta }} = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {\left( {1 - v} \right)\pi^{*} \ge \bar{K}} \hfill \\ {\frac{{\left( {1 - v} \right)\pi^{*} }}{{\bar{K}}},} \hfill & {\left( {1 - v} \right)\pi^{*} < \bar{K}} \hfill \\ \end{array} } \right. $$

The adjusted error-cost ratio that takes into account the dynamic effect of enforcement can now be expressed as a function of demand elasticity

$$ \tilde{\rho } = \frac{1 - \theta }{\theta }\rho = \left\{ {\begin{array}{*{20}l} {\frac{{\bar{K} - \left( {1 - v} \right)\pi^{*} }}{{\left( {1 - v} \right)\pi^{*} }}\left[ {\left( {1 - \frac{1}{{\epsilon}}} \right)^{1 - {\epsilon}} \left( {2 - \frac{1}{{\epsilon}}} \right)^{ - 1} - 1} \right]^{ - 1} , } \hfill & {\left( {1 - v} \right)\pi^{*} < \bar{K}} \hfill \\ {0,} \hfill & { \left( {1 - v} \right)\pi^{*} \ge \bar{K}} \hfill \\ \end{array} } \right., $$

where \( \pi^{*} = \left( {\frac{{\epsilon} - 1}{c}} \right)^{{\epsilon} - 1} \left( {\frac{\alpha }{{\epsilon}}} \right)^{{\epsilon}} \). Differentiating with respect to elasticity,

$$ \begin{array}{*{20}c} {\frac{{\partial \tilde{\rho }}}{\partial {\epsilon}} = \left\{ {\left( {\frac{{\bar{K}}}{{\left( {1 - v} \right)\pi^{*} }} - 1} \right)\left[ { - \rho \left( {1 + \rho } \right)} \right]\left[ {\ln \left( {\frac{{\epsilon}}{{\epsilon} - 1}} \right) - \frac{1}{{\smallint - \frac{1}{2}}}} \right] + \frac{{\bar{K}}}{{\left( {1 - v} \right)\pi^{*} }}\rho \left[ {\ln \left( {\frac{{\epsilon}}{{\epsilon} - 1}} \right) - \ln \frac{\alpha }{c}} \right]} \right\}} \\ \end{array} $$
(5)

The first line in Eq. (5) reflects the static effect, which is negative,Footnote 12 while the second line is that of the dynamic effect, which is positive.Footnote 13 Whether the static effect dominates the dynamic effect depends on the comparison of the absolute values of the two lines in Eq. (5). Thus, the static effect is dominant if and only if

$$ \begin{array}{*{20}c} {\tilde{\rho }\left( {1 + \rho } \right)\left[ {\ln \left( {\frac{{\epsilon}}{{\epsilon} - 1}} \right) - \frac{1}{{{\epsilon} - \frac{1}{2}}}} \right] > \frac{{\bar{K}}}{{\left( {1 - v} \right)\pi^{*} }}\rho \left[ {\ln \left( {\frac{{\epsilon}}{{\epsilon} - 1}} \right) - \ln \frac{\alpha }{c}} \right].} \\ \end{array} $$
(6)

If this condition holds (6), the static effect is dominant, and the adjusted error-cost ratio decreases with elasticity. Otherwise, the error-cost ratio increases with elasticity—which is the opposite of our result in Sect. 2.

Figure 3 shows a simulation of the relationship between static and dynamic effects, and of the relationship between the adjusted and static error-cost ratios. Both figures assume a modest antitrust enforcement probability of 0.2.Footnote 14 As is shown in Fig. 3a, the dynamic effect overtakes the static effect after the (absolute) elasticity becomes greater than 1.06.

Fig. 3
figure 3

Adjusted error cost ratio

Figure 3b compares the error-cost ratio and the adjusted error-cost ratio curves. For relatively low elasticity values, the adjusted error-cost ratio is below the error-cost ratio. For relatively high elasticity levels, the adjusted curve is above the static error-cost ratio, and eventually goes to infinity. The intuition behind this pattern is the following: for high elasticity values, the market is relatively competitive, and profit expectations are low. As a result, the rate of entry/investment is low, and approaching zero as the market becomes perfectly competitive. Because entry is so low, and the expected deadweight loss from monopoly pricing is therefore low, the adjusted error-cost ratio steadily goes to infinity.

Conversely, when the demand elasticity is relatively low, so that the market is not competitive, firms are expected to earn relatively large profits. Now the likelihood of entry and investment is high, which reduces the adjusted error-cost ratio below its static counterpart.

Though the adjusted error-cost ratio dips below the (static) error-cost ratio for low elasticity values—for uncompetitive markets—note that it is always well above the asymptotic limit of \( \frac{1}{e/2 - 1} \) for the isoelastic demand case. Indeed, the minimum value of the adjusted error-cost ratio in this simulation is greater than five. In the “Appendix” we consider additional simulations with higher probabilities of enforcement: one intermediate with the probability enforcement set at 0.5; and the other a high-enforcement regime with a probability of 0.8. In both of the additional simulations the adjusted error-cost ratio is greater, at each elasticity value, than in the modest enforcement regime that is simulated in Fig. 3.

Although we believe that the assumptions in these simulations are reasonable, we do not intend to suggest that they provide a representation of the relationship of the error-cost ratio to the adjusted error-cost ratio for every conceivable demand function or investment cost distribution. It is possible to generate an example where the adjusted ratio falls below one. Indeed, in the case of exponential demand, where the error-cost ratio has a constant value, the adjusted ratio could start from a level below the error-cost ratio (for low elasticity) before going to infinity.Footnote 15 Given this, a measure of the average value of the adjusted error-cost ratio over a wide range of elasticity values might offer an alternative single measurement.

5.2 Implications

The foregoing analysis has examined the welfare tradeoffs of antitrust enforcement in the innovation setting. We have focused on the antitrust regulation of patentees, though the issues addressed here apply to any area where firms make investments that create new markets or substantially enhance existing markets.

The tradeoffs that we have examined here have implications for many facets of law enforcement: First, society must determine the level of resources to devote to the antitrust enforcement effort against patent holders. What is the optimal probability of enforcement when monopolists have obtained their status through innovation? Second, society must determine an optimal legal standard: for example, whether to apply a per se prohibition, a per se legality rule, or a rule of reason test. Third, society must choose the optimal standard of proof in trials, where the occurrence of an antitrust violation under the operable legal standard is uncertain.

For each of these questions, the tradeoff between the costs of excessive enforcement and the costs of inadequate enforcement should be considered in determining the features of an optimal enforcement system.

In this paper’s model, we begin with a perfect enforcement ideal, and consider the cost of deviating from the ideal toward excessive enforcement, and the cost of deviating from the ideal toward inadequate enforcement. The ratio of these two costs—which we label the error-cost ratio—constitutes the appropriate welfare ratio test for determining optimal antitrust enforcement in the patent context. The perfect enforcement ideal is met when antitrust enforcement enables the innovating monopolist to recover its R&D costs, and also prevents the firm from imposing unnecessary deadweight loss on society.

Using the foregoing approach to defining perfect enforcement, we find that the error-cost ratio for patent antitrust—or, alternatively, the appropriate welfare tradeoff ratio for patent antitrust—is equal to the sum of the monopoly profit and the residual consumer surplus, all divided by the deadweight loss. This finding contradicts the view suggested in the literature (Kaplow 1984; Gilbert and Shapiro 1990; Klemperer 1990) that the relevant welfare ratio for patent antitrust is the profit divided by the deadweight loss (reward-loss ratio).

The reward-loss ratio appears to be invalid for assessing the welfare tradeoffs in patent antitrust because it accords inadequate weight to the residual surplus that goes to consumers, and in doing so undercounts the social value of innovation. 0ne immediate implication is that prescriptions for patent antitrust law that is based on the reward-loss framework should be reconsidered from a perspective that gives greater consideration to the social incentive to innovate.

We find that the error-cost ratio is generally well above one, and declines as a function of market competitiveness as measured by the elasticity of demand. The minimum value of the ratio is roughly equal to three under commonly used demand functions. When the effects of enforcement on innovation are taken into account, the adjusted error-cost ratio is likely to be even greater, and tends toward infinity as the elasticity of demand increases. These results suggest that society should show a greater concern for the costs of excessive enforcement than the costs of inadequate enforcement of antitrust in the patent context. The law should err on the side of protecting innovation incentives.

These implications have immediate practical relevance: there are novel theories of antitrust that are being applied to patentees currently. In an ongoing lawsuit against Qualcomm, the FTC claims that the firm’s patent licensing fees are an abusive exercise of monopoly power. Such efforts to introduce antitrust regulation into areas that had u:ntil recently been controlled almost entirely by patent law should be assessed u:nder a consideration of the associated error costs. An attack on patent pricing as a form of monopoly abuse is equivalent to introducing price regulation through antitrust. Determination of the welfare-maximizing antitrust price cap—which encourages innovation and at the same time avoids unnecessary deadweight loss—is subject to uncertainty. Such an effort should err on the side of protecting innovation incentives, given the high ratio of false-positive costs to false-negative costs.

In other areas of litigation, courts must determine whether to apply a per se legality test, per se illegality test, or rule of reason test to alleged antitrust violations by patent holders.Footnote 16 For example, settlements of patent infringement lawsuits were u:ntil recently examined under antitrust according to a per se legality test. Such settlements have typically involved the patent holder transferring a share of the patent revenue to the challenger, which traditionally has been deemed within the power of a patent holder. The Supreme Court overturned the per se legality rule and replaced it with a rule of reason test, for pharmaceutical patent infringement settlements, in FTC v. Actavis.Footnote 17 The rule of reason test has led to numerous lawsuits against pharmaceutical patent holders for entering into settlements with generic drug makers. The error-cost ratios examined here suggest that the Actavis analysis should be conducted in a manner that takes into consideration the high error-cost ratio for patent antitrust.Footnote 18

Lastly, there is the question of the appropriate standard of proof in antitrust challenges of patent holders. The error-cost ratio, when applied to this question, would support a high burden of proof, such as requiring clear and convincing evidence to support antitrust theories.

6 Conclusion

The error-cost ratio—the ratio of false-positive to false-negative costs—for patent antitrust is equal to the sum of the monopoly profit and residual consumer surplus all divided by the deadweight loss. We find that this ratio ranges from infinity, in uncompetitive markets (no substitutes to the patent), to a low of roughly three, in competitive markets, for commonly used demand functions. When we extend the analysis to take enforcement’s effect on entry into account, we find that the range of values for the ratio is even higher under reasonable assumptions. This implies that patent antitrust rules, from substantive law to proof standards, should tilt generally in favor of patentees.