1 Introduction

Empirical evidence on licensing deals indicates that innovators often transfer their patented innovations to direct competitors (Jiang & Shi, 2018) and that most licensing contracts feature positive royalties (Bousquet et al., 1998; Lim & Veugelers, 2003).Footnote 1 In this paper we discuss two additional aspects relevant for licensing arrangements. First, that often one of the firms in the industry has accumulated an advantage over time, which is reflected in its current market dominance. This competitive advantage may derive from prior technological superiority leading to a large customer base, well-known trademarks, etc.; and in the case of incumbents against start-ups, also from the fact that new firms face constraints in their production capacity, must build their own supply chain and distribution network, etc. In our analysis below, we will represent this incumbency feature through a Stackelberg game, where the more established firm will be the first-mover player in the marketplace.

The second important aspect to consider is that, at a given time, any of the two firms can have the technological leadership. Hence the holder of a new technology can be either the leader in setting the output level (the most established firm) or the follower (the entrant) of the industry. There are many firms playing as leaders in their product markets and that frequently license their patented innovations to competitors. Jiang and Shi (2018) discuss the examples of Procter and Gamble and Ford. At the same time, the case of AMD in the semiconductor industry features an example of a licensor that plays as follower in the marketplace. AMD microprocessors for PCs compete with those from Intel,Footnote 2 and until recently Intel almost monopolized this market, but currently AMD has the technological leadership; as a consequence, while in 2015 Intel’s share for desktop and laptop microprocessors was at 92.4%, in 2021 AMD’s share was over 17%.Footnote 3 Notice that AMD technological advantage does not mean that it has overtaken Intel in terms of market share. Moreover, this rivalry does not exclude that AMD and Intel extensively cross-license each other’s technologies, so that both firms end up offering products of similar quality.

The empirical literature does not provide clear-cut results on the relationship between firm size and innovation, and although the tendency seems to be positive, it is not necessarily linear. Therefore, it can be assumed that both a leading firm and a follower firm in setting the quantity to produce, and thus having a different size, can become the licensor of a new technology. For example, according to Acs and Audretsch (1987, 1990), the relationship between firms’ size and innovation depends on industry characteristics, so that in highly concentrated sectors with high entry barriers, large firms tend to innovate (and become licensors) more than small firms, whereas the opposite holds for less concentrated sectors reflecting emerging or growing technologies. As for the relationship between innovation and firm age (another proxy for incumbency), the evidence indicates that challengers invest more in R&D than more established firms when the goal is to enter new markets (Reinganum, 1983; Czarnitzki & Kraft, 2004); this suggests that older firms may be less R&D-intensive than their younger counterparts. Finally, Huergo and Jaumandreu (2004) find that the probability of innovating varies widely by activity, that small size per se broadly reduces the probability of innovation and that entrant firms are more likely to innovate than older firms. Summing up, both the firm that plays as leader or follower in the product market can be the one that innovates and licenses the resulting technology.

Since the main motivation of innovative firms for licensing out their patents is to earn revenue, they will try to devise a licensing arrangement that ensures the maximum payoff. The empirical literature shows that contingent royalties—either per-unit royalty (non-negative uniform royalty per unit of production) or ad-valorem royalty (non-negative royalty based on a percentage of licensee sales)—are commonly included in licensing contracts (e.g., Bousquet et al., 1998; Lim & Veugelers, 2003; Trombini & Comacchio, 2012). In this respect, our work examines how the position of a firm in a market, namely its role as a leader or a follower in the Stackelberg game, affects the type of royalty chosen (per-unit or ad-valorem) when licensing a process technology.

The theoretical literature shows that the rationale for using contingent royalties in licensing deals lies in factors such as demand or cost uncertainty (Bousquet et al., 1998), product differentiation, a licensee’s new product development cost (San Martín and Saracho, 2016), or the relative efficiency of the licensee compared to the licensor (Fan et al., 2018a). Two papers have analysed the choice between per-unit and ad-valorem royalties in a Cournot setting: San Martín and Saracho (2010) find that, under full information, an ad-valorem royalty is the preferred licensing contract. This is because ad-valorem royalty compared to per-unit royalty allows the licensor to relax market competition. Fan et al. (2018a) show that per-unit licensing is more profitable if the licensor is more efficient than the licensee in using the innovation, whereas ad-valorem licensing is more profitable in the reverse scenario.

In this paper we describe the interaction between firms in a duopoly using the Stackelberg leadership model,Footnote 4 that we believe better reflects the characteristics we have described above rather than a Cournot duopoly, and we contribute three findings to the literature. First, the kind of royalty offered crucially depends on the licensor’s status as a leader or follower in the product market. In particular, the superiority of ad-valorem over per-unit royalty in a Cournot setting no longer holds when the licensor is the follower in the Stackelberg game. In this set-up, per-unit royalty is more collusive than ad-valorem royalty and allows the licensor to reap more license revenues.

Our second finding is related to the licensing impact on social welfare. While, in a Cournot setting, licensing as compared to no licensing unequivocally hurts both consumers and society (San Martín & Saracho, 2010), in a Stackelberg environment the licensing impact on consumers and society as a whole depends on the licensor’s status in the marketplace. The diffusion of an innovation harms consumers and society only when the licensor is the market leader; if the licensor plays as a follower, the consumer surplus remains unaltered and society as a whole is better off in welfare terms after licensing because there is an increase in production efficiency.Footnote 5

Finally, we also show that the incentive to undertake innovative activities and license an innovation largely depends on both the licensor’s status in the product market and the size of the innovation: when the size of the innovation is sufficiently small, the incentive to innovate and disseminate the resulting innovation is higher for a leader licensor than for a follower licensor, while the opposite holds for a large innovation. In this respect, our paper complements Wang and Yang (2004)’s research. These authors find that in a linear Stackelberg duopoly like ours, the follower innovator is more likely to license a cost-reducing innovation to the leader than the leader innovator is to the follower, regardless of whether licensing is in the form of a fixed fee or royalty per unit of output. Under fixed-fee licensing, the follower gains more from small innovations while the leader gains more from large non-drastic innovations. Under royalty licensing, however, it is the follower in the product market who always gains more than the leader from an innovation.

The rest of the paper is structured as follows. Section 2 outlines the model and Sect. 3 examines the preferred method—ad-valorem two-part or per-unit two-part contracting—for a licensor to license the innovation. Section 4 analyses the welfare impact of licensing, Sect. 5 studies the effect of the licensor’s status in the product market on the incentive to innovate, and finally, Sect. 6 concludes.

2 The model

Consider a Stackelberg duopoly industry with firms 1 and 2 producing an undifferentiated good. Consumers in this market exhibit the following linear piecewise demand function:

$$p\left(Q\right)=\mathrm{max }\{0, a-Q\} \quad \mathrm{with}\,Q={q}_{1}+{q}_{2}$$
(1)

where \({q}_{i}\) denotes the quantity produced by firm \(i\) (\(i=1, 2\)) and parameter \(a>0\) represents the market size. Firms produce the good with their existing technology, which leads them to borne the constant marginal cost \(c>0\). There are no fixed costs of production. In this context, one of the firms in the industry, either the leader or the follower in setting the output level, owns an R&D division that develops an innovation which reduces the marginal cost from \(c\) to 0.

Throughout this paper we make the following assumption concerning the cost reduction induced by the innovation:

Assumption 1

The size of the innovation, \(c\), is such that \(0<c<a/3\).

This assumption ensures that all firms are previously active even if licensing does not take place or, alternatively, that the innovation is non-drastic irrespective of whether the owner behaves as leader or follower in the product market. A consequence of Assumption 1 is that non-licensing the innovation does not give rise to a monopoly.

The innovative firm can license its innovation to the rival by means of a two-part contractFootnote 6 that includes either a non-negative uniform royalty per unit of production (per-unit royalty), or a non-negative royalty based on a percentage of sales (ad-valorem royalty), plus possibly a fee.Footnote 7 The marginal cost of selling the licence is zero. The licensor is assumed to have all the bargaining power, and the licensee therefore always obtains its outside option (i.e. its profit absent licensing). The potential licensee acquires the innovation if its profit after payments is larger than that obtained without the licence, in which case it is operating at marginal cost c against the innovative firm with zero marginal cost.

The analysis follows a four-stage non-cooperative game. In the first stage, the innovator, either the market leader or the market follower in setting the output level, decides whether to license the innovation to its rival and, in the case of licensing, offers either a per-unit two-part contract or an ad-valorem two-part contract. In the second stage, the potential licensee, either the follower or leader in setting the output level, accepts or refuses the licensor’s offer. If the offer is accepted, then, in the third stage, the leader—the licensor or the licensee, as the case may be—chooses its level of production. Finally, in the fourth stage, the follower—the licensee or the licensor, as the case may be—observes the output level of the leader—the licensor or the licensee, as the case may be—and decides its own quantity. As usual, we look for a Subgame Perfect Nash equilibrium for this licensing game.

3 The licensor’s decision

3.1 The licensor plays as leader in setting the output level

In this section we consider the leader firm in the marketplace to be the licensor. In this case, if the licensing contract takes the form of per-unit two-part tariff, i.e. a fixed-fee payment \(f\) plus a royalty rate per unit sold \(r\), \(r\le c\),Footnote 8 and the licensee accepts the licensor’s offer, then, in the fourth stage of the game, once the leader/licensor chooses to produce \({q}_{L}\), where subscript L indicates leadership status, the follower/licensee must choose the quantity \({q}_{F}\) (subscript F denotes followership status) that maximizes its profit, i.e.:

$${q}_{F}={\mathrm{argmax}}_{q} {\pi }_{F}^{u}=\left(a-r-{q}_{L}-q\right)q$$
(2)

where superscript \(u\) denotes per-unit royalty. Solving the problem defined in (2) affords \({q}_{F}\left({q}_{L}\right)=\left(a-r-{q}_{L}\right)/2\). The licensor’s optimal quantity is then given by:

$${{q}_{L}={\mathrm{argmax}}_{q} \pi }_{L}^{u}=\left(a-q-\left(a-r-q\right)/2\right)q+r\left(a-r-q\right)/2$$
(3)

which yields \({q}_{L}=a/2\). Thus, the licensee produces \({q}_{F}=(a-2r)/4\) and obtains profit \({\pi }_{F}^{u}\left(r\right){=\left(\frac{a-2r}{4}\right)}^{2}\).

In the second stage of the game, the follower accepts the licence contract whenever this profit is larger than that which would be achieved using the old technology, in which case the leader would produce \({q}_{L}^{n}=(a+c)/2\), and the follower would produce the strictly positive quantityFootnote 9\({q}_{F}^{n}=(a-3c)/4\), where superscript \(n\) stands for no licence. As result, the follower/licensee’s profit amounts to \({\pi }_{F}^{n}{=\left(\frac{a-3c}{4}\right)}^{2}\) and the leader/licensor chooses the two-part tariff contract \(\left(f, r\right)\) that solves:

$$\left.\begin{array}{c}\underset{(f,\mathit{ }r)}{\mathrm{max}}{\pi }_{L}^{u}=\left(f+r\frac{a-2r}{4}\right)+\frac{a\left(a+2r\right)}{8}\\ s.t:\left\{\begin{array}{c}{\pi }_{F}^{u}\left(r\right)-f\ge {\pi }_{F}^{n}\\ r\le c\end{array}\right.\end{array}\right\}$$
(4)

The follower/licensee’s participation constraint in (4) leads to \(f={\pi }_{F}^{u}\left(r\right)-{\pi }_{F}^{n}={\left(\frac{a-2r}{4}\right)}^{2}{-\left(\frac{a-3c}{4}\right)}^{2}\). Note that \(r\le c\) implies that \({\pi }_{F}^{u}\left(r\right)>{\pi }_{F}^{n}\) and, consequently, the leader/licensor can charge a positive fee, \(f>0\). Thus, the problem stated in (4) can be written as:

$$\underset{r}{\mathrm{max }} {\left(\frac{a-2r}{4}\right)}^{2}{-\left(\frac{a-3c}{4}\right)}^{2}+r\frac{a-2r}{4}+\frac{a\left(a+2r\right)}{8},\quad \mathrm{s}.\mathrm{t}:r\le c$$
(5)

While a (linear) per-unit contract would limit the rent extraction of the licensor and would force it to leave some rents to the licensee, the fixed payment in the per-unit two-part tariff allows it to reduce those rents. The solution to the problem stated (5) is therefore \({r}^{*}=c\), provided that Assumption 1 is satisfied. Thus, the fixed part of the contract amounts to \({f=\left(\frac{a-2c}{4}\right)}^{2}{-\left(\frac{a-3c}{4}\right)}^{2}\), which leads the leader/licensor’s payoff to be:

$${\pi }_{L}^{u}={\left(\frac{a-2c}{4}\right)}^{2}{-\left(\frac{a-3c}{4}\right)}^{2}+\frac{{a}^{2}+4c(a-c)}{8}$$
(6)

Assume now that the licence consists of an ad-valorem two-part tariff, i.e., a fixed payment \(f\) plus an ad-valorem royalty \(d\), \(0<d<1\), proportional to the follower/licensee’s sales. In this case, the licensee chooses to produce the quantity:

$${q}_{F}={\mathrm{argmax}}_{q} {\pi }_{F}^{v}=(1-d)(a-{q}_{L}-q)q$$
(7)

where superscript \(v\) denotes ad-valorem royalty regime. The solution to the problem stated in (7) leads to \({q}_{F}\left({q}_{L}\right)=\left(a-{q}_{L}\right)/2\). In turn, the leader/licensor chooses the output level:

$${q}_{L}={\mathrm{argmax}}_{q} {\pi }_{L}^{v}=\left(a-q-{q}_{F}\left(q\right)\right)q+d\left(a-q-{q}_{F}\left(q\right)\right){q}_{F}\left(q\right)$$
(8)

which yields quantities \({q}_{L}=a\left(1-d\right)/\left(2-d\right)\) and \({q}_{F}=a/\left(2(2-d)\right)\) for the licensor and licensee, respectively. Thus, the follower/licensee’s profit amounts to \({\pi }_{F}^{v}(d)=(1-d)\frac{{a}^{2}}{4{\left(2-d\right)}^{2}}\).

On the other hand, the leader/licensor’s payoff is made up of the fixed payment, \(f={\pi }_{F}^{v}\left(d\right)-{\pi }_{F}^{n}=\frac{(1-d){a}^{2}}{4{\left(2-d\right)}^{2}}-{\left(\frac{a-3c}{4}\right)}^{2}\), the royalty income, \(\frac{d {a}^{2}}{4{\left(2-d\right)}^{2}}\), and the profit \(\frac{{(1-d)a}^{2}}{2{\left(2-d\right)}^{2}}\) from its own production. Thus, \({\pi }_{L}^{v}(d)=\frac{(1-d){a}^{2}}{4{\left(2-d\right)}^{2}}-{\left(\frac{a-3c}{4}\right)}^{2}+\frac{d {a}^{2}}{4{\left(2-d\right)}^{2}}+\frac{{a}^{2}\left(1-d\right)}{2{\left(2-d\right)}^{2}}=\frac{{a}^{2}\left(3-2d\right)}{4{\left(2-d\right)}^{2}}-{\left(\frac{a-3c}{4}\right)}^{2}\) and, as result, the chosen ad-valorem royalty is that which solves:

$${\mathrm{max}}_{d} \left\{\frac{{a}^{2}\left(3-2d\right)}{4{\left(2-d\right)}^{2}}-{\left(\frac{a-3c}{4}\right)}^{2}\right\},\quad \mathrm{ s}.\mathrm{t}:{ \pi }_{F}^{v}(d)\ge {\pi }_{F}^{n}$$
(9)

Since the leader’s payoff given in (9) increases and the follower’s profit decreases with ad-valorem royalty \(d\), ending up being zero at \(d=1\), it follows that the optimal contract for the leader/licensor has no fixed payment \(f\) and features an ad-valorem royalty that fulfils the follower’s participation constraint, i.e. where the profit if the licence is accepted, \({\pi }_{F}^{v}(d)=\frac{(1-d){a}^{2}}{4{\left(2-d\right)}^{2}}\), equals that achieved if the licence is rejected, \({\pi }_{F}^{n}=\frac{{(a-3c)}^{2}}{16}\). Thus, the optimal ad-valorem royalty rate amounts to:

$${d}^{*}=2\left(\sqrt{s\left(1+s\right)}-s\right),\quad\mathrm{with}\,s\equiv {\left(\frac{a}{a-3c}\right)}^{2}-1$$
(10)

being \(0<{d}^{*}<1\), and, finally, the licensor’s profit is:

$${\pi }_{L}^{v}= \frac{{a}^{2}+a\sqrt{3c\left(2a-3c\right)}}{8}$$
(11)

which, once compared with the profit given in Eq. (6), yields the following result.

Proposition 1

If the licensor plays as the leader in setting the output level, then a pure ad-valorem royalty is the preferred licensing deal.

Proof.

According to Eqs. (6) and (11), the difference in the leader/licensor’s profit using ad-valorem royalty or per-unit two-part tariff is \({\pi }_{L}^{v}-{\pi }_{L}^{u}=\frac{a\sqrt{3c\left(2a-3c\right)}}{8}-\left({\left(\frac{a-2c}{4}\right)}^{2}{-\left(\frac{a-3c}{4}\right)}^{2}+\frac{{a}^{2}+4c(a-c)}{8}\right)\). This difference is strictly positive for \(c>0\), whenever \(12{a}^{2}\left(2a-3c\right)-c{\left(10a-13c\right)}^{2}>0\) holds. Since this expression is decreasing in \(c\in \left(0,a/3\right)\), and achieves a strictly positive value at \(c=a/3\), it follows that \({\pi }_{L}^{v}-{\pi }_{L}^{u}>0\) in all the range of admissible values of parameter c.

Both per-unit and ad-valorem royalties modify the profit for the licensor, adding the royalty revenue to the profit from its own production. The internalization of royalty revenues implies that the leader/licensor becomes less aggressive in the product market, while the follower/licensee increases production (because of the reduction in the leader’s production and also, in the case of ad-valorem royalty, because it becomes a more efficient firm). Since any increase in the follower’s profit can be captured by the licensor through the fixed-fee payment, the licensor prefers the royalty that increases the industry’s profits most. Both kind of royalties have a collusive effect, since total production decreases from \({Q}^{n}=\frac{3a-c}{4}\) to \({Q}^{u}=\frac{3a-2c}{4}\), \({Q}^{u}<{Q}^{n}\),when a per-unit royalty is used, and to \({Q}^{v}=\frac{3a-2{d}^{*}}{2(2-{d}^{*})}\), \({{Q}^{v}<Q}^{n}\), in the case of an ad-valorem royalty. Therefore, the result in Proposition 1 can be restated by stating that ad-valorem royalty is a more collusive device since total production becomes even lower than under per-unit royalty, \({Q}^{v}<{Q}^{u}\). This result extends the findings of San Martín and Saracho (2010) to a Stackelberg setting in which the licensor plays as the leader in the markeplace.

3.2 The licensor plays as follower in setting the output level

Assume now that the licensor chooses the quantity to produce after observing the rival’s pre-committed quantity; hence the licensor is a follower in the Stackelberg game. Under per-unit two-part licensing, the follower/licensor chooses to produce the quantity given by:

$${{q}_{F}=\mathrm{argmax}}_{q}{\pi }_{F}^{u}=\left(a-{q}_{L}-q\right)q+r{q}_{L}$$
(12)

where superscript \(u\) stands (as in Sect. 3.1) for per-unit royalty. Solving the problem stated in (12) affords \({q}_{F}\left({q}_{L}\right)=\left(a-{q}_{L}\right)/2\). Note that licensing through per-unit royalty does not change the strategic interaction between leader and follower; it merely affects market outcomes if the royalty is set below the marginal cost with the old technology (and the leader/licensee has therefore lower marginal costs than in the absence of a license contract). The licensee then chooses to produce:

$${q}_{L}={\mathrm{argmax}}_{q}{\pi }_{L}^{u}=\left(a-r-{q}_{F}\left(q\right)-q\right)q$$
(13)

which yields \({q}_{L}=\left(a-2r\right)/2\) and, consequently, \({q}_{F}=\left(a+2r\right)/4\). As a result, the follower/licensor solves:

$$\left.\begin{array}{c}\underset{(f,\mathit{ }r)}{\mathrm{max}}{\pi }_{F}^{u}=\left(f+r\frac{a-2r}{2}\right)+\frac{{\left(a+2r\right)}^{2}}{16}\\ s.t: \left\{\begin{array}{c}{\pi }_{L}^{u}\left(r\right)-f\ge {\pi }_{L}^{n}\\ r\le c\end{array}\right.\end{array}\right\}$$
(14)

If we saturate the leader/licensee’s participation constraint, then \(f={\pi }_{L}^{u}\left(r\right)-{\pi }_{L}^{n}=\frac{{\left(a-2r\right)}^{2}}{8}-\frac{{\left(a-2c\right)}^{2}}{8}\) and the problem stated in (14) becomes:

$$\underset{r}{\mathrm{max }}{\pi }_{F}^{u}\left(r\right)= \frac{{\left(a-2r\right)}^{2}}{8}-\frac{{\left(a-2c\right)}^{2}}{8}+r\frac{a-2r}{2}+\frac{{\left(a+2r\right)}^{2}}{16}, \quad \mathrm{s}.\mathrm{t}:r\le c$$
(15)

which yields \(r=c\) (provided that parameter \(c\) satisfies Assumption 1). Now the follower/licensor does not want a more efficient leader/licensee, because its market share would be reduced, and the decrease in its own profit in the product market is not compensated for by the increased royalty revenue. As a consequence, per-unit royalties do not reduce competition in the product market.Footnote 10 With equilibrium per-unit royalty at \(r=c\), production levels (and market shares) remain unchanged and the leader in the marketplace obtains the same profit as without a licence. Hence, it holds that \(f=0\) when the licensor is the market follower, and the licence is then reduced to a pure per-unit royalty deal, unlike what happens when the licensor plays as the market leader. The licensee is left indifferent between accepting or refusing the licence and the licensor’s profit amounts to:

$${\pi }_{L}^{u}=\frac{{a}^{2}+12c(a-c)}{16}$$
(16)

which is strictly larger than the profit without licensing, because, while the licensor’s own market profit does not change, additional royalty revenue of \(\frac{c(a-2c)}{2}\) is reaped.

If, on the other hand, the follower/licensor transfers the innovation through a two-part contract featuring ad-valorem royalty \(d\), it chooses to produce:

$${q}_{F}={\mathrm{argmax}}_{q}{\pi }_{F}^{v}=\left(a-{q}_{L}-q\right)q+d\left(a-{q}_{L}-q\right)q$$
(17)

that is, \({q}_{F}\left({q}_{L}\right)=\left(a-(1+d){q}_{L}\right)/2\). In the presence of ad-valorem royalty the follower/licensor reduces production since the effect of a higher market price on royalty revenue is internalized. In turn, the leader/licensee chooses the quantity:

$${q}_{L}={\mathrm{argmax}}_{q}={\pi }_{L}^{v}=(1-d)\left(a-{q}_{F}\left({q}_{L}\right)-q\right)q$$
(18)

which yields \({q}_{L}=a/(2\left(1-d\right))\), a larger production than without the royalty because the licensee knows that the licensor will reduce its production when compared to a Stackelberg game without a license, from \(\left(a-{q}_{L}\right)/2\) to \(\left(a-(1+d){q}_{L}\right)/2\). Thus, \({q}_{F}\left({q}_{L}\right)=a(1-3d)/(4\left(1-d\right))\), which is positive only if \(d<1/3\), and, as a result, the licensor’s profit amounts to \({\pi }_{L}^{v}(d)=\frac{\left(1-d\right){a}^{2}}{8\left(1-d\right)}=\frac{{a}^{2}}{8}\). The licensor can then charge a fixed fee equal to the increased profit of the licensee, \(f=\frac{{a}^{2}}{8}-\frac{{(a-2c)}^{2}}{8}\), and its payoff, i.e. the fixed payment plus royalty revenue and its own market profit, amounts to:

$${\pi }_{F}^{v}(d)=\left(\frac{{a}^{2}}{8}-\frac{{\left(a-2c\right)}^{2}}{8}\right)+\frac{d {a}^{2}}{8\left(1-d\right)}+\frac{{a}^{2}\left(1-3d\right)}{16\left(1-d\right)}=\frac{c(a-c)}{2}+\frac{{a}^{2}}{16}$$
(19)

Any ad-valorem royalty \(d\in \left(0,\right.\left. 1/3\right]\) leads the follower/licensor to obtain the same payoff, although different royalty rates in this interval lead to different distributions between its own market profit (internal profit) and licensing revenue. An ad-valorem royalty substantially reduces its internal profit from \(\frac{{(a+2c)}^{2}}{16}\) to \(\frac{{a}^{2}\left(1-3d\right)}{16\left(1-d\right)}\). This is because the leader/licensee becomes much more efficient (its marginal cost goes from c to zero), which leads to a direct increase in its production, and also because the licensor cuts its production for two reasons: the Stackelberg effect of an increase in the licensee’s production, and internalization by the licensor that licensing revenue is enhanced by an increased final price. In any case, the licensing revenue \(\left(\frac{{a}^{2}}{8}-\frac{{\left(a-2c\right)}^{2}}{8}\right)+d\frac{{a}^{2}}{8\left(1-d\right)}\) more than compensates for the reduction in its internal profit.

Summing up, the follower/licensor prefers per-unit royalty to ad-valorem royalty to license its innovation, which is recorded in the following proposition.

Proposition 2

If the licensor plays as the follower in setting the output level, then pure per-unit royalty is the preferred licensing deal.

Proposition 2 shows that, when the licensor is the follower in the marketplace, an ad-valorem royalty is not an anticompetitive device, but procompetitive. The ad-valorem royalty is counterproductive from the licensor’s point of view because it leads to a very competitive leader/licensee. Therefore, the follower/licensor settles for a per-unit royalty that leaves the market performance intact and allows profit to increase through royalty revenues.

Propositions 1 and 2 show that whether the per-unit or the ad-valorem royalty is more anticompetitive depends on whether the licensor is the leader or follower in the final market. These results depend crucially on two assumptions that we maintain throughout the paper. First, the licensor has all the bargaining power when establishing the terms of the licensing contract, whereby the licensee always obtains its outside option; thus, the licensor will choose the licensing contract that maximizes aggregate profit. Second, the contract can feature a fixed fee that allows the licensor to appropriate all the extra industry profit achieved through the agreement. Therefore, the licensor will choose the contract that maximizes industry profit (subject to the restriction that the rival cannot be shut down).

4 Welfare

Once examined the impact of the licensor’s market status (leader or follower in the Stackelberg game) on the kind of licensing arrangement, we now investigate the welfare consequences of licensing. To this end, we define aggregate welfare as the non-weighted sum of consumer surplus and firms’ profits, i.e.

$$W=\frac{1}{2}{Q}^{2}+{\pi }_{L}+{\pi }_{F}, \quad \mathrm{where}\, Q={q}_{L}+{q}_{F}$$
(20)

and the following result emerges.

Proposition 3

As compared to a no-licensing setting:

  1. (i)

    The diffusion of an innovation hurts consumers and society as a whole if the licensor is the leader in the marketplace.

  2. (ii)

    The diffusion of an innovation is innocuous for consumers and benefits society as a whole if the licensor is the follower in the marketplace.

Proof

(i) It is immediate that consumer surplus decreases when the licensor is the leader in the marketplace because total production decreases from \({Q}^{n}=\frac{3a-c}{4}\) to \({Q}^{v}=\frac{3a-2{d}^{*}}{2(2-{d}^{*})}\), \({Q}^{v}<{Q}^{n}\). Utility also decreases, since it is an increasing function of consumption in the relevant range. We must therefore check that this is not compensated for by increased production efficiency when the follower/licensee uses the new technology. With no licensing, consumer surplus amounts to \({CS}^{n}={\left(3a-c\right)}^{2}/32\), industry profits to \({\pi }^{n}=\left({3a}^{2}-2ac+11{c}^{2}\right)/16\), and, consequently, aggregate welfare to \({W}^{n}=\left({15a}^{2}-10ac+23{c}^{2}\right)/32\). In contrast, with licensing by means of pure ad-valorem royalty contract according to Proposition 1, consumer surplus amounts to \({CS}^{v}={{a}^{2}\left(3-2d\right)}^{2}/{8\left(2-d\right)}^{2}\), and industry profits to \({\pi }^{v}={a}^{2}\left(3-2d\right)/4{\left(2-d\right)}^{2}\), whereby total welfare is \({W}^{v}={a}^{2}\left(3-2d\right)\left(5-2d\right)/{8\left(2-d\right)}^{2}\). Comparison of \({W}^{n}\) with \({W}^{v}\) yields the stated result in part (i).

(ii) The fact that industry output is constant in \(d\) implies that consumer surplus is not affected when the follower in the marketplace licenses its technology to the leader (through a contract that features ad-valorem royalty). Aggregate welfare, on the other hand, improves with licensing because the leader/licensee produces with a more efficient technology; this translates into higher profit for the follower/licensor through licensing revenues.

As compared to the no-licensing setting, the diffusion of an innovation is innocuous for consumers when the licensor is the follower in the marketplace, but hurts consumers otherwise. Licensing an innovation in the hands of the market leader is a collusive device, and consumers end up paying a higher price than if licensing did not occur. An innovation in the hands of a follower in the product market, in contrast, leaves consumers unaffected and licensing is only a way to increase production efficiency (and industry profits as a consequence). We can thus argue than a new technology in the hands of a follower in the marketplace and licensed to a leader in that market has more social value than if its holder were a leader that licenses to a follower.

5 Incentives to innovate

What sort of firm—a leading firm or a following firm in the product market—has a greater incentive to obtain the new technology? To evaluate the incentive of a firm to innovate we use as an indicator the increased profit of the firm that already got the new technology and decides to license it to a rival. Although for most of the admissible values of parameter \(c\) that measures the size of the innovation, it is the market leader who has the most incentives to innovate, there are, however, admissible values of \(c\) for which it is the market follower who has the most incentives. This is recorded in the following proposition.

Proposition 4

There is a cut-off value for the size of the innovation, \({c}^{*}\approx 0.292\cdot a\) , being \(0<{c}^{*}<a/3\) , such that the following hold:

  1. (i)

    A leader in setting the output level has greater incentive to develop an innovation of size \({c<c}^{*}\) than a follower in setting the output level.

  2. (ii)

    A follower in setting the output level has greater incentive to develop an innovation of size \({c>c}^{*}\) than a leader in setting the output level.

Proof

For an innovator that plays as a leader in the product market and engages in R&D to obtain the new technology the increase in its profit as compared to the situation in which it does not innovate amounts to:

$$\frac{{a}^{2}+a\sqrt{3c(2a-3c)}}{8}-\frac{({a-c)}^{2}}{8}=\frac{a\sqrt{3c\left(2a-3c\right)}+c(2a-c)}{8}$$
(21)

On the other hand, for an innovator that plays as a follower in the product market and has the new technology the increase in its profit as compared to the situation in which it does not innovate is:

$$\frac{{a}^{2}+12c(a-c)}{16}-\frac{({a-c)}^{2}}{16}=\frac{12c(a-2c)+c(2a-c)}{16}$$
(22)

The value \({c}^{*}\) stated in Proposition 4 is the solution in the interval \((0,a/3)\) of equalling Eqs. (21) and (22), or the solution of the third-degree equation in c given by \(12\left(2a-3c\right)-c{\left(10a-11c\right)}^{2}=0\). Finally, the two statements in the proposition are the result of comparing Eqs. (21) and (22) above and below cut-off value \({c}^{*}\).

There are three channels through which an innovation creates value for an innovator/licensor that plays as a leader in the product market. The first one is the direct leadership effect in the Stackelberg model: the leader is a much larger firm than the follower in terms of market share, and the increased efficiency induced by the new technology implies a much greater impact on the leader’s profit. This channel only loses its relevance when the value of parameter c is very large, i.e. when the old technology is very inefficient, since the innovative firm would become a monopoly.

When the value of parameter c is very large, two other channels, both related with the incentives to disseminate the innovation through a licensing deal, become relevant to create value for the innovator/licensor. First, licensing increases the production efficiency of the licensee, and the licensor reaps royalty revenues in exchange for facilitating this improvement. This channel is less relevant when the licensor is the market leader than when it is the market follower, because it is already the firm with the greatest market share, and also when the marginal cost c is low, i.e. when the old technology is almost as efficient as the new one. Second, licensing an innovation creates value for the innovator/licensor when the licence acts as a collusive device. According to Proposition 2, this channel is not present when the innovator/licensor is the follower in the product market. As a result, for sufficiently high values of marginal cost c, it is important for industry profit if the market leader moves from the old to the new technology; hence, in this case it is the follower in setting the output level that has more incentives to innovate.

6 Final remarks

We have demonstrated that when an inside licensor licenses a cost-reducing innovation, its position in the product market plays a crucial role in the choice of per-unit or ad-valorem royalty for the licensing deal. This is because the effect of each kind of royalties on industry performance depends on whether the licensor is the leader or the follower in setting the output level. In other words, we have shown that we cannot claim that a per-unit royalty is a more anticompetitive device than an ad-valorem royalty, or the other way around, because, in fact, their effects will depend on the details of the industry.

One important aspect of the set-up developed in this paper is that we can easily evaluate the licensing impact on aggregate welfare. Since licensing tends to create a more collusive product market, in welfare terms the increase in productive efficiency never compensates for the reduction in consumer surplus. As a consequence, only licensing that is innocuous for consumers can lead to welfare improvement.

Regarding the incentives to engage in R&D activities that lead to a new technology we showed that they largely depend on the size of the innovation achieved and the market status of the innovator/licensor. Obviating any restriction to performing R&D, our model suggests that a firm playing as a leader (follower) in the product market is more incentivized to undertake and to license a small (large) innovation than a firm that plays as a follower (leader). In a sense, this could be understood as a form of decreasing relationship between a firm’s size and its innovative activity.

There are two extensions of the model handled in this article that would be worth exploring. First, we considered in our research the case of homogenous goods, so it would be useful to extend the Stackelberg framework to a set-up with differentiated products. In a product variety situation, we suspect that the anticompetitive impact of licensing would be reduced; if this were the case, then the increase in production efficiency from licensing could translate into improved consumer surplus.

Second, we used the Stackelberg model as a reduced form to represent that a firm (the leader) has a competitive advantage over a rival (the follower). For a more detailed discussion of any particular industry, it would be useful to build a leader/follower model with micro-foundations based on the technological and institutional details of the industry under study (e.g., the leader is a platform in an industry with network effects) and then to analyse the role of both kind of royalties (per-unit royalty and ad-valorem royalty) for that particular industry. The analysis of these extensions is left for future research.