1 Introduction

Reforms in the US patent system over the past few decades have caused an explosion in patent applications and grants (Gallini 2002; Jaffe and Lerner 2004). These reforms were aimed at strengthening the position of patent holders, and they were successful in increasing the productivity of research measured in patents. However, it has also been argued that the quality and importance of these patents have decreased and that the patent boom has not generated the economic growth that might have been expected (Jaffe and Lerner 2004). This has provoked a debate on the theoretical and empirical justifications for strengthening patent protection among policy-makers and academics.

The debate on patents is not new. In fact, for as long as patents have existed, scholars have debated the optimal length, strength, and breadth of protection. A strong rationale for more protection has been formalized in endogenous, innovation-driven growth models such as those put forth by Romer (1990), Aghion and Howitt (1992), Segerstrom et al. (1990), Grossman and Helpman (1991), Stokey (1995), and Young (1993). In these models knowledge creation drives economic growth in the long run. Consequently, intellectual property rights (IPR) protection is considered a key institution that allows inventors to market their inventions and thereby recover their costs. The logic in these models implies that stronger IPR protection stimulates investment in knowledge creation and consequently causes higher growth.

The empirical growth literature indeed strongly supports the notion that institutions in general (Barro 1996; Sala-I-Martin 1996; Acemoglu et al. 2001) and IPR protection in particular (Varsakelis 2001; Branstetter et al. 2006; Kanwar 2006; Allred and Park 2007) contribute to growth performance. However, this same literature does not support the premise that more and stronger protection is always better. Instead, evidence of an inverted-U-shaped relationship is growing (Gould and Gruben 1996), and some theoretical arguments for such a relationship have already been proposed; for example, Nordhaus (1969) pointed out that static efficiency losses need to be traded off against dynamic innovation gains, and several other mechanisms have been suggested in what one might label the patent literature.Footnote 1 This literature, however, relies largely on partial equilibrium modeling techniques. This makes it difficult to evaluate the importance of these mechanisms for overall economic growth and innovation. Analyzing the trade-offs in the context of general equilibrium, endogenous innovation-driven growth models is a recent research trajectory aimed at connecting these two literatures, and the area of focus in this paper.

Nordhaus’s arguments, for example, have been formalized in general equilibrium innovation-driven growth models by Kwan and Lai (2003) and Iwaisako and Futagami (2003). Both papers show that static losses can be weighed against dynamic gains, and thus, an optimum level of protection exists. Horii and Iwaisako (2007) and Furukawa (2007) focus on the reduced growth potential in an economy with more monopolized sectors. However, as O’Donoghue and Zweilmueller (2004) observe, little further analysis of the role of IPR protection in knowledge-driven general equilibrium models has been done.Footnote 2

While these models show that IPR protection can be too much of a good thing, they remain strongly committed to the assumption that patents provide economic incentives for innovation. They weigh static efficiency costs (that increase in the level of protection) against dynamic benefits of innovation (that remain constant or increase at a decreasing rate in the level of protection) to find an optimum. We present a model in which more protection can also reduce the rate of innovation in equilibrium.Footnote 3

Moreover, we argue that IPR protection cannot be understood in the context of existing modern general equilibrium endogenous growth models. Commercialization is, after all, assumed to be trivial. Innovation-driven endogenous growth models collapse the process of innovation, i.e., the subsequent generation, exploration, and exploitation of the knowledge that constitutes a commercial opportunity, into one rational decision that is entirely motivated by downstream commercial rents.

In this paper, we follow the knowledge spillover theory of entrepreneurship and stress the distinction between invention and innovation, as was suggested by Carlsson et al. (2009). Knowledge, created by inventors, spills over to the economy at large through commercialization and the activity of entrepreneurs and, once commercialized, helps future invention elsewhere.Footnote 4 This makes knowledge creation and commercialization dynamic complements in generating economic growth. We build on the same intuition presented in Michelacci (2003), who already suggested that knowledge creation and commercialization reinforce each other in the growth process. There, intuition was operationalized by modeling the matching of new ideas to entrepreneurs. More intensively searching entrepreneurs then also generate more invention, because there are more matches and that increases the ex ante expected returns to knowledge creation. We take this one step further by analyzing a direct knowledge spillover that makes R&D more productive as well.Footnote 5 Our contribution to the literature is to show that, even in the absence of matching, a relatively simple two-way knowledge spillover structure already generates qualitatively similar outcomes, and we do so in a standard endogenous growth framework.

Our model closely resembles the basic Romer (1990) variety expansion model. However, in our specification it is the entrepreneur who holds the residual claim to any monopoly rents that a new intermediate variety may generate once commercially introduced. In placing the entrepreneur center-stage, we bring back Schumpeter’s (1934) original assumption that knowledge creation and commercialization are two separate activities. Furthermore, entrepreneurs and not inventors are driven by the prospect of capturing commercial rents from an innovation. These rents are the entrepreneur’s reward for seeing the commercial potential, taking the risks, investing the resources, and organizing the production necessary for a new (intermediate) product or service. To our knowledge, we present the first general equilibrium endogenous growth model that explicitly separates invention from innovation and models the knowledge spillovers between the two activities.

To prevent our model from reverting to the exogenous Solow-esque “manna from heaven” models, we introduce an additional private economic incentive to generate new knowledge. This incentive in our model comes primarily from cost competition among final goods producers. They will invest resources in R&D to improve upon existing product lines, and we assume that, in the course of that activity, they generate knowledge that is of no direct commercial value to them. That knowledge, however, presents an opportunity for entrepreneurs, who are willing and able to take the risks to develop and commercialize it. An entrepreneur will do so when the expected (risk-adjusted) returns justify that investment. Without IPR protection, the knowledge spillover is costless and the investment is set equal to the wages foregone in engaging in the venture.

Patent protection then shifts rents from the entrepreneur to the inventor, and more patent protection reduces the incentives to commercialize new knowledge, as well as creates incentives to generate more. The latter mechanism is well understood. Our model now introduces an offsetting effect that explains why the relationship between innovation and IPR protection is not strictly positive. Consequently, there is an optimum level of protection that can be exceeded.

This question is highly relevant for modern knowledge-based economies. In a system without protection of intellectual property, invention may well be the bottleneck in the innovative cycle. Initially, patents were awarded to benefit royal favorites. When the connection between invention and exclusive property rights was introduced, it was an institutional revolution that helped spur invention and arguably paved the way for the Industrial Revolution.Footnote 6 So it is now the inventor, not the entrepreneur, who is allowed to establish legal ownership over an invention in current patent systems. We argue that a delicate balancing act is required once such a system is in place. In most Organisation for Economic Co-operation and Development (OECD) countries today, entrepreneurship and not invention seems to have become the bottleneck in the innovative process (Carlsson et al. 2010), and the balance may well be beyond the tipping point.

By enforcing patents more strictly and allowing inventors to patent much more easily, Jaffe and Lerner (2004) argue that the US patent system has now exceeded the optimum and that rents should be redistributed to the entrepreneurs. However, in their analysis, it is not the static efficiency losses from monopoly that offset the dynamic gains. They argue that strengthening IPR protection where it was already strong has actually hurt the innovation process by killing incentives to commercialize.Footnote 7 Our paper embeds their narrative in a well-established general equilibrium framework with endogenous innovation-driven growth and firm decision-theoretical microfoundations. Following Schumpeter (1934), our model also places the entrepreneur at the heart of growth theory.

The structure of this paper is as follows: We present our model in Sect. 2 and derive the equilibrium properties and implications of intellectual property rights protection in Sect. 3. In Sect. 4 we examine comparative statics and the impact of stronger patent protection. We conclude the paper in Sect. 5.

2 The basic model

2.1 Final consumption and production

The basic structure of our model follows Romer (1990) and is standard in the literature (see, for example, Barro and Sala-I-Martin 2004). First assume that consumers are infinitely lived and choose consumption, C, to maximize their lifetime utility. We follow the textbook case where direct utility is given by U = log(C(t)). Under the standard intertemporal budget constraint, where income can be spent on the consumption of final goods or the purchase of new bonds that yield a return, r(t), we obtain that in equilibrium \( \dot{C}(t)/C(t) = r(t) - \rho, \) where a dot represents a time derivative and ρ is the discount rate.Footnote 8

In final goods production a mass 1 of identical firms j is assumed to have the same constant returns to scale production function

$$ X_{j} (t) = A_{j} (t)^{\alpha } L_{{\rm P}j} (t)^{\beta } \sum\limits_{i = 0}^{n(t)} {x_{j} (i,t)^{1 - \alpha - \beta } } \quad{\text{with}}\,0 < \alpha + \beta < 1\,{\text{and}}\,0 < \alpha ,\beta < 1, $$
(1)

where X j (t) is the output of final goods by producer j at time t, L Pj (t) is production labor that earns wage w P(t), and x j (i,t) is the quantity of intermediate i bought at price χ(i,t). All these quantities are flows. A j (t) represents the level of accumulated knowledge in the firm, and n(t) is the number of available varieties of intermediate goods at time t. These variables are stock variables. By employing specialized R&D labor, L Rj (t), that earns a wage w R(t), the firm’s knowledge base can be expanded according to

$$ \dot{A}_{j} (t) = \psi A_{j} (t)^{1 - \gamma } n(t)^{\gamma } L_{{\rm R}j} (t)\quad{\text{with}}\,0\, < \,\gamma \, < \, 1. $$
(2)

The presence of A j (t) reflects an intertemporal knowledge spillover. R&D is more productive when a large knowledge base has been developed in the past but at a decreasing rate. The presence of n(t) represents the positive spillover effect of more variety in intermediates on process R&D. With more variety in intermediates, the final goods-producing sector has more degrees of flexibility to organize the production process more efficiently and thereby generate more total factor-augmenting technical change for a given level of R&D effort. Alternatively, one can say that the relevant knowledge base for firm j’s R&D is assumed to be a Cobb–Douglas aggregate of public and private knowledge, proxied by n and A j , respectively. ψ is a scaling productivity parameter. We have chosen a linear specification in R&D labor, following Romer (1990).

A higher stock of relevant production knowledge already provides an incentive to employ R&D workers. However, in addition, we introduce the possibility to patent knowledge that is generated in the R&D process but not directly relevant to the firm itself. If these patents are licensed out, the total license income of the firm, Y j , depends on the profits that the licensees can generate, Π(t), on the strength of the relative bargaining position of licensor and licensee, ξ, and on the growth rate of the firm-specific knowledge base. We assume that license income is proportional to the rate of knowledge creation, relative bargaining power, and licensees’ profits and is given byFootnote 9

$$ Y_{j} (t) = {\frac{{\dot{A}_{j} (t)}}{{A_{j} (t)}}}\xi \Uppi (t). $$
(3)

Assuming perfect competition in final goods production and normalizing the price of the final good to 1, all firms then choose L Pj (t), x j (i,t), and L Rj (t) to maximize the value function

$$ V_{j} = \int\limits_{0}^{\infty } {{\text {e}}^{ - rt} \left( {X_{j} (t) + Y_{j} (t) - w_{{\rm P}} (t)L_{{\rm P}j} (t) - w_{{\rm R}} (t)L_{{\rm R}j} (t) - \sum\limits_{i = 0}^{n(t)} {\chi (i,t)x_{j} (i,t)} } \right)} {\rm d}t. $$

As wages and prices as well as the number of intermediate varieties are given to the firm, the dynamic optimization problem has 2 + n control and 1 state variables. Dropping time arguments to save on notation and substituting for output, knowledge creation, and license income using Eqs. 13, this problem is characterized by the Hamiltonian

$$ H_{j} = {\text{e}}^{ - rt} \left( {A_{j}^{\alpha } L_{{\rm P}j}^{\beta } \sum\limits_{i = 0}^{n} {x_{j} (i)^{1 - \alpha - \beta } } + \psi A_{j}^{ - \gamma } n^{\gamma } L_{{\rm R}j} \xi \Uppi - w_{{\rm P}} L_{{\rm P}j} - w_{{\rm R}} L_{{\rm R}j} - \sum\limits_{i = 0}^{n} {\chi (i)x_{j} (i)} } \right) + \mu_{j} \left( {\psi A_{j}^{1 - \gamma } n^{\gamma } L_{{\rm R}j} } \right), $$

where the levels of employment, L Pj and L Rj , and intermediate use, x j (i), are control variables and the stock of firm-specific knowledge, A j , is the relevant state variable. The solution is therefore characterized by n + 5 first-order conditions. From the first-order conditions, we first obtain the standard Cobb–Douglas result that labor demand is given by

$$ L_{{\rm P}}^{{\rm D}} = \sum\limits_{j} {L_{{\rm P}j}^{{\rm D}} } = \sum\limits_{j} {\left( {{\frac{{\beta A_{j}^{\alpha } \sum\nolimits_{i = 0}^{n} {x_{j} (i)^{1 - \alpha - \beta } } }}{{w_{{\rm P}} }}}} \right)^{{{\frac{1}{1 - \beta }}}} } = \sum\limits_{j} {{\frac{{\beta X_{j} }}{{w_{{\rm P}} }}}} = {\frac{\beta X}{{w_{{\rm P}} }}}. $$
(4)

The total wage sum for production workers is then βX. For given employment levels, Eq. 4 shows that wages grow at the same rate as total output in equilibrium.Footnote 10 For intermediates the firm will choose the levels of each variety to satisfy n first-order conditions that yield isoelastic demand curves for each variety i by final goods producer j:

$$ x_{j} (i)^{{\rm D}} = {\frac{{\chi (i)^{{{\frac{ - 1}{\alpha + \beta }}}} }}{{\sum\nolimits_{i = 0}^{n} {\chi (i)^{{{\frac{\alpha + \beta - 1}{\alpha + \beta }}}} } }}}(1 - \alpha - \beta )X_{j}. $$
(5)

Multiplying (5) by χ(i) and summing over all varieties i shows that the total expenditure on intermediates by firm j is (1 − α − β)X j .Footnote 11 Together with the result on the wage costs, this implies that the final goods sector makes an operating profit of αX.

It can be verified that, at the value maximizing levels of employment and intermediate use, the firm’s output is proportional to its knowledge stock, A j (t).Footnote 12 Firms can therefore invest resources in R&D to increase output and operating profits. Moreover, the R&D generates license income if the knowledge generated is commercially valuable [Π(t) > 0] and the patent system allows patent owners to capture some of the rents from commercialization (ξ > 0). Intuitively, final goods producer will increase R&D activity as long as the discounted future benefits of doing so exceed the current labor costs at the margin. As R&D is a deterministic process in our model, the firms can decide to spend on R&D exactly up to that point. The solution is formally characterized by two first-order conditions, one transversality condition, and the law of motion for A j :Footnote 13

$$\begin{aligned} {\frac{{\partial H_{j} }}{{\partial L_{{\rm R}j} }}} &= 0 = {\text{e}}^{ - rt} \left( {\psi A_{j}^{ - \gamma } n^{\gamma } \xi \Uppi - w_{{\rm R}} } \right) + \mu_{j} \psi A_{j}^{1 - \gamma } n^{\gamma } \\ {\frac{{\partial H_{j} }}{{\partial A_{j} }}} &= - \dot{\mu }_{j} = {\text{e}}^{ - rt} \left( {\alpha A_{j}^{\alpha - 1} L_{{\rm P}j}^{\beta } \sum\limits_{i = 0}^{n} {x_{j} (i)^{1 - \alpha - \beta } } - \gamma\, \psi A_{j}^{ - \gamma - 1} n^{\gamma } \xi \Uppi L_{{\rm R}j} } \right) + (1 - \gamma )\mu_{j} \psi A_{j}^{ - \gamma } n^{\gamma } L_{{\rm R}j} \hfill \\ & \quad \mathop {\lim }\limits_{t \to \infty } \mu_{j} (t)A_{j} (t) = 0 \\ {\frac{{\partial H_{j} }}{{\partial \mu_{j} }}} &= \dot{A}_{j} = \psi A_{j}^{1 - \gamma } n^{\gamma } L_{{\rm R}j}, \\ \end{aligned} $$

where the first condition implies that firms will hire R&D labor until the marginal cost, w R, equals the sum of the discounted present value of marginal benefits, which consist of the license income and the shadow value, μ j , of a higher knowledge stock that a unit of R&D creates at the margin. Solving for that shadow value yields

$$ \mu_{j} = {\text{e}}^{ - rt} \left( {{\frac{{w_{{\rm R}} }}{{\psi A_{j}^{1 - \gamma } n^{\gamma } }}} - {\frac{\xi \Uppi }{{A_{j} }}}} \right). $$

Taking the time derivative and setting that expression equal to minus the right-hand side in the second condition equates the marginal return on A j to the time derivative of this shadow value. Substituting the inverted law of motion (2) and the inverted production function (1) for L Rj and L Pj , respectively, we obtain, after rearranging,

$$ \left( {r - {\frac{{\dot{w}_{{\rm R}} }}{{w_{{\rm R}} }}} + \gamma\, {\frac{{\dot{n}}}{n}}} \right){\frac{{w_{{\rm R}} }}{{\psi A_{j}^{1 - \gamma } n^{\gamma } }}} = {\frac{{\alpha X_{j} }}{{A_{j} }}} + \left( {r - {\frac{{\dot{\Uppi }}}{\Uppi }}} \right){\frac{\xi \Uppi }{{A_{j} }}}. $$
(6)

Equation 6 defines the wage level at which R&D workers will be employed by firm j. The wage level that solves (6) represents a horizontal demand function for R&D labor. If R&D wages exceed the threshold, no R&D workers will be employed by firm j. As long as R&D wages fall short, firm j will hire additional R&D workers.Footnote 14 We obtain for the threshold:

$$ \bar{w}_{{\rm R}j} = {\frac{{\left( {\alpha \left( {{\frac{\beta }{{w_{{\rm P}} }}}} \right)^{{{\frac{\beta }{\alpha }}}} \left( {{\frac{1 - \alpha - \beta }{{\bar{\chi }}}}} \right)^{{{\frac{1 - \alpha - \beta }{\alpha }}}} n^{{{\frac{\alpha + \beta }{\alpha }}}} A_{j} + \left( {r - \dot{\Uppi }/\Uppi } \right)\xi \Uppi } \right)\psi A_{j}^{ - \gamma } n^{\gamma } }}{{\left( {r - \dot{w}_{{\rm R}} /w_{{\rm R}} + \gamma \dot{n}/n} \right)}}}, $$
(7)

where we have substituted for X j using (4) and (5) in (1).Footnote 15 As (7) holds for all firms j and all firms are price takers in input markets, an equilibrium in the R&D labor market requires that all firms that hire R&D pay the same wage. It can be verified in (7) that the threshold wage is firm specific and depends nonmonotonically on the value of A j . The slope of the right-hand side switches sign from negative to positive at

$$ A_{{\rm S}} \equiv {\frac{{(r - \dot{\Uppi }/\Uppi )\xi \Uppi }}{{\alpha \left( {{\frac{\beta }{{w_{{\rm P}} }}}} \right)^{{{\frac{\beta }{\alpha }}}} \left( {{\frac{1 - \alpha - \beta }{{\bar{\chi }}}}} \right)^{{{\frac{1 - \alpha - \beta }{\alpha }}}} n^{{{\frac{\alpha + \beta }{\alpha }}}} }}}{\frac{\gamma }{1 - \gamma }}. $$

This implies that the right-hand side of (7) is decreasing in A j for A j  < A S, and increasing in A j for A j  > A S. Assuming for simplicity that firms always start at an initial level of knowledge A 0 > A S, there is a unique level of A j that all firms must attain to hire R&D labor.Footnote 16 The mechanism is that the firms with A j  = A max then also have the highest threshold wage for R&D. They will thus bid up R&D wages to this threshold level and employ a positive amount of R&D. Their level of A will then rise according to (2), and those with A S ≤ A j  < A max will not hire any R&D and their A j remains stable. The rise in A max pushes up the threshold but also increases the average A, causing production wages and intermediate prices to rise. In any equilibrium with R&D, only those firms that have A j  = A max can stay in the race, whereas others are forced to bring down their production employment and intermediate use levels to 0.Footnote 17 If we assume therefore that all firms start from the same initial level of A j (0) = A 0 > A S, the above implies that A j (t) = A max(t) = A(t) for all j and we obtain for (7) that

$$ \bar{w}_{{\rm R}} = {\frac{{\left( {\alpha X + \left( {r - \dot{\Uppi }/\Uppi } \right)\xi \Uppi } \right)\psi A^{ - \gamma } n^{\gamma } }}{{\left( {r - \dot{w}_{{\rm R}} /w_{{\rm R}} + \gamma \dot{n}/n} \right)}}}. $$
(8)

We have shown above that a stable labor demand in production requires an equilibrium in which wages grow at the same rate as output. Equation 8 shows that the threshold wage level for R&D workers will also satisfy that constraint as long as A and n grow at the same rate and total profits grow in proportion to final output. We show below that these conditions are satisfied in the steady state, so there is no long-run relative wage divergence between R&D and production labor wages, and the income distribution is stable.

Interpreting IPR protection as a strengthening of the relative bargaining power of knowledge creators, i.e., increasing ξ, Eq. 8 shows the traditional innovation-enhancing effect of patent protection. Higher levels of IPR protection will cause the threshold wage level to go up for a given level of intermediate sector profits. If R&D then competes for its labor inputs with other activities in the model, stronger protection implies higher R&D and productivity improvement.Footnote 18

To bring out our argument in the model we assume that R&D in final goods-producing firms competes over skilled labor with innovative activity (entry) in the intermediate sector. With all the differences one can think of, R&D and entrepreneurship are both largely non-routine and highly skilled activities that distinguish themselves more from routine unskilled production labor than from each other. We now turn to the intermediate goods sector.

2.2 Intermediate producers and entrepreneurs

The intermediate sector produces intermediate goods according to some specific process available to one intermediate firm only. We assume, however, that there are n varieties available to compete as imperfect substitutes and that new ones are allowed to enter.

One can think of the intermediate designs as being codified in a blueprint and protected by a patent, as in Romer (1990). Entrepreneurs, however, often bring a unique combination of tacit knowledge, training, talent, access to finance and support networks, etc., to their ventures, and by definition come up with a commercial opportunity that no-one recognized before.Footnote 19 Therefore, we can justify the assumption that, even in the absence of patent protection, every intermediate will be produced exclusively by one firm, and subsequent entry with a perfect substitute is not possible. The big difference now is that knowledge is excludable but not tradable. As in Romer (1990), the producers in this sector are monopolists that set their own price and compete only with imperfect substitutes.

By the assumed symmetry in the final goods-production function, all varieties face the same, isoelastic demand curve for their variety. All intermediates are produced using a simple one-for-one technology out of raw capital, K, and we assume that the monopolists are price takers in that input market, paying the market interest rate, r. The problem is then identical for every intermediate producer i. Formally, they solve the static and standard profit-maximization problem given by

$$ \mathop {\max }\limits_{\chi (i)} :\pi (i) = \chi (i)x(i) - rK(i) \,{\text{s}}.{\text{t}}.{:} \, \left( 5\right){\text{ and}}\,x\left( i \right) = K\left( i \right). $$

It follows that prices are set at\( \chi (i) = {\frac{r}{1 - \alpha - \beta }} \) and do not vary over varieties i anymore. As every intermediate producer sets his price equal to this value and faces the same demand function, all intermediates are demanded in the same quantity. This implies that, in equilibrium, the stock of raw capital is divided equally among all n varieties of intermediate goods and the capital share in income is given by \( rK = (1 - \alpha - \beta )^{2} X \), whereas the monopoly rents in the intermediate sector will be given by

$$ \begin{aligned} \pi (i) &= \frac{(\alpha + \beta )(1 - \alpha - \beta )X}{n} \\ \sum\limits_{i = 0}^{n} \pi (i) &= (\alpha + \beta )(1 - \alpha - \beta )X. \end{aligned} $$
(9)

These profits accrue to the entrepreneur who organized the intermediate production unit, as no other inputs or fixed (entry) costs have been assumed. However, not all rents can remain with the entrepreneur. If intermediate i is based on knowledge that is protected by a patent, then the patent holder can charge a license fee that reduces the flow of rents to the entrepreneur. As assumed above, the share retained by the entrepreneurs is (1 − ξ).

The positive (expected) flow of retained rents attracts new entrants. These entrants cannot enter an existing intermediate variety market, as we assume that these are protected by trade secrets, unique essential entrepreneurial traits or otherwise. However, the existence of these rents and the knowledge that there is a latent demand for new varieties makes it attractive to start one’s own venture and enter with a new intermediate variety. The value of such a new venture depends on the level of retainable profits. We assume for simplicity that all entrants receive their idea as a knowledge spillover from downstream final goods producers’ process R&D, such that all new entrants take into account the license fees that will be due.

In Eq. 3 we assumed that the final goods sector appropriates a share ξ of total intermediate profits for every 100% expansion of their knowledge base. This total license income is collected from the n intermediate firms that exist at time t. As all intermediate firms are fully symmetric, we assume that all contribute an equal amount to total license fees.Footnote 20 With total profits given in Eq. 9 as n times π i and taking the discounted present value of retained profits as the value of a new venture, we obtain as the value of a new firm to the owner at time T asFootnote 21

$$ V_{{\rm E}} (T) = \int\limits_{T}^{\infty } {{\text{e}}^{ - rt} (1 - \xi \dot{A}/A)\pi (i,t){\rm d}t} = (\alpha + \beta )(1 - \alpha - \beta )\int\limits_{T}^{\infty } {{\text{e}}^{ - rt} } (1 - \xi \dot{A}(t)/A(t)){\frac{X(t)}{n(t)}}{\rm d}t, $$

where r is the discount rate that the entrepreneur applies.Footnote 22 We assume that the entrepreneur, as owner of the firm, can appropriate this value but also propose that this requires the investment of time and is therefore costly in terms of (skilled/R&D) wages foregone. The entry function is given by

$$ \dot{n}(t) = \varphi A(t)^{\delta } n(t)^{1 - \delta } L_{{\rm E}} (t)\quad{\text{where}}\,0\, < \,\delta \, < \, 1. $$
(10)

As the final production process is better understood, more ideas for new and further specialized intermediates are likely to emerge. We, therefore, assume that entry is positive in the accumulated knowledge in final goods-producers process R&D, A(t). The presence of n(t) reflects the fact that accumulated entrepreneurial experience increases the entry rate for given levels of activity and knowledge availability. Alternatively, one can interpret this specification as stating that entry is proportional to a Cobb–Douglas aggregate of accumulated public knowledge in entrepreneurship and R&D. φ is a scaling productivity parameter, and as before we have assumed constant returns to skilled entrepreneurial labor, L E.

δ may be interpreted as a parameter that reflects the knowledge filter. This concept was first coined by Acs et al. (2004) to describe the institutional, informational, and otherwise existing barriers to knowledge spillover between knowledge creators and commercializers. In the context of our model, one could think of nondisclosure agreements, labor contract limitations on moving to competing firms, and defensive patenting strategies in final goods-producing firms. Anything the final goods-producing firms does to limit the spillover of knowledge, including legal and other action, will reduce δ. This reduces the entry of new intermediates for given increases in knowledge and levels of entrepreneurial activity.Footnote 23 Rent income to the marginal entrepreneur at the time of entry at time T is then given by

$$ {\frac{{\partial \dot{n}(T)}}{{\partial L_{{\rm E}} (T)}}}V_{{\rm E}} (T) = \varphi A(T)^{\delta } n(T)^{1 - \delta } (\alpha + \beta )(1 - \alpha - \beta )\int\limits_{T}^{\infty } {{\text{e}}^{ - rt} } (1 - \xi \dot{A}/A){\frac{X(t)}{n(t)}}{\rm d}t = w_{{\rm E}} (T). $$
(11)

As this trade-off is identical for entrants over time, we can replace T by t and Eq. 11 can be rewritten as the flow of income for entrepreneurial labor, where we assume that, at the time of entry, entrepreneurs expect output and variety to expand at a constant rate (as they will in steady state), such that \( X(t) = X(T){\text{e}}^{{\dot{X}/X \times t}} \)and \( n(t) = n(T){\text{e}}^{{\dot{n}/n \times t}}. \) Dropping time arguments to save on notation, we obtainFootnote 24

$$ \bar{w}_{{\rm E}} = {\frac{{(\alpha + \beta )(1 - \alpha - \beta )(1 - \xi \dot{A}/A)}}{{r + \dot{n}/n - \dot{X}/X}}}\,\varphi A^{\delta } n^{ - \delta } X. $$
(12)

As we assume that entrepreneurship competes with R&D for skilled labor, no entry will take place if the skilled wage exceeds this level. The opportunity costs are too high, and all skilled labor is employed in R&D. If it falls below this level, however, all skilled labor will switch to entrepreneurial activity. We thus have a bang–bang equilibrium due to the constant returns to L E and L R. Note that this implies that, in such a bang–bang equilibrium, either variety n or knowledge A increases, while the other is stable. This implies that A/n changes until the threshold wages in (12) and (8) equalize. We use this property to derive first the skilled labor market and then the steady-state equilibrium in Sect. 3. We analyze the relevant comparative statics in Sect. 4.

3 Equilibrium

3.1 The skilled labor market

The skilled labor market is in equilibrium when wages equate total exogenous supply to the demand in R&D and entrepreneurship and both activities earn the same income.Footnote 25 Formally, we have \( \bar{w}_{{\rm R}} = \bar{w}_{{\rm E}} \) and \( 1 = L_{{\rm R}} + L_{{\rm E}} \) to determine the equilibrium, but let us first consider what happens out of equilibrium. If \( \bar{w}_{{\rm R}} > \bar{w}_{{\rm E}}, \) then all skilled labor is allocated to R&D and none to entrepreneurship. This implies that A/n will rise. If \( \bar{w}_{{\rm E}} > \bar{w}_{{\rm R}} \) instead, all skilled labor is allocated to entrepreneurship and A/n will fall. Such changes in A/n will push the threshold wages in (8) and (12) towards each other. Only when \( \bar{w}_{{\rm R}} = \bar{w}_{{\rm E}} \) is the labor market allocation stable at positive levels of both activities.

Figure 1 plots the ratio \( \bar{w}_{{\rm R}} /\bar{w}_{{\rm E}} \) against A/n. The above implies that the labor market may clear at any ratio in the short run, but the corresponding allocation of labor over R&D or entrepreneurship implies that we will move towards the point where this ratio equals 1.

Fig. 1
figure 1

The skilled labor market

Even then, however, the model is not in a steady state. The position of the convex curve still depends on the various growth rates in the model, as can be verified when we take the ratio of (8) over (12) and substitute for the growth rate and level of total profits using (9):

$$ {\frac{{\bar{w}_{{\rm R}} }}{{\bar{w}_{{\rm E}} }}} = {\frac{\psi }{\varphi }}\left( {\frac{A}{n}} \right)^{ - \gamma - \delta } {\frac{{\alpha + (\alpha + \beta )(1 - \alpha - \beta )(r - \dot{X}/X)\xi }}{{(\alpha + \beta )(1 - \alpha - \beta )(1 - \xi \dot{A}/A)}}}\,{\frac{{r + \dot{n}/n - \dot{X}/X}}{{r - \dot{w}_{{\rm R}} /w_{{\rm R}} + \gamma \dot{n}/n}}}. $$
(13)

Out of steady-state equilibrium, the labor market will thus ensure that first A/n is at (A/n)*, but due to the fact that (13) depends on the growth rates of output, skilled wages, the interest rate, and the growth rate of n, this (A/n)* is not necessarily the steady-state ratio. A steady state is reached at (A/n)* only when knowledge stocks have adjusted to such levels that A and n grow at the same positive rate and (A/n)* remains stable. We analyze the steady state below.

3.2 The steady state

The model is in steady-state equilibrium when all variables expand at a constant rate and the skilled labor market allocation is stable. From the arbitrage Eqs. 8 and 12 and the analysis of the labor market above we can derive that the allocation of skilled labor is stable when A and n expand at the same rate.Footnote 26 Output, by the production function (1) and the fact that all intermediates are used at level K/n, will then grow at rate

$$ {\frac{{\dot{X}}}{X}} = \alpha {\frac{{\dot{A}}}{A}} + (\alpha + \beta ){\frac{{\dot{n}}}{n}} + (1 - \alpha - \beta ){\frac{{\dot{K}}}{K}}. $$
(14)

Using the fact that output in steady state grows at the same rate as both wages, total wage income, and consumption, we know that asset income must also grow at that rate by the budget constraint of consumers. Hence, asset and raw capital accumulation must also take place at the growth rate of output. Using this fact and Eq. 14 we obtain

$$ {\frac{{\dot{X}}}{X}} = {\frac{\alpha }{\alpha + \beta }}\,{\frac{{\dot{A}}}{A}} + {\frac{{\dot{n}}}{n}}. $$

As a stable labor allocation requires a constant ratio A/n, the steady-state growth rates will be equal to:

$$ {\frac{{\dot{K}}}{K}} = {\frac{{\dot{X}}}{X}} = {\frac{{\dot{C}}}{C}} = {\frac{{\dot{B}}}{B}} = {\frac{{\dot{w}_{{\rm P}} }}{{w_{{\rm P}} }}} = {\frac{{\dot{w}_{{\rm R}} }}{{w_{{\rm R}} }}} = {\frac{{\dot{w}_{{\rm E}} }}{{w_{{\rm E}} }}} = r - \rho = {\frac{{\dot{n}}}{n}}\left( {{\frac{2\alpha + \beta }{\alpha + \beta }}} \right). $$

This solves the model if we can obtain the steady-state growth rate of n (and A). The first steady-state condition follows from rewriting Eq. 13 for the steady state. The ratio in Eq. 13 is 1 in equilibrium and can be solved for A/n to yield

$$ \frac{A}{n} = \left( {{\frac{\psi }{\varphi }}\,{\frac{\Upphi }{{\Upomega \Upxi (\dot{n}/n)}}}\,{\frac{1}{{\Upgamma (\dot{n}/n)}}}} \right)^{{{\frac{1}{\gamma + \delta }}}}, $$
(15)

where we define auxiliary parameters \( \Upomega \equiv (\alpha + \beta )(1 - \alpha - \beta ),\,\Upphi \equiv \alpha + \Upomega \rho \xi \), and functions \( \Upxi (\dot{n}/n) = 1 - \xi \dot{n}/n \) and \( \Upgamma (\dot{n}/n) = {\frac{{\rho + \gamma \dot{n}/n}}{{\rho + \dot{n}/n}}} \) to save on notation. Equation 15 solves in parameters only for the special case that ξ = 0 (no license income) and ρ = 0 (no time preference). Using the condition that, in steady state, variety expansion, \( \dot{n}/n \), equals productivity growth, \( \dot{A}/A \), we can derive a second steady-state relation between entrepreneurial activity and R&D labor using Eqs. 2 and 10 as

$$ {\frac{{L_{{\rm R}} }}{{L_{{\rm E}} }}} = {\frac{\varphi }{\psi }}\left( {\frac{A}{n}} \right)^{\delta + \gamma }. $$

Using the labor market clearing condition \( 1 = L_{{\rm R}} + L_{{\rm E}} \) we can compute the steady-state level of entrepreneurial and R&D activity. We thus obtain the steady-state allocation of skilled labor as

$$ \begin{gathered} L_{{\rm E}}^{*} = {\frac{1}{{1 + {\frac{\varphi }{\psi }}\left( {\frac{A}{n}} \right)^{\delta + \gamma } }}} \hfill \\ L_{{\rm R}}^{*} = {\frac{{{\frac{\varphi }{\psi }}\left( {\frac{A}{n}} \right)^{\delta + \gamma } }}{{1 + {\frac{\varphi }{\psi }}\left( {\frac{A}{n}} \right)^{\delta + \gamma } }}}. \hfill \\ \end{gathered} $$
(16)

Plugging the level of entrepreneurship in (16) into the entry function in Eq. 10, dividing both sides by n, and using (15) to solve for the rate of variety expansion yields

$$ (\dot{n}/n)^{*} = {\frac{{\left( {\psi \Upphi } \right)^{{{\frac{\delta }{\gamma + \delta }}}} \left( {\varphi \Upomega } \right)^{{{\frac{\gamma }{\gamma + \delta }}}} }}{{\Upomega \left( {\Upgamma (\dot{n}/n)\Upxi (\dot{n}/n)} \right)^{{{\frac{\delta }{\gamma + \delta }}}} + \Upphi \left( {\Upgamma (\dot{n}/n)\Upxi (\dot{n}/n)} \right)^{{{\frac{ - \gamma }{\gamma + \delta }}}} }}}. $$
(17)

This equation determines the growth rate in steady state by the fact that the right-hand side is a function of that growth rate, but it cannot be solved analytically.Footnote 27 Equation 17 does allow us to make the following proposition:

Proposition I

There exists a positive, unique, and stable steady-state equilibrium growth rate.

The proof is presented in Appendix 1.

4 Comparative statics and the impact of stronger IPR protection

4.1 The key result

The effects of stronger IPR protection can now be analyzed by deriving the impact of a higher ξ on this steady-state growth rate, and we formulate our key proposition.

Proposition II

Strengthening the level of patent protection as captured by an increase in ξ in our model will only generate increases in the overall rate of innovation if the initial level of protection is low enough. More patent protection is beneficial for economic growth as long as: \( \xi < {\frac{1 - \alpha }{{\psi^{{{\frac{\delta }{\gamma + \delta }}}} \varphi^{{{\frac{\gamma }{\gamma + \delta }}}}\, {\frac{\delta }{\gamma }} + {\frac{\rho }{\delta }}}}}. \)

Corollary I

An increase in patent protection when initial levels of patent protection are already high will result in a reduction of overall innovation. This negative effect will certainly arise when: \( \xi > {\frac{1 - \alpha \gamma }{{\psi^{{{\frac{\delta }{\gamma + \delta }}}} \varphi^{{{\frac{\gamma }{\gamma + \delta }}}} \,{\frac{\delta }{\gamma }} + {\frac{\rho \gamma }{\delta }}}}}. \)

Appendix 2 provides the proofs.

The threshold level for ξ in Proposition II and Corollary I are reached faster when the output elasticity of knowledge in final goods production, α, is large. Intuitively, this means patent protection is less likely to be beneficial when private incentives to R&D are already strong. The effects of the knowledge spillover parameters in the R&D and the entry functions, γ and δ, are ambiguous, but the threshold also shows that more productive, highly skilled labor, higher φ and ψ, and more impatient consumers, higher ρ, unambiguously reduce the growth-maximizing level of patent protection. The intuition for these results is that more productive labor in innovation increases the rate of innovation without patents. Therefore, higher productivity reduces the effectiveness of patents to increase R&D activity through shifting incentives from innovation to invention. Finally, impatience reduces innovation with and without patents in two ways. Consumers’ willingness to finance investments in R&D is reduced. This reduces the benefits of strong patent protection for the incumbents. Moreover, increasing the rental cost of capital reduces the profitability of the intermediate sector. This reduces the incentives to invest in commercialization. Strong patent protection will reduce those incentives even more. Consequently the growth-maximizing level of protection is lower when consumers are less patient.

4.2 Discussion

We have introduced the parameter ξ to represent the strength, length, and breadth of patent protection. This parameter determines how much of the commercial rents of innovation the original generator of knowledge can expropriate from the commercializer of that knowledge. We argue that this parameter captures the essence of the patent system and the strength of patent protection. We envision patents as an instrument of the legislature to redistribute commercial rents from innovation between the creator and commercializer of knowledge. Stronger patents imply stronger bargaining power for the knowledge creator and hence allow him to extract a larger share of the rents. Longer patenting spells, patentability of a broader knowledge base in earlier stages of development, bias in patent infringement courts, and lower costs of patenting all work to increase the share of the knowledge creator versus the potential commercializer. Recent reforms in the US patent system (see Jaffe and Lerner 2004) are therefore largely covered by an increase in our parameter ξ. We have shown that there may be an offsetting effect of strengthening patent protection on the rate of innovation and growth, when invention and innovation draw on the same scarce resources.

These results strongly contradict the traditional idea-based growth models of Romer (1990) and others like him, who do not separate knowledge creation from commercialization. In the absence of this separation, one would conclude that internalization of spillovers through (re)enforcing intellectual property rights of R&D laboratories is always a good idea. Less spillover implies more appropriability and more R&D, which cause higher growth in the modern growth literature. This is not merely of academic interest, as these models lend strong and perhaps oversimplified support to claims made by patent lawyers, firms with large R&D laboratories, and developed countries in World Trade Organization (WTO) rounds. Our model demonstrates that support for more and better patent protection needs to at least be qualified.

As we have argued and shown above, our result emerges when commercialization and invention are no longer assumed to collapse into one decision. When commercialization of new opportunities has to take place outside the existing and inventing firm, then barriers to the knowledge spillover may reduce growth. The risks of being sued for patent infringement and losing that case in court can overturn the initial benefits of being able to legally protect monopoly profits.Footnote 28 This problem is aggravated when the patent office allows inventors to patent ideas and knowledge which they never intended to commercialize themselves. The public policy implications of this model are therefore straightforward but also unconventional. To facilitate the spilling over of knowledge, governments should stop enforcing nondisclosure agreements in labor contracts, stop enforcing defensive patenting, stop patenting knowledge unless a working prototype of a commercial product can be shown, encourage dissemination of knowledge and labor mobility between entrepreneurship and wage employment, and try to facilitate generation and diffusion of corporate R&D output.

Following the traditional endogenous growth theorists, we argue there is a case for R&D to be stimulated, for example through subsidies, but we add to that usual result the qualification that the subsidy must be used as leverage to promote commercialization of results inside and outside the firm. In this way, government can reduce deadweight losses (subsidizing R&D investments that incumbent firms would have undertaken anyway) and maximize resulting economic growth and innovation.

5 Conclusions

We have presented an endogenous growth model in which monopoly rents provide the incentive to innovate. In our model, rents motivate the commercialization of existing knowledge rather than the generation of new ideas. The model has entrepreneurs invest resources in commercialization and capture the rents from innovation. They do not, however, produce the opportunities themselves. Incumbent firms do R&D to maintain competitiveness through efficiency improvements on their final output, and in our model the commercial opportunities spill over from this R&D. We then analyze the impact of stronger IPR protection and patents in the context of our model.

The implications of this amended model are more than trivial. R&D spillovers contribute to growth but, as spinouts are growth enhancing, nondisclosure agreements and patenting may turn out to be growth inhibiting. Patent protection increases incentives to create and patent knowledge but reduces incentives to commercialize it. The latter effect may overtake the former and reduce the aggregate rate of growth. When IPR protection and patents shift a share of the rents from knowledge commercializers to knowledge generators, the resulting rate of innovation in the economy follows an inverted U-shape in the level of protection.

New growth theory correctly asserts that the knowledge generated by commercial R&D can be a source of steady-state growth, but inaccurately considers it a sufficient precondition or even the most important one. Protecting and giving incentives for generation of knowledge are useful and necessary, but doing so through mechanisms such as patents and IPR may shift the balance of power in the ex post bargain over rents too much in favor of knowledge creators. This can reduce incentives to commercialize to the extent that economic growth falls. As both the inventor and the innovator generate large positive spillovers to society, a more balanced approach to IPR protection is required.

Knowledge is only valuable to society when it is commercialized in new products and services. The patent system was never intended to enable large firms’ legal departments to bully small competitors out of adjacent market niches, or to enable individual inventors who lack the motivation, talent or means to commercialize their ideas to discourage others from doing so. As Jaffe and Lerner (2004) have argued forcefully, however, that is exactly what the most recent reforms in the US patent system have accomplished.

In our model we have abstracted from uncertainty and have introduced IPR protection at a very high abstraction level as part of the bargain between knowledge creator and commercializer. That bargain and the relative bargaining power of the parties involved may have many other possible legal, institutional, and economic aspects to be considered. Possible extensions at this point include the role of intermediaries such as venture capitalists and university technology transfer offices. Also, our crude parameterization of IPR leaves much to be desired when it comes to the many dimensions of IPR protection. O’Donoghue and Zweilmueller (2004), for example, distinguish leading and lagging breadth, patentability requirements, and patent length as relevant and distinct dimensions of patent protection systems. Stronger protection in one or another of these dimensions may have a quite different impact on the relative bargaining power of commercializing entrepreneurs vis-à-vis patent owners.Footnote 29 Optimization of patent design over these dimensions would require a more explicit model of the bargaining process to specify how patents affect relative bargaining strength and the consequent bargaining outcome that our parameter reflects. This extension, however, we leave for future research. In future work, we also aim to be more explicit on the issue of risk and to derive more precisely how the ex ante value of new ventures is shared among parties involved in the innovation process. Although, to our knowledge, our model assumptions do not contradict the empirical evidence, its predictions are yet to be tested against the data.