Keywords

JEL Classification

1 Introduction

The danger that greenhouse gas (GHG) stocks cause environmental damage has led to a renewed interest in the problem of controlling emissions when there is asymmetric information about abatement costs. Although hybrid policies, e.g., cap and trade with a price ceiling, are more efficient than either the tax or quantity restriction (Pizer 1999), the comparison of taxes and quotas remains an important policy question. Since GHGs are a stock pollutant, the regulator’s problem is dynamic. Most of the current literature on this dynamic problem assumes that non-strategic firms solve a succession of static problems. If, however, a firm’s abatement costs depend on its stock of abatement capital, the firm makes a dynamic investment decision as well as the static emissions decision. We study the regulatory problem with asymmetric information when firms invest in abatement capital. We find that for general functional forms, quotas have an advantage over taxes that had not previously been recognized: quotas, but not taxes, are “information-constrained” optimal with respect to investment. We then consider a special functional form (linear-quadratic) where that advantage disappears, and here we find that taxes can increase abatement and welfare.

Earlier literature, e.g., Chichilnisky and Heal (1995), provided policymakers with the economic framework needed to compare different types of policies, thereby contributing to the creation of the carbon market currently used by some Kyoto signatories in the European Trading Scheme. Ellerman (2010) discusses the current state of this market. Our paper contributes to policy discussions by helping to illuminate a difference between taxes and quotas when forward-looking firms make investments that reduce abatement costs. In our setting, investment decreases costs by increasing the stock of abatement capital. We can also think of the firms’ activity as investment in (excludable) R&D.

For a variety of pollution problems, capital costs comprise a large part of total abatement costs (Vogan 1991) and investment in abatement capital depends on the regulatory environment. In these cases, the endogeneity of investment is an important aspect of the regulatory problem. Several recent papers (Buonanno et al. 2001; Goulder and Schneider 1999; Goulder and Mathai 2000; Norhaus 1999) assume that the regulator can choose emissions and also induce firms to provide the first-best level of investment, e.g., by means of an investment tax/subsidy.

We consider the situation where the regulator has a single policy instrument, either a sequence of emissions taxes or a sequence of quotas. This assumption is consistent with many regulations and proposals that involve an emissions policy but ignore endogenous investment (e.g., the Kyoto Protocol). In virtually any real-world problem, the regulator is likely to have fewer instruments than targets. Our model is an example of this general disparity between the number of instruments and targets, and therefore is empirically relevant. We identify a previously unrecognized difference between taxes and quantity restrictions, and we provide a simple means of solving the regulatory problem when a certain condition holds.Footnote 1

We now describe the problem in more detail. In each period the representative firm observes an abatement cost shock that is private information. If this cost shock is serially correlated, the regulator learns something about its current value by observing past behavior. The firm knows the current value of the cost shock and therefore is better informed than the regulator. Both the regulator and firms obtain information over time. We examine a subgame perfect equilibrium in which the regulator and firms condition their decisions on “directly payoff-relevant” information, as distinct, for example from the entire history of actions. That is, we consider a Markov Perfect Equilibrium (MPE).

For the regulator, the payoff-relevant information consists of the aggregate stock of abatement capital (which affects the industry-wide marginal abatement costs), the stock of pollution (which determines marginal damages) and the regulator’s beliefs about the current cost shock (which also affects the industry’s marginal abatement costs). For the firms, the payoff-relevant information consists of the current policy level (the tax or quota), the current cost shock, the individual firm’s level of abatement capital, the aggregate industry capital and the pollution stock. The last two state variables are payoff-relevant for the firm because the firm understands that these variables affect the evolution of aggregate capital stock and pollution stock, and the firm understands that the regulator conditions future policy levels on future values of those two stocks.

The assumption of Markov Perfection means that the regulator cannot make binding commitments regarding future policies. Firms have rational expectations; they take the current emissions policy as given and they understand how the regulator chooses future policies. The non-atomic representative firm is not able to affect the economy-wide variables that determine future policies. The representative firm therefore behaves non-strategically, but not myopically, and also uses Markov policies.

The regulator understands that future emissions policies affect the current shadow value of abatement capital and thus affect current investment. For example, firms’ anticipation that future emissions policies will be strict would increase the shadow value of abatement capital, thereby increasing the current level of investment. The regulator at the current time, t, might want to commit to a future policy, implemented at \(t^{\prime }>t\), as a means of affecting current investment in abatement capital. This incentive is the source of the familiar time-consistency problem. After time t, the time-t investment is predetermined, so the motivation for the time \(t^{\prime }\) policy choice that was optimal at time t has changed. Our setting has the usual ingredients that lead to this problem: the regulator with a second-best instrument (the emissions tax or quota) wants to influence forward-looking agents.

Subgame perfection is stronger than time-consistency; the former requires that no agent wants to deviate from the equilibrium strategy at any possible subgame, and the latter requires only that no agent wants to deviate from the equilibrium strategy at subgames that actually occur in equilibrium. The assumption of Markov perfection (a refinement of subgame perfection) therefore implies time-consistency.

In order to understand an important difference between emissions taxes and quotas, it is useful to determine whether the restriction to time-consistency is a binding constraint. To this end, we ask whether the solution to the regulator’s problem, in the absence of a time-consistency constraint, would in any case be time-consistent. If the private level of investment under the equilibrium emissions policy (the tax or quota) is socially optimal, then the regulator wants to use the emissions policy exclusively to influence emissions, without having to consider how the emissions policy affects investment. In this case, the regulator has no incentive to alter a previously announced policy rule. In this circumstance there is no time-consistency problem. Here we can obtain the MPE by solving an optimization problem that contains elements of the regulator’s and the firms’ problems. If, in contrast, firms’ investment decisions are not socially optimal given beliefs about the emissions trajectory, then the regulator would like to choose the emissions policy partly to influence investment. In this case, the Markov Perfect restriction, which implies time-consistency, actually constrains the regulator. Here, we need to solve an equilibrium problem (a dynamic game between the regulator and non-strategic firms) rather than a relatively simple dynamic optimization problem. In other words, the type of problem that we need to solve an equilibrium problem or an optimization problem depends on whether the equilibrium level of investment is socially optimal, conditional on the emissions trajectory.

There is another way of thinking about the time-consistency problem. The only market failure is that firms do not take into account the social damages arising from emissions. If there were no cost shock, or if the regulator and firms had symmetric information, either the emissions tax or the quota would be sufficient to induce firms to emit at the optimal level. In that case, firms’ investment decisions would be first best. Therefore, if in addition to the emissions policy the regulator were able to use an investment tax, the optimal level of that tax would be identically zero. However, when there is asymmetric information about abatement costs, there is no assurance that either the emissions tax or the quota leads to the first best level of emissions. Therefore, with asymmetric information, the equilibrium level of investment under the emissions policy might not be (information-constrained) socially optimal. In that case, the optimal level of an investment tax would be non-zero.

We can ask our basic question in two equivalent ways. (1) Is the optimal emissions tax or quota policy time-consistent? (2) Would a regulator who uses either the emissions tax or the quota increase welfare by additionally using an investment tax/subsidy? (In other words: Is the optimal investment tax/subsidy identically 0?).

We provide a simple answer to these questions. The optimal quota policy is time-consistent; equivalently, when the regulator can use both an emissions quota and an investment tax, the latter is identically 0. However, the optimal emissions tax policy is time-inconsistent, unless a particular “separability condition” holds; if this condition does not hold, the optimal investment tax, when used with the emissions tax, is not identically 0.

This result is useful for two reasons. First, under plausible circumstances the separability condition does not hold. In these circumstances, an emissions tax creates a secondary investment distortion, whereas the emissions quota does not. Thus, we have identified a difference between taxes and quotas that has previously been unnoticed.

Second, when the separability condition does hold, we can solve the dynamic game by solving a much simpler dynamic optimization problem that combines elements of the regulator’s and the firms’ optimization problems. The separability condition holds for an important special case, the linear-quadratic model, which has been previously used to study the problem of regulating both a flow and a stock pollutant under asymmetric information. Our separability result means that we can generalize the linear-quadratic model by including endogenous abatement capital. This generalization enables us to learn how the inclusion of endogenous abatement costs affects the ranking of the two policies. Our principal numerical finding is that, in the linear-quadratic framework, including abatement costs increases the advantage of taxes over quotas.

In summary, explicit treatment of the investment decision renders the firms’ decision problems dynamic. In general, this feature favors the use of quotas, because these, unlike taxes, introduce no distortion into firms’ investment decisions. However, in special cases that include the linear-quadratic model, this distortion disappears with taxes. For our calibration, the explicit treatment of the investment decision favors taxes in the linear-quadratic setting.

The next section discusses a static problem that provides the intuition for the separability condition. Subsequent sections describe the dynamic model and show the role of the separability condition. We then discuss the linear-quadratic specialization, and explain how endogenous investment affects the comparison of taxes and quotas in that setting. In closing the paper, we discuss other aspects of the tax versus quantity debate, as it applies to climate change policy.

Other papers in this Symposium discuss related aspects of the climate change problem. Asheim et al. (2016); Lauwers (2016); Figuieres and Tidball (2016); Chichilnisky (2016) consider the ethical foundations of criteria for sustainability, and the possibility of implementing such programs. Lecocq and Hourcade (2016) examine the relation between income levels and abatement expenditure in an efficient solution. Dutta and Radner (2016) show that capital accumulation can exacerbate the tragedy of the commons, and that income transfers can alleviate this problem. Rezai et al. (2016) explain why meaningful climate policy may require smaller (or even zero) sacrifice by the current generation, contrary to the conclusions of mainstream integrated assessment models. Burniaux and Martins (2016) use a computable general equilibrium model to evaluate the key parameters in determining the magnitude of “carbon leakage”. Ostrom (2016) emphasizes the importance of climate change policy at sub-global levels. Chipman and Tian (2016) elaborate on the role of the Coase Theorem in climate change policy.

2 The One-Period Example

This section uses a one-period model that demonstrates, in a simple setting, the difference between taxes and quotas when abatement costs are endogenous. We show that the emissions quota is always time-consistent; equivalently, if the regulator uses an emissions quota and also is able to use an investment tax/subsidy, the optimal investment tax is 0. In contrast, the emissions tax is not time-consistent; equivalently, if the regulator uses an emissions tax and also is able to use an investment tax/subsidy, the investment tax is not 0 in general. However, the optimal emissions tax is time-consistent if and only if the primitive functions satisfy a particular “separability condition”.

There is a simple explanation for this difference between taxes and quotas. An emissions policy chosen before investment has a “direct welfare effect” via its direct effect on emissions, and an “indirect welfare effect”, via its effect on investment. When the policy level is chosen after investment, the regulator takes into account only the direct welfare effect, since investment is fixed. The optimality condition for the policy chosen before investment requires that the sum of the direct and the indirect welfare effects is set equal to 0. The optimality condition for the policy chosen after investment requires that the direct welfare effect is set equal to 0. These two optimality conditions are equivalent if and only if the indirect effect is 0 whenever the direct effect is 0. That equivalence does not hold in general under taxes, but it does hold when the separability condition is satisfied. In contrast, the indirect welfare effect under quotas is always 0, so the optimal quota is the same regardless of whether it is chosen before or after taxes. All of this becomes obvious once we develop some notation and write down the optimality conditions.

We normalize the initial level of abatement capital to 0. The non-strategic but forward looking representative firm can buy k units of abatement capital at cost c(k); the firm obtains benefits \(B\left( x,k,\theta \right) \) by emitting x units of emissions when its stock of abatement capital is k and the cost shock is \(\theta \). We can think of the function \(B\left( \cdot \right) \) as a restricted profit function in which input and output prices are suppressed. Alternatively, we can interpret \(B\left( \cdot \right) \) as the amount of avoided abatement costs. For the latter interpretation, define \(x^{b}\) as the Business-as-Usual (BAU) level of emissions, i.e., the level of emissions under the status quo. Define \(a=x^{b}-x\) as the level of abatement, i.e., the reduction in emissions due to a new regulatory policy. The abatement costs associated with the new regulations are \(A=A\left( k,\theta ,a\right) \). If \(x^{b}\) is a function of \(\left( k,\theta \right) \), we can rewrite the abatement cost function as \(A\left( k,\theta ,a\right) =B\left( x,k,\theta \right) \), with \(A_{a}\left( \cdot \right) =B_{x}\left( \cdot \right) \): marginal abatement costs equal the marginal benefit of emissions.

The benefit function is increasing and concave in x and k and increasing in \(\theta \) (\(B_{k}>0\), \(B_{\theta }>0\), \(B_{x}>0\), \(B_{kk}<0\), \(B_{xx}<0\)). More abatement capital decreases the marginal cost of abatement and therefore lowers the marginal benefit of pollution, so \(B_{xk}<0\). A higher cost shock increases the marginal benefits of abatement capital and emissions: \(B_{k\theta }\ge 0\), \(B_{x\theta }\ge 0\).

The damage from emissions (external to the firm) is D(x). The regulator chooses either an emissions tax p or an emissions quota \(\bar{x}\). Throughout this paper, we assume that the emissions quota is binding for all realizations of \(\theta \). Both the regulator and the firm have the same information about the distribution of \(\theta \) before the firm observes its value.

Each firm has measure 0, and by choice of units the mass of firms has measure 1. With this normalization, in a symmetric equilibrium k and x represent the industry-wide capital stock and aggregate emissions, as well as the firm level values. The non-strategic firm chooses its (possibly constrained) level of k and x but takes the industry-wide levels as exogenous. In this section it is clear from the context whether we mean firm or aggregate level variables, but in a later section we modify the notation to avoid the possibility of misunderstanding.

We consider the following two time-lines:

Time Line A

Time Line B

1. The regulator chooses the policy level (p or \(\bar{x}\))

1. The firm chooses investment (k)

2. The firm chooses investment (k)

2. The regulator chooses the policy level (p or \(\bar{x}\))

3. Nature reveals the cost shock (\(\theta \))

3. Nature reveals the cost shock (\(\theta \))

4. The firm makes its emissions decision (x)

4. The firm makes its emissions decision (x)

With Time Line A, the emissions policy can influence both the levels of investment and emissions. With Time Line B, the emissions policy depends on the level of investment, and influences only the emissions level. In the one-period game, neither of the two time lines has a greater claim to plausibility, but the comparison of the two helps to understand the time-consistency problem in the dynamic setting.

If the optimal policy for the regulator is the same under both time lines, then it is obvious that the regulator uses that policy only to affect the emissions decision, not to influence the investment decision. In this case, the emissions policy does not create a secondary distortion in the investment decision; if we were to add a period after stage 2 and before stage 3 (“stage 2.5”) to Time Line A, at which the regulator were permitted to revise the policy announced in stage 1, the regulator would not want to make a revision when policies are time-consistent.

We show that the emissions quota is always time-consistent, but the emissions tax is time-consistent if and only if a particular separability condition holds. Equivalently, if we were to add a “stage 0” to either time line, at which the regulator announces an investment tax, the optimal investment tax is always 0 when the regulator uses an emissions quota, but it is 0 when the regulator uses an emissions tax if and only if the separability condition holds. To establish this claim, we examine each policy under both time lines.

Emissions taxes Under Time Line A, p is predetermined when the individual firm chooses k. Under Time Line B, p depends on aggregate capital, about which firms have rational point expectations. Since firms take aggregate capital as given when making their investment decision, and since they know the relation between p and aggregate capital, it is “as if” they take p as given under Time Line B as well. In short

Remark 1

Under both time lines, the individual firm takes p as given when choosing its level of capital.

This remark simplifies comparison of the two time lines because we need only consider how the choice of time lines affects the regulator’s strategic incentives; firms do not behave strategically, so the choice of time lines obviously does not affect their strategic incentives.

Consider Time Line A when the regulator uses a tax. The representative firm’s payoff in stage 2 is

$$\begin{aligned} E\left[ B(x,k,\theta )-px-c(k)\right] , \end{aligned}$$

where the expectation is with respect to \(\theta \). The firm chooses x in the last stage, conditional on k, p, and \(\theta \). It chooses k before it learns \(\theta \). The first-order conditions for x and k and the corresponding decision rules (denoted using \(*\)) in stages 4 and 2 are

$$\begin{aligned} B_{x}(x,k,\theta )-p=0\Rightarrow x=x^{*}\left( k,p,\theta \right) . \end{aligned}$$
(1)
$$\begin{aligned} E\left[ B_{k}(x^{*},k,\theta )-c^{\prime }(k)\right] =0\Rightarrow k=k^{*}\left( p\right) . \end{aligned}$$
(2)

Note that \(k^{*}\left( p\right) \) is independent of \(\theta \). Differentiating the first-order condition (1) gives the comparative statics result

$$\begin{aligned} \frac{\partial x^{*}}{\partial p}=\frac{-1}{B_{xx}\left( x,k,\theta \right) } \ \ \mathrm{{and}} \ \frac{\partial x^{*}}{\partial k}=\frac{-B_{xk}\left( x,k,\theta \right) }{B_{xx}\left( x,k,\theta \right) }. \end{aligned}$$
(3)

The regulator’s problem under Time Line A is

$$\begin{aligned} \max _{p}E\left[ B(x^{*},k^{*},\theta )-c(k^{*})-D\left( x^{*}\right) \right] , \end{aligned}$$

leading to the first-order condition

$$\begin{aligned}&E\left\{ \left[ B_{x}(*,\theta )-D^{\prime }\left( x^{*}\right) \right] \frac{\partial x^{*}}{\partial p}\right\} +E\left\{ \left[ \left[ B_{x}(*,\theta )-D^{\prime }\left( x^{*}\right) \right] \frac{ \partial x^{*}}{\partial k}\right. \right. \nonumber \\&\quad \,\, \left. \left. +B_{k}(*,\theta )-c^{\prime }(k^{*}) \right] \frac{dk^{*}}{dp}\right\} =0, \end{aligned}$$
(4)

(using the notation \(*\,=\,\left( x^{*},k^{*}\right) \)). Because \(k^{*}\left( p\right) \) is independent of \(\theta \), we can take \(\frac{dk^{*}}{dp}\) outside the expectations operator and use the firm’s optimality condition (2) to write the regulator’s optimality condition as

$$\begin{aligned} E\left\{ \left[ B_{x}(*,\theta )-D^{\prime }\left( x^{*}\right) \right] \frac{\partial x^{*}}{\partial p}\right\} +\frac{dk^{*}}{dp} E\left\{ \left[ B_{x}(*,\theta )-D^{\prime }\left( x^{*}\right) \right] \frac{\partial x^{*}}{\partial k}\right\} =0, \end{aligned}$$
(5)

The first term on the left side is the direct welfare effect of the tax and the second term is the indirect welfare effect, operating through investment.

The first-order condition for the firms’ emissions decision is the same under the two time lines, since in both cases the firm conditions its emissions decision on predetermined values of p and k and the realized value of \(\theta \). In view of Remark 1, the first order condition to the individual firm’s investment problem under Time Line B is (apart from an inessential notational difference) still given by Eq. (2). This notational difference is that instead of treating p as a predetermined variable, under Time Line B the firm treats p as a function of aggregate capital, which the firm takes as given.

The fact that (apart from the notational difference) the firm’s first order conditions are the same under the two time lines means that the functions \(x^{*}\left( k,p,\theta \right) \), and \(k^{*}\left( p\right) \) are also the same under the two time lines, although of course the values of p in the two scenarios (and therefore the equilibrium values of k and x) might differ. This possible difference is the key to the time-consistency issue. Under Time Line B the regulator takes (aggregate) k as given, so its first-order condition for the tax is

$$\begin{aligned} E\left\{ \left[ B_{x}(*,\theta )-D^{\prime }\left( x^{*}\right) \right] \frac{\partial x^{*}}{\partial p}\right\} =0. \end{aligned}$$
(6)

Note that this optimality condition equates the expected marginal benefits and costs of the tax, not the expected marginal benefits and costs of emissions. The two need not be the same, because in general \(\frac{\partial x^{*}}{\partial p}\) depends on \(\theta \).

Denote the optimal tax under Time Line B as \(\hat{p}\). We assume that the regulator’s problem is concave under both time lines, so that the solution to the respective first order condition is unique. Comparison of Eqs. (5) and (6) shows that the optimal emission tax is the same under the two time lines if and only if the indirect effect of the tax, evaluated at \(p=\hat{p}\), is zero, i.e., if

$$\begin{aligned} E\left\{ \left[ B_{x}(*,\theta )-D^{\prime }\left( x^{*}\right) \right] \frac{\partial x^{*}}{\partial k}\right\} _{\mid p=\hat{p}}=0. \end{aligned}$$
(7)

We refer to the following as the “separability condition”:

Condition 1

(Separability) \(B_{xx}\) and \(B_{xk}\), evaluated at the optimal \(x^{*}\), are independent of \(\theta \).

Remark 2

Equation (7) holds for all functions \(B\left( x,k,\theta \right) \) if and only if the separability condition holds.

Proof

In order to establish the sufficiency of Condition 1, note that it implies (using Eq. 3) that both \(\frac{\partial x^{*}}{\partial p}\) and \(\frac{\partial x^{*}}{\partial k}\) are independent of \(\theta \). This independence, together with Eq. (6) and the fact that \(\frac{\partial x^{*}}{\partial p}\ne 0\) imply that \(\hat{p}\) (the optimal tax under Time line B) satisfies

$$\begin{aligned} E\left[ B_{x}(x^{*}\left( k^{*},p,\theta \right) ,k^{*},\theta )-D^{\prime }(x^{*}\left( k^{*},p,\theta \right) )\right] =0. \end{aligned}$$
(8)

That is, under Condition 1 the tax equates expected marginal benefits of emissions with marginal damages. The independence of \(\frac{\partial x^{*}}{\partial k}\) and \(\theta \), and the fact that \(\frac{\partial x^{*}}{\partial k}\ne 0\), mean that Eq. (8) implies Eq. (7). Therefore, the optimal tax under the two lines is the same.

In order to establish necessity, note that if either \(B_{xx}\) or \(B_{xk}\) are not independent of \(\theta \), it is straightforward to construct examples under which the regulator’s first order conditions for p differ under the two time lines.\(\quad \square \)

It is also easy to show:

Remark 3

Suppose that the regulator uses an emissions tax. If we modify either time lines by adding a stage 0 at which the regulator is able to choose an investment tax/subsidy, the optimal level of this policy is identically 0 for all functions \(B\left( x,k,\theta \right) \) if and only if the separability condition holds.

We omit the proof, which parallels the proof of Remark 2.

We see that time-consistency requires that Eq. (6) implies Eq. (7). Equation (6) states that the first-order change in welfare due to a change in the tax, (holding investment fixed), is zero. Equation (7) states that the first-order change in welfare due to a change in investment (holding the tax fixed) is zero. In general, of course, there is no reason that one equation implies the other, so in general the tax chosen before investment is not time-consistent. However, the separability condition implies two things about the problem: (1) \(\frac{\partial x^{*}}{\partial p}\) is independent of \(\theta \), so that setting the expected net marginal benefit of the tax (holding investment fixed) equal to 0 is equivalent to setting the expected net marginal benefit of emissions equal to 0; and (2) \(\frac{\partial x^{*}}{\partial k}\) is independent of \(\theta \), so that setting the expected net marginal benefit of investment (holding the tax fixed) equal to 0 is equivalent to setting the expected marginal benefit of emissions equal to 0. When both Eqs. (6) and (7) are equivalent to setting expected net marginal benefit of emissions equal to 0, the two are equivalent to each other.

Emissions quotas Based on the same reasoning that led to Remark 1, we have

Remark 4

Under both time lines, the individual firm takes the emissions quota as given when choosing its level of capital.

If the regulator uses quotas (that by assumption are binding for all \(\theta \)) the firm’s emissions decision equals \(\bar{x}\), and \(\frac{\partial x^{*}}{\partial \bar{x}}=1\). In view of Remark 4, the firm’s first-order condition for the choice of k (for both of the two time lines) is

$$\begin{aligned} E\left[ B_{k}(\bar{x},k,\theta )-c^{\prime }(k)\right] =0\Rightarrow k=k^{*}\left( \bar{x}\right) . \end{aligned}$$
(9)

As was the case with taxes, there is an unimportant notational issue: with Time Line A, \(\bar{x}\) is literally predetermined, while with Time Line B, the firm treats \(\bar{x}\) as a known function of aggregate investment, and the firm takes aggregate investment as given.

Under Time Line A, the regulator’s first order condition for \(\bar{x}\) is

$$\begin{aligned} \left. \begin{array}{ll} &{}E\left\{ \left[ B_{x}(\bar{x},k^{*},\theta )-D^{\prime }\left( \bar{x} \right) \right] +\left[ B_{k}(\bar{x},k^{*},\theta )-c^{\prime }(k^{*})\right] \frac{dk^{*}}{d\bar{x}}\right\} \\ &{}\quad \,\,=E\left\{ \left[ B_{x}(\bar{x},k^{*},\theta )-D^{\prime }\left( \bar{x} \right) \right] \right\} =0, \end{array} \right. \end{aligned}$$
(10)

where the first equality uses Eq. (9), the fact that \(k^{*}\) is independent of \(\theta \), and \(\frac{dk^{*}}{d\bar{x}}\ne 0\). The first order condition under Time Line B is identical to the second equality in Eq. (10). Thus, when non-strategic firms have rational expectations, the optimal quota is the same under the two time lines. The regulator uses the quota to target only emissions, and the firm’s investment decision is information-constrained socially optimal. There is no social value in using an investment tax when the regulator uses an emissions quota.

The time-consistency of quotas is due to the fact that the indirect welfare effect of the quota is always 0. Although the quota does affect the level of investment, the fact that the quota is always binding means that this investment does not affect the level of emissions. The only remaining indirect effect comes via the change in total cost; however, the private optimality of investment insures that it chosen so that expected marginal costs savings due to an extra unit of investment equals the marginal cost of investment. Thus, the private optimality of the investment decision insures that investment is also socially optimal.

3 Basics of the Dynamic Model

The stock of pollution at the beginning of period t is \(S_{t-1}\) and the flow of emissions in period t is \(x_{t}\). The fraction \(0\le \Delta \le 1\) of the pollution stock lasts into the next period, so the growth equation for \(S_{t}\) is:

$$\begin{aligned} S_{t}=\Delta S_{t-1}+x_{t}. \end{aligned}$$
(11)

The period t stock-related environmental damage equals \(D_{t}=D\left( S_{t-1}\right) \), with \(D^{^{\prime }}>0,\,\) \(D^{^{\prime \prime }}>0.\)

At time t the representative firm’s level of abatement capital is \(K_{t-1}\) and its cost shock is \(\theta _{t}\); when it emits at \(x_{t}\) its benefit is \(B_{t}=B\left( K_{t-1},\theta _{t},x_{t}\right) \). At time t only the firm knows the value of the random cost shock \(\theta _{t}\); there is persistent asymmetric information. All agents know the stochastic process for the cost shock, which we assume is AR(1):

$$\begin{aligned} \theta _{t}=\rho \theta _{t-1}+\mu _{t},\qquad \mu _{t}\sim iid\left( 0,\sigma _{\mu }^{2}\right) ,\qquad \forall t\ge 1, \end{aligned}$$
(12)

with \(-1<\rho <1\).Footnote 2 The sequence \(\left\{ \mu _{t}\right\} \) (\(t\ge 1\)) is generated by an i.i.d. random process with zero mean and common variance \(\sigma _{\mu }^{2}\). At time 0 the regulator knows \(\theta _{-1}\), so the subjective expectation and variance of \(\theta _{0}\) is \(\left( \rho \theta _{-1},\sigma _{\mu }^{2}\right) \). This assumption about the regulator’s initial priors makes the problem stationary; it has no bearing on our results, but merely simplifies the notation. At time \(t\ge 1\) the regulator’s variance for the current shock is \(\sigma _{\mu }^{2}\) provided that he has learned the value of the previous shock, \(\theta _{t-1}\).

The representative firm invests in abatement capital to reduce future abatement costs, i.e., to increase future benefits from pollution. The flow of investment in period t is \(I_{t}\). The fraction of abatement capital that lasts into the next period is \(0\le \delta \le 1\), so the growth equation for \(K_{t}\) is:

$$\begin{aligned} K_{t}=\delta K_{t-1}+I_{t}. \end{aligned}$$
(13)

The cost of investment, \(C_{t}=C\left( I_{t},K_{t-1}\right) \), is increasing and convex in \(I_{t}\). This convexity means that abatement capital does not adjust instantaneously.

The endogeneity of the investment decision means that the marginal abatement cost function, \(B_{x}\left( \cdot \right) \), changes endogenously. Slower adjustment of abatement capital means that it is optimal to adjust emissions more slowly.

4 The Game

In this section it is helpful to distinguish between the representative firm’s level of capital and the aggregate level of capital. We denote the former by k and the latter by \(k^{A}\). Where there is no danger of confusion, we denote both using K. Since we normalize the number of representative firms to 1, \(k^{A}=k=K\) in a symmetric equilibrium. The representative firm understands that it controls k, and that this variable affects its payoff directly, via the function \(B\left( \cdot \right) \). This firm takes the aggregate level of capital \(k^{A}\) as exogenous; \(k^{A}\) has no direct effect on the firm’s payoff. However, in a Markov Perfect equilibrium, where the regulator conditions policies on payoff-relevant information, \(k^{A}\) affects the firm’s beliefs about future policies.

To avoid a proliferation of notation, we do not distinguish between the firm’s level of emissions and the aggregate level of emissions. However, it is important to bear in mind that the firm treats aggregate emissions, and therefore the aggregate pollution stock, as exogenous.

The regulator always uses taxes or always uses quotas. The period t policy is the tax \(p_{t}\) or the quota \(x_{t}\). At time t the regulator knows the aggregate capital stock \(k_{t-1}^{A}\), the pollution stock \(S_{t-1} \) and (as we explain below), the lagged cost shock \(\theta _{t-1}\). These are the payoff-relevant variables for the regulator. In a Markov Perfect rational expectations equilibrium, the representative firm takes the current level of the regulatory policy (at time t) as given; it understands that the policy at time \(\tau >t\) will be a function of \(\left( k_{\tau -1}^{A},S_{\tau -1},\theta _{\tau -1}\right) \). Since the firm takes these conditioning variables to be exogenous, it treats future policies as exogenous. This firm chooses investment \(I_{t}\) under both policies, and it chooses the level of emissions if the regulator uses a tax.

In view of the timing conventions in the model, the regulator’s current (tax or quota) policy influences the firm’s current emission, but not the current level of investment. Investment depends on the firm’s beliefs about future policies (as was the case with Time Line B in Sect. 2).

4.1 The Firm’s Emission and Investment Responses

The firm wants to maximize the expectation of the present value of the stream of cost saving from polluting (B) minus investment cost (C) minus pollution tax payments (under taxes). The constant discount factor is \(\beta \), and we use the superscripts T and Q to distinguish functions and variables under taxes and quotas.

Taxes The firm’s value function under taxes, \(V^{T}\left( k_{t-1},\theta _{t},p_{t};S_{t-1},k_{t-1}^{A}\right) \), solves the dynamic programming equation (DPE)

$$\begin{aligned} \left. \begin{array}{cl} V^{T}\left( k_{t-1},\theta _{t},p_{t};S_{t-1},k_{t-1}^{A}\right) =&{} \mathop {\max }\limits _{{x_{t},I_{t}}} \left\{ B\left( k_{t-1},\theta _{t},x_{t}\right) -p_{t}x_{t}-C\left( I_{t},k_{t-1}\right) \right. \\ &{} \, \left. +\,\beta E_{_{t}}\left[ V^{T}\left( k_{t},\theta _{t+1},p_{t+1;}S_{t},k_{t}^{A}\right) \right] \right\} , \end{array} \right. \end{aligned}$$

subject to the equation of motion for the cost shock (12), the capital stock (13), and the pollution stock (11). The firm’s expectation at t of \(\theta _{t+1}\) and \(p_{t+1}\) is conditioned on the payoff-relevant variables \(\left( k_{t-1}^{A},\theta _{t},S_{t-1}\right) \).

The optimal level of emissions solves a static problem with the following first-order condition

$$\begin{aligned} B_{x}\left( k_{t-1},\theta _{t},x_{t}\right) -p_{t}=0. \end{aligned}$$
(14)

Solving for x, we obtain the optimal emission response

$$\begin{aligned} x_{t}^{*}=\chi \left( k_{t-1},\theta _{t},p_{t}\right) \equiv \chi _{t}. \end{aligned}$$
(15)

The optimal level of investment equates the marginal cost of investment and the discounted shadow value of abatement capital. Setting \(k^{A}=k=K\), the stochastic Euler equation isFootnote 3

$$\begin{aligned} \beta E_{_{t}}\left\{ B_{K}\left( K_{t},\theta _{t+1},\chi _{t+1}\right) -C_{K}\left( I_{t+1},K_{t}\right) +\delta C_{I}\left( I_{t+1},K_{t}\right) \right\} -C_{I}\left( I_{t},K_{t-1}\right) =0. \end{aligned}$$
(16)

This second-order difference equation has two boundary conditions, the current abatement capital \(K_{t-1}\), and the transversality condition

$$\begin{aligned} \lim _{T\rightarrow \infty }E_{t}\left\{ \beta ^{T-t}C_{I}\left( I_{T},K_{T-1}\right) K_{T}\right\} =0. \end{aligned}$$
(17)

Quotas Firms are homogeneous and quotas are not bankable. Thus, under a quota policy, there is no incentive to trade permits. The firm solves the DPE

$$\begin{aligned} \left. \begin{array}{cl} V^{Q}\left( k_{t-1},\theta _{t},x_{t};S_{t-1},k_{t-1}^{A}\right) =&{} \max _{I_{t}}\{B\left( k_{t-1},\theta _{t},x_{t}\right) -C\left( I_{t},k_{t-1}\right) \\ &{}\, \left. +\,\beta E_{_{t}}V^{Q}\left( k_{t},\theta _{t+1},x_{t+1};S_{t},k_{t}^{A}\right) \right\} . \end{array} \right. \end{aligned}$$

Again, the firm’s beliefs about the quota in the next period depend on \(\left( k_{t-1}^{A},\theta _{t},S_{t-1}\right) \).

The optimal level of investment solves the stochastic Euler equation

$$\begin{aligned} \beta E_{_{t}}\left\{ B_{K}\left( K_{t},\theta _{t+1},x_{t+1}\right) -C_{K}\left( I_{t+1},K_{t}\right) +\delta C_{I}\left( I_{t+1},K_{t}\right) \right\} -C_{I}\left( I_{t},K_{t-1}\right) =0, \end{aligned}$$
(18)

and the transversality condition (17).

The investment rule Under both taxes and quotas, the current level of investment depends on the firm’s beliefs about future policy levels, but it does not depend on the current policy level. The firm has rational expectations about future policies; we discuss this policy rule in the next section. Under either taxes or quotas, the representative firm’s equilibrium investment rule at time t is a function of \(\left( k_{t-1},\theta _{t};S_{t-1},k_{t-1}^{A}\right) \). When there is no danger of confusion, we write the investment rule as \(I^{j}\left( K_{t-1},\theta _{t},S_{t-1}\right) \), \(j=T,Q\) (for tax or quota).

4.2 The Regulator’s Problem

The regulator’s payoff equals the payoff to the representative firm net of taxes, minus environmental damages. The regulator maximizes the expectation of the present discounted value of the flow of the payoff, i.e., the expectation of

$$\begin{aligned} \sum _{t=0}^{\infty }\beta ^{t}\left( B\left( K_{t-1},\theta _{t},x_{t}\right) -C\left( I_{t},K_{t-1}\right) -D(S_{t-1})\right) . \end{aligned}$$

His policy (always a tax or always a quota) can be a function of (only) payoff-relevant variables: the current stocks of pollution and capital, and the regulator’s current information about the cost shock. Under taxes the regulator knows that Eq. (15) determines emissions. Under either policy, he knows that investment is given by \(I^{j}\left( K_{t-1},\theta _{t},S_{t-1}\right) \), \(j=T,Q\).

The regulator takes as given the investment rule and (under taxes) the emissions rule. At time t the regulator observes the aggregate stocks \(S_{t-1},K_{t-1}\). If \(\rho =0\), the regulator learns nothing about the current cost shock by observing firms’ past behavior. The past cost shock provides information about the current shock if and only if \(\rho \ne 0.\) Under taxes, the regulator learns the previous cost by observing the response to the previous tax (via Eq. (15)). With tradable quotas, the regulator learns the previous cost by observing the previous quota price.Footnote 4

The regulator’s decision rule is a function \(z^{j}\left( K_{t-1},\theta _{t-1},S_{t-1}\right) , j=T,Q\) that determines the current tax (\(j=T\)) or quota (\(j=Q\)) as a function of his current information, given his beliefs about the firm’s decision rules.

4.3 The Equilibrium

Both the regulator and the representative firm solve stochastic control problems; the exact problem that one agent solves depends on the solution to the other agent’s problem. The rational expectations equilibrium investment rule for the firm depends on the regulator’s policy rule, and that policy rule depends on the equilibrium investment rule. The investment and the regulatory decision rules generate a random sequence of pollution and capital stocks. Agents have rational expectations about these random variables.

An equilibrium consists of a (possibly non-unique) pair of decision rules \(I^{j*}\left( K_{t-1},\theta _{t},S_{t-1}\right) \) and \(z^{j*}\left( K_{t-1},\theta _{t-1},S_{t-1}\right) \) for \(j=T,Q\) that are mutually consistent; the superscript “\(*\)” indicates equilibrium functions. Hereafter, we refer to \(I^{j*}( K_{t-1},\theta _{t},S_{t-1}) \), and \(z^{i*}\left( K_{t-1},\theta _{t-1},S_{t-1}\right) \) as Markov Perfect policy rules.

Modern computational methods make it possible to (approximately) solve these kinds of dynamic equilibrium problems, i.e., to find a fixed point in function space (Judd 1998; Marcet and Marimon 1998; Miranda and Fackler 2002). These fixed point problems are not trivial, especially when the state space has more than one dimension—it has three in our problem.

5 Finding the Markov Perfect Equilibrium

In many cases, the type of model described in the previous section must be solved as an equilibrium problem rather than as an optimization problem. The next subsection explains why this complication might arise. Using an auxiliary control problem in which the regulator has two policy instruments, we then identify conditions under which the model can be solved as a straightforward optimization problem.

5.1 The Time-Consistency Problem

In general, the regulator might want to announce a rule that would determine future levels of the tax or quota. The purpose of such an announcement would be to alter the firm’s investment rule—as distinct from altering a stock that appears as an argument of the investment rule. The inability to make binding commitments, and the Markov assumption, exclude this possibility. In a rational expectations equilibrium, current investment depends on beliefs about future policies, and these beliefs and policies depend on the pollution stock. By choice of the current quota or tax level, the regulator affects the future pollution stock, which can affect future investment. Under our assumptions, the only means by which this period’s policy can influence future investment is by influencing the future level of the pollution stock.

Consider a simpler problem without asymmetric information, where a representative firm with rational expectations makes investment decisions. The firm’s optimal decisions depend on its beliefs about future regulations, and the regulator wants to influence the firm’s decisions. If the regulator has a first best policy (defined as one that does not cause secondary distortions), he can induce the firm to select exactly the decisions that the regulator would have used, had he been in a position to choose them directly. In that case, the regulatory problem can be solved as standard optimization problem. If, however, the regulator has only a second-best policy, the familiar time-consistency problem arises. (See Xie 1997; Karp and Lee 2003 for discussions of this problem, and references.) The Markov restriction is binding in this setting, so finding the equilibrium requires solving an equilibrium problem rather than a standard optimization problem.

The presence of asymmetric information in our model leads to the possibility of time-inconsistency of the optimal emissions tax or quota. We know from the literature on principal-agent problems that with asymmetric information, non-linear policies are generally superior to either the linear tax or the quota. We noted in Sect. 4.1 that the firm’s investment depends on its beliefs about future policies. Since the regulator has two targets, (emissions and investment) and only one instrument, it appears that the regulator might want to use future emissions taxes or quotas to influence the firm’s current investment decision. In that case, the information-constrained first best tax or quota would be time-inconsistent: the ability to make commitments about future taxes or quota decision rules would enable the regulator to achieve a higher payoff than under the Markov restriction. If this were the case, we would not be able to obtain a Markov Perfect equilibrium merely by solving a dynamic optimization problem, but would instead have to solve the equilibrium problem described in the previous section.

5.2 An Auxiliary Control Problem

This subsection describes an auxiliary control problem that helps identify conditions under which the Markov Perfect equilibrium can be obtained by solving an optimization problem. In this auxiliary control problem, in each period the regulator sets an emissions tax or quota using the same information as in the game; later in the same period he observes the current cost shock and then chooses investment directly. In contrast, in the game the regulator chooses only an emissions policy. The regulator’s ability (in the auxiliary problem) to control current investment directly, knowing the current cost shock, eliminates any incentive to use future emissions policies to control current investment.

In this setting, it does not matter whether the regulator chooses investment directly (e.g., by command and control), or decentralizes this decision by means of an investment tax/subsidy. In the former case, firms make no investment decision, and in the latter case, firms merely carry out the optimal investment decision induced by the investment tax/subsidy. However, the model with the investment tax/subsidy is more helpful with intuition, so we emphasize that model. The optimal investment tax/subsidy is identically 0 if and only if Markov Perfect rules in the game are equivalent to the optimal policy rules in the auxiliary problem. With an identically zero investment tax, agents in the auxiliary problem have exactly the same optimization problem as in the game. It is optimal to use a non-zero investment tax/subsidy if and only if the Markov Perfect policies do not solve the auxiliary problem.

In the auxiliary problem we need to consider a two-stage optimization within each period. At the beginning of the period the regulator knows \(\left( K_{t-1},\theta _{t-1},S_{t-1}\right) \) and chooses the emissions policy (a tax or quota); the regulator then learns \(\theta _{t}\) and chooses the level of investment or the investment tax/subsidy. It does not matter whether this time-line is plausible. We use this problem only as a means of finding conditions under which the Markov Perfect rules can be obtained by solving a control problem.

Suppose we find that the investment tax/subsidy is identically 0 in the auxiliary control problem. In this case, the regulator is willing to allow firms to choose investment, given that the regulator chooses the emissions policy. The regulator in the game is therefore also willing to allow firms to choose investment. That is, the regulator in the game also has no wish to use an investment tax/subsidy.

5.2.1 Quotas in the Auxiliary Problem

When the regulator chooses emissions quotas in the auxiliary problem he solves the following DPE:

$$\begin{aligned} \left. \begin{array}{cl} \mathcal {J}^{Q}\left( K_{t-1},S_{t-1},\theta _{t-1}\right) =&{} \max _{x_{t}}E_{\theta _{t}|\theta _{t-1}} \left\{ B\left( K_{t-1},\theta _{t},x_{t}\right) -D\left( S_{t-1}\right) \right. \\ &{} \,\left. +\,\max _{I_{t}}\left[ -C\left( I_{t},K_{t-1}\right) +\beta \mathcal {J} ^{Q}\left( K_{t},S_{t},\theta _{t}\right) \right] \right\} \end{array} \right. \end{aligned}$$
(19)

subject to Eqs. (11) and (13). The first-order condition for the optimal quota is

$$\begin{aligned} E_{\theta _{t}|\theta _{t-1}}\{B_{x}\left( K_{t-1},\theta _{t},x_{t}\right) +\beta \mathcal {J}_{S}^{Q}\left( K_{t},S_{t},\theta _{t}\right) \}=0 \end{aligned}$$
(20)

and the Euler equation for investment under quotas is

$$\begin{aligned}&\beta E_{\theta _{t+1}|\theta _{t}}\left\{ B_{K}\left( K_{t},\theta _{t+1},x_{t+1}\right) -C_{K}\left( I_{t+1},K_{t}\right) +\delta C_{I}\left( I_{t+1},K_{t}\right) \right\} \nonumber \\&\, -\,C_{I}\left( I_{t},K_{t-1}\right) =0. \end{aligned}$$
(21)

The transversality condition is

$$\begin{aligned} \lim _{T\rightarrow \infty }E_{\theta _{T}|\theta _{t}}\left\{ \beta ^{T-t}C_{I}\left( I_{T},K_{T-1}\right) K_{T}\right\} =0. \end{aligned}$$
(22)

5.2.2 Taxes in the Auxiliary Problem

Using the firm’s emission response function (15), the regulator in the auxiliary problem with emissions taxes solves the following DPE

$$\begin{aligned} \left. \begin{array}{cl} \mathcal {J}^{T}\left( K_{t-1},S_{t-1},\theta _{t-1}\right) =&{} \max _{p_{t}}E_{\theta _{t}|\theta _{t-1}}\{B\left( K_{t-1},\theta _{t},x_{t}^{*}\right) -D\left( S_{t-1}\right) \\ &{} + \, \max _{I_{t}}\left[ -C\left( I_{t},K_{t-1}\right) +\beta \mathcal {J} ^{T}\left( K_{t},S_{t},\theta _{t}\right) \right] \} \end{array} \right. \end{aligned}$$
(23)

subject to Eqs. (11), (13) and (15). We use the definition

$$\begin{aligned} H_{t}\equiv \left[ B_{x}\left( K_{t-1},\theta _{t},x_{t}^{*}\right) +\beta \mathcal {J}_{S}^{T}\left( K_{t},S_{t},\theta _{t}\right) \right] , \end{aligned}$$

and the abbreviation \(\chi _{t}\equiv \chi \left( K_{t-1},\theta _{t},p_{t}\right) =x_{t}^{*}\). The function \(H_{t}\) is the social benefit of an additional unit of emissions, and recall that \(\chi \) is the firm’s decision rule for emissions under emissions taxes (Eq. (15)). With this notation, we can write the first-order condition with respect to \(p_{t}\) as

$$\begin{aligned} E_{\theta _{t}|\theta _{t-1}}\left\{ H_{t}\frac{\partial \chi _{t}}{\partial p_{t}}\right\} =0, \end{aligned}$$
(24)

and the stochastic Euler equation for investment as

$$\begin{aligned} \left. \begin{array}{cl} &{} \beta E_{\theta _{t+1}|\theta _{t}}\left\{ B_{K}\left( K_{t},\theta _{t+1},x_{t+1}^{*}\right) -C_{K}\left( I_{t+1},K_{t}\right) +\delta C_{I}\left( I_{t+1},K_{t}\right) +H_{t+1}\frac{\partial \chi _{t+1}}{\partial K_{t}}\right\} \\ &{} -\,C_{I}\left( I_{t},K_{t-1}\right) \;=\;0. \end{array} \right. \end{aligned}$$
(25)

The transversality condition is Eq. (22).

5.3 Social Optimality of the Markov Perfect Rules

Differentiating the first-order condition (14) implies the following:

Lemma 1

Condition 1 is equivalent to the following two conditions: (1) \(\frac{\partial \chi \left( K_{t-1},\theta _{t},p_{t}\right) }{\partial p_{t}}\) is independent of \(\theta _{t}\). (2) \(\frac{\partial \chi \left( K_{t-1},\theta _{t},p_{t}\right) }{\partial K_{t-1}}\) is independent of \(\theta _{t}\), where \(p_{t}\) is the time t emissions tax.

Our main result is the following:

Proposition 1

(i) When the regulator uses emissions quotas, the solution to the auxiliary problem (19) is a Markov Perfect equilibrium to the original game. (ii) When the regulator uses emissions taxes, the solution to the auxiliary problem (23) is a Markov Perfect equilibrium to the original game if and only if the separability condition holds.

The proof, contained in Appendix 1, verifies that the equilibrium conditions in the games and in the auxiliary problems are identical under the conditions stated in the Proposition.

5.3.1 Significance of the Proposition

When the regulator uses quotas to control emissions, the Markov Perfect investment rule is always information-constrained socially optimal. With emissions quotas, the ability to use an additional policy instrument to influence investment does not increase social welfare.

If the regulator uses emissions taxes to control emissions, the Markov Perfect investment rule is socially optimal if and only if Condition 1 is satisfied. This condition depends only on the benefit function \(B\left( \cdot \right) \), not on the damage or the investment cost function. If the separability condition holds, the investment tax that supports optimal investment in the auxiliary problem is identically 0.

Proposition 1 identifies a previously unnoticed difference between taxes and quotas. When the separability condition does not hold, the regulator who uses an emissions tax to control pollution creates a secondary distortion in investment. In these circumstances, private investment is optimal under an emissions quota but not under an emissions tax. The emissions tax, but not the quota, creates the need for an investment tax/subsidy.

The Proposition also provides a simple way of obtaining the equilibrium for the game when the separability condition holds. This method requires only solving a dynamic optimization problem rather than a dynamic equilibrium problem.

5.3.2 Interpretation of the Separability Condition

We first identify the secondary distortion under emissions taxes, and we explain why it vanishes if the separability condition holds. This discussion also explains why emissions taxes and quotas typically have different effects, as regards the secondary distortion.

In order to identify the secondary distortion, we follow the standard procedure of computing the investment tax/subsidy that supports the information-constrained first best investment policy. Suppose that firms face an investment tax \(s_{t}\), so their single period payoff is \(B(\cdot )-C\left( \cdot \right) -s_{t}I_{t}-p_{t}x_{t}\). We can write the Euler equation for the capital stock corresponding to this problem, and compare it to the optimal investment policy under an emissions tax, Eq. (25). We omit the details, but the comparison implies that the investment tax supports the socially optimal level of investment if and only ifFootnote 5

$$\begin{aligned} -s_{t}+\beta \delta E_{\theta _{t+1}|\theta _{t}}s_{t+1}=\beta E_{\theta _{t+1}|\theta _{t}}\left\{ H_{t+1}\frac{\partial \chi _{t+1}}{\partial K_{t}} \right\} . \end{aligned}$$
(26)

The left side of Eq. (26) equals the effect of the tax sequence on the marginal incentive to invest in the current period. Under the investment tax, an additional unit of investment costs the firm \(s_{t}\) in the current period, but reduces the expected cost of tax payments by \(\delta E_{t}s_{t+1}\) in the next period. The right side of Eq. (26) is the present value of the expectation of the secondary distortion. \(H_{t+1}\) is the marginal value to society of an additional unit of emissions in the next period, and \(\frac{\partial \chi _{t+1}}{\partial K_{t}}\) equals the change in emissions in the next period caused by an additional unit of investment in the current period. Thus, the term in brackets in Eq. (26) is the value to society of the lower future emissions caused by the additional investment. This benefit is external to the firm. The optimal investment tax sequence induces the firm to internalize the present value of the expectation of this additional social benefit of investment, i.e., to internalize the externality.

The optimal emission quota does not create a secondary distortion. Under the quota, the expected social benefit of an additional unit of emissions is zero in each period (Eq. (20)). The socially optimal rule for determining investment, Eq. (21), involves only the current and future expected marginal investment and abatement costs. The socially optimal balance of these costs is identical to the balance that firms choose.

The optimal emissions tax, in contrast, requires that a marginal change in the tax has zero expected social value (Eq. (24)). This condition is not, in general, equivalent to the requirement that the expected social marginal benefit of emissions (\(H_{t}\)) is zero. The expected social marginal benefit of an additional unit of emissions is zero if and only if \(B_{xx}\) is independent of \(\theta \) (equivalently, if and only if \(\frac{\partial \chi }{\partial p}\) is independent of \(\theta \)). This independence implies that \(E_{t}H_{t}=0\).

Even if this independence holds, \(H_{t}\) is a random variable, a function of \(\theta \). If \(\frac{\partial \chi }{\partial K}\) is also a function of \(\theta \) (i.e., if \(B_{xK}\) is not independent of \(\theta \)), then the social marginal benefit of emissions is correlated with \(\frac{\partial \chi }{\partial K}\). In that case, the expected marginal value to society of the lower future emissions caused by the additional investment (i.e., the secondary distortion, measured by the right side of Eq. (26)) is non-zero. Here, the investment externality is non-zero. Consequently, both \(B_{xx}\) and \(B_{xK}\) must be independent of \(\theta \) in order for the investment externality to vanish under emissions taxes.

6 The Linear-Quadratic Model

The static model with a flow pollutant shows that a simple comparison of taxes and quotas requires strong functional assumptions: quadratic abatement costs and quadratic damages and additive uncertainty (the “linear-quadratic model”) (Weitzman 1974). Without these assumptions, the ranking of taxes and quotas depends on parameters such as the variance of the cost uncertainty, for which we have very poor (if any) estimates. The major insight from the static linear-quadratic model is that taxes dominate quotas when the marginal abatement cost function is steeper than the marginal environmental damage function.

Analytical comparisons of the two policies in the climate change literature use the linear-quadratic model in which damages arise from the pollution stock, rather than the flow of emissions. Some commentators have claimed that for the regulation of GHGs, taxes obviously dominate quotas, because the marginal damage function for GHGs is so flat relative to the marginal abatement cost function. This reasoning is faulty, because in the dynamic setting the marginal abatement cost depends on the flow of emissions, while the marginal damage depends on the stock of pollution. The two slopes have different units in the dynamic problem, whereas they have the same units in the static problem.Footnote 6 In the dynamic setting it is not sensible to simply compare magnitudes of the two slopes. The dynamic optimization problem has to be solved in order to know how to compare these slopes, i.e., to know what constitutes a “large slope” and a “small slope” with GHGs.

This analysis has been undertaken with the competing assumptions that the regulator announces the entire sequence of future policies today (the open loop assumption) (Newell and Pizer 2003) or that the regulator conditions future policies on future information (the feedback assumption) (Hoel and Karp 2002; Karp and Zhang 2005), and under the assumption that the regulator expects to learn about a damage parameter (Karp and Zhang 2006). This analysis, together with available estimates of parameter values, supports the view that taxes dominate quotas for climate policy. Numerical results that do not use the linear-quadratic model also support this conclusion (Pizer 1999; Hoel and Karp 2001; Pizer 2002).

To determine the effect of endogenous abatement capital, it makes sense to use the linear-quadratic model. This functional form allows us to compare our results with those of earlier papers in this literature. With different functional assumptions, we would not be able to isolate the effect of endogenous capital. The linear-quadratic model satisfies the separability assumption—a fortunate circumstance, because it means that the effect of endogenous capital is not confounded by differences (under taxes and quotas) in strategic interactions. Both the information constrained optimal tax and quota are subgame perfect. Moreover, Proposition 1 enables us to obtain this equilibrium by solving the auxiliary control problem introduced in Sect. 5.2, instead of solving a dynamic game.

The representative firm’s benefit function is

$$\begin{aligned} B\left( K_{t-1},\theta _{t},x_{t}\right) =f_{0}+\left( f_{1}+\psi \theta _{t}\right) K_{t-1}-\frac{f_{2}}{2}K_{t-1}^{2}+\left( a-\phi K_{t-1}+\theta _{t}\right) x_{t}-\frac{b}{2}x_{t}^{2} \end{aligned}$$

with \(f_{1}>0\), \(f_{2}>0\), \(b>0\), \(\psi \ge 0\), \(\phi \ge 0\). The function \(B\left( \cdot \right) \) (which includes the rental cost of capital) satisfies the separability condition. The cost of changing the level of capital isFootnote 7

$$\begin{aligned} C\left( I_{t}\right) =\frac{d}{2}\left( I_{t}\right) ^{2}, \ \ \ d>0 \text {.} \end{aligned}$$

Environmental damages are also quadratic:

$$\begin{aligned} D(S_{t-1})=\frac{g}{2}\left( S_{t-1}-\bar{S}\right) ^{2} \end{aligned}$$

where \(\bar{S}\) is the stock level that minimizes damages.

The following Remark collects a number of useful facts about the comparison of policies. These results will be obvious to readers familiar with the linear-quadratic control problem, so we state them without proof:

Remark 5

In this linear-quadratic model with additive errors, the Principle of Certainty Equivalence holds. The expected trajectories of all stock and flow variables are the same under taxes and quotas. The higher moments of these trajectories differ under the two policies. Neither the policy ranking nor the magnitude of the payoff difference depends on the information state \(\left( K_{t-1},S_{t-1},\theta _{t-1}\right) \). The magnitude (but not the sign) of the difference in payoffs depends on the variance of cost, \(\sigma _{\mu }^{2}\).

6.1 Regulated Emissions and Investment

For the linear-quadratic model we obtain an explicit equation for the emissions rule (Eq. 15) under taxes:

$$\begin{aligned} x_{t}^{*}=e_{t}-\frac{\phi }{b}K_{t-1}+\frac{\theta _{t}}{b}; \ \ e_{t}\equiv \frac{a-p_{t}}{b}. \end{aligned}$$

A higher cost realization increases current emissions, and a higher tax or a higher stock of abatement capital decreases emissions.

Using standard methods (e.g., Chap. 14 of Sargent 1987) we can solve the firm’s Euler equation (16) under taxes and (18) under quotas) to write current investment as a linear function of current capital (\(K_{t-1}\)) and the firm’s expectations of the future cost variables and policies (taxes or quotas). The optimal investment under emissions taxes is

$$\begin{aligned} \left. \begin{array}{cl} &{} I_{t}^{*}=\frac{\lambda \beta f_{1}}{d\delta \left( 1-\lambda \beta \right) }+\left( \lambda -\delta \right) K_{t-1} \\ &{}\qquad +\,\frac{\lambda \beta }{d\delta }E_{t}\left[ \left( \psi -\frac{\phi }{b} \right) \displaystyle \sum _{j=0}^{\infty }\left( \lambda \beta \right) ^{j}\theta _{t+1+j}-\phi \sum _{j=0}^{\infty }\left( \lambda \beta \right) ^{j}e_{t+1+j} \right] . \end{array} \right. \end{aligned}$$
(27)

where \(0<\lambda <1\) is the smaller root of the quadratic equation \(\lambda ^{2}+\frac{h}{\beta }\lambda +\frac{1}{\beta }=0\) and \(h\equiv -\left[ \frac{1}{\delta }+\frac{\beta }{d\delta }\left( f_{2}-\frac{\phi ^{2}}{b}\right) +\beta \delta \right] \). A lower expected future tax (i.e., a higher value of \(e_{t+j}\)) decreases current investment. A higher expected future cost shock increases (decreases) current investment if \(\psi -\frac{\phi }{b}\) is positive (negative). Since \(B_{K\theta }=\psi >0\), a higher expected cost shock increases the expected marginal benefit of capital—and thus increases the marginal shadow value of capital. This effect encourages investment. However, a higher expected cost shock increases expected emissions, reducing the expected marginal benefit of capital (\(B_{xK}=-\phi <0\)) and discouraging investment. These offsetting effects are exactly balanced if \(\psi =\frac{\phi }{b}\), in which case the cost shock has no effect on investment, under emissions taxes.

The optimal investment under emissions quotas isFootnote 8

$$\begin{aligned} \left. \begin{array}{cl} I_{t}^{*}&{} =\frac{\mu \beta f_{1}}{d\delta \left( 1-\mu \beta \right) } +\left( \mu -\delta \right) K_{t-1} \\ &{} {\quad }+\,\frac{\mu \beta }{d\delta }E_{t}\left[ \psi \sum \limits _{j=0}^{\infty }\left( \mu \beta \right) ^{j}\theta _{t+1+j}-\phi \sum \limits _{j=0}^{\infty }\left( \mu \beta \right) ^{j}x_{t+1+j}\right] . \end{array} \right. \end{aligned}$$
(28)

where \(0<\mu <1\) is the smaller root of the quadratic equation \(\mu ^{2}+\frac{w}{\beta }\mu +\frac{1}{\beta }=0\) and \(w\equiv -\left( \frac{1}{\delta }+\frac{\beta f_{2}}{d\delta }+\beta \delta \right) \). Higher expected quotas decrease investment, and higher expected cost shocks increase investment. With quotas, cost shocks have an unambiguous effect, because the firm treats future emissions quotas as exogenous.

6.2 A Limiting Case: Flow Externality

If \(\Delta =0\) all of the pollution stock decays in a single period, and the model collapses to the case of a flow externality. In this case, emissions in the current period cause damages only in the next period: \(D(S_{t-1})=D(x_{t-1})\).Footnote 9 By defining \(\tilde{D}(x_{t})=\beta D\left( x_{t}\right) \) we can write the difference between the benefits and costs of current emissions as \(B\left( K_{t-1},\theta _{t},x_{t}\right) -\tilde{D}(x_{t})\). This simplification eliminates a state variable (S), making it possible to obtain some analytic results. We can solve the dynamic programming equations under taxes and quotas and compare the payoffs.

We show that in two cases, the policy ranking does not depend on the parameters associated with abatement capital: (1) \(\rho =0\); or (2) \(\rho \ne 0\) and \(\psi \ne 0\). If neither of these two conditions hold, so that \(\rho \ne 0\) and \(\psi =0\), the policy ranking does depend on the parameters associated with abatement costs.

If \(\rho =0\), or if \(\rho \ne 0\) and \(\psi \ne 0\), the payoff difference under taxes and quotas, is

$$\begin{aligned} \mathcal {J}^{T}-\mathcal {J}^{Q}=\frac{\sigma _{\mu }^{2}}{2b\left( 1-\beta \right) }\left( 1-\frac{\beta g}{b}\right) . \end{aligned}$$

This expression reproduces a result in Weitzman (1974)’s static model and in two dynamic models (Hoel and Karp 2002; Karp and Zhang 2005).

If \(\rho \ne 0\) and \(\psi =0\), the payoff difference equals

$$\begin{aligned} \mathcal {J}^{T}-\mathcal {J}^{Q}=\frac{\sigma _{u}^{2}}{2b\left( 1-\beta \right) }\left[ \Gamma +\left( 1-\frac{\beta g}{b}\right) \right] . \end{aligned}$$
(29)

The function \(\Gamma >0\) depends on \(f_{2}\), d and \(\delta \) (among other parameters).

We summarize the implications of these expressions in the following:

Remark 6

For a flow pollutant (\(\Delta =0\)): (i) If (a) \(\rho =0\), or if (b) \(\rho \ne 0\) and \(\psi \ne 0\)), the policy ranking depends only on the relative slopes (appropriately discounted) of the marginal benefit and damage functions. (ii) When neither conditions (a) or (b) in part (i) hold the policy ranking also depends on the parameters associated with abatement capital.

The next section considers the problem of a stock-related pollutant with \(\rho \ne 0\) and \(\psi \ne 0\); there the policy ranking does depend on the parameters associated with abatement capital—in contrast to Remark 6.i. Here we explain why stock and flow pollutants have this qualitative difference.

As Remark 5 notes, the expected levels of emissions and of investment are the same under taxes and quotas. The first-order condition for investment (using Eqs. 19 or 23) is

$$\begin{aligned} -C_{I}\left( I_{t},K_{t-1}\right) +\beta \mathcal {J}_{K}^{i}\left( K_{t},S_{t},\theta _{t}\right) =0, \ \ \ \ i=T,Q. \end{aligned}$$

The linear-quadratic structure with additive uncertainty implies that \(\mathcal {J}_{K}^{T}\left( K_{t},S_{t},\theta _{t}\right) \equiv \mathcal {J}_{K}^{Q}\left( K_{t},S_{t},\theta _{t}\right) \): the shadow value of capital and therefore the investment rules under taxes and quotas, conditional on \(\left( K_{t-1},S_{t},\theta _{t}\right) \), are identical.

For a stock pollutant, \(\mathcal {J}_{K,S}^{i}\ne 0\), so investment at time t depends on the pollution stock at the beginning of the next period, \(S_{t} \). That pollution stock depends on current emissions; therefore, emissions in period t affect investment in period t. Conditional on the regulator’s information at the beginning of a period, the current level of emissions is random under taxes and is a choice variable under quotas. Therefore, conditional on the information at the beginning of a period, the distribution function for the current level of investment differs under the two policies. The expected payoff difference therefore depends on the parameters associated with abatement capital.

In contrast, with a flow pollutant, the current level of emissions has no effect on future payoffs. The shadow value of capital \(\mathcal {J}_{K}^{i}\) depends only on \(\left( K_{t},\theta _{t}\right) \). With a flow pollutant, the current investment and current emissions decisions are decoupled. Therefore, the value to the regulator of the difference in emissions under taxes and quotas does not depend on investment costs.

7 An Application to Climate Change

With a stock externality problem such as greenhouse gasses, we have three state variables (greenhouse gasses, the capital stock, and the expected cost shock) and therefore cannot obtain an analytic solution. However, using Proposition 1, it is straightforward to solve the tax and quota problems numerically. The resulting control problem is almost standard, except that new information arrives within a period, so there are two stages of optimization within a period. This fact accounts for the nested maximization in Eqs. (19) and (23). For the linear-quadratic model, we can solve each of these dynamic programming problems by solving a matrix Riccati equation.

Table 1 The model of global warming

7.1 Model Calibration

Table 1 describes the model. In order to calibrate the general linear-quadratic model described in the previous section, we assume that benefits are equal to the value of abatement cost that the firm avoids by increasing emissions. Abatement costs are a quadratic function of abatement, \(x_{t}^{b}-x_{t}\) (row 6), where the BAU emissions \(x_{t}^{b}\) is a decreasing linear function of abatement capital (row 5). A higher level of abatement capital makes it cheaper to reduce emissions, and also decreases the marginal abatement costs. The cost variable \(\tilde{\theta }\) (which is proportional to the random variable \(\theta \) used above) changes the level of BAU emissions and therefore changes marginal abatement costs. Row 7 of Table 1 repeats the general linear-quadratic model; the final row gives the parameter restrictions under which this general model reproduces the special model described in the rows 2–6 of the table.Footnote 10 If \(m_{1}=0\), capital does not affect abatement costs. This limiting case reproduces previous linear-quadratic models of a stock pollutant (Karp and Zhang 2005).

Table 2 lists baseline parameter values. In presenting the simulation results, we use the parameter \(\pi \), defined as the percentage loss in Gross World Product due to a doubling of greenhouse gasses. This parameter is linearly related to g, the slope of marginal damages. Our baseline parameters assume that \(\pi =1.33\), an estimate that has been widely used. For comparison, we also discuss results when \(\pi =3.6\) (the average of expert opinions, reported in Nordhaus 1994) and \(\pi =21\) (the maximum of these expert opinions).

Table 2 Parameter values for the baseline model

Appendix 3 explains our calibration of the abatement costs (rows 3–6 of Table 1). Our companion paper (Karp and Zhang 2006)Footnote 11 describes the calibration of the growth and damage functions (rows 1 and 2 of Table 1) and of the equation for the random shock (Eq. 12).

7.2 Numerical Results

We begin by summarizing results from earlier static and dynamic models that exclude abatement capital. We then discuss new results—those directly related to abatement capital.

7.2.1 Previous Results

Previous papers study the relation between the policy ranking and parameters in the linear-quadratic model with additive errors (Hoel and Karp 2002; Newell and Pizer 2003; Karp and Zhang 2005). Those papers show that the difference in payoffs under optimal taxes and quotas, \(\mathcal {J}^{T}-\mathcal {J}^{Q}\), is decreasing in \(\frac{g}{b}\). The intuition is the same as in Weitzman (1974)’s static model. A larger value of g means that damages are more convex in S. In view of Jensen’s inequality, as damages become more convex it becomes more important to control emissions exactly (as under a quota) rather than to choose only the expected value of emissions (as under a tax). A higher value of b makes it more important for the firm to be able to respond to changes in the cost variable by changing emissions. It is able to respond under a tax but not under a quota.

There is a critical value of \(\frac{g}{b}\) above which quotas are preferred. This critical value is decreasing in both \(\beta \) and \(\Delta \). When more weight is put on future costs and benefits (higher \(\beta \)), or when the stock is more persistent (higher \(\Delta \)), it is more important to control the exact level of emissions (as under quotas) rather than the first moment of emissions (as under taxes).

The previous papers calibrate models using parameter values that are consistent with published estimates of the abatement costs and environmental damages associated with greenhouse gasses. These studies find that taxes dominate quotas for the control of greenhouse gasses.

These qualitative results also hold for our parameterization of the model with endogenous abatement capital. This robustness is worth noting, but our analysis adds nothing to the intuition for these results, and therefore we do not discuss them further. Instead, we emphasize the comparative statics and dynamics associated with endogenous abatement costs.

7.2.2 The Role of Abatement Capital

There are three important parameters related to abatement capital: \(\delta \) (capital stock persistence), d (slope of marginal investment cost), and \(m_{1}\) (marginal effect of capital on BAU emissions) We consider the first two briefly, and then concentrate on the third. In all cases, we perform the obvious experiment of varying one of these parameters, holding all others constant. This experiment has a shortcoming that we discuss later, where we consider a second type of experiment.

We explained why a more durable pollution stock (higher \(\Delta \)) decreases the preference for taxes. However, a more durable capital stock (higher \(\delta \)) increases the preference for taxes. Under taxes, the firm responds to a cost shock by changing the level of emissions. For \(\rho \ne 0\),Footnote 12 the firm responds to a cost shock by changing the level of investment, thereby changing the future level of capital under both taxes and quotas. The adjustment mechanism via capital provides a partial substitute for the inability to change emissions under quotas. A large value of \(\delta \) means that current investment has long-lasting effects, tending to make capital less flexible. The decreased flexibility associated with larger values of \(\delta \) increases the value of being able to respond to the cost shock by changing emissions. A larger value of \(\delta \) therefore increases the advantage of taxes.

Fig. 1
figure 1

Dependence of expected payoff difference on cost-related parameters

A lower value of \(m_{1}\) (a decrease in the marginal effect of capital on BAU emissions) or a larger value of d (an increase in the adjustment cost for abatement capital), favors quotas. Figure 1 shows the relation between the difference in payoffs (the value of using taxes minus the value of using quotas) and the parameters d and \(m_{1}\) for three values of \(\pi \), holding all other parameters constant. (Recall that \(\pi \) is the percentage loss in global world product due to a doubling of greenhouse gasses.) When environmental damages are moderate (\(\pi =1.33\) or \(\pi =3.6\)) the difference in payoffs is insensitive to changes in d and \(m_{1}\); for large environmental damages (\(\pi =21\)) the change in either parameter has a noticeable affect on the payoff difference. Previous linear-quadratic models that do not include investment capital are a special case of the model here, obtained by letting \(d\rightarrow \infty \) or \(m_{1}\rightarrow 0\). Those models tend to understate (slightly) the advantage of using taxes.

As d increases, capital increasingly resembles a fixed input; as \(m_{1}\) decreases, abatement capital has less effect on the marginal benefit of pollution. A larger value of d or a smaller value of \(m_{1}\) both imply less flexibility of marginal abatement costs. This diminished flexibility favors quotas, just as does the diminished flexibility in marginal abatement costs associated with a smaller value of b (the slope of \(B_{x}\)).

In all cases, the present discounted value of the payoff difference under taxes and quotas is approximately 1 billion dollars, implying an annualized cost of about 50 million dollars. Our parameterization of abatement costs assumes that the annualized cost of stabilizing emissions is about 1 % of income, or 290 billion dollars. Thus, the payoff difference of the two policies is less than 0.02 of the estimated costs of stabilizing emissions.

The small difference in the expected payoffs may be due largely to the Principle of Certainty Equivalence, mentioned in Sect. 5: the expected stock trajectories are identical under taxes and quotas—only higher moments differ. Uncertainty in our calibrated model (but not in the general formulation) arises only because BAU emissions are uncertain. Given the (small) magnitude of this particular type of uncertainty, the higher moments of stocks simply are not very important. Models that do not satisfy the Principle of Certainty Equivalence find a larger payoff difference under taxes and quotas (Pizer 1999; Hoel and Karp 2001).

The relations between the equilibrium decision rules and levels of the state variables are as expected. The optimal quota (which equals the expected level of emissions under the optimal tax) decreases with the level of pollution and with the capital stock and increases with the lagged cost shock (for \(\rho >0\), as in our calibration). Equilibrium investment is an increasing function of the stock of pollution and a decreasing function of capital stock. Firms understand that a higher pollution stock will lead to lower future equilibrium emissions, increasing the marginal value of investment. A higher aggregate capital stock encourages the regulator to reduce future emissions, increasing the value of investment. However, the representative firm’s level of capital equals the aggregate level. For a given quota or tax, a higher capital stock reduces the marginal value of investment. The net effect of higher capital stocks is to reduce investment.

As we mentioned above, the comparative dynamics associated with a change in a single parameter value might be misleading. For example, when we decrease \(m_{1}\) holding other parameters constant, we change the BAU level of emissions and the abatement costs associated with a particular emissions trajectory, in addition to changing the marginal effect of capital on abatement costs. Here we consider a slightly different experiment: When we vary \(m_{1}\) we make offsetting changes in \(m_{0}\) in order to maintain current BAU emissions at 5.2, and we require that the year 2100 BAU emissions are consistent with a particular IPCC scenario.

Our baseline calibration (\(m_{1}=0.7266\)) makes our model consistent with the IPCC IS92a scenario that projects BAU CO\(_{2}\) stocks of 1500 GtC in the year 2100—an approximate doubling of stocks relative to pre-industrial levels. For comparison we also choose parameters that are consistent with the IS92c scenario of a 35 % increase in CO\(_{2}\) concentration (\(m_{1}=0.0416\)) and with the IS92e scenario of a 170 % increase in CO\(_{2}\) concentration (\(m_{1}=1.6622\)).

Fig. 2
figure 2

Changes in expected pollution flows and stocks relative to BAU levels

Figure 2 graphs optimal abatement levels, i.e., the difference in the BAU and the optimal levels of emissions (the left panel) and the difference between BAU and the regulated pollution stock (the right panel), as a function of time. The three graphs in each panel correspond to the three values of \(m_{1}\). In all cases, abatement increases over time. Both the level and the change over time of abatement is greatest when abatement capital has a large effect on marginal abatement costs (\(m_{1}\) is large). This result is further evidence that the consideration of endogenous investment in abatement capital increases the optimal level of abatement.

8 Discussion and Conclusion

The previous literature that compares taxes and quotas assumes that firms solve a sequence of static problems. Our paper recognizes that firms also make investment decisions which affect their future abatement costs. The value of this investment depends on the severity of future environmental restrictions, so the policymaker might have an incentive to announce future environmental policies in order to influence current investment. When this incentive arises, the firms’ investment decisions are not constrained optimal, so the regulator would increase welfare if he were able to use an investment tax/subsidy together with the emissions policy. We showed that for general functional forms, when the regulator uses a quota (cap and trade), the competitive firms’ investment policy is information-constrained efficient. In contrast, for general functional forms, when the regulator uses an emissions tax, the firms’ investment policy is not information-constrained efficient. In this sense, there is an advantage to quotas, relative to emissions taxes, that had not previously been recognized.

This particular advantage disappears under a “separability condition” on the primitive functions. The linear-quadratic model, generalized to include endogenous investment, satisfies this condition. Using a calibrated model and a numerical solution, we found that making capital more durable or more effective in reducing the cost of abatement, or reducing the marginal adjustment cost of capital, all favor the use of taxes rather than quotas. These numerical results and the previously described analytic result lead to a mixed message for the comparison of policies. Within the functional assumptions that most previous studies have used, we find that the inclusion of endogenous investment increases the advantage of taxes. However, for more general functional forms, quotas have an entirely different type of advantage. We do not know anything about the magnitude of the latter advantage; its measurement would require a more complicated (i.e., non-linear quadratic) model, which presents problems of calibration, and it would also require the solution to a dynamic game rather than an optimization problem.

We close by discussing several other views of the relative efficiency of taxes and quotas. One view is that the risk of extreme environmental damages, associated with high GHG stocks, means that over some range damages are likely to be very convex in stocks, i.e., the slope of marginal damages is actually very large. In addition, over a long enough time span, given the opportunities for the development and adoption of new technologies, the marginal abatement cost curve is actually rather flat. Based on these (in our view, plausible) observations, and reasoning from the standard static model, Dietz and Stern (2007) conclude that quantity restrictions are more efficient than taxes for climate policy. We have three reasons for doubting this conclusion. First, the use of the static framework (or the open loop assumption in a dynamic setting) is not appropriate for studying climate policy, because the current policymaker cannot choose policy levels decades into the future. More rapid adjustment of policy, i.e., a decrease in the length of period between policy adjustments, favors the use of taxes. Second, even if the possibility of extreme events makes the marginal damage function much steeper than current estimates suggest, the magnitude of the slope of damages would have to be implausibly large to favor quotas. (Hoel and Karp 2002 demonstrate both of these claims.) Third, the current paper shows that endogenous investment in abatement capital is likely to increase the advantage of taxes, given the linear-quadratic framework.

A second view, which we have heard propounded orally but not in writing, is that the existing models inaccurately describe the abatement problem and are therefore simply inappropriate for comparing policies. The objection is that firms will first exhaust the cheapest abatement opportunities; once these are used, they are unavailable in the future. There are (at least) two ways to respond to this objection. First, a stationary upward sloping marginal abatement cost curve (used in most previous analyses) is obviously consistent with the claim that firms first use the cheapest way of reducing emissions, and then use more expensive means when regulation becomes stricter. However, because abatement is a flow decision, the fact that the cheap abatement opportunities were used early in the program does not mean that they are unavailable later in the program. The firms move up their marginal abatement curves as the policy becomes stricter. A second response interprets the objection as a call to use a model in which abatement is a stock rather than a flow decision—specifically, a model with endogenous investment in abatement capital, in which there is a sequence of increasingly expensive technologies that reduce emissions. It would be fairly straightforward to produce that kind of model, using a slight modification of the model in this paper. We assumed that the cost of investment is a function of gross investment. To address the objection, we could modify the cost function so that the cost of an additional unit of capital increases with the current level of capital. With this formulation, the firms’s level of capital is a proxy for it’s stage of technology. Because it first adopts the cheapest (most efficient) technologies, it becomes increasingly expensive to make further reductions in abatement costs. It is not clear how this change affects the policy ranking.

There are several other model variations that would address other interesting questions. For example, network externalities may cause the productivity of a firm’s capital to increase with the level of aggregate capital. Also, if we think of investment as being R&D rather than the installation of new capital, there are likely to be important spillovers. At least in the linear-quadratic framework, it would be straightforward to include such spillovers. There may be intra-firm increasing returns to scale. There might also be learning by doing, so that an increase in cumulative abatement decreases abatement costs. The inclusion of intertemporal trade (banking and borrowing) under quantity restrictions would be even more interesting. Because GHGs are a stock pollutant, the stream of damages can be sensitive to the cumulative emissions over a long period of time without being sensitive to the precise timing of emissions. Intertemporal trading allows firms to optimally allocate over time a given cumulative level of emissions. The introduction of banking and borrowing (under the quantity restriction) would likely significantly erode the advantage of taxes. The effect of banking and borrowing on the incentive to invest is not clear. These questions, and the model variations that they entail, are the subject of current research.