1 Introduction

Committed emissions from existing and proposed energy infrastructure represent more than the entire remaining carbon budget if the 1.5 °C target is to be achieved, and perhaps two-thirds of the remaining carbon budget if mean global warming is to be limited below 2 °C (Tong et al. 2019). Energy infrastructure has substantial (and partly sunk) investment costs and can remain operative for decades once in place. Hence, it is of crucial importance to direct investment away from emission intensive fossil fuels and towards low-emission alternatives if we are to curb global warming.

In this paper, I examine emission regulation and transition dynamics in the presence of long-lived capital in a competitive partial equilibrium model with resource scarcity. The modeling framework includes capital accumulation and convex investment costs to capture the sluggish response to regulation caused by long-lived capital. I use regulation of the electric power industry as an example throughout the paper.

The socially optimal time trajectory can be implemented in competitive equilibrium in a setting with convex investment cost and resource scarcity by a standard Pigouvian emission tax if the firms’ private discount rate equals the social discount rate. Otherwise, i.e., if the firms are more impatient than the social planner, production capacity adjusts too slowly to the emission tax.Footnote 1 Therefore, a tax or subsidy on investment is needed to implement the socially optimal time trajectory in this case. This provides a rationale for using emission taxes and investment subsidies simultaneously, which is relevant for public policy because many countries have or consider regulation featuring both instruments (see, e.g., IPCC 2012). Further, emission taxes above the Pigouvian level may be optimal if the private discount rate exceeds the social discount rate and an emission tax is the only policy instrument available. The explanation is that a higher emission tax helps speed up the transition, which increases welfare if the firms discount the future too strongly. A caveat is that such policies tend to be time inconsistent. Last, the time lags of environmental policy may be substantial, and I also examine current effects of increasing (or introducing) future emission taxes.Footnote 2 A key research question here is whether the green paradox (Sinclair 1992; Sinn 2008) holds when the model includes capital accumulation, convex investment costs and resource scarcity.Footnote 3

The analysis highlights that large investments towards a less emission intensive production capacity mix is needed early on. The reason is that current investment in relatively clean production technologies decreases the cost of future emission reductions. This corroborates a key result in Vogt-Schilb et al. (2018), who explicitly models investment in abatement equipment and finds that it is optimal to start a long-term emission–reduction strategy with significant and early abatement investment, even if the optimal carbon price starts low and grows progressively over time.Footnote 4

Regarding current effects of anticipated future emission taxes, future taxes have three key dynamic effects on current emissions from electricity generation in a model with exhaustible resources and long-lived capital. First, future taxes increase the future cost of combusting fossil fuels. This reduces the profitability of upkeep and investment in emission intensive power plants, which again reduces the demand for fossil fuels. Second, future emission taxes increase future residual demand for low-emission electricity. This increases the profitability of investment in low-emission power plants, which again increases the electricity supply from these plants to the market. This reduces the equilibrium consumption of fossil fuels. Because of convex investment costs, it is cost efficient for the firms to begin adaptation to anticipated future emission taxes immediately. Therefore, emissions may decrease even before the tax has been implemented. Third, owners of scarce fossil resources can pre-empt a future emission tax by accelerating production of fossil energy such that more extraction take place before the tax is enacted. This is the well-known green paradox (see, e.g., Sinclair 1992; Sinn 2008). Whereas the two first mechanisms reduce the demand for fossil fuels, the third mechanism increases the supply of fossil fuels. Therefore, it is theoretically ambiguous whether the market equilibrium will feature increased or decreased fossil fuel consumption (before the emission tax is implemented), as compared to the case without future taxes.

I illustrate the analytical findings with a stylized numerical model of the US electricity market. The numerical model uses the Path solver in GAMS to solve the theoretical model as a mixed complementarity problem, given assumptions about quadratic functional forms for utility, environmental damage and production costs. The electricity demand function does not change over time in the numerical simulations, and technical change is either omitted or exogenous. The numerical simulations suggest that the green paradox does not hold in the presence of long-lived capital: Early emissions decrease in all cases following announcement of future emission taxes, except when investment costs are very low (the model then collapses towards the standard exhaustible resource model without long lived capital; see, e.g., Sinclair 1992, and Sinn 2008).

The analysis predicts that forward-looking firms will reduce current use of inputs subject to stringent future regulation. In this respect, it is interesting to observe the current struggle of publicly traded US coal companies.Footnote 5 Clearly, there are several factors behind this, like slower economic growth, cheap natural gas and current environmental regulation. Nevertheless, it seems reasonable that also bleaker prospects caused by future environmental regulation and increased competition from renewable power partly explain the investors’ vanishing interest in coal.Footnote 6

The presence of adjustment costs is an important premise for the present paper. Adjustment costs was early recognized, both related to firms’ net capital investment decisions (Gould 1968; Lucas 1976) and related to changing the number of employees (Holt et al. 1960; Oi 1962). Capital adjustment costs arise, e.g., if the price of capital increases in the rate of investment. Labor adjustment costs include costs related to hiring, training and layoff. Bellofatto and Besfamille (2018) use the probability that a project is finished early and without the need of refinancing as a proxy for administrative capacity. Whereas capital adjustment cost is the type of adjustment cost explicitly modeled in the present paper, the results may be relevant also in the case of labor adjustment cost or administrative capacity constraints. In the macroeconomics literature, Kydland and Prescott (1982) assumes that it takes time to install new equipment, and Wickens (2008, p. 33) assume that the cost of a unit of investment depends on how large it is in relation to the size of the existing capital stock.

The paper also relates to a body of literature which examines resource extraction under capacity constraints, but without pollution (Kemp and Van Long 1980; Amigues et al. 1998; Holland 2003). Particularly relevant to the present paper, Amigues et al. (2015) show that optimal investment in renewables starts before the end of fossil fuel usage in a setting with adjustment costs and endogenous capacity constraints. Further, Coulomb et al. (2019) examines the optimal transition from coal to gas and renewables in a model with capacity constraints and adjustment costs. Similarly to the present paper, they find that different energy sources should be used together in order to smooth out adjustment costs. Whereas the present paper’s modeling of the supply side shares many similarities with this body of literature, this paper adds to the literature by examining regulation and including environmental constraints (except for Coulomb et al. 2019, which also models pollution).

Another relevant branch of literature examines optimal use of energy sources, given emission constraints and exhaustible fossil fuels. This literature models renewable energy as a clean backstop and pays limited attention to adjustment costs. A general result without adjustment costs is that the clean backstop is kept on hold until the use of fossil fuels ceases (Chakravorty et al. 2006, 2008; van der Ploeg and Withagen 2012).Footnote 7 The transition is different in the present paper, as renewables (and nuclear power) is phased in early on and exploited together with emission intensive fossil fuels along the socially optimal time trajectory.

Section 2.1 characterizes the competitive partial equilibrium. The tax scheme that can implement the socially optimal time trajectory is presented in Sect. 2.2. Section 2.3 examines dynamic effects following announcement of future emission taxes. Section 2.4 investigates second-best taxation when an emission tax is the only policy instrument available and firms discount the future too strongly. The numerical illustration is in Sects. 3 and 4 concludes.

2 Theoretical Analysis

Let the vector \(\varvec{x}_{t}=\left( x_{t}^{1},x_{t}^{2},\ldots ,x_{t}^{\bar{i}}\right) \) denote a representative consumer’s consumption bundle of goods \(i\in I=\left\{ 1,2,\ldots ,\bar{i}\right\} \) in period \(t\in T=\left\{ 1,2,\ldots ,\overline{t}\right\} \). The associated benefit is given by the increasing and strictly concave utility function \(u\left( \varvec{x}_{t}\right) \). Each good \(x_{t}^{i}\) is produced by a representative firm (or sector) i. I assume market clearing such that production of \(x_{t}^{i}\) equals consumption of \(x_{t}^{i}\) for all \(i\in I\) and \(t\in T\). The firms’ discount factor is given by \(\delta \in \left( 0,1\right] \) and all derivatives are assumed to be finite.

One interpretation of this model setup is an economy with concave utility from electricity consumption, where electricity may be derived from \(\bar{i}\) energy sources: coal, gas, hydro power, and so forth. I will use this as an example throughout the paper, i.e., we have one representative firm for each type of electricity generation technology.

The investment costs of power generation are essentially capital construction costs and land, including “regulatory costs” for obtaining siting permits, environmental approvals, and so on. These costs may increase substantially in the presence of economy wide capacity constraints, like limited availability of skilled labor or raw materials. I assume that the investment cost function, \(\chi ^{i}\left( y_{t}^{i}\right) \), is strictly convex and increasing in investment \(y_{t}^{i}\), with minimum at \(\chi ^{i}(0)=0\).Footnote 8 The model framework allows the representative firm to actively reduce capacity faster than capital depreciation (\(y_{t}^{i}<0\)).Footnote 9

Operating costs for power plants include fuel, labor and maintenance costs. I divide these costs into fixed and variable operating costs. Fixed operating and maintenance costs, denoted \(f^{i}(Y_{t}^{i})\), include, e.g., salaries for facility staff and maintenance that is scheduled on a calendar basis. They do not vary significantly with a plant’s electricity generation, but increase in capacity; i.e., we have \(\partial f^{i}(Y_{t}^{i})/\partial Y_{t}^{i}\equiv f_{Y_{t}^{i}}^{i}\left( \cdot \right) >0\). The variable operating costs include the cost of consumable materials and maintenance that may be scheduled based on the number of operating hours or start-stop cycles of the plant. These costs are captured by the variable cost function \(k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})\). Variable operating costs increase in production \(x_{t}^{i}\) and decrease in the capacity measure \(Y_{t}^{i}\). This is captured by the first order derivatives \(k_{x_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\), \(k_{Y_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\), second order derivatives \(k_{x_{t}^{i}x_{t}^{i}}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\), \(k_{Y_{t}^{i}Y_{t}^{i}}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})\le 0\) and cross derivative \(k_{x_{t}^{i}Y_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\). The firms may reduce their current emissions, \(e_{t}^{i}\ge 0\), by (flow) abatement activities. Abatement, measured as \(e_{t}^{i,BaU}-e_{t}^{i}\), is not free and production cost decreases in emissions if emissions fall below the business as usual emissions, \(e_{t}^{i,BaU}\) (associated with no emission regulation); i.e., we have \(k_{e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\), \(k_{e_{t}^{i}e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\) and \(k_{x_{t}^{i}e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\) for \(e_{t}^{i}<e_{t}^{i,BaU}\).Footnote 10 Note that abatement within a given technology is modeled as a flow variable; i.e. the current emission intensity does not matter for the future emission intensity. This differs from the emission reductions that may be achieved with a larger share of low-emission technology capacity (e.g., replacing coal capacity with renewables). Electricity generation (flow) abatement is most relevant for fossil fuels, where it may involve, e.g., switching to cleaner types of coal or use of combined cycle power plants or scrubbers (sulfur dioxide).Footnote 11

Production capacity evolves following the state equation:

$$\begin{aligned} Y_{t+1}^{i}=\beta Y_{t}^{i}+y_{t}^{i}, Y_{0}^{i}=\overline{Y}^{i},\ \forall i,\forall t, \end{aligned}$$
(1)

where \(\beta \in \left( 0,1\right] \) is a capital depreciation factor and \(\bar{Y}^{i}\) is initial capacity (a constant determined by history).

Assume that a subset of the representative firms \(j\in J=\left\{ \tilde{i}+1,\tilde{i}+2,\ldots ,\bar{i}\right\} \) use a scarce resource as an input factor in production (\(J\subseteq I=\left\{ 1,2,\ldots ,\tilde{i},\tilde{i}+1,\ldots ,\bar{i}\right\} \)). These firms have an additional term \(h^{j}\left( S_{t}^{j}\right) x_{t}^{j}\) added to their variable operating cost function, where the remaining resource stock, \(S_{t}^{j}\), evolves following the state equation:

$$\begin{aligned} S_{t+1}^{j}=S_{t}^{j}-x_{t}^{j}, S_{0}^{j}=\bar{S}^{j},\ \forall j,\forall t. \end{aligned}$$
(2)

Here \(\bar{S}^{j}\) is an exogenous constant and I have normalized units in (2) such that one unit of production requires one unit of resource. We have the resource stock constraint \(S_{t}^{j}\ge 0\). Further, resource scarcity implies that unit operating costs decrease in the remaining resource stock; i.e., we have \(h_{S_{t}^{j}}^{j}\left( S^{j}\right) <0\); e.g., because the cheapest resource deposits are extracted first. Note that firm \(j\in J\) is an integrated firm that extracts the fossil fuels needed for electricity generation by itself. I assume that \(lim_{S_{t}^{j}\rightarrow 0}h\left( S_{t}^{j}\right) =\infty \); i.e., the cost of resource extraction approaches infinity as the resource stock is completely exhausted.Footnote 12

Total operating costs are given by:

$$\begin{aligned} c^{i}\left( \varvec{z}_{t}^{i}\right) ={\left\{ \begin{array}{ll} \begin{array}{l} f^{i}(Y_{t}^{i})+k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i}),\\ f^{i}(Y_{t}^{i})+k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})+h^{i}\left( S_{t}^{i}\right) x_{t}^{i}, \end{array} & \begin{array}{l} \forall i\le \tilde{i},\forall t,\\ \forall i>\tilde{i},\forall t, \end{array}\end{array}\right. } \end{aligned}$$
(3)

with \(\varvec{z}_{t}^{i}=\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i}\right) \) for \(\forall i\le \tilde{i}\) and \(\varvec{z}_{t}^{i}=\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i},S_{t}^{i}\right) \) for \(\forall i>\tilde{i}\). For example, the first \(i=1,2,\ldots ,\tilde{i}\) may denote firms using non-exhaustible energy sources like renewables and nuclear, whereas the remaining \(j=\tilde{i}+1,\tilde{i}+2,\ldots ,\bar{i}\) are firms combusting fossil energy sources like petroleum and gas. This cost structure implies that unit operating costs, \(c^{i}\left( \varvec{z}^{i}\right) /x_{t}^{i}\), have the familiar skewed U-shape with minimum at \(c^{i}\left( \varvec{z}^{i}\right) /x_{t}^{i}=c_{x_{t}^{i}}^{i}\left( \varvec{z}^{i}\right) \) (for any given \(e_{t}^{i}\)).Footnote 13

Clearly, the relationship between \(f^{i}\left( \cdot \right) \) and \(k^{i}\left( \cdot \right) \) may differ markedly across technologies. For example, nuclear power plants feature high fixed costs relative to the variable operating costs, as compared with gas-fueled power plants. The strict convexity of \(\chi ^{i}\left( \cdot \right) \) and \(k_{x_{t}^{i}Y_{t}^{i}}^{i}(\cdot )<0\) implies that the cost associated with any given change in the energy mix \(\varvec{x}\) may be reduced by increasing the number of time periods during which the change occurs. Specifically, the cost of reducing GHG emissions increases with the speed of emission reductions. I will discuss the results in a setting where the initial capacity mix \(\left( \overline{Y}^{1},\overline{Y}^{2},\ldots ,\overline{Y}^{\bar{i}}\right) \) is characterized by too much dirty capacity, such that the socially optimal capacity declines over time for relatively emission intensive production technologies, and increases for relatively clean energies. Whereas this eases the discussion of the results, and arguably is the case for the electricity industry in many countries today, it is not necessary for the validity of the analytical results.

The emissions stock evolves following the state equation:

$$\begin{aligned} E_{t+1}=\alpha E_{t}+\sum _{i\in I}e_{t}^{i}, E_{0}=\bar{E},\ \forall t, \end{aligned}$$
(4)

where \(\bar{E}\) is a constant determined by history and \(\alpha \in \left[ 0,1\right) \) denotes the stock depreciation factor from one period to the next. Environmental damage from emissions is given by \(d\left( E_{t}\right) \), where \(d(\cdot )\) is weakly convex and increasing.Footnote 14

Let \(p_{t}^{i}\) denote the endogenous consumer price on \(x_{t}^{i}\) (net of taxes). I assume the regulator has access to three regulatory instruments: an emission tax, \(\tau _{t}\), a tax on investment, \(\theta _{t}^{i}\), and a tax on extraction of exhaustible resources, \(\phi _{t}^{i}\). The representative firms \(i\in I\) may be interpreted as representing different sectors of the electric power industry; i.e., the wind power sector, the nuclear power sector, and so forth. I will therefore use the term sector specific taxes when referring to \(\theta _{t}^{i}\) and \(\phi _{t}^{i}\).Footnote 15 The investment tax \(\theta _{t}^{i}\) may take three forms: (i) a standard unit tax on investment if \(\theta _{t}^{i}>0\) and \(y_{t}^{i}>0\), (ii) a subsidy to decommissioning if \(\theta _{t}^{i}>0\) and \(y_{t}^{i}<0\), and (iii) a subsidy on investment if \(\theta _{t}^{i}<0\). The extraction tax \(\phi _{t}^{i}\) is needed to slow down extraction of scarce resources if the private discount rate is higher than the social discount rate (see Sect. 2.2); i.e., \(\phi _{t}^{i}>0\) is enacted to preserve scarce resources for use in the future.

2.1 Market Equilibrium

The competitive representative firm \(i\in I\) maximizes the present value of profits over the remaining time horizon solving:

$$\begin{aligned} \max _{x_{t}^{i},y_{t}^{i},e_{t}^{i}}\sum _{t\in T}\delta ^{t-1}\left[ p_{t}^{i}x_{t}^{i}-\left( c^{i}\left( \varvec{z}_{t}^{i}\right) +\phi _{t}^{i}x_{t}^{i}\right) -\left( \chi ^{i}\left( y_{t}^{i}\right) +\theta _{t}^{i}y_{t}^{i}\right) -e_{t}^{i}\tau _{t}\right] ,\ \forall t, \end{aligned}$$
(5)

where \(c^{i}\left( \varvec{z}_{t}^{i}\right) +\phi _{t}^{i}x_{t}^{i}\) is operating costs including the extraction tax, \(\chi ^{i}\left( y_{t}^{i}\right) +\theta _{t}^{i}y_{t}^{i}\) is the cost of investment, including investment taxes, and \(e_{t}^{i}\tau _{t}\) is the emission tax payment. The maximization is subject to equations (1), (2), (3) and the resource constraint \(S_{\overline{t}}^{j}\ge 0\).Footnote 16

A price-taking representative consumer maximizes net utility solving:

$$\begin{aligned} \varvec{x}_{t}=\arg \max _{\varvec{x}_{t}}\left[ u\left( \varvec{x}_{t}\right) -\varvec{p}_{t}\varvec{x}_{t}^{\prime }\right] ,\ \forall t, \end{aligned}$$
(6)

where \(\varvec{p}_{t}=\left( p_{t}^{1},p_{t}^{2},\ldots,p_{t}^{\bar{i}}\right) \) and \(\varvec{x}_{t}^{\prime }\) is the transpose of \(\varvec{x}_{t}\). The associated first order condition is \(u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right) \le p_{t}^{i}\) for all \(i\in I\) and \(t\in T\).

We have the following result:

Lemma 1

The competitive partial equilibrium sequence triple \(\left\{ x_{t}^{i,*},y_{t}^{i,*},e_{t}^{i,*}\right\} \), solving (5) and (6) subject to equations (1), (2) and (3), satisfies:

$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right)&\le p_{i}^{i}\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\phi _{t}^{i},\ \forall i\le \tilde{i},\forall t,\end{aligned}$$
(7a)
$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right)&\le p_{i}^{i}\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\mu _{t}^{i}+\phi _{t}^{i},\ \forall i>\tilde{i},\forall t,\end{aligned}$$
(7b)
$$\begin{aligned} \lambda _{t}^{i,*}&\le \chi _{y_{t}^{i}}^{i}\left( y_{t}^{i,*}\right) +\theta _{t}^{i},\ \forall i,\forall t,\end{aligned}$$
(7c)
$$\begin{aligned} \lambda _{t}^{i,*}&=-\delta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \delta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \varvec{z}_{r}^{i,*}\right) ,\ \forall i,\forall t<\bar{t,}\end{aligned}$$
(7d)
$$\begin{aligned} \mu _{t}^{i,*}&=-\sum _{r=t+1}^{\overline{t}}\delta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,*}\right) x_{r}^{i,*},\ \forall i>\tilde{i,}\forall t<\bar{t,}\end{aligned}$$
(7e)
$$\begin{aligned} \tau _{t}&\le -c_{e_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) ,\ \forall i,\forall t, \end{aligned}$$
(7f)

with \(Y_{t}^{i,*}\) and \(S_{t}^{i,*}\) as given by equations (1) and (2), respectively. We have \(\lambda _{\bar{t}}^{i,*}=\mu _{\bar{t}}^{i,*}=0\). The weak inequalities are strict if and only if we have a corner solution for the relevant decision variable.Footnote 17

Proof

See "Appendix B". \(\square \)

We see from Lemma 1 that production of \(x_{t}^{i}\) increases in capacity \(Y_{t}^{i}\), marginal utility from consumption and the remaining resource stock (\(\forall i>\tilde{i}\)), whereas it decreases in production cost and extraction taxes. Note that Lemma 1 implies \(p_{t}^{1}=p_{t}^{2}=\ldots =p_{t}^{\bar{i}}\) if the goods are perfect substitutes. This is relevant if I is a set of electricity producers.

The variable \(\lambda _{t}^{i}\) is a (endogenous) shadow price representing the present value of the change in future profits caused by a marginal increase in current capacity. In the case where optimal production capacity declines towards a new and lower level (faster than capacity depreciation), higher capacity today induces too high fixed operating costs in the future. Hence, the shadow price \(\lambda _{t}^{i}\) is negative. Conversely, \(\lambda _{t}^{i}\) is positive if optimal capacity shifts upwards.Footnote 18

For sectors with scarce resources (\(\forall i>\tilde{i}\)), more extraction today implies less extraction in the future. Hence, the resource owners must not only decide whether to extract the resource, but also when to extract. This consideration is captured by the non-negative shadow price on the remaining resource stock, \(\mu _{t}^{j}\) , in Lemma 1 (the endogenous shadow price \(\mu _{t}^{j}\) is often referred to as the ’scarcity rent’ or ’Hotelling rent’). It is the present value change in future profits caused by a marginal increase in the remaining resource stock \(S_{t}^{j}\). Lemma 1 implies that each resource owner equalizes marginal discounted profits from extraction over time. Otherwise, the resource owners could increase the present value of their resource by moving production between periods. The resource rents typically vary between the different resources.

Whereas the isolated effect of increased emission taxes is to decrease production (given \(e_{t}^{i}>0\)), production of \(x_{t}^{i}\) in competitive equilibrium may increase in the emission tax \(\tau _{t}\) if \(x_{t}^{i}\) is a low-emission good. The reason is that the upward shift in each firm’s supply cost functions, caused by the emission tax, increases in the emission intensity of the firm. Therefore, residual demand and equilibrium production of relatively low-emission goods increases. Equation (7f) in Lemma 1 states the familiar result that marginal costs of emission reductions (i.e., flow abatement for each type of technology) equal the emission tax in the interior solution.

2.2 The Socially Optimal Tax Scheme

Let welfare W be measured as the present value of utility from consumption net of environmental damages, production costs and investment costs:

$$\begin{aligned} W=\sum _{t\in T}\zeta ^{t-1}\left[ u\left( \varvec{x}_{t}\right) -d\left( E_{t}\right) -\sum _{i\in I}\left[ c^{i}\left( \varvec{z}_{t}^{i}\right) +\chi ^{i}\left( y_{t}^{i}\right) \right] \right] , \end{aligned}$$
(8)

where \(1\ge \zeta \ge \delta \) is the social discount factor. The regulator faces a trade-off in the presence of convex investment costs. On the one hand, fast emission reductions reduce environmental damage. On the other hand, the convexity of investment costs imply that the cost of emission reductions can always be reduced by extending the time horizon over which emission reductions take place. We have the following result:

Proposition 1

Let the socially optimal sequence triple \(\left\{ x_{t}^{i,sp},y_{t}^{i,sp},e_{t}^{i,sp}\right\} \) maximize welfare (8) subject to equations (1) to (4). Then, the socially optimal time trajectory can be implemented in partial competitive equilibrium with the following taxes:

$$\begin{aligned} \tau _{t}^{sp}&=\zeta \sum _{r=t+1}^{r=\overline{t}}\left( \alpha \zeta \right) ^{r-t-1}d_{E_{r}}(E_{r}^{sp}),\ \forall t<\overline{t},\\ \theta _{t}^{i}&=-\lambda _{t}^{i,sp}+\lambda _{t}^{*,i},\ \forall i,\forall t,\\ \phi _{t}^{i}&=\mu _{t}^{i,sp}-\mu _{t}^{*,i},\ \forall i>\tilde{i},\forall t, \end{aligned}$$

where:

$$\begin{aligned} \lambda _{t}^{i,sp}&=-\zeta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \zeta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \varvec{z}_{r}^{i,sp}\right) ,\ \forall i,\forall t<\bar{t,}\\ \mu _{t}^{i,sp}&=-\sum _{r=t+1}^{\overline{t}}\zeta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,sp}\right) x_{r}^{i,sp},\ \forall i>\tilde{i},\forall t<\bar{t,} \end{aligned}$$

with \(\lambda _{t}^{*,i}\) and \(\mu _{t}^{*,i}\)as given in Lemma 1, \(\tau _{\bar{t}}^{sp}=\theta _{\bar{t}}^{i}=\phi _{\bar{t}}^{i}=0\) and \(\phi _{t}^{i,sp}=0\text { for } \forall i\le \tilde{i}\).

Proof

See "Appendix B".

We first examine the case where the social discount factor equals the private discount factor, such that \(\delta =\zeta \). In this case \(\theta _{t}^{i,sp}=\phi _{t}^{i,sp}=0\), given the optimal emission tax \(\tau _{t}^{sp}\). We observe that the expression for \(\tau _{t}^{sp}\) is the present value of the stream of future marginal stock damages following one additional unit of emissions along the socially optimal time trajectory. This is sometimes referred to as the social cost of carbon in the case of greenhouse gases.

The Pigou tax \(\tau _{t}^{sp}\) is only indirectly affected by the explicit modeling of production capacity and convex investment costs (the production capacity mix influence emissions and, hence, the optimal tax). Specifically, \(\tau _{t}^{sp}\) is not reduced during the first years to give firms time to adjust. On the contrary, higher investment costs cause slower development of relatively clean production capacity, which again entails higher emissions, a higher absolute value shadow price on the emissions stock, and higher optimal emissions taxes (see also Fig. 2 in Sect. 3.2). More expensive decommissioning of dirty production capacity (e.g., coal) has the same effect (see Figure 10 in “Appendix A”).

Assuming an interior solution for (flow) abatement, \(e_{t}^{i,sp}<e_{t}^{i,BaU}\), we have \(\tau _{t}=k_{e_{t}^{i}}^{i}\left( \cdot \right) \) in each time period. Marginal abatement cost then increases over time if the optimal stock of carbon in the atmosphere increases over time (because marginal environmental damage and, thereby, the optimal tax \(\tau _{t}^{sp}\), increases over time). Whereas this abatement profile agrees with Nordhaus (1991; 1992), the present analysis highlights that substantial investments may be necessary early on (see Fig. 1 in Sect. 3.2). The reason is that it takes time to implement the emission reductions necessary to curb global warming. The well-known result that lower (flow) abatement costs reduces the optimal emission tax remains valid in the present model setup (Weitzman 1974). Cheaper abatement also reduces the importance of redirecting investment towards less-emission intensive production capacity.

A common assumption when deriving the socially optimal time trajectory is that the firms’ private discount rate equals the discount rate of the social planner (see, e.g., Nordhaus 1991, 1992; Golosov et al. 2014). This assumption may be questionable, at least when applied to major environmental challenges like greenhouse gas emissions from power plants and climate change. The Stern Review (Stern 2007), and the following discussion about appropriate social discount rates in cost-benefit analysis, indicates that the social discount rate may be below capital market interest rates, at least in the case of climate change (Weitzman 2007; Tol and Yohe 2006). Indeed, the Stern Reviews’s conclusions about the need for decisive immediate action hinges on the assumption of a near-zero pure time preference discount rate, which are inconsistent with today’s marketplace real interest rates and savings rates (Nordhaus 2007). Goulder and Williams (2012) argue that we should distinguish between a social-welfare-equivalent discount rate appropriate for determining whether a given policy would augment social welfare and a finance-equivalent discount rate suitable for determining whether the policy would offer a potential Pareto improvement.Footnote 19

The case where the social discount factor is larger than the private discount factor (\(\delta <\zeta \)) involves \(\theta _{t}^{i,sp}\ne 0\) (\(\forall i\)) and \(\phi _{t}^{i,sp}>0\) for sectors with scarce resources (\(\forall i>\tilde{i}\)). The optimal investment tax, \(\theta _{t}^{i,sp}\), is the difference between the social planner’s and the representative firm’s shadow price on capacity \(Y_{t}^{i}\) along the socially optimal time trajectory. Note that the socially optimal investment tax is positive (negative) if production decrease (increase) over time. For example, Proposition 1 may imply decommissioning subsidies to coal plants, whereas investment in renewable energy is subsidized; see Fig. 3 in the numerical Sect. 3.2.

The optimal extraction tax, \(\phi _{t}^{i,sp}\), is the difference between the social planner’s and the representative firm’s shadow price on scarce resources along the socially optimal time trajectory. This tax is needed because owners of scarce resources put too low value on the future resource bases in their current extraction decisions when \(\delta <\zeta \). Hence, the social planner taxes current extraction to conserve more of the resource stocks for future use.

The following corollary summarizes the above discussion:

Corollary 1

The socially optimal time trajectory can be implemented in competitive partial equilibrium with a standard Pigou tax, \(\tau _{t}^{sp}\), if and only if \(\delta =\zeta \). Otherwise, the Pigou tax must be combined with sector specific investment taxes and subsidies, \(\theta _{t}^{i,sp}\ne 0\), to implement the social optimum (\(\forall i\)). Further, an extraction tax, \(\phi _{t}^{i,sp}>0\), is needed on extraction of scarce resources (\(\forall i>\tilde{i}\)).

Proof

The corollary follows directly from Proposition 1.

Note that the sector specific taxes and subsidies, \(\theta _{t}^{i,sp}\) and \(\phi _{t}^{i,sp}\), in general differs between sectors (or technologies), even though all sectors have the same private discount factor \(\delta \).

2.3 Dynamic Effects of Future Emission Taxes

In practice, it may be hard for lawmakers to enact an emission tax immediately (Di Maria et al. 2017). In this Sect. 1 analyze dynamic effects of future emission taxes. The other taxes are set to zero (\(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\)). Lemma 1 implies that a credible announcement of increased future emission taxes has three key effects in the electricity market:

(a) Reduced fossil fuel demand from power plants Future emission taxes increase the future cost of burning fossil fuels. The decline in future emission intensive fossil-fueled electricity generation implies that optimal fossil-fueled power plant capacity will be lower in the future. This reduces the profitability of investment in, e.g., coal-fired power plants, and thereby the demand for coal (cf., a lower \(\lambda _{t}^{i}\) for emission intensive energy in Lemma 1).

(b) Increased supply of electricity generated from (sufficiently) low-emission energy sources Future emission taxes imply higher supply costs for emission intensive fossil-fueled power plants. Hence, low-emission electricity generation sources, like renewables or nuclear power, gain a competitive advantage when the tax is implemented. This increases the profitability of investing in low-emission electricity generation capacity (cf., a higher \(\lambda _{t}^{i}\) for low-emission energy in Lemma 1). The associated increase in non-fossil electricity generation capacity reduces the electricity market equilibrium consumption of fossil fuels.Footnote 20

(c) Increased current supply of fossil fuels Future taxes decrease the future value of the fossil fuel resource (cf., a lower value on the Hotelling rent \(\mu _{t}^{i}\) for \(\forall i>\tilde{i}\) in Lemma 1). Hence, it is profitable with faster extraction. This is the well-known (weak) green paradox (see, e.g., Sinclair 1992; Sinn 2008; Gerlagh 2011). In particular, Sinclair (1992) and Sinn (2008) caution against environmental policies that become more stringent with the passage of time, because such policies will accelerate resource extraction and, thereby, accelerate global warming.

Whereas the resource scarcity dynamic (c) suggest that exhaustible fossil fuel extraction accelerates following signaling of future environmental policies, the capacity stock dynamics (a) and (b) have the opposite effect. From a theoretical point of view, it is therefore ambiguous whether current emissions increase or decrease following signaling of stringent future climate policy, given that resource exhaustibility, capacity constraints and convex investment costs are present. The capacity stock mechanisms (a) and (b) strongly dominate the supply side mechanism (c) put forth by the green paradox literature in the numerical Sect. 3.3 below. One reason is that mechanism (c) only really matters for oil and gas fueled power plants (the resource rent is small for coal). The above discussion suggests that emissions will unambiguously decline following the tax announcement if scarcity (mechanism c) is negligible or non-existent.Footnote 21 The mechanics discussed above act on current production via the shadow prices on capacity (\(\lambda _{t}^{i}\), mechanisms a and b) and the resource stock (\(\mu _{t}^{i}\), mechanism c). Figure 11 in “Appendix A” graphs how these shadow prices are affected by the tax announcement in the numerical simulations.

2.4 Second-Best Emission Taxes

The optimal tax scheme given in Proposition 1 involves taxes or subsidies that differs across sectors (\(\theta _{t}^{i}\) and \(\phi _{t}^{i}\)). In this section, I consider the case where the regulator is constrained to \(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\) and the representative firms’ discount rate exceeds the social discount rate (\(\delta <\zeta \)).Footnote 22 Then, whereas the Pigou tax \(\tau _{t}^{sp}\) still perfectly balances environmental damages with (flow) abatement cost, the transition towards a less emission intensive production capacity mix is too slow as compared with the socially optimal trajectory (cf., the need for investment taxes and subsidies \(\theta _{t}^{i,sp}\) when \(\delta <\zeta \) in Proposition 1). The transition towards a cleaner capacity mix can then be accelerated by announcing a future emission tax that is above the Pigouvian tax.

We observe that a policy involving announcement of a future tax above the Pigouvian tax will be subject to the mechanisms discussed in Sect. 2.3 (i.e., we have an increase from the Pigouvian tax to an emission tax that is higher during the transition). Of particular interest, higher future emission cost decreases future production from relatively emission intensive power plants, which again decreases their shadow price on capacity \(\lambda _{t}^{i}\) and, hence, investment in emission intensive production capacity. By the same reasoning, investment in (sufficiently) low-emission investment capacity increases. Hence, a future emission tax above the Pigouvian level has similar effects as the optimal investment tax in Proposition 1. This suggests that welfare may be increased by implementing a tax above the Pigouvian level during the transition period, given that \(\delta <\zeta \) and that mechanisms (a) and (b) dominate mechanism (c). Note that this argument only applies to future emission taxes; i.e., the regulator has no incentive to tax emissions above marginal environmental damages in the current time period. On the contrary, taxing current emissions above \(\tau _{t}^{sp}\) unambiguously reduces welfare, because marginal abatement cost is then larger than marginal environmental damages.

Assume mechanisms (a) and (b) dominate mechanism (c). Then, the regulator faces the following trade-off: On the one hand side, a tax above the Pigouvian level increases welfare by accelerating the change in production capacity necessary for the transition towards less emission intensive electricity generation. On the other hand side, there is a loss by taxing current emissions above the Pigouvian level, because marginal abatement cost is then higher than marginal environmental damage. It will increase welfare to tax emissions above the Pigouvian level if and only if the former effect dominates the latter. These dynamics are examined numerically in Sect. 3.2, where the second-best emissions tax trajectory turns out to be markedly above the Pigouvian tax during the transition period; see Fig. 4.

A caveat is that this policy is likely to be time inconsistent. To see this, let \(t=\left\{ 1,2,3\right\} \), \(\delta <\zeta \), \(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\) and suppose mechanisms (a) and (b) dominate mechanism (c). Consider a policy where the regulator in period \(t=1\) announces the following emission tax sequence \(\left\{ \tau _{1}^{sp},\tau _{2}^{sp}+\epsilon ,\tau _{3}^{sp}\right\} \), where \(\epsilon \) is a small positive constant and \(\tau _{t}^{sp}\) is given by Proposition 1. Then, the period \(t=1\) shadow price on capacity decreases in \(\epsilon \) for relatively emission intensive sectors, and increases in \(\epsilon \) for (sufficiently) low-emission sectors. Hence, the isolated effect of \(\epsilon >0\) is to increase welfare (unless \(\epsilon \) is too large). In period \(t=2\), the regulator will have an incentive to break the commitment to \(\epsilon >0\), however, because in period \(t=2\) welfare is unambiguously maximized by \(\epsilon =0\). Therefore, this policy requires that the regulator can credibly commit to policies that, even though they increase present value welfare (8), will be less than optimal in the future time period in which they are enacted.Footnote 23

3 Numerical Illustration: The US Electricity Market

In this section I substantiate the analytical findings with complementary numerical results based on a stylized model for the US electricity market. The numerical model runs over the time horizon \(T=\left\{ 2016,2017,\ldots ,2115\right\} \) and uses the Path solver in GAMS (numerical software) to solve the theory model as a mixed complementarity problem.Footnote 24 Sect. 3.1 briefly summarizes the main characteristics of the numerical model, along with the data sources and estimation and calibration procedures used. The numerical results are given in sects. 3.2 and 3.3. See "Appendix C" for more details about the numerical model and data sources, including a test of model fit against history.

3.1 Parameterization and Functional Forms

The United States generated about 4 thousand terawatt hours of electricity in 2016, of which 30 percent came from coal plants, 34 percent from natural gas and petroleum, 20% from nuclear power and 15% from renewables.Footnote 25 I model electricity from these four energy sources, and coal-fired power plants with carbon capture and storage (CCS). Electricity is a homogeneous good and electricity generated from the different sources are modeled as perfect substitutes in consumption. I assume throughout that the US government allows increased nuclear energy production.

I use data from the US Energy Information Administration (EIA), IMF and British Petroleum to estimate a quadratic utility function for electricity consumption; see "Appendix C". This utility function yields the linear demand function for US electricity used in the numerical simulations. The electricity demand function does not change over time in the numerical simulations.Footnote 26

Real capital depreciation is set to 6% per year.Footnote 27 Technology specific investment costs are fetched from EIA. Technology specific operating costs are calibrated using historic figures from EIA, cost estimates of fossil fueled power plant ramp-up costs (Kumar et al. 2012), and figures for remaining fossil fuel reserves from British Petroleum. Supply of electricity generated from fossil fueled power plants, and gas in particular, is modeled quite flexible, whereas nuclear and renewables must invest in production capacity in order to increase production (with more than a few percent above the 2015 level). Emission reductions are possible either by lower electricity consumption or through substitution from fossil energy to renewables, nuclear energy or implementation of CCS. The stylized numerical model treats the emission intensities of each production technology as exogenous constants.Footnote 28

The highly stylized quadratic environmental damage function is calibrated such that the Obama Administration’s 80% emission reduction target in 2050 (as compared with 2015 emissions) is socially optimal, given the other parameters in the model.Footnote 29 I assume social and private discount rates equal to 4 percent per year, unless otherwise stated.

3.2 The Socially Optimal Time Trajectory

Figure 1 graphs net investment along the socially optimal time trajectory in the numerical model, and the (undiscounted) emission tax that implements this trajectory in competitive equilibrium (cf., Proposition 1 with \(\delta =\zeta \)). The socially optimal time trajectory is characterized by substantial investment in low-emission electricity generation capacity and decommissioning of fossil fueled power plants. The lion’s share of investment occurs during the first years.Footnote 30 Whereas CCS plays an important part during the transition towards a clean energy mix, the use of CCS declines along the socially optimal time trajectory, as renewable and nuclear capacity increases after a couple of decades. The explanation is that CCS has relatively high operating costs and emissions, as compared to renewables and nuclear power plants. The results suggests that CCS can reduce the need for expensive investment in renewable capacity in the short term.Footnote 31

Fig. 1
figure 1

The socially optimal time trajectory: Net investment by technology and optimal emission taxes

Figure 2 graphs optimal emission taxes and emissions along the socially optimal time trajectory for different assumptions about the magnitude of investment costs. Emissions and optimal taxes increase in investment cost, because the transition towards a cleaner energy mix slows down when investment costs increase, which again implies higher emissions and larger environmental damage.Footnote 32

Fig. 2
figure 2

Total emissions and emission taxes for various assumptions about investment costs. 0.5=halved; 1=baseline (i.e., EIA 2016); 2=doubled

Proposition 1 states that the socially optimal time trajectory can be implemented by a Pigou tax alone only if the private discount factor equals the socially optimal discount factor. Otherwise, the Pigou tax must be supplemented with taxes on investment and production. Figure 3 shows the investment taxes/subsidies necessary to induce the socially optimal time trajectory when the private discount factor is 0.9, whereas the social discount factor is 0.96. For comparison, the representative overnight capital costs used in the model calibration are 3636, 6084, 978, 5945 and 2557 USD per kW for coal, CCS, gas, nuclear and renewables, respectively (EIA 2016). The optimal extraction tax (\(\phi _{t}^{i}\)) is close to zero for coal and small but positive for gas and petroleum.Footnote 33

Fig. 3
figure 3

Optimal investment taxes with private (sector) discount factor equal to 0.9 and a social discount factor equal to 0.96. A negative tax indicates a subsidy

As discussed in the theory section, the regulator may be constrained to \(\theta _{t}^{i}=\phi _{t}^{i}=0\), such that an emission tax is the only available policy instrument. If so, it may increase welfare to tax emissions above the Pigouvian level if \(\delta <\zeta \). The theory is ambiguous on this, however, because two effects counteract each other (see Sect. 2.4). Hence, I investigate this topic numerically. Let the regulator implement a tax of the form \(\tau _{t}^{sp'}=\tau _{t}^{sp}+\varPhi _{t}\) for all \(t\in T\), where \(\tau _{t}^{sp}\) is given by Proposition 1 and the regulator choose the level on \(\varPhi _{t}\) that maximizes welfare (8). The second-best tax is graphed Fig. 4.Footnote 34 Whereas the cost of the extra tax element in time period \(s\in T\), \(\varPhi _{s}>0\), is incurred in the period it is enacted (because marginal abatement costs exceeds the social cost of carbon), the benefits via investment occur over the time interval \(\left\{ 2016,2017,\ldots ,s-1\right\} \). For example, \(\varPhi _{2028}>0\) acts on investment in the period 2016–2027, whereas \(\varPhi _{2017}>0\) only acts on investment in 2016. This is why \(\varPhi _{t}\) gradually increases over time (before declining as the transition is completed).

Fig. 4
figure 4

Second-best emission tax when \(\theta _{t}^{i}=\phi _{t}^{i}=0\), \(\delta =0.9\) and \(\zeta =0.9\)6

Technical change is omitted in the main part of present paper, but “Appendix A” features two different technology scenarios. In the first scenario, renewable investment cost declines by 5 percent each year. This implies, e.g., that renewable investment costs are halved by 2030, and only one tenth of baseline cost in 2060. As expected, exogenous technological change implies a delay in investment. Nevertheless, Fig. 8 in “Appendix A” shows that the optimal trajectory features large investments early on also in this technology optimistic scenario. The second scenario features a clean technology breakthrough. The new technology emerges in 2025, has potential to supply 1500 GWh per year at a marginal supply cost equal to half the 2015 electricity price, and the whole US electricity market at a marginal supply cost equal to the 2015 price. Otherwise, it shares characteristics with the renewable energy sector. As expected, the emergence of revolutionary clean technology has a major impact on the socially optimal time trajectory; see Fig. 9 in “Appendix A”.Footnote 35

Irreversible investment and vintage models are examined by, e.g., Hart (2004), Caparros et al. (2015) and Rozenberg et al. (2019). Whereas I have assumed that decommissioning of capacity is possible in the main analysis, a simulation which approximates irreversible investment by strongly increasing decommissioning costs is included in “Appendix A”. As expected, irreversibility implies a larger share of electricity generated by coal-fired power plants, with associated higher environmental damages and emission taxes; see Fig. 10.

3.3 Current Effects of Future Emission Taxes

We know from Sect. 2.3 that it is ambiguous from a purely theoretical perspective whether current emissions will increase or decrease following announcement of a future emission tax. In this section I investigate this topic using numerical analysis.

Consider an emission tax that is announced in the beginning of year 2016. The tax is zero for the period 2016–2024 and equals the Pigou tax thereafter (\(\tau _{t}^{i,sp}\) in Proposition 1 with \(\theta _{t}^{i}=\phi _{t}^{i}=0\)).

Fig. 5
figure 5

Effects of tax announcement on net investment (tax minus no-tax simulation values) and the Pigou tax

Figure 5 graphs the changes in net investment (investment minus capital depreciation) induced by the tax announcement in the period 2016–2050. As expected, investment in generation capacity from low-emission sources (renewables, nuclear and CCS) increases when the tax is announced. The reason is that future residual demand for electricity from low-emission plants will increase when electricity from coal plants is taxed. In terms of Lemma 1, the emission tax induces a higher future producer price for low-emission plants, with an associated higher shadow price on capacity. Furthermore, it is not profitable to invest in coal-fired power plants in the face of the future emission tax. Therefore, net investment is negative for coal. The results for gas are less clear. The reason is that gas is less emission intensive than coal, but more emission intensive than the other energy sources. The shadow price on gas the gas resource (i.e., the Hotelling rent) is lower in the tax simulation, as compared with the no-tax case; see Fig. 11 in “Appendix A”.

Fig. 6
figure 6

Effects of tax announcement on production and emissions. Production by source and total yearly emissions. Tax minus no-tax simulation values. The Pigouvian tax implemented in 2025

Figure 6 shows changes in electricity production and emissions following announcement of the future tax, as compared to the case with no tax. The lower capacity of coal-fired power plants, implied by the figures for net investment graphed in Fig. 5, causes early production and emission from coal to decline. In addition, the increased capacity of low-emission power plants crowds out electricity from coal-fired power plants, also in the years before the tax is implemented. The black line in Fig. 6 shows the associated decline in aggregate yearly emissions. Emissions decline in all periods, except for a minuscule increase in 2016, which occurs because the capacity stock mechanics operates with a one period time lag.Footnote 36 Overall, the cumulative decline in emissions over the period 2016–2024, i.e., before the tax is implemented, constitutes 69 percent of total emissions in 2015. Because the sectors adjust optimally to the future regulation, Figs. 5 and 6 illustrate that immediate action is optimal even when the emission reduction targets are several years into the future.

It is interesting to examine how sensitive the results in Fig. 6 are with respect to the magnitude of investment costs. In a sensitivity analysis I multiply the model baseline investment costs (\(\chi \left( \cdot \right) \)) with \(\phi \in \left\{ 0,0.05,0.1,0.2,0.4,\ldots ,2\right\} \); see Fig. 12 in “Appendix A”. Here, \(\phi =0\) is the case with free investment in production capacity (no adjustment costs), whereas \(\phi =2\) indicates that marginal investment costs are doubled. In the numerical model, total emissions from 2016 to 2024 declines unless investment costs are less than 5% of the baseline capital costs calibrated from figures given by the US Energy Administration (EIA 2016). The model collapses towards a standard exhaustible resource model without capacity constraints as investment costs approach zero.

Further sensitivity analysis was conducted for remaining fossil fuel reserves, discounting, the fuel shares of generation capacity in 2015, and the time lag between tax announcement and tax implementation (see Fig. 7 in “Appendix A” for the time lag sensitivity). Emissions in the time period between tax announcement and tax implementation decreased in all cases, given that the time-lag was two years or more. Total emissions over the period 2016–2024 remained lower in the tax simulation (as compared with the no-tax simulation) even when scarcity costs were multiplied with five, and when initial capacity was adjusted such that all electricity in 2015 was generated from gas and petroleum fired power plants.

4 Conclusion

This paper examined regulation in the presence of convex investment costs and technology specific capacity stocks. Four key results emerged: First, future emission reductions may require substantial investment in low emission energy sources today, because it takes time to build up clean production capacity and phase out dirty capacity. Second, the Pigou tax must be coupled with technology (or sector) specific investment taxes or subsidies to induce the socially optimal trajectory if the private discount rate differs from the social discount rate. Third, if such investment taxes or subsidies are unavailable, a second-best alternative may be to tax emissions above the Pigouvian level during the transition phase. A caveat is that this second-best tax policy is time-inconsistent, however. Fourth, announcement of future emission taxes reduces current emissions unless fossil fuels are scarce, in which case the effect is ambiguous in theory. The theory was complemented with a stylized numerical model of the US electricity market. The numerical model suggested that early emissions will decrease following the tax announcement in the combined presence of resource scarcity and long-lived capital.

The results in the present paper hinge on the assumption of strictly convex investment costs. Whereas this is reasonable in the presence of economy wide capacity constraints, or expedited construction of power plants (see the literature on adjustment costs cited in the introduction), there exists plausible scenarios where this convexity is non-existent or negligible.

The analysis features several simplifications and the results should be interpreted cautiously. Among them, the paper does not account for endogenous technical change and knowledge accumulation, which will be important in the transition towards a low emission economy (Goulder and Mathai 2000; Popp 2004; Kverndokk and Rosendahl 2007; Acemoglu et al. 2012). Further, the analysis does not feature general equilibrium effects and the environmental damage function in the numerical model is highly stylized. The results were derived under the assumption of perfect information, including knowledge about future prices. Whereas this is a very strong assumption, the economic rationale behind the discussion in Sect. 2.3 is straightforward: Expectations about a future emission tax reduces the incentives to maintain and invest in emission intensive production capacity, and increases the incentives to invest in clean alternatives. The associated change in the production capacity mix (i.e., a larger share of clean capacity) causes emissions to decline. As such, the essential assumption is that the tax announcement can induce an increase in expected future emission taxes. Proposition 1, on the other hand, is not valid without the perfect foresight assumption.

I have assumed that the regulator can commit credibly to future emission taxes. This matters because optimal regulation in the future depends on current investment levels. Specifically, the optimal future emission taxes prescribed by the tax rule in Proposition 1 typically change if the firms do not believe in the future tax levels as announced by the regulator today and, hence, choose different current investment levels than those targeted by the regulator (see, e.g., Kydland and Prescott 1977, on commitment and credibility).