Skip to main content

Emission Regulation of Markets with Sluggish Supply Structures


I examine regulation in the presence of convex investment costs and technology specific capacity stocks. Announcement of future emission taxes reduces current emissions unless fossil fuels are scarce, in which case the effect is ambiguous. Substantial future emission reductions require action today, because it takes time to build up clean production capacity and phase out dirty capacity. The Pigou tax must be coupled with sector specific investment taxes or subsidies to induce the socially optimal trajectory if the private discount rate differs from the social discount rate. If such investment taxes or subsidies are unavailable, a (time-inconsistent) second-best alternative may be to tax emissions above the Pigouvian level during the transition phase. The theory is complemented with a stylized numerical model of the US electricity market.


Committed emissions from existing and proposed energy infrastructure represent more than the entire remaining carbon budget if the 1.5 °C target is to be achieved, and perhaps two-thirds of the remaining carbon budget if mean global warming is to be limited below 2 °C (Tong et al. 2019). Energy infrastructure has substantial (and partly sunk) investment costs and can remain operative for decades once in place. Hence, it is of crucial importance to direct investment away from emission intensive fossil fuels and towards low-emission alternatives if we are to curb global warming.

In this paper, I examine emission regulation and transition dynamics in the presence of long-lived capital in a competitive partial equilibrium model with resource scarcity. The modeling framework includes capital accumulation and convex investment costs to capture the sluggish response to regulation caused by long-lived capital. I use regulation of the electric power industry as an example throughout the paper.

The socially optimal time trajectory can be implemented in competitive equilibrium in a setting with convex investment cost and resource scarcity by a standard Pigouvian emission tax if the firms’ private discount rate equals the social discount rate. Otherwise, i.e., if the firms are more impatient than the social planner, production capacity adjusts too slowly to the emission tax.Footnote 1 Therefore, a tax or subsidy on investment is needed to implement the socially optimal time trajectory in this case. This provides a rationale for using emission taxes and investment subsidies simultaneously, which is relevant for public policy because many countries have or consider regulation featuring both instruments (see, e.g., IPCC 2012). Further, emission taxes above the Pigouvian level may be optimal if the private discount rate exceeds the social discount rate and an emission tax is the only policy instrument available. The explanation is that a higher emission tax helps speed up the transition, which increases welfare if the firms discount the future too strongly. A caveat is that such policies tend to be time inconsistent. Last, the time lags of environmental policy may be substantial, and I also examine current effects of increasing (or introducing) future emission taxes.Footnote 2 A key research question here is whether the green paradox (Sinclair 1992; Sinn 2008) holds when the model includes capital accumulation, convex investment costs and resource scarcity.Footnote 3

The analysis highlights that large investments towards a less emission intensive production capacity mix is needed early on. The reason is that current investment in relatively clean production technologies decreases the cost of future emission reductions. This corroborates a key result in Vogt-Schilb et al. (2018), who explicitly models investment in abatement equipment and finds that it is optimal to start a long-term emission–reduction strategy with significant and early abatement investment, even if the optimal carbon price starts low and grows progressively over time.Footnote 4

Regarding current effects of anticipated future emission taxes, future taxes have three key dynamic effects on current emissions from electricity generation in a model with exhaustible resources and long-lived capital. First, future taxes increase the future cost of combusting fossil fuels. This reduces the profitability of upkeep and investment in emission intensive power plants, which again reduces the demand for fossil fuels. Second, future emission taxes increase future residual demand for low-emission electricity. This increases the profitability of investment in low-emission power plants, which again increases the electricity supply from these plants to the market. This reduces the equilibrium consumption of fossil fuels. Because of convex investment costs, it is cost efficient for the firms to begin adaptation to anticipated future emission taxes immediately. Therefore, emissions may decrease even before the tax has been implemented. Third, owners of scarce fossil resources can pre-empt a future emission tax by accelerating production of fossil energy such that more extraction take place before the tax is enacted. This is the well-known green paradox (see, e.g., Sinclair 1992; Sinn 2008). Whereas the two first mechanisms reduce the demand for fossil fuels, the third mechanism increases the supply of fossil fuels. Therefore, it is theoretically ambiguous whether the market equilibrium will feature increased or decreased fossil fuel consumption (before the emission tax is implemented), as compared to the case without future taxes.

I illustrate the analytical findings with a stylized numerical model of the US electricity market. The numerical model uses the Path solver in GAMS to solve the theoretical model as a mixed complementarity problem, given assumptions about quadratic functional forms for utility, environmental damage and production costs. The electricity demand function does not change over time in the numerical simulations, and technical change is either omitted or exogenous. The numerical simulations suggest that the green paradox does not hold in the presence of long-lived capital: Early emissions decrease in all cases following announcement of future emission taxes, except when investment costs are very low (the model then collapses towards the standard exhaustible resource model without long lived capital; see, e.g., Sinclair 1992, and Sinn 2008).

The analysis predicts that forward-looking firms will reduce current use of inputs subject to stringent future regulation. In this respect, it is interesting to observe the current struggle of publicly traded US coal companies.Footnote 5 Clearly, there are several factors behind this, like slower economic growth, cheap natural gas and current environmental regulation. Nevertheless, it seems reasonable that also bleaker prospects caused by future environmental regulation and increased competition from renewable power partly explain the investors’ vanishing interest in coal.Footnote 6

The presence of adjustment costs is an important premise for the present paper. Adjustment costs was early recognized, both related to firms’ net capital investment decisions (Gould 1968; Lucas 1976) and related to changing the number of employees (Holt et al. 1960; Oi 1962). Capital adjustment costs arise, e.g., if the price of capital increases in the rate of investment. Labor adjustment costs include costs related to hiring, training and layoff. Bellofatto and Besfamille (2018) use the probability that a project is finished early and without the need of refinancing as a proxy for administrative capacity. Whereas capital adjustment cost is the type of adjustment cost explicitly modeled in the present paper, the results may be relevant also in the case of labor adjustment cost or administrative capacity constraints. In the macroeconomics literature, Kydland and Prescott (1982) assumes that it takes time to install new equipment, and Wickens (2008, p. 33) assume that the cost of a unit of investment depends on how large it is in relation to the size of the existing capital stock.

The paper also relates to a body of literature which examines resource extraction under capacity constraints, but without pollution (Kemp and Van Long 1980; Amigues et al. 1998; Holland 2003). Particularly relevant to the present paper, Amigues et al. (2015) show that optimal investment in renewables starts before the end of fossil fuel usage in a setting with adjustment costs and endogenous capacity constraints. Further, Coulomb et al. (2019) examines the optimal transition from coal to gas and renewables in a model with capacity constraints and adjustment costs. Similarly to the present paper, they find that different energy sources should be used together in order to smooth out adjustment costs. Whereas the present paper’s modeling of the supply side shares many similarities with this body of literature, this paper adds to the literature by examining regulation and including environmental constraints (except for Coulomb et al. 2019, which also models pollution).

Another relevant branch of literature examines optimal use of energy sources, given emission constraints and exhaustible fossil fuels. This literature models renewable energy as a clean backstop and pays limited attention to adjustment costs. A general result without adjustment costs is that the clean backstop is kept on hold until the use of fossil fuels ceases (Chakravorty et al. 2006, 2008; van der Ploeg and Withagen 2012).Footnote 7 The transition is different in the present paper, as renewables (and nuclear power) is phased in early on and exploited together with emission intensive fossil fuels along the socially optimal time trajectory.

Section 2.1 characterizes the competitive partial equilibrium. The tax scheme that can implement the socially optimal time trajectory is presented in Sect. 2.2. Section 2.3 examines dynamic effects following announcement of future emission taxes. Section 2.4 investigates second-best taxation when an emission tax is the only policy instrument available and firms discount the future too strongly. The numerical illustration is in Sects. 3 and 4 concludes.

Theoretical Analysis

Let the vector \(\varvec{x}_{t}=\left( x_{t}^{1},x_{t}^{2},\ldots ,x_{t}^{\bar{i}}\right) \) denote a representative consumer’s consumption bundle of goods \(i\in I=\left\{ 1,2,\ldots ,\bar{i}\right\} \) in period \(t\in T=\left\{ 1,2,\ldots ,\overline{t}\right\} \). The associated benefit is given by the increasing and strictly concave utility function \(u\left( \varvec{x}_{t}\right) \). Each good \(x_{t}^{i}\) is produced by a representative firm (or sector) i. I assume market clearing such that production of \(x_{t}^{i}\) equals consumption of \(x_{t}^{i}\) for all \(i\in I\) and \(t\in T\). The firms’ discount factor is given by \(\delta \in \left( 0,1\right] \) and all derivatives are assumed to be finite.

One interpretation of this model setup is an economy with concave utility from electricity consumption, where electricity may be derived from \(\bar{i}\) energy sources: coal, gas, hydro power, and so forth. I will use this as an example throughout the paper, i.e., we have one representative firm for each type of electricity generation technology.

The investment costs of power generation are essentially capital construction costs and land, including “regulatory costs” for obtaining siting permits, environmental approvals, and so on. These costs may increase substantially in the presence of economy wide capacity constraints, like limited availability of skilled labor or raw materials. I assume that the investment cost function, \(\chi ^{i}\left( y_{t}^{i}\right) \), is strictly convex and increasing in investment \(y_{t}^{i}\), with minimum at \(\chi ^{i}(0)=0\).Footnote 8 The model framework allows the representative firm to actively reduce capacity faster than capital depreciation (\(y_{t}^{i}<0\)).Footnote 9

Operating costs for power plants include fuel, labor and maintenance costs. I divide these costs into fixed and variable operating costs. Fixed operating and maintenance costs, denoted \(f^{i}(Y_{t}^{i})\), include, e.g., salaries for facility staff and maintenance that is scheduled on a calendar basis. They do not vary significantly with a plant’s electricity generation, but increase in capacity; i.e., we have \(\partial f^{i}(Y_{t}^{i})/\partial Y_{t}^{i}\equiv f_{Y_{t}^{i}}^{i}\left( \cdot \right) >0\). The variable operating costs include the cost of consumable materials and maintenance that may be scheduled based on the number of operating hours or start-stop cycles of the plant. These costs are captured by the variable cost function \(k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})\). Variable operating costs increase in production \(x_{t}^{i}\) and decrease in the capacity measure \(Y_{t}^{i}\). This is captured by the first order derivatives \(k_{x_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\), \(k_{Y_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\), second order derivatives \(k_{x_{t}^{i}x_{t}^{i}}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\), \(k_{Y_{t}^{i}Y_{t}^{i}}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})\le 0\) and cross derivative \(k_{x_{t}^{i}Y_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\). The firms may reduce their current emissions, \(e_{t}^{i}\ge 0\), by (flow) abatement activities. Abatement, measured as \(e_{t}^{i,BaU}-e_{t}^{i}\), is not free and production cost decreases in emissions if emissions fall below the business as usual emissions, \(e_{t}^{i,BaU}\) (associated with no emission regulation); i.e., we have \(k_{e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\), \(k_{e_{t}^{i}e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\) and \(k_{x_{t}^{i}e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\) for \(e_{t}^{i}<e_{t}^{i,BaU}\).Footnote 10 Note that abatement within a given technology is modeled as a flow variable; i.e. the current emission intensity does not matter for the future emission intensity. This differs from the emission reductions that may be achieved with a larger share of low-emission technology capacity (e.g., replacing coal capacity with renewables). Electricity generation (flow) abatement is most relevant for fossil fuels, where it may involve, e.g., switching to cleaner types of coal or use of combined cycle power plants or scrubbers (sulfur dioxide).Footnote 11

Production capacity evolves following the state equation:

$$\begin{aligned} Y_{t+1}^{i}=\beta Y_{t}^{i}+y_{t}^{i}, Y_{0}^{i}=\overline{Y}^{i},\ \forall i,\forall t, \end{aligned}$$

where \(\beta \in \left( 0,1\right] \) is a capital depreciation factor and \(\bar{Y}^{i}\) is initial capacity (a constant determined by history).

Assume that a subset of the representative firms \(j\in J=\left\{ \tilde{i}+1,\tilde{i}+2,\ldots ,\bar{i}\right\} \) use a scarce resource as an input factor in production (\(J\subseteq I=\left\{ 1,2,\ldots ,\tilde{i},\tilde{i}+1,\ldots ,\bar{i}\right\} \)). These firms have an additional term \(h^{j}\left( S_{t}^{j}\right) x_{t}^{j}\) added to their variable operating cost function, where the remaining resource stock, \(S_{t}^{j}\), evolves following the state equation:

$$\begin{aligned} S_{t+1}^{j}=S_{t}^{j}-x_{t}^{j}, S_{0}^{j}=\bar{S}^{j},\ \forall j,\forall t. \end{aligned}$$

Here \(\bar{S}^{j}\) is an exogenous constant and I have normalized units in (2) such that one unit of production requires one unit of resource. We have the resource stock constraint \(S_{t}^{j}\ge 0\). Further, resource scarcity implies that unit operating costs decrease in the remaining resource stock; i.e., we have \(h_{S_{t}^{j}}^{j}\left( S^{j}\right) <0\); e.g., because the cheapest resource deposits are extracted first. Note that firm \(j\in J\) is an integrated firm that extracts the fossil fuels needed for electricity generation by itself. I assume that \(lim_{S_{t}^{j}\rightarrow 0}h\left( S_{t}^{j}\right) =\infty \); i.e., the cost of resource extraction approaches infinity as the resource stock is completely exhausted.Footnote 12

Total operating costs are given by:

$$\begin{aligned} c^{i}\left( \varvec{z}_{t}^{i}\right) ={\left\{ \begin{array}{ll} \begin{array}{l} f^{i}(Y_{t}^{i})+k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i}),\\ f^{i}(Y_{t}^{i})+k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})+h^{i}\left( S_{t}^{i}\right) x_{t}^{i}, \end{array} & \begin{array}{l} \forall i\le \tilde{i},\forall t,\\ \forall i>\tilde{i},\forall t, \end{array}\end{array}\right. } \end{aligned}$$

with \(\varvec{z}_{t}^{i}=\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i}\right) \) for \(\forall i\le \tilde{i}\) and \(\varvec{z}_{t}^{i}=\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i},S_{t}^{i}\right) \) for \(\forall i>\tilde{i}\). For example, the first \(i=1,2,\ldots ,\tilde{i}\) may denote firms using non-exhaustible energy sources like renewables and nuclear, whereas the remaining \(j=\tilde{i}+1,\tilde{i}+2,\ldots ,\bar{i}\) are firms combusting fossil energy sources like petroleum and gas. This cost structure implies that unit operating costs, \(c^{i}\left( \varvec{z}^{i}\right) /x_{t}^{i}\), have the familiar skewed U-shape with minimum at \(c^{i}\left( \varvec{z}^{i}\right) /x_{t}^{i}=c_{x_{t}^{i}}^{i}\left( \varvec{z}^{i}\right) \) (for any given \(e_{t}^{i}\)).Footnote 13

Clearly, the relationship between \(f^{i}\left( \cdot \right) \) and \(k^{i}\left( \cdot \right) \) may differ markedly across technologies. For example, nuclear power plants feature high fixed costs relative to the variable operating costs, as compared with gas-fueled power plants. The strict convexity of \(\chi ^{i}\left( \cdot \right) \) and \(k_{x_{t}^{i}Y_{t}^{i}}^{i}(\cdot )<0\) implies that the cost associated with any given change in the energy mix \(\varvec{x}\) may be reduced by increasing the number of time periods during which the change occurs. Specifically, the cost of reducing GHG emissions increases with the speed of emission reductions. I will discuss the results in a setting where the initial capacity mix \(\left( \overline{Y}^{1},\overline{Y}^{2},\ldots ,\overline{Y}^{\bar{i}}\right) \) is characterized by too much dirty capacity, such that the socially optimal capacity declines over time for relatively emission intensive production technologies, and increases for relatively clean energies. Whereas this eases the discussion of the results, and arguably is the case for the electricity industry in many countries today, it is not necessary for the validity of the analytical results.

The emissions stock evolves following the state equation:

$$\begin{aligned} E_{t+1}=\alpha E_{t}+\sum _{i\in I}e_{t}^{i}, E_{0}=\bar{E},\ \forall t, \end{aligned}$$

where \(\bar{E}\) is a constant determined by history and \(\alpha \in \left[ 0,1\right) \) denotes the stock depreciation factor from one period to the next. Environmental damage from emissions is given by \(d\left( E_{t}\right) \), where \(d(\cdot )\) is weakly convex and increasing.Footnote 14

Let \(p_{t}^{i}\) denote the endogenous consumer price on \(x_{t}^{i}\) (net of taxes). I assume the regulator has access to three regulatory instruments: an emission tax, \(\tau _{t}\), a tax on investment, \(\theta _{t}^{i}\), and a tax on extraction of exhaustible resources, \(\phi _{t}^{i}\). The representative firms \(i\in I\) may be interpreted as representing different sectors of the electric power industry; i.e., the wind power sector, the nuclear power sector, and so forth. I will therefore use the term sector specific taxes when referring to \(\theta _{t}^{i}\) and \(\phi _{t}^{i}\).Footnote 15 The investment tax \(\theta _{t}^{i}\) may take three forms: (i) a standard unit tax on investment if \(\theta _{t}^{i}>0\) and \(y_{t}^{i}>0\), (ii) a subsidy to decommissioning if \(\theta _{t}^{i}>0\) and \(y_{t}^{i}<0\), and (iii) a subsidy on investment if \(\theta _{t}^{i}<0\). The extraction tax \(\phi _{t}^{i}\) is needed to slow down extraction of scarce resources if the private discount rate is higher than the social discount rate (see Sect. 2.2); i.e., \(\phi _{t}^{i}>0\) is enacted to preserve scarce resources for use in the future.

Market Equilibrium

The competitive representative firm \(i\in I\) maximizes the present value of profits over the remaining time horizon solving:

$$\begin{aligned} \max _{x_{t}^{i},y_{t}^{i},e_{t}^{i}}\sum _{t\in T}\delta ^{t-1}\left[ p_{t}^{i}x_{t}^{i}-\left( c^{i}\left( \varvec{z}_{t}^{i}\right) +\phi _{t}^{i}x_{t}^{i}\right) -\left( \chi ^{i}\left( y_{t}^{i}\right) +\theta _{t}^{i}y_{t}^{i}\right) -e_{t}^{i}\tau _{t}\right] ,\ \forall t, \end{aligned}$$

where \(c^{i}\left( \varvec{z}_{t}^{i}\right) +\phi _{t}^{i}x_{t}^{i}\) is operating costs including the extraction tax, \(\chi ^{i}\left( y_{t}^{i}\right) +\theta _{t}^{i}y_{t}^{i}\) is the cost of investment, including investment taxes, and \(e_{t}^{i}\tau _{t}\) is the emission tax payment. The maximization is subject to equations (1), (2), (3) and the resource constraint \(S_{\overline{t}}^{j}\ge 0\).Footnote 16

A price-taking representative consumer maximizes net utility solving:

$$\begin{aligned} \varvec{x}_{t}=\arg \max _{\varvec{x}_{t}}\left[ u\left( \varvec{x}_{t}\right) -\varvec{p}_{t}\varvec{x}_{t}^{\prime }\right] ,\ \forall t, \end{aligned}$$

where \(\varvec{p}_{t}=\left( p_{t}^{1},p_{t}^{2},\ldots,p_{t}^{\bar{i}}\right) \) and \(\varvec{x}_{t}^{\prime }\) is the transpose of \(\varvec{x}_{t}\). The associated first order condition is \(u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right) \le p_{t}^{i}\) for all \(i\in I\) and \(t\in T\).

We have the following result:

Lemma 1

The competitive partial equilibrium sequence triple \(\left\{ x_{t}^{i,*},y_{t}^{i,*},e_{t}^{i,*}\right\} \), solving (5) and (6) subject to equations (1), (2) and (3), satisfies:

$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right)&\le p_{i}^{i}\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\phi _{t}^{i},\ \forall i\le \tilde{i},\forall t,\end{aligned}$$
$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right)&\le p_{i}^{i}\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\mu _{t}^{i}+\phi _{t}^{i},\ \forall i>\tilde{i},\forall t,\end{aligned}$$
$$\begin{aligned} \lambda _{t}^{i,*}&\le \chi _{y_{t}^{i}}^{i}\left( y_{t}^{i,*}\right) +\theta _{t}^{i},\ \forall i,\forall t,\end{aligned}$$
$$\begin{aligned} \lambda _{t}^{i,*}&=-\delta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \delta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \varvec{z}_{r}^{i,*}\right) ,\ \forall i,\forall t<\bar{t,}\end{aligned}$$
$$\begin{aligned} \mu _{t}^{i,*}&=-\sum _{r=t+1}^{\overline{t}}\delta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,*}\right) x_{r}^{i,*},\ \forall i>\tilde{i,}\forall t<\bar{t,}\end{aligned}$$
$$\begin{aligned} \tau _{t}&\le -c_{e_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) ,\ \forall i,\forall t, \end{aligned}$$

with \(Y_{t}^{i,*}\) and \(S_{t}^{i,*}\) as given by equations (1) and (2), respectively. We have \(\lambda _{\bar{t}}^{i,*}=\mu _{\bar{t}}^{i,*}=0\). The weak inequalities are strict if and only if we have a corner solution for the relevant decision variable.Footnote 17


See "Appendix B". \(\square \)

We see from Lemma 1 that production of \(x_{t}^{i}\) increases in capacity \(Y_{t}^{i}\), marginal utility from consumption and the remaining resource stock (\(\forall i>\tilde{i}\)), whereas it decreases in production cost and extraction taxes. Note that Lemma 1 implies \(p_{t}^{1}=p_{t}^{2}=\ldots =p_{t}^{\bar{i}}\) if the goods are perfect substitutes. This is relevant if I is a set of electricity producers.

The variable \(\lambda _{t}^{i}\) is a (endogenous) shadow price representing the present value of the change in future profits caused by a marginal increase in current capacity. In the case where optimal production capacity declines towards a new and lower level (faster than capacity depreciation), higher capacity today induces too high fixed operating costs in the future. Hence, the shadow price \(\lambda _{t}^{i}\) is negative. Conversely, \(\lambda _{t}^{i}\) is positive if optimal capacity shifts upwards.Footnote 18

For sectors with scarce resources (\(\forall i>\tilde{i}\)), more extraction today implies less extraction in the future. Hence, the resource owners must not only decide whether to extract the resource, but also when to extract. This consideration is captured by the non-negative shadow price on the remaining resource stock, \(\mu _{t}^{j}\) , in Lemma 1 (the endogenous shadow price \(\mu _{t}^{j}\) is often referred to as the ’scarcity rent’ or ’Hotelling rent’). It is the present value change in future profits caused by a marginal increase in the remaining resource stock \(S_{t}^{j}\). Lemma 1 implies that each resource owner equalizes marginal discounted profits from extraction over time. Otherwise, the resource owners could increase the present value of their resource by moving production between periods. The resource rents typically vary between the different resources.

Whereas the isolated effect of increased emission taxes is to decrease production (given \(e_{t}^{i}>0\)), production of \(x_{t}^{i}\) in competitive equilibrium may increase in the emission tax \(\tau _{t}\) if \(x_{t}^{i}\) is a low-emission good. The reason is that the upward shift in each firm’s supply cost functions, caused by the emission tax, increases in the emission intensity of the firm. Therefore, residual demand and equilibrium production of relatively low-emission goods increases. Equation (7f) in Lemma 1 states the familiar result that marginal costs of emission reductions (i.e., flow abatement for each type of technology) equal the emission tax in the interior solution.

The Socially Optimal Tax Scheme

Let welfare W be measured as the present value of utility from consumption net of environmental damages, production costs and investment costs:

$$\begin{aligned} W=\sum _{t\in T}\zeta ^{t-1}\left[ u\left( \varvec{x}_{t}\right) -d\left( E_{t}\right) -\sum _{i\in I}\left[ c^{i}\left( \varvec{z}_{t}^{i}\right) +\chi ^{i}\left( y_{t}^{i}\right) \right] \right] , \end{aligned}$$

where \(1\ge \zeta \ge \delta \) is the social discount factor. The regulator faces a trade-off in the presence of convex investment costs. On the one hand, fast emission reductions reduce environmental damage. On the other hand, the convexity of investment costs imply that the cost of emission reductions can always be reduced by extending the time horizon over which emission reductions take place. We have the following result:

Proposition 1

Let the socially optimal sequence triple \(\left\{ x_{t}^{i,sp},y_{t}^{i,sp},e_{t}^{i,sp}\right\} \) maximize welfare (8) subject to equations (1) to (4). Then, the socially optimal time trajectory can be implemented in partial competitive equilibrium with the following taxes:

$$\begin{aligned} \tau _{t}^{sp}&=\zeta \sum _{r=t+1}^{r=\overline{t}}\left( \alpha \zeta \right) ^{r-t-1}d_{E_{r}}(E_{r}^{sp}),\ \forall t<\overline{t},\\ \theta _{t}^{i}&=-\lambda _{t}^{i,sp}+\lambda _{t}^{*,i},\ \forall i,\forall t,\\ \phi _{t}^{i}&=\mu _{t}^{i,sp}-\mu _{t}^{*,i},\ \forall i>\tilde{i},\forall t, \end{aligned}$$


$$\begin{aligned} \lambda _{t}^{i,sp}&=-\zeta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \zeta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \varvec{z}_{r}^{i,sp}\right) ,\ \forall i,\forall t<\bar{t,}\\ \mu _{t}^{i,sp}&=-\sum _{r=t+1}^{\overline{t}}\zeta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,sp}\right) x_{r}^{i,sp},\ \forall i>\tilde{i},\forall t<\bar{t,} \end{aligned}$$

with \(\lambda _{t}^{*,i}\) and \(\mu _{t}^{*,i}\)as given in Lemma 1, \(\tau _{\bar{t}}^{sp}=\theta _{\bar{t}}^{i}=\phi _{\bar{t}}^{i}=0\) and \(\phi _{t}^{i,sp}=0\text { for } \forall i\le \tilde{i}\).


See "Appendix B".

We first examine the case where the social discount factor equals the private discount factor, such that \(\delta =\zeta \). In this case \(\theta _{t}^{i,sp}=\phi _{t}^{i,sp}=0\), given the optimal emission tax \(\tau _{t}^{sp}\). We observe that the expression for \(\tau _{t}^{sp}\) is the present value of the stream of future marginal stock damages following one additional unit of emissions along the socially optimal time trajectory. This is sometimes referred to as the social cost of carbon in the case of greenhouse gases.

The Pigou tax \(\tau _{t}^{sp}\) is only indirectly affected by the explicit modeling of production capacity and convex investment costs (the production capacity mix influence emissions and, hence, the optimal tax). Specifically, \(\tau _{t}^{sp}\) is not reduced during the first years to give firms time to adjust. On the contrary, higher investment costs cause slower development of relatively clean production capacity, which again entails higher emissions, a higher absolute value shadow price on the emissions stock, and higher optimal emissions taxes (see also Fig. 2 in Sect. 3.2). More expensive decommissioning of dirty production capacity (e.g., coal) has the same effect (see Figure 10 in “Appendix A”).

Assuming an interior solution for (flow) abatement, \(e_{t}^{i,sp}<e_{t}^{i,BaU}\), we have \(\tau _{t}=k_{e_{t}^{i}}^{i}\left( \cdot \right) \) in each time period. Marginal abatement cost then increases over time if the optimal stock of carbon in the atmosphere increases over time (because marginal environmental damage and, thereby, the optimal tax \(\tau _{t}^{sp}\), increases over time). Whereas this abatement profile agrees with Nordhaus (1991; 1992), the present analysis highlights that substantial investments may be necessary early on (see Fig. 1 in Sect. 3.2). The reason is that it takes time to implement the emission reductions necessary to curb global warming. The well-known result that lower (flow) abatement costs reduces the optimal emission tax remains valid in the present model setup (Weitzman 1974). Cheaper abatement also reduces the importance of redirecting investment towards less-emission intensive production capacity.

A common assumption when deriving the socially optimal time trajectory is that the firms’ private discount rate equals the discount rate of the social planner (see, e.g., Nordhaus 1991, 1992; Golosov et al. 2014). This assumption may be questionable, at least when applied to major environmental challenges like greenhouse gas emissions from power plants and climate change. The Stern Review (Stern 2007), and the following discussion about appropriate social discount rates in cost-benefit analysis, indicates that the social discount rate may be below capital market interest rates, at least in the case of climate change (Weitzman 2007; Tol and Yohe 2006). Indeed, the Stern Reviews’s conclusions about the need for decisive immediate action hinges on the assumption of a near-zero pure time preference discount rate, which are inconsistent with today’s marketplace real interest rates and savings rates (Nordhaus 2007). Goulder and Williams (2012) argue that we should distinguish between a social-welfare-equivalent discount rate appropriate for determining whether a given policy would augment social welfare and a finance-equivalent discount rate suitable for determining whether the policy would offer a potential Pareto improvement.Footnote 19

The case where the social discount factor is larger than the private discount factor (\(\delta <\zeta \)) involves \(\theta _{t}^{i,sp}\ne 0\) (\(\forall i\)) and \(\phi _{t}^{i,sp}>0\) for sectors with scarce resources (\(\forall i>\tilde{i}\)). The optimal investment tax, \(\theta _{t}^{i,sp}\), is the difference between the social planner’s and the representative firm’s shadow price on capacity \(Y_{t}^{i}\) along the socially optimal time trajectory. Note that the socially optimal investment tax is positive (negative) if production decrease (increase) over time. For example, Proposition 1 may imply decommissioning subsidies to coal plants, whereas investment in renewable energy is subsidized; see Fig. 3 in the numerical Sect. 3.2.

The optimal extraction tax, \(\phi _{t}^{i,sp}\), is the difference between the social planner’s and the representative firm’s shadow price on scarce resources along the socially optimal time trajectory. This tax is needed because owners of scarce resources put too low value on the future resource bases in their current extraction decisions when \(\delta <\zeta \). Hence, the social planner taxes current extraction to conserve more of the resource stocks for future use.

The following corollary summarizes the above discussion:

Corollary 1

The socially optimal time trajectory can be implemented in competitive partial equilibrium with a standard Pigou tax, \(\tau _{t}^{sp}\), if and only if \(\delta =\zeta \). Otherwise, the Pigou tax must be combined with sector specific investment taxes and subsidies, \(\theta _{t}^{i,sp}\ne 0\), to implement the social optimum (\(\forall i\)). Further, an extraction tax, \(\phi _{t}^{i,sp}>0\), is needed on extraction of scarce resources (\(\forall i>\tilde{i}\)).


The corollary follows directly from Proposition 1.

Note that the sector specific taxes and subsidies, \(\theta _{t}^{i,sp}\) and \(\phi _{t}^{i,sp}\), in general differs between sectors (or technologies), even though all sectors have the same private discount factor \(\delta \).

Dynamic Effects of Future Emission Taxes

In practice, it may be hard for lawmakers to enact an emission tax immediately (Di Maria et al. 2017). In this Sect. 1 analyze dynamic effects of future emission taxes. The other taxes are set to zero (\(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\)). Lemma 1 implies that a credible announcement of increased future emission taxes has three key effects in the electricity market:

(a) Reduced fossil fuel demand from power plants Future emission taxes increase the future cost of burning fossil fuels. The decline in future emission intensive fossil-fueled electricity generation implies that optimal fossil-fueled power plant capacity will be lower in the future. This reduces the profitability of investment in, e.g., coal-fired power plants, and thereby the demand for coal (cf., a lower \(\lambda _{t}^{i}\) for emission intensive energy in Lemma 1).

(b) Increased supply of electricity generated from (sufficiently) low-emission energy sources Future emission taxes imply higher supply costs for emission intensive fossil-fueled power plants. Hence, low-emission electricity generation sources, like renewables or nuclear power, gain a competitive advantage when the tax is implemented. This increases the profitability of investing in low-emission electricity generation capacity (cf., a higher \(\lambda _{t}^{i}\) for low-emission energy in Lemma 1). The associated increase in non-fossil electricity generation capacity reduces the electricity market equilibrium consumption of fossil fuels.Footnote 20

(c) Increased current supply of fossil fuels Future taxes decrease the future value of the fossil fuel resource (cf., a lower value on the Hotelling rent \(\mu _{t}^{i}\) for \(\forall i>\tilde{i}\) in Lemma 1). Hence, it is profitable with faster extraction. This is the well-known (weak) green paradox (see, e.g., Sinclair 1992; Sinn 2008; Gerlagh 2011). In particular, Sinclair (1992) and Sinn (2008) caution against environmental policies that become more stringent with the passage of time, because such policies will accelerate resource extraction and, thereby, accelerate global warming.

Whereas the resource scarcity dynamic (c) suggest that exhaustible fossil fuel extraction accelerates following signaling of future environmental policies, the capacity stock dynamics (a) and (b) have the opposite effect. From a theoretical point of view, it is therefore ambiguous whether current emissions increase or decrease following signaling of stringent future climate policy, given that resource exhaustibility, capacity constraints and convex investment costs are present. The capacity stock mechanisms (a) and (b) strongly dominate the supply side mechanism (c) put forth by the green paradox literature in the numerical Sect. 3.3 below. One reason is that mechanism (c) only really matters for oil and gas fueled power plants (the resource rent is small for coal). The above discussion suggests that emissions will unambiguously decline following the tax announcement if scarcity (mechanism c) is negligible or non-existent.Footnote 21 The mechanics discussed above act on current production via the shadow prices on capacity (\(\lambda _{t}^{i}\), mechanisms a and b) and the resource stock (\(\mu _{t}^{i}\), mechanism c). Figure 11 in “Appendix A” graphs how these shadow prices are affected by the tax announcement in the numerical simulations.

Second-Best Emission Taxes

The optimal tax scheme given in Proposition 1 involves taxes or subsidies that differs across sectors (\(\theta _{t}^{i}\) and \(\phi _{t}^{i}\)). In this section, I consider the case where the regulator is constrained to \(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\) and the representative firms’ discount rate exceeds the social discount rate (\(\delta <\zeta \)).Footnote 22 Then, whereas the Pigou tax \(\tau _{t}^{sp}\) still perfectly balances environmental damages with (flow) abatement cost, the transition towards a less emission intensive production capacity mix is too slow as compared with the socially optimal trajectory (cf., the need for investment taxes and subsidies \(\theta _{t}^{i,sp}\) when \(\delta <\zeta \) in Proposition 1). The transition towards a cleaner capacity mix can then be accelerated by announcing a future emission tax that is above the Pigouvian tax.

We observe that a policy involving announcement of a future tax above the Pigouvian tax will be subject to the mechanisms discussed in Sect. 2.3 (i.e., we have an increase from the Pigouvian tax to an emission tax that is higher during the transition). Of particular interest, higher future emission cost decreases future production from relatively emission intensive power plants, which again decreases their shadow price on capacity \(\lambda _{t}^{i}\) and, hence, investment in emission intensive production capacity. By the same reasoning, investment in (sufficiently) low-emission investment capacity increases. Hence, a future emission tax above the Pigouvian level has similar effects as the optimal investment tax in Proposition 1. This suggests that welfare may be increased by implementing a tax above the Pigouvian level during the transition period, given that \(\delta <\zeta \) and that mechanisms (a) and (b) dominate mechanism (c). Note that this argument only applies to future emission taxes; i.e., the regulator has no incentive to tax emissions above marginal environmental damages in the current time period. On the contrary, taxing current emissions above \(\tau _{t}^{sp}\) unambiguously reduces welfare, because marginal abatement cost is then larger than marginal environmental damages.

Assume mechanisms (a) and (b) dominate mechanism (c). Then, the regulator faces the following trade-off: On the one hand side, a tax above the Pigouvian level increases welfare by accelerating the change in production capacity necessary for the transition towards less emission intensive electricity generation. On the other hand side, there is a loss by taxing current emissions above the Pigouvian level, because marginal abatement cost is then higher than marginal environmental damage. It will increase welfare to tax emissions above the Pigouvian level if and only if the former effect dominates the latter. These dynamics are examined numerically in Sect. 3.2, where the second-best emissions tax trajectory turns out to be markedly above the Pigouvian tax during the transition period; see Fig. 4.

A caveat is that this policy is likely to be time inconsistent. To see this, let \(t=\left\{ 1,2,3\right\} \), \(\delta <\zeta \), \(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\) and suppose mechanisms (a) and (b) dominate mechanism (c). Consider a policy where the regulator in period \(t=1\) announces the following emission tax sequence \(\left\{ \tau _{1}^{sp},\tau _{2}^{sp}+\epsilon ,\tau _{3}^{sp}\right\} \), where \(\epsilon \) is a small positive constant and \(\tau _{t}^{sp}\) is given by Proposition 1. Then, the period \(t=1\) shadow price on capacity decreases in \(\epsilon \) for relatively emission intensive sectors, and increases in \(\epsilon \) for (sufficiently) low-emission sectors. Hence, the isolated effect of \(\epsilon >0\) is to increase welfare (unless \(\epsilon \) is too large). In period \(t=2\), the regulator will have an incentive to break the commitment to \(\epsilon >0\), however, because in period \(t=2\) welfare is unambiguously maximized by \(\epsilon =0\). Therefore, this policy requires that the regulator can credibly commit to policies that, even though they increase present value welfare (8), will be less than optimal in the future time period in which they are enacted.Footnote 23

Numerical Illustration: The US Electricity Market

In this section I substantiate the analytical findings with complementary numerical results based on a stylized model for the US electricity market. The numerical model runs over the time horizon \(T=\left\{ 2016,2017,\ldots ,2115\right\} \) and uses the Path solver in GAMS (numerical software) to solve the theory model as a mixed complementarity problem.Footnote 24 Sect. 3.1 briefly summarizes the main characteristics of the numerical model, along with the data sources and estimation and calibration procedures used. The numerical results are given in sects. 3.2 and 3.3. See "Appendix C" for more details about the numerical model and data sources, including a test of model fit against history.

Parameterization and Functional Forms

The United States generated about 4 thousand terawatt hours of electricity in 2016, of which 30 percent came from coal plants, 34 percent from natural gas and petroleum, 20% from nuclear power and 15% from renewables.Footnote 25 I model electricity from these four energy sources, and coal-fired power plants with carbon capture and storage (CCS). Electricity is a homogeneous good and electricity generated from the different sources are modeled as perfect substitutes in consumption. I assume throughout that the US government allows increased nuclear energy production.

I use data from the US Energy Information Administration (EIA), IMF and British Petroleum to estimate a quadratic utility function for electricity consumption; see "Appendix C". This utility function yields the linear demand function for US electricity used in the numerical simulations. The electricity demand function does not change over time in the numerical simulations.Footnote 26

Real capital depreciation is set to 6% per year.Footnote 27 Technology specific investment costs are fetched from EIA. Technology specific operating costs are calibrated using historic figures from EIA, cost estimates of fossil fueled power plant ramp-up costs (Kumar et al. 2012), and figures for remaining fossil fuel reserves from British Petroleum. Supply of electricity generated from fossil fueled power plants, and gas in particular, is modeled quite flexible, whereas nuclear and renewables must invest in production capacity in order to increase production (with more than a few percent above the 2015 level). Emission reductions are possible either by lower electricity consumption or through substitution from fossil energy to renewables, nuclear energy or implementation of CCS. The stylized numerical model treats the emission intensities of each production technology as exogenous constants.Footnote 28

The highly stylized quadratic environmental damage function is calibrated such that the Obama Administration’s 80% emission reduction target in 2050 (as compared with 2015 emissions) is socially optimal, given the other parameters in the model.Footnote 29 I assume social and private discount rates equal to 4 percent per year, unless otherwise stated.

The Socially Optimal Time Trajectory

Figure 1 graphs net investment along the socially optimal time trajectory in the numerical model, and the (undiscounted) emission tax that implements this trajectory in competitive equilibrium (cf., Proposition 1 with \(\delta =\zeta \)). The socially optimal time trajectory is characterized by substantial investment in low-emission electricity generation capacity and decommissioning of fossil fueled power plants. The lion’s share of investment occurs during the first years.Footnote 30 Whereas CCS plays an important part during the transition towards a clean energy mix, the use of CCS declines along the socially optimal time trajectory, as renewable and nuclear capacity increases after a couple of decades. The explanation is that CCS has relatively high operating costs and emissions, as compared to renewables and nuclear power plants. The results suggests that CCS can reduce the need for expensive investment in renewable capacity in the short term.Footnote 31

Fig. 1
figure 1

The socially optimal time trajectory: Net investment by technology and optimal emission taxes

Figure 2 graphs optimal emission taxes and emissions along the socially optimal time trajectory for different assumptions about the magnitude of investment costs. Emissions and optimal taxes increase in investment cost, because the transition towards a cleaner energy mix slows down when investment costs increase, which again implies higher emissions and larger environmental damage.Footnote 32

Fig. 2
figure 2

Total emissions and emission taxes for various assumptions about investment costs. 0.5=halved; 1=baseline (i.e., EIA 2016); 2=doubled

Proposition 1 states that the socially optimal time trajectory can be implemented by a Pigou tax alone only if the private discount factor equals the socially optimal discount factor. Otherwise, the Pigou tax must be supplemented with taxes on investment and production. Figure 3 shows the investment taxes/subsidies necessary to induce the socially optimal time trajectory when the private discount factor is 0.9, whereas the social discount factor is 0.96. For comparison, the representative overnight capital costs used in the model calibration are 3636, 6084, 978, 5945 and 2557 USD per kW for coal, CCS, gas, nuclear and renewables, respectively (EIA 2016). The optimal extraction tax (\(\phi _{t}^{i}\)) is close to zero for coal and small but positive for gas and petroleum.Footnote 33

Fig. 3
figure 3

Optimal investment taxes with private (sector) discount factor equal to 0.9 and a social discount factor equal to 0.96. A negative tax indicates a subsidy

As discussed in the theory section, the regulator may be constrained to \(\theta _{t}^{i}=\phi _{t}^{i}=0\), such that an emission tax is the only available policy instrument. If so, it may increase welfare to tax emissions above the Pigouvian level if \(\delta <\zeta \). The theory is ambiguous on this, however, because two effects counteract each other (see Sect. 2.4). Hence, I investigate this topic numerically. Let the regulator implement a tax of the form \(\tau _{t}^{sp'}=\tau _{t}^{sp}+\varPhi _{t}\) for all \(t\in T\), where \(\tau _{t}^{sp}\) is given by Proposition 1 and the regulator choose the level on \(\varPhi _{t}\) that maximizes welfare (8). The second-best tax is graphed Fig. 4.Footnote 34 Whereas the cost of the extra tax element in time period \(s\in T\), \(\varPhi _{s}>0\), is incurred in the period it is enacted (because marginal abatement costs exceeds the social cost of carbon), the benefits via investment occur over the time interval \(\left\{ 2016,2017,\ldots ,s-1\right\} \). For example, \(\varPhi _{2028}>0\) acts on investment in the period 2016–2027, whereas \(\varPhi _{2017}>0\) only acts on investment in 2016. This is why \(\varPhi _{t}\) gradually increases over time (before declining as the transition is completed).

Fig. 4
figure 4

Second-best emission tax when \(\theta _{t}^{i}=\phi _{t}^{i}=0\), \(\delta =0.9\) and \(\zeta =0.9\)6

Technical change is omitted in the main part of present paper, but “Appendix A” features two different technology scenarios. In the first scenario, renewable investment cost declines by 5 percent each year. This implies, e.g., that renewable investment costs are halved by 2030, and only one tenth of baseline cost in 2060. As expected, exogenous technological change implies a delay in investment. Nevertheless, Fig. 8 in “Appendix A” shows that the optimal trajectory features large investments early on also in this technology optimistic scenario. The second scenario features a clean technology breakthrough. The new technology emerges in 2025, has potential to supply 1500 GWh per year at a marginal supply cost equal to half the 2015 electricity price, and the whole US electricity market at a marginal supply cost equal to the 2015 price. Otherwise, it shares characteristics with the renewable energy sector. As expected, the emergence of revolutionary clean technology has a major impact on the socially optimal time trajectory; see Fig. 9 in “Appendix A”.Footnote 35

Irreversible investment and vintage models are examined by, e.g., Hart (2004), Caparros et al. (2015) and Rozenberg et al. (2019). Whereas I have assumed that decommissioning of capacity is possible in the main analysis, a simulation which approximates irreversible investment by strongly increasing decommissioning costs is included in “Appendix A”. As expected, irreversibility implies a larger share of electricity generated by coal-fired power plants, with associated higher environmental damages and emission taxes; see Fig. 10.

Current Effects of Future Emission Taxes

We know from Sect. 2.3 that it is ambiguous from a purely theoretical perspective whether current emissions will increase or decrease following announcement of a future emission tax. In this section I investigate this topic using numerical analysis.

Consider an emission tax that is announced in the beginning of year 2016. The tax is zero for the period 2016–2024 and equals the Pigou tax thereafter (\(\tau _{t}^{i,sp}\) in Proposition 1 with \(\theta _{t}^{i}=\phi _{t}^{i}=0\)).

Fig. 5
figure 5

Effects of tax announcement on net investment (tax minus no-tax simulation values) and the Pigou tax

Figure 5 graphs the changes in net investment (investment minus capital depreciation) induced by the tax announcement in the period 2016–2050. As expected, investment in generation capacity from low-emission sources (renewables, nuclear and CCS) increases when the tax is announced. The reason is that future residual demand for electricity from low-emission plants will increase when electricity from coal plants is taxed. In terms of Lemma 1, the emission tax induces a higher future producer price for low-emission plants, with an associated higher shadow price on capacity. Furthermore, it is not profitable to invest in coal-fired power plants in the face of the future emission tax. Therefore, net investment is negative for coal. The results for gas are less clear. The reason is that gas is less emission intensive than coal, but more emission intensive than the other energy sources. The shadow price on gas the gas resource (i.e., the Hotelling rent) is lower in the tax simulation, as compared with the no-tax case; see Fig. 11 in “Appendix A”.

Fig. 6
figure 6

Effects of tax announcement on production and emissions. Production by source and total yearly emissions. Tax minus no-tax simulation values. The Pigouvian tax implemented in 2025

Figure 6 shows changes in electricity production and emissions following announcement of the future tax, as compared to the case with no tax. The lower capacity of coal-fired power plants, implied by the figures for net investment graphed in Fig. 5, causes early production and emission from coal to decline. In addition, the increased capacity of low-emission power plants crowds out electricity from coal-fired power plants, also in the years before the tax is implemented. The black line in Fig. 6 shows the associated decline in aggregate yearly emissions. Emissions decline in all periods, except for a minuscule increase in 2016, which occurs because the capacity stock mechanics operates with a one period time lag.Footnote 36 Overall, the cumulative decline in emissions over the period 2016–2024, i.e., before the tax is implemented, constitutes 69 percent of total emissions in 2015. Because the sectors adjust optimally to the future regulation, Figs. 5 and 6 illustrate that immediate action is optimal even when the emission reduction targets are several years into the future.

It is interesting to examine how sensitive the results in Fig. 6 are with respect to the magnitude of investment costs. In a sensitivity analysis I multiply the model baseline investment costs (\(\chi \left( \cdot \right) \)) with \(\phi \in \left\{ 0,0.05,0.1,0.2,0.4,\ldots ,2\right\} \); see Fig. 12 in “Appendix A”. Here, \(\phi =0\) is the case with free investment in production capacity (no adjustment costs), whereas \(\phi =2\) indicates that marginal investment costs are doubled. In the numerical model, total emissions from 2016 to 2024 declines unless investment costs are less than 5% of the baseline capital costs calibrated from figures given by the US Energy Administration (EIA 2016). The model collapses towards a standard exhaustible resource model without capacity constraints as investment costs approach zero.

Further sensitivity analysis was conducted for remaining fossil fuel reserves, discounting, the fuel shares of generation capacity in 2015, and the time lag between tax announcement and tax implementation (see Fig. 7 in “Appendix A” for the time lag sensitivity). Emissions in the time period between tax announcement and tax implementation decreased in all cases, given that the time-lag was two years or more. Total emissions over the period 2016–2024 remained lower in the tax simulation (as compared with the no-tax simulation) even when scarcity costs were multiplied with five, and when initial capacity was adjusted such that all electricity in 2015 was generated from gas and petroleum fired power plants.


This paper examined regulation in the presence of convex investment costs and technology specific capacity stocks. Four key results emerged: First, future emission reductions may require substantial investment in low emission energy sources today, because it takes time to build up clean production capacity and phase out dirty capacity. Second, the Pigou tax must be coupled with technology (or sector) specific investment taxes or subsidies to induce the socially optimal trajectory if the private discount rate differs from the social discount rate. Third, if such investment taxes or subsidies are unavailable, a second-best alternative may be to tax emissions above the Pigouvian level during the transition phase. A caveat is that this second-best tax policy is time-inconsistent, however. Fourth, announcement of future emission taxes reduces current emissions unless fossil fuels are scarce, in which case the effect is ambiguous in theory. The theory was complemented with a stylized numerical model of the US electricity market. The numerical model suggested that early emissions will decrease following the tax announcement in the combined presence of resource scarcity and long-lived capital.

The results in the present paper hinge on the assumption of strictly convex investment costs. Whereas this is reasonable in the presence of economy wide capacity constraints, or expedited construction of power plants (see the literature on adjustment costs cited in the introduction), there exists plausible scenarios where this convexity is non-existent or negligible.

The analysis features several simplifications and the results should be interpreted cautiously. Among them, the paper does not account for endogenous technical change and knowledge accumulation, which will be important in the transition towards a low emission economy (Goulder and Mathai 2000; Popp 2004; Kverndokk and Rosendahl 2007; Acemoglu et al. 2012). Further, the analysis does not feature general equilibrium effects and the environmental damage function in the numerical model is highly stylized. The results were derived under the assumption of perfect information, including knowledge about future prices. Whereas this is a very strong assumption, the economic rationale behind the discussion in Sect. 2.3 is straightforward: Expectations about a future emission tax reduces the incentives to maintain and invest in emission intensive production capacity, and increases the incentives to invest in clean alternatives. The associated change in the production capacity mix (i.e., a larger share of clean capacity) causes emissions to decline. As such, the essential assumption is that the tax announcement can induce an increase in expected future emission taxes. Proposition 1, on the other hand, is not valid without the perfect foresight assumption.

I have assumed that the regulator can commit credibly to future emission taxes. This matters because optimal regulation in the future depends on current investment levels. Specifically, the optimal future emission taxes prescribed by the tax rule in Proposition 1 typically change if the firms do not believe in the future tax levels as announced by the regulator today and, hence, choose different current investment levels than those targeted by the regulator (see, e.g., Kydland and Prescott 1977, on commitment and credibility).


  1. The Stern Review (Stern 2007), and the following discussion about appropriate social discount rates in cost-benefit analysis, indicates that the social discount rate may be below capital market interest rates, at least in the case of climate change (Weitzman 2007; Tol and Yohe 2006).

  2. As pointed out by Di Maria et al. (2017), the time lags of environmental policy may be substantial; cf., e.g., the Kyoto Protocol, which was signed in 1997, entered into force in 2005, and had its first commitment period in 2008.

  3. One unit of resource (e.g., a barrel of oil) cannot be extracted and sold twice. As shown by Hotelling (1931), intertemporal profit maximization involves that marginal present value profits from extraction are equalized through time. Because future emission taxes reduce profits from future resource extraction, announcement of such taxes causes the resource owners to move extraction forward in time. This mechanism was recognized early on by Sinclair (1992), who pointed out that present value carbon taxes should decline over time, as increasing carbon taxes accelerates emissions. Using similar reasoning, Sinn (2008) argues that demand-side climate policies might increase emissions, at least in the short run, and terms this effect the “green paradox.” There is a large literature following up on this phenomenon (see Jensen et al. 2015, for a survey).

  4. There are several key differences between the present paper and Vogt-Schilb et al. (2018). For example, the present paper features exhaustible resources, examine dynamic effects of future taxes, and derive the taxes and subsidies that can implement the socially optimal time trajectory in competitive equilibrium.

  5. According to Bloomberg (March 17, 2016), the combined market capitalization of US coal miners since 2011 has plunged from over $70 billion to barely $6 billion. In the past two years, at least six US coal-mining companies have filed for bankruptcy. Their struggle to find rescue in the financial and capital markets underscores Wall Street’s vanishing interest in coal companies (

  6. The International Energy Administration (IEA) states, referring to the 2015 Paris Climate Conference, that climate policy has emerged as a major driver for the future of coal in large parts of the world (

  7. A well-known result by Herfindahl (1967) states that the low cost exhaustible resource should be exploited first if exhaustible resources differ by their cost of extraction. Chakravorty et al. (2006) shows that fossil fuels may return to the fuel mix when the emission ceiling is no longer binding.

  8. The strict convexity of \(\chi ^{i}\left( \cdot \right) \) is a key assumption in the literature on adjustment costs referred in Sect. 1. It implicitly assumes that at least one factor used for expanding capacity is scarce. Joskow and Parsons (2009) point out that the human and manufacturing infrastructure required to produce major nuclear plant components, perform detailed engineering, and construct new nuclear plants is limited. Hence, a surge in nuclear plant orders will run up against capacity constraints on the supply of key components and labor, leading to higher component manufacturing costs and higher construction costs. Regarding petroleum, Osmundsen et al. (2015) and Skjerpen et al. (2018) finds that increased capacity utilization in the rig market increases the rig rates and, hence, the cost of capacity construction in the Gulf of Mexico and on the Norwegian continental shelf, respectively. Last, the modern-day gold rush of oil companies and contractors converging on western Canada’s oil-sands markets bogged down as high materials costs and outstripped labor resources forced project delays and budget overruns around the year 2007 (see

  9. The costs of decommissioning power plants depends, e.g., on the extent of environmental remediation required, the physical location of the plant, and the potential salvage value of equipment and scrap; see, e.g., Raimi (2017).

  10. Business as usual emissions solve \(k_{e_{t}^{i}}^{i}\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i,BaU}\right) =0\) (for any levels of \(x_{t}^{i}\) and \(Y_{t}^{i}\)). I also assume that \(k_{e_{t}^{i}}^{i}(\cdot )>0\) for \(e_{t}^{i}>e_{t}^{i,BaU}\), because a unique solution requires emission intensities above business as usual to be costly. The requirements for the hessian matrix associated with the firms’ Hamiltonian to be negative definite are \(k_{x_{t}^{i}x_{t}^{i}}^{i}(\cdot )>0\), \(\chi _{y_{t}^{i}y_{t}^{i}}^{i}(\cdot )>0\) and \(k_{x_{t}^{i}x_{t}^{i}}^{i}(\cdot )k_{e_{t}^{i}e_{t}^{i}}^{i}(\cdot )>\left( k_{x_{t}^{i}e_{t}^{i}}^{i}(\cdot )\right) ^{2}\).

  11. The modeling of abatement within a particular type of technology as a pure flow activity abstracts from the fact that most types of abatement action requires some sort of investment. This is common in the economic literature (cf., e.g., Nordhaus 1991, 1992).

  12. A framework where extraction costs increase with accumulated extraction is frequently used in the resource economics literature; see, e.g., Heal (1976), Hanson (1980) and Hoel (2012). Economic exhaustibility is arguably the relevant condition for most scarce energy resources. For example, before enhanced oil recovery (EOR), typically only around 30% of the oil in the reservoir has been recovered and around 70% remains in the ground. In some fields, recovery rates greater than 60% have been achieved using advanced EOR (e.g. Prudhoe Bay in Alaska); see

  13. We may have idle capacity if the ratio \(x_{t}^{i}/Y_{t}^{i}\) is low. The model abstracts from hourly, daily and seasonal variations in electricity demand (which is relevant for the transition from relatively flexible fossil fuel plants to less flexible renewable or nuclear energy). The capacity measure \(Y_{t}^{i}\) must account for power plant downtime caused by maintenance or weather conditions (renewables); see "Appendix C" on the numerical model.

  14. Stock damage is most relevant for carbon and sulfur dioxides, but associate emissions may cause flow damages. For example, coal plants also emit nitrogen oxides and particulate matter which causes smog. The theoretical framework omits flow damages for simplicity.

  15. Many countries tax or subsidize power plants based on their energy source; see, e.g., EIA (2018) for US energy-specific subsidies in the US fiscal years 2010, 2013 and 2016.

  16. Capacity \(Y_{\overline{t}}^{i}\) is endogenously determined by the intertemporal optimization problem. Note that \(\bar{t}\) may be arbitrarily far into the future.

  17. I.e., we have strict inequalities in Eqs. (7a), (7b) or (7f) if and only if \(x_{t}^{i}=0\), \(x_{t}^{i}=0\) or \(e_{t}^{i}=0\), respectively. Whereas \(y_{t}^{i}=0\) and \(\theta _{t}^{i}\ne 0\) are necessary conditions for a strict inequality in Eq. (7c), \(y_{t}^{i}=0\) is also a possible interior solution.

  18. We always have an interior solution \(\lambda _{t}^{i,*}=\chi _{y_{t}^{i}}^{i}\left( \cdot \right) \) if \(\theta _{t}^{i}=0\) in Lemma 1. We have \(\lambda _{t}^{i}<(>)0\) if capacity \(Y^{i}\) declines (increases) over time. The first order condition for \(y_{t}^{i}\) then states that \(\chi _{y}^{i}\left( \cdot \right) <(>)0\), implying that \(y_{t}^{i}<(>)0\) because \(\chi ^{i}\left( \cdot \right) \) is strictly convex with minimum at \(\chi ^{i}\left( 0\right) =0\) (for an interior solution).

  19. See also Grout (2003), and Fredrick et al. (2002) for an overview and discussion of the literature on time discounting and time preference.

  20. Mechanism (b) requires the low-emissions sectors to have sufficiently low emission intensity relative to the emission intensive sectors. The reason is that the future emission tax applies to all sectors (which in itself reduces the value of future production capacity for all sectors with positive emissions).

  21. The abatement cost of a sector also matters here. Consider, e.g., a sector with relatively high BaU emissions and cheap abatement possibilities. This emission intensive sector may invest in capacity (and hence increase early emissions) because its’ cheap abatement opportunities gives it a comparative advantage when the future emission tax is enacted.

  22. An alternative setting is the case where the only available policy instrument is investment subsidies \(\theta _{t}^{i}>0\); i.e., the regulator is constrained to \(\tau _{t}=0\). We see from Lemma 1 that such a policy causes the electricity price to be below the price along the socially optimal time trajectory. The reason is that emission pricing reduces emissions by increasing the operating costs of emission intensive power plants, whereas renewable investment subsidies increase the capacity of low emission power plants. See Abrell et al. (2019) about subsidies to renewables versus emission pricing.

  23. Whereas we know from game theory that commitment to strategies involving suboptimal payoffs in single periods may be possible (given that present value payoff is increased by this strategy), the presence of a finite time horizon in the present model setup poses challenges; see., e.g., Osborne and Rubinstein (1994), pp. 134–136 and 155–160.

  24. See Dirkse and Ferris (1995) and ’’ for information about the Path solver and GAMS.

  25. Petroleum constituted less than 1%. In the “renewables” category we have the following shares: Hydro = 6.5%, biomass = 1.5%, geothermal = 0.4%, solar = 0.9% and wind = 5.6%. Figures are for net electricity generation. Emissions from the electric power industry constituted about 35% of US energy-related CO2 emissions in 2016. See

  26. GDP enters as a significant explanatory variable in the econometric modeling of US electricity demand. The demand function in the numerical simulations is kept at the level corresponding to GDP in 2016 over the whole time horizon, however. The reason is twofold: First, US electricity demand (and emissions) grows gigantic in the later modeling years if we extrapolate the trends in electricity demand since the 1950s. Second, the picture has been somewhat different the last decade or so: From 2005 to 2016, US GDP has increased with 16%, whereas US electricity consumption has increased with 0.45% (IMF World Economic Outlook Database; EIA November 2017 Monthly Energy Review).

  27. Nadiri and Prucha (1993) estimates the depreciation rates for physical and R&D capital in the US manufacturing sector to 0.059 and 0.12, respectively.

  28. The model does not feature retrofitting existing coal plants with CCS technology. According to EIA, the main application of CO2 capture in the long term is expected to be at new power plants ( Further, CO2 coefficients (measured in kilograms per MBtu) for key types of coal are: anthracite 104; bituminous, 93; lignite, 97; subbituminous, 98; see Hence, the emission reductions available by fuel switching are not that substantial.

  29. According to the Obama Administration (US presidential administration from 2009 to 2017), the United States intended to roughly double its pace of carbon pollution reduction, from 1.2% per year on average during the period 2005–2020 to 2.3–2.8% per year on average between 2020 and 2025. This target was grounded in analysis of cost-effective carbon pollution reductions achievable under existing law and was intended to keep the US on the pathway to achieve deep economy-wide reductions of 80% or more by 2050. The Trump administration succeeding Barrack Obama has implemented several changes that roll back Obama-era policies aiming to curb climate change, however. See, e.g.,

  30. This result also appeared in a previous version of the numerical model with flow damages only (no emission stock).

  31. Coulomb et al. (2019) finds that gas can reduce the need for renewables in the short term (their model does not include CCS).

  32. We know from Montgomery (1972) that emission taxes and quotas are equivalent in a setting without uncertainty. Figure 2 could therefore also be interpreted as graphing optimal emission quotas for various assumptions about investment costs (the ’Pigou tax’ is then the endogenous quota price).

  33. I do not calculate \(\phi _{t}^{gas,sp}\) explicitly, but the numerical simulations suggests that it is small (the shadow price on gas and petroleum resources is relatively small compared to the other cost elements).

  34. The second-best tax in Fig. 4 is an approximation obtained by comparing welfare levels in the numerical model for different values on \(\varPhi _{t}\).

  35. The presence of endogenous technological change in the form of learning by doing pulls in the direction of more early abatement; see, e.g., and Bramoullé and Olson (2005), and Kverndokk and Rosendahl (2007).

  36. The very small increase in emissions in 2016 is caused by resource scarcity dynamics (mechanism (c)). The capacity constraint mechanisms (a) and (b) dominate thereafter. Representative lead times for new power plants are, e.g., 2–4 years for renewables, 2–3 years for gas, around 4 years for coal, and 6 years for nuclear (EIA 2017). Major maintenance decisions and investment in new equipment, like new hydro turbines or replacing old boilers with new steam generators may, however, affect capacity significantly faster.

  37. I use the following data sources: Electricity prices and consumption: Energy Information Administration (EIA) (; US GDP: IMF World Economic Outlook Database (; gas, oil and coal prices: British Petroleum (; wage index: US social security administration (; Interest rate: Federal reserve (; Inflation: (

  38. NEB:; BP:

  39. CCS has the potential to reduce CO2 emissions from a coal or natural gas-fueled power plant by 90%; see


  • Abrell J, Rausch S, Streitberger C (2019) The economics of renewable energy support. J Public Econ 176:94–117

    Google Scholar 

  • Acemoglu D, Philippe A, Leonardo B, David H (2012) The environment and directed technical change. Am Econ Rev 102(1):131–166

    Google Scholar 

  • Amigues J-P, Favard P, Gaudet G, Moreaux M (1998) On the optimal order of natural resource use when the capacity of the inexhaustible substitute is limited. J Econ Theory 80(1):153–170

    Google Scholar 

  • Amigues J-P, Kama AAL, Moreaux M (2015) Equilibrium transitions from non-renewable energy to renewable energy under capacity constraints. J Econ Dyn Control 55:89–112

    Google Scholar 

  • Bellofatto AA, Besfamille M (2018) Regional state capacity and the optimal degree of fiscal decentralization. J Public Econ 159:225–243

    Google Scholar 

  • Bramoullé Y, Olson LJ (2005) Allocation of pollution abatement under learning by doing. J Public Econ 89(9):1935–1960

    Google Scholar 

  • Chakravorty U, Magne B, Moreaux M (2006) A hotelling model with a ceiling on the stock of pollution. J Econ Dyn Control 30(12):2875–2904

    Google Scholar 

  • Chakravorty U, Moreaux M, Tidball M (2008) Ordering the extraction of polluting nonrenewable resources. Am Econ Rev 98(3):1128–1144

    Google Scholar 

  • Caparros A, Just R Zilberman (2015) Dynamic relative standards versus emission taxes in a putty-clay model. J Assoc Environ Resour Econ 2(2:277–308

    Google Scholar 

  • Coulomb R, Lecuyer O, Vogt-Schilb A (2019) Optimal transition from coal to gas and renewable power under capacity constraints and adjustment costs. Environ Resour Econ 73:557–590

    Google Scholar 

  • Di Maria C, Smulders S, van der Werf E (2017) Climate policy with tied hands: optimal resource taxation under implementation lags. Environ Resour Econ 66(3):537–551

    Google Scholar 

  • Dirkse SP, Ferris MC (1995) The path solver: a non-monotone stabilization scheme for mixed complementarity problems. Optim Methods Software 5(2):123–156

    Google Scholar 

  • EIA (2016) Capital cost estimates for utility scale electricity generating plants, Technical report. US, Energy Information Administration, Washington DC

  • EIA (2017) Cost and performance characteristics of new generating technologies, Annual Energy Outlook 2017, Technical report. US, Energy Information Administration, Washington DC

  • EIA (2018) Direct federal financial interventions and subsidies in energy in fiscal year 2016, Technical report. US, Energy Information Administration, Washington DC

  • Fredrick S, Loewenstein G, O’Donoghue T (2002) Time discounting and time preference: a critical review. J Econ Literat 40(2):351–401

    Google Scholar 

  • Gerlagh R (2011) Too much oil. CESifo Econ Stud 57(1):79–102

    Google Scholar 

  • Gould JP (1968) Adjustment costs in the theory of investment of the firm. Rev Econ Stud 35(1):47–55

    Google Scholar 

  • Golosov M, Hassler J, Krusell P, Tsyvinski A (2014) Optimal taxes on fossil fuel in general equilibrium. Econometrica 82(1):31–88

    Google Scholar 

  • Goulder LH, Mathai K (2000) Optimal CO2 abatement in the presence of induced technological change. J Environ Econ Manag 39(1):1

    Google Scholar 

  • Goulder LH, Williams RC III (2012) The choice of discount rate for climate change policy evaluation. Discussion paper 12–43. Resources for the Future, Washington, DC

  • Grout PA (2003) Public and private sector discount rates in public-private partnerships. Econ J 113(486):C62–68

    Google Scholar 

  • Hart R (2004) Growth, environment and innovation: a model with production vintages and environmentally oriented research. J Environ Econ Manag 48:1078–98

    Google Scholar 

  • Hanson DA (1980) Increasing extraction costs and resource prices: some further results. Bell J Econ 11(1):335–342

    Google Scholar 

  • Heal G (1976) The relationship between price and extraction cost for a resource with a backstop technology. Bell J Econ 7(2):371–378

    Google Scholar 

  • Herfindahl OC (1967) Depletion and economic theory. In: Gaffney M (ed) Extractive resources and taxation. University of Wisconsin Press, Madison, pp 63–90

    Google Scholar 

  • Hoel M (2012) Carbon taxes and the green paradox. In: Hahn R, Ulph A (eds) Climate change and common sense: essays in honor of Tom Schelling. Oxford University Press, Oxford chapter 11

    Google Scholar 

  • Holland SP (2003) Extraction capacity and the optimal order of extraction. J Environ Econ Manag 45(3):569–588

    Google Scholar 

  • Holt CC, Modigliani F, Muth JF, Simon HA (1960) Planning production, inventories, and work force. Prentice-Hall, New Jersey

    Google Scholar 

  • Hotelling H (1931) The economics of exhaustible resources. J Polit Econ 39(2):137–175

    Google Scholar 

  • IPCC (2012) Special report on renewable energy sources and climate change mitigation. Technical report, Intergovernmental Panel on Climate Change

  • Jensen S, Mohlin K, Pittel K, Sterner T (2015) An introduction to the green paradox: the unintended consequences of climate policies. Rev Environ Econ Policy 9(2):246–265

    Google Scholar 

  • Joskow PL, Parsons JE (2009) The economic future of nuclear power. Daedalus 138(4):45–59

    Google Scholar 

  • Kemp MC, Van Long N (1980) On two folk theorems concerning the extraction of exhaustible resources. Econometrica 48(3):663–673

    Google Scholar 

  • Kverndokk S, Rosendahl KE (2007) Climate policies and learning by doing: Impacts and timing of technology subsidies. Resour Energy Econ 29:58–82

    Google Scholar 

  • Kumar N, Besuner P, Lefton S, Agan D, Hilleman D (2012) Power Plant Cycling Costs. Technical report, NREL Technical Monitor, Subcontract report NREL/SR-5500-55433 July 2012

  • Kydland FE, Prescott EC (1977) Rules rather than discretion: the inconsistency of optimal plans. J Polit Econ 85(3):473–491

    Google Scholar 

  • Kydland FE, Prescott EC (1982) Time to build and aggregate fluctuations. Econometrica 50:1345–1370

    Google Scholar 

  • Lucas R (1976) Optimal investment policy and the flexible accelerator. Int Econ Rev 8(1):78–85

    Google Scholar 

  • Montgomery WD (1972) Markets in licenses and efficient pollution control programs. J Econ Theory 5(3):395–418

    Google Scholar 

  • Nadiri MI, Prucha IR (1993) Estimation of the depreciation rate of physical and R&D capital in the US total manufacturing sector. NBER working paper series. WP. no. 4591, National Bureau of Economic Research

  • Nordhaus WD (1991) To slow or not to slow: the economics of the greenhouse effect. Econ J 101(407):920–937

    Google Scholar 

  • Nordhaus WD (1992) An optimal transition path for controlling greenhouse gases. Science 258(5086):1315–1319

    Google Scholar 

  • Nordhaus WD (2007) A review of the Stern review on the economics of climate change. J Econ Literat 45(3):686–702

    Google Scholar 

  • Oi WY (1962) Labor as a quasi-fixed factor. J Polit Econ 70(6):538–538

    Google Scholar 

  • Osborne MJ, Rubinstein A (1994) A course in game theory. MIT Press, Cambridge

    Google Scholar 

  • Osmundsen P, Rosendahl KE, Skjerpen T (2015) Understanding rig rate formation in the Gulf of Mexico. Energy Econ 49:430–439

    Google Scholar 

  • Popp D (2004) ENTICE: endogenous technological change in the DICE model of global warming. J Environ Econ Manag 48(1):742–768

    Google Scholar 

  • Raimi D (2017) Decommissioning US Power Plants. Decisions, Costs, and Key Issues’. RFF Report, (October 2017) Resources for the Future., Washington, DC

  • Rozenberg J, Vogt-Schilb A, Hallegatte S (2019) Instrument choice and stranded assets in the transition to clean capital. Forthcoming J Environ Econ Manag 100:102183

    Google Scholar 

  • Sinclair PJN (1992) High does nothing and rising is worse: carbon taxes should keep declining to cut harmful emissions. Manchester School Econ Social Stud 60(1):41–52

    Google Scholar 

  • Sinn H-W (2008) Public policies against global warming: a supply side approach. Int Tax Public Finance 15(4):360–394

    Google Scholar 

  • Skjerpen T, Rosendahl KE, Osmundsen P, Storrøsten HB (2018) Modelling and forecasting rig rates on the Norwegian Continental Shelf. Resour Energy Econ 53:220–239

    Google Scholar 

  • Stern NH (2007) The economics of climate change: the Stern review. Cambridge University Press, Cambridge

    Google Scholar 

  • Sydsæter K, Hammond P, Seierstad A, Strøm A (2008) Further mathematics for economic analysis. Pearson Education Limited, Harlow

    Google Scholar 

  • Tol RSJ, Yohe GW (2006) A review of the stern review. World Econ 7(4):233–50

    Google Scholar 

  • Tong D, Zhang Q, Zheng Y, Caldeira K, Shearer C, Hong C, Qin Y, Davis SJ (2019) Committed emissions from existing energy infrastructure jeopardize 1.5 \(\circ \)C climate target. Nature.

    Article  Google Scholar 

  • van der Ploeg F, Withagen C (2012) Too much coal, too little oil. J Publ Econ 96(12):62–77

    Google Scholar 

  • Vogt-Schilb A, Meunier G, Hallegatte S (2018) When starting with the most expensive option makes sense: optimal timing, cost and sectoral allocation of abatement investment. J Environ Econ Manag 88:210–233

    Google Scholar 

  • Weitzman ML (1974) Prices versus quantitites. Rev Econ Stud 41:477–491

    Google Scholar 

  • Weitzman ML (2007) A review of the stern review on the economics of climate change. J Econ Literat 45(3):703

    Google Scholar 

  • Wickens M (2008) Macroeconomic theory. A dynamic general equilibrium approach. Princeton University Press, Princeton

    Google Scholar 

Download references


Open Access funding provided by Statistics Norway.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Halvor B. Storrøsten.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


Appendix A: Figures

This appendix presents figures referred to in the text.

I have tested the model fit by running the model from 2005. Figure 7 shows model projections and historic figures for the period 2005–2015. Production levels, investment levels and electricity prices are endogenous. The simulation features actual values for GDP, coal prices and gas prices (I use constant values based on historic averages after 2015 in Sect. 3). All in all the model does reasonably well, but it struggles with the shale gas revolution. Specifically, the supply of coal is too high, whereas the supply of gas is too low in the last years of the sample. Figure 7 also graphs results from a sensitivity analysis w.r.t. the lag between tax announcement and tax implementation (x-axis). We observe that emissions in the time period between tax announcement and tax implementation decrease in all cases; unless the tax is implemented in the next year 2017 (cf., the one-period lag caused by Eq. 1).

The left diagram in Fig. 8 replicates Fig. 1 in Sect. 3.2 in the case of fast technical change, modeled as a 5% yearly reduction in the investment cost parameters \(k_{1}^{ren}\) and \(k_{2}^{ren}\). We observe that, even though a slower change in the energy mix is optimal in the presence of very fast technological progress, a large share if the investment still occurs in the early years. The right diagram in Fig. 8 is a sensitivity analysis w.r.t. resource scarcity (the scarcity parameter \(c_{6}^{i}\) is 0, 105 and 210 in the simulations denoted with 0, 1 and 2, respectively). Note that the social cost of carbon decreases in resource scarcity.

Figure 9 examines the socially optimal time trajectory in the case of an emerging clean future technology (FuTech). I have omitted CCS and used T = 70 in this simulation. Further, I allow FuTech to produce 25 GWh per year in the period 2016–2024. This was necessary for the numerical model to solve. The left diagram in Fig. 9 replicates Fig. 1 in Sect. 3.2. The right diagram graphs production levels (both with FuTech present).

Figure 10 graphs sensitivity results in the case of very high decommissioning costs (\(k_{3}^{i}=100{,}000\)).

Fig. 7
figure 7

Left diagram: Sensitivity w.r.t. timing of tax implementation. Effects of tax announcement on production by source (left axis) and emissions (right axis) summed over the relevant number of years (given along the x-axis) before the tax is implemented. Tax minus no-tax simulation values. The emission tax is zero for the period 2016–2024 and 50 USD per ton CO2 thereafter. Right diagram: Model fitted equilibrium values (dotted lines) and actual historic values (solid lines)

Fig. 8
figure 8

Left diagram: The socially optimal time trajectory with fast renewable technology growth. Net investment by technology (left axis) and optimal emission taxes (right axis). Right diagram: Emissions (bars) and the social cost of carbon (lines) when scarcity is zero (0), baseline (1) or doubled (2)

Fig. 9
figure 9

The socially optimal time trajectory with a clean technology revolution (FuTech available in 2025). Left diagram: Net investment and optimal tax. Right diagram: Production levels

Fig. 10
figure 10

Simulation with very high decommissioning costs (less reversible investment). Left diagram: net investment by technology (left axis) and optimal emission taxes (right axis) along the socially optimal time trajectory with very high decommissioning costs. Right diagram: comparison of reference model vs. simulation with very high decommissioning costs (irreversibility)

Fig. 11
figure 11

Changes in shadow prices following tax announcement. Left diagram: Change in shadow prices on capacity (\(\lambda \)) following tax announcement (tax minus no-tax simulation). Right diagram: Shadow prices on cumulative production (\(\mu \)) in tax and no-tax simulations (rents are zero for ’nuc’ and ’ren’)

Fig. 12
figure 12

Sensitivity w.r.t. investment costs. Effects of tax announcement on production by technology (left axis) and emissions (right axis) summed over the 9 years before the tax is implemented (2016–2024). Tax minus no-tax simulation values. The emission tax is zero for the period 2016–2024 and 50 USD per ton CO2 thereafter

Appendix B: Proofs and Derivations

Derivation of Lemma 1. Representative firm \(i\in I\) solves (5) s.t. Eqs. (1), (2) and (3). The associated present value Hamiltonian is:

$$\begin{aligned} H^{x}={\left\{ \begin{array}{ll} \begin{array}{l} \delta ^{t-1}\left[ p_{t}^{i}x_{t}^{i}-c^{i}\left( z_{t}^{i},Y_{t}^{i},e_{t}^{i}\right) -\chi ^{i}\left( y_{t}^{i}\right) -\theta _{t}^{i}y_{t}^{i}-e_{t}^{i}\tau _{t}\right] +\hat{\lambda }_{t}^{i}\left( \beta Y_{t}^{i}+y_{t}^{i}\right) ,\\ \delta ^{\overline{t}-1}\left[ p_{\overline{t}}^{i}x_{\overline{t}}^{i}-c^{i}\left( x_{\overline{t}}^{i},Y_{\overline{t}}^{i},e_{\overline{t}}^{i}\right) -\chi ^{i}\left( y_{\overline{t}}^{i}\right) -\theta _{\overline{t}}^{i}y_{\overline{t}}^{i}-e_{\overline{t}}^{i}\tau _{\overline{t}}^{i}\right] , \end{array} &{} \begin{array}{l} \forall i\le \tilde{i},\forall t\ne \bar{t},\\ \forall i\le \tilde{i}, \end{array}\end{array}\right. } \end{aligned}$$


$$\begin{aligned} H^{x}={\left\{ \begin{array}{ll} \begin{array}{l} \delta ^{t-1}\left[ p_{t}^{i}x_{t}^{i}-c^{i}\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i},S_{t}^{i}\right) -\chi ^{i}\left( y_{t}^{i}\right) -\phi _{t}^{i}x_{t}^{i}-\theta _{t}^{i}y_{t}^{i}-e_{t}^{i}\tau _{t}\right] +\hat{\lambda }_{t}^{i}\left( \beta Y_{t}^{i}+y_{t}^{i}\right) +\hat{\mu }_{t}^{i}\left( S_{t}^{i}-x_{t}^{i}\right) ,\\ \delta ^{\overline{t}-1}\left[ p_{\overline{t}}^{i}x_{\overline{t}}^{i}-c^{i}\left( x_{\overline{t}}^{i},Y_{\overline{t}}^{i},e_{\overline{t}}^{i},S_{\overline{t}}^{i}\right) -\phi _{\overline{t}}^{i}x_{\overline{t}}^{i}-\chi ^{i}\left( y_{\overline{t}}^{i}\right) -\theta _{\overline{t}}^{i}y_{\overline{t}}^{i}-e_{\overline{t}}^{i}\tau _{\overline{t}}^{i}\right] , \end{array} &{} \begin{array}{l} \forall i>\tilde{i},\forall t\ne \bar{t,}\\ \forall i>\tilde{i}, \end{array}\end{array}\right. } \end{aligned}$$

where \(\hat{\lambda }_{t}^{i}\) and \(\hat{\mu }_{t}^{j}\) are the shadow prices on production capacity \(Y_{t}^{i}\) and the resource stock \(S_{t}^{j}\), respectively. The maximum principle for discrete time optimization states that the solution to (5) must satisfy the following necessary conditions for all \(i\in I\) (see, e.g., Sydsæter et al. 2008, p. 445):

$$\begin{aligned} H_{x_{t}^{i}}^{x}= & {} p_{t}^{i,*}-c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) \le 0,\ \forall i\le \tilde{i}, \forall t,\end{aligned}$$
$$\begin{aligned} H_{x_{t}^{i}}^{x}= & {} p_{t}^{i,*}-c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) -\phi _{t}^{i}-\hat{\mu }_{t}^{i}/\delta ^{t-1}\le 0,\ \forall i>\tilde{i,} \forall t,\end{aligned}$$
$$\begin{aligned} H_{e_{t}^{i}}^{x}= & {} -\tau _{t}-c_{e_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) \le 0,\ \forall i, \forall t,\end{aligned}$$
$$\begin{aligned} H_{y_{t}^{i}}^{x}= & {} -\delta ^{t-1}\chi _{y_{t}^{i}}^{i}\left( y_{t}^{i*}\right) -\theta _{t}^{i}+\hat{\lambda }_{t}^{i,*}=0\ \forall i, \forall t,\end{aligned}$$
$$\begin{aligned} \hat{\lambda }_{t-1}^{i,*}= & {} H_{Y_{t}^{i}}^{x}=-\delta ^{t-1}c_{Y_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\beta \hat{\lambda }_{t}^{i,*},\ \forall i, \forall t\ne \overline{t},\end{aligned}$$
$$\begin{aligned} \hat{\mu }_{t-1}^{j,*}= & {} H_{S_{t}^{j}}^{x}=-\delta ^{t-1}c_{S_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\hat{\mu }_{t}^{i,*},\ \forall i>\tilde{i}, \forall t,\end{aligned}$$
$$\begin{aligned} \hat{\lambda }_{\overline{t}}^{i,*}= & {} 0,\,\hat{\mu }_{\overline{t}}^{j,*}\ge 0 (=0\text { if } S_{\bar{t}}^{j}>0\text {)}, \quad \forall i, \end{aligned}$$

where (9g) is the transversality conditions for the state variables \(Y_{\overline{t}}^{i}\) (free) and \(X_{\overline{t}}^{i}\) (non-negative). Finite partial derivatives in the utility function and \(lim_{S_{t}^{i}\rightarrow 0}h\left( S_{t}^{i}\right) =\infty \) implies that \(\hat{\mu }_{\overline{t}}^{j,*}=0\) in equation (9g). The assumptions imposed on \(c^{i}\left( \cdot \right) \) and \(\chi ^{i}\left( \cdot \right) \) ensure that the Hamiltonian is concave along the optimal trajectory for all \(t\in T\) . Hence, the necessary conditions above, and the state movement equations (1) and (2), are also sufficient to solve (5) (cf., Arrow’s sufficiency theorem).

The solution to \(\hat{\lambda }_{t-1}^{i}=-\delta ^{t-1}c_{Y_{t}^{i}}^{i}\left( \cdot \right) +\beta \hat{\lambda }_{t}^{i}\) in (9e) is \(\hat{\lambda }_{t}^{i}=\frac{\hat{\lambda }_{0}^{i}}{\beta ^{t}}+\sum _{r=t+1}^{r=\bar{t}}\frac{\delta ^{r-1}}{\beta ^{t-r+1}}c_{Y_{r}^{i}}^{i}\left( \cdot \right) \). The transversality condition \(\hat{\lambda }_{\overline{t}}^{i}=0\) then implies \(\hat{\lambda }_{0}^{i}=-\sum _{r=1}^{r=\bar{t}}\left( \beta \delta \right) ^{r-1}c_{Y_{r}^{i}}^{i}\left( \cdot \right) \). Inserting in the equation for \(\hat{\lambda }_{t}^{i}\) above yields \(\hat{\lambda }_{t}^{i}=-\sum _{r=t+1}^{r=\bar{t}}\delta ^{r-1}\beta ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \cdot \right) \) (\(t<\overline{t}\)). The current value shadow price on capacity is then given by:

$$\begin{aligned} \lambda _{t}^{i,*}\equiv \frac{\hat{\lambda }_{t}^{i,*}}{\delta ^{t-1}} =-\delta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \delta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i} \left( \varvec{z}_{r}^{i,*}\right) , \end{aligned}$$

with \(\lambda _{\overline{t}}^{*}=0\).

The solution to \(\hat{\mu }_{t-1}^{i}=-\delta ^{t-1}c_{S_{r}^{j}}^{i}\left( \cdot \right) +\hat{\mu }_{t}^{i}\) in (9f) is \(\hat{\mu }_{t}^{i}=\hat{\mu }_{0}^{i}+\sum _{r=1}^{r=\overline{t}}\delta ^{r-1}c_{S_{r}^{j}}^{i}\left( \cdot \right) \) for \(t<\overline{t}\). The transversality condition \(\hat{\mu }_{\overline{t}}^{i}=0\) then implies \(\hat{\mu }_{0}^{i}=-\sum _{r=1}^{r=\overline{t}}\delta ^{r-1}c_{S_{r}^{j}}^{i}\left( \cdot \right) \). Hence, we have \(\hat{\mu }_{t}^{i}=-\sum _{r=t+1}^{\overline{t}}\delta ^{r-1}c_{S_{r}^{j}}^{i}\left( \cdot \right) \) (\(t<\overline{t}\)). The current value shadow price on the resource stock \(S_{t}^{i}\) is then given by:

$$\begin{aligned} \mu _{t}^{i,*}\equiv \frac{\hat{\mu }_{t}^{i,*}}{\delta ^{t-1}} =-\sum _{r=t+1}^{\overline{t}}\delta ^{r-t}c_{S_{r}^{i}}^{i}\left( x_{r}^{i,*}, Y_{r}^{i,*},S_{r}^{i,*}\right) =-\sum _{r=t+1}^{\overline{t}} \delta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,*}\right) x_{r}^{i,*}, t<\overline{t}, \end{aligned}$$

with \(\mu _{\overline{t}}^{i,*}=0\).

Multiply (9a) and (9b) with \(\delta ^{t-1}\) and insert the consumer’s first order condition \(u_{x_{t}^{i}}\left( \cdot \right) =p_{t}^{i}\) from (6). Lemma 1 then follows from the equations system (9a), (9b), (9c), (9d), (10) and (11).

Proof of Proposition 1

A benevolent social planner maximizes the present value of welfare maximizing Eq. (8) subject to Eqs. (1), (2), (3), (6) and (4) with no constraints on the state variables in the last period. The maximization is carried out with respect to all \(i\in I\). The Hamiltonian is:

$$\begin{aligned} H^{W}={\left\{ \begin{array}{ll} \begin{array}{l} \zeta ^{t-1}\left[ u\left( \varvec{x}_{t}\right) -d\left( E_{t}\right) -\sum _{i\in I}\left[ c^{i}\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i}\right) +\chi ^{i} \left( y_{t}^{i}\right) \right] \right] \\ +\sum _{i\in I}\left[ \hat{\lambda }_{t}^{i}\left( \beta Y_{t}^{i}+y_{t}^{i}\right) +\hat{\gamma }_{t}\left( \alpha E_{t}+\sum _{i\in I}e_{t}^{i}\right) \right] , \;\forall i\le \tilde{i}, \forall t\ne \bar{t},\\ \zeta ^{\overline{t}-1}\left[ u\left( \varvec{x}_{\overline{t}}\right) -d\left( E_{\overline{t}}\right) -\sum _{i\in I}\left[ c^{i}\left( x_{\overline{t}}^{i}, Y_{\overline{t}}^{i},e_{\overline{t}}^{i}\right) +\chi ^{i}\left( y_{\overline{t}}^{i} \right) \right] \right] ,\;\forall i\le \tilde{i}, \end{array}\end{array}\right. } \end{aligned}$$


$$\begin{aligned} H^{W}={\left\{ \begin{array}{ll} \begin{array}{l} \zeta ^{t-1}\left[ u\left( \varvec{x}_{t}\right) -d\left( E_{t}\right) -\sum _{i\in I}\left[ c^{i}\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i},S_{t}^{i}\right) +\chi ^{i}\left( y_{t}^{i}\right) \right] \right] \\ +\sum _{i\in I}\left[ \hat{\lambda }_{t}^{i}\left( \beta Y_{t}^{i}+y_{t}^{i}\right) +\hat{\mu }_{t}^{i}\left( S_{t}^{i}-x_{t}^{i}\right) +\hat{\gamma }_{t}\left( \alpha E_{t} +\sum _{i\in I}e_{t}^{i}\right) \right] ,\;\forall i>\tilde{i}, \forall t \ne \bar{t},\\ \zeta ^{\overline{t}-1}\left[ u\left( \varvec{x}_{\overline{t}}\right) -d\left( E_{\overline{t}}\right) -\sum _{i\in I}\left[ c^{i}\left( x_{\overline{t}}^{i}, Y_{\overline{t}}^{i},e_{\overline{t}}^{i},S_{\overline{t}}^{i}\right) +\chi ^{i} \left( y_{\overline{t}}^{i}\right) \right] \right] ,\;\forall i>\tilde{i}. \end{array}\end{array}\right. } \end{aligned}$$

The socially optimal sequence \(\left\{ \varvec{x}_{t}^{sp},\varvec{y}_{t}^{sp},\varvec{e}_{t}^{sp}\right\} \) satisfies:

$$\begin{aligned} H_{x_{t}^{i}}^{W}= & {} \zeta ^{t-1}\left( u_{x_{t}^{i}}\left( \varvec{x}_{t}^{sp}\right) -c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,sp}\right) \right) \le 0,\ \forall i\le \tilde{i}, \forall t\\ H_{x_{t}^{i}}^{W}= & {}\,\zeta ^{t-1}\left( u_{x_{t}^{i}}\left( \varvec{x}_{t}^{sp}\right) -c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,sp}\right) \right) -\hat{\mu }_{t}^{i,sp}\le 0,\ \forall i>\tilde{i}, \forall t,\\ H_{e_{t}^{i}}^{W}= & {}\, \zeta ^{t-1}\left( -c_{e_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) \right) +\hat{\gamma }_{t}\le 0,\ \forall i, \forall t,\\ H_{y_{t}^{i}}^{W}= & {} \,-\zeta ^{t-1}\chi _{y_{t}^{i}}^{i}\left( y_{t}^{isp}\right) +\hat{\lambda }_{t}^{i,sp}\le 0,\ \forall t,\\ \hat{\lambda }_{t-1}^{i,sp}= & {}\, H_{Y_{t}^{i}}^{W,sp}=-\zeta ^{t-1}c_{Y_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,sp}\right) +\beta \hat{\lambda }_{t}^{i,sp},\ \forall t\ne \overline{t},\\ \hat{\mu }_{t-1}^{i,sp}= & {}\, H_{S_{t}^{i}}^{W,sp}=-\zeta ^{t-1}c_{S_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,sp}\right) +\hat{\mu }_{t}^{i,sp},\ \forall i>\tilde{i}, \forall t\ne \overline{t},\\ \hat{\gamma }_{t-1}^{sp}= & {} \,H_{E_{t}^{i}}^{W,sp}=-\zeta ^{t-1}d_{E_{t}}\left( E_{t}^{sp}\right) +\alpha \hat{\gamma }_{t}^{sp},\ \forall t\ne \overline{t},\\ 0= & {} \,\hat{\lambda }_{\overline{t}}^{i,sp}=\hat{\gamma }_{\overline{t}}^{sp},\hat{\mu }_{\overline{t}}^{j,sp}\ge 0 (=0\text { if }S_{\bar{t}}^{j}>0)\ \forall i\in I, \forall j\in J, \end{aligned}$$

where superscript sp denote the variable values along the social planner’s optimal time trajectory. The solution to \(\widehat{\gamma }_{t-1}^{sp}=-\zeta ^{t-1}d_{E_{t}}(E_{t})+\alpha \hat{\gamma }_{t}\) in the above equation system is \(\widehat{\gamma }_{t}=\frac{1}{\alpha ^{\overline{t}}}\hat{\gamma }_{0}+\sum _{r=1}^{r=t}\frac{\zeta ^{r-1}}{\alpha ^{t-r+1}}d_{E_{r}}(E_{r})\). The transversality condition \(\hat{\gamma }_{\overline{t}}=0\) then implies \(\hat{\gamma }_{0}=-\alpha ^{\overline{t}}\sum _{r=1}^{r=\overline{t}}\frac{\zeta ^{r-1}}{\alpha ^{t-r+1}}d_{E_{r}}(E_{r})\). Inserting in the equation for \(\hat{\gamma }_{t}\) above yields \(\hat{\gamma }_{t}=-\sum _{r=t+1}^{\overline{t}}\zeta ^{r-1}\alpha ^{r-t-1}d_{E_{r}}(E_{r})\) (\(t<\overline{t}\)). The current value shadow price on the emission stock \(E_{t}\) is then:

$$\begin{aligned} \gamma _{t}^{sp}\equiv \frac{\hat{\gamma }_{t}}{\zeta ^{t-1}}=-\zeta \sum _{r=t+1}^{r=\overline{t}}\left( \alpha \zeta \right) ^{r-t-1}d_{E}(E_{r}^{sp}), t<\overline{t,} \end{aligned}$$

with \(\gamma _{\overline{t}}^{sp}=0\). Inserting in the equation system with necessary conditions above and rearranging we get:

$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{sp}\right)&\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,sp}\right) ,\ \forall i\le \tilde{i}, \forall t,\\ u_{x_{t}^{i}}\left( \varvec{x}_{t}^{sp}\right)&\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,sp}\right) +\hat{\mu }_{t}^{i,sp},\ \forall i>\tilde{i}, \forall t,\\ \lambda _{t}^{i,*}&\le \chi _{y_{t}^{i}}^{i}\left( y_{t}^{i,sp}\right) ,\ \forall i, \forall t,\\ \hat{\gamma }_{t}&\le c_{e_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) ,\ \forall i, \forall t, \end{aligned}$$

with \(\lambda _{t}^{i,sp}\), \(\mu _{t}^{i,sp}\) and \(\gamma _{t}^{sp}\) given by equations (10), (11) and (12), and \(Y_{t}^{i,sp}\), \(E_{t}^{sp}\) and \(S_{t}^{i,sp}\) as given by equations (1), (4) and (2), respectively. These conditions state the well-known result that current marginal utility from consumption equals the sum of marginal production cost (including the shadow price \(\mu _{t}^{i,sp}\)) and marginal environmental damage. It is straightforward to show that the competitive equilibrium equals the socially optimal time trajectory with a tax equal to the social cost of carbon as given in Proposition 1 when \(\delta =\zeta \). Regarding the case with \(\delta >\zeta \), we observe that the investment taxes (\(\theta _{t}^{i}\)) and extraction taxes (\(\phi _{t}^{i}\)) causes the representative firms’ shadow values on future capacity (\(\lambda _{t}^{*,i}\)), and on future resource base (\(\mu _{t}^{*,i}\)), to be replaced with that of the social planner (\(\lambda _{t}^{sp,i}\) and \(\mu _{t}^{sp,i}\)). It follows that the firms investment and extraction decisions equals that of the social planner. \(\square \)

Appendix C: The Numerical Model

Let the set of electricity generation sources be \(I=\left\{ nuclear,renewables,coal,gas,CCS\right\} \), such that \(x_{t}^{i}\) denotes US electricity produced (and consumed) in year \(t\in T\) from energy source \(i\in I\). The subset of fossil fuel owners is \(J=\left\{ coal,gas,CCS\right\} \). The numerical model runs over the time horizon \(T=\left\{ 2016,2017,\ldots ,2115\right\} \) and uses the Path solver in GAMS solve the systems of equations in Appendix B as mixed complementarity problems.

The utility function from electricity consumption is given by \(u\left( \varvec{x}_{t}\right) =u_{1}\left( \sum _{i\in I}x_{t}^{i}\right) -\left( u_{2}/2\right) \left( \sum _{i\in I}x_{t}^{i}\right) ^{2}\), where \(u_{1}\) and \(u_{2}\) are estimated parameters. The first order condition associated with Eq. (6) implicitly yields the demand function, given this utility function. I estimate US electricity demand based on yearly figures for US electricity sales to ultimate customers and average yearly prices from the US Energy Information Administration (EIA) over the period 1990–2014, including GDP and the US Henry hub gas price in the regression.Footnote 37 I let the electricity price in this equation be endogenous and dependent on the US oil price (West Texas Intermediate) and the supply of electricity. The fitted equation system is (standard errors in parentheses):

$$\begin{aligned} \begin{array}{l} ElC\\ {\scriptstyle } \end{array}&\begin{array}{l} =\\ {\scriptstyle } \end{array}&\begin{array}{l} 1806\\ {\scriptstyle (185)} \end{array}\begin{array}{l} -\\ {\scriptstyle } \end{array}\begin{array}{l} 2.992\\ {\scriptstyle (1.39)} \end{array}\begin{array}{l} *ElP+\\ {\scriptstyle } \end{array}\begin{array}{l} 131.021\\ {\scriptstyle (4.49)} \end{array}\begin{array}{l} *GDP+\\ {\scriptstyle } \end{array}\begin{array}{l} 13.501\\ {\scriptstyle (3.33)} \end{array}\begin{array}{l} *GasP,\\ {\scriptstyle } \end{array}\end{aligned}$$
$$\begin{aligned} \begin{array}{c} ElP\\ {\scriptstyle } \end{array}&\begin{array}{c} =\\ {\scriptstyle } \end{array}&\begin{array}{c} 197.094\\ {\scriptstyle (7.05)} \end{array}\begin{array}{c} -\\ {\scriptstyle } \end{array}\begin{array}{c} 0.033\\ {\scriptstyle (.002)} \end{array}\begin{array}{c} *ElC+\\ {\scriptstyle } \end{array}\begin{array}{c} 0.313\\ {\scriptstyle (.030)} \end{array}\begin{array}{c} *OilP.\\ {\scriptstyle } \end{array} \end{aligned}$$

Here electricity consumption (ElC) is measured in TWh, GDP is in trillions of USD (2014), electricity prices (ElP) are in USD (2014) per MWh, gas prices (GasP) are in USD (2014) per million Btu, and oil prices (OilP) are USD (2014) per barrel. All variables are significant at a 5 percent confidence level and the \(R^{2}\) values are 0.986 and 0.879 for Eqs. (13a) and (13b), respectively (remember that time series data with strong time trends can produce very high \(R^{2}\)). Note the negative sign on electricity consumption (ElC) in (13b). Alternative estimations featuring the real interest rate, wage index and US coal prices give very similar results. One lag Dickey-Fuller unit root tests suggest that US energy consumption and GDP are non-stationary (MacKinnon approximate p values are 0.37 and 0.73, respectively—the null hypothesis is unit root). However, the one lag Dickey fuller test statistic on the regression residuals is \(-\)2.850, implying that we can reject the hypothesis of unit root residuals at a 10 percent confidence level (p value is 0.0515). This suggests that US GDP and electricity consumption are cointegrated. I derive \(u_{1}\) and \(u_{2}\) from Eq. (13a) and the quadratic utility functional form. The values on \(u_{1}\) and \(u_{2}\) are given in Table 1. Their positive values imply that the estimated utility from electricity consumption is increasing and concave.

The numerical model employs a cost function that captures the trade-off between fixed and variable operating costs directly (emissions in the model are given by the emission intensity multiplied with production):

$$\begin{aligned} c(x_{t}^{i},Y_{t}^{i},X_{t}^{i})=k(x_{t}^{i})+f(x_{t}^{i},Y_{t}^{i})+h(X_{t}^{i})x_{t}^{i}+c_{8}^{i}x_{t}^{i}. \end{aligned}$$

Here, the ’standard’ part of \(c\left( \cdot \right) \) is

$$\begin{aligned} k^{i}(x_{t}^{i})=c_{1}^{i}x_{t}^{i}+\frac{c_{2}^{i}}{2}\left( x_{t}^{i}\right) ^{2}, \end{aligned}$$

where \(c_{1}^{i}\) and \(c_{2}^{i}\) are technology specific parameters calibrated such that, for each source, \(k_{x_{t}^{i}}^{i}\left( \cdot \right) \) equals the average of 1990–2014 real US electricity prices at generation equal to 2015, and doubles at supply equal to total 2015 electricity consumption. \(f(\cdot )\) captures the cost of producing at a level that differs from the minimum efficient scale of invested capacity (measured by \(Y_{t}^{i}\)):

$$\begin{aligned} f^{i}\left( x_{t}^{i},Y_{t}^{i}\right) =g^{i}(\cdot )\left( c_{4}^{i}+\left( 1-c_{4}^{i}\right) C^{i}(\cdot )\right) , \end{aligned}$$


$$\begin{aligned} g^{i}\left( \cdot \right)&=\frac{c_{3}^{i}}{2}\left( \frac{x_{t}^{i}-Y_{t}^{i}}{0.05Y_{0}^{i}+0.95Y_{t}^{i}}\right) ^{2},\\ C^{i}\left( \cdot \right)&=\frac{1}{\pi }\sum _{i\in I}\left[ arctan\left( \frac{x_{t}^{i}-\left( 0.05Y_{0}^{i}+0.95Y_{t}^{i}\right) }{c_{5}}\right) +\frac{1}{2}\right] . \end{aligned}$$

Here \(c_{3}^{i}\) determines the magnitude of the convex investment costs, \(c_{4}^{i}\) is the share of investment costs that is incurred when production is declining, and \(c_{5}\) determines the shape of \(C^{i}\left( \cdot \right) \). The function \(C^{i}\left( \cdot \right) \) is derived using the cumulative Cauchy distribution function. Note that \(C^{i}(\cdot )\in \left( 0,1\right) \) and increases steeply from near zero to near 1 around \(x_{t}^{i}=Y_{t}^{i}\), given a low value on \(c_{5}\). The use of \(0.05Y_{0}^{i}+0.95Y_{t}^{i}\) (in place of only \(Y_{t}^{i}\)) is needed for the numerical model to solve. The parameters \(c_{3}^{i}\) are calibrated based on historic production and generation capacity figures from EIA, cost estimates of fossil fueled power plant ramp-up costs (including increased capital depreciation from power plant cycling ) (Kumar et al. 2012), and technology specific power plant characteristics. Besides costs related to investment in non-fossil electricity production capacity and shut-down of fossil fueled power plants, relevant investment costs may be power grid investments and energy security issues related to renewable energy intermittency. Fossil fuel purchasing costs, \(c_{8}^{i}\), are assumed constant and equal to figures from EIA for 2015 (\(c_{8}^{i}\) is zero for renewables and nuclear). Figure 13 graphs the investment costs used in the numerical simulations.

Fig. 13
figure 13

Calibrated investment costs \(\chi \left( y_{t}^{i}\right) \) on the left and ’adjustment costs’ \(f^{i}\left( x_{t}^{i},Y_{t}^{i}\right) \) on the right

The numerical model let extraction costs increase in cumulative production. Whereas this is mathematically equivalent to the constraint used previously (for non-binding resource stock constraints), it switches the sign on the shadow rent (\(\mu _{t}^{j}\) is now the present value following a marginal decrease in the remaining resource stock, not a marginal increase as in the theory model). I replace \(h^{i}\left( S_{t}^{i}\right) \) with \(\tilde{h}^{i}\left( X_{t}^{i}\right) \), where \(X_{t+1}^{i}=X_{t}^{i}+x_{t}^{i}\), \(\tilde{h}_{X_{t}^{i}}^{i}\left( X_{t}^{i}\right) >0\) and \(X_{0}^{i}=\bar{X}^{i}\) is given by history. I use figures for proved US coal and natural gas reserves from BP Statistics 2016, along with conversion factors and energy content from the Canadian National Energy Board, to derive the resource scarcity unit cost function \(\tilde{h}^{i}\left( X_{t}^{i}\right) =c_{6}^{i}X_{t}^{i}\).Footnote 38 I assume zero US net imports of coal and gas, and that all US coal and gas resources are available for US electricity production. \(c_{7}\) is calibrated such that supply costs of coal and gas doubles when cumulative production \(X_{t}^{i}\) equals proven reserves (\(c_{7}^{i}\) is zero for renewables and nuclear). Coal and coal-fired CCS draws from the same fossil fuel resource base.

Technology specific investment costs are given by:

$$\begin{aligned} \chi \left( y_{t}^{i}\right) =\left( k_{1}^{i}+k_{2}^{i}y_{t}^{i}\right) y_{t}^{i}\left( k_{3}^{i}+\left( 1-k_{3}^{i}\right) K^{i}(\cdot )\right) , \end{aligned}$$


$$\begin{aligned} K^{i}\left( \cdot \right) =\frac{1}{\pi }\sum _{i\in I}\left[ arctan\left( \frac{y_{t}^{i}}{k_{4}}\right) +\frac{1}{2}\right] , \end{aligned}$$

where \(K\left( \cdot \right) \) is derived using the cumulative Cauchy distribution (see comments on \(C^{i}(\cdot )\) above). The technology specific investment costs parameters \(k_{1}^{i}\) and \(k_{2}^{i}\) are calibrated using figures from IEA (2016); i.e., overnight capital costs are 3636, 6084, 978, 5945 and 2557 $/kW for coal, CCS, gas, nuclear and renewables, respectively. Power plants cannot operate all hours during the year, e.g. because of maintenance requirements. Renewables are also dependent on weather conditions. I assume that all power plants can operate 90% of the time, except renewables which has a utilization rate equal to 0.5. l calibrate \(k_{1}^{i}\) and \(k_{2}^{i}\) such that unit investment costs equals those given by EIA when the investment level equals average yearly investment during the period 2005–2014. Furthermore, unit investment costs are quadrupled if the investment level is doubled from the average 2005–2014 level. Note that investment tends to occur simultaneously for different technologies in the numerical simulation, implying that economy wide capacity constraints may occur. Last, the cost of reducing capacity is lower than that of increasing capacity for all fuels except nuclear. The calibrated investment costs are graphed in Fig. 13.

Table 1 Parameter values in the numerical illustration. \(c_{3}^{i}\) is measured in thousand USD

Fuel specific emission intensities are based on EIA figures for electricity generation and emissions. I calculate the CCS emissions intensity under the assumption that CCS plants reduce emissions with 90%.Footnote 39 Environmental damage is given by \(d\left( \varvec{e}_{t}\varvec{x}_{t}^{\prime },E_{t}\right) =d_{1}\varvec{e}_{t}\varvec{x}_{t}^{\prime }+d_{2}E_{t}+\frac{d_{3}}{2}E_{t}^{2}\), where \(d_{1}\), \(d_{2}\) and \(d_{3}\) are calibrated parameters. I set \(d_{1}=1\), \(d_{3}=0\) and calibrate \(d_{2}\) such that the Obama Administration’s 80% emission reduction target in 2050 (as compared with 2015 emissions) is socially optimal (given the other parameters in the model). See Table 1 for exact parameter values. All prices and costs are measured in 2016 USD in this paper (except for the estimation above). The numerical model solves the systems of equations in Appendix B (social planner and competitive equilibrium) as mixed complementarity problems, given these functional forms and parameter values.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Storrøsten, H.B. Emission Regulation of Markets with Sluggish Supply Structures. Environ Resource Econ 77, 1–33 (2020).

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


  • Adjustment costs
  • Investment
  • Pollution
  • Regulation

JEL Classification

  • H21
  • H23
  • Q35
  • Q41
  • Q54