Let the vector \(\varvec{x}_{t}=\left( x_{t}^{1},x_{t}^{2},\ldots ,x_{t}^{\bar{i}}\right) \) denote a representative consumer’s consumption bundle of goods \(i\in I=\left\{ 1,2,\ldots ,\bar{i}\right\} \) in period \(t\in T=\left\{ 1,2,\ldots ,\overline{t}\right\} \). The associated benefit is given by the increasing and strictly concave utility function \(u\left( \varvec{x}_{t}\right) \). Each good \(x_{t}^{i}\) is produced by a representative firm (or sector) i. I assume market clearing such that production of \(x_{t}^{i}\) equals consumption of \(x_{t}^{i}\) for all \(i\in I\) and \(t\in T\). The firms’ discount factor is given by \(\delta \in \left( 0,1\right] \) and all derivatives are assumed to be finite.
One interpretation of this model setup is an economy with concave utility from electricity consumption, where electricity may be derived from \(\bar{i}\) energy sources: coal, gas, hydro power, and so forth. I will use this as an example throughout the paper, i.e., we have one representative firm for each type of electricity generation technology.
The investment costs of power generation are essentially capital construction costs and land, including “regulatory costs” for obtaining siting permits, environmental approvals, and so on. These costs may increase substantially in the presence of economy wide capacity constraints, like limited availability of skilled labor or raw materials. I assume that the investment cost function, \(\chi ^{i}\left( y_{t}^{i}\right) \), is strictly convex and increasing in investment \(y_{t}^{i}\), with minimum at \(\chi ^{i}(0)=0\).Footnote 8 The model framework allows the representative firm to actively reduce capacity faster than capital depreciation (\(y_{t}^{i}<0\)).Footnote 9
Operating costs for power plants include fuel, labor and maintenance costs. I divide these costs into fixed and variable operating costs. Fixed operating and maintenance costs, denoted \(f^{i}(Y_{t}^{i})\), include, e.g., salaries for facility staff and maintenance that is scheduled on a calendar basis. They do not vary significantly with a plant’s electricity generation, but increase in capacity; i.e., we have \(\partial f^{i}(Y_{t}^{i})/\partial Y_{t}^{i}\equiv f_{Y_{t}^{i}}^{i}\left( \cdot \right) >0\). The variable operating costs include the cost of consumable materials and maintenance that may be scheduled based on the number of operating hours or start-stop cycles of the plant. These costs are captured by the variable cost function \(k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})\). Variable operating costs increase in production \(x_{t}^{i}\) and decrease in the capacity measure \(Y_{t}^{i}\). This is captured by the first order derivatives \(k_{x_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\), \(k_{Y_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\), second order derivatives \(k_{x_{t}^{i}x_{t}^{i}}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\), \(k_{Y_{t}^{i}Y_{t}^{i}}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})\le 0\) and cross derivative \(k_{x_{t}^{i}Y_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\). The firms may reduce their current emissions, \(e_{t}^{i}\ge 0\), by (flow) abatement activities. Abatement, measured as \(e_{t}^{i,BaU}-e_{t}^{i}\), is not free and production cost decreases in emissions if emissions fall below the business as usual emissions, \(e_{t}^{i,BaU}\) (associated with no emission regulation); i.e., we have \(k_{e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\), \(k_{e_{t}^{i}e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})>0\) and \(k_{x_{t}^{i}e_{t}^{i}}^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})<0\) for \(e_{t}^{i}<e_{t}^{i,BaU}\).Footnote 10 Note that abatement within a given technology is modeled as a flow variable; i.e. the current emission intensity does not matter for the future emission intensity. This differs from the emission reductions that may be achieved with a larger share of low-emission technology capacity (e.g., replacing coal capacity with renewables). Electricity generation (flow) abatement is most relevant for fossil fuels, where it may involve, e.g., switching to cleaner types of coal or use of combined cycle power plants or scrubbers (sulfur dioxide).Footnote 11
Production capacity evolves following the state equation:
$$\begin{aligned} Y_{t+1}^{i}=\beta Y_{t}^{i}+y_{t}^{i}, Y_{0}^{i}=\overline{Y}^{i},\ \forall i,\forall t, \end{aligned}$$
(1)
where \(\beta \in \left( 0,1\right] \) is a capital depreciation factor and \(\bar{Y}^{i}\) is initial capacity (a constant determined by history).
Assume that a subset of the representative firms \(j\in J=\left\{ \tilde{i}+1,\tilde{i}+2,\ldots ,\bar{i}\right\} \) use a scarce resource as an input factor in production (\(J\subseteq I=\left\{ 1,2,\ldots ,\tilde{i},\tilde{i}+1,\ldots ,\bar{i}\right\} \)). These firms have an additional term \(h^{j}\left( S_{t}^{j}\right) x_{t}^{j}\) added to their variable operating cost function, where the remaining resource stock, \(S_{t}^{j}\), evolves following the state equation:
$$\begin{aligned} S_{t+1}^{j}=S_{t}^{j}-x_{t}^{j}, S_{0}^{j}=\bar{S}^{j},\ \forall j,\forall t. \end{aligned}$$
(2)
Here \(\bar{S}^{j}\) is an exogenous constant and I have normalized units in (2) such that one unit of production requires one unit of resource. We have the resource stock constraint \(S_{t}^{j}\ge 0\). Further, resource scarcity implies that unit operating costs decrease in the remaining resource stock; i.e., we have \(h_{S_{t}^{j}}^{j}\left( S^{j}\right) <0\); e.g., because the cheapest resource deposits are extracted first. Note that firm \(j\in J\) is an integrated firm that extracts the fossil fuels needed for electricity generation by itself. I assume that \(lim_{S_{t}^{j}\rightarrow 0}h\left( S_{t}^{j}\right) =\infty \); i.e., the cost of resource extraction approaches infinity as the resource stock is completely exhausted.Footnote 12
Total operating costs are given by:
$$\begin{aligned} c^{i}\left( \varvec{z}_{t}^{i}\right) ={\left\{ \begin{array}{ll} \begin{array}{l} f^{i}(Y_{t}^{i})+k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i}),\\ f^{i}(Y_{t}^{i})+k^{i}(x_{t}^{i},Y_{t}^{i},e_{t}^{i})+h^{i}\left( S_{t}^{i}\right) x_{t}^{i}, \end{array} & \begin{array}{l} \forall i\le \tilde{i},\forall t,\\ \forall i>\tilde{i},\forall t, \end{array}\end{array}\right. } \end{aligned}$$
(3)
with \(\varvec{z}_{t}^{i}=\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i}\right) \) for \(\forall i\le \tilde{i}\) and \(\varvec{z}_{t}^{i}=\left( x_{t}^{i},Y_{t}^{i},e_{t}^{i},S_{t}^{i}\right) \) for \(\forall i>\tilde{i}\). For example, the first \(i=1,2,\ldots ,\tilde{i}\) may denote firms using non-exhaustible energy sources like renewables and nuclear, whereas the remaining \(j=\tilde{i}+1,\tilde{i}+2,\ldots ,\bar{i}\) are firms combusting fossil energy sources like petroleum and gas. This cost structure implies that unit operating costs, \(c^{i}\left( \varvec{z}^{i}\right) /x_{t}^{i}\), have the familiar skewed U-shape with minimum at \(c^{i}\left( \varvec{z}^{i}\right) /x_{t}^{i}=c_{x_{t}^{i}}^{i}\left( \varvec{z}^{i}\right) \) (for any given \(e_{t}^{i}\)).Footnote 13
Clearly, the relationship between \(f^{i}\left( \cdot \right) \) and \(k^{i}\left( \cdot \right) \) may differ markedly across technologies. For example, nuclear power plants feature high fixed costs relative to the variable operating costs, as compared with gas-fueled power plants. The strict convexity of \(\chi ^{i}\left( \cdot \right) \) and \(k_{x_{t}^{i}Y_{t}^{i}}^{i}(\cdot )<0\) implies that the cost associated with any given change in the energy mix \(\varvec{x}\) may be reduced by increasing the number of time periods during which the change occurs. Specifically, the cost of reducing GHG emissions increases with the speed of emission reductions. I will discuss the results in a setting where the initial capacity mix \(\left( \overline{Y}^{1},\overline{Y}^{2},\ldots ,\overline{Y}^{\bar{i}}\right) \) is characterized by too much dirty capacity, such that the socially optimal capacity declines over time for relatively emission intensive production technologies, and increases for relatively clean energies. Whereas this eases the discussion of the results, and arguably is the case for the electricity industry in many countries today, it is not necessary for the validity of the analytical results.
The emissions stock evolves following the state equation:
$$\begin{aligned} E_{t+1}=\alpha E_{t}+\sum _{i\in I}e_{t}^{i}, E_{0}=\bar{E},\ \forall t, \end{aligned}$$
(4)
where \(\bar{E}\) is a constant determined by history and \(\alpha \in \left[ 0,1\right) \) denotes the stock depreciation factor from one period to the next. Environmental damage from emissions is given by \(d\left( E_{t}\right) \), where \(d(\cdot )\) is weakly convex and increasing.Footnote 14
Let \(p_{t}^{i}\) denote the endogenous consumer price on \(x_{t}^{i}\) (net of taxes). I assume the regulator has access to three regulatory instruments: an emission tax, \(\tau _{t}\), a tax on investment, \(\theta _{t}^{i}\), and a tax on extraction of exhaustible resources, \(\phi _{t}^{i}\). The representative firms \(i\in I\) may be interpreted as representing different sectors of the electric power industry; i.e., the wind power sector, the nuclear power sector, and so forth. I will therefore use the term sector specific taxes when referring to \(\theta _{t}^{i}\) and \(\phi _{t}^{i}\).Footnote 15 The investment tax \(\theta _{t}^{i}\) may take three forms: (i) a standard unit tax on investment if \(\theta _{t}^{i}>0\) and \(y_{t}^{i}>0\), (ii) a subsidy to decommissioning if \(\theta _{t}^{i}>0\) and \(y_{t}^{i}<0\), and (iii) a subsidy on investment if \(\theta _{t}^{i}<0\). The extraction tax \(\phi _{t}^{i}\) is needed to slow down extraction of scarce resources if the private discount rate is higher than the social discount rate (see Sect. 2.2); i.e., \(\phi _{t}^{i}>0\) is enacted to preserve scarce resources for use in the future.
Market Equilibrium
The competitive representative firm \(i\in I\) maximizes the present value of profits over the remaining time horizon solving:
$$\begin{aligned} \max _{x_{t}^{i},y_{t}^{i},e_{t}^{i}}\sum _{t\in T}\delta ^{t-1}\left[ p_{t}^{i}x_{t}^{i}-\left( c^{i}\left( \varvec{z}_{t}^{i}\right) +\phi _{t}^{i}x_{t}^{i}\right) -\left( \chi ^{i}\left( y_{t}^{i}\right) +\theta _{t}^{i}y_{t}^{i}\right) -e_{t}^{i}\tau _{t}\right] ,\ \forall t, \end{aligned}$$
(5)
where \(c^{i}\left( \varvec{z}_{t}^{i}\right) +\phi _{t}^{i}x_{t}^{i}\) is operating costs including the extraction tax, \(\chi ^{i}\left( y_{t}^{i}\right) +\theta _{t}^{i}y_{t}^{i}\) is the cost of investment, including investment taxes, and \(e_{t}^{i}\tau _{t}\) is the emission tax payment. The maximization is subject to equations (1), (2), (3) and the resource constraint \(S_{\overline{t}}^{j}\ge 0\).Footnote 16
A price-taking representative consumer maximizes net utility solving:
$$\begin{aligned} \varvec{x}_{t}=\arg \max _{\varvec{x}_{t}}\left[ u\left( \varvec{x}_{t}\right) -\varvec{p}_{t}\varvec{x}_{t}^{\prime }\right] ,\ \forall t, \end{aligned}$$
(6)
where \(\varvec{p}_{t}=\left( p_{t}^{1},p_{t}^{2},\ldots,p_{t}^{\bar{i}}\right) \) and \(\varvec{x}_{t}^{\prime }\) is the transpose of \(\varvec{x}_{t}\). The associated first order condition is \(u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right) \le p_{t}^{i}\) for all \(i\in I\) and \(t\in T\).
We have the following result:
Lemma 1
The competitive partial equilibrium sequence triple \(\left\{ x_{t}^{i,*},y_{t}^{i,*},e_{t}^{i,*}\right\} \), solving (5) and (6) subject to equations (1), (2) and (3), satisfies:
$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right)&\le p_{i}^{i}\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\phi _{t}^{i},\ \forall i\le \tilde{i},\forall t,\end{aligned}$$
(7a)
$$\begin{aligned} u_{x_{t}^{i}}\left( \varvec{x}_{t}^{*}\right)&\le p_{i}^{i}\le c_{x_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) +\mu _{t}^{i}+\phi _{t}^{i},\ \forall i>\tilde{i},\forall t,\end{aligned}$$
(7b)
$$\begin{aligned} \lambda _{t}^{i,*}&\le \chi _{y_{t}^{i}}^{i}\left( y_{t}^{i,*}\right) +\theta _{t}^{i},\ \forall i,\forall t,\end{aligned}$$
(7c)
$$\begin{aligned} \lambda _{t}^{i,*}&=-\delta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \delta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \varvec{z}_{r}^{i,*}\right) ,\ \forall i,\forall t<\bar{t,}\end{aligned}$$
(7d)
$$\begin{aligned} \mu _{t}^{i,*}&=-\sum _{r=t+1}^{\overline{t}}\delta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,*}\right) x_{r}^{i,*},\ \forall i>\tilde{i,}\forall t<\bar{t,}\end{aligned}$$
(7e)
$$\begin{aligned} \tau _{t}&\le -c_{e_{t}^{i}}^{i}\left( \varvec{z}_{t}^{i,*}\right) ,\ \forall i,\forall t, \end{aligned}$$
(7f)
with \(Y_{t}^{i,*}\) and \(S_{t}^{i,*}\) as given by equations (1) and (2), respectively. We have \(\lambda _{\bar{t}}^{i,*}=\mu _{\bar{t}}^{i,*}=0\). The weak inequalities are strict if and only if we have a corner solution for the relevant decision variable.Footnote 17
Proof
See "Appendix B". \(\square \)
We see from Lemma 1 that production of \(x_{t}^{i}\) increases in capacity \(Y_{t}^{i}\), marginal utility from consumption and the remaining resource stock (\(\forall i>\tilde{i}\)), whereas it decreases in production cost and extraction taxes. Note that Lemma 1 implies \(p_{t}^{1}=p_{t}^{2}=\ldots =p_{t}^{\bar{i}}\) if the goods are perfect substitutes. This is relevant if I is a set of electricity producers.
The variable \(\lambda _{t}^{i}\) is a (endogenous) shadow price representing the present value of the change in future profits caused by a marginal increase in current capacity. In the case where optimal production capacity declines towards a new and lower level (faster than capacity depreciation), higher capacity today induces too high fixed operating costs in the future. Hence, the shadow price \(\lambda _{t}^{i}\) is negative. Conversely, \(\lambda _{t}^{i}\) is positive if optimal capacity shifts upwards.Footnote 18
For sectors with scarce resources (\(\forall i>\tilde{i}\)), more extraction today implies less extraction in the future. Hence, the resource owners must not only decide whether to extract the resource, but also when to extract. This consideration is captured by the non-negative shadow price on the remaining resource stock, \(\mu _{t}^{j}\) , in Lemma 1 (the endogenous shadow price \(\mu _{t}^{j}\) is often referred to as the ’scarcity rent’ or ’Hotelling rent’). It is the present value change in future profits caused by a marginal increase in the remaining resource stock \(S_{t}^{j}\). Lemma 1 implies that each resource owner equalizes marginal discounted profits from extraction over time. Otherwise, the resource owners could increase the present value of their resource by moving production between periods. The resource rents typically vary between the different resources.
Whereas the isolated effect of increased emission taxes is to decrease production (given \(e_{t}^{i}>0\)), production of \(x_{t}^{i}\) in competitive equilibrium may increase in the emission tax \(\tau _{t}\) if \(x_{t}^{i}\) is a low-emission good. The reason is that the upward shift in each firm’s supply cost functions, caused by the emission tax, increases in the emission intensity of the firm. Therefore, residual demand and equilibrium production of relatively low-emission goods increases. Equation (7f) in Lemma 1 states the familiar result that marginal costs of emission reductions (i.e., flow abatement for each type of technology) equal the emission tax in the interior solution.
The Socially Optimal Tax Scheme
Let welfare W be measured as the present value of utility from consumption net of environmental damages, production costs and investment costs:
$$\begin{aligned} W=\sum _{t\in T}\zeta ^{t-1}\left[ u\left( \varvec{x}_{t}\right) -d\left( E_{t}\right) -\sum _{i\in I}\left[ c^{i}\left( \varvec{z}_{t}^{i}\right) +\chi ^{i}\left( y_{t}^{i}\right) \right] \right] , \end{aligned}$$
(8)
where \(1\ge \zeta \ge \delta \) is the social discount factor. The regulator faces a trade-off in the presence of convex investment costs. On the one hand, fast emission reductions reduce environmental damage. On the other hand, the convexity of investment costs imply that the cost of emission reductions can always be reduced by extending the time horizon over which emission reductions take place. We have the following result:
Proposition 1
Let the socially optimal sequence triple \(\left\{ x_{t}^{i,sp},y_{t}^{i,sp},e_{t}^{i,sp}\right\} \) maximize welfare (8) subject to equations (1) to (4). Then, the socially optimal time trajectory can be implemented in partial competitive equilibrium with the following taxes:
$$\begin{aligned} \tau _{t}^{sp}&=\zeta \sum _{r=t+1}^{r=\overline{t}}\left( \alpha \zeta \right) ^{r-t-1}d_{E_{r}}(E_{r}^{sp}),\ \forall t<\overline{t},\\ \theta _{t}^{i}&=-\lambda _{t}^{i,sp}+\lambda _{t}^{*,i},\ \forall i,\forall t,\\ \phi _{t}^{i}&=\mu _{t}^{i,sp}-\mu _{t}^{*,i},\ \forall i>\tilde{i},\forall t, \end{aligned}$$
where:
$$\begin{aligned} \lambda _{t}^{i,sp}&=-\zeta \sum _{r=t+1}^{r=\bar{t}}\left( \beta \zeta \right) ^{r-t-1}c_{Y_{r}^{i}}^{i}\left( \varvec{z}_{r}^{i,sp}\right) ,\ \forall i,\forall t<\bar{t,}\\ \mu _{t}^{i,sp}&=-\sum _{r=t+1}^{\overline{t}}\zeta ^{r-t}h_{S_{r}^{i}}^{i}\left( S_{r}^{i,sp}\right) x_{r}^{i,sp},\ \forall i>\tilde{i},\forall t<\bar{t,} \end{aligned}$$
with \(\lambda _{t}^{*,i}\) and \(\mu _{t}^{*,i}\)as given in Lemma 1, \(\tau _{\bar{t}}^{sp}=\theta _{\bar{t}}^{i}=\phi _{\bar{t}}^{i}=0\) and \(\phi _{t}^{i,sp}=0\text { for } \forall i\le \tilde{i}\).
Proof
See "Appendix B".
We first examine the case where the social discount factor equals the private discount factor, such that \(\delta =\zeta \). In this case \(\theta _{t}^{i,sp}=\phi _{t}^{i,sp}=0\), given the optimal emission tax \(\tau _{t}^{sp}\). We observe that the expression for \(\tau _{t}^{sp}\) is the present value of the stream of future marginal stock damages following one additional unit of emissions along the socially optimal time trajectory. This is sometimes referred to as the social cost of carbon in the case of greenhouse gases.
The Pigou tax \(\tau _{t}^{sp}\) is only indirectly affected by the explicit modeling of production capacity and convex investment costs (the production capacity mix influence emissions and, hence, the optimal tax). Specifically, \(\tau _{t}^{sp}\) is not reduced during the first years to give firms time to adjust. On the contrary, higher investment costs cause slower development of relatively clean production capacity, which again entails higher emissions, a higher absolute value shadow price on the emissions stock, and higher optimal emissions taxes (see also Fig. 2 in Sect. 3.2). More expensive decommissioning of dirty production capacity (e.g., coal) has the same effect (see Figure 10 in “Appendix A”).
Assuming an interior solution for (flow) abatement, \(e_{t}^{i,sp}<e_{t}^{i,BaU}\), we have \(\tau _{t}=k_{e_{t}^{i}}^{i}\left( \cdot \right) \) in each time period. Marginal abatement cost then increases over time if the optimal stock of carbon in the atmosphere increases over time (because marginal environmental damage and, thereby, the optimal tax \(\tau _{t}^{sp}\), increases over time). Whereas this abatement profile agrees with Nordhaus (1991; 1992), the present analysis highlights that substantial investments may be necessary early on (see Fig. 1 in Sect. 3.2). The reason is that it takes time to implement the emission reductions necessary to curb global warming. The well-known result that lower (flow) abatement costs reduces the optimal emission tax remains valid in the present model setup (Weitzman 1974). Cheaper abatement also reduces the importance of redirecting investment towards less-emission intensive production capacity.
A common assumption when deriving the socially optimal time trajectory is that the firms’ private discount rate equals the discount rate of the social planner (see, e.g., Nordhaus 1991, 1992; Golosov et al. 2014). This assumption may be questionable, at least when applied to major environmental challenges like greenhouse gas emissions from power plants and climate change. The Stern Review (Stern 2007), and the following discussion about appropriate social discount rates in cost-benefit analysis, indicates that the social discount rate may be below capital market interest rates, at least in the case of climate change (Weitzman 2007; Tol and Yohe 2006). Indeed, the Stern Reviews’s conclusions about the need for decisive immediate action hinges on the assumption of a near-zero pure time preference discount rate, which are inconsistent with today’s marketplace real interest rates and savings rates (Nordhaus 2007). Goulder and Williams (2012) argue that we should distinguish between a social-welfare-equivalent discount rate appropriate for determining whether a given policy would augment social welfare and a finance-equivalent discount rate suitable for determining whether the policy would offer a potential Pareto improvement.Footnote 19
The case where the social discount factor is larger than the private discount factor (\(\delta <\zeta \)) involves \(\theta _{t}^{i,sp}\ne 0\) (\(\forall i\)) and \(\phi _{t}^{i,sp}>0\) for sectors with scarce resources (\(\forall i>\tilde{i}\)). The optimal investment tax, \(\theta _{t}^{i,sp}\), is the difference between the social planner’s and the representative firm’s shadow price on capacity \(Y_{t}^{i}\) along the socially optimal time trajectory. Note that the socially optimal investment tax is positive (negative) if production decrease (increase) over time. For example, Proposition 1 may imply decommissioning subsidies to coal plants, whereas investment in renewable energy is subsidized; see Fig. 3 in the numerical Sect. 3.2.
The optimal extraction tax, \(\phi _{t}^{i,sp}\), is the difference between the social planner’s and the representative firm’s shadow price on scarce resources along the socially optimal time trajectory. This tax is needed because owners of scarce resources put too low value on the future resource bases in their current extraction decisions when \(\delta <\zeta \). Hence, the social planner taxes current extraction to conserve more of the resource stocks for future use.
The following corollary summarizes the above discussion:
Corollary 1
The socially optimal time trajectory can be implemented in competitive partial equilibrium with a standard Pigou tax, \(\tau _{t}^{sp}\), if and only if \(\delta =\zeta \). Otherwise, the Pigou tax must be combined with sector specific investment taxes and subsidies, \(\theta _{t}^{i,sp}\ne 0\), to implement the social optimum (\(\forall i\)). Further, an extraction tax, \(\phi _{t}^{i,sp}>0\), is needed on extraction of scarce resources (\(\forall i>\tilde{i}\)).
Proof
The corollary follows directly from Proposition 1.
Note that the sector specific taxes and subsidies, \(\theta _{t}^{i,sp}\) and \(\phi _{t}^{i,sp}\), in general differs between sectors (or technologies), even though all sectors have the same private discount factor \(\delta \).
Dynamic Effects of Future Emission Taxes
In practice, it may be hard for lawmakers to enact an emission tax immediately (Di Maria et al. 2017). In this Sect. 1 analyze dynamic effects of future emission taxes. The other taxes are set to zero (\(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\)). Lemma 1 implies that a credible announcement of increased future emission taxes has three key effects in the electricity market:
(a) Reduced fossil fuel demand from power plants Future emission taxes increase the future cost of burning fossil fuels. The decline in future emission intensive fossil-fueled electricity generation implies that optimal fossil-fueled power plant capacity will be lower in the future. This reduces the profitability of investment in, e.g., coal-fired power plants, and thereby the demand for coal (cf., a lower \(\lambda _{t}^{i}\) for emission intensive energy in Lemma 1).
(b) Increased supply of electricity generated from (sufficiently) low-emission energy sources Future emission taxes imply higher supply costs for emission intensive fossil-fueled power plants. Hence, low-emission electricity generation sources, like renewables or nuclear power, gain a competitive advantage when the tax is implemented. This increases the profitability of investing in low-emission electricity generation capacity (cf., a higher \(\lambda _{t}^{i}\) for low-emission energy in Lemma 1). The associated increase in non-fossil electricity generation capacity reduces the electricity market equilibrium consumption of fossil fuels.Footnote 20
(c) Increased current supply of fossil fuels Future taxes decrease the future value of the fossil fuel resource (cf., a lower value on the Hotelling rent \(\mu _{t}^{i}\) for \(\forall i>\tilde{i}\) in Lemma 1). Hence, it is profitable with faster extraction. This is the well-known (weak) green paradox (see, e.g., Sinclair 1992; Sinn 2008; Gerlagh 2011). In particular, Sinclair (1992) and Sinn (2008) caution against environmental policies that become more stringent with the passage of time, because such policies will accelerate resource extraction and, thereby, accelerate global warming.
Whereas the resource scarcity dynamic (c) suggest that exhaustible fossil fuel extraction accelerates following signaling of future environmental policies, the capacity stock dynamics (a) and (b) have the opposite effect. From a theoretical point of view, it is therefore ambiguous whether current emissions increase or decrease following signaling of stringent future climate policy, given that resource exhaustibility, capacity constraints and convex investment costs are present. The capacity stock mechanisms (a) and (b) strongly dominate the supply side mechanism (c) put forth by the green paradox literature in the numerical Sect. 3.3 below. One reason is that mechanism (c) only really matters for oil and gas fueled power plants (the resource rent is small for coal). The above discussion suggests that emissions will unambiguously decline following the tax announcement if scarcity (mechanism c) is negligible or non-existent.Footnote 21 The mechanics discussed above act on current production via the shadow prices on capacity (\(\lambda _{t}^{i}\), mechanisms a and b) and the resource stock (\(\mu _{t}^{i}\), mechanism c). Figure 11 in “Appendix A” graphs how these shadow prices are affected by the tax announcement in the numerical simulations.
Second-Best Emission Taxes
The optimal tax scheme given in Proposition 1 involves taxes or subsidies that differs across sectors (\(\theta _{t}^{i}\) and \(\phi _{t}^{i}\)). In this section, I consider the case where the regulator is constrained to \(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\) and the representative firms’ discount rate exceeds the social discount rate (\(\delta <\zeta \)).Footnote 22 Then, whereas the Pigou tax \(\tau _{t}^{sp}\) still perfectly balances environmental damages with (flow) abatement cost, the transition towards a less emission intensive production capacity mix is too slow as compared with the socially optimal trajectory (cf., the need for investment taxes and subsidies \(\theta _{t}^{i,sp}\) when \(\delta <\zeta \) in Proposition 1). The transition towards a cleaner capacity mix can then be accelerated by announcing a future emission tax that is above the Pigouvian tax.
We observe that a policy involving announcement of a future tax above the Pigouvian tax will be subject to the mechanisms discussed in Sect. 2.3 (i.e., we have an increase from the Pigouvian tax to an emission tax that is higher during the transition). Of particular interest, higher future emission cost decreases future production from relatively emission intensive power plants, which again decreases their shadow price on capacity \(\lambda _{t}^{i}\) and, hence, investment in emission intensive production capacity. By the same reasoning, investment in (sufficiently) low-emission investment capacity increases. Hence, a future emission tax above the Pigouvian level has similar effects as the optimal investment tax in Proposition 1. This suggests that welfare may be increased by implementing a tax above the Pigouvian level during the transition period, given that \(\delta <\zeta \) and that mechanisms (a) and (b) dominate mechanism (c). Note that this argument only applies to future emission taxes; i.e., the regulator has no incentive to tax emissions above marginal environmental damages in the current time period. On the contrary, taxing current emissions above \(\tau _{t}^{sp}\) unambiguously reduces welfare, because marginal abatement cost is then larger than marginal environmental damages.
Assume mechanisms (a) and (b) dominate mechanism (c). Then, the regulator faces the following trade-off: On the one hand side, a tax above the Pigouvian level increases welfare by accelerating the change in production capacity necessary for the transition towards less emission intensive electricity generation. On the other hand side, there is a loss by taxing current emissions above the Pigouvian level, because marginal abatement cost is then higher than marginal environmental damage. It will increase welfare to tax emissions above the Pigouvian level if and only if the former effect dominates the latter. These dynamics are examined numerically in Sect. 3.2, where the second-best emissions tax trajectory turns out to be markedly above the Pigouvian tax during the transition period; see Fig. 4.
A caveat is that this policy is likely to be time inconsistent. To see this, let \(t=\left\{ 1,2,3\right\} \), \(\delta <\zeta \), \(\theta _{t}^{i}\equiv \phi _{t}^{i}\equiv 0\) and suppose mechanisms (a) and (b) dominate mechanism (c). Consider a policy where the regulator in period \(t=1\) announces the following emission tax sequence \(\left\{ \tau _{1}^{sp},\tau _{2}^{sp}+\epsilon ,\tau _{3}^{sp}\right\} \), where \(\epsilon \) is a small positive constant and \(\tau _{t}^{sp}\) is given by Proposition 1. Then, the period \(t=1\) shadow price on capacity decreases in \(\epsilon \) for relatively emission intensive sectors, and increases in \(\epsilon \) for (sufficiently) low-emission sectors. Hence, the isolated effect of \(\epsilon >0\) is to increase welfare (unless \(\epsilon \) is too large). In period \(t=2\), the regulator will have an incentive to break the commitment to \(\epsilon >0\), however, because in period \(t=2\) welfare is unambiguously maximized by \(\epsilon =0\). Therefore, this policy requires that the regulator can credibly commit to policies that, even though they increase present value welfare (8), will be less than optimal in the future time period in which they are enacted.Footnote 23