Keywords

1 Practice of the EU ETS

A properly designed emission trading system should help reducing pollution reduction with low social burden.

In this paper we understand it as a burden to the society, caused by energy production. We assume that it can be measured in monetary units including both, the overall production costs and an appropriately quantified environmental impact of energy production.

Originated from this idea, and based on the theoretical work of environmental economists, cap-and-trade systems have been put into operations all over the world.

The problem of design optimization for emission trading schemes has been addressed in [4]. This work shows that, in general, a traditional architecture of environmental markets is far from being optimal, meaning that appropriate alterations may provide significant improvements in emission reduction performances at lower social burden. Such improvements can be achieved by extending a regulatory framework, which we address below as extended scheme.

Let us explain this.

In the traditional scheme, it is assumed that the administrator allocates a pre-determined allowance number to the market and sets a compliance date at which a penalty must be paid for each unit of pollutant not covered by allowances. Hence, the policy maker can exercise merely two controls, the so-called cap (total amount of allowances allocated to the market) and the penalty size. In theory, a desired pollution reduction can be reached at some costs for the society by an appropriate choice of these parameters. However, in practice, there is not much flexibility, since the cap is motivated politically and the penalty is determined to provide enough incentives for the required pollution reduction. As a result, the performance of the traditional scheme could be very poor in terms of social burden for the achieved reduction.

In an extended scheme, the policy maker has much more influence. The regulator can tax or subsidize the production in terms of monetary units or in terms of emission certificates. These additional controls can be implemented in a technology-sensitive way. Doing so, the merit order of technologies can be changed significantly. On this account, emission savings, triggered by certificate prices, also become controllable. The work [4] illustrates that, by an appropriate choice of additional controls, the market can reach a targeted pollution reduction at much lower social burden.

Although these theoretical findings are sound, intuitive, and practically important, the optimization of environmental market architectures could not be brought to a level suitable for practical implementation. There are two reasons for this.

  1. (1)

    The existing approach [4] is based on the unrealistic assumption that each of the market players is non-risk-averse in the sense that it realizes a linear utility function. This assumption is not conform with the modern view and creates a number of singularities in the model. A priori, it is not even clear which conclusions of this work do hold under risk-aversion.

  2. (2)

    Although the practical advantage of such market design optimization is obvious, policy makers hardly can use the theoretical findings of [4], because their quantitative assessment requires optimal control techniques whose numerics is difficult.

In this work, we address both issues, namely:  

(1\(^\prime \)):

We assume a non-linear utility function for market agents and show several properties of the market equilibrium which make market design optimization possible. With this, our model is brought in line with standard economic theory and is appropriate for further developments. We also emphasize that, to capture risk-aversion, a completely new argumentation has been developed.

(2\(^\prime \)):

We provide our study in a one-period setting. Being accessible without optimal control techniques, the results become evident and potentially usable for a broad audience, including practitioners and decision makers.

 

The paper is organized as follows. Section 2 discusses the literature developed concerning markets of emission certificates. In Sect. 3 we introduce our equilibrium model. Section 4 deepens the analysis of the equilibrium. Section 5 studies the social optimality of the equilibrium and proves that it corresponds to the overall minimum-cost policy under a risk-neutral probability distribution. Section 6 discusses some perspectives of optimal market design. The final Sect. 7 provides conclusions.

2 Theory of Marketable Pollution Rights

The efficiency properties of environmental markets have been first addressed in [6, 10], who first advocated the principle that the “environment” is a good that can not be “consumed” for free. In particular, Montgomery describes a system of tradable certificates issued by a public authority coupled with fixing a cap to the total emissions, and, doing so, to force polluting companies paying proportionally to the environmental damage generated by their production activity. An emission certificate is representative of the permission to emit a given quantity of pollutant without being penalized. Companies with low environmental impact can sell excess certificates and the resulting revenue represents a general incentive to reduce pollution. Montgomery shows that the equilibrium price for a certificate must be driven by the cost of the most virtuous company to abate its marginal unit of pollutant. The key result of his analysis is that such a system guarantees that the reduction of pollution is distributed among the companies efficiently, that is minimizing their total costs.

After the seminal analysis of Montgomery, which is based on a deterministic and static model, the following research has taken the direction to the stochastic and multi-period settings. A literature review on the research which has developed after Montgomery’s work can be found in [14]. A common result shared by all the analyses developed so far is that cap-and-trade systems indeed represent the most efficient way to reduce and control the environmental damage generated by the industrial activity.

Let us mention the contributions which are directly related to our analysis. A majority of relatively recent papers [15, 11, 13] are related to equilibrium models, where risk-neutral individuals optimize the expected value of their profit or cost function. The hypothesis of risk-neutrality of the agents is gracefully assumed in those contributions, since it significantly simplifies the proof that environmental markets are efficient. Some papers have considered explicitly risk-averse decision makers. One of them, [9], develops a pricing model for the spot and derivative pricing of environmental certificates in a single-period economy. In [7], the authors also develop a (multi-period) equilibrium pricing model for contingent claims depending on environmental certificates, where risk-averse agents maximize the expected utility of their profit function.

3 One-Period Equilibrium of Emission Market

To explain the emission price mechanism, we present a market model where a finite number of agents, indexed by the set I, is confronted with abatement of pollution. The key assumptions are:

  • We consider a trading scheme in isolation, within a time horizon [0, T], without credit transfer from and to other markets. That is, unused emission allowances expire worthless.

  • There is no production strategy adjustment within the compliance period [0, T]. This means that the agents schedule their production plans for the entire period [0, T] at the beginning. Allowances can be traded twice: at time \(t=0\) at the beginning and at time \(t=T\) immediately before emission reports are surrendered to the regulator.

  • For the sake of simplicity, we set the interest rate to zero.

  • Each agent decides how much energy to produce and how many allowances to trade.

Note that this one-period model is best suited for our needs to explain the core mechanism of market operation and to discuss its properties. A generalization to a multi-period framework is possible, but it gives no additional insights related to the goal of this work.

The ith agent is specified by the set \(\varXi ^{i}\) of feasible production plans for the generation of energy (electricity) within one time period from \(t=0\) to \(t=T\). Further, we consider the following mappings, defined on \(\varXi ^{i}\), for each agent \(i \in I\):

$$ \begin{array}{c} \xi ^{i}_{0} \mapsto V_{0}^{i}(\xi ^{i}_{0}), C_{0}^{i}(\xi ^{i}_{0}), E_{T}^{i}(\xi ^{i}_{0})\text {,} \end{array} $$

with the interpretation that for production plan \(\xi ^{i}_{0} \in \varXi ^{i}\), the values \(V_{0}^{i}(\xi ^{i}_{0})\), \(C_{0}^{i}(\xi ^{i}_{0})\), and \(E_{0}^{i}(\xi ^{i}_{0})\) stand for the total production volume, the total production costs, and the total carbon dioxide emission, respectively.

Production: At time \(t=0\), each agent \(i \in I\) faces the energy demand \(D_{0} \in \mathbb R_{+}\) of the entire market, the realized electricity price \(P_{0} \in \mathbb R_{+}\), and the emission allowance price \(A_{0} \in \mathbb R_{+}\). Based on this information, each agent decides on its production plan \(\xi ^{i}_{0} \in \varXi ^{i}\), where \(\varXi ^{i}\) is the set of feasible production plans. Given \(\xi ^{i}_{0} \in \varXi ^{i}\), at time T, agent realizes the total production costs,

$$\begin{aligned} C_{0}^{i}(\xi ^{i}_{0}) \in \mathbb R\text {,} \end{aligned}$$
(1)

the production volume

$$\begin{aligned} V_{0}^{i}(\xi ^{i}_{0}) \in \mathbb R\text {,} \end{aligned}$$
(2)

and the total revenue, \( P_{0}V_{0}^{i}(\xi ^{i}_{0})\text {,} \) from the electricity sold.

Allowance allocation: We assume that the administrator allocates a pre-determined number \(\gamma _{0}^{i} \in [0, \infty [\) of allowances to each agent i.

So far, we have introduced deterministic quantities. Let us now turn to uncertainties modeled by random variables on the probability space \((\varOmega ,{\mathscr {F}}, {\mathbb P})\).

Emission from production: Following the production plan \(\xi ^{i}_{0}\), the total pollution of agent i is expressed as \( E_{T}^{i}(\xi ^{i}_{0})\text {.} \)

Remark

(Randomness in demand and production) The question of randomness in energy demand and production deserves a careful argumentation. The reader may be confused by the assumption that in our one-period modeling, the time unit may correspond to the entire compliance period (which suggests a rather long time), such that our assumption on deterministic demand and unflexible production schedule appears unrealistic. To ease understanding, one shall imagine an artificial emission market model for short time period, say one day until compliance. The point of our proposal is that the elements, the arguments and the techniques required to define the optimal production plan on a daily basis are the same of those required to identify the plan \(\xi ^{i}_{0}\) over a generalized period [0, T]. The value of this toy model is that it allows a straight-forward generalization to the multi-period situation. In our one-period modeling, we assume that the nominal energy demand \(D_{0}\) is non-random and is observed at the time \(t=0\) when production decisions are made. We also suppose that the production plan \(\xi ^{i}_{0}\) of each agent is deterministically scheduled at time \(t=0\). This view is in line with the current practice in energy business, where a nominal energy production volume along with a detailed schedule of production units is planed non-randomly in advance. Of course, the realized energy consumption deviates from what has been predicted. However, based on our experience in energy markets, it does not make sense to include this random factor into equilibrium modeling, since all decisions are made on the basis of a non-random demand anticipation and non-random customer’s requests for energy delivery. To maintain energy consumption fluctuations in real-time, diverse auxiliary mechanisms are used. They can be considered as purely technical measures (security of supply by reserve margins). For this reason, we believe that it is natural to assume that, although the energy demand \(D_{0}\) is known and production plan \(\xi ^{i}_{0}\) is deterministically scheduled at time \(t=0\), the total emission, associated with this production can not be predicted with certainty at time \(t=0\) when the production and trading decisions are made. In fact, in practice the producers have to manage diverse source of randomness while following production, (demand fluctuation, outages of generators) which yields usually small but unpredictable deviations \(N^{i}\) from the nominal emission \(E^{i}_{0}(\xi ^{i}_{0}) \) associated with production plan \(\xi ^{i}_{0}\). Thus, let us agree that \(E_{T}^{i}(\xi ^{i}_{0})\) is modeled as a random variable given as a sum

$$\begin{aligned} E_{T}^{i}(\xi ^{i}_{0})=E^{i}_{0}(\xi ^{i}_{0})+N^{i}, \qquad \xi ^{i}_{0} \in \varXi ^{i}, i \in I \end{aligned}$$
(3)

with deterministic function

$$\xi ^{i}_{0}: \varXi ^{i} \rightarrow \mathbb R, \quad \xi ^{i}_{0} \mapsto E^{i}_{0}(\xi ^{i}_{0})$$

describing the dependence of the nominal emission on the production plan and a random variable \(N^{i}\) standing for the deviation from the nominal emission. Note that the random emission \( E_{T}^{i}(\xi ^{i}_{0})\) will be the only source of uncertainty in our model.

To ease our analysis, let us agree on the natural assumption that for production schedules \(\xi ^{i}_{0} \in \varXi _{i}\) \(i \in I\) the total market emission \(\sum _{i \in I}E_{T}^{i}(\xi ^{i}_{0})\) possesses no point masses:

$$\begin{aligned} {\mathbb P}( \sum _{i \in I}E_{T}^{i}(\xi ^{i}_{0})=z) =0 \qquad \text {for all} \, z \in \mathbb R. \end{aligned}$$
(4)

Allowance trading: At times \(t=0, T\) the allowance permits can be exchanged between agents by trading at the prices \(A_{0}\) and \(A_{T}\), respectively. Denote by \(\vartheta ^{i}_{0}, \vartheta ^{i}_{T}\) the change at times \(t=0, T\) of the allowance number held by agent \(i \in I\). Such trading yields a revenue, which is

$$\begin{aligned} -\vartheta ^{i}_{0}A_{0}- \vartheta ^{i}_{T}A_{T}. \end{aligned}$$
(5)

Note that \(\vartheta ^{i}_{0}\) and \(A_{0}\) are deterministic, whereas \(\vartheta ^{i}_{T}\) and \(A_{T}\) are modeled as random variables. Observe that sales are described by negative values of \(\vartheta ^{i}_{0}, \vartheta ^{i}_{T}\), therefore (5) is non-negative random variable, if permits are sold.

Penalty payment: As mentioned above, the penalty \(\pi \in [0, +\infty [\) must be paid at maturity T for each unit of pollutant not covered by allowances. Given the changes at times \(t=0,T\) due to allowance trading, i.e. \(\vartheta ^{i}_{0}\) and \(\vartheta ^{i}_{T}\), the production \(\xi ^{i}_{0}\), and the total number \(\gamma ^{i}_{0}\) of allowances allocated to agent \(i \in I\), the loss of agent i due to potential penalty payment is given by

$$\begin{aligned} \pi ( E_{0}^{i}(\xi ^{i}_{T}) -\vartheta ^{i}_{0}- \vartheta ^{i}_{T} -\gamma _{0}^{i} )^{+} \text {.} \end{aligned}$$
(6)

Individual profit: In view of (1)–(6), the profit of agent \(i \in I\) following trading and production strategy \((\vartheta ^{i}, \xi ^{i})=(\vartheta ^{i}_{0}, \vartheta ^{i}_{T}, \xi ^{i}_{0})\) depends on the market prices \((A, P)=(A_{0}, A_{T}, P_{0})\) for allowances and energy and is given by

$$\begin{aligned} L^{A, P, i}(\vartheta ^{i}, \xi ^{i})= & {} - \vartheta ^{i}_{0}A_{0} -\vartheta ^{i}_{T}A_{T}- C_{0}^{i}(\xi ^{i}_{0}) + P_{0}V_{0}^{i}(\xi ^{i}_{0}) \nonumber \\&-\pi ( E_{T}^{i}(\xi ^{i}_{0}) -\vartheta ^{i}_{0}-\vartheta ^{i}_{T} -\gamma _{0}^{i} )^{+}\text {.} \end{aligned}$$

Note that the individual profit could be negative.

Risk-aversion and rational behavior: Suppose that the risk attitudes of each agent \(i \in I\) are described by a pre-specified strictly increasing utility function \(U^{i}: \mathbb R\rightarrow \mathbb R\). With this, the rational behavior of the agent i is targeted on the maximization of the functional

$$\begin{aligned} ( \vartheta ^{i}, \xi ^{i}) \mapsto {\mathbb E}(U^{i}(L^{A, P, i}( \vartheta ^{i},\xi ^{i}))) \end{aligned}$$

over all the possible trading and production strategies \(( \vartheta ^{i}, \xi ^{i})=( \vartheta ^{i}_{0}, \vartheta ^{i}_{T}, \xi ^{i}_{0})\).

Energy demand: Suppose that at time \(t=0\) all agents observe the total energy demand, which is described by \(D_{0} \in \mathbb R_{+}\). Let us agree that the demand must be covered.

Market equilibrium: Following standard apprehension, a realistic market state is described by the so-called equilibrium—a situation where all allowance prices, all allowance positions, and all production decisions are such that each agent is satisfied by its own policy and, at the same time, natural restrictions are fulfilled.

Definition 1

Given energy demand \(D_{0} \in \mathbb R_{+}\), the prices \((A^{*}, P^{*}) = (A^{*}_{0}, A^{*}_{T}, P^{*}_{0})\) are called equilibrium prices, if, for each agent \(i \in I\text {,}\) there exists a strategy \((\vartheta ^{i*},\xi ^{i*})=(\vartheta ^{i*}_{0},\vartheta ^{i*}_{T}, \xi ^{i*}_{0})\) such that:

  1. (i)

    the energy demand is covered

    $$ \sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0})= D_{0}\text {,} $$
  2. (ii)

    the emission certificates are in zero net supply

    $$\begin{aligned} \sum _{i \in I}\vartheta _{t}^{i*}=0 \quad \text {almost surely for} \, t=0 \, \text {and} \, t=T, \end{aligned}$$
    (7)
  3. (iii)

    each agent \(i \in I\) is satisfied by its own policy in the sense that

    $$\begin{aligned} {\mathbb E}(U^{i}(L^{A^{*}, P^{*}, i} (\vartheta ^{i*}, \xi ^{i*}))) \ge {\mathbb E}(U^{i}(L^{A^{*}, P^{*}, i} (\vartheta ^{i}, \xi ^{i}))) \quad \end{aligned}$$
    (8)

    holds for any alternative strategy \((\vartheta ^{i}, \xi ^{i})\).

The main objective of this section is to prove that in the present model the electricity price formation is determined by the usual merit order arguments, where the effect of emission regulation causes emission allowance prices to enter the costs of production at the specific emission rate. This issue can be considered as the core mechanism of any cap-and-trade system, since including pollution costs into final product prices causes a change of the merit order of production technologies towards a cleaner production. To formulate this result, let us elaborate on the opportunity costs and introduce additional definitions.

Opportunity costs: In the economic literature, they stand for the forgone benefit from using a certain strategy compared to the next best alternative. For example, the opportunity costs of farming own land is the amount which could be obtained by renting the land to someone else. Let us explain how the opportunity costs necessarily lead to windfall profits.

When facing energy (electricity) generation, producers consider a profit, which could be potentially realized when, instead of production, unused emission allowances were sold to the market. For instance, if the price of the emission certificates is 12 € per tonne of CO2 and the production of one Megawatt-hour (MWh) emits two tonnes of CO2 (say, using a coal-fired steam turbine), then the producer must decide between two strategies which are equivalent in terms of their emission certificate balance:

  • produce and sell one MWh to the market,

  • do not produce this MWh and sell allowances covering two tonnes of CO2.

In this situation, the opportunity costs of producing one MWh are \(2\times 12=24\) €. Obviously, the agent produces energy only if the first strategy is at least as profitable as the second one. Thereby, both the production and the opportunity costs must be considered in the formation of the electricity market price. Clearly, if the production costs of electricity are 30 € per MWh, then the energy will be produced only if its price covers both the production and the opportunity costs. Thus electricity can only be delivered at a price exceeding \(30+2 \times 12=54\) €. That is, in order to trigger the electricity production, the opportunity costs must be added to the production costs.

In the scientific community, this phenomenon is well-known under the name of cost-pass-through. An empirical analysis, see [12] confirms that the strategy of cost-pass-through is currently followed by the European energy producers. Furthermore, the detailed investigation of mathematical market models shows that the cost-pass-through is the only possible strategy in the so-called equilibrium state of the market. This can be interpreted as follows: when behaving optimally, the energy producers must pass the allowance price on to the consumers. Note that the producer obtains a windfall profit of 24 € in anycase: if electricity price is higher than 54, by passing-through the price of the certificates (that he has received for free); if the price is less than 54, by selling 2 certificates at 12 € on the emission market.

More importantly, it turns out that the cost-pass-through is nothing but the core mechanism responsible for the emission savings. Namely, due to the opportunity costs, clean technologies appear cheaper than emission-intense production strategies. For instance, the alternative generation technology represented by a gas turbine, which yields energy at the price of 40 € and emits only one tonne of CO2, hardly competes with a coal-fired steam turbine under generic regime (without emissions regulation). Namely, if there is no regulatory framework, then the coal-fired steam turbine is scheduled first and the gas turbine has to wait until the energy demand can not be covered by coal-fired steam technologies. However, given an emission regulation, the opposite is true: say, if the allowance price is equal to 12 € per tonne of CO2 as above, then the gas technology appears cheaper, operating at full costs of \(40+ 1\times 12=52\) €. Thus, the gas turbine is scheduled first, followed by the coal-fired steam turbine, which runs only if the installed gas turbine capacity does not cover the energy demand.

In the next section, we will show that the only rational behavior in equilibrium is to pass the opportunity costs on to the consumers. For this, we require additional notions.

Definition 2

Consider a given energy amount \(d \in \mathbb R_{+}\) and a given allowance price \(a \in \mathbb R_{+}\).

(i) Introduce the individual opportunity merit order costs of agent \(i \in I\) as

$$ {\mathscr {C}}^{i}(d, a)=\inf \{C_{0}^{i}(\xi _{0}^{i})+ a E_{0}^{i}(\xi _{0}^{i}) \, : \, \xi _{0}^{i} \in \varXi ^{i}, V_{0}^{i}(\xi _{0}^{i}) \ge d\}. $$

An individual production plan \(\xi _{0}^{i} \in \varXi ^{i}\) is called conform with opportunity costs at emission price \(a \in \mathbb R_{+}\) if

$$\begin{aligned} {\mathscr {C}}^{i}(V_{0}^{i}(\xi ^{i}_{0}),a)=C_{0}^{i}(\xi _{0}^{i})+ a E_{0}^{i}(\xi _{0}^{i}). \end{aligned}$$

that is to say \(\xi _{0}^{i}\) is confirm if it minimizes the production and emission costs among all the alternative plans offering the same generation and given emission price a.

(ii) Introduce the cumulative opportunity merit order costs as

$$ {\mathscr {C}}(d, a)=\inf \{\sum _{i \in I}(C_{0}^{i}(\xi _{0}^{i})+ a E_{0}^{i}(\xi _{0}^{i})) \, : \, \xi _{0}^{i}\in \varXi ^{i}, i \in I, \sum _{i \in I} V_{0}^{i}(\xi _{0}^{i}) \ge d \}. $$

The production plans \(\xi _{0}^{i} \in \varXi ^{i}\text {,}\) \(i \in I\text {,}\) are called conform with opportunity costs at emission price \(a \in \mathbb R_{+} \) if

$$ {\mathscr {C}}(\sum _{i \in I}V_{0}^{i}(\xi ^{i}_{0}), a)=\sum _{i \in I}\left( C_{0}^{i}(\xi ^{i}_{0})+ a E_{0}^{i}(\xi _{0}^{i}) \right) . $$

(iii) Any price \(p \in \mathbb R_{+}\) with the property that

$$\begin{aligned} -{\mathscr {C}}(\tilde{d}, a)+ p \tilde{d} \le -{\mathscr {C}}(d, a)+ p d \, \text {for all} \, \tilde{d} \in \mathbb R_{+} \end{aligned}$$
(9)

is referred to as an opportunity merit order electricity price at (da) and it can be understand as the marginal cost for the entire generation sector given the level of demand d and emission price a.

4 Properties of Equilibrium

With these definitions, we now show that, within any equilibrium, the production plans are always conform with opportunity costs. Furthermore, the equilibrium electricity price is always an opportunity merit order price.

Proposition 1

Given energy demand \(D_{0}\), let \((A^{*}, P^{*})=(A^{*}_{0},A^{*}_{T}, P^{*}_{0})\) be the equilibrium prices with the corresponding strategies \( (\vartheta ^{i*},\xi ^{i*})\text {,} i \in I\text {,} \) then the following points hold:

(i) For each agent \(i \in I\text {,}\) the individual production plan \(\xi ^{i*}_{0}\) is conform with opportunity costs at emission price \(A_{0}^{*}\):

$$\begin{aligned} {\mathscr {C}}^{i}(V_{0}^{i}(\xi ^{i*}_{0}), A^{*}_{0})=C_{0}^{i}(\xi ^{i*}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i*}_{0}). \end{aligned}$$
(10)

(ii) The market production schedule \(\xi ^{i*}_{0}\text {,}\) \(i \in I\text {,}\) is conform with opportunity costs at emission price \(A_{0}^{*}\):

$$\begin{aligned} {\mathscr {C}}(\sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}), A^{*}_{0})=\sum _{i \in I}(C_{0}^{i}(\xi ^{i*}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i*}_{0})). \end{aligned}$$
(11)

(iii) \(P^{*}_{0}\) is an opportunity merit order price at \((\sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}) , A^{*}_{0})\).

The direct economic consequence of this mathematical result is that each individual will organize its own production strategy by scheduling power production units in an increasing price order. Thereby their variable costs (which include the opportunity costs of using the emission certificates) are considered. Within such a schedule, a demand d is satisfied by gradually turning on the most economic plants, until a generation level matching d is reached. Furthermore, the above proposition states that such schedule is reached not only on the individual level, but also for the entire market. Namely, an overall demand d is satisfied by gradually turning on the most economic plants until reaching a production which covers the demand d. Such aggregate ordering is usually called merit order, for this reason we call \({\mathscr {C}}^{i}(d, a)\) and \({\mathscr {C}}(d, a)\) (agent’s i) opportunity merit order costs and cumulative opportunity merit order costs, respectively.

It is worth noticing that the opportunity merit order electricity price as defined in (9) is equal to the marginal cost of generating electricity when the level of demand is d given certificate price a. Coupling this property with the merit order production in the electricity sector, implies that the most expensive production in the plan \(\xi _0^i\) will determine the marginal cost at demand level d and emission price a. The opportunity merit order electricity price at (da) is defined as the lowest price, which is able to trigger the required production level.

Proof

(i) Consider the equilibrium strategy \((\vartheta ^{i*}_{0},\xi ^{i*}_{0})\) of agent \(i \in I\). Assume that the agent deviates from this strategy following an alternative production plan \(\xi ^{i}_{0} \in \varXi ^{i}\). However, to keep the same emission credit balance, the difference \(E_{0}^{i}(\xi ^{i}_{0})-E_{0}^{i}( \xi ^{i*}_{0})\) is traded at the market in addition to \(\vartheta ^{i*}_{0}\). That is, we change the equilibrium trading strategy \((\vartheta ^{i*}_{0}, \vartheta ^{i*}_{T})\) to an alternative trading strategy \((\vartheta ^{i}_{0}, \vartheta ^{i}_{T})\) given by

$$ \vartheta ^{i}_{0}= \vartheta ^{i*}_{0}+E_{0}^{i}(\xi ^{i}_{0})-E_{0}^{i}( \xi ^{i*}_{0}), \qquad \vartheta ^{i}_{T}=\vartheta ^{i*}_{T}. $$

Note that we have changed only the initial position, from \(\vartheta ^{i*}_{0}\) to \(\vartheta ^{i}_{0}\), whereas the final position is the same \(\vartheta ^{i}_{T}=\vartheta ^{i*}_{T}\). A direct calculation shows that the profit of this alternative strategy \((\vartheta ^{i}, \xi ^{i})=(\vartheta ^{i}_{0}, \vartheta ^{i}_{T}, \xi ^{i}_{0})\) can be written as

$$\begin{aligned} L^{A^{*}, P^{*}, i} (\vartheta ^{i}, \xi ^{i})= L^{A^{*}, P^{*}, i} (\vartheta ^{i*} , \xi ^{i*})+R(\xi ^{i}_{0}, \xi ^{i*}_{0})\text {,} \end{aligned}$$

i.e. it differs form the original profit \(L^{A^{*}, P^{*}, i} (\vartheta ^{i*} , \xi ^{i*})\) by the amount

$$ R(\xi ^{i}_{0}, \xi ^{i*}_{0})= P^{*}_{0}(V_{0}^{i}(\xi ^{i}_{0})-V_{0}^{i}(\xi ^{i*}_{0})) + (C_{0}^{i}(\xi ^{i*}_{0}) -C_{0}^{i}(\xi ^{i}_{0}) ) + A^{*}_{0}(E_{0}^{i}(\xi ^{i*}_{0})-E_{0}^{i}( \xi ^{i}_{0})). $$

Note that \(R(\xi ^{i}_{0}, \xi ^{i*}_{0})\) can not be positive, since otherwise

$$\begin{aligned} L^{A^{*}, P^{*}, i} ( \vartheta ^{i} , \xi ^{i})> & {} L^{A^{*}, P^{*}, i} (\vartheta ^{i*} , \xi ^{i*}) \end{aligned}$$

would yield

$$ {\mathbb E}(U^{i}(L^{A^{*}, P^{*}, i} (\vartheta ^{i},\ \xi ^{i})))> {\mathbb E}(U^{i}(L^{A^{*}, P^{*}, i} (\vartheta ^{i*} , \xi ^{i*})))\text {,} $$

thus contradicting the optimality of the equilibrium strategy \((\vartheta ^{i*} , \xi ^{i*})\) (see (8)). Now, from \( R(\xi ^{i}_{0}, \xi ^{i*}_{0})\le 0\) we conclude that

$$\begin{aligned} -C_{0}^{i}(\xi ^{i*}_{0})-A^{*}_{0}E_{0}^{i}(\xi ^{i*}_{0}) + P^{*}_{0}V_{0}^{i}(\xi ^{i*}_{0}) \ge -C_{0}^{i}(\xi ^{i}_{0})-A^{*}_{0}E_{0}^{i}(\xi ^{i}_{0}) + P^{*}_{0}V_{0}^{i}(\xi ^{i}_{0}) \end{aligned}$$
(12)

for each \(\xi ^{i}_{0} \in \varXi ^{i}\). With this, we conclude the desired assertion (10) as follows: any alternative production plan \(\xi ^{i}_{0}\) which produces an energy amount \(V_{0}^{i}(\xi ^{i}_{0})\) at least equal to \(V_{0}^{i}(\xi ^{i*}_{0})\) must satisfy

$$ C_{0}^{i}(\xi ^{i*}_{0})+A^{*}_{0}E_{0}^{i}(\xi ^{i*}_{0}) \le C_{0}^{i}(\xi ^{i}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i}_{0}). $$

Thus,

$$ C_{0}^{i}(\xi ^{i*}_{0})+A^{*}_{0}E_{0}^{i}(\xi ^{i*}_{0})=\inf \{C_{0}^{i}(\xi ^{i}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i}_{0}): \xi ^{i}_{0} \in \varXi ^{i}, V_{0}^{i}(\xi ^{i}_{0}) \ge V_{0}^{i}(\xi ^{i*}_{0})\}. $$

(ii) Summing up (12) over \(i \in I\text {,}\) yields, for arbitrary choices of \(\xi ^{i}_{0} \in \varXi ^{i}\text {,}\) \(i \in I\text {,}\)

$$\begin{aligned}&- \sum _{i\in I}\left( C_{0}^{i}(\xi ^{i*}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i*}_{0}) \right) +P^{*}_{0}\sum _{i\in I}V_{0}^{i}(\xi ^{i*}_{0})\qquad \qquad \qquad \nonumber \\&\qquad \qquad \ge -\sum _{i\in I}\left( C_{0}^{i}(\xi ^{i}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i}_{0}) \right) +P^{*}_{0}\sum _{i\in I}V_{0}^{i}(\xi ^{i}_{0}). \end{aligned}$$
(13)

From this, we conclude that, for any choice \(\xi ^{i}_{0} \in \varXi ^{i}\text {,}\) \(i \in I\text {,}\) of production plans satisfying

$$ \sum _{i \in I}V_{0}^{i}(\xi ^{i}_{0}) \ge \sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}), $$

it holds

$$ \sum _{i\in I}\left( C_{0}^{i}(\xi ^{i*}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i*}_{0}) \right) \le \sum _{i\in I}\left( C_{0}^{i}(\xi ^{i}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i}_{0}) \right) , $$

implying the desired assertion (11).

(iii) We need to prove that, for any \(\tilde{d} \in \mathbb R_{+}\),

$$ -{\mathscr {C}}(\tilde{d}, A^{*}_{0}) + P^{*}_{0}\tilde{d} \le -{\mathscr {C}}( \sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}) , A^{*}_{0}) + P^{*}_{0} \sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}). $$

For each choice of production strategies \(\xi ^{i}_{0} \in \varXi ^{i}\text {,}\) \(i \in I\text {,}\) estimate (13), combined with (11), gives

$$\begin{aligned}&- {\mathscr {C}}(\sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}), A^{*}_{0}) +P^{*}_{0}\sum _{i\in I}V_{0}^{i}(\xi ^{i*}_{0}) \qquad \qquad \qquad \\&\qquad \qquad \ge -\sum _{i\in I}\left( C_{0}^{i}(\xi ^{i}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i}_{0}) \right) +P^{*}_{0}\sum _{i\in I}V_{0}^{i}(\xi ^{i}_{0}). \end{aligned}$$

In particular, if the strategies are chosen from

$$\begin{aligned} \{ (\xi ^{i}_{0})_{i \in I}\, : \, \xi ^{i}_{0} \in \varXi ^{i}, \quad i \in I, \qquad \sum _{i \in I}V_{0}^{i}(\xi ^{i}_{0}) \ge \tilde{d}\} \text {,} \end{aligned}$$

then it holds that

$$ - {\mathscr {C}}(\sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}), A^{*}_{0}) +P^{*}_{0}\sum _{i\in I}V_{0}^{i}(\xi ^{i*}_{0}) \ge -\sum _{i\in I}\left( C_{0}^{i}(\xi ^{i}_{0})+ A^{*}_{0} E_{0}^{i}(\xi ^{i}_{0}) \right) +P^{*}_{0}\tilde{d}. $$

Passing on the right-hand side of this inequality to

$$ {\mathscr {C}}(\tilde{d}, A^{*}_{0}):=\inf \{\sum _{i \in I}(C_{0}^{i}(\xi _{0}^{i})+ A^{*}_{0} E_{0}^{i}(\xi _{0}^{i})) \, : \, \xi _{0}^{i}\in \varXi ^{i}, i \in I, \sum _{i \in I} V_{0}^{i}(\xi _{0}^{i}) \ge \tilde{d} \}\text {,} $$

yields the desired assertion

$$ - {\mathscr {C}}(\sum _{i \in I}V_{0}^{i}(\xi ^{i*}_{0}), A^{*}_{0}) +P^{*}_{0}\sum _{i\in I}V_{0}^{i}(\xi ^{i*}_{0}) \ge - {\mathscr {C}}(\tilde{d}, A^{*}_{0}) +P^{*}_{0}\tilde{d}. $$

Remark

The statement (ii) of the above proposition characterizes equilibrium in terms of aggregated quantities. Once the equilibrium is reached, the production schedule represents the cheapest way to satisfy the demand. From this perspective, the reader may conclude that the equilibrium production schedule can be obtained as a production plan which minimizes the overall costs among those which cover a given demand, indicating that only aggregated quantities do influence the equilibrium. However, we shall emphasize that the equilibrium still heavily depends on individual ingredients (such as initial endowments and risk aversion), which enter through the initial allowance price.

Now, we show another natural property of the equilibrium allowance prices. It turns out that there is no arbitrage allowance trading and that the terminal allowance price is digital.

Proposition 2

Given energy demand \(D_{0}\), let \((A^{*}, P^{*})=(A^{*}_{0},A^{*}_{T}, P^{*}_{0})\) be the equilibrium prices with the corresponding strategies \((\vartheta ^{i*},\xi ^{i*})\text {,}\) \(i \in I\text {,}\). It holds:

(i) There exists a risk-neutral measure \(\mathbb Q^{*} \sim \mathbb P\) such that \(A^{*}=(A^{*}_{0}, A^{*}_{T})\) follows a martingale with respect to \({\mathbb Q}^{*}\).

(ii) The terminal allowance price in equilibrium is digital

$$\begin{aligned} A^{*}_{T}=\pi 1_{\{\sum _{i \in I} E_{T}^{i}(\xi ^{i*}_{0}) -\gamma _{0} \ge 0\}}. \end{aligned}$$
(14)

Proof

(i) According to the first fundamental theorem of asset pricing, see [8] in discrete-time setting, the existence of the so-called equivalent martingale measure satisfying \( A_{0}^{*}={\mathbb E}^{\mathbb Q^{*}}(A_{T}) \) is ensured by the absence of arbitrage. Fortunately, in our framework, the absence of arbitrage follows from the equilibrium notion, as we show next. We thus conclude (i) of the above theorem and it remains to verify that the equilibrium rules out all arbitrage opportunities for allowance trading. Let us follow an indirect proof, assuming that \(\nu _{0}\) is an arbitrage allowance trading, meaning that

$$\begin{aligned} {\mathbb P}(\nu _{0}(A^{*}_{T}-A^{*}_{0}) \ge 0)=1, \quad {\mathbb P}(\nu _{0}(A^{*}_{T}-A^{*}_{0})>0)>0. \end{aligned}$$
(15)

Based on this we obtain a contradiction by showing that each agent i can change its own policy \((\vartheta ^{i*}, \xi ^{i*})\) to an improved strategy \(({\tilde{\vartheta }}^{i}, \xi ^{i*})\) satisfying

$$\begin{aligned} {\mathbb E}\left( U^{i}\left( L^{A^{*}, i} (\vartheta ^{i*}, \xi ^{i*})\right) \right) <{\mathbb E}\left( U^{i} (L^{A^{*}, i} ({\tilde{\vartheta }}^{i}, \xi ^{i*}) ) \right) . \end{aligned}$$
(16)

The improvement is achieved by incorporating arbitrage \(\nu _{0}\) into the allowance trading of each agent i as follows:

$$ {\tilde{\vartheta }}^{i}_{0}:=\vartheta ^{i*}_{0}+ \nu _{0}, \qquad {\tilde{\vartheta }}^{i}_{T}:=\vartheta ^{i*}_{T}- \nu _{0}. $$

Indeed, the revenue improvement from allowance trading is

$$ -{\tilde{\vartheta }}^{i}_{0}A^{*}_{0}-{\tilde{\vartheta }}^{i}_{T}A^{*}_{T} =- \vartheta ^{i}_{0}A^{*}_{0}- \vartheta ^{i}_{T}A^{*}_{T}+ \nu _{0}(A^{*}_{T}-A^{*}_{0} )\text {,} $$

which we combine with (15) to see that

$$\begin{aligned}{\mathbb P}\left( L^{A,i}(\vartheta ^{i*}, \xi ^{i*}) \le L^{A,i}({\tilde{\vartheta }}^{i}, \xi ^{i*}) \right) =1, \qquad {\mathbb P}\left( L^{A,i}(\vartheta ^{i*}, \xi ^{i})<L^{A,i}({\tilde{\vartheta }}^{i}, \xi ^{i*}) \right) >0\text {,} \end{aligned}$$

which implies (16), therefore contradicting the optimality of \((\vartheta ^{i*}, \xi ^{i*})\).

(ii) From equilibrium property (8), it follows that for almost each \(\omega \in \varOmega \) the terminal allowance position adjustment \(\vartheta _{T}(\omega )\) is a maximizer on \(\mathbb R\) to

$$\begin{aligned} z \mapsto - z A^{*}_{T}(\omega ) - \pi (E_{T}^{i}(\xi ^{i*}_{0})(\omega )-\vartheta ^{i*}_{0}-\gamma _{0}^{i}-z)^{+}. \end{aligned}$$
(17)

Note that a maximizer of this mapping exists only if \(0 \le A^{*}_{T}(\omega ) \le \pi \). That is, the terminal allowance price in equilibrium must be within the interval \(A^{*}_{T} \in [0, \pi ]\) almost surely. Let us show now that the price actually attains only boundary values almost surely, i.e.

$$\begin{aligned} A^{*}_{T} \in \{0, \pi \} \ \quad \text {almost surely.} \end{aligned}$$
(18)

Suppose that an intermediate value \(A^{*}_{T}(\omega ) \in ]0, \pi [\) is taken, then the unique maximizer of function (17) is attained on \(E_{T}^{i}(\xi ^{i*}_{T})(\omega )-\vartheta ^{i*}_{0}-\gamma _{0}^{i}\). This implies that \(\vartheta ^{i*}_{T}(\omega )=E_{T}^{i}(\xi ^{i*}_{0})(\omega )-\vartheta ^{i*}_{0}-\gamma _{0}^{i}\) holds for each \(i \in I\text {,}\) and a summation over i yields

$$\begin{aligned}\sum _{i \in I}\vartheta ^{i*}_{T}(\omega )=\sum _{i \in I}\left( E_{T}^{i}(\xi ^{i*}_{0})(\omega )-\vartheta ^{i*}_{0}-\gamma _{0}^{i}\right) =\sum _{i \in I}E_{T}^{i}(\xi ^{i*}_{0})(\omega )-\gamma _{0}. \end{aligned}$$

Note that equilibrium property (7) ensures that the random variable on the left-hand side of the above equality is zero almost surely. Thus, the inclusion

$$\begin{aligned} \{{\omega :}A^{*}_{T} \in ]0, \pi [\} \subseteq \{{\omega :}\sum _{i \in I}E_{T}^{i}(\xi ^{i*}_{0})-\gamma _{0}=0\} \end{aligned}$$
(19)

holds almost surely. Because of (4), the probability of the event on the right-hand side of the above inclusion is zero, which shows (18).

If \(A^{*}_{T}(\omega )=0\), then a maximizer \(\vartheta ^{i*}_{T}(\omega )\) to the function (17) is attained on \([E_{T}^{i}(\xi ^{i*}_{0})(\omega )-\vartheta ^{i*}_{0}-\gamma _{0}^{i}, +\infty [\). Hence

$$ \{{\omega :} A^{*}_{T}=0\} \subseteq \{{\omega :} E_{T}^{i}(\xi ^{i*}_{0}) -\vartheta ^{i*}_{0}-\gamma _{0}^{i} \le \vartheta ^{i*}_{T} \} $$

holds almost surely for each \(i \in I\text {,}\) which implies that

$$ \{{\omega :} A^{*}_{T}=0\} \subseteq \{{\omega :} \sum _{i \in I} E_{T}^{i}(\xi ^{i*}_{0}) -\gamma _{0} \le \sum _{i \in I}\vartheta ^{i*}_{T} \} $$

holds almost surely. Now, because of the equilibrium property (7), we obtain the almost sure inclusion

$$ \{{\omega :} A^{*}_{T}=0\} \subseteq \{{\omega :} \sum _{i \in I} E_{T}^{i}(\xi ^{i*}_{0}) -\gamma _{0} \le 0\}\text {.} $$

Since the probability of \(A^{*}_{T}\in ]0, \pi [\) is zero (19), we conclude for the complementary event that

$$\begin{aligned} \{{\omega :} A^{*}_{T}=\pi \} \supseteq \{{\omega :} \sum _{i \in I} E_{T}^{i}(\xi ^{i*}_{0}) -\gamma _{0} \ge 0\} \end{aligned}$$
(20)

holds almost surely. Let us show the opposite inclusion. If \(A^{*}_{T}(\omega )=\pi \), then a maximizer \(\vartheta ^{i*}_{T}(\omega )\) to function (17) is attained on \(] - \infty , E_{T}^{i}(\xi ^{i*}_{0})(\omega )-\vartheta ^{i*}_{0}-\gamma _{0}^{i}]\). Hence,

$$ \{{\omega :} A^{*}_{T}=\pi \} \subseteq \{ E_{T}^{i}(\xi ^{i*}_{0}) -\vartheta ^{i*}_{0}-\gamma _{0}^{i} \ge \vartheta ^{i*}_{T} \} $$

holds almost surely for each \(i \in I\text {,}\) which implies that

$$ \{{\omega :} A^{*}_{T}=\pi \} \subseteq \{{\omega :} \sum _{i \in I} E_{T}^{i}(\xi ^{i*}_{0}) -\gamma _{0} \ge \sum _{i \in I}\vartheta ^{i*}_{T} \} $$

holds almost surely. Now, because of the equilibrium property (7), we obtain

$$\begin{aligned} \{{\omega :} A^{*}_{T}=\pi \} \subseteq \{{\omega :} \sum _{i \in I} E_{T}^{i}(\xi ^{i*}_{0}) -\gamma _{0} \ge 0\}\text {.} \end{aligned}$$
(21)

Finally, combine inclusions (20) and (21) to obtain assertion (14).

5 Social Optimality

To formulate social optimality, we require additional notations. Given production strategies \(\xi ^{i}_{0} \in \varXi ^{i}\text {,}\) \(i \in I\text {,}\) we denote the overall market production schedule by \(\xi _{0}=(\xi ^{i}_{0})_{i \in I} \) and introduce the total production costs \(C_{0}\), the total production volume \(V_{0}\), and the total carbon dioxide emission \(E_{T}\) and the total nominal carbon dioxide emission, defined by

$$\begin{aligned} C_{0}(\xi _{0})=\sum _{i \in I}C_{0}^{i}(\xi _{0}^{i}), \quad V(\xi _{0})=\sum _{i \in I}V_{0}^{i}(\xi _{0}^{i}), \quad E_{T}(\xi _{0})=\sum _{i \in I}E_{T}^{i}(\xi _{0}^{i}) , \quad E_{0}(\xi _{0})=\sum _{i \in I}E_{0}^{i}(\xi _{0}^{i}). \end{aligned}$$

Having in mind that \(C_{0}(\xi _{0})\) stands for the overall costs of the production and interpreting \(\pi (E_{T}(\xi _{0})-\gamma _{0})^{+}\) as a proxy of the environmental impact of the production schedule \(\xi _{0}\), let us agree that

$$ B(\xi _{0})=C_{0}(\xi _{0})+\pi (E_{T}(\xi _{0})-\gamma _{0})^{+} $$

expresses the social burden caused by the overall production plan \(\xi _{0} \in \times _{i \in I} \varXi ^{i}\).

It turns out that the equilibrium strategy minimizes the social burden among all production strategies which cover a given demand.

Proposition 3

Given energy demand \(D_{0}\), let \((A^{*}, P^{*})=(A^{*}_{0},A^{*}_{T}, P^{*}_{0})\) be the equilibrium prices with the corresponding strategies \( (\vartheta ^{i*},\xi ^{i*})\), \(i \in I \). Let \({\mathbb Q}^{*}\) be a risk-neutral measure whose existence is shown in Proposition 2. Then

$$\begin{aligned} E_{0}^{\mathbb Q^{*}}(B(\xi ^{*}_{0})) \le E_{0}^{\mathbb Q^{*}}(B(\xi _{0})) \end{aligned}$$
(22)

holds for each production schedule \(\xi _{0}=(\xi ^{i}_{0})_{i \in I}\in \times _{i \in I}\varXi ^{i}\) which yields at least the same production volume, \(V_{0}(\xi _{0}) \ge V_{0}(\xi ^{*}_{0})=D_{0}\).

Proof

For each convex function \(f: \mathbb R\rightarrow \mathbb R\), \(x \mapsto f(x)\), it holds \( f(x)+\nabla f(x) h \le f(x+h)\), \(h \in \mathbb R\), where \(\nabla f(x)\) stands for one of the sub-gradients of f at the point x. In particular, for convex function \(f: \mathbb R\rightarrow \mathbb R_{+}\), \(x \mapsto x^{+}\), we obtain \( x^{+}+ 1_{\{x \ge 0\}} h \le (x+\,h)^{+}\) for all \(x, h \in \mathbb R\). With the equilibrium production strategy \(\xi _{0}^{*}=(\xi _{0}^{i*})_{i \in I}\), we conclude that

$$ (E_{T}(\xi ^{*}_{0}) -\gamma _{0})^{+} + 1_{\{E_{T}(\xi ^{*}_{0})-\gamma _{0} \ge 0\}}(E_{T}(\xi _{0})-E_{T}(\xi ^{*}_{0})) \le (E_{T}(\xi _{0}) -\gamma _{0})^{+} $$

holds almost surely for any production strategy \(\xi _{0} \in \times _{i \in I} \varXi ^{i}\). Using our model assumption (3) we deduce \( E_{T}(\xi _{0})-E_{T}(\xi ^{*}_{0})=E_{0}(\xi _{0})-E_{0}(\xi ^{*}_{0}) \) which gives

$$ (E_{T}(\xi ^{*}_{0}) -\gamma _{0})^{+} + 1_{\{E_{T}(\xi ^{*}_{0})-\gamma _{0} \ge 0\}}(E_{0}(\xi _{0})-E_{0}(\xi ^{*}_{0})) \le (E_{T}(\xi _{0}) -\gamma _{0})^{+}. $$

Calculating the expectations with respect to \(\mathbb Q^{*}\) on both sides and multiplying both sides by \(\pi \), we obtain

$$\begin{aligned}&\pi {\mathbb E}^{\mathbb Q^{*}}\left( (E_{T}(\xi ^{*}_{0}) -\gamma _{0})^{+} \right) + \pi {\mathbb E}^{\mathbb Q^{*}}\left( 1_{\{E_{T}(\xi _{0}^{*})-\gamma _{0} \ge 0\}}\right) (E_{0}(\xi _{0})-E_{0}(\xi ^{*}_{0})) \le \\&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \le \pi {\mathbb E}^{\mathbb Q^{*}}\left( (E_{T}(\xi _{0}) -\gamma _{0})^{+}\right) .&\end{aligned}$$

Using the martingale property and the digital terminal value of the equilibrium allowance prices shown in Proposition 2, we finally obtain

$$\begin{aligned} \pi {\mathbb E}^{\mathbb Q^{*}}\left( (E_{T}(\xi ^{*}_{0}) -\gamma _{0})^{+} \right) + A^{*}_{0}(E_{0}(\xi _{0})-E_{0}(\xi ^{*}_{0})) \le \pi {\mathbb E}^{\mathbb Q^{*}}\left( (E_{T}(\xi _{0}) -\gamma _{0})^{+}\right) \text {.} \end{aligned}$$
(23)

For the case that the strategy \(\xi _{0}\) yields at least the total production volume of the equilibrium strategy, \( V_{0}(\xi _{0}) \ge V_{0}(\xi ^{*}_{0})\text {,} \) assertion (11) in Proposition 1 yields the estimate

$$ C_{0}(\xi ^{*}_{0}) + A^{*}_{0}E_{0}(\xi ^{*}_{0}) \le C_{0}(\xi _{0})+ A^{*}_{0}E_{0}(\xi _{0})\text {,} $$

which is equivalent to

$$\begin{aligned} C_{0}(\xi ^{*}_{0}) -C_{0}(\xi _{0}) \le A^{*}_{0}(E_{0}(\xi _{0}) - E_{0}(\xi ^{*}_{0}))\text {.} \end{aligned}$$

Now, combining the last inequality with (23), we obtain

$$ C_{0}(\xi ^{*}_{0})+\pi {\mathbb E}^{\mathbb Q^{*}}\left( (E_{T}(\xi ^{*}_{0}) -\gamma _{0})^{+} \right) \le C_{0}(\xi _{0})+\pi {\mathbb E}^{\mathbb Q^{*}}\left( (E_{T}(\xi _{0}) -\gamma _{0})^{+}\right) \text {,} $$

which proves our claim (22).

In Proposition 3, the equilibrium production schedule \(\xi _{0}^{*}\) was characterized as a solution to the minimization problem

$$\begin{aligned} \min \{{\mathbb E}^{\mathbb Q^{*}}(B(\xi _{0}))\, : \, \xi _{0} \in \times _{i \in I}\varXi ^{i}, \quad V_{0}(\xi _{0}) \ge D_{0}\}\text {.} \end{aligned}$$
(24)

Although this fact is about minimization of social burden, it should not be interpreted as one of the classical welfare results, which typically follow from equilibrium considerations.

An interesting point here is that this type of cost-optimality needs to be taken with great care: due to the opportunity cost-pass-through, the consumers can not expect that an (inappropriately designed) cap-and-trade mechanism indeed implements the cheapest way of emission reduction, from their perspective.

To see this point, remember that the price per unit of electricity under the merit order system includes the opportunity costs of consuming the emission certificates. Therefore, given emission price \(A_{0}\) and an overall production schedule \(\xi _{0}\), the consumers pay the costs \(\sum _{i \in I}A_{0}E^i(\xi _0^i)\) to switch in the merit order and to reduce emissions. From a global perspective, this costs stands for a wealth re-distribution. From the consumer’s perspective, it is associated with a burden.

6 Equilibrium-Like Risk-Neutral Modeling

Another interesting observation from Proposition 3 is that the expectation \({\mathbb E}^{\mathbb Q^{*}}(B(\xi _{0}))\) of the social burden \(B(\xi _{0})\) is minimized with respect to a risk-neutral measure \(\mathbb Q^{*}\) which differs from the objective measure \({\mathbb P}\). The measure \(\mathbb Q^{*}\) is an outcome of the equilibrium, and, as such, it heavily depends on the many model components, for instance on the risk-aversions, on the certificate endowments, and on the production technologies of the agents. However, it is surprising that, once the measure \(\mathbb Q^{*}\) is known, other important equilibrium outcomes can be deduced from aggregated quantities only.

In particular, given \(\mathbb Q^{*}\), the equilibrium production schedule \(\xi _{0}^{*}\) can be obtained as the solution of optimization problem (24). Such solution is determined by aggregated quantities, since the social burden is by definition \(B(\xi _{0})=C_{0}(\xi _{0})+\pi (E_{T}(\xi _{0})-\gamma _{0})^{+}\) and, apart the quantities \(\gamma _{0}\) and \(\pi \) decided by the authority, it depends only on technologies present in the market. Having obtained the equilibrium production schedule \(\xi _{0}^{*}\) as the solution of optimization problem (24), the equilibrium allowance price \(A_{0}^{*}\) is calculated applying martingale pricing:

$$ A^{*}_{0}= \pi {\mathbb E}^{\mathbb Q^{*}}(1_{\{E_{T}(\xi _{0})- \gamma _{0} \ge 0\}}). $$

Finally, given the production schedule \(\xi _{0}^{*}\) and the allowance price \(A^{*}_{0}\), also the electricity price \(P^{*}_{0}\) is determined as the marginal price of the most expensive technology, which is active in the schedule \(\xi ^{*}_{0}\). Note that the opportunity costs must be included when identifying the most expensive active technology.

Summarizing, we conclude that given \(\mathbb Q^{*}\), merely aggregated market parameters are needed to obtain \(\xi ^{*}_{0}\), \(A^{*}_{0}\), and \(P^{*}_{0}\). This observation can be used to establish and to analyze realistic equilibrium-like emission market models. Such models are needed, since in real emission trading it is nearly impossible to estimate the equilibrium from a market model, because the individual parameters are highly undetermined. For instance, within the EU ETS, there are more than 25, 000 agents, each with a specific production, its own certificate endowment, and a completely unknown risk-aversion. On the contrary, the aggregated quantities are well-known, since high-quality market data on total allowance allocation and electricity production, including capacities, costs, and emission rates, are available.

In view of this, we suggest an alternative way to estimate the market equilibrium based on aggregated quantities and using an exogenously specified proxy for risk-neutral measure \(\mathbb Q^{*}\). This general approach follows the standard methodology of financial mathematics, which successfully describes the stochastic evolution of equilibrium prices on financial markets under an appropriately chosen risk-neutral measure.

6.1 Market Equilibrium Under a Risk-Neutral Measure

We sketch the following program for equilibrium-type modeling of emission markets:

(1) Determine a risk-neutral measure \(\mathbb Q^{*}\), which corresponds to an equilibrium situation of the emission market in the sense of (i) of Proposition 2.

(2) Observe that, because of Proposition 3, the equilibrium production schedule \(\xi _{0}^{*}\) must be a solution to the deterministic optimization problem

$$\begin{aligned} \begin{array}{c} \text {minimize} \, C_{0}(\xi _{0}) + \pi {\mathbb E}^{\mathbb Q^{*}}((E_{T}(\xi _{0})-\gamma _{0})^{+} ) \\ \text {subject to}\, V_{0}(\xi _{0}) \ge D_{0}, \, \text {over} \, \xi _{0} \in \times _{i \in I}\varXi ^{i}. \end{array} \end{aligned}$$
(25)

To address the problem further, a specification of the space \(\times _{i \in I}\varXi ^{i}\) of market production strategies along with the functions \(C_{0}\), \(V_{0}\) and \(E_{T}\) is required.

(3) Given the equilibrium production schedule, calculate the total production costs \(C_{0}(\xi _{0}^{*})\), the total carbon dioxide emission \(E_{T}(\xi _{0}^{*})\), and the energy price \(P^{*}\) to assess the performance in emission reduction of the current market architecture.

Remark

Note that a risk neutral measure is not unique. Clearly, finding a realistic candidate for the risk-neutral measure \(\mathbb Q^{*}\) can be difficult. However, notice that one merely needs to specify the fluctuations of the non-predictable emissions under a risk-neutral measure. This distribution can be described in a parameter-dependent way, which adds flexibility to the model. For instance, having assumed a Gaussian framework under objective measure and modeling the density of the risk-neutral measure in terms of a Girsanov kernel. Given theoretical initial emission price depending on the parameters of the Girsanov kernel, these parameters shall be adjusted to match the observed emission prices. Similar techniques have been applied in financial modeling under the framework of implicit model calibration. Being one of the central questions in quantitative finance, the connection between risk-neutral and objective measures has been successfully addressed over the recent decades. In view of this development, modeling from a risk-neutral measure perspective can be based on a variety of different methods, ranging from benchmark approach, estimation of risk premia, state price density from portfolio optimization theory, to several econometric methods for the estimation of the so-called market price of risk.

Finally, the performance of the cap-and-trade mechanism can be examined leveraging on the dependence of the major economic indicators, i.e. total consumers’ costs \(P^{*}_{0}D_{0}\), total (producers’) production costs \(C_{0}(\xi ^{i*}_{0})\), and total carbon dioxide emission \(E_{0}(\xi ^{i*}_{0})\), on the controls available to the regulator.

Note that in the standard scheme the regulator controls two key parameters: the total allowance allocation \(\gamma _{0}\) and the penalty size \(\pi \). The performance of regulation could be assessed in terms of relation between the increase of consumers’ costs versus the achieved emission reduction. Such analysis may uncover and visualize inappropriate market architectures, where unlucky choices of \(\gamma _{0}\) and \(\pi \) cause consumers to pay too much, compared to emission savings. Complementary or supplementary policies can be evaluated at this point next to the cap-and-trade system. In particular, different forms of subsidies and carbon tax mechanisms can have a strong impact on the merit order of different technologies.

7 Conclusions

In this paper, we show how equilibrium analysis and optimization of an environmental market can be carried out under the realistic assumption of risk-averse market players. This generalization is based on a novel approach. Thereby, we obtain a number of interesting observations, which allow studying equilibrium market situations in terms of aggregated market quantities under a risk-neutral measure. Our findings show how market design optimization can be achieved incorporating risk-aversion. The choice to develop our approach in one-period setting, yields explicit results which constructively contribute to better understand the working principles of financial instruments and to improve both effectiveness and efficiency of environmental policy.